# Entropy-Based Segregation Indices

Computes segregation indices, including the Index of Dissimilarity, as well as the information-theoretic indices developed by Theil (1971) , namely the Mutual Information Index (M) and Theil's Information Index (H). The M, further described by Mora and Ruiz-Castillo (2011) and Frankel and Volij (2011) , is a measure of segregation that is highly decomposable. The package provides tools to decompose the index by units and groups (local segregation), and by within and between terms. The package also provides a method to decompose differences in segregation as described by Elbers (2021) . The package includes standard error estimation by bootstrapping, which also corrects for small sample bias.

An R package to calculate and decompose entropy-based, multigroup segregation indices, with a focus on the Mutual Information Index (M) and Theil’s Information Index (H).

• calculate total, between, within, and local segregation
• decompose differences in total segregation over time
• estimate standard errors via bootstrapping
• every method returns a tidy data frame for easy post-processing and plotting
• it’s fast, because it uses the `data.table` package internally

## Usage

The package provides an easy way to calculate segregation measures, based on the Mutual Information Index (M) and Theil’s Entropy Index (H).

Standard errors in all functions can be estimated via boostrapping:

Decompose segregation into a between-state and a within-state term (the sum of these equals total segregation):

Local segregation (`ls`) is a decomposition by units (here racial groups). The sum of the proportion-weighted local segregation scores equals M:

Decompose the difference in M between 2000 and 2005, using iterative proportional fitting (IPF) and the Shapley decomposition, as suggested by Karmel and Maclachlan (1988) and Deutsch et al. (2006):

## How to install

To install the package from CRAN, use

To install the development version, use

## Papers using the Mutual information index

(list incomplete)

DiPrete, T. A., Eller, C. C., Bol, T., & van de Werfhorst, H. G. (2017). School-to-Work Linkages in the United States, Germany, and France. American Journal of Sociology, 122(6), 1869-1938. https://doi.org/10.1086/691327

Forster, A. G., & Bol, T. (2017). Vocational education and employment over the life course using a new measure of occupational specificity. Social Science Research, 70, 176-197. https://doi.org/10.1016/j.ssresearch.2017.11.004

Van Puyenbroeck, T., De Bruyne, K., & Sels, L. (2012). More than ‘Mutual Information’: Educational and sectoral gender segregation and their interaction on the Flemish labor market. Labour Economics, 19(1), 1-8. https://doi.org/10.1016/j.labeco.2011.05.002

Mora, R., & Ruiz-Castillo, J. (2003). Additively decomposable segregation indexes. The case of gender segregation by occupations and human capital levels in Spain. The Journal of Economic Inequality, 1(2), 147-179. https://doi.org/10.1023/A:1026198429377

## References on entropy-based segregation indices

Deutsch, J., Flückiger, Y. & Silber, J. (2009). Analyzing Changes in Occupational Segregation: The Case of Switzerland (1970–2000), in: Yves Flückiger, Sean F. Reardon, Jacques Silber (eds.) Occupational and Residential Segregation (Research on Economic Inequality, Volume 17), 171–202.

Theil, H. (1971). Principles of Econometrics. New York: Wiley.

Frankel, D. M., & Volij, O. (2011). Measuring school segregation. Journal of Economic Theory, 146(1), 1-38. https://doi.org/10.1016/j.jet.2010.10.008

Mora, R., & Ruiz-Castillo, J. (2009). The Invariance Properties of the Mutual Information Index of Multigroup Segregation, in: Yves Flückiger, Sean F. Reardon, Jacques Silber (eds.) Occupational and Residential Segregation (Research on Economic Inequality, Volume 17), 33-53.

Mora, R., & Ruiz-Castillo, J. (2011). Entropy-based Segregation Indices. Sociological Methodology, 41(1), 159–194. https://doi.org/10.1111/j.1467-9531.2011.01237.x

Karmel, T. & Maclachlan, M. (1988). Occupational Sex Segregation — Increasing or Decreasing? Economic Record 64: 187-195. https://doi.org/10.1111/j.1475-4932.1988.tb02057.x

Watts, M. The Use and Abuse of Entropy Based Segregation Indices. Working Paper. URL: http://www.ecineq.org/ecineq_lux15/FILESx2015/CR2/p217.pdf

# segregation 0.2.0

• add "shapley" decomposition method, revisit other difference decomposition methods
• better logging of bootstrap/IPF
• several small fixes
• add warning when attempting bootstrap with non-integer weights

# segregation 0.1.0

• switch group and unit definitions, to be consistent with the literature
• add Theil's Information Index (H)
• add mutual_within function to decompose weighted within indices
• add "wide" option to mutual_local and mutual_within
• add "ipf" (iterative proportional fitting) function and a difference decomposition based on IPF
• "mrc_adjusted" difference decomposition is defined only on overlap sample of units and groups
• internal refactoring

Initial release.

# Reference manual

install.packages("segregation")

0.6.0 by Benjamin Elbers, 5 months ago

https://elbersb.github.io/segregation/

Report a bug at https://github.com/elbersb/segregation/issues

Browse source code at https://github.com/cran/segregation

Authors: Benjamin Elbers [aut, cre]

Documentation:   PDF Manual