Language Structures Dataset Constructed for Gay et al. (2017). The language_data.dta file contains gender-related language structure data for 492 languages as well as their geographical location. See the README.txt file for more information about data sources, as well as the paper and its appendix. The file describes the original sources used to complement the language_data file.

Data Links

Below are links to public datasets used in my papers.

Economic Data

Barro-Lee Educational attainment data from 1950 to 2010 for 146 countries, disaggregated by sex and 5-year age intervals. See Barro and Lee (2013) "A New Data Set of Educational Attainment in the World, 1950-2010", Journal of Development Economics, 104, 184-198.

CEPII - GeoDIST Geographical variables for 225 countries, including bilateral distance measures and contiguity indicators. See Mayer and Zignago (2011) "Notes on CEPII's distances measures: the GeoDist Database", CEPII Working Paper 2011-25.

DESA Population, fertility, and migration data for a wide range of countries and years from the Population Division of the United Nations (DESA).

Genetic Distance Genetic distance data for 206 countries. See Spolaore and Wacziarg (2018) "Ancestry and Development: New Evidence", Journal of Applied Econometrics.

ILOSTAT Labor statistics for a wide range of countries and years from the International Labor Organization.

IPUMS USA U.S. Census and American Community Survey microdata from 1850 to the present.

Penn World Tables Income, output, input, and productivity data from 1950 to 2014 for 182 countries. See Feenstra, Inklaar, and Timmer (2015) "The Next Generation of the Penn World Table", American Economic Review, 105(10), 3150-3182.

Linguistic Data

AUTOTYP Typological data and geographical distribution for about 2,900 languages. See Bickel (2002) "The AUTOTYP Research Program".

World Atlas of Language Structures (WALS) Structural properties for about 2,700 languages.

Political Data

Autocratic Regimes Transition information for the 280 autocratic regimes in existence from 1946 to 2010. See Geddes, Wright, and Frantz (2014) "Autocratic breakdown and Regime Transitions: A New Data Set", Perspectives on Politics, 12(2), 313-331.

DD Democracy and dictatorship database. Classification of political regimes across 202 countries from 1946 to 2008. See Cheibub, Gandhi, and Vreeland (2010) "Democracy and Dictatorship Revisited", Public Choice, 143, 67-101.

IPU PARLINE Information on 272 parliamentary chambers in all of the 193 countries where a national legislature exists.

Quota Project Worldwide information on quota provisions for women in parliament. See Dahlerup et al. (2014) Atlas of Electoral Gender Quotas.