This dataset covers about three million refugees who arrived in the United States between 1975 and 2018. We provide two complementary formats:
Individual-level refugee records (1975–2008)
Digitized from publicly available records as originally recorded by the Office of Refugee Resettlement.
Geo-coded dataset (1975–2018)
A geo-referenced version that extends coverage to 2018 and can be linked to the individual-level data.
Data website: www.refugeeresettlementdata.com/
Broadband Internet Data for Indian Villages (BharatNet)
This dataset tracks the rollout of BharatNet, the Indian government’s program to connect every Gram Panchayat (village council) to fiber-optic internet in several phases. Raw broadband connection point locations were extensively validated through web scraping of village directories and cross-checks with OpenStreetMap. The dataset contains 175,157 locations, identifies the phase in which each was connected, and can be easily linked to a wide range of datasets via SHRUG.
Data download: here
When using the data, please cite:
Matzat J. 2025. Rural Internet, Identities and Governance.
Local Political Polarization on Twitter (2016-2020)
We collected profile locations from millions of US-Twitter accounts via the Twitter API . We geocoded the locations and connected them with ideology scores provided by Pablo Barberá. Coming soon.