On this page you can find details on the data that I use in my research. Data is made available after publication or upon request. I also post packages that I help develop.
Universe of Swedish Plants: The Universe of Swedish Plants contains establishment-level observations of manufacturing establishment during the 20th century. For the purposes of compiling this database we have trained and utilized a TrOCR-model. The database is under progress and currently covers all the universe of Swedish manufacturing plants in 1916, 1925, 1929, and 1939.
Interwar Wage Database: The Interwar Wage Database is a biannual sample of the underlying survey material from the official historical wage statistics. The database covers 731,333 workers and 11,928 establishments between 1922 and 1936 in Sweden. Data is available at SND.
histocc: I've developed an R package to help code Swedish historical occupations by hisco, income scores, sector and worker type. The package utilizes existing crosswalks between occupational strings and worker classifications to speed up a common task for researchers. The package also allows for standardizing occupational strings both using a crosswalk and a fuzzy match approach. Find the package at: https://github.com/skoglundw/histocc/tree/main