Data

Some useful dataset that I have found over my research and that are open to use


Long Run Productivity Data from the Long Term Productivity Project (Bergeaud, Cette and Lecat). This dataset is uploaded yearly and offer estimates of TFP, labor productivity, GDP per capita and other derivatives for a set of around 20 countries since the end of the 19th century.

CH.DUPIN project provides yearly estimate of oil consumption for 16 countries since 1890.

Patent semantic classification from the paper "Classifying Patent Based on their Semantic Content" (Bergeaud, Potiron, Raimbault). This dataset links each USPTO patents from 1976 to 2013 to a category that have been constructed based on the content of each patent's abstract.

List of USPTO patents from US universities from the paper "Innovation and Top Income Inequality" (Aghion, Akcigit, Bergeaud, Blundell, HĂ©mous). This dataset lists all USPTO patent from 1969 to 2016 whose assignee is a univeristy and give the name and state of this university (originally taken from USPTO and improved).

PatentCity Database provides a geolocation of all USPTO patents from 1836 to 1924 at the county level from a joint work with Cyril Verluise. See the Github page of the project

HistPat database HistPat provides the geography of historical patents granted by the USPTO from 1790 to 1975. From Petralia, Sergio; Balland, Pierre-Alexandre; Rigby, David, 2016

BACI provides bilateral trade flows for more than 5000 products and 200 countries and is now free of charge. From CEPII (Guillaume Gaulier and Soledad Zignago)