Pitchbook (1. PE 2. private firms)
Capital IQ (All public and private firm fundamental data 2. Merger and Acquisition data 3. Conference Call)
Bloomberg (ESG rating, Disclosure divergence)
Nelson Consumer Data (retail scanner database)!
Nielsen Retail Scanner Data, which contain prices and revenues from retail chains across the US. Prior research has used this data to study the product market impacts of private equity (Fracassi et al., 2022), credit market disruption (Granja and Moreira, 2022; Kabir, 2022), common ownership (Aslan, 2023), and taxation (Baker et al., 2023). We extract prices and revenues at the weekly frequency and at a level as fine as each store and each Universal Product Code (UPC), which is a 12-digit barcode that identifies a unique traded item.
Gopalan, Radhakrishnan and Li, Renping and Zaldokas, Alminas, Board Connections, Firm Profitability, and Product Market Actions (April 12, 2022). European Corporate Governance Institute – Finance Working Paper No. 996/2024 , Available at SSRN: https://ssrn.com/abstract=4053853 or http://dx.doi.org/10.2139/ssrn.4053853
CIK-gvkey linking table https://sites.google.com/iu.edu/professorbrianpmiller/bog-data
Company Database Linking Matrix https://wrds-www.wharton.upenn.edu/pages/wrds-research/database-linking-matrix/
Labor Union (1. Election 2. Negotiation)
Health Care Cost and Pension Fund (Form 5500)
AS YOU SOW (DEI disclosure score)
NETS (PRI and PUB) - Lauren or Chengzhu
FactSet Reverse Relationship Global Supply Chain -Poduction Nework (Youchang WU)
EPA
Municipal Bond
Seeking Alpha (Conference call)
Investor Conference (Wall Street Horizon, Bushee et al., 2024 TAR)
Revelio Lab (PolyU-Individual data not Job Posting)
Congress Member Insider Trading (STOCK ACT)
Harvard Dataverse https://dataverse.harvard.edu/
Openicpsr https://www.openicpsr.org/openicpsr/
AFL-CIO https://aflcio.org/paywatch/company-pay-ratios (Collect Every Year! 2022 2023)
USPTO Patent -Jingjing Wang &Linda and some Harvard paper The Harvard USPTO Patent Dataset (HUPD)
Access World News -Library Access
Custom Data Download | CAMPD | US EPA CAMPD is from Xuanyu
Form 923 and other forms are from Scott
EIA emission data Emissions by plant and by region
EIA capacity and
Government exposure --Amstrong JAE
Bloomberg Supply Chain Data -Carbon intensity and ESG !!
Diversity data
GPTW's Culture Audit
International Labor Organization https://www.ilo.org/
Gender Equality and C-suite
LEHD Data
Census Bureau Employee gender education, and experience and Company working for
Environmental/Climate data
The database, developed by the Berkeley Carbon Trading Project, contains all carbon offset projects, credit issuances, and credit retirements listed globally by four major voluntary offset project registries—American Carbon Registry (ACR), Climate Action Reserve (CAR), Gold Standard, and Verra (VCS). These four registries generate almost all of the world's voluntary market offsets and also include credits eligible for use under the California / Quebec linked cap-and-trade programs and Washington's cap-and-invest program.
This database is meant to increase the transparency of the carbon offset market, providing researchers and offset buyers with the ability to better see offset credits and projects in a single database. Dynamic charts and tools allow users to see trends over time, and explore the projects and credits on the market by location, type, registry, etc.
Energy Information Agency’s (“EIA”)
Form 923 dataset. All electric generators that have a capacity of at least one megawatt hour and are connected to the electric grid.
ResponsibilityReport.com Apple Inc. - ResponsibilityReports.com
CSRwire CSRWire - CSR Reports
Corporate Register - Global CSR Resources CSR report CR data collecter
DEF 14A
ESG Disclosure
ESG compensation
Shareholder Proposal
ESG and anti-ESG Bills
Open State
State-level vote
State-level anti-ESG bills
Sustainable Stock Exchange Initiatives
https://sseinitiative.org/esg-guidance-database
CEO Compensation
Shareholder Proposals Proxy Analytics | Access Governance (proxy-analytics.com)
ESG hiring data
Live Data Technologies: Real-time Job Change Data
Incentive Lab -Compensation (Ferri et al JAE 2018)
*Morningstar proxy-voting database (*Voting on ESG: Ever-Widening Differences)
Voting on ESG: Ever-Widening Differences (harvard.edu)
ESG rating (MSCI Asset4, RepRisk, Bloomberg, MorningStar, Sustainalytics, Refinitiv, KLD)
Rating agency competition
AI investment and involvement of rating agency to reduce human judgement error?
How Artificial Intelligence is Reshaping ESG Ratings _ Nomura Connects.pdf
NGO data SigWatch
"We obtain data on NGOs’ E&S-washing allegations from SigWatch, a European data analytics firm specializing in monitoring and analyzing NGO activism campaigns. SigWatch has built a unique dataset that covers about 11,000 activist groups worldwide in over 75,000 campaigns involving over 20,000 target firms since 2011. The data are sourced from a variety of public sources, including NGO websites, their press releases, and research reports. For each NGO campaign, the information that SigWatch provides includes characteristics of the NGO and the firm that is targeted, a summary of the NGO’s allegations against the firm, and web links to the source documents. The clients of SigWatch include institutional investors interested in assessing their (or their portfolio firms’) reputation risks related to NGO campaigns, but also audit and consulting firms as well as the OECD and academics."
MTBS program wildfire Tiger/Line Shapefiles from Census Hot Dry Wendy Index (HDW)
B CORP CERTIFICATION
Measuring a company’s entire social and environmental impact.
Seeking Alpha (Retail Investor)
Earnings Conference Transcripts
Investor Social Media (Comments)
An integrated platform provides retail investors with seminal analysis and a place to comment. It also provides conference call script. Ladder - Yujie Song 宋豫洁 provides the merge list of seeking alpha and IBES
Investor conference (Different from earnings conference)
Bushee, B.J., Taylor, D.J. and Zhu, C., 2023. The dark side of investor conferences: Evidence of managerial opportunism. The Accounting Review, 98(4), pp.33-54.
Capital IQ
Earnings Conference Scripts and Q&A
Twitter provides (limited) access for academic research to extract and analyze Tweets.
rtweet (Kearney 2019)
Wayback Machine (Jacqueline (Jackie) Wegner)
Google trends
Google offers public access to global search volumes through its search engine through the Google Trends portal.
globaltrends (Puhr and Müllner 2021) and gtrends (Massicotte and Eddelbuettel 2022)
EPU policy uncertainty
National level and State level.
OpenState https://open.pluralpolicy.com/about/
Bills
Regions and Regional Elections (Amore and Minichilli, 2018)
Each Italian region has its own parliament and government. Regional elections are held every 5 years, and the winning coalition is determined by electoral systems that combine proportional and majoritarian rules. Table 1 illustrates the 69 regional elections held in the 20 Italian regions from 2000 to 2014.
RiskMetrics CEPD (Northwest University Only)
RiskMetrics Corporate Environmental Profiles Database (CEPD) has data for the following 19 environmental statutes.
MIT Election data and science lab
The president election of the United States:
All level federal, state, county
All units by state, by district, by county and by precinct
https://mtgis-portal.geo.census.gov/arcgis/apps/webappviewer/index.html?id=c754be823d6342949a4c50e519eb87be#
Voteview.com
House of Representatives
Senates
Census
https://www.census.gov/data/developers.html
Congressional Districts
https://www.govinfo.gov/app/details/USCODE-2023-title2/USCODE-2023-title2-chap1-sec2c
Map+Layer
https://researchguides.uoregon.edu/gis/data
https://freegisdata.rtwilson.com/
https://github.com/OpenSourceActivismTech/us-zipcodes-congress/tree/master
ZIP-CD Linking Table
https://www.huduser.gov/apps/public/uspscrosswalk/home
The STOCK Act requires members of Congress to file a Periodic Transaction Report (PTR) for any transactions of over $1,000 in publicly traded securities within forty-five days of the transaction date.
https://corpgov.law.harvard.edu/2024/07/29/negative-trading-in-congress/?utm_source=rss&utm_medium=rss&utm_campaign=negative-trading-in-congress
Mergent-Bond
The Federal Reserve Bank of St. Louis provides more than 818,000 US and international time series from 109 sources via the API FRED. The data is freely available and can be browsed online on the FRED homepage.
fredr (Boysel and Vaughan 2021) and alfred (Kleen 2021)
Capital IQ
All public and private firm fundamental data
Merger and Acquisition data
TRACE Corporate Bond
The Financial Industry Regulatory Authority (FINRA) provides the Trade Reporting and Compliance Engine (TRACE). In TRACE, dealers that trade corporate bonds must report such trades individually. Hence, we observe trade messages in TRACE that contain information on the bond traded, the trade time, price, and volume. TRACE comes in two variants: standard and enhanced TRACE.
IBES
Analyst following
Capex
Analyst forecast (include green investment forecast)
Municipal Securities Rulemaking Board MSRB
Form-605 Broker's transaction data
EDGAR Schedule 13D 是美国证券交易委员会(SEC)根据《证券交易法》规定的一种提交表格,用于披露在某些情况下购买股票后所拥有的股权。具体来说,Schedule 13D 主要适用于那些持有某公司股票超过 5% 的投资者或者投资者集团。
Shock to the efficiency of municipal bond trades
The Real-Time Transaction Reporting System (RTRS) reduced the delay in reporting municipal bond trades from one-day to 15 min.
The data provider CoinMarketCap provides cryptocurrency information and historical prices, as well as information on the exchanges they are listed on.
crypto2 (Stoeckl 2022)
CoinGecko is an alternative crypto data provider of current and historical data on a myriad of coins and exchanges.
geckor (Mastitsky 2021)
FED
The Federal Reserve Bank of St. Louis provides more than 818,000 US and international time series from 109 sources via the API FRED. The data is freely available and can be browsed online on the FRED homepage.
fredr (Boysel and Vaughan 2021) and alfred (Kleen 2021)
ECB
The European Central Bank’s Statistical Data Warehouse provides data on Euro area monetary policy, financial stability, and other topics relevant to the activities of the ECB and the European System of Central Banks (ESCB).
ecb (Persson 2021)
R&D (Lauren)
Income inequality
Gini coefficients 1993-2003 for international research; Theil index, see http://utip.gov.utexas.edu
World Bank
fiscal space database, sovereign debt, unemployment rate in percent, GDP per capita
FRED
county level household data (Zhou Ren)
FED Households Debt to income ratio quarterly based
Macro Predictors: (19) Home (google.com)
CEO Turnover/Executive turnover
Tainted CEO (Baer Jingjing Zhang TAR 2023)
We identify firms subject to securities class action lawsuits filed over the 2002–2017 period. To reduce the incidence
of frivolous cases of alleged fraud, following Dyck et al. (2010), we limit the sample to settled lawsuits. We also require nonmissing settling defendant information, yielding a sample of 1,623 lawsuits at 1,444 firms. Next, we match CEOs of public firms in BoardEx with lawsuit defendants using individual name and case period. For CEOs named in multiple cases over our sample period, we keep the lawsuit with the earliest filing date. Our initial sample consists of 1,282 CEOs named in 1,178 lawsuits at 1,125 firms.
CEO Pilot
We then hand collect data to identify pilot and nonpilot CEOs based on airmen certificate records.
https://amsrvs.registry.faa.gov/airmeninquiry/Main.aspx
Prosocial CEO
Specifically, we match the names of CEOs’ off-the-job organizations with organizations classified as charitable by the IRS.16 If a CEO has been involved with at least one charitable organization, we consider him or her to be prosocial, for whom an indicator variable, Charity, equals one. https://link.springer.com/article/10.1007/s11142-023-09761-0
Feng, M., Ge, W., Ling, Z. et al. Prosocial CEOs, corporate policies, and firm value. Rev Account Stud 29, 1854–1903 (2024). https://doi.org/10.1007/s11142-023-09761-0
CEO Pay ratio
2018 Setting, WONJAE CHANG, 2022; Execucomp (covering the S&P 1500 Index) and Equilar (covering the bottom half of the Russell 3000 Index) databases.
Spencer Stuart-Director Compensation Trends in Director Compensation (harvard.edu)
Executive compensation
DEF 14A
SeekingAlpha
Family-controlled and nonfamily-controlled companies
Italy (Amore and Minichilli, 2018)
Information on ownership and board positions is hand-collected from official public filings of the Italian Chamber of Commerce, with the family identity of executives and board members established via surname affinity with that of the controlling family.
China
Institutional Shareholder Services (ISS) withhold recommendations
Proxy statement about environment disclosure—>contentious director election
(Contentious director election) (Robinson, 2024 JMP)
Shock ISS updating to the ESG related criteria
Glass Lewis Guidance and ISS
Shareholder Proposal Voting
(Contentious director election) (Robinson, 2024 JMP)
Diligent Market Intelligence’s (DMI) Shareholder Voting
Employee Ownership Vs. Common Ownership
Regulator SEC FARB SASB
SEC Undisclosed investigation
Accounting professor at Oregon State University
Terrence Blackburne, John D. Kepler, Phillip J. Quinn, Daniel Taylor (2020) Undisclosed SEC Investigations. Management Science 67(6):3403-3418. https://doi.org/10.1287/mnsc.2020.3805
Whistleblower programs (CFTC and SEC Whistleblower office)
Recent literature has studied, in various settings, theeffectiveness of these tools, including whistleblower programs (Call et al.[2018], Soltes [2020], Dey et al. [2021], Berger and Lee [2022]), the dis-closure of regulatory actions (Duro et al. [2019], Kleymenova and Tomy[2022]), mandated firm disclosures (Christensen et al. [2017]), and en-forcement or prosecution (Correia [2014], Silvers [2016], Nguyen [2021]).
SEC.gov | Office of the Whistleblower
Government-appointed Monitors
We study a relatively new tool at the disposal of regulators in preventing corporate misconduct: a government-appointed, on-site corporate compliance monitor, also referred to as the “Corporate Monitor.” Monitors areappointed at large corporations that have already been exposed for wrong-doing, with the aim of reforming the firm and preventing further misconduct. However, given their relative novelty, less is known about their roleor effectiveness. In this paper, we examine whether the appointment of aCorporate Monitor reduces the incidence of repeat misconduct.
Good Job First
Discover Which Corporations are the Biggest Regulatory Violators and Lawbreakers Throughout the United States.
https://violationtracker.goodjobsfirst.org/
Financial Analyst Misconduct: Yuwen Yuan and Youchang (2023, WP)
IAPD reports available from the SEC investment public disclosure (IAPD) website (https://advisorinfo.sec.gov Unique CRD number
AAER Accounting fraud
Comment letter (SEC) Audit Analytics
Dichev, I.D., Qian, J. (2022) The Benefits of Transaction-Level Data: The Case of NielsenIQ Scanner Data. Journal of Accounting and Economics 74 (1), 101495.
Nelson Consumer Data (UO available)
Nielsen retail scanner database
Factset database, covers suppliers & customers relationships (international)
TAB UK data (Crowdfund Data)
formerly Crowdfund interface – on 1,126 (successful and unsuccessful) initial ECF campaigns over the 2012-2018 period in the UK. TAB was acquired by Thomson Reuters and added to its Eikon App Studio. Our dataset was augmented with firm-level data gathered from the UK Companies House.
EEOC
NLS Investigator
Sponsored by the Bureau of Labor Statistics, the National Longitudinal Surveys (NLS) are a family of surveys dedicated to tracking the labor market and other life experiences of American men and women. Intergeneration mobility decreased.
Employee Quality
The distance from an Ivy University
Plant Level and firm level incident rate
Violation of OSHA regulation at state level
Bureau of Labor Statistics (BLS)
Unemployment Rate
U.S. CENSUS DATA FOR SOCIAL, ECONOMIC, AND HEALTH RESEARCH
IPUMS provides census and survey data from around the world integrated across time and space. IPUMS integration and documentation makes it easy to study change, conduct comparative research, merge information across data types, and analyze individuals within family and community contexts. Data and services available free of charge.
The state level labor inflow and outflow tracker, with the industry category and salaries statistics.
Revelio labs (Employee data)
Revelio individual
Revelio Job Posting
Revelio Sentiment
Revelio Workforce Dynamics
ILR strike tracker/ Cornell University
https://striketracker.ilr.cornell.edu/
Lightcast sources job posting
from more than 65,000 websites, with 25 million job posting.
MSCI Empowering Women Index (WIN)
Taxation
SOI bases its county income data on the addresses reported on individual income tax returns filed with the IRS. Data are presented by county (including State totals) and are available for Tax Years 1989 through 2021. The data include:
Number of returns, which approximates the number of households
Number of personal exemptions, which approximates the population
Adjusted gross income
Wages and salaries
Dividends before exclusion
Interest received
The NBER Public Use Data Archive
The NBER Public Use Data Archive is an eclectic mix of public-use economic, demographic, and enterprise data obtained over the years to satisfy the specific requests of NBER-affiliated researchers for particular projects. Files here are often in more convenient formats than the original data source. However, files that receive updates at the source may not be updated here. The Public Use Data archive also serves as a repository of the outputs, be they data or code, of NBER projects that, when allowed by the sources, are intended for wider use or replication efforts.
https://www.nber.org/research/data?page=1&perPage=50
IMDB movies
Movie and information processing
Geocomputation with R (free online)
Spatial Data Science (free online)
RFS 2023 The Value of Differing Points of View: Evidence from Financial Analysts’ Geographic Diversity
Analyst locations
Our data on financial analyst locations comes from historical filings of the Uniform Application for Securities Industry Registration or Transfer (Form U4), which provides detailed accounts of analyst registrations and work histories including the street address of office location. Form U4 is filed by an employer when the analyst joins the firm and must be updated upon material changes, such as changing jobs. The Form U4 data are aggregated in a database called the Central Registration Depository (CRD), which is jointly operated by FINRA and state securities regulators. We obtain Form U4 data from a series of Freedom of Information Act requests to state regulators. Our universe of financial analysts consists of those registered in any of the states that respond to our requests during some point of their career. Analysts may register in multiple states, so we have data for many analysts in the states that do not supply information.
Parking lot data
To measure local firm performance, we use satellite imagery of daily parking lot car counts for major U.S. retail firms.We obtain data on parking lot car counts from Orbital Insight, a leading image processing company that uses machine learning to convert satellite images into quantitative data.6 The data include the company name and ticker, a unique identifier for each store location, the latitude and longitude of the store, the date and time the image was taken, and a count of the number of cars in the parking lot when the image was taken. Orbital Insight provides a normalized measure based on the raw car count data that accounts for the day of the week, time of day the satellite image is taken, and the maximum number of cars a parking lot can likely hold. This normalization is based on empirically validated predictable patterns in daily retail traffic.7 Similarly, Orbital Insight’s machine vision technology can detect and account for weather anomalies like cloud coverage. Our sample for daily car counts begins in January 2009 and ends in December 2015.
Political connections in a low corruption environment.