Many municipalities have begun providing frequently-updated data on city activities as part of an effort to increase transparency in government. My hope in this project is to leverage these data to provide a better idea of how people are responding (and reducing social contact) in real time to the COVID-19 pandemic. Data used in this project encompass four broad categories: crime, automobile crashes, first responder calls and air pollution. When possible, I have aggregated these data into broad categories (i.e. property crime, violent crime, crime victims by race, etc) so comparisons may be made. Data for each city is rescraped approximately daily. I currently have data for the following cities, but am continuing to update whenever I have free time:
Crime: Austin, Baltimore, Baton Rouge, Boston, Chatanooga, Chicago, Cincinnati, Detroit, Dallas, Denver, DC, Gainesville, Kansas City, Little Rock, Los Angeles, Louisville, Memphis, Mesa, Montgomery County (MD), Naperville, Nashville, Omaha, Phoenix, Raleigh, Riverside, Rochester, Rockford, Sacramento,Saint Paul, San Diego, San Francisco, San Jose, Santa Ana, Seattle, Sonoma County, St. Petersburg, Tacoma, Tucson, Virginia Beach, Wichita
Auto Accidents: Baltimore, Baton Rouge, Boston, Chicago, Cincinnati, Denver, Fort Worth, Los Angeles, Milwaukee, Montgomery County, NYC, Nashville, Riverside, Sacramento, San Diego,San Jose, Santa Ana, Scottsdale, Seattle, St. Petersburg, Tucson, Virginia Beach, Wichita
First Responder/Fire Dept: Baton Rouge, Cincinnati, Mesa, Milwaukee, San Diego, San Francisco, Scottsdale, Seattle, Virginia Beach, Wichita
In each plot, I show when two events occurred: orders to shut down the restaurants and bars (red line) and shelter-in-place (black line).
For the crime data, I aggregate data into various categories. The breakdowns are as follows:
For the crash data, I aggregate data into two categories:
For the Fire 911 data, I aggregate data into three categories:
Data Last Updated:
Caveats:
1) I have done almost no quality control on these data. This is meant as a quick and dirty project to look across as many cities as possible and see how behavior is changing. At a future point, I would like to go back and clean these data up, but that is not what is presented here. A particular concern is that the most recent day (or days) of data may not be fully updated, thus one should be especially careful interpreting these points.
2) Particularly with the crime data, cities include very different events and so comparing all crimes are not all comparable across cities. For instance, some cities include only UCR Part 1 crimes, some include all crimes and some include police interactions many of which are not crimes at all. Comparing across crime category types is more valid as these are always taken from a standard set of crimes (Violent: Robbery, Murder, Rape and Assault/Battery), (Property: Theft/Larceny, Burglary excluding Motor Vehicle Theft), (Motor Vehicle Theft: MV Theft), but even in these categories there is variation in reporting across cities. I have indicated roughly what each city includes in its crime data on each cities' page.
3) For the air pollution data---values are not adjusted for weather conditions. Large changes can occur because of weather patterns even independent of any change in emissions.
4) Make sure to check where cities' data end. Cities update at different frequencies and so some cities are more up-to-date than others.
5) I have no skills at web design, so this website is very ugly
6) If you see something in one of the graphs that looks obviously wrong, let me know and I can try to fix it.
Data and Code
Here is a link to my (again ugly, uncommented) Stata code, which pulls data from various cities' open data portals---all plots can be recreated directly from the code and it should be moderately easy to tweak to create other cuts of the data (of course because each city documents their crime differently, this requires digging at least partially into each cities' data) . Feel free to borrow for any projects that might be useful. Alternatively, here is a Stata dataset of all cities' crime collapsed to the daily level broken out by the categories used in the charts.