Anonymized complaints dataset

The full complaints dataset is available via Google BigQuery, which support SQL queries and data exports. You can access it here; you need a Google account. The tables are now considered `legacySQL` by Google, so your queries need to declare that (see examples below).

For now, please contact feedback@jetnoise.net to get your Google account added to the access list.

Data fields

The complaints data is basically a single table, [serfr0-1000:public.comp]. Each row is a single, anonymized complaint. (serfr0-1000 is the behind-the-scenes name of stop.jetnoise.net in Google Cloud.)

The columns in that table contain data as follows:

The following fields relate to the flight identified for the complaint; if no flight was identified, then they will be empty or zero. (If the user manually entered a flightnumber, then some but not all of them will be empty or zero.)

A few notes on the data

How the data is anonymized

You can review the anonymizing code on github. It works like this:

Example queries

Count complaints and unique users, for each day:

#legacySQL

SELECT

  datepst,

  COUNT(datepst) AS NumComplaints,

  COUNT(DISTINCT(user)) AS UniqueUsers,

FROM [serfr0-1000:public.comp]

GROUP BY datepst

ORDER BY datepst DESC

Count complaints and unique users by city, on a particular date:

#legacySQL

SELECT

  City,

  COUNT(city) AS NumComplaints,

  COUNT(DISTINCT(user)) AS NumUsers

FROM

  [serfr0-1000:public.comp]

WHERE

  datepst = '2016-06-20'

GROUP BY city

ORDER BY NumComplaints DESC

There are a bunch of views already defined; each of them has a backing query that you can look at, too.