Info on Data

Data Source: https://www.epa.gov/egrid/download-data

Files: eGRID2018v2 Data File (XLSX), PLNT18 tab.

eGRID historical files (1996-2016) (ZIP) from this zip file you'll need,

eGRID2000_plant.xls, EGRDPLNT00 tab,

eGRID2010_Data.xls, PLTN10 tab.


How to clean Data:

A majority of the cleaning for these three data files was done on excel.

  1. Once you have downloaded all the data files delete all the tabs except for the ones needed, you should only have one tab per xls file.

  2. The only columns you should keep from all data files are the STATE, PNAME, LAT, LON, coal generated, oil generated, natural gas generated, nuclear generated, hydro generated, biomass generated, wind generated, solar generated, geothermal generated.

  3. To make the other source generated column, add the renaming other column such as fossils, waste, etc. into one column named GENAO. Do it for the first row adding the specific columns ei: SUM=(F2:G2,I2), then drag the right bottom square down to do the sum for all the other rows.

  4. To make the total amount of energy generated(TOTAL) for each row, do the SUM of the all the different energy sources generated per row.

  5. To make the total amount of renewable energy generated(GENATR), do the SUM of these columns with source geothermal,hydro,biomass,wind,solar per row.

  6. To make the total amount of non-renewable energy generated(GENATN), do the SUM of these columns with source coal,oil,natural gas, nuclear, and others. ei: SUM=(F40:K40,M40)

  7. The percent columns are included in the data file, but to make sure that non-renewable and renewable percent are correct. In the same column with percent(TNPR and TRPR) add up their specific energy source doing the SUM=(etc) for the first row and dragging it down for the remaining rows to do SUM for each row, and for it for each column TNPR and TRPR.

  8. To further clean the data delete any rows with NA or no data in LON and LAN column.

Repeat these steps for each data file eGRID2000_plant.xls, eGRID2010_Data.xls, eGRID2018v2.xlsx.