Classification Data Prep

Classification model data-prep

Preview of clean dataset used for classification methods (Decision Tree , Naive Bayes , etc.)

Function for splitting training and testing datasets (in Python) with options to scale and how to split, etc. : https://github.com/Rokkaan5/5622-PublishedCode/blob/main/Classification/JK_preprocessing.py

Data used:

Greenhouse gas Data from "Our World In Data"
(going to use as input/feature/data for classification models)

Starting dataframe of NOAA API data to be used as targets for classification models

cleaned data, explained in NOAA API Data page

Selected specific columns for data, and discretized continuous data (of NOAA API data) to create classification labels

Page updated

Google Sites

Report abuse