Preview of clean dataset used for classification methods (Decision Tree , Naive Bayes , etc.)
Link to code: https://github.com/Rokkaan5/5622-PublishedCode/blob/main/Classification/labeled-data.py
Function for splitting training and testing datasets (in Python) with options to scale and how to split, etc. : https://github.com/Rokkaan5/5622-PublishedCode/blob/main/Classification/JK_preprocessing.py
Data used:
NOAA - API data as target/labels
OWID Greenhouse Gas data as features/input data
Greenhouse gas Data from "Our World In Data"
(going to use as input/feature/data for classification models)
Starting dataframe of NOAA API data to be used as targets for classification models
cleaned data, explained in NOAA API Data page
Selected specific columns for data, and discretized continuous data (of NOAA API data) to create classification labels