Preview of clean dataset used for classification methods (Decision Tree , Naive Bayes , etc.)
Link to code: https://github.com/Rokkaan5/5622-PublishedCode/blob/main/Regression/regression_data.py
Function for splitting training and testing datasets (in Python) with options to scale and how to split, etc. : https://github.com/Rokkaan5/5622-PublishedCode/blob/main/Classification/JK_preprocessing.py
Data used:
NOAA - API data as target/labels
OWID Greenhouse Gas data as features/input data
Greenhouse gas Data from "Our World In Data"
(going to use as input/feature/data for classification models)
Starting dataframe of NOAA API data to be used as targets for regression models
cleaned data, explained in NOAA API Data page
Final clean df of combined OWID and NOAA data
OWID data = Features; NOAA (temperature) datan = Targets