For this study, I used R to analyze my ~5.6 million observation dataset. I used multiple packages, such as ggplot, tidyverse, and bigstatsr.
I first cleaned the dataset, removing any rows with null values, converting variables to the correct type, and filtering those variables. I also created binary columns to use with my logit regression model.
For my Data Visualization class, I chose a company to generate and analyze data for. I created a FY 2022 dashboard with Tableau. The Tableau Dashboard is interactive, allowing the user to drill down into the data.
Due to the nature of this class, I created the data on my own using predictive analysis and outside research.
I used Excel to organize the data and formulas, then uploaded that to Tableau for visualization.