In this project, I created a dashboard allowing peer group comparisons of HCAHPS patient satisfaction scores for small, medium, and large hospitals by state. I used data from the CMS Patient survey (HCAHPS) - Hospital dataset, used PostgreSQL to clean and prepare the data, and used the output data to create a dashboard in Tableau that allows hospitals to to evaluate their performance relative to peer institutions and identify targets for improvement.
In this project, I created a dashboard tracking influenza (flu) vaccines administered in a simulated Massachusetts healthcare system. I used synthetic patient data generated by SyntheaTM, an open-source patient population simulation, and modified and shared by Joshua Matlock, used PostgreSQL to select and prepare the data, and used the output data to create a dashboard in Tableau that provides access to flu vaccination coverage data for 2022.
In this project, I examined an HR dataset to predict whether or not an employee will leave the company and identifed which variables were most important for the prediction. I used Python to perform exploratory data analysis, to create visualizations to explore relationships between variables and evaluate model assumptions, to build, fit and evaluate decision tree, random forest and XGBoost classification models to choose the best model for predicting employee attrition. I interpret the model and make recommendations based on my findings.
In this project, I explored the relationship between marketing promotional budgets and sales, using Python to conduct exploratory data analysis, test model assumptions, build and fit simple linear regression and multiple linear regression models, perform one-way ANOVA and post hoc ANOVA tests to evaluate difference between groups, and evaluate model and testing results.
In this comprehensive case study, I compared use patterns of casual bike share users and members to inform marketing strategies. I used R to prepare, clean, manipulate, analyze, and visualize date, and presented key findings and recommendations. The project includes an R notebook, presentation for stakeholders, and appendix with supplemental analyses.
In this more focused project, I explore a comprehensive dataset using SQL (in Microsoft SQL Server) and produce a dashboard of visualizations in Tableau.