Dataset #1: Analyzing Opioid Prescription Rates by State and Provider Type
Source: Centers for Medicare and Medicaid Services – Medicaid Opioid Prescribing Rates by Geography
Summary of data and research questions it may address: The dataset contains opioid prescription rate data at the national level and state levels. It can help us address our research questions: What states have the highest rates of opioid prescriptions? How are prescription rates changing over time?
Collection method: Data was requested using the healthdata.gov API and loaded directly into the code file using the dataset's unique identifier. JSON data was converted into a data frame and CSV file for analysis.
Cleaning and preprocessing:
Handling issues and noise: Redundant columns were dropped, columns were renamed for better understanding, and data was split into national and state-specific data. Some columns contained NA values; those were dropped because they were not relevant to the research questions. For the national data, rows referring to subgroup insurance types were removed because only total reports will be used in the analysis. At the state level, rows containing data from 2019 were selected to perform a comparison between states in the most recent year of this dataset.