Statewide District Analysis for Equity and Need
Last Updated - September 2024
PURPOSE AND KEY QUESTIONS FOR UNDERSTANDING EQUITY
Purpose and Overview
DATA OVERVIEW: The statewide analysis highlights a number of key equity and need metrics already in use by governmental agencies as they relate to TK-12 schools and environmental justice. The purpose of including these metrics in this data initiative is to support education leaders and advocacy organizations to take an equity-driven approach to taking environmental and climate action to scale.
Key Questions
CORE QUESTIONS:
Which school districts are experiencing the highest inequities according to traditional equity indicators for TK–12 schools (i.e socioeconomics, race, special education, english language learners, etc.)?
Which school districts are experiencing the highest inequities as it relates to environmental pollution?
ADDITIONAL QUESTION: Which school districts are experiencing the highest inequities AND least amount of access to environmental and climate action?
Data Methodology
Resources and Data Collection: The data collection process involved various methods. Specifically, CDE codes, school district listings, physical addresses, geographic coordinates, district classifications, ADA expenses, rates of unduplicated student populations, and the proportion of students eligible for FRL were sourced from California Department of Education (CDE) Public Schools and Districts data files, as well as the EdData website. The process was automated using for-loops and similar strategies, resulting in an array or table containing the calculated values for subsequent reporting and analysis.
Data Cleaning: The process involved downloading Excel files and analyzing them using Python in Google Colab, mainly with the Pandas package. The analysis combined geospatial techniques in QGIS and data manipulation in Google Colab. It started with collecting shapefiles, pollution data from CalEnviroScreen, school locations from HIFLD, and census data. In QGIS, California schools were selected based on their location, including manual adjustments near water bodies. This was followed by a spatial join with pollution data. In Google Colab, school data was further refined by adding district names, cleaning the data, and grouping it by district. Finally, average pollution scores and percentiles for each district were calculated using Numpy's np.mean function.
Data Team: This data set was originally compiled by a team of University of California at Berkeley (UCB) Data Discovery Interns in Fall 2022, and has been updated annually.
DATA VISUALS AND METRICS EXPLANATIONS
Visuals Overview
This video will show how to use the interactive features in the following visualizations (map and graph).
The map below includes data for school districts across the state of California, and pulls from a number of data sources including Ed-Data and CalEnviroScreen. To learn more about these metrics see explanations below the map.
DATA FILTERING AND COLOR KEY:
Color Key: In general, scores closer to 0% are indicated with darker green, and scores closer to 100% are indicated with red.
Calculated Average: Use the filter drop downs to include or exclude the different equity indicators in the calculated average. To see just a single equity indicator, select "no" for all other indicators in the drop downs.
Filtering: Use the slide bars to view only the districts within that range. Since % Free or Reduced Meals and % English Learners are part of the % Unduplicated, the slide bars make it possible to see which districts have higher or lower percentages of those demographics without double counting those equity indicators.
Hovering for More Data: Hover over a district to see other data such as district enrollment, and expense ADA (the amount this district spends per pupil).
Metrics Explanations
Already Existing Metrics and Data:
LCFF Unduplicated Percent: A metric ranging from 0 - 100 that captures student needs in a local school district. This is the district's Local Control Funding Formula unduplicated percentage, or the percentage of students that fall into at least one of these categories: a) low-income, b) foster youth, or c) English learners. Learn more about LCFF and Unduplicated pupil counts here.
% BIPoC Students: The percentage of non-white students in a school district. Includes multiracial students and those who did not respond.
% Students Receiving Special Education Services: Percentage of students with disabilities qualifying for special services. Learn more about Special Education Services and the students who qualify here.
Pollution Burden Score: The CalEnviroScreen data provided in this data set was calculated through a geospatial join of CalEnviroScreen's census tract data with district boundary data. For a given census tract, scores for the Pollution Burden is calculated as described below. This description is directly from the CalEnviroScreen 4.0 Report
The percentiles for all the individual indicators in a component are averaged. This becomes the score for that component (see image to the right for indicators). When combining the Exposures and Environmental Effects components, the Environmental Effects score was weighted half as much as the Exposures score. This was done because the contribution to possible pollutant burden from the Environmental Effects component was considered to be less than those from sources in the Exposures component.
Learn more about the indicators included in this initiative in the Glossary of Indicators.
The graph below shows the average pollution burden percentile scores compared to the average percent of unduplicated students in each county. In the graph, pollution burden is on the y-axis and unduplicated students are on the x-axis. Each circle represents a county with the size representing the % of BIPoC students and the color representing the CES percentile (a combination of the pollution burden and population characteristics).
The counties in the upper right quadrant are above average in the different listed equity factors, indicating a greater overall equity impact than other areas in California.
Key Takeaways
While each district has its own equity factors to consider, there are some trends that show areas of higher need throughout our state.
California has a wide distribution of unduplicated students across all regions in the state.
Pollution burden tends to be higher in the central valley and greater Los Angeles areas.
Sensitive populations and socioeconomic issues are spread more widely throughout the state with the highest scores in the central valley down through southern California. Unlike pollution burden, there are higher population characteristic scores in the north, far-north regions of California as well.
When looking at all of the factors together, there are a few counties that have high percentages of each equity indicator (such as Tulare, Kings, Fresno, and Merced). These counties could potentially use additional funding and support to address pollution burden and equity gaps.