High School Matriculation & Attendance based on Average Household Income of Neighbourhood

Analyze NYC Schools data (Overall Score, Graduation Rates, Attendance, Enrollment) with the socioeconomic of neighbouring areas in which the school resides and reference this data for the year 2010-11

Datasets

Resources:

Citizen's Committee for Children - https://data.cccnewyork.org/

NYC Open Data - https://opendata.cityofnewyork.us/

School Progress Reports (2010-11)

Peer indexes & Overall Scores are calculated differently depending on School Level. Schools are only compared to other schools in the same School Level.

(e.g., Elementary, K-8, Middle, High, Transfer)

https://data.cityofnewyork.us/Education/2010-2011-School-Progress-Reports-All-Schools/yig9-9zum

Average Household Incomes by School District (2010)

Average Household Incomes (USD) calculated for each school district for the year 2010. Incomes were segregated in All Households, Families, Families with Children, Families without Children etc. based on School Districts.

https://data.cccnewyork.org/data/table/66/median-incomes#66/107/8/abbr/u

School Attendance & Enrollment by School District (2010-11)

Attendance and Enrollment statistics broken down by School District.

https://data.cityofnewyork.us/Education/2010-2011-School-Attendance-and-Enrollment-Statist/7z8d-msnt

Graduation Rates by School District (2011)

Graduation Rates of students passed out from 2011, graduated after 4 & 6 years respectively.

https://data.cccnewyork.org/data/map/121/graduation-rate#121/258/5/205/99/a/a


Tools Used

Pandas : Data Pre processing, Correlation. [pandas.pydata.org]

Pandasql : Aggregating Data [github.com/yhat/pandasql]

Plotly : Geo Spatial Visualization [plotly.com]

Colaboratory: Python Notebook for Data Science Project [research.google.com/colaboratory]