Data Cleaning Process
Ensured the correct data type was used for each field:
Age and salary responses were numeric.
State responses were text, with a maximum character count of two.
Responses to the "scale from 1 to 5" question were integers.
The "yes/no" responses were consistent (multiple choice).
Range: Checked that numerical responses fell within expected ranges:
Age values were within a reasonable human range (0<, >120).
Salary responses were checked to avoid any unrealistic figures, ensuring they aligned with typical teacher salaries in the U.S.
Required Fields: Ensured that all fields were mandatory and completed.
Verified that respondents selected one response for each multi-choice question (e.g., salary funding source and cost-of-living adjustment).
Techniques Used:
Filtering: Applied filters to isolate any data errors.
Sorting: I sorted responses to check for outliers, such as unusually high or low salary figures or age values.
Searching: Used the search function to identify duplicates.
Data Affected:
Fortunately only one response needed corrections. The salary was 0 which is an unrealistic response, the response was excluded.
Cleaned Data
Data Summary
This summary table displays the average salary of teachers in each state by title. The last column in the table gives the average salary for all teachers in each state. The data here was used to create this chart showing average teacher salary by state.
This summary table of responses to the Google Form displays respondent's idea of a fair starting salary with whether or not they believe teacher salaries should be adjusted. It shows that people who disagree with teacher salary increases think that teacher starting salaries should be lower than those who agree with salary increases.