Skip to content

Analysis to provide a School District with student performance data using Jupyter Notebook and Pandas.

Notifications You must be signed in to change notification settings

Karenjakins/School_District_Analysis

Repository files navigation

School District Analysis

Overview of Project

The purpose of this analysis is to provide a School District with the performance data of students in several high schools in math and reading to visualize the data and decide where funding is needed to improve the scores of all high schools.

The data was also altered to remove academic dishonesty and ensure the results reflect the real performance of all students in the district. 

Results

District Summary

  • The overall results of the district analysis were not severely affected as can be seen on the screenshots below, the average math score before the grades were taken off the calculations was 79% and after 78.9%. The reading scores remained the same for both data sets at an 81.9%.

    alt text alt text

School Summary

  • The overall results of the school analysis were not severely affected as can be seen on the screenshots below.

    alt text alt text

Results after Thomas High School data was nullfied

  • After removing the data from Thomas High School its is eveident that it does not affect the overall scores for the district as the change in the data is not significant. The only area for concern was the reading and math scores for 9th grades as can be seen below with the value NaN:

    alt text

  • The same trend can be observed on the scores by school spending, school type and school size, the data does not provide any significant change after having nullfied the data.

Summary

After comparing the data from both analysis, its hard to find major changes in the school district analysis since even after adding the NaNs to the data frame, the results were not significantly different. The passing percentages for both math and reading were not impacted by more than a grade point for the average. The values that were replaced by NaNs did not create any significant statistical difference to the results, so there could be more research conducted to see if how the results of the grade 9th student affected math and reading proficiences at Thomas High School.

Resources

Data Source: students_complete.csv, school_complete.csv

Software: Python 3.6.1, JupyterNotebook, Anaconda

About

Analysis to provide a School District with student performance data using Jupyter Notebook and Pandas.

Topics

Resources

Stars

Watchers

Forks