PyCitySchools Challenge

Overview of Project

Original Project

Maria, the chief data scientist for a city school district, has requested my assistance in preparing an analysis of school testing proficiency in the school district. I assisted Maria in analyzing data on student funding and students’ standardized test scores. The project used Anaconda and Jupyter Notebook software. It also used the Pandas library, a preferred tool for data analysis. I created and activated a development environment for the project. I also created and cloned a GitHub repository for the project. Maria walked me through a series of exercises working with the data, Anaconda, Jupyter Notebook, and the Pandas library. This included opening and inspecting csv files containing data on the school district; cleaning the data; creating and merging dataframes; filtering the data for specific analyses; and developing reports for the school board.

Revision of the Project

The school board subsequently notified Maria and her supervisor that the student_complete.csv file showed evidence of academic dishonesty, specifically, reading and math grades for Thomas High School ninth graders appeared to have been altered. Although the school board did not know the full extent of the academic dishonesty, they wanted to uphold state-testing standards and turned to Maria for help. Maria asked me to replace the math and reading scores for Thomas High School with NaNs while keeping the rest of the data intact. Once I replaced the math and reading scores, Maria requested that I repeat the school district analysis that I did earlier and describe how these changes affected the overall analysis. My code can be found in the "PyCitySchools_Challenge.ipynb" file.

Challenges

Throughout working on Maria’s series of data exercises and the challenge, I found the code did not compile, showing errors. Sometimes finding the errors was simple, such as just running the entire code from the beginning rather than just the cell. But more times than not, finding the error was very difficult, usually requiring review of existing code from a prior student, XGUILXR and his “PyCitySchools_Challenge.ipynb” file. I found that most of my code that did not compile had minor errors such as a missing bracket or parentheses. However, in some cases, it was more, typically where I did not break the code down enough. The screenshot below is one example of this in which I attempted to do grades 10-12 of Thomas High School together in one line of code and the solution was breaking it down by grade, so multiple lines of code.

Results:

PyCitySchools Challenge provided the following key results:

The district summary changed slightly with the removal of the Thomas High School 9th grader test results. The average math score for the district decreased slightly from 79.0% to 78.9%. Since the report rounded values, the summary does not indicate other changes. See the original and revised District Summary below.
The school summary changed for Thomas High School, with the average math score decreasing from 83.418% to 83.351% rounding to the nearing thousandth. However, Thomas High School’s average reading scores increased slightly to 83.896% from 83.845%, again rounding to the nearing thousandth. See the original and revised Per School Summary below.
Replacing the ninth graders’ math and reading scores at Thomas High School did not impact Thomas High School’s performance relative to the other schools. The schools passing percentages remained unchanged when rounding the values to whole numbers.
Replacing the Thomas High School ninth-grade scores had the following affect, after rounding:
- No change in the scores by school spending.
- No change in the scores by school size.
- No change in the scores by school type.
- This is understandable given that the change from the removal of a relatively small set of grades is limited when using an average measure and percentage across the district.

Key Reports

Maria and I generated the following key reports for the school board as part of the project:

Per School Summary
Size Summary
Spending summary
Type Summary

Summary

Four changes in the updated school district analysis after reading and math scores for the ninth grade at Thomas High School were replaced with NaNs were the following:

The number of students with test scores in the district decreased 461 from 39,170 to 38,709.
The school summary changed for Thomas High School, with average math score decreasing from 83.418% to 83.351% and average reading score increasing from 83.845% to 83.896%.
The number of Thomas High School students with test scores decreased 461 from 1,635 to 1,174 after eliminating the 9th grader scores.
The overall passing percentage for Thomas High School went from 90.948% to 90.630%, rounding to the nearest thousandth.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Resources		Resources
.gitignore		.gitignore
District_summary_original.png		District_summary_original.png
District_summary_revised.png		District_summary_revised.png
Passing_math_reading_THS.png		Passing_math_reading_THS.png
Per_school_summary_original.png		Per_school_summary_original.png
Per_school_summary_revised.png		Per_school_summary_revised.png
PyCitySchools.ipynb		PyCitySchools.ipynb
PyCitySchools_Challenge.ipynb		PyCitySchools_Challenge.ipynb
PyCitySchools_Challenge_starter_code.ipynb		PyCitySchools_Challenge_starter_code.ipynb
PyCitySchools_Challenge_testing.ipynb		PyCitySchools_Challenge_testing.ipynb
README.md		README.md
Size_summary_original.png		Size_summary_original.png
Size_summary_revised.png		Size_summary_revised.png
Spending_summary_original.png		Spending_summary_original.png
Spending_summary_revised.png		Spending_summary_revised.png
THS_Overall_Passing_Percentage_code.png		THS_Overall_Passing_Percentage_code.png
Type_summary_original.png		Type_summary_original.png
Type_summary_revised.png		Type_summary_revised.png
cleaning_data.ipynb		cleaning_data.ipynb
cleaning_students_names.ipynb		cleaning_students_names.ipynb
function.ipynb		function.ipynb
jupyter_practice.ipynb		jupyter_practice.ipynb
top_schools_revised.png		top_schools_revised.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyCitySchools Challenge

Overview of Project

Original Project

Revision of the Project

Challenges

Results:

Key Reports

Summary

About

Releases

Packages

Languages

Robertfnicholson/School_District_Analysis

Folders and files

Latest commit

History

Repository files navigation

PyCitySchools Challenge

Overview of Project

Original Project

Revision of the Project

Challenges

Results:

Key Reports

Summary

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages