This Pandas Project helps a local school board and mayor make strategic decisions regarding future school budgets and priorities. The project analyzes the district-wide standardized test results using student's math and reading scores data, as well as various information about the schools they attend. The overarching task is to aggregate the data and showcase obvious trends in school performance as well as to draw conclusions on what factors might be propagatng these trends based on the given data.
In this sub-catergory I perform the necessary calculations and then create a high-level snapshot of the district's key metrics in a DataFrame to include the following:
- Total number of unique schools
- Total students
- Total budget
- Average math score
- Average reading score
- % passing math (the percentage of students who passed math)
- % passing reading (the percentage of students who passed reading)
- % overall passing (the percentage of students who passed math AND reading)
In this sub-catergory I perform the necessary calculations and then create a DataFrame that summarizes key metrics about each school to include the following:
- School name
- School type
- Total students
- Total school budget
- Per student budget
- Average math score
- Average reading score
- % passing math (the percentage of students who passed math)
- % passing reading (the percentage of students who passed reading) 10.% overall passing (the percentage of students who passed math AND reading)
In this sub-catergory I sort the schools by % Overall Passing in descending order and display the top 5 rows and save the results in a DataFrame called "top_schools".
In this sub-catergory I sort the schools by % Overall Passing in ascending order and display the top 5 rows and save the results in a DataFrame called "bottom_schools".
In this sub-catergory I perform the necessary calculations to create a DataFrame that lists the average math score for students of each grade level (9th, 10th, 11th, 12th) at each school.
In this sub-catergory I create a DataFrame that lists the average reading score for students of each grade level (9th, 10th, 11th, 12th) at each school.
In this sub-catergory I create a table that breaks down school performance based on average spending ranges (per student). I then use the scores above to create a DataFrame called "spending_summary" which includes the following metrics in the table:
- Average math score
- Average reading score
- % passing math (the percentage of students who passed math)
- % passing reading (the percentage of students who passed reading)
- % overall passing (the percentage of students who passed math AND reading)
In this sub-catergory I create a DataFrame called "size_summary" that breaks down school performance based on school size (small, medium, or large).
In this sub-catergory I create a DataFrame that shows school performance based on the "School Type"