Skip to content

Module 4 Pandas challenge for the Vanderbilt Data Analytics Bootcamp 2023

Notifications You must be signed in to change notification settings

A-Lounsbury/pandas-challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pandas-challenge

Andrew Lounsbury

Module 4 Pandas challenge for the Vanderbilt Data Analytics Bootcamp

Code from Stack Overflow

The cell

# Use `pd.cut` to categorize spending based on the bins.
# https://stackoverflow.com/questions/75808415/why-am-i-getting-typeerror-not-supported-between-instances-of-int-and-st
# We're trying to compare the integer bins to the budgets, which have dollar signs in them, making them strings. To avoid this, we take just the numbers in the string with .str[1:], and convert just that portion to a float. Then the comparison works. 
school_spending_df["Spending Ranges (Per Student)"] = pd.cut(school_spending_df["Per Student Budget"].str[1:].astype(float), bins=spending_bins, labels=labels)
school_spending_df

contains some code taken from this page.

Charts

The Initial Data Set

initial_df

District Summary

district summary

School Summary

school summary

Highest Performing Schools

highest performing schools

Bottom Performing Schools

bottom performing schools

Math Scores by Grade

math scores by grade

Reading Scores by Grade

reading scores by grade

Scores by School Spending

scores by school spending

Scores by School Size

scores by school size

Scores by School Type

scores by school type

Report

  • Overall, Charter schools greatly outperformed District schoools with Charter schools having an overall passing percentage of 90.43% and District schools having an overall passing percentage of 53.67%.
  • The school with the lowest number of students, Holden High School, had a high percent passing math (92.51%), an even higher percent passing reading (96.25%), and a relativley high percent overall passing (89.23%) despite its low spending rate per student (<585).
  • The school with the highest number of students had a low overall passing percentage (54.64%) despite having a higher spending range per student ($585-630) than Holden High School.

About

Module 4 Pandas challenge for the Vanderbilt Data Analytics Bootcamp 2023

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published