Skip to content

royanin/collegescorecard

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Unofficial CollegeScoreCard data visualization

This is the codebase of CollegeScoreCard.io, which is built with Dash by Plotly, and powered (mostly) by the official CollegeScoreCard dataset. This is a work-in-progress, but may be useful to others.

Notes

  1. The "requirements.txt" file is embarrassingly long. I plan to fix it later.

  2. The official dataset is stored into two MySQL databases (or, two SQLite databases) following some processing steps. Here is the codebase to generate the databases.

  3. I plan to publish the processing steps, once the related jupyter notebooks are clean enough! For now, you can find a short description of the processes and the variables here.

  4. Most of the impotant data in CollegeScoreCard.io comes from the official sources, but errors might have crept in during my processing. Please do not make important life/financial decisions based on CollegeScoreCard.io or this codebase, at least for now.

  5. I highly appreciate your feedback -- please do let me know if you catch mistakes in the data analysis, or catch bugs, or the general usability of the website.

References

  1. Dash user guide
  2. Dash/Plotly community forum
  3. Flask -- a Python-based web framework
  4. The Flask Mega-Tutorial by Miguel Grinberg
  5. Official CollegeScoreCard dataset

Background

At collegescorecard.ed.gov, run by the US Department of Education, we can see how the US higher ed institutions fare on a variety of metrics. For example, one can find quantities such as the median debt of the graduates, the standardized test scores required to get in and so on. The goal of the official scorecard website is to help the higher ed stakeholders (e.g., students, parents, teachers, counselors etc.) make a quantitative understanding of the institutions. This is a fantastic goal, especially so, since the dataset that powers the website is publicly available!

While the official CollegeScoreCard website is a good starting point for viewing the dataset, I wanted better ways to analyze and compare different schools, and visualize the results more interactively. Having come across Dash by Plotly recently, I decided to build CollegeScoreCard.io to incorporate some of these ideas.

A special shoutout to the Plotly team for creating the amazing Dash library that makes CollegeScoreCard.io possible!

About

Visualization of college scorecard data (from ed.gov) along with other information about colleges

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published