This project uses data from Data.gov 's College Scorecard https://collegescorecard.ed.gov/data/documentation/ to evaluate two methods for handling null values in unsupervised clustering: (1) imputing nulls via KNN prior to clustering and (2) clustering on complete observations and then assigning clusters to incomplete observations via KNN. Clusters are evaluated visually with pairplots and numerically with silhouette scores, where possible.
-
Notifications
You must be signed in to change notification settings - Fork 1
MetaDr0me/Clustering_engineering_schools
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Clustering engineering schools from data.gov API
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published