Skip to content

Unsupervised approach to discover patterns between NBA compensation and skillsets

Notifications You must be signed in to change notification settings

joemarlo/ML-NBA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

65 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Final project for NYU ML

Repo contains the final project for APSTA 2011 Supervised and Unsupervised Machine Learning. The project is an unsupervised approach to discover underlying patterns or groupings between NBA compensation vs. overall team skillsets. It uses K-means, hierarchical, and model-based clustering along with other techniques and tools such as principal component analysis, standardizing, scaling, and web-scraping.

To run the analyses, first ensure seasons_stats_clean.csv is within/Data. If it isn't, run Data/Clean_all_data.R which cleans the original data and outputs the .csv.

/Analyses contains the clustering analyses
/Data contains the cleaned data, cleaning scripts, and scraping scripts
/Inputs contains the raw data. Original data can be found on kaggle.com

See blog post: marlo.works/posts/nba/

About

Unsupervised approach to discover patterns between NBA compensation and skillsets

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages