Clustering to Find the Best Teams & Simulating a Data-Driven "Playoff" Between the Best WNBA Players Across Time
Photo: Angel McCoughtry of the Atlanta Dream playing against the Minnesota Lynx.
Using K-Means Clustering, I ranked all WNBA players into teams based on how well they performed compared to other players in their position.
First, I scraped all WNBA player stats from Basketball-Reference.com. I decided the best way to create ranked teams among all WNBA players was to divide all players by position, then cluster players within each position according to that position's best performance metrics, and then combine top ranking clusters into teams. I used K-Means Clustering where k = 8, therefore I created a total of eight teams. At the end of my notebook, I simulate a data-driven game between the top two teams.
I was pleased with results of my modeling process. Players were ranked according to how well they performed in multiple categories. Additionally, the number of games played was not a feature used to rank the players. Therefore, the players were ranked by their overall game strategy. After doing this project, I hope to continue exploring data from other womens sports.