An analysis on a dataset of 160k spotify tracks from 1921-2020. Done as part of Udacity's Data Science Nanodegree, project 1.
https://www.kaggle.com/yamaerenay/spotify-dataset-19212020-160k-tracks
Music is a big part of many people's lives, mine included. I love listening to songs, and I enjoy the various feelings I get from them. I'm also a huge fan of the artist Adele, and I've been listening soulfully to her heart-wrenching songs since 2013.
With these 2 goals in mind, I decided to analyze the dataset and find the answers to the following questions:
-
Are there any notable trends in recent years' popular songs?
-
Based on the dataset, how well is Adele doing compared to other singers in 2020?
-
What are the key differences between Adele and her top competitors?
1. Pandas
2. Numpy
3. Matplotlib
4. Seaborn
Specific version can be found on requirements.txt
spotify_analysis
|_data.csv # Data for analysis
|_requirements.txt # Libraries and versions used
|_udacity_project.ipynb # IPYNB notebook containing analysis code, graphs, etc.
git clone <repo>
#clone repocd <folder>
#cd to where you cloned the repopip install virtualenv
#install venv package (skip if already installed)virtualenv <name>
#create venv<name>\Scripts\activate
#activate venvpip install -r requirements.txt
#install dependencies- Run the cells of udacity_project.ipynb #Make sure to change input paths accordingly
Kaggle for providing kernels to do analysis on. Spotify for providing documentation for the data. Yamaç Eren Ay for putting together the spotify 160k track dataset.