Skip to content

Song and lyric data collected for Taylor Swift's 10 studio albums (up through Midnights)

Notifications You must be signed in to change notification settings

adashofdata/taylor_swift_data

Repository files navigation

Taylor Swift Data

This repo contains the data that I collected to do the analysis for this blog post: http://adashofdata.com/2023/03/01/a-data-scientist-breaks-down-all-10-taylor-swift-albums-the-extended-version/

You can also see the video summary of my analysis on YouTube: https://youtu.be/ZNFms8sUbTw

For fun, you can also check out a few of the Reddit threads here, here and here.

The data sets contain details about her songs from her first 10 studio albums (up through Midnights). I gathered the data through the Genius API (lyrics) and Spotify API (song metadata).

For the non-technical people

This main folder contains:

To access the data, you can click on any Taylor_Swift_Type_Data.xlsx spreadsheet and then click the Download button at the top right.

For the technical people

The Taylor_Swift_Genius folder contains:

  • Python code to extract the lyrics of Taylor Swift's songs from the Genius API
  • .csv file of the lyrics of Taylor Swift's songs
  • 10 folders containing the song lyrics for each album as .txt files

The Taylor_Swift_Spotify folder contains:

  • Python code to extract the metadata of Taylor Swift's songs from the Spotify API
  • .csv file of the metadata of Taylor Swift's songs (danceability, key, etc.)

The Taylor_Swift_Words folder contains:

  • Python code to break down her lyrics into words
  • .csv file of the words in Taylor Swift's songs

About

Song and lyric data collected for Taylor Swift's 10 studio albums (up through Midnights)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published