Skip to content
Collect, clean, and cluster song lyrics with Doc2Vec and t-SNE
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
lyrics_data
LICENSE.md
LyricsAnalyzer.py
README.md
helpers.py
params.py
run.py

README.md

Lyrics, Pt. 3: Rap Song Clustering with Doc2Vec

This project creates embeddings of song lyrics with Doc2Vec, reduces the resulting dimensionality with t-SNE, and compares artist-by-artist song clusters.

A full description of the project can be found at saisenberg.com.

Getting started

Prerequisite software

  • Python

Prerequisite libraries

  • Python:
contractions, collections, gensim, nltk, pandas, re, sklearn, string (```install any missing libraries with !pip install [library name]```)

Instructions for use

  • Change paths in params.py as appropriate.

  • Update artist dictionary in params.py as appropriate.

  • Change working directory in run.py as appropriate, and run entire file. Note that additional parameters are available for many LyricsAnalyzer methods; see LyricsAnalyzer.py for further details on available options.

Author

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Acknowledgements

You can’t perform that action at this time.