LyricsAudioBoost

Combining BERT and Spotify Valence Feature For Track Sentiment Analysis

In this notebook I combine Spotify audio feature and BERT word embedding to predict tracks sentiments. I use hugginface pre-trained BERT transformer as an embedding layer, and train an additional bidirectional GRU layer for the sentiment analysis regression task (point prediction in range [0-1]). To train the fine-tunning layer of the model I use Spotify valence attribute which I added to a lyrics dataset.

Motivation:

The examples below use NLTK demo and Spotify valence to measure a track's positivenesss. They demonstrate that using strictly audio OR lyrics might be inaccurate.

Positive Sentiment Example: Baz Luhrmann - Everybody's Free To Wear Sunscreen.
- NLTK sentiment classification: Negative.
- Spotify Valence: 0.8.
Negative Sentiment Example: Otis Redding- Mr. pitiful.
- NLTK sentiment classification: Negative.
- Spotify Valence: 0.9.

Steps to build model:

Database: gathering songs lyrics, adding Spotify valence attribute and pre-processing. I uploaded to Kaggle the final 150K Lyrics Labeled with Spotify Valence Dataset.
Model Design: Iteratively improved model capacity.
Evaluation: loss and accuracy metrics across 3 buckets - negative, neutral and positive sentiments.
Interpretation: Understanding what the model is learning using word clouds.

Example:

Words in the word cloud are sized by their respective difference on the model's prediction, and their positive (green) or negative (red) influence.

Positive Sentiment Example: Armin Van Buuren- Blah Blah Blah.

NLTK sentiment classification: Negative.
Spotify Valence: 0.18.
LyricsAudioBoost Model: 0.76.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
README.md		README.md
Spotify_Dataset.ipynb		Spotify_Dataset.ipynb
Tracks_Sentiment_Analysis.ipynb		Tracks_Sentiment_Analysis.ipynb
blah_good.png		blah_good.png
diversified.xlsx		diversified.xlsx
helpers.py		helpers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Spotify_Dataset.ipynb

Spotify_Dataset.ipynb

Tracks_Sentiment_Analysis.ipynb

Tracks_Sentiment_Analysis.ipynb

blah_good.png

blah_good.png

diversified.xlsx

diversified.xlsx

helpers.py

helpers.py

Repository files navigation

LyricsAudioBoost

Combining BERT and Spotify Valence Feature For Track Sentiment Analysis

Motivation:

Steps to build model:

Example:

About

Releases

Packages

Languages

EdenBD/lyrics-sentiment

Folders and files

Latest commit

History

Repository files navigation

LyricsAudioBoost

Combining BERT and Spotify Valence Feature For Track Sentiment Analysis

Motivation:

Steps to build model:

Example:

About

Resources

Stars

Watchers

Forks

Languages