Skip to content

A text classification model on song lyrics with an automated process of scraping for the artist and their song lyrics using Python packages of Selenium and Beautiful Soup.

License

Notifications You must be signed in to change notification settings

shindesimantini6/lyrics-classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Lyrics Classifier

Built a text classification model using Logistic Regression on song lyrics with an automated process of scraping for artist and their song lyrics using Python packages of Selenium and Beautiful Soup. The user is asked to input atleast one artist name. As the code runs, the user is also asked if they would like to download the URLs and the all or a number of songs for that artist. Once the songs are downloaded logistic regression is applied on the lyrics to find its accuracy on test and train data.

Before logistic regression, CountVectorizer is applied to tokenise the lyrics and TfidfTransformer to add weights to the frequency of the tokens

lyrics_classifier

Requirements

  • Python 3.8 and above
  • Selenium

Usage

  • Run the main.py file along with atleast one Artist in a string format.

Collaborators

  • Moritz
  • Simantini

About

A text classification model on song lyrics with an automated process of scraping for the artist and their song lyrics using Python packages of Selenium and Beautiful Soup.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages