Music Genre Classification

This is a repository for Music Genre Classification Project which includes Data Collection, Feature Extraction, Training and Deployment.

Virtual Environment

With virtual environment, you can easily install all requirements in one go. To do so, follow these steps:

Create a virtual environment
python -m venv env
Activate the virtual environment
Windows: ./env/Scripts/activate
Linux: source env/bin/actiavte
Install requirements
pip install -r requirements.txt

1. Data Collection

Data in collected in a four step process.

Song names fetched from Spotify API according to provided Genres.
Youtube video URL fetched from Youtube API from the song names.
Songs downloaded using those Youtube video URLs.
Songs segmented into clips of specified seconds.

How to use

Install required libraries
- FFMPEG (Add to path)
- PyTube
- PythonDotEnv
- Requests
Acquire required API Keys (Store all acquired Keys in SongCollection/.env)
1. Spotify API Keys
  - Login to SpotifyDev
  - Store keys as follows in .env file
    - SPOTIFY_CLIENT_ID = Your client id
    - SPOTIFY_CLIENT_SECRET = Your client secret
2. Youtube API Keys
  - Login to GoogleCloudConsole
  - Make a Project
  - Enable YouTubeAPI
  - In sidetabs, go to Credentials > Create API Key
  - Store keys as follows in .env file (You can use more than one. One key can be used to get URls of about 500 songs)
    - YOUTUBE_API_KEY0 = Your 1st api key
    - YOUTUBE_API_KEY1 = Your 2nd api key
    - . . .
Run the program
1. For Song name and URL
  1. Make object of APICall as
    ac = APICall(genre_list, number_of_songs, number_of_youtube_keys)
    Example:
    ac = APICall(["classical", "rock"],50, 1)
  2. Get song names
    ac.generate_song_list()
  3. Get song URLs
    ac.generate_song_url()
  4. Access song name and urls from text files
    song_name ,song_url = ac.get_song_name_url()
2. To Download and Segment
  1. Make object of SongDownloader as
    sc = SongDownloader(n_threads, segment_duration)
    Example:
    sd = SongDownloader(10, 30)
  2. Download and segment
    sd.download_song(song_name, song_url)

2. Feature Extraction

Feature Extraction makes use of Librosa to extract a total of 38 features from both frequenct domain and time domain. Extracted features are Mean and Standard Deviations of:

Amplitude Envelope
Root Mean Squared Energy
Zero Crossing Rate
Band Energy Ratio
Spectral Centroid
Bandwidth
13 Mel Frequency Cepstral Coefficients (MFCCs)

How to use

Complete How to use of Data Collection
Install required libraries
- FFMPEG (Add to path)
- Librosa
- Numpy
- Pandas
Run the program
1. Make object of FeatureExtractor as
  fe = FeatureExtractor(segment_duration, n_threads) (Segment duration must be same as Data Collection)
  Example:
  fe = FeatureExtractor(30, 10)
2. Extract features
  fe.extract_features()

3. Training

In this part, I have dome some preliminary exploratory data analysis along with visualization of different features that has been extracted .

The data was trained on different models with primary evalulation metrics being the F1 Score and Confusion Matrix. The data being highly imbalanced, balancing techniques like UnderSampling and OverSampling were used. Random Search was also employed to find best hyperparameters for the model.

The best model was a RandomForestClassifer with test F1 Score of 70%. This model was trained by dropping BER_Mean and BER_Std columns as these data had multiple discrepancies.

The model was extracted into a file by pickle to be used in deployment.

4. Deployment

In Progress

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
FeatureExtraction		FeatureExtraction
ML		ML
SongCollection		SongCollection
.gitignore		.gitignore
README.md		README.md
data.csv		data.csv
data_2_200.csv		data_2_200.csv
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Music Genre Classification

Virtual Environment

1. Data Collection

2. Feature Extraction

3. Training

4. Deployment

About

Releases

Packages

Languages

Magus4450/MusicGenreClassification

Folders and files

Latest commit

History

Repository files navigation

Music Genre Classification

Virtual Environment

1. Data Collection

2. Feature Extraction

3. Training

4. Deployment

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages