Multi Speaker classification

In this project I have used supervised learning to classify multi speaker and single speaker segments from a audio file. The audio file chosen is a Republic TV debate available on YouTube.

The attributes taken for buidling the supervised learning models are :
👉 Mean and Standard deviation of audio signal over 2 sec intervals.
👉 Number and density of peaks
👉 Sub-band energy ratio

The following algorithms are used for classification:
▶️ SGDClassifier
▶️ KNN
▶️ XGBoost
▶️ Neural Networks

The best accuracy on unseen data come out to be 81% using Neaural Networks. The performance can be further imporved by adding more features using PyAudio library (Mel spectrum and other audio features)

Where can such analysis be used : ⏳ If you have spend 1 HR watching news, how much of it was really informational?

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
clip1.mp3		clip1.mp3
final_annotation.TextGrid		final_annotation.TextGrid
github_file.ipynb		github_file.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi Speaker classification

About

Releases

Packages

Languages

avinashladdha/independent_research

Folders and files

Latest commit

History

Repository files navigation

Multi Speaker classification

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages