Skip to content

avinashladdha/independent_research

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Multi Speaker classification

In this project I have used supervised learning to classify multi speaker and single speaker segments from a audio file. The audio file chosen is a Republic TV debate available on YouTube.

The attributes taken for buidling the supervised learning models are :
👉 Mean and Standard deviation of audio signal over 2 sec intervals.
👉 Number and density of peaks
👉 Sub-band energy ratio

The following algorithms are used for classification:
▶️ SGDClassifier
▶️ KNN
▶️ XGBoost
▶️ Neural Networks

The best accuracy on unseen data come out to be 81% using Neaural Networks. The performance can be further imporved by adding more features using PyAudio library (Mel spectrum and other audio features)

Where can such analysis be used : ⏳ If you have spend 1 HR watching news, how much of it was really informational?

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published