Skip to content

SER and audio classification using both a Wav2Vec2 based model and an ASR->Bert pipeline, as well as utilizing a multimodal late-fusion model

Notifications You must be signed in to change notification settings

viksit-siddhant/compare2023

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wav2Vec2Stats

This is my official entry for the ACM ComPaRe 2023 challenge, performing Audio classificaion and SER using a Wav2Vec2-based model. Run asr.py to generate transcripts for the respective datasets, wav2vec2.py to run Wav2Vec2Stats, bert.py for the rudimentary BERT-based classifier on the transcripts, and evaluate.ipynb to generate the submission CSVs

About

SER and audio classification using both a Wav2Vec2 based model and an ASR->Bert pipeline, as well as utilizing a multimodal late-fusion model

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published