Skip to content

zhangjxCS/Multilingual-Speech-Emotion-Recognition-System

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multilingual SER Systems

ENGI E4800 Columbia University Capstone Project Folder

team captain:

  • Jingxiang Zhang jz3313

team members:

  • Luwei Zhang lz2815
  • Ruoxi Liu rl3155
  • Wael Boukhobza wab2138
  • Shirley Gui xg2378

mentor: Akshat Gupta from JP Morgan
instructor: Sining Chen
CA: Aayush Verma

Speech emotion recognition (SER) is an important capability for systems interacting with humans. The aim of building such systems is to gauge emotions of human subjects/users. The decision on emotion type in human speech is dependent on two factors - the content of the speech or what is said by the user, and paralinguistic content or how something is spoken by the user. We need to incorporate both these factors in deciding the emotion of input speech.

Robust SER systems exist for English. In this project, we want to work with multiple languages and focus on exploring methods to create SER systems for low-resourced languages. One of our aims would be to understand how emotions are expressed in different languages like English, German, French, Italian etc. We will be using large pre-trained speech recognition systems like wav2vec, Hubert etc and study how well they are suited for the task of speech emotion recognition in different languages. In the end, we want to create multilingual speech emotion recognition systems that work for multiple systems.

  1. add every team member, team captain, sponsor/mentor, instructor, and CA's name to your readme markdown file
  2. add at least one paragraph describing the project you are developing.
  3. rename your project repo into something meaningful and related to your project name

About

ENGI E4800 Columbia University Capstone Project Folder

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published