Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Just a couple of questions #1

Open
mirix opened this issue Aug 8, 2023 · 1 comment
Open

Just a couple of questions #1

mirix opened this issue Aug 8, 2023 · 1 comment

Comments

@mirix
Copy link

mirix commented Aug 8, 2023

Hi,

I see you have several models for Speech Emotion Recognition (SER).

Would you say Vesper is the best?

I have also noticed you use acted databases for training, in your experience, does learning between acted and, so called, natural databases transfer well?

I was considering training a (unimodal audio) model on CMU-MOSEI to check if training on a natural database would produce a better performing model in real-life scenarios.

What do you think?

Of course, one could argue that a significant percent of the utterances from YouTube are also acted and do not reflect real emotions, in which case, it it would be better go with professional actors than with amateur Youtubers...

Best,

Ed

@mirix
Copy link
Author

mirix commented Aug 10, 2023

I have forked MOSEI to build a unimodal SER dataset:

https://github.com/mirix/messaih/tree/main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant