Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Speech to Speech #71

Open
GeorgeS2019 opened this issue Apr 29, 2022 · 2 comments
Open

Speech to Speech #71

GeorgeS2019 opened this issue Apr 29, 2022 · 2 comments

Comments

@GeorgeS2019
Copy link

GeorgeS2019 commented Apr 29, 2022

For your looking-ahead inspiration: speech_to_speech

@kaiidams
Copy link
Owner

Thanks. They use k-mean clustered audio and seq2seq to translate them to translate Spanish-English. k-mean clustered audio can be used to replace CMU phonemes in Voice100. For Speech-to-Speech translation, I'm not sure it is good enough with a small model for mobiles.

@GeorgeS2019
Copy link
Author

image

Is it challenging to do this for German using NeMoOnnxSharp?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants