Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

poor performance on seen-to-unseen task while finetuning on Hindi language #79

Open
rgenai opened this issue May 16, 2023 · 2 comments
Open

Comments

@rgenai
Copy link

rgenai commented May 16, 2023

Hello! I'm delighted to come across this remarkable project, and thanks for sharing it as an open-source project. Currently, my focus lies on fine-tuning the freevc-s model using pretrained checkpoints as the foundation, specifically on a Hindi dataset. While I've achieved impressive results in seen-to-seen and unseen-to-seen tasks, with a remarkable 95% match, I'm eager to enhance the performance in the seen-to-unseen task. Presently, I'm encountering a moderate 60% match when working with the reference speaker for unseen-to-unseen and seen-to-unseen tasks. I would greatly appreciate any insights or suggestions you have to improve these results further.

@EmreOzkose
Copy link

Hi @MuruganR96 , how did you train with another language ? Did you train wavlm ?

@mm3509
Copy link

mm3509 commented Dec 3, 2023

Hello! I'm delighted to come across this remarkable project, and thanks for sharing it as an open-source project. Currently, my focus lies on fine-tuning the freevc-s model using pretrained checkpoints as the foundation, specifically on a Hindi dataset. While I've achieved impressive results in seen-to-seen and unseen-to-seen tasks, with a remarkable 95% match, I'm eager to enhance the performance in the seen-to-unseen task. Presently, I'm encountering a moderate 60% match when working with the reference speaker for unseen-to-unseen and seen-to-unseen tasks. I would greatly appreciate any insights or suggestions you have to improve these results further.

Hi @MuruganR96 , I want to do what you did and fine-tune FreeVC on a non-English dataset. Your results of 95% match on seen-to-seen would be perfect for my use case. Can you please provide guidance or share your code?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants