Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to run training script of Natural Speech 2 #43

Open
dongngm opened this issue Dec 19, 2023 · 8 comments
Open

Unable to run training script of Natural Speech 2 #43

dongngm opened this issue Dec 19, 2023 · 8 comments
Assignees
Labels
bug Something isn't working Status: in progress

Comments

@dongngm
Copy link

dongngm commented Dec 19, 2023

Hi,

I ran into multiple issues trying to run the training script:
In ns2_dataset.py:

  • self.utt2phone[utt] = utt_info["phones"]: where phones comes from? I suspect we need to run the phonemizer first? but I don't see extract_phone=True in the config file
  • utt_info["num_frames"] is utt_info["Duration"], right?

In exp_config_base.json:

  • use_code=true, use_pitch=true, use_phone, should extract_acoustic_token=true, extract_pitch=true, extract_phone=true also?
  • There seems to be some mismatch between tts/preprocessing.py and the config file. For example: code_dir should be acoustic_token_dir?
@RMSnow RMSnow assigned RMSnow and HeCheng0625 and unassigned RMSnow Dec 19, 2023
@HeCheng0625
Copy link
Collaborator

It has some differences for the data processing for NS2 between other TTS. We will update the data processing section as soon as possible.

@vn09
Copy link

vn09 commented Dec 22, 2023

Hi @HeCheng0625 ,

I hope this message finds you well. I understand that these things take time and effort, and I appreciate the work you're putting into it.

If possible, could you please provide an estimated timeline for when we might expect the update?

@HeCheng0625
Copy link
Collaborator

HeCheng0625 commented Dec 22, 2023

Hi, we will update a new checkpoint and data processing pipeline on a large dataset (> 1 w hours) in about two weeks. Now, we only use libritts to train the model.
Now, we use our pretrained model on libritts: https://huggingface.co/amphion/naturalspeech2_libritts
Or, try the toy demo: https://huggingface.co/spaces/amphion/NaturalSpeech2

@vn09
Copy link

vn09 commented Dec 22, 2023

Thanks @HeCheng0625.

@lmxue lmxue added the bug Something isn't working label Jan 6, 2024
@vn09
Copy link

vn09 commented Jan 6, 2024

Hi @HeCheng0625 ,
I just wanted to hear from you if there have been any updates on the data processing pipeline.

@shreeshailgan
Copy link

Any updates on the data preprocessing pipeline?

@CreepJoye
Copy link

Hello,@dongngm
I encountered the same issue and have the same confusion. Do you have a solution to this problem?
Any advice will be appreciated!

@chazo1994
Copy link

@HeCheng0625 @RMSnow Do you have any updates on the preprocessing pipeline for neuralspeech2?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Status: in progress
Projects
None yet
Development

No branches or pull requests

8 participants