Unable to run training script of Natural Speech 2 #43

dongngm · 2023-12-19T10:37:21Z

Hi,

I ran into multiple issues trying to run the training script:
In ns2_dataset.py:

self.utt2phone[utt] = utt_info["phones"]: where phones comes from? I suspect we need to run the phonemizer first? but I don't see extract_phone=True in the config file
utt_info["num_frames"] is utt_info["Duration"], right?

In exp_config_base.json:

use_code=true, use_pitch=true, use_phone, should extract_acoustic_token=true, extract_pitch=true, extract_phone=true also?
There seems to be some mismatch between tts/preprocessing.py and the config file. For example: code_dir should be acoustic_token_dir?

The text was updated successfully, but these errors were encountered:

HeCheng0625 · 2023-12-19T11:32:17Z

It has some differences for the data processing for NS2 between other TTS. We will update the data processing section as soon as possible.

vn09 · 2023-12-22T07:32:18Z

I hope this message finds you well. I understand that these things take time and effort, and I appreciate the work you're putting into it.

If possible, could you please provide an estimated timeline for when we might expect the update?

HeCheng0625 · 2023-12-22T08:52:02Z

Hi, we will update a new checkpoint and data processing pipeline on a large dataset (> 1 w hours) in about two weeks. Now, we only use libritts to train the model.
Now, we use our pretrained model on libritts: https://huggingface.co/amphion/naturalspeech2_libritts
Or, try the toy demo: https://huggingface.co/spaces/amphion/NaturalSpeech2

vn09 · 2023-12-22T10:11:53Z

vn09 · 2024-01-06T05:37:54Z

Hi @HeCheng0625 ,
I just wanted to hear from you if there have been any updates on the data processing pipeline.

shreeshailgan · 2024-04-01T10:21:47Z

Any updates on the data preprocessing pipeline?

CreepJoye · 2024-05-20T08:48:35Z

Hello，@dongngm
I encountered the same issue and have the same confusion. Do you have a solution to this problem?
Any advice will be appreciated!

chazo1994 · 2024-06-11T16:51:25Z

@HeCheng0625 @RMSnow Do you have any updates on the preprocessing pipeline for neuralspeech2?

RMSnow assigned RMSnow and HeCheng0625 and unassigned RMSnow Dec 19, 2023

lmxue added the bug Something isn't working label Jan 6, 2024

lmxue added the Status: in progress label Jan 6, 2024

dongngm mentioned this issue Feb 2, 2024

Natural Speech2 Training Speed #120

Closed

Provide feedback