Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The generated wav is not good #14

Open
pangtouyuqqq opened this issue Nov 30, 2022 · 1 comment
Open

The generated wav is not good #14

pangtouyuqqq opened this issue Nov 30, 2022 · 1 comment

Comments

@pangtouyuqqq
Copy link

pangtouyuqqq commented Nov 30, 2022

Hi, thank you for open source the wonderful work !
I followed your instructions 1) install lightconv_cuda, 2) download the checkpoint, 3) download the speaker embedding npy.
However, the generated result is not good.

Below is my running command

python3 synthesize.py \
  --text "Hello world" \
  --speaker_id Actor_22 \
  --emotion_id sad \
  --restore_step 450000 \
  --mode single \
  --dataset RAVDESS
# sh run.sh 
2022-11-30 13:45:22.626404: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudart.so.11.0
Device of XSpkEmoTrans: cuda
Removing weight norm...
Raw Text Sequence: Hello world
Phoneme Sequence: {HH AH0 L OW1 W ER1 L D}

ENV

python 3.6.8
fairseq                 0.10.2
torch                   1.7.0+cu110
CUDA 11.0

Hello world_Actor_22_sad

Hello world_Actor_22_sad.wav.zip

@keonlee9420
Copy link
Owner

keonlee9420 commented Dec 1, 2022

Hi @pangtouyuqqq , thanks for your attention. It is because of the dataset where there are only two different texts (It will give you more natural output when you try with one of them). If you need to generate unseen text, you may get some helps by training on other dataset which has more generic text-speech pairs. It would be also helpful to replace light convolution with transformer when you do that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants