Pytorch synthesizer #472

ghost · 2020-08-06T15:01:43Z

I have taken the tacotron model from fatchord/WaveRNN and integrated it with this repo (#447). Aside from the new format of the synthesizer model (.pt) this change should be completely transparent to the end user.

Major Changes

Toolbox no longer requires tensorflow 🎉
Synthesizer is tacotron1 instead of tacotron2

Pretrained Model

A download link and instructions are provided here: #472 (comment)

Task List

…es with WaveRNN synthesizer pretrained model

…n model

…ion using WaveRNN pretrained model

ghost · 2021-02-07T17:41:48Z

Development of the pytorch synthesizer is complete. Please review the changes.

The pretrained model release consists of the pretrained encoder, along with synthesizer and vocoder models I have developed. Audio samples and model details: https://blue-fish.github.io/experiments/RTVC-7.html

CorentinJ · 2021-02-08T08:21:56Z

Wow, amazing work. I'll do my best to find the time to review within this week.

Garvit-32 · 2021-02-11T07:45:58Z

Hi @blue-fish Amazing Work !!
Can you tell me why you are using tacotron1 instead of tacotron2 ? Have you tested tacotron2 ?

ghost · 2021-02-11T09:31:10Z

@Garvit-32
The main reason to use Taco1 is that it uses the same codebase with the vocoder (fatchord/WaveRNN). The commonality makes it a lot easier to write the training script and integrate it with the rest of the repo. Now that the supporting infrastructure is in place, the model can be switched with relative ease.

I've already shared my thoughts on Taco1 vs Taco2 in #472 (comment) . They're close in performance. I prefer Taco2.

CorentinJ

Alright well I checked the code and played around with the toolbox. I think all seems good.

Thank you again for your amazing work, feel free to merge whenever.

121898 · 2021-02-26T09:33:23Z

Development of the pytorch synthesizer is complete. Please review the changes.

The pretrained model release consists of the pretrained encoder, along with synthesizer and vocoder models I have developed. Audio samples and model details: https://blue-fish.github.io/experiments/RTVC-7.html

Hi, thanks for the amazing work, may I know what does Google rows mean in your demo page? Thanks

ghost · 2021-02-26T16:42:56Z

@121898 Google rows are the audio samples from 1806.04558. https://google.github.io/tacotron/publications/speaker_adaptation/index.html

This reverts commit b5ba6d0.

blue-fish added 30 commits July 30, 2020 09:39

Fork demo_cli.py to torch_cli.py

851616a

Minor changes to torch_cli.py for synthesizer_pt

be008ce

Update import locations for synthesizer_pt in inference

5e6b80a

Fix more import paths

b3e5fff

More fixes to get through a few more lines of torch_cli.py

8bf2977

Restore synthesizer_pt/hparams.py from WaveRNN repo

02cb0e7

Update hparams to add speaker embedding size as parameter

e609569

Temporarily set the speaker embedding size to zero for testing purpos…

ff8b206

…es with WaveRNN synthesizer pretrained model

Finally loaded WaveRNN tacotron model

9eefbbd

More changes to support inference with the WaveRNN pretrained tacotro…

f39d265

…n model

Minor fix to get torch_cli.py to run to completion

ead4b96

Fix import paths in synthesizer train script

f9aba0e

Fixed most of the obvious errors with synthesizer_train_pt.py

6cd1a15

Finally reaching some error messages in synthesizer_pt/train.py

d74df20

Merge fatchord train_tacotron.py with synthesizer/train.py

0b50677

Move tensorflow-based synthesizer_train.py to avoid confusion

7ab62f0

Fixed filename

bb25f97

synthesizer_train.py update to be consistent with vocoder_train.py

f124f6b

More synthesizer train updates

deadd3c

More synthesizer train debugging

eecaba5

More synthesizer_pt/train.py cleanup

68fbf13

More synthesizer_pt/train.py fixes

9fe18ea

Add optimizer back into the saved checkpoints

349963a

Fix indents

2f162da

synthesizer_pt/train.py almost does something now

7ed2a56

Getting closer to torch-based training for synthesizer

d9ec428

Fix the collate_synthesizer function

bf02834

Fix collate_synthesizer again

b883b6e

Fix synthesizer_dataset.py for training

9e771c5

Workaround for synthezier model load, so torch_cli.py runs to complet…

fcd2938

…ion using WaveRNN pretrained model

blue-fish added 6 commits February 5, 2021 23:49

Mask attention

64beaae

Update dimensions for new pretrained model

97c6dea

Fixes for training bugs

1a6622b

Add a trim_silence feature for synthesizer preprocess

3dc08ac

Pretrained model responds to punctuation

8150481

Move webrtcvad to requirements.txt

8f73317

ghost requested a review from CorentinJ February 7, 2021 17:41

CorentinJ approved these changes Feb 14, 2021

View reviewed changes

ghost merged commit b5ba6d0 into CorentinJ:master Feb 14, 2021

neng668 added a commit to neng668/Real-Time-Voice-Cloning that referenced this pull request Apr 12, 2021

Revert "Pytorch synthesizer (CorentinJ#472)"

6cf165d

This reverts commit b5ba6d0.

ghost mentioned this pull request Aug 25, 2021

Difference about speaker_embedding part in Synthesizer? #790

Closed

ghost mentioned this pull request Oct 7, 2021

I used to have better quality outputs #867

Closed

This was referenced Oct 31, 2021

Followed your webpage instructions and receive ERROR after ERROR trying to install - final is 'sounddevice' #882

Closed

tacotron2 and waveglow #878

Closed

TTS outputing different words than the ones typed in #883

Closed

This was referenced Nov 15, 2021

GUIDE: Setting it up on Linux #615

Closed

Should I correct the warning "creating a tensor from a list of numpy.ndarray is extremely slow" ? #893

Closed

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pytorch synthesizer #472

Pytorch synthesizer #472

ghost commented Aug 6, 2020 •

edited by ghost

ghost commented Feb 7, 2021

CorentinJ commented Feb 8, 2021

Garvit-32 commented Feb 11, 2021 •

edited

ghost commented Feb 11, 2021

CorentinJ left a comment

121898 commented Feb 26, 2021

ghost commented Feb 26, 2021

Pytorch synthesizer #472

Pytorch synthesizer #472

Conversation

ghost commented Aug 6, 2020 • edited by ghost

Major Changes

Pretrained Model

Task List

ghost commented Feb 7, 2021

CorentinJ commented Feb 8, 2021

Garvit-32 commented Feb 11, 2021 • edited

ghost commented Feb 11, 2021

CorentinJ left a comment

Choose a reason for hiding this comment

121898 commented Feb 26, 2021

ghost commented Feb 26, 2021

ghost commented Aug 6, 2020 •

edited by ghost

Garvit-32 commented Feb 11, 2021 •

edited