Add Talromur2 recipe #4680

G-Thor · 2022-09-30T20:06:47Z

This adds a recipe for the Talrómur 2 multi-speaker corpus. I've trained an x-vector conditioned Tacotron 2 using this recipe with decent results.

The commit history is a bit messy so feel free to squash if you decide to merge this PR.

I'm open to any and all comments on how to improve this recipe.

…pstream

Reduced batch size of fastspeech2 to facilitate 1-gpu training Reduced VITS batch size to prevent OOM failures in 4-GPU setup

…omur2

kan-bayashi · 2022-10-04T00:30:53Z

Could you merge the latest master to fix the CI?

codecov · 2022-10-04T13:35:59Z

Codecov Report

Merging #4680 (e52cfd8) into master (b221db0) will decrease coverage by 0.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #4680      +/-   ##
==========================================
- Coverage   80.32%   80.31%   -0.02%     
==========================================
  Files         527      527              
  Lines       46311    46311              
==========================================
- Hits        37200    37193       -7     
- Misses       9111     9118       +7

Flag	Coverage Δ
test_integration_espnet1	`66.23% <ø> (-0.14%)`	⬇️
test_integration_espnet2	`48.96% <ø> (ø)`
test_python	`68.56% <ø> (ø)`
test_utils	`23.30% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...pnet/nets/pytorch_backend/transformer/optimizer.py	`86.11% <0.00%> (-2.78%)`	⬇️
espnet/asr/asr_utils.py	`75.65% <0.00%> (-0.88%)`	⬇️
...et/nets/pytorch_backend/e2e_asr_mix_transformer.py	`84.50% <0.00%> (-0.47%)`	⬇️
espnet/tts/pytorch_backend/tts.py	`78.33% <0.00%> (-0.30%)`	⬇️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

egs2/talromur2/tts1/setup.sh

kan-bayashi

Sorry for the late review and thank you for adding a great recipe!
It looks almost perfect but I just left minor comments. Could you reflect them?

kan-bayashi · 2022-10-05T23:54:30Z

.gitignore

+tools/anaconda
+tools/ice-g2p*
+tools/fairseq*
+tools/featbin*


Please add line break

kan-bayashi · 2022-10-05T23:55:38Z

egs2/talromur2/tts1/local/data_download.sh

+
+# TODO(G-Thor) add alignment download option
+
+cd "${cwd}"


please add line break.

kan-bayashi · 2022-10-05T23:56:15Z

egs2/talromur2/tts1/local/data_multi_speaker.sh

+if [ ${stage} -le 1 ] && [ ${stop_stage} -ge 1 ]; then
+    log "stage 2: utils/subset_data_dir.sh"
+
+    ./local/split_train_dev_test.py --data_dir "data/${full_set}" --train_dir "data/${train_set}" --dev_dir "data/${dev_set}" --test_dir "data/${eval_set}"


Suggested change

./local/split_train_dev_test.py --data_dir "data/${full_set}" --train_dir "data/${train_set}" --dev_dir "data/${dev_set}" --test_dir "data/${eval_set}"

./local/split_train_dev_test.py \

--data_dir "data/${full_set}" \

--train_dir "data/${train_set}" \

--dev_dir "data/${dev_set}" \

--test_dir "data/${eval_set}"

kan-bayashi · 2022-10-05T23:57:33Z

egs2/talromur2/tts1/local/phonetize.sh

+
+for dset in train dev eval1; do
+    utils/copy_data_dir.sh data/"${dset}"{,_phn};
+    ${train_cmd} --gpu 1 --num-threads 1 data/"${dset}_phn/log/conversion.log" ./pyscripts/utils/convert_text_to_phn.py --nj 1 --g2p g2p_is data/"${dset}"{,_phn}/text;


I think --num-threads 1 is default.

Suggested change

${train_cmd} --gpu 1 --num-threads 1 data/"${dset}_phn/log/conversion.log" ./pyscripts/utils/convert_text_to_phn.py --nj 1 --g2p g2p_is data/"${dset}"{,_phn}/text;

${train_cmd} --gpu 1 data/"${dset}_phn/log/conversion.log" \

./pyscripts/utils/convert_text_to_phn.py \

--nj 1 \

--g2p g2p_is \

data/"${dset}"{,_phn}/text

kan-bayashi · 2022-10-05T23:57:48Z

egs2/talromur2/tts1/local/phonetize.sh

+for dset in train dev eval1; do
+    utils/copy_data_dir.sh data/"${dset}"{,_phn};
+    ${train_cmd} --gpu 1 --num-threads 1 data/"${dset}_phn/log/conversion.log" ./pyscripts/utils/convert_text_to_phn.py --nj 1 --g2p g2p_is data/"${dset}"{,_phn}/text;
+    # srun --gres=gpu:1 ./pyscripts/utils/convert_text_to_phn.py --nj 1 --g2p g2p_is --cleaner tacotron data/"${dset}"{,_phn}/text;


Suggested change

# srun --gres=gpu:1 ./pyscripts/utils/convert_text_to_phn.py --nj 1 --g2p g2p_is --cleaner tacotron data/"${dset}"{,_phn}/text;

kan-bayashi · 2022-10-05T23:57:59Z

egs2/talromur2/tts1/local/phonetize.sh

+    utils/copy_data_dir.sh data/"${dset}"{,_phn};
+    ${train_cmd} --gpu 1 --num-threads 1 data/"${dset}_phn/log/conversion.log" ./pyscripts/utils/convert_text_to_phn.py --nj 1 --g2p g2p_is data/"${dset}"{,_phn}/text;
+    # srun --gres=gpu:1 ./pyscripts/utils/convert_text_to_phn.py --nj 1 --g2p g2p_is --cleaner tacotron data/"${dset}"{,_phn}/text;
+done


Please add line break.

kan-bayashi · 2022-10-05T23:58:23Z

egs2/talromur2/tts1/run.sh

+    # --valid_set "${valid_set}" \
+    # --test_sets "${test_sets}" \
+    # --expdir "${expdir}" \
+    # --srctexts "data/${train_set}/text" \


Please add line break.

kan-bayashi · 2022-10-05T23:58:41Z

egs2/talromur2/tts1/train_multi_speaker_tacotron2.sh

+    --ngpu 1 \
+    --expdir "$expdir" \
+    --train_config ./conf/tuning/train_xvector_tacotron2.yaml \
+    --inference_model valid.loss.ave_5best.pth


Please add line break.

kan-bayashi · 2022-10-05T23:59:08Z

egs2/talromur2/tts1/train_multi_speaker_VITS.sh

@@ -0,0 +1,45 @@
+#!/bin/bash


This is just my preference, but could use lower case? VITS -> vits

mergify · 2022-10-09T00:46:44Z

This pull request is now in conflict :(

kan-bayashi · 2022-10-29T01:42:11Z

Hi @G-Thor, we want to merge your great recipe.
You may struggle with ICASSP but could you reflect my review?

Also replaced data.sh with data_multi_speaker.sh. - since that is used for this multi-speaker dataset

G-Thor · 2022-11-07T12:30:32Z

Hi @kan-bayashi, thanks for your review. I've been on leave and waiting for updates to the official corpus repo.
I've now applied your suggested changes as well as removing some unused code.

I also went ahead and changed the installation of ice-g2p to use the official PyPI version of that package rather than my personal fork since my suggested changes to that project have been merged. I hope it is okay to apply this change here, but if it isn't, just lmk and I'll open a separate PR for that.

kan-bayashi

LGTM
Thank you for your great recipe :)

G-Thor added 30 commits December 7, 2021 09:19

add talromur to db.sh

07d151c

Merge branch 'master' of https://github.com/cadia-lvl/espnet

8152202

Integrate icelandic g2p

2a1a407

Add talromur recipe

2f2ed4b

Add progress bar to phonemization

299e061

Remove unused variables from talromur recipe

6348585

Merge branch 'master' of https://github.com/espnet/espnet into pull_u…

16f47f7

…pstream

reformatted with black

b467140

Set cluster settings to default, add name to TODO

7752729

Fix python formatting

49a284e

Add data_download.sh for talromur recipe.

87b7b25

Reduced batch size of fastspeech2 to facilitate 1-gpu training Reduced VITS batch size to prevent OOM failures in 4-GPU setup

Added multi-speaker VITS training recipe

67e9836

Updated ice-g2p integration to reflect recent changes

6320f0b

add talromur to db.sh

759cec0

Integrate icelandic g2p

d7a3946

Add talromur recipe

2f27743

Add progress bar to phonemization

5009416

Remove unused variables from talromur recipe

c6b4db3

reformatted with black

2f2c8db

Set cluster settings to default, add name to TODO

bf449bd

Fix python formatting

3aae2f7

Add data_download.sh for talromur recipe.

bcead94

Reduced batch size of fastspeech2 to facilitate 1-gpu training Reduced VITS batch size to prevent OOM failures in 4-GPU setup

Added multi-speaker VITS training recipe

ab374fa

Updated ice-g2p integration to reflect recent changes

a3658ad

Fix merge conflicts in db.sh

cd7078d

refactor ice-g2p integration

512b857

initial commit. Not complete yet

49d846a

add conf files and implement data scripts

bbb8111

Implement talromur2 recipe data preprocessing scripts

c61baeb

Merge branch 'add_talromur2' of github.com:cadia-lvl/espnet into talr…

1ccd7e5

…omur2

G-Thor added 2 commits September 30, 2022 20:10

Add Talromur 2 entry in egs2/README.md

9989814

Fix linter error E501

0df138e

sw005320 added this to the v.202209 milestone Sep 30, 2022

sw005320 added Recipe TTS Text-to-speech labels Sep 30, 2022

sw005320 requested review from ftshijt and kan-bayashi September 30, 2022 21:28

kan-bayashi modified the milestones: v.202209, v.202211 Oct 4, 2022

G-Thor added 2 commits October 4, 2022 12:51

Merge branch 'master' into talromur2

f7d6f33

apply black to new script

0649fee

ftshijt reviewed Oct 5, 2022

View reviewed changes

egs2/talromur2/tts1/setup.sh Outdated Show resolved Hide resolved

remove setup.sh

161ab4c

kan-bayashi requested changes Oct 6, 2022

View reviewed changes

mergify bot added the conflicts label Oct 9, 2022

G-Thor added 3 commits November 7, 2022 12:21

Apply changes from PR review

163e887

Also replaced data.sh with data_multi_speaker.sh. - since that is used for this multi-speaker dataset

Update model download URL to correct version

6a62e9b

Make ice-g2p installer use official PyPI version

500ac0d

mergify bot added the Installation label Nov 7, 2022

Merge branch 'master' into talromur2

37cb63b

mergify bot removed the conflicts label Nov 7, 2022

Fix shellcheck errors and warnings

e52cfd8

G-Thor force-pushed the talromur2 branch from 4268df5 to e52cfd8 Compare November 7, 2022 13:11

kan-bayashi approved these changes Nov 8, 2022

View reviewed changes

kan-bayashi merged commit 45ae496 into espnet:master Nov 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Talromur2 recipe #4680

Add Talromur2 recipe #4680

G-Thor commented Sep 30, 2022

kan-bayashi commented Oct 4, 2022

codecov bot commented Oct 4, 2022 •

edited

kan-bayashi left a comment

kan-bayashi Oct 5, 2022

kan-bayashi Oct 5, 2022

kan-bayashi Oct 5, 2022

kan-bayashi Oct 5, 2022

kan-bayashi Oct 5, 2022

kan-bayashi Oct 5, 2022

kan-bayashi Oct 5, 2022

kan-bayashi Oct 5, 2022

kan-bayashi Oct 5, 2022

mergify bot commented Oct 9, 2022

kan-bayashi commented Oct 29, 2022

G-Thor commented Nov 7, 2022

kan-bayashi left a comment

-    ./local/split_train_dev_test.py --data_dir "data/${full_set}" --train_dir "data/${train_set}" --dev_dir "data/${dev_set}" --test_dir "data/${eval_set}"
+    ./local/split_train_dev_test.py \
+        --data_dir "data/${full_set}" \
+        --train_dir "data/${train_set}" \
+        --dev_dir "data/${dev_set}" \
+        --test_dir "data/${eval_set}"

-    ${train_cmd} --gpu 1 --num-threads 1 data/"${dset}_phn/log/conversion.log" ./pyscripts/utils/convert_text_to_phn.py --nj 1 --g2p g2p_is data/"${dset}"{,_phn}/text;
+    ${train_cmd} --gpu 1 data/"${dset}_phn/log/conversion.log" \
+        ./pyscripts/utils/convert_text_to_phn.py \
+            --nj 1 \
+            --g2p g2p_is \
+            data/"${dset}"{,_phn}/text

Add Talromur2 recipe #4680

Add Talromur2 recipe #4680

Conversation

G-Thor commented Sep 30, 2022

kan-bayashi commented Oct 4, 2022

codecov bot commented Oct 4, 2022 • edited

Codecov Report

kan-bayashi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mergify bot commented Oct 9, 2022

kan-bayashi commented Oct 29, 2022

G-Thor commented Nov 7, 2022

kan-bayashi left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 4, 2022 •

edited