Refactor prep_segments in SVS #5210

jerryuhoo · 2023-06-04T20:44:29Z

@ftshijt @A-Quarter-Mile
I plan to use only one file for preprocessing labels and XML files, instead of using local/prep_segments.py and local/prep_segments_from_xml.py in every dataset, which will be placed under pyscripts/utils.
To fix dataset errors, we can use several functions (replace_lyrics, replace_labels, skip_labels, add_missing_phoneme, add_pause) to handle specific errors. We can use a dictionary error_correction to store error handling functions in each dataset.

In order to use this new function, previous scripts for datasets that use XML files should be changed.

in local/data.sh, we should use pyscripts/utils/prep_segments.py and add parameters such as --dataset natsume and --input_type hts/xml. Please see Natsume example.
All error-handling code for each dataset should be added to this file.
After that, delete local/prep_segments.py, local/prep_segments_from_xml.py and pyscripts/utils/prep_segments_from_xml.py

for more information, see https://pre-commit.ci

codecov · 2023-06-04T22:06:19Z

Codecov Report

Merging #5210 (f915e56) into master (d9ae3f7) will increase coverage by 0.44%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #5210      +/-   ##
==========================================
+ Coverage   74.54%   74.98%   +0.44%     
==========================================
  Files         640      655      +15     
  Lines       57267    58552    +1285     
==========================================
+ Hits        42688    43908    +1220     
- Misses      14579    14644      +65

Flag	Coverage Δ
test_integration_espnet1	`66.24% <ø> (-0.05%)`	⬇️
test_integration_espnet2	`47.64% <ø> (+0.06%)`	⬆️
test_python	`65.27% <ø> (-0.01%)`	⬇️
test_utils	`23.27% <ø> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
espnet2/gan_svs/vits/length_regulator.py	`94.87% <ø> (-0.13%)`	⬇️

... and 58 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

for more information, see https://pre-commit.ci

ftshijt

super thanks! it would be great if we can create a better doc in egs/TEMPLATE/svs1 to instruct people how to use the tool

egs2/namine_ritsu_utagoe_db/svs1/local/data.sh

ftshijt · 2023-06-05T07:48:08Z

egs2/namine_ritsu_utagoe_db/svs1/local/dataset_split.py

+        label_info, text_info = process_text_info(
+            os.path.join(src_data, folder, "{}.lab".format(folder))
+        )
+        # text_scp.write("{} {}\n".format(utt_id, text_info))


Why do you remove the write of text_scp and midiscp?

Why do you remove the write of text_scp and midiscp?

I think they are not used in this dataset. I find that ofuton also doesn't use text_scp and midiscp, so I removed them. Should I keep them?

If it is not used, let's remove them.

A-Quarter-Mile · 2023-06-05T07:54:12Z

The modification functions in new prep_segment look great. But it would be too long if we put all datasets in it.

jerryuhoo · 2023-06-05T13:12:58Z

I see. I think it would be a good idea to generate a dict in check_align.py, then I can use the dict in prep_segments.py. In that case, all the errors will be auto-fixed, and we don't have to put those dicts inside the code. But currently I don't know how to save and load (also append a function to) this dict.

ftshijt · 2023-06-11T10:58:11Z

Looks good to me. Could you also add the related docs in template/svs1/readme ? I'm still not pretty sure about the potential steps we need to do with the regarding updates.

egs2/TEMPLATE/svs1/README.md

ftshijt · 2023-06-14T17:09:23Z

egs2/namine_ritsu_utagoe_db/svs1/local/dataset_split.py

+        label_info, text_info = process_text_info(
+            os.path.join(src_data, folder, "{}.lab".format(folder))
+        )
+        # text_scp.write("{} {}\n".format(utt_id, text_info))


If it is not used, let's remove them.

ftshijt · 2023-06-14T17:10:02Z

LGTM! @A-Quarter-Mile could you double check it?

A-Quarter-Mile · 2023-06-15T01:50:12Z

I still suggest removing natsume from new prep_segments and keeping the one in local/. There are some special designs for it (mid_reader and align_lyric_notes), which are not applicable for other datasets. Besides, the modifications for natsume in new prep_segments are based on manual inspection, we can remove them safely.

Co-authored-by: Jiatong <728307998@qq.com>

for more information, see https://pre-commit.ci

ftshijt · 2023-06-23T07:18:24Z

LGTM! @A-Quarter-Mile could you double check this update? After your confirmation and CI cleared, I will merge the PR.

ftshijt · 2023-06-26T07:16:02Z

Many thanks! Code merged.

jerryuhoo and others added 4 commits June 2, 2023 21:51

combine prep_segments and xml into one file

c5596ec

Merge branch 'espnet:master' into namine

e3834e4

support natsume

aa25430

refactor prep_segments code

f03b111

mergify bot added the ESPnet2 label Jun 4, 2023

pre-commit-ci bot and others added 2 commits June 4, 2023 20:45

[pre-commit.ci] auto fixes from pre-commit.com hooks

e842ad9

for more information, see https://pre-commit.ci

update natsume

e85b9bc

jerryuhoo and others added 3 commits June 5, 2023 00:23

Add namine dataset

4f72417

Add error hint for fixing the dataset

c75c4a6

[pre-commit.ci] auto fixes from pre-commit.com hooks

2c95176

for more information, see https://pre-commit.ci

jerryuhoo changed the title ~~[WIP] Refactor prep_segments in SVS~~ Refactor prep_segments in SVS Jun 5, 2023

jerryuhoo added 2 commits June 5, 2023 01:12

delete glu config

f9d4982

update namine run.sh

c06012f

ftshijt reviewed Jun 5, 2023

View reviewed changes

remove unused code

cf1ab4c

jerryuhoo added 2 commits June 11, 2023 01:13

fix svs data prep

9875e8f

fix namine ci error

ea964ec

Add instruction for fixing the dataset

6cd573b

mergify bot added the README label Jun 11, 2023

jerryuhoo added 3 commits June 11, 2023 12:47

fix comment

a295ce1

fix cmd.sh

a57f3cf

Add data path

c703483

ftshijt reviewed Jun 14, 2023

View reviewed changes

Update egs2/TEMPLATE/svs1/README.md

d273893

Co-authored-by: Jiatong <728307998@qq.com>

jerryuhoo added 3 commits June 15, 2023 10:10

fix length regulator bug

a625bee

remove unused code

5794803

Update svs configs

49b9e48

ftshijt added Recipe Music Music processing labels Jun 16, 2023

jerryuhoo and others added 3 commits June 22, 2023 21:19

update prep_segments.py

6703a4a

[pre-commit.ci] auto fixes from pre-commit.com hooks

03e5873

for more information, see https://pre-commit.ci

Update SVS README.md

e511a06

Shortened lines to fix CI

f915e56

ftshijt merged commit baaba22 into espnet:master Jun 26, 2023
24 of 25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor prep_segments in SVS #5210

Refactor prep_segments in SVS #5210

jerryuhoo commented Jun 4, 2023 •

edited

codecov bot commented Jun 4, 2023 •

edited

ftshijt left a comment

ftshijt Jun 5, 2023

jerryuhoo Jun 5, 2023

ftshijt Jun 14, 2023

A-Quarter-Mile commented Jun 5, 2023

jerryuhoo commented Jun 5, 2023 •

edited

ftshijt commented Jun 11, 2023

ftshijt Jun 14, 2023

ftshijt commented Jun 14, 2023

A-Quarter-Mile commented Jun 15, 2023

ftshijt commented Jun 23, 2023

ftshijt commented Jun 26, 2023

Refactor prep_segments in SVS #5210

Refactor prep_segments in SVS #5210

Conversation

jerryuhoo commented Jun 4, 2023 • edited

codecov bot commented Jun 4, 2023 • edited

Codecov Report

ftshijt left a comment

Choose a reason for hiding this comment

ftshijt Jun 5, 2023

Choose a reason for hiding this comment

jerryuhoo Jun 5, 2023

Choose a reason for hiding this comment

ftshijt Jun 14, 2023

Choose a reason for hiding this comment

A-Quarter-Mile commented Jun 5, 2023

jerryuhoo commented Jun 5, 2023 • edited

ftshijt commented Jun 11, 2023

ftshijt Jun 14, 2023

Choose a reason for hiding this comment

ftshijt commented Jun 14, 2023

A-Quarter-Mile commented Jun 15, 2023

ftshijt commented Jun 23, 2023

ftshijt commented Jun 26, 2023

jerryuhoo commented Jun 4, 2023 •

edited

codecov bot commented Jun 4, 2023 •

edited

jerryuhoo commented Jun 5, 2023 •

edited