Running yaapt on-the-fly extremely slows the training #4

seungwonpark · 2021-11-18T07:04:55Z

Hi, thanks for kindly releasing the code for the paper. (Also congratulations on the acceptance in INTERSPEECH!)

While I was running the code, I encountered a significant issue - pYAAPT.yaapt extremely slow the training.
Here's how I found out such a bottleneck on speed:

I tried to run train_f0_vq.py as specified in README.
However, training was too slow; looks like we need to train an f0 vq model for 400000 steps, but a single epoch (about 700 steps) took 2657 seconds to run. GPU util was really low, and CPUs were running like crazy. (My server has 3080 Ti with 64 CPU cores.)
I suspected pYAAPT.yaapt to be a cause for this. To test that, I forked a repository to add a caching functionality: https://github.com/seungwonpark/speech-resynthesis
After that, a single epoch after the first epoch (for an initial caching) took only 36 seconds.

So my question is, how did you manage to run yaapt on-the-fly without caching? Though I succeeded in training the model fast enough, I shall need to disable caching again since it requires the _sample_interval method to sample the same interval for each audio (i.e. disabling the data augmentation via randomly choosing the interval).

The text was updated successfully, but these errors were encountered:

seungwonpark · 2021-11-29T11:00:59Z

ping @adampolyak

adampolyak · 2022-01-04T08:46:12Z

Hi,

In our experiments we were able to finish 1 epoch in ~760 seconds, on the VCTK dataset. It might be possible that our naive implementation ran faster on our hardware.

Going forward, it seems that adding caching speeds up training! Another option is to add a preprocessing step that will extract pitch values from all wav samples + update the dataset to load preprocessed values instead of calculating them on the fly.

Happy to review and add any pull-requests!

seungwonpark · 2022-01-04T09:00:30Z

Got it. Thanks for your reply!

Closing this issue.

aereobert · 2022-01-04T15:04:32Z

Hi, I think you might be training on some shared platform with a weak cpu. E.g. Google Colab container. When training on Colab, I got the same time as yours does. However, when I train on my 1080ti, I got the same training speed as the author.

seungwonpark closed this as completed Jan 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running yaapt on-the-fly extremely slows the training #4

Running yaapt on-the-fly extremely slows the training #4

seungwonpark commented Nov 18, 2021

seungwonpark commented Nov 29, 2021

adampolyak commented Jan 4, 2022

seungwonpark commented Jan 4, 2022

aereobert commented Jan 4, 2022

Running yaapt on-the-fly extremely slows the training #4

Running yaapt on-the-fly extremely slows the training #4

Comments

seungwonpark commented Nov 18, 2021

seungwonpark commented Nov 29, 2021

adampolyak commented Jan 4, 2022

seungwonpark commented Jan 4, 2022

aereobert commented Jan 4, 2022