Add dynamic batch size for VALLE #19

HeCheng0625 · 2023-12-08T02:12:29Z

Add dynamic batch size for VALLE

lmxue · 2023-12-08T06:27:12Z

models/tts/valle/valle_dataset.py

@@ -82,6 +93,10 @@ def __getitem__(self, index):

        return single_feature

+    def get_num_frames(self, index):
+        utt_info = self.metadata[index]
+        return int(utt_info['Duration'] * 75)


What's the meaning of 75? Is it a fixed parameter or a variable parameter?

it is a fixed parameter. 75 means 1s have 75 tokens for encodec

It would be better to move this parameter to the config file because it relies on the codec.

lmxue · 2023-12-08T06:33:27Z

models/tts/valle/valle_trainer.py

+        if not self.cfg.train.use_dynamic_batchsize:
+            return super()._build_dataloader()
+        Dataset, Collator = self._build_dataset()
+        train_dataset = Dataset(self.cfg, self.cfg.dataset[0], is_valid=False)


Does it only work on dataset[0] instead of all elements of the dataset list?

It only works for dynamic batchsize training for VALLE, if not, it will use super()._build_dataloader()

cfg.dataset specifics a dataset list to be processed, but your code seems to only process the first dataset (self.cfg.dataset[0]). How about other datasets?

egs/tts/VALLE/exp_config.json

lmxue · 2023-12-08T13:05:25Z

models/tts/base/tts_trainer.py

+        if (self.model_type == "VALLE") and (not self.cfg.train.use_dynamic_batchsize):
+            (
+                self.train_dataloader,
+                self.valid_dataloader,
+            ) = self.accelerator.prepare(
+                self.train_dataloader,
+                self.valid_dataloader,
+            )


This judgment statement means that if the model_type is VITS or Fastspeech2， the dataloader will not be prepared. It is wrong!

rewrite accelerator prepare in valle

Add dynamic batch size for VALLE

95421bd

HeCheng0625 requested review from zhizhengwu and lmxue December 8, 2023 02:12

lmxue reviewed Dec 8, 2023

View reviewed changes

egs/tts/VALLE/exp_config.json Outdated Show resolved Hide resolved

use AdamW as default

37c5c5d

lmxue reviewed Dec 8, 2023

View reviewed changes

rewrite accelerator prepare in valle

fc2a63a

HeCheng0625 requested a review from lmxue December 9, 2023 17:37

HeCheng0625 and others added 4 commits December 10, 2023 05:44

add codec hop size in config

c6190ae

Update exp_config.json

3f0b420

Update valle.json

e61cde1

Update valle_dataset.py

373430a

lmxue approved these changes Dec 10, 2023

View reviewed changes

lmxue merged commit c113ec3 into open-mmlab:main Dec 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dynamic batch size for VALLE #19

Add dynamic batch size for VALLE #19

HeCheng0625 commented Dec 8, 2023

lmxue Dec 8, 2023

HeCheng0625 Dec 9, 2023 •

edited

Loading

lmxue Dec 10, 2023

HeCheng0625 Dec 10, 2023

lmxue Dec 8, 2023

HeCheng0625 Dec 8, 2023

lmxue Dec 10, 2023

HeCheng0625 Dec 10, 2023

lmxue Dec 8, 2023

HeCheng0625 Dec 8, 2023

Add dynamic batch size for VALLE #19

Add dynamic batch size for VALLE #19

Conversation

HeCheng0625 commented Dec 8, 2023

Choose a reason for hiding this comment

HeCheng0625 Dec 9, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

HeCheng0625 Dec 9, 2023 •

edited

Loading