Where to download the pretrained model? #31

OptimusPrimeCao · 2018-06-08T11:59:37Z

Is there a way to get checkpoint_15500 in inference file?

rafaelvalle · 2018-06-08T15:28:21Z

Checkpoint files are saved to the output_directory that is specified when running the train.py command. The example code on our repo saves to outdir

OptimusPrimeCao · 2018-06-11T08:43:01Z

@rafaelvalle
When I run train.py with default hparams on 8GB 1080 gpu, I get this error. I change batchsize to 24 but the error still exists!

src/tcmalloc.cc:278] Attempt to free invalid pointer 0x100000009
Traceback (most recent call last):
  File "train.py", line 291, in <module>
    args.warm_start, args.n_gpus, args.rank, args.group_name, hparams)
  File "train.py", line 209, in train
    for i, batch in enumerate(train_loader):
  File "/data00/home/caoyuetian/share/anaconda3/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 330, in __next__
    idx, batch = self._get_batch()
  File "/data00/home/caoyuetian/share/anaconda3/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 309, in _get_batch
    return self.data_queue.get()
  File "/data00/home/caoyuetian/share/anaconda3/lib/python3.6/multiprocessing/queues.py", line 335, in get
    res = self._reader.recv_bytes()
  File "/data00/home/caoyuetian/share/anaconda3/lib/python3.6/multiprocessing/connection.py", line 216, in recv_bytes
    buf = self._recv_bytes(maxlength)
  File "/data00/home/caoyuetian/share/anaconda3/lib/python3.6/multiprocessing/connection.py", line 407, in _recv_bytes
    buf = self._recv(4)
  File "/data00/home/caoyuetian/share/anaconda3/lib/python3.6/multiprocessing/connection.py", line 379, in _recv
    chunk = read(handle, remaining)
  File "/data00/home/caoyuetian/share/anaconda3/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 227, in handler
    _error_if_any_worker_fails()
RuntimeError: DataLoader worker (pid 125263) is killed by signal: Aborted.
Segmentation fault (core dumped)

rafaelvalle · 2018-06-12T15:51:26Z

This is probably related to your CPU not being able to load the data.
Try changing the number of workers or smaller batch size.

MXGray · 2018-06-26T01:43:04Z

@rafaelvalle
Can you share your pretrained Tacotron2 model and hparams that generated the sample audio?

rafaelvalle · 2018-06-26T15:07:09Z

@OptimusPrimeCao The model uses approximately 300MB per sample. Try reducing your batch size to 16.

rafaelvalle · 2018-06-26T15:08:58Z

@MXGray We can share the hparams.
Please do not post the same message on multiple issues. I deleted your message from the Audio Examples issue.

gsoul · 2018-06-26T16:04:43Z

Please share the hparams.

vijaysumaravi · 2018-07-10T21:09:42Z

Is there a way I can continue training my model from a particular point?

To be specific, my training crashed at checkpoint_32000 because of memory issue which I have fixed. Can I now somehow resume training from this point or should I again begin from the start? If yes, how do I do it?

I wasn't sure if opening a new issue for this was required hence posting my comment here.

Any help is appreciated. Thanks!

vijaysumaravi · 2018-07-10T22:13:19Z

Never mind, figured it out.

python train.py --output_directory=outdir --log_directory=logdir --checkpoint_path='outdir/checkpoint_32500'

rafaelvalle · 2018-11-27T16:08:07Z

Pre-trained model has been made available on our README page.

beknazar · 2019-03-02T12:34:57Z

Never mind, figured it out.

python train.py --output_directory=outdir --log_directory=logdir --checkpoint_path='outdir/checkpoint_32500'

@vijaysumaravi Mine was also stopped after 32.5 epochs, did you figure out the reason?

vijaysumaravi · 2019-03-02T22:54:02Z

Reducing my batch size helped. I was training it on a single GPU.

ErfolgreichCharismatisch · 2021-03-26T08:53:59Z

Tutorial: Training on GPU with Colab, Inference with CPU on Server here.

gsoul mentioned this issue Jul 4, 2018

HParams for training the model from readme #55

Closed

rafaelvalle closed this as completed Nov 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Where to download the pretrained model? #31

Where to download the pretrained model? #31

OptimusPrimeCao commented Jun 8, 2018

rafaelvalle commented Jun 8, 2018 •

edited

Loading

OptimusPrimeCao commented Jun 11, 2018

rafaelvalle commented Jun 12, 2018

MXGray commented Jun 26, 2018

rafaelvalle commented Jun 26, 2018

rafaelvalle commented Jun 26, 2018

gsoul commented Jun 26, 2018

vijaysumaravi commented Jul 10, 2018

vijaysumaravi commented Jul 10, 2018

rafaelvalle commented Nov 27, 2018

beknazar commented Mar 2, 2019

vijaysumaravi commented Mar 2, 2019

ErfolgreichCharismatisch commented Mar 26, 2021

Where to download the pretrained model? #31

Where to download the pretrained model? #31

Comments

OptimusPrimeCao commented Jun 8, 2018

rafaelvalle commented Jun 8, 2018 • edited Loading

OptimusPrimeCao commented Jun 11, 2018

rafaelvalle commented Jun 12, 2018

MXGray commented Jun 26, 2018

rafaelvalle commented Jun 26, 2018

rafaelvalle commented Jun 26, 2018

gsoul commented Jun 26, 2018

vijaysumaravi commented Jul 10, 2018

vijaysumaravi commented Jul 10, 2018

rafaelvalle commented Nov 27, 2018

beknazar commented Mar 2, 2019

vijaysumaravi commented Mar 2, 2019

ErfolgreichCharismatisch commented Mar 26, 2021

rafaelvalle commented Jun 8, 2018 •

edited

Loading