You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Traceback (most recent call last):
File "train.py", line 182, in <module>
generation_save_path=args.generation_save_path)
File "/disk/nfs/ostrom/s1717552/btmpg/utils/run.py", line 133, in __call__
self.run()
File "/disk/nfs/ostrom/s1717552/btmpg/utils/run.py", line 100, in run
max_length=self.max_length)
File "/disk/nfs/ostrom/s1717552/btmpg/model/VAE.py", line 206, in round
out_embed = self.embed(self.GS(sentence[:, -1:, :]))
File "/disk/nfs/ostrom/s1717552/btmpg/btmpgenv/lib64/python3.6/site-packages/torch/nn/modules/module.py", line 532, in __call__
result = self.forward(*input, **kwargs)
File "/disk/nfs/ostrom/s1717552/btmpg/model/gumbleSoftmax.py", line 17, in forward
sigma = min(self.tau_max, (self.tau_max ** (self.n / self.N)))
OverflowError: (34, 'Numerical result out of range')
This happens after a few days of training, around epoch 39 for MSCOCO and epoch 77 for Quora.
Hi, this overflow is because gumble_softmax did not set as mentioned in the paper. In the file run.py, ‘self.GS = gumble_softmax(3500, 100)‘, that is, n = 3500, tau_max = 100, check the code carefully, you will find that every step, n+= 1, with the number of training steps increase, n is getting bigger and bigger, self.tau_max ** (self.n / seld.n) will have an overflow error.
Hi,
During training, I get the following error:
This happens after a few days of training, around epoch 39 for MSCOCO and epoch 77 for Quora.
The command used was:
The text was updated successfully, but these errors were encountered: