During the run of the UniTS_pretrain_x128.sh script the loss value encountered nan. #17

linxi20 · 2024-05-02T05:15:20Z

Hello,
Thank you for your contributions. I tried to run the UniTS_pretrain_x128.sh script, but after a while, the outputs appeared to be nan, and the corresponding loss value also changed to nan. But reducing the d_model to 64 there is no problem. What is the reason for this?

gasvn · 2024-05-02T16:28:27Z

That happens sometimes because the co-training on cross-domain datasets is not always stable, and it happens not only for time series but also on foundation models on other fields. We rerun the experiments when we find nan. You can also adjust the learning rate and use a smaller grad-clip value.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

During the run of the UniTS_pretrain_x128.sh script the loss value encountered nan. #17

During the run of the UniTS_pretrain_x128.sh script the loss value encountered nan. #17

linxi20 commented May 2, 2024 •

edited

Loading

gasvn commented May 2, 2024

During the run of the UniTS_pretrain_x128.sh script the loss value encountered nan. #17

During the run of the UniTS_pretrain_x128.sh script the loss value encountered nan. #17

Comments

linxi20 commented May 2, 2024 • edited Loading

gasvn commented May 2, 2024

linxi20 commented May 2, 2024 •

edited

Loading