NAN loss after one epoch #20

sjbling · 2021-12-14T02:06:11Z

HI,
I train the model based on nuscenes with baseline.yml, but I got the nan loss after one epoch, how to train the model from scratch? Looking forward to your reply！

markus93 · 2022-03-31T10:45:56Z

Hey,

Thanks for sharing the Fiery model and article publicly, it was a great read.

I also tried the same setup and have the same issue. I tried to dig a little bit deeper and found out that running means and running variances of batch normalization are getting first to -infinity and then to nan. This also causes the loss to go to nan. The loss seems to get back from the nan-values, however, the model will still output -infinities for the segmentation.

Best regards,
Markus

anthonyhu · 2022-04-01T08:58:37Z

Hey both of you, and sorry for the late answer. It is a known issue, and it seems like training the whole network from scratch leads to instability. The fix is to load pre-trained weight from a 1-timestep model (FIERY Static) first because training the whole future prediction model as discussed here: #8

cs-hue · 2023-03-20T12:07:56Z

Hey,

Thanks for sharing the Fiery model and article publicly, it was a great read.

After I train the model for epoch I got loss as negative value.Do you know the reason?

Best regards,
Cara

anthonyhu · 2023-03-20T13:32:07Z

Hey Cara, the negative loss value is due to the adaptive weighting of the losses using uncertainty. There is an additional loss term that prevents the weight from growing too large (see page 5 https://arxiv.org/pdf/1705.07115.pdf)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NAN loss after one epoch #20

NAN loss after one epoch #20

sjbling commented Dec 14, 2021

markus93 commented Mar 31, 2022

anthonyhu commented Apr 1, 2022

cs-hue commented Mar 20, 2023

anthonyhu commented Mar 20, 2023

NAN loss after one epoch #20

NAN loss after one epoch #20

Comments

sjbling commented Dec 14, 2021

markus93 commented Mar 31, 2022

anthonyhu commented Apr 1, 2022

cs-hue commented Mar 20, 2023

anthonyhu commented Mar 20, 2023