Use tf.softmax_cross_entropy_with_logits to calculate loss #181

elezar · 2016-06-06T09:33:11Z

This PR closes #166

When manually implementing the cross entropy loss, the training loss becomes nan after a number of iterations (usually around 90 or so epochs for the cluttered MNIST example). Switching to the built-in softmax_cross_entropy_with_logits allows the network to train stably.

@denny1108 could you also just confirm that this solves the problem?

denny1108 · 2016-06-06T19:08:25Z

@elezar Thanks for your contribution. I ran the cluttered-mnist example again. Within 500 epochs, I did not see the 'nan' loss. I speculate the previous problem is due to negative log likelihood in cross entropy. In some case, when the probability of true category is close to 0, the numerical loss would be 'nan'. It could be caused by error in labeling or model divergence. I guess the build-in loss module has some scheme to avoid it?

Anyway, I think the problem has been solved for this example.

elezar · 2016-06-14T21:01:49Z

@martinwicke could you have a look again?

(@denny1108 it would be great if you could also just confirm that things still look good from your side)

martinwicke · 2016-06-23T02:37:08Z

Looks good, thanks!

Use the tensorflow cross entropy function to prevent nan losses.

552c75b

elezar force-pushed the bugfix/nan_loss branch from 36fafd1 to 552c75b Compare June 10, 2016 07:18

Correct double softmax and use mean for loss.

3b44c94

martinwicke merged commit d816971 into tensorflow:master Jun 23, 2016

elezar deleted the bugfix/nan_loss branch June 23, 2016 07:15

daviddao mentioned this pull request Jul 7, 2016

got 'nan' loss daviddao/spatial-transformer-tensorflow#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use tf.softmax_cross_entropy_with_logits to calculate loss #181

Use tf.softmax_cross_entropy_with_logits to calculate loss #181

elezar commented Jun 6, 2016

denny1108 commented Jun 6, 2016 •

edited

Loading

elezar commented Jun 14, 2016

martinwicke commented Jun 23, 2016

Use tf.softmax_cross_entropy_with_logits to calculate loss #181

Use tf.softmax_cross_entropy_with_logits to calculate loss #181

Conversation

elezar commented Jun 6, 2016

denny1108 commented Jun 6, 2016 • edited Loading

elezar commented Jun 14, 2016

martinwicke commented Jun 23, 2016

denny1108 commented Jun 6, 2016 •

edited

Loading