Error in loss function #18

ymzhang1919 · 2017-01-11T16:06:56Z

   epsilon = tf.constant(value=1e-4)     
   logits = logits + epsilon
   softmax = tf.nn.softmax(logits)

It should be
epsilon = tf.constant(value=1e-4)
softmax = tf.nn.softmax(logits) + epsilon

The text was updated successfully, but these errors were encountered:

MarvinTeichmann · 2017-01-12T13:17:45Z

Why? The purpose of the epsilon is to avoid numerical instability.

ymzhang1919 · 2017-01-12T18:26:09Z

I understand the purpose, but I don't understand how it works. Logits can be big negative numbers. How can you improve the numerical stability of the softmax() operation by adding a small positive number to logits?

On the other hand, adding a small positive number to softmax makes the log() operation more robust.

If I am wrong, can you explain it in detail? Thx.

MarvinTeichmann · 2017-01-25T13:46:22Z

You are right, I have fixed it.

MarvinTeichmann closed this as completed in 2b2e24b Jan 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error in loss function #18

Error in loss function #18

ymzhang1919 commented Jan 11, 2017

MarvinTeichmann commented Jan 12, 2017

ymzhang1919 commented Jan 12, 2017 •

edited

MarvinTeichmann commented Jan 25, 2017

Error in loss function #18

Error in loss function #18

Comments

ymzhang1919 commented Jan 11, 2017

MarvinTeichmann commented Jan 12, 2017

ymzhang1919 commented Jan 12, 2017 • edited

MarvinTeichmann commented Jan 25, 2017

ymzhang1919 commented Jan 12, 2017 •

edited