Show averages in batch loss/accuracy? #66

danieldk · 2019-05-24T07:56:21Z

While training, the batch loss/accuracy of the last trained batch is shown. However, these numbers tend to jump around quite a bit. Maybe we should show the average so far in the current epoch? This would help observing the trends a bit better.

twuebi · 2019-05-24T08:12:51Z

I think it's very informative to see single batch losses, it can be helpful for tuning parameters, but also to find foul training examples, as they would tend to produce outlier losses.

danieldk · 2019-05-24T08:15:35Z

It could be a command-line option. Overall averages are not really nice for pretraining (who cares what performance you had at batch 100 when you are at batch 10000). On the other hand, I see quite a lot of variability between batches, which is also annoying. Maybe the proper solution would be to support moving averages, they are informative on how the model performs at that point, but also easier to read than the jumpy raw loss/accuracy.

twuebi · 2019-05-24T08:21:34Z

It could be a command-line option. Overall averages are not really nice for pretraining (who cares what performance you had at batch 100 when you are at batch 10000). On the other hand, I see quite a lot of variability between batches, which is also annoying. Maybe the proper solution would be to support moving averages, they are informative on how the model performs at that point, but also easier to read than the jumpy raw loss/accuracy.

If it's just an option, no hard feelings.

Something related I implemented: I have a branch with tensorboard support (through tf.contrib.summary) which serves plots of accuracy, train loss, gradient norms and also visualizes the graph. It brings the overhead of a browser in, but I like the visualization it offers.

danieldk · 2019-05-24T08:26:08Z

Tensorboard support would be really nice!

twuebi · 2019-05-24T12:59:57Z

Then I'll do the PRs.

danieldk closed this as completed Jul 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show averages in batch loss/accuracy? #66

Show averages in batch loss/accuracy? #66

danieldk commented May 24, 2019

twuebi commented May 24, 2019

danieldk commented May 24, 2019

twuebi commented May 24, 2019

danieldk commented May 24, 2019

twuebi commented May 24, 2019

Show averages in batch loss/accuracy? #66

Show averages in batch loss/accuracy? #66

Comments

danieldk commented May 24, 2019

twuebi commented May 24, 2019

danieldk commented May 24, 2019

twuebi commented May 24, 2019

danieldk commented May 24, 2019

twuebi commented May 24, 2019