Skip to content
This repository has been archived by the owner on Mar 19, 2021. It is now read-only.

Show averages in batch loss/accuracy? #66

Closed
danieldk opened this issue May 24, 2019 · 5 comments
Closed

Show averages in batch loss/accuracy? #66

danieldk opened this issue May 24, 2019 · 5 comments

Comments

@danieldk
Copy link
Member

While training, the batch loss/accuracy of the last trained batch is shown. However, these numbers tend to jump around quite a bit. Maybe we should show the average so far in the current epoch? This would help observing the trends a bit better.

@twuebi
Copy link
Collaborator

twuebi commented May 24, 2019

I think it's very informative to see single batch losses, it can be helpful for tuning parameters, but also to find foul training examples, as they would tend to produce outlier losses.

@danieldk
Copy link
Member Author

It could be a command-line option. Overall averages are not really nice for pretraining (who cares what performance you had at batch 100 when you are at batch 10000). On the other hand, I see quite a lot of variability between batches, which is also annoying. Maybe the proper solution would be to support moving averages, they are informative on how the model performs at that point, but also easier to read than the jumpy raw loss/accuracy.

@twuebi
Copy link
Collaborator

twuebi commented May 24, 2019

It could be a command-line option. Overall averages are not really nice for pretraining (who cares what performance you had at batch 100 when you are at batch 10000). On the other hand, I see quite a lot of variability between batches, which is also annoying. Maybe the proper solution would be to support moving averages, they are informative on how the model performs at that point, but also easier to read than the jumpy raw loss/accuracy.

If it's just an option, no hard feelings.

Something related I implemented: I have a branch with tensorboard support (through tf.contrib.summary) which serves plots of accuracy, train loss, gradient norms and also visualizes the graph. It brings the overhead of a browser in, but I like the visualization it offers.

@danieldk
Copy link
Member Author

Tensorboard support would be really nice!

@twuebi
Copy link
Collaborator

twuebi commented May 24, 2019

Then I'll do the PRs.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Development

No branches or pull requests

2 participants