Reduce validation loss in training #30

egpbos · 2020-11-16T09:14:49Z

In run dainty-dawn-20 we saw the validation loss increasing again starting from epoch 20, approximately. We should find a way to train that reduces validation loss together with training loss.

egpbos · 2020-11-16T09:49:51Z

First try: dropout = 0.1. Run: lunar-serenity-26.

egpbos · 2020-11-16T10:26:01Z

Promising, surely better results than with 0 dropout. Now trying dropout = 0.2: dark-serenity-27. Note: this is only dropout in the transformer encoder layer.

egpbos · 2020-11-16T14:52:20Z

Also tried 0.5 (worthy-sunset-28). Both cases: difference between validation loss and step loss stays slightly lower initially than in the lower dropout factor runs.

However, also running logical-butterfly-29 and there we see that after some more training the validation loss still starts climbing again, whereas the training loss more or less completely vanishes, so again overfit on training set at some point. The performance is better though, and still climbing at this moment (epoch 130 of 300), so let's see where it goes.

egpbos · 2021-01-13T10:06:55Z

We did a sweep again over dropout values and again values between 0.2-0.5 seem to perform best. However, validation loss still increases in all runs after a certain time.

As already mentioned in this report, we should probably look into more regularization options to correct for this overfitting on training data. See #61.

cwmeijer · 2021-02-23T11:19:11Z

I think we solved the validation loss (overfitting) issue during our regularization sweeps (https://wandb.ai/spokenlanguage/platalea_transformer/reports/Jan-29-Project-Update-Regularization-rates-conclusion--Vmlldzo0MzY3MDg).

egpbos mentioned this issue Nov 16, 2020

Experiment with constant learning rate with transformers #24

Closed

cwmeijer mentioned this issue Jan 13, 2021

add L2 regularization option #61

Closed

egpbos added performance transformer labels Jan 13, 2021

cwmeijer closed this as completed Feb 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce validation loss in training #30

Reduce validation loss in training #30

egpbos commented Nov 16, 2020

egpbos commented Nov 16, 2020

egpbos commented Nov 16, 2020

egpbos commented Nov 16, 2020

egpbos commented Jan 13, 2021

cwmeijer commented Feb 23, 2021

Reduce validation loss in training #30

Reduce validation loss in training #30

Comments

egpbos commented Nov 16, 2020

egpbos commented Nov 16, 2020

egpbos commented Nov 16, 2020

egpbos commented Nov 16, 2020

egpbos commented Jan 13, 2021

cwmeijer commented Feb 23, 2021