Fixing issue "Training beyond specified 't_total' steps with schedule 'warmup_linear'" reported in #556 #604

samuelbroscheit · 2019-05-11T22:34:28Z

Fixing the issues reported in #556

Reason for issue was that num_optimzation_steps was computed from example size, which is different from actual size of dataloader when an example is chunked into multiple instances.

Solution in this pull request is to compute num_optimization_steps directly from len(data_loader).

Reason for issue was that optimzation steps where computed from example size, which is different from actual size of dataloader when an example is chunked into multiple instances. Solution in this pull request is to compute num_optimization_steps directly from len(data_loader).

examples/run_classifier.py

lukovnikov · 2019-05-13T15:45:54Z

Looks good.

thomwolf · 2019-06-14T14:49:15Z

Ok merging thanks, sorry for the delay!

samuelbroscheit added 2 commits May 12, 2019 00:13

Clean up a little bit

49a77ac

samuelbroscheit changed the title ~~Fixing the issues reported in https://github.com/huggingface/pytorch-pretrained-BERT/issues/556~~ Fixing issue "Training beyond specified 't_total' steps with schedule 'warmup_linear'" reported in #556 May 11, 2019

MottoX reviewed May 12, 2019

View reviewed changes

examples/run_classifier.py Outdated Show resolved Hide resolved

Make num_train_optimization_steps int

94247ad

lukovnikov mentioned this pull request May 13, 2019

Fix for computing t_total in examples #590

Closed

AlanHassen mentioned this pull request May 23, 2019

run_classifier.py:TypeError: forward() missing 1 required positional argument: 'input_ids' #632

Closed

thomwolf merged commit 659af2c into huggingface:master Jun 14, 2019

andrelmfarias mentioned this pull request Sep 20, 2019

Fine Tuning (Training beyond specified 't_total'...) cdqa-suite/cdQA#262

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing issue "Training beyond specified 't_total' steps with schedule 'warmup_linear'" reported in #556 #604

Fixing issue "Training beyond specified 't_total' steps with schedule 'warmup_linear'" reported in #556 #604

samuelbroscheit commented May 11, 2019

lukovnikov commented May 13, 2019

thomwolf commented Jun 14, 2019

Fixing issue "Training beyond specified 't_total' steps with schedule 'warmup_linear'" reported in #556 #604

Fixing issue "Training beyond specified 't_total' steps with schedule 'warmup_linear'" reported in #556 #604

Conversation

samuelbroscheit commented May 11, 2019

lukovnikov commented May 13, 2019

thomwolf commented Jun 14, 2019