Warmup schedulers in References #4411

datumbox · 2021-09-14T13:47:37Z

Resolves #4281

Adds warmup on the following recipes:

classification
detection
segmentation
video classification

This PR maintains the location where we call scheduler.step() in each recipe. Segmentation and Video classification do it on the iteration level, Classification does it on the epoch level and Detection does it in a hybrid.

Though doing it on the iteration level provides more flexibility, doing so will have slight effects on the reproducibility of existing models. These effects should be minor and largely overshadowed by other differences across runs (such as the randomness of the initialization scheme). The only reason I'm not doing the switch here is because it requires extra work which I'm deferring for when we will start retraining the models using the new utils of Batteries Included.

fmassa

LGTM, thanks!

Just double-checking, did you run the new schedulers in a loop to compare before / after results?

fmassa · 2021-09-17T10:54:43Z

references/segmentation/train.py

        optimizer,
-        lambda x: (1 - x / (len(data_loader) * args.epochs)) ** 0.9)
+        lambda x: (1 - x / (iters_per_epoch * (args.epochs - args.lr_warmup_epochs))) ** 0.9)


Created an issue to track if we can now use a stock PyTorch scheduler for this #4438

datumbox · 2021-09-17T11:02:06Z

@fmassa Thanks!

Yes I did. They match really close (to the 5th decimal) most of the times. For reference here are the scripts used to test this:

Segmentation:
python -m torch.distributed.launch --nproc_per_node=2 --use_env train.py --dataset coco --model lraspp_mobilenet_v3_large --epochs 7 --lr-warmup-epochs 2

Classification:
python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model mobilenet_v3_large --data-path /datasets01/tinyimagenet/081318/ --epochs 100 --lr-scheduler cosineannealinglr --lr 0.004 --lr-warmup-epochs 10

Detection:
python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --dataset coco --model ssdlite320_mobilenet_v3_large --epochs 2 --lr-scheduler cosineannealinglr --lr 0.1 --batch-size 12

Video:
python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --data-path=/private/home/vvryniotis/kinetics400 --train-dir=train --val-dir=val --batch-size=1 --cache-dataset --epochs 10 --lr-warmup-epochs 2 --lr-milestones 4 6 8

Summary: * Warmup on Classficiation references. * Adjust epochs for cosine. * Warmup on Segmentation references. * Warmup on Video classification references. * Adding support of both types of warmup in segmentation. * Use LinearLR in detection. * Fix deprecation warning. Reviewed By: datumbox Differential Revision: D31268039 fbshipit-source-id: d0fe7e334c01201c2413bac8b911d740b9a69bba

datumbox added 2 commits September 14, 2021 14:18

Warmup on Classficiation references.

fb815ec

Adjust epochs for cosine.

3f9d8fa

datumbox added the module: reference scripts label Sep 14, 2021

facebook-github-bot added the cla signed label Sep 14, 2021

datumbox marked this pull request as draft September 14, 2021 13:47

datumbox mentioned this pull request Sep 14, 2021

[RFC] TorchVision with Batteries included - Phase 1 #3911

Closed

16 tasks

datumbox and others added 6 commits September 15, 2021 19:52

Merge branch 'main' into references/scheduler_rewrite

48e4d55

Warmup on Segmentation references.

8e940eb

Warmup on Video classification references.

5e0cbcf

Merge branch 'main' into references/scheduler_rewrite

9e9cadf

Adding support of both types of warmup in segmentation.

0a64eba

Use LinearLR in detection.

f95a13d

datumbox changed the title ~~[WIP] Warmup schedulers in References~~ Warmup schedulers in References Sep 16, 2021

datumbox requested a review from fmassa September 16, 2021 18:41

datumbox marked this pull request as ready for review September 16, 2021 18:41

Fix deprecation warning.

ba7d1d8

fmassa approved these changes Sep 17, 2021

View reviewed changes

datumbox merged commit a2b4c65 into pytorch:main Sep 17, 2021

datumbox deleted the references/scheduler_rewrite branch September 17, 2021 11:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Warmup schedulers in References #4411

Warmup schedulers in References #4411

Uh oh!

datumbox commented Sep 14, 2021 •

edited

Loading

Uh oh!

fmassa left a comment

Uh oh!

fmassa Sep 17, 2021

Uh oh!

datumbox commented Sep 17, 2021

Uh oh!

Uh oh!

Warmup schedulers in References #4411

Warmup schedulers in References #4411

Uh oh!

Conversation

datumbox commented Sep 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

fmassa Sep 17, 2021

Choose a reason for hiding this comment

Uh oh!

datumbox commented Sep 17, 2021

Uh oh!

Uh oh!

datumbox commented Sep 14, 2021 •

edited

Loading