[feat] Add flexible step-wise LR scheduler with minimum changes #256

nabenabe0928 · 2021-06-15T22:47:13Z

Since this PR is urgent now, I made another PR to quickly merge the feature.
The original PR is available here.

codecov · 2021-06-16T08:43:49Z

Codecov Report

Merging #256 (551666e) into development (999f3c3) will decrease coverage by 0.00%.
The diff coverage is 100.00%.

@@               Coverage Diff               @@
##           development     #256      +/-   ##
===============================================
- Coverage        81.62%   81.62%   -0.01%     
===============================================
  Files              150      151       +1     
  Lines             8625     8646      +21     
  Branches          1325     1328       +3     
===============================================
+ Hits              7040     7057      +17     
- Misses            1108     1111       +3     
- Partials           477      478       +1

Impacted Files	Coverage Δ
...pipeline/components/setup/lr_scheduler/__init__.py	`74.32% <ø> (ø)`
...h/pipeline/components/training/trainer/__init__.py	`70.81% <ø> (ø)`
...components/setup/lr_scheduler/CosineAnnealingLR.py	`96.15% <100.00%> (-0.15%)`	⬇️
.../setup/lr_scheduler/CosineAnnealingWarmRestarts.py	`96.42% <100.00%> (-0.13%)`	⬇️
...pipeline/components/setup/lr_scheduler/CyclicLR.py	`91.42% <100.00%> (-5.80%)`	⬇️
...ine/components/setup/lr_scheduler/ExponentialLR.py	`96.15% <100.00%> (-0.15%)`	⬇️
...eline/components/setup/lr_scheduler/NoScheduler.py	`95.00% <100.00%> (-0.24%)`	⬇️
...components/setup/lr_scheduler/ReduceLROnPlateau.py	`96.66% <100.00%> (-0.11%)`	⬇️
...h/pipeline/components/setup/lr_scheduler/StepLR.py	`96.42% <100.00%> (-0.13%)`	⬇️
...ne/components/setup/lr_scheduler/base_scheduler.py	`86.20% <100.00%> (+7.25%)`	⬆️
... and 10 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 999f3c3...551666e. Read the comment docs.

ravinkohli

Hey, Thanks for this PR. The changes look good. However, I still don't see where epoch or batch is passed by an LR scheduler. So basically, how does the fit dictionary 'X' get updated to contain the step unit? To that end, could we add a test where we run a few schedulers in a training and we verify that the step was made at the appropriate time. This can be achieved by using the internal parameters of the torch scheduler for example, in CosineAnnealingWarmRestarts we have T_0 or T_curr which contains the epochs since the last step. And we can verify if it is done epoch wise or batch wise there.

nabenabe0928 · 2021-06-16T12:57:25Z

However, I still don't see where epoch or batch is passed by an LR scheduler. So basically, how does the fit dictionary 'X' get updated to contain the step unit?

I will check promptly and give another commit soon

To that end, could we add a test where we run a few schedulers in a training and we verify that the step was made at the appropriate time.

Should I do it separetly as in the latest commit of test codes by me or should I really run it from API?
I think the former is sufficient and faster, but if something wrong happens in the system, we cannot potentially detect the errors.
But in my opinion, when you make tests, they should test speicific parts, so the former is enough.
What do you think?

ravinkohli · 2021-06-16T13:03:33Z

However, I still don't see where epoch or batch is passed by an LR scheduler. So basically, how does the fit dictionary 'X' get updated to contain the step unit?

I will check promptly and give another commit soon

To that end, could we add a test where we run a few schedulers in a training and we verify that the step was made at the appropriate time.

Should I do it separetly as in the latest commit of test codes by me or should I really run it from API?
I think the former is sufficient and faster, but if something wrong happens in the system, we cannot potentially detect the errors.
But in my opinion, when you make tests, they should test speicific parts, so the former is enough.
What do you think?

yeah so the idea is to train a pipeline with the different schedulers and make sure that the step is made at the right time. The code you added in itself might run, but the tests I would like to see would ensure that proper information is passed by the chosen lr scheduler and it is able to take a step at the appropriate time. This will ensure that CosineAnnealing or some other scheduler if selected in a configuration is able to take the appropriate step

…ld without them

Since we would like to merge this feature promptly, I cut this new branch from the branch hot-fix-adapt... and narrowed down the scope of this PR. The subsequent PR addresses the issues, epspecially the format and mypy typing.

…nching

The intention behind the change from torch.tensor to torch.Tensor is that since I got an error NoModuleFound `import torch.tensor`. Plus, the torch.tensor is not a TensorType class, but torch.Tensor is. Therefore, I changed the torch.tensor to torch.Tensor.

Since the previous version always use batch-wise update, I added the step_unit = batch and then avoid the errors I got from pytest.

Since the latest version only supports the batch-wise update, I just inserted step_unit == "batch" to be able to run greedy portfolio selection.

…ons in base_scheduler

…ROnPlateau

Since the step_interval should not be modified from outside, I made it a property of the base_scheduler class. Furthermore, since we do not have to check the type of step_interval except the initialization, I deleted the type check from prepare method.

autoPyTorch/pipeline/components/training/trainer/base_trainer.py

… changes (#256)

ravinkohli requested changes Jun 16, 2021

View reviewed changes

nabenabe0928 added 10 commits June 19, 2021 22:51

[doc] Add the bug fix information for ipython user

dd252ea

[fix] Delete the gcc and gnn install because I could check we can bui…

8f9fc6d

…ld without them

[fix] Fix flake8 issue and remove unneeded changes because of mis-bra…

00c9b7a

…nching

[test] Add the tests for the new features

5431372

[feat] Be able to add the step_unit to ConfigSpace

5168ed1

[fix] Fix pytest issues by adding batch-wise update to incumbent

f2f6bff

Since the previous version always use batch-wise update, I added the step_unit = batch and then avoid the errors I got from pytest.

[fix] Add step_unit info in the greedy portfolio json file

4ec5f10

Since the latest version only supports the batch-wise update, I just inserted step_unit == "batch" to be able to run greedy portfolio selection.

[refactor] Rebase to the latest development and add overridden functi…

8494e0c

…ons in base_scheduler

nabenabe0928 force-pushed the feat-epoch-wise-LR-scheduler branch from ae354c8 to 8494e0c Compare June 19, 2021 21:29

nabenabe0928 added 5 commits June 19, 2021 23:37

[fix] Fix flake8 and mypy issues

1f2cc44

[fix] Fix flake8 issues

57f7ef4

[test] Add the test for the train step and the lr scheduler check

50ecdf9

[refactor] Change the name to

63d07a1

[fix] Fix flake8 issues

53f5555

nabenabe0928 requested review from ravinkohli and franchuterivera June 20, 2021 13:25

nabenabe0928 added 8 commits June 22, 2021 22:11

[fix] Disable the step_interval option from the hyperparameter settings

02c6afc

[fix] Change the default step_interval to Epoch-wise

23f3863

[fix] Fix the step timing for ReduceLROnPlateau and fix flake8 issues

167c54c

[fix] Add the after-validation option to StepIntervalUnit for ReduceL…

b7253b3

…ROnPlateau

[fix] Fix flake8 issues

af11efd

[fix] Fix loss value for epoch-wise scheduler update

e50f7f1

[fix] Fix a mypy issue

bc038f9

nabenabe0928 added 3 commits June 23, 2021 14:14

[fix] Fix a mypy issue

f633bae

[fix] Fix mypy issues

ad6eda1

[fix] Fix mypy issues

70728d6

ravinkohli reviewed Jun 23, 2021

View reviewed changes

autoPyTorch/pipeline/components/training/trainer/base_trainer.py Outdated Show resolved Hide resolved

ravinkohli reviewed Jun 23, 2021

View reviewed changes

autoPyTorch/pipeline/components/training/trainer/base_trainer.py Show resolved Hide resolved

ravinkohli reviewed Jun 23, 2021

View reviewed changes

autoPyTorch/pipeline/components/training/trainer/base_trainer.py Show resolved Hide resolved

ravinkohli approved these changes Jun 23, 2021

View reviewed changes

[feedback] Address the Ravin's suggestions

551666e

ravinkohli approved these changes Jun 23, 2021

View reviewed changes

ravinkohli merged commit 76bdde7 into automl:development Jun 23, 2021

github-actions bot pushed a commit that referenced this pull request Jun 23, 2021

nabenabe0928: [feat] Add flexible step-wise LR scheduler with minimum…

ec263cb

… changes (#256)

ravinkohli mentioned this pull request Nov 11, 2021

[Hotfix] Adapt LR scheduler to epoch wise #212

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] Add flexible step-wise LR scheduler with minimum changes #256

[feat] Add flexible step-wise LR scheduler with minimum changes #256

nabenabe0928 commented Jun 15, 2021

codecov bot commented Jun 16, 2021 •

edited

ravinkohli left a comment

nabenabe0928 commented Jun 16, 2021

ravinkohli commented Jun 16, 2021

[feat] Add flexible step-wise LR scheduler with minimum changes #256

[feat] Add flexible step-wise LR scheduler with minimum changes #256

Conversation

nabenabe0928 commented Jun 15, 2021

codecov bot commented Jun 16, 2021 • edited

Codecov Report

ravinkohli left a comment

Choose a reason for hiding this comment

nabenabe0928 commented Jun 16, 2021

ravinkohli commented Jun 16, 2021

codecov bot commented Jun 16, 2021 •

edited