PoC: Revamp optimizer and scheduler experience using registries #777

karthikrangasai · 2021-09-20T16:58:27Z

What does this PR do?

Fixes #752

Optimizer and scheduler can currently be an instance but that doesn't comply with the way the code works since you need the model to the instantiate optimizer and need the optimizer to instantiate the scheduler. The main idea is to use the FlashRegistry class for this. This blends in well with the idea of Flash being a library for fast prototyping of ML models.

This PR focuses on fixing this issue using a few ideas:

To pass a partial function which instantiates the optimizer or scheduler given the object it needs to wrap
Use registries: So in theory you can select a scheduler with a string.

How the API looks after this PR is appiled:

The optimizer of choice can be passed as a

# - String value
model = ImageClassifier(backbone="resnet18", num_classes=2, optimizer="Adam", lr_scheduler=None)

# - Callable
model = ImageClassifier(backbone="resnet18", num_classes=2, optimizer=functools.partial(torch.optim.AdaDelta, eps=0.5), lr_scheduler=None)

# - Tuple[string, dict]: (The dict takes in the optimizer kwargs)
model = ImageClassifier(backbone="resnet18", num_classes=2, optimizer=("AdaDelta", {"epa": 0.5}), lr_scheduler=None)

The scheduler of choice can be passed as a

# - String value
model = ImageClassifier(backbone="resnet18", num_classes=2, optimizer="Adam", lr_scheduler="constant_schedule")

# - Callable
model = ImageClassifier(backbone="resnet18", num_classes=2, optimizer="Adam", lr_scheduler=functools.partial(CyclicLR, step_size_up=1500, mode='exp_range', gamma=0.5))

# - Tuple[string, dict]: (The dict takes in the scheduler kwargs)
model = ImageClassifier(backbone="resnet18", num_classes=2, optimizer="Adam", lr_scheduler=("StepLR", {"step_size": 10]))

You can also register you own custom scheduler recipes beforehand and use them shown as above:

@ImageClassifier.lr_schedulers
def my_steplr_recipe(optimizer):
    return torch.optim.lr_scheduler.StepLR(optimizer, step_size=10)

model = ImageClassifier(backbone="resnet18", num_classes=2, optimizer="Adam", lr_scheduler="my_steplr_recipe")

This keeps the work of instantiation and setup of optimizer and scheduler from the flash library to mitigate any errors a user might make if we allowed class names to be passed.

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests? [not needed for typos/docs]
Did you verify new and existing tests pass locally with your changes?
If you made a notable change (that affects users), did you update the CHANGELOG?

PR review

Is this pull request ready for review? (if not, please submit in draft mode)

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

…nd string.

…ort for HF transformers provided schedulers.

tchaton · 2021-09-27T09:59:18Z

Dear @karthikrangasai,

I added some modifications. Still wip.

Do you think we should remove entirely optimizer_kwargs and scheduler_kwargs ?
Adding support for a tuple seems slightly counter-intuive as users might want to control arguments through keyword arguments.

Best,
T.C

codecov · 2021-09-27T10:01:16Z

Codecov Report

Merging #777 (35f3834) into master (a94ed6c) will decrease coverage by 6.68%.
The diff coverage is 97.12%.

@@            Coverage Diff             @@
##           master     #777      +/-   ##
==========================================
- Coverage   85.18%   78.49%   -6.69%     
==========================================
  Files         228      230       +2     
  Lines       12566    12666     +100     
==========================================
- Hits        10704     9942     -762     
- Misses       1862     2724     +862

Flag	Coverage Δ
unittests	`78.49% <97.12%> (-6.69%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
flash/core/model.py	`88.15% <93.93%> (+0.57%)`	⬆️
flash/audio/speech_recognition/model.py	`100.00% <100.00%> (ø)`
flash/core/optimizers/__init__.py	`100.00% <100.00%> (ø)`
flash/core/optimizers/optimizers.py	`100.00% <100.00%> (ø)`
flash/core/optimizers/schedulers.py	`100.00% <100.00%> (ø)`
flash/core/utilities/imports.py	`90.90% <100.00%> (+0.06%)`	⬆️
flash/core/utilities/types.py	`100.00% <100.00%> (ø)`
flash/graph/classification/model.py	`100.00% <100.00%> (ø)`
flash/image/classification/model.py	`70.45% <100.00%> (-6.15%)`	⬇️
flash/image/detection/model.py	`73.33% <100.00%> (-26.67%)`	⬇️
... and 61 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a94ed6c...35f3834. Read the comment docs.

…, Any]]. Added necessary tests as well.

…nd not optimizer names.

…d of all text libraries.

flash/core/model.py

README.md

tchaton · 2021-10-12T14:38:18Z

flash/audio/speech_recognition/model.py

-        optimizer_kwargs: Additional kwargs to use when creating the optimizer (if not passed as an instance).
-        scheduler: The scheduler or scheduler class to use.
-        scheduler_kwargs: Additional kwargs to use when creating the scheduler (if not passed as an instance).
+        lr_scheduler: The scheduler or scheduler class to use.
        learning_rate: Learning rate to use for training, defaults to ``1e-3``.


Let's remove the learning rate which is obselete now.

flash/core/model.py

tchaton · 2021-10-12T15:15:53Z

Hey @SeanNaren @ethanwharris.

Should we drop learning_rate from the Tasks and force kwargs directly ?

With @karthikrangasai, we are currently thinking to remove it entirely. It will make accessibility worse but it will be fully consistent across the codebase.

Best,
T.C

SeanNaren · 2021-10-12T15:21:39Z

Hey @SeanNaren @ethanwharris.

Should we drop learning_rate from the Tasks and force kwargs directly ?

With @karthikrangasai, we are currently thinking to remove it entirely. It will make accessibility worse but it will be fully consistent across the codebase.

Similar to the discussion of deprecating arguments in the Trainer, it's a convenience for the user. Do we feel it isn't warranted to add to the Task itself? Do we think asking users to just use kwargs all the time is intuitive enough to drop it? I'm personally unsure but curious what others have to say!

…ter, update tests.

…datasets.

tchaton

After those updates, it should be good to go.

docs/source/general/optimization.rst

flash/core/model.py

flash/core/optimizers/schedulers.py

…he code.

flash/core/model.py

karthikrangasai added 2 commits September 18, 2021 17:41

Change optimizer Callables alone and scheduler to support Callables a…

2f46b93

…nd string.

Add Optimizer Registry and Update __init__ for all tasks.

caefe68

karthikrangasai changed the title ~~[WIP] PoC: Revamp optimizer and scheduler experience using registries~~ [skip CI][WIP] PoC: Revamp optimizer and scheduler experience using registries Sep 20, 2021

karthikrangasai and others added 5 commits September 20, 2021 22:33

Merge branch 'master' into refactor/revamp_optimizer_and_scheduler

93bc1b5

Revamp scheduler parameter to use str, Callable, str with params.

7ea53a2

Merge branch 'master' into refactor/revamp_optimizer_and_scheduler

e95a209

Updated _instantiate_scheduler method to handle providers. Added supp…

4cf6cdd

…ort for HF transformers provided schedulers.

wip

440aef2

karthikrangasai added 5 commits September 29, 2021 13:37

Merge branch 'master' into refactor/revamp_optimizer_and_scheduler

094b690

Updated scheduler parameter to take input as type Tuple[str, Dict[str…

06e7722

…, Any]]. Added necessary tests as well.

Update naming of scheduler parameter to lr_scheduler.

8ab54bd

Update optimizer and lr_scheduler parameter across all tasks.

617e53a

Merge branch 'master' into refactor/revamp_optimizer_and_scheduler

dd5615e

karthikrangasai changed the title ~~[skip CI][WIP] PoC: Revamp optimizer and scheduler experience using registries~~ [WIP] PoC: Revamp optimizer and scheduler experience using registries Sep 29, 2021

karthikrangasai added 6 commits September 29, 2021 18:41

Updated optimizer registration code to compare with optimizer types a…

7a3029b

…nd not optimizer names.

Added tests for Errors and Exceptions.

d36c451

Update README with examples on using the API.

061454b

Update skipif condition only to check for transformers library instea…

c611aa8

…d of all text libraries.

Merge branch 'master' into refactor/revamp_optimizer_and_scheduler

64cedf3

Update newly added Face Detection Task.

e158802

karthikrangasai marked this pull request as ready for review October 1, 2021 08:38

karthikrangasai requested review from ananyahjha93, Borda, carmocca, edenlightning, ethanwharris, justusschock and kaushikb11 as code owners October 1, 2021 08:38

mergify bot added the has conflicts label Oct 11, 2021

tchaton reviewed Oct 12, 2021

View reviewed changes

karthikrangasai added 3 commits October 13, 2021 18:13

Merge branch 'master' into refactor/revamp_optimizer_and_scheduler

eda81ae

Changes from code review, Add new input method to lr_scheduler parame…

20eacaf

…ter, update tests.

Merge branch 'master' into refactor/revamp_optimizer_and_scheduler

87cf563

mergify bot removed the has conflicts label Oct 13, 2021

karthikrangasai added 5 commits October 13, 2021 18:21

Fix pre-commit ci review.

ddb5d1f

Add documentation for using the modified API and update CHANGELOG.

eb3aaec

Update docstrings for all tasks.

50c936a

Fix mistake in my CHANGELOG update.

5dfbeae

Removed optimizer old that was commented code.

93dbe67

mergify bot added the has conflicts label Oct 14, 2021

Merge branch 'master' into refactor/revamp_optimizer_and_scheduler

42e3bf4

karthikrangasai requested a review from tchaton October 14, 2021 15:14

mergify bot removed the has conflicts label Oct 14, 2021

Fix dependency version for failing tests on text type data, module - …

5e76ea3

…datasets.

mergify bot added the has conflicts label Oct 14, 2021

Merge branch 'master' into refactor/revamp_optimizer_and_scheduler

ec348bf

mergify bot removed the has conflicts label Oct 15, 2021

tchaton reviewed Oct 15, 2021

View reviewed changes

Changes from review - Fix docs, Add test, Clean up certian parts of t…

c49d70a

…he code.

karthikrangasai requested a review from tchaton October 16, 2021 10:23

mergify bot added the has conflicts label Oct 18, 2021

Merge branch 'master' into refactor/revamp_optimizer_and_scheduler

66c30bc

mergify bot removed the has conflicts label Oct 18, 2021

tchaton reviewed Oct 18, 2021

View reviewed changes

flash/core/model.py Outdated Show resolved Hide resolved

Remove debug print statements.

35f3834

tchaton merged commit b41722a into Lightning-Universe:master Oct 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PoC: Revamp optimizer and scheduler experience using registries #777

PoC: Revamp optimizer and scheduler experience using registries #777

karthikrangasai commented Sep 20, 2021 •

edited

Loading

tchaton commented Sep 27, 2021

codecov bot commented Sep 27, 2021 •

edited

Loading

tchaton Oct 12, 2021

tchaton commented Oct 12, 2021 •

edited

Loading

SeanNaren commented Oct 12, 2021

tchaton left a comment

PoC: Revamp optimizer and scheduler experience using registries #777

PoC: Revamp optimizer and scheduler experience using registries #777

Conversation

karthikrangasai commented Sep 20, 2021 • edited Loading

What does this PR do?

How the API looks after this PR is appiled:

Before submitting

PR review

Did you have fun?

tchaton commented Sep 27, 2021

codecov bot commented Sep 27, 2021 • edited Loading

Codecov Report

tchaton Oct 12, 2021

Choose a reason for hiding this comment

tchaton commented Oct 12, 2021 • edited Loading

SeanNaren commented Oct 12, 2021

tchaton left a comment

Choose a reason for hiding this comment

karthikrangasai commented Sep 20, 2021 •

edited

Loading

codecov bot commented Sep 27, 2021 •

edited

Loading

tchaton commented Oct 12, 2021 •

edited

Loading