Add gradient accumulation logic to SupervisedTrainer by jak0bw · Pull Request #6101 · Project-MONAI/MONAI

jak0bw · 2023-03-03T19:09:29Z

Description

A few sentences describing the changes proposed in this pull request.

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
Quick tests passed locally by running ./runtests.sh --quick --unittests --disttests.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

jak0bw · 2023-03-03T19:11:58Z

(Source code is strongly (and shamelessly) influenced by https://pytorch.org/ignite/generated/ignite.engine.supervised_training_step.html

jak0bw · 2023-03-03T19:16:52Z

Needs Feedback if this feature is desired and if yes probably test(s) for the new parameter.

Nic-Ma · 2023-03-06T03:24:47Z

Hi @jak0bw ,

Thanks for your idea and contribution here.
The design of SupervisedTrainer follows the ignite engine logic, it just defines the standard (or default) computation logic for every iteration. If you have any customized logic, please pass it as a callback function:
https://github.com/Project-MONAI/MONAI/blob/dev/monai/engines/trainer.py#L101

Thanks.

jak0bw · 2023-03-06T11:03:17Z

Hi @Nic-Ma,

thank you for your answer.

I am sorry if I misunderstand something critical here but doesn't the standard ignite logic include gradient accumulation similarly to my proposed changes (since ignite 0.4.7)? (My code is more or less copy pasted from the ignite source code) Therefore a change in the way outlined as in this pr would just restore feature parity between ignite and monai and not be considered customized logic.

Link to ignite create_supervised_trainer: https://github.com/pytorch/ignite/blob/c7c0df0fbfdff2a86415476cf0e68f36a089c1d2/ignite/engine/__init__.py#L404

Link to the used step function(s):
https://github.com/pytorch/ignite/blob/c7c0df0fbfdff2a86415476cf0e68f36a089c1d2/ignite/engine/__init__.py#L44

Nic-Ma · 2023-03-06T14:12:52Z

Hi @jak0bw ,

Oh, I didn't notice this new option in ignite.
@vfdev-5 @wyli Do you think it's necessary to add it in MONAI trainer?

Thanks in advance.

wyli · 2023-03-06T14:21:10Z

I think if ignite's supervised_training_step is not directly usable, we should create a util function in monai.engine.utils:

def grad_accumulation_iteration(steps=...):
    def iteration(engine, ...):
        ...
        return engine.output
    return iteration

and the usage would be

monai.engine.SupervisedTrainer(..., iteration_update=monai.engine.utils.grad_accumulation_iteration(steps), ...)

jak0bw · 2023-03-06T15:20:42Z

As mentioned in #6100 it is possible to directly use Ignite's supervised_training_step but as it does not emit the same Events as the monai step function some monai handlers using these Events are not triggered correctly.

wyli · 2023-03-06T18:30:30Z

sure, please consider creating a function in monai.engines.utils
and the engine will prefer this iteration_update if it's provided:

MONAI/monai/engines/workflow.py

Lines 125 to 128 in e375f2a

    
           if iteration_update is not None: 
        
               super().__init__(iteration_update) 
        
           else: 
        
               super().__init__(self._iteration)

this is how we create various iteration_update functions, for example: https://github.com/Project-MONAI/MONAI/blob/dev/monai/apps/deepedit/interaction.py#LL26C7-L26C18

usage:
https://github.com/Project-MONAI/tutorials/blob/aa4ca78d3e7f08c6d8f5a5a009d5da508acdb6ad/deepedit/ignite/train.py#L250C37-L259

Signed-off-by: Jakob Weigand <jakob.weigand@tum.de>

jak0bw · 2023-03-28T14:24:09Z

@wyli I think I added a custom iteration update function as requested. The code still has a circular import error (the circular import of SupervisedTrainer) which I don't know how to resolve really as it kinda depends on how the monai project is structured (or deals with these problems in code).

Additionally tests are still missing.

@Nic-Ma Unfortunately, I don't really have time to further work on this pull request in the near future. Therefore this pull request (and/or the corresponding issue) can be marked for contribution wanted, closed or however the monai team wants to deal with it.

jak0bw mentioned this pull request Mar 3, 2023

Add gradient accumulation functionality to SupervisedTrainer #6100

Open

jak0bw force-pushed the add-gradient-accumulation-to-supervised-trainer branch from 6bc4652 to de2884a Compare March 3, 2023 19:14

jak0bw force-pushed the add-gradient-accumulation-to-supervised-trainer branch from 3c25c7c to 68177ac Compare March 3, 2023 19:36

jak0bw marked this pull request as draft March 4, 2023 12:37

jak0bw force-pushed the add-gradient-accumulation-to-supervised-trainer branch from 5538ec7 to da515e3 Compare March 6, 2023 19:05

jak0bw force-pushed the add-gradient-accumulation-to-supervised-trainer branch 3 times, most recently from d48d18d to 3301889 Compare March 20, 2023 14:25

jak0bw force-pushed the add-gradient-accumulation-to-supervised-trainer branch 6 times, most recently from 303a1a8 to 1d8415e Compare March 28, 2023 14:01

Add gradient accumulation supervised trainer update step function

fda5cb6

Signed-off-by: Jakob Weigand <jakob.weigand@tum.de>

jak0bw force-pushed the add-gradient-accumulation-to-supervised-trainer branch from dfe24ef to fda5cb6 Compare March 28, 2023 14:09

wyli added the Contribution wanted label Mar 29, 2023

aymuos15 mentioned this pull request Mar 3, 2026

Add GradientAccumulation utility for SupervisedTrainer #8763

Open

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add gradient accumulation logic to SupervisedTrainer#6101

Add gradient accumulation logic to SupervisedTrainer#6101
jak0bw wants to merge 1 commit intoProject-MONAI:devfrom
jak0bw:add-gradient-accumulation-to-supervised-trainer

jak0bw commented Mar 3, 2023 •

edited by wyli

Loading

Uh oh!

jak0bw commented Mar 3, 2023

Uh oh!

jak0bw commented Mar 3, 2023

Uh oh!

Nic-Ma commented Mar 6, 2023

Uh oh!

jak0bw commented Mar 6, 2023 •

edited

Loading

Uh oh!

Nic-Ma commented Mar 6, 2023

Uh oh!

wyli commented Mar 6, 2023

Uh oh!

jak0bw commented Mar 6, 2023

Uh oh!

wyli commented Mar 6, 2023

Uh oh!

jak0bw commented Mar 28, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jak0bw commented Mar 3, 2023 • edited by wyli Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Types of changes

Uh oh!

jak0bw commented Mar 3, 2023

Uh oh!

jak0bw commented Mar 3, 2023

Uh oh!

Nic-Ma commented Mar 6, 2023

Uh oh!

jak0bw commented Mar 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Nic-Ma commented Mar 6, 2023

Uh oh!

wyli commented Mar 6, 2023

Uh oh!

jak0bw commented Mar 6, 2023

Uh oh!

wyli commented Mar 6, 2023

Uh oh!

jak0bw commented Mar 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jak0bw commented Mar 3, 2023 •

edited by wyli

Loading

jak0bw commented Mar 6, 2023 •

edited

Loading

jak0bw commented Mar 28, 2023 •

edited

Loading