Loop Refactor 3/N - Evaluation Loop #7990

awaelchli · 2021-06-15T15:36:18Z

What does this PR do?

Introduces the evaluation loop under the new interface introduced in #7871

Three new classes:

DataLoaderLoop: This loop runs over a list of dataloaders
EvaluationDataLoaderLoop: A subclass of DataLoaderLoop running over a list of evaluation dataloaders (can be test or validation)
EvaluationEpochLoop: Runs a single epoch of validation or test.

In the next PR, we will introduce also the PredictionDataLoaderLoop and PredictionEpochLoop.

Before submitting

Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you update the CHANGELOG? (not for typos, docs, test updates, or internal minor changes/refactorings)

PR review

Anyone in the community is free to review the PR once the tests have passed.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

Make sure you had fun coding 🙃

Co-authored-by: Justus Schock <justus.schock@rwth-aachen.de> Co-authored-by: Justus Schock <justus.schock@posteo.de> Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>

pep8speaks · 2021-06-15T15:36:21Z

Hello @awaelchli! Thanks for updating this PR.

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2021-06-18 11:00:30 UTC

for more information, see https://pre-commit.ci

codecov · 2021-06-15T15:37:44Z

Codecov Report

Merging #7990 (6545bc9) into master (3fece17) will decrease coverage by 0%.
The diff coverage is 96%.

@@          Coverage Diff           @@
##           master   #7990   +/-   ##
======================================
- Coverage      92%     91%   -0%     
======================================
  Files         207     210    +3     
  Lines       13479   13558   +79     
======================================
+ Hits        12346   12392   +46     
- Misses       1133    1166   +33

…eval' into refactor/loops/loops_everywhere_eval

for more information, see https://pre-commit.ci

…eval' into refactor/loops/loops_everywhere_eval

pytorch_lightning/loops/dataloader/dataloader_loop.py

pytorch_lightning/loops/dataloader/evaluation_dataloader_loop.py

pytorch_lightning/trainer/trainer.py

pytorch_lightning/loops/dataloader/dataloader_loop.py

pytorch_lightning/loops/dataloader/evaluation_dataloader_loop.py

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

…eval' into refactor/loops/loops_everywhere_eval

ethanwharris

LGTM

pytorch_lightning/loops/dataloader/evaluation_dataloader_loop.py

Co-authored-by: Ethan Harris <ewah1g13@soton.ac.uk>

tchaton

Looks awesome !

pytorch_lightning/loops/dataloader/evaluation_dataloader_loop.py

Borda · 2021-06-18T10:42:52Z

pytorch_lightning/loops/evaluation_epoch_loop.py

+        if self.trainer.testing:
+            self.trainer.lightning_module._current_fx_name = "test_step"
+            with self.trainer.profiler.profile("test_step"):
+                output = self.trainer.accelerator.test_step(step_kwargs)
+        else:
+            self.trainer.lightning_module._current_fx_name = "validation_step"
+            with self.trainer.profiler.profile("validation_step"):
+                output = self.trainer.accelerator.validation_step(step_kwargs)


Suggested change

if self.trainer.testing:

self.trainer.lightning_module._current_fx_name = "test_step"

with self.trainer.profiler.profile("test_step"):

output = self.trainer.accelerator.test_step(step_kwargs)

else:

self.trainer.lightning_module._current_fx_name = "validation_step"

with self.trainer.profiler.profile("validation_step"):

output = self.trainer.accelerator.validation_step(step_kwargs)

name_step = "test_step" if self.trainer.testing else "validation_step"

self.trainer.lightning_module._current_fx_name =name_step

with self.trainer.profiler.profile(name_step):

if self.trainer.testing:

output = self.trainer.accelerator.test_step(step_kwargs)

else:

output = self.trainer.accelerator.validation_step(step_kwargs)

pytorch_lightning/loops/evaluation_epoch_loop.py

Borda · 2021-06-18T10:48:04Z

pytorch_lightning/loops/evaluation_epoch_loop.py

+        if output is not None:
+            if isinstance(output, ResultCollection):
+                output = output.detach()
+                if self.trainer.move_metrics_to_cpu:
+                    output = output.cpu()
+            elif isinstance(output, dict):
+                output = recursive_detach(output, to_cpu=self.trainer.move_metrics_to_cpu)
+            elif isinstance(output, Tensor) and output.is_cuda and self.trainer.move_metrics_to_cpu:
+                output = output.cpu()
+            outputs.append(output)
+        return outputs


Suggested change

if output is not None:

if isinstance(output, ResultCollection):

output = output.detach()

if self.trainer.move_metrics_to_cpu:

output = output.cpu()

elif isinstance(output, dict):

output = recursive_detach(output, to_cpu=self.trainer.move_metrics_to_cpu)

elif isinstance(output, Tensor) and output.is_cuda and self.trainer.move_metrics_to_cpu:

output = output.cpu()

outputs.append(output)

return outputs

if output is None:

return outputs

if isinstance(output, ResultCollection):

output = output.detach()

if self.trainer.move_metrics_to_cpu:

output = output.cpu()

elif isinstance(output, dict):

output = recursive_detach(output, to_cpu=self.trainer.move_metrics_to_cpu)

elif isinstance(output, Tensor) and output.is_cuda and self.trainer.move_metrics_to_cpu:

output = output.cpu()

outputs.append(output)

return outputs

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>

…eval' into refactor/loops/loops_everywhere_eval

awaelchli and others added 4 commits June 15, 2021 15:52

add classes

d1ab532

Co-authored-by: Justus Schock <justus.schock@rwth-aachen.de> Co-authored-by: Justus Schock <justus.schock@posteo.de> Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>

trainer changes

781f26b

connect

7c0f96e

clean up

4533b2c

[pre-commit.ci] auto fixes from pre-commit.com hooks

ea48342

for more information, see https://pre-commit.ci

awaelchli added the refactor label Jun 15, 2021

awaelchli added this to the v1.4 milestone Jun 15, 2021

awaelchli and others added 8 commits June 15, 2021 17:40

update test renaming

9a3a908

Merge remote-tracking branch 'origin/refactor/loops/loops_everywhere_…

a7d2d86

…eval' into refactor/loops/loops_everywhere_eval

rename evaluation loop to evaluation epoch loop

d711a49

minor docstring improvements

e592423

update chlog

e3c0512

[pre-commit.ci] auto fixes from pre-commit.com hooks

9044c19

for more information, see https://pre-commit.ci

try ci fix

c5c02ec

Merge remote-tracking branch 'origin/refactor/loops/loops_everywhere_…

cc4c57c

…eval' into refactor/loops/loops_everywhere_eval

Borda assigned awaelchli Jun 15, 2021

awaelchli added 3 commits June 16, 2021 02:11

Merge branch 'master' into refactor/loops/loops_everywhere_eval

6d8af9f

update code owners for pl/loops

7e030e5

update mock path

d174d04

tchaton reviewed Jun 16, 2021

View reviewed changes

re-order

d135da8

awaelchli marked this pull request as ready for review June 16, 2021 21:48

awaelchli requested review from Borda, carmocca, justusschock, kaushikb11, SeanNaren and williamFalcon as code owners June 16, 2021 21:48

remove a todo that needs more discussion

0f4d536

carmocca reviewed Jun 18, 2021

View reviewed changes

pytorch_lightning/loops/dataloader/dataloader_loop.py Outdated Show resolved Hide resolved

pytorch_lightning/loops/dataloader/evaluation_dataloader_loop.py Outdated Show resolved Hide resolved

awaelchli and others added 5 commits June 18, 2021 02:42

combine _get_num_dataloaders with the property

9db3ddc

Update pytorch_lightning/loops/dataloader/dataloader_loop.py

90da366

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

Merge remote-tracking branch 'origin/refactor/loops/loops_everywhere_…

891a429

…eval' into refactor/loops/loops_everywhere_eval

black + yapf

e583d6e

avoid coverage on old unused eval loop

c7579ab

ethanwharris approved these changes Jun 18, 2021

View reviewed changes

pytorch_lightning/loops/dataloader/evaluation_dataloader_loop.py Outdated Show resolved Hide resolved

awaelchli added the ready PRs ready to be merged label Jun 18, 2021

awaelchli and others added 2 commits June 18, 2021 10:02

empty space in docstring

7f785f4

Co-authored-by: Ethan Harris <ewah1g13@soton.ac.uk>

resolve todo for args forwarding

89ba6fc

tchaton approved these changes Jun 18, 2021

View reviewed changes

tchaton enabled auto-merge (squash) June 18, 2021 10:20