Remove message of checking `apex.amp` module and add tests for features of gradient accumulation/mixed precision training #46

NaleRaphael · 2020-06-06T16:24:03Z

This PR is a fix for issue #45 with some new test cases for those features implemented in PR #9.

A quick summary for this PR:

The message To enable mixed precision training, please install apex... is removed. (solved in commit 227fc53)
A silly mistake was made. batch_size was not passed into DataLoader, so that those data loaders in test cases were working with the default value batch_size=1 before. Though it does not affect the test correctness, it still needs to be corrected. (solved in commit 1c549ec)
Another mistake. In test cases, there is no setting that moves model to device declared in task.__init__(). Therefore, all tests were running on CPU even if the pytest argument --cpu_only is not specified. (solved in commit c854714)
More tests for features of gradient accumulation and mixed precision are added in commit e073ac8.

Note that there is a new dependency pytest-mock added for new test cases.

The original propose of that message is to let users know gradient accumulation and mixed precision training is supported but `apex` is required. With an attention brought up by issue davidtvs#45, the following things are confirmed: - Gradient accumulation can still work properly without `apex.amp`. And that's why it would fall back on normal `loss.backward()` when `apex.amp` is not available or `amp.initialize()` wasn't called. - When mixed precision training is required, that is to say model and optimizer are wrapped by `amp.initialize()`, `amp.scale_loss()` will be adopted automatically in current implementation. Therefore, it seems that message of checking `apex.amp` module is not necessary anymore.

This mistake made batch size of every data loader become the default value: 1. Though it does not affect the correctness of all test case, it still needs to be corrected. However, `batch_size` of a `DataLoader` cannot be modified after it is initialized. Therefore, we can only determine it while generating tasks for test, and that's why `batch_size` and `steps` is moved to the signature of `__init__` of each `Task`.

This functionality was not added before, and it made all tests run on CPU even if the pytest argument `--cpu_only` is not specified.

davidtvs

Thanks for another good contribution

davidtvs · 2020-06-06T20:15:31Z

tests/test_lr_finder.py

@@ -4,6 +4,14 @@
 import task as mod_task


+try:
+    import apex


I think this line can be changed to from apex import amp and we can then remove the local imports from the functions

Yeah, I'll fix it.

davidtvs · 2020-06-06T20:16:03Z

tests/test_lr_finder.py

+        reason="`apex` module and gpu is required to run this test."
+    )
+    def test_gradient_accumulation_with_apex_amp(self, mocker):
+        from apex import amp


Remove line (see comment about import)

davidtvs · 2020-06-06T20:16:07Z

tests/test_lr_finder.py

+)
+class TestMixedPrecision:
+    def test_mixed_precision(self, mocker):
+        from apex import amp


Remove line (see comment about import)

davidtvs

Forgot to select the proper radio button

davidtvs · 2020-06-06T20:21:02Z

/black-check

github-actions

No linting violations have been found in this PR.

davidtvs · 2020-06-06T20:22:32Z

/flake8-lint

github-actions

Lintly has detected code quality issues in this pull request.

github-actions · 2020-06-06T20:23:19Z

tests/test_lr_finder.py

@@ -4,6 +4,14 @@
 import task as mod_task


+try:
+    import apex


F401: 'apex' imported but unused

davidtvs

Nice work. Thanks!

NaleRaphael added 5 commits June 6, 2020 21:41

TST: move model to device while post-initializing tasks

c854714

This functionality was not added before, and it made all tests run on CPU even if the pytest argument `--cpu_only` is not specified.

TST: add tests for gradient accumulation and mixed precision training

e073ac8

BUILD: add pytest-mock to dependencies for tests

faf86c9

davidtvs reviewed Jun 6, 2020

View reviewed changes

davidtvs requested changes Jun 6, 2020

View reviewed changes

github-actions bot approved these changes Jun 6, 2020

View reviewed changes

github-actions bot requested changes Jun 6, 2020

View reviewed changes

tests/test_lr_finder.py Outdated

@@ -4,6 +4,14 @@

import task as mod_task

try:

import apex

Copy link

github-actions bot Jun 6, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

F401: 'apex' imported but unused

TST: change the import statement to import amp submodule directly

3f97921

davidtvs approved these changes Jun 7, 2020

View reviewed changes

davidtvs merged commit 23a23cf into davidtvs:master Jun 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove message of checking `apex.amp` module and add tests for features of gradient accumulation/mixed precision training #46

Remove message of checking `apex.amp` module and add tests for features of gradient accumulation/mixed precision training #46

NaleRaphael commented Jun 6, 2020 •

edited

davidtvs left a comment

davidtvs Jun 6, 2020

NaleRaphael Jun 7, 2020

davidtvs Jun 6, 2020

davidtvs Jun 6, 2020

davidtvs left a comment

davidtvs commented Jun 6, 2020

github-actions bot left a comment

davidtvs commented Jun 6, 2020

github-actions bot left a comment

github-actions bot Jun 6, 2020

davidtvs left a comment

Remove message of checking apex.amp module and add tests for features of gradient accumulation/mixed precision training #46

Remove message of checking apex.amp module and add tests for features of gradient accumulation/mixed precision training #46

Conversation

NaleRaphael commented Jun 6, 2020 • edited

davidtvs left a comment

Choose a reason for hiding this comment

davidtvs Jun 6, 2020

Choose a reason for hiding this comment

NaleRaphael Jun 7, 2020

Choose a reason for hiding this comment

davidtvs Jun 6, 2020

Choose a reason for hiding this comment

davidtvs Jun 6, 2020

Choose a reason for hiding this comment

davidtvs left a comment

Choose a reason for hiding this comment

davidtvs commented Jun 6, 2020

github-actions bot left a comment

Choose a reason for hiding this comment

davidtvs commented Jun 6, 2020

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot Jun 6, 2020

Choose a reason for hiding this comment

davidtvs left a comment

Choose a reason for hiding this comment

Remove message of checking `apex.amp` module and add tests for features of gradient accumulation/mixed precision training #46

Remove message of checking `apex.amp` module and add tests for features of gradient accumulation/mixed precision training #46

NaleRaphael commented Jun 6, 2020 •

edited