[jit] Add EndToEndHybridModel CUDA tests #11544

zou3519 · 2018-09-11T20:41:10Z

Also adds two additional tests that check for memory leaks while the relevant graph executors are alive:

(minimal test): Create a ScriptModule, keep it alive, and test that it does not leak memory while it is alive
(large test) Do MNIST training with a traced MNIST module and test that no memory is leaked while the traced module (with graph executor) is alive

cc @apaszke @zdevito

apaszke

LGTM, but it would be great if we could make sure that the new tests don't take forever to run

test/test_jit.py

facebook-github-bot

zou3519 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

zou3519 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Also adds two tests: - Create a ScriptModule, keep it alive, and test that it does not leak memory while it is alive - Do MNIST training and test that no memory is leaked while the Mnist module (with graph executor) is alive

facebook-github-bot

zou3519 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

zou3519 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

* master: (165 commits) Aibench for asr decoder Explicitly set locale on docs build. (pytorch#11595) Documentation for debugging JIT Fused weightnorm for ATen (pytorch#10842) Move Type, Tensor, TensorMethods to core. Add reminder % to the jit Fix reloading modules back into python (pytorch#11552) Add trigonometry functions to docs/source/onnx.rst Add EndToEndHybridModel CUDA tests (pytorch#11544) minor formatting error log (pytorch#11528) Warn that export+import module always load onto the CPU (pytorch#11485) caffe2::StorageImpl use at::DataPtr (pytorch#11282) Sync all libnccl soversions, not just libnccl.so.1 (pytorch#11575) Document BatchNorm and update default behavior (pytorch#11484) Typo fix in randomness.rst (pytorch#11571) Move some bmm/baddbmm to ATen (pytorch#11292) Make c10d test work on CPU only build (pytorch#11567) Clean up some C++ cruftiness in the script lexer. Allow setting deletion constant Make C10d support CPU only build (pytorch#11513) ...

ssnl · 2018-09-12T22:47:15Z

Is it expected that test_mnist_training_leaks_no_memory_cuda gives me this warning:

test_mnist_training_leaks_no_memory_cuda (__main__.TestEndToEndHybridFrontendModels) ... test/tes
t_jit.py:7046: TracerWarning: Trace had nondeterministic nodes. Nodes:
        %92 : Double(5, 50) = aten::dropout(%89, %90, %91), scope: MnistNet
This may cause errors in trace checking. To disable trace checking, pass check_trace=False to tor
ch.jit.trace()
  traced_net = torch.jit.trace(net, [torch.randn(5, 1, 28, 28, device='cuda')])
test/test_jit.py:7046: TracerWarning: Output nr 1. of the traced function does not match the corr
esponding output of the Python function. Detailed error:
Not within tolerance rtol=1e-05 atol=1e-08 at input[4, 3] (-2.3382810849225493 vs. -2.80311356175
0903) and 49 other locations (100.00%)
  traced_net = torch.jit.trace(net, [torch.randn(5, 1, 28, 28, device='cuda')])
ok

zou3519 · 2018-09-13T15:35:30Z

@ssnl I'm fixing that in #11639

zou3519 requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners September 11, 2018 20:41

zou3519 added the oncall: jit Add this issue/PR to JIT oncall triage queue label Sep 11, 2018

apaszke approved these changes Sep 11, 2018

View reviewed changes

test/test_jit.py Outdated

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

zou3519 force-pushed the jit-cuda-tests3 branch from 3d8c0c0 to 724502d Compare September 11, 2018 21:04

facebook-github-bot reviewed Sep 11, 2018

View reviewed changes

zou3519 added 2 commits September 12, 2018 07:07

[jit] Add EndToEndHybridModel CUDA tests

5e7b39e

Also adds two tests: - Create a ScriptModule, keep it alive, and test that it does not leak memory while it is alive - Do MNIST training and test that no memory is leaked while the Mnist module (with graph executor) is alive

Address comments: lower iterations to 5

467c4d2

zou3519 force-pushed the jit-cuda-tests3 branch from d6a64b2 to 8e5232a Compare September 12, 2018 14:07

facebook-github-bot reviewed Sep 12, 2018

View reviewed changes

Fix

fda5ea4

zou3519 force-pushed the jit-cuda-tests3 branch from 8e5232a to fda5ea4 Compare September 12, 2018 14:42

facebook-github-bot reviewed Sep 12, 2018

View reviewed changes

facebook-github-bot closed this in 13b05c8 Sep 12, 2018

zou3519 deleted the jit-cuda-tests3 branch September 13, 2018 15:35

ezyang added the merged label Jun 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[jit] Add EndToEndHybridModel CUDA tests #11544

[jit] Add EndToEndHybridModel CUDA tests #11544

Uh oh!

zou3519 commented Sep 11, 2018 •

edited

Loading

Uh oh!

apaszke left a comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

ssnl commented Sep 12, 2018

Uh oh!

zou3519 commented Sep 13, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[jit] Add EndToEndHybridModel CUDA tests #11544

[jit] Add EndToEndHybridModel CUDA tests #11544

Uh oh!

Conversation

zou3519 commented Sep 11, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

apaszke left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

ssnl commented Sep 12, 2018

Uh oh!

zou3519 commented Sep 13, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

zou3519 commented Sep 11, 2018 •

edited

Loading