Disable dynamo on some opt methods and differentiable optimizer tests #103066

mlazos · 2023-06-06T06:45:03Z

Disables dynamo on the differentiable optimizer tests
Disables dynamo on some test methods which expose a very rare dynamo edge case
Disables dynamo on export/save optimizer state methods because it shouldn't trace those anyway.

I have a draft PR to fix the two tests marked skip due to unsupported mutation of step.

cc @voznesenskym @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @ipiszy

pytorch-bot · 2023-06-06T06:45:06Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/103066

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 65938d3:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

janeyx99

what is the rare dynamo edge case? should an issue be filed or a big NOTES comment be written so people are aware why certain things are disabled?
do we not trace state dict info for modules as well?

CI still seems red :(

test/optim/test_optim.py

torch/_dynamo/eval_frame.py

janeyx99 · 2023-06-06T13:39:27Z

Is this PR intended to fix the flakiness?

mlazos · 2023-06-06T17:11:26Z

Is this PR intended to fix the flakiness?

Sorry told Nikita in chat

There are still two more issues (with Adadelta and RMSProp) that I'm working on a separate PR for.
I can add more comments.

Re the edge case, I can file an issue, I've never seen it repro in real workloads and haven't been able to create a minimal repro, I've only seen it while tracing test code so I don't think it's worth handling.

mlazos · 2023-06-06T17:31:30Z

@janeyx99 this should be green, I added skips for adadelta and rms prop because I have a more involved fix that I'm cleaning up right now for those.

janeyx99 · 2023-06-06T18:00:31Z

Is this PR intended to fix the flakiness?

Sorry told Nikita in chat

Could the PR description link the PR that introduced the flakiness so we have a coherent story for later search? Even if it doesn't fix it all the way, it would be good to know the impact of this specific PR.

There are still two more issues (with Adadelta and RMSProp) that I'm working on a separate PR for. I can add more comments.

Yes pls.

Re the edge case, I can file an issue, I've never seen it repro in real workloads and haven't been able to create a minimal repro, I've only seen it while tracing test code so I don't think it's worth handling.

Then a NOTE comment would suffice in the code

janeyx99

Approving to unblock but please do the due diligence of explaining (in the PR description/code) which of these will be fixed in the future vs never fixed.

mlazos · 2023-06-06T21:14:02Z

Approving to unblock but please do the due diligence of explaining (in the PR description/code) which of these will be fixed in the future vs never fixed.

Yeah I requested since I added notes in the code, all good, ty.

malfet · 2023-06-06T21:26:25Z

Feel free to either rebase or ignore CUDA-12.1 failure (workflow was added after you've created the PR)

test/optim/test_optim.py

pytorchmergebot · 2023-06-06T21:48:23Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-06-06T21:48:24Z

Merge failed

Reason: Comment with id 1579492388 not found

Details for Dev Infra team

Raised by workflow job

mlazos · 2023-06-06T21:50:14Z

@pytorchbot merge

pytorchmergebot · 2023-06-06T21:52:17Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-06-06T22:32:44Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

pull / linux-bionic-py3_8-clang8-xla / test (xla, 1, 1, linux.12xlarge)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

mlazos · 2023-06-07T01:01:24Z

@pytorchbot merge

pytorchmergebot · 2023-06-07T01:03:43Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Disable dynamo on some opt methods and differentiable optimizer test

8885579

mlazos requested review from malfet and janeyx99 June 6, 2023 06:45

github-actions bot added ciflow/inductor module: dynamo labels Jun 6, 2023

janeyx99 reviewed Jun 6, 2023

View reviewed changes

test/optim/test_optim.py Outdated Show resolved Hide resolved

janeyx99 reviewed Jun 6, 2023

View reviewed changes

torch/_dynamo/eval_frame.py Show resolved Hide resolved

mlazos added 2 commits June 6, 2023 17:24

Skip dynamo on other tests with pending fix

c531f52

Add more notes

e917283

mlazos added the release notes: dynamo label Jun 6, 2023

mlazos requested a review from janeyx99 June 6, 2023 17:57

janeyx99 approved these changes Jun 6, 2023

View reviewed changes

malfet approved these changes Jun 6, 2023

View reviewed changes

malfet reviewed Jun 6, 2023

View reviewed changes

test/optim/test_optim.py Outdated Show resolved Hide resolved

This was referenced Jun 6, 2023

UNSTABLE pull / linux-bionic-py3.8-clang9 / test (dynamo) #103010

Closed

UNSTABLE pull / linux-bionic-py3.11-clang9 / test (dynamo) #103007

Closed

pytorchmergebot added the merging label Jun 6, 2023

pytorchmergebot removed the merging label Jun 6, 2023

mlazos added 2 commits June 6, 2023 21:49

Merge branch 'main' into mlazos/diff-opt-disable

9c2752f

Rename disable to disable_dynamo

65938d3

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 6, 2023

This was linked to issues Jun 6, 2023

DISABLED test_adadelta (__main__.TestOptim) #102940

Closed

DISABLED test_adadelta (optim.test_optim.TestOptim) #102941

Closed

DISABLED test_rmsprop (__main__.TestOptim) #103112

Closed

pytorchmergebot added the merging label Jun 6, 2023

pytorchmergebot removed the merging label Jun 6, 2023

pytorchmergebot added the merging label Jun 7, 2023

pytorchmergebot added Merged and removed merging labels Jun 7, 2023

pytorchmergebot closed this in 0769a50 Jun 7, 2023

janeyx99 mentioned this pull request Jun 9, 2023

Disabling ALL TestOptim on the dynamo config #103322

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable dynamo on some opt methods and differentiable optimizer tests #103066

Disable dynamo on some opt methods and differentiable optimizer tests #103066

mlazos commented Jun 6, 2023 •

edited

pytorch-bot bot commented Jun 6, 2023 •

edited

janeyx99 left a comment

janeyx99 commented Jun 6, 2023

mlazos commented Jun 6, 2023

mlazos commented Jun 6, 2023

janeyx99 commented Jun 6, 2023

janeyx99 left a comment

mlazos commented Jun 6, 2023

malfet commented Jun 6, 2023

pytorchmergebot commented Jun 6, 2023

pytorchmergebot commented Jun 6, 2023

mlazos commented Jun 6, 2023

pytorchmergebot commented Jun 6, 2023

pytorchmergebot commented Jun 6, 2023

mlazos commented Jun 7, 2023

pytorchmergebot commented Jun 7, 2023

Disable dynamo on some opt methods and differentiable optimizer tests #103066

Disable dynamo on some opt methods and differentiable optimizer tests #103066

Conversation

mlazos commented Jun 6, 2023 • edited

pytorch-bot bot commented Jun 6, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/103066

✅ No Failures

janeyx99 left a comment

Choose a reason for hiding this comment

janeyx99 commented Jun 6, 2023

mlazos commented Jun 6, 2023

mlazos commented Jun 6, 2023

janeyx99 commented Jun 6, 2023

janeyx99 left a comment

Choose a reason for hiding this comment

mlazos commented Jun 6, 2023

malfet commented Jun 6, 2023

pytorchmergebot commented Jun 6, 2023

Merge started

pytorchmergebot commented Jun 6, 2023

Merge failed

mlazos commented Jun 6, 2023

pytorchmergebot commented Jun 6, 2023

Merge started

pytorchmergebot commented Jun 6, 2023

Merge failed

mlazos commented Jun 7, 2023

pytorchmergebot commented Jun 7, 2023

Merge started

mlazos commented Jun 6, 2023 •

edited

pytorch-bot bot commented Jun 6, 2023 •

edited