Add function to port FX minified graph to HLO via StableHLO #109084

awskila · 2023-09-12T00:22:33Z

If XLA_HLO_DEBUG flag is enabled, generated a minified HLO graph when using the minifier. This function enables HLO minification support by porting the minified FX graph to StableHLO via the save_torch_model_as_stablehlo function.

This allows users to port the minified graph to compilers that are not compatible with TorchDynamo/Inductor workflow and use XLA instead. The purpose of this PR is to help XLA users debug accuracy and compilation errors. It will also be helpful for existing TorchDynamo/XLA workflow on torchxla_trace_once backend as well.

Fixes #5461 in Torch XLA repo. CC @GleasonK @qihqi

pytorch-bot · 2023-09-12T00:22:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/109084

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 198acf8 with merge base c1a2f35 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

janeyx99 · 2023-09-12T15:14:49Z

Added @anijain2305 and @mlazos as reviewers as this is minifier related--feel free to reassign reviewers if you're not the right people.

awskila · 2023-09-12T20:33:20Z

The two failures are unrelated to this PR. Both are due to an Android installation issue (maybe caused by permissions issue/read-only filesystem?)

ANDROID_HOME not a directory; did you install it under /opt/android/sdk?

awskila · 2023-09-18T20:19:10Z

@janeyx99 Seems like @mlazos @anijain2305 may be busy at the moment. Is it possible to ping them, or add another reviewer? There's interest from my team (AWS SageMaker) and StableHLO to get this in as a debugging feature for XLA users.

awskila · 2023-09-22T18:20:35Z

Is it possible to get a review on this PR? Have been waiting for about 2 weeks. Thanks!

If `XLA_HLO_DEBUG` flag is enabled, generate a minified HLO graph when using the minifier. This function enables HLO minification support by porting the minified FX graph to StableHLO via the `save_torch_model_as_stablehlo` function. This allows users to port the FX minified graph to compilers that are not compatible with TorchDynamo/Inductor and use XLA as its backend.

awskila · 2023-09-27T18:13:55Z

@anijain2305 @mlazos Can this PR be reviewed this week? We have interest from the AWS SageMaker team plus Google's StableHLO team. Thanks for your time!

anijain2305

I apologize for the delay. It lgtm!

awskila · 2023-09-27T21:07:35Z

No worries @anijain2305. Thanks! Glad to see it get approved!

awskila · 2023-10-02T16:44:55Z

@pytorchmergebot merge

pytorchmergebot · 2023-10-02T16:46:43Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

kit1980 · 2023-10-02T21:52:38Z

@awskila @anijain2305 After this PR a test started to time out:

https://github.com/pytorch/pytorch/actions/runs/6384661915/job/17328135872

dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_retracibility_dynamic_shapes <- test/dynamo/test_export.py Command took >30min, returning 124

I'm not sure if this PR is actually the cause of this, what do you think?

awskila · 2023-10-02T22:20:01Z

Hmmm, I took a look at it, and I don't think it is the cause of the test failure. Main reason is that this function is only called when XLA_HLO_DEBUG environment variable is set.

Per the docker container runtime arguments, XLA_HLO_DEBUG is not set.

-e BUILD_ENVIRONMENT -e PR_NUMBER -e GITHUB_ACTIONS -e GITHUB_REPOSITORY -e GITHUB_WORKFLOW -e GITHUB_JOB -e GITHUB_RUN_ID -e GITHUB_RUN_NUMBER -e GITHUB_RUN_ATTEMPT -e BASE_SHA -e BRANCH -e SHA1 -e AWS_DEFAULT_REGION -e IN_WHEEL_TEST -e SHARD_NUMBER -e TEST_CONFIG -e NUM_TEST_SHARDS -e REENABLED_ISSUES -e CONTINUE_THROUGH_ERROR -e PYTORCH_RETRY_TEST_CASES -e PYTORCH_OVERRIDE_FLAKY_SIGNAL -e PR_LABELS -e MAX_JOBS=14 -e SCCACHE_BUCKET -e SCCACHE_S3_KEY_PREFIX -e XLA_CUDA -e XLA_CLANG_CACHE_S3_BUCKET_NAME -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK -e PYTORCH_TEST_RERUN_DISABLED_TESTS -e SKIP_SCCACHE_INITIALIZATION=1 -e HUGGING_FACE_HUB_TOKEN -e DASHBOARD_TAG

Also the FX minifier does not support dynamic shapes. And test_dynamic_shapes does not import the minifier.

pytorch-bot bot added the release notes: fx release notes category label Sep 12, 2023

pytorchbot added the open source label Sep 12, 2023

awskila force-pushed the convert-to-hlo branch from d631e8b to 10e1aaa Compare September 12, 2023 00:34

janeyx99 requested review from mlazos and anijain2305 September 12, 2023 15:13

janeyx99 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Sep 12, 2023

awskila force-pushed the convert-to-hlo branch from 10e1aaa to 60f77af Compare September 12, 2023 16:49

awskila force-pushed the convert-to-hlo branch from 60f77af to 198acf8 Compare September 25, 2023 22:17

anijain2305 approved these changes Sep 27, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 2, 2023

pytorchmergebot added the merging label Oct 2, 2023

pytorchmergebot added Merged and removed merging labels Oct 2, 2023

pytorchmergebot closed this in 16e3f15 Oct 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add function to port FX minified graph to HLO via StableHLO #109084

Add function to port FX minified graph to HLO via StableHLO #109084

awskila commented Sep 12, 2023 •

edited

pytorch-bot bot commented Sep 12, 2023 •

edited

janeyx99 commented Sep 12, 2023

awskila commented Sep 12, 2023 •

edited

awskila commented Sep 18, 2023

awskila commented Sep 22, 2023

awskila commented Sep 27, 2023

anijain2305 left a comment

awskila commented Sep 27, 2023

awskila commented Oct 2, 2023

pytorchmergebot commented Oct 2, 2023

kit1980 commented Oct 2, 2023 •

edited

awskila commented Oct 2, 2023 •

edited

Add function to port FX minified graph to HLO via StableHLO #109084

Add function to port FX minified graph to HLO via StableHLO #109084

Conversation

awskila commented Sep 12, 2023 • edited

pytorch-bot bot commented Sep 12, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/109084

✅ No Failures

janeyx99 commented Sep 12, 2023

awskila commented Sep 12, 2023 • edited

awskila commented Sep 18, 2023

awskila commented Sep 22, 2023

awskila commented Sep 27, 2023

anijain2305 left a comment

Choose a reason for hiding this comment

awskila commented Sep 27, 2023

awskila commented Oct 2, 2023

pytorchmergebot commented Oct 2, 2023

Merge started

kit1980 commented Oct 2, 2023 • edited

awskila commented Oct 2, 2023 • edited

awskila commented Sep 12, 2023 •

edited

pytorch-bot bot commented Sep 12, 2023 •

edited

awskila commented Sep 12, 2023 •

edited

kit1980 commented Oct 2, 2023 •

edited

awskila commented Oct 2, 2023 •

edited