Do not fail on lack of default precision set. #6139

golechwierowicz · 2023-12-13T20:16:29Z

I discovered some models from the suite do not have the default precision set so instead of failing the script we just log the case, and do nothing, as no additional machinery should run for the Inductor anyway. Additionally I wrapped the exceptions with the ValueError so the logging message will not pollute with info about str not inheriting from Exception class.

@cota , note that needs to be hooked "somewhere". Not sure where, as there was a revert in #6134, but in general it can be done prior to moving the model to the device safely.

I discovered some models from the suite do not have the precision set so instead of failing the script we just log the case, and use the default precision, as no additional machinery should run for the Inductor anyway. Additionally I wrapped the exceptions with the ValueError so the logging message will not pollute with info about str not inheriting from Exception class. ecg@, note that needs to be hooked "somewhere". Not sure where, as there was a revert in #6134.

cota · 2023-12-13T20:29:12Z

Thanks Greg -- I had missed that your change was reverted as part of another revert.

Please land this (including the PR description when merging, otherwise annoyingly we lose that info in the repo) and I'll try to re-land your reverted change.

frgossen

One comment.
Thanks you for finding these thinsg!

benchmarks/torchbench_model.py

cota · 2023-12-13T22:08:50Z

Please land this (including the PR description when merging, otherwise annoyingly we lose that info in the repo) and I'll try to re-land your reverted change.

Given that Jack is looking into this at the bridge level, I'll wait for him to finish that investigation since that's the right place to fix this.
Thanks again for identifying this issue Greg!

@cota

I discovered some models from the suite do not have the default precision set so instead of failing the script we just log the case, and do nothing, as no additional machinery should run for the Inductor anyway. Additionally I wrapped the exceptions with the ValueError so the logging message will not pollute with info about str not inheriting from Exception class. @cota , note that needs to be hooked "somewhere". Not sure where, as there was a revert in pytorch#6134, but in general it can be done prior to moving the model to the device safely.

@cota

I discovered some models from the suite do not have the default precision set so instead of failing the script we just log the case, and do nothing, as no additional machinery should run for the Inductor anyway. Additionally I wrapped the exceptions with the ValueError so the logging message will not pollute with info about str not inheriting from Exception class. @cota , note that needs to be hooked "somewhere". Not sure where, as there was a revert in #6134, but in general it can be done prior to moving the model to the device safely.

ysiraichi · 2024-01-23T19:20:23Z

benchmarks/torchbench_model.py

    elif precision == "amp":
-      raise f"AMP for PT/XLA:GPU is not implemented yet for torchbench models"
+      raise ValueError(
+          f"AMP for PT/XLA:GPU is not implemented yet for torchbench models")


@miladm @frgossen @golechwierowicz Isn't AMP supported on PT/XLA:GPU?

@cota

I discovered some models from the suite do not have the default precision set so instead of failing the script we just log the case, and do nothing, as no additional machinery should run for the Inductor anyway. Additionally I wrapped the exceptions with the ValueError so the logging message will not pollute with info about str not inheriting from Exception class. @cota , note that needs to be hooked "somewhere". Not sure where, as there was a revert in #6134, but in general it can be done prior to moving the model to the device safely.

golechwierowicz requested review from cota and frgossen December 13, 2023 20:16

cota approved these changes Dec 13, 2023

View reviewed changes

frgossen approved these changes Dec 13, 2023

View reviewed changes

benchmarks/torchbench_model.py Show resolved Hide resolved

Add documentation to the apply_default_precision_config

2791a22

golechwierowicz merged commit 6b1344e into master Dec 13, 2023

ysiraichi reviewed Jan 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Do not fail on lack of default precision set. #6139

Do not fail on lack of default precision set. #6139

golechwierowicz commented Dec 13, 2023 •

edited

Loading

Uh oh!

cota commented Dec 13, 2023

Uh oh!

frgossen left a comment

Uh oh!

Uh oh!

cota commented Dec 13, 2023

Uh oh!

ysiraichi Jan 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Do not fail on lack of default precision set. #6139

Do not fail on lack of default precision set. #6139

Conversation

golechwierowicz commented Dec 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cota commented Dec 13, 2023

Uh oh!

frgossen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cota commented Dec 13, 2023

Uh oh!

ysiraichi Jan 23, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

golechwierowicz commented Dec 13, 2023 •

edited

Loading