Revert "Fix model initialization. (#6076)" #6134

frgossen · 2023-12-13T17:03:03Z

This reverts commit 7786d5d.

python benchmarks/experiment_runner.py --dynamo=openxla --dynamo=openxla_eval --dynamo=inductor --xla=PJRT --xla=None --test=eval --test=train --suite-name=torchbench --accelerator=cuda --filter=^alexnet$ --no-resume fails for openxla benchmarks.

zpcore · 2023-12-13T17:40:18Z

Sorry I didn't catch up with recent PRs. Do we already support automatically match dynamo backend openxla, openxla_eval with train and eval based on the cmdline in the description?

frgossen · 2023-12-13T17:57:10Z

We should support openxla_eval + eval, openxla + eval, and openxla + train
This issue is unrelated though. The flags are expanded to configs, so if the combination is not supported it should just skip it.
The failure I see is a segfault which did not happen before.

zpcore

Thanks for the clarification. LGTM! Can you also resolve the conflict?

This reverts commit 7786d5d. `python benchmarks/experiment_runner.py --dynamo=openxla --dynamo=openxla_eval --dynamo=inductor --xla=PJRT --xla=None --test=eval --test=train --suite-name=torchbench --accelerator=cuda --filter=^alexnet$ --no-resume` fails for openxla benchmarks.

I discovered some models from the suite do not have the precision set so instead of failing the script we just log the case, and use the default precision, as no additional machinery should run for the Inductor anyway. Additionally I wrapped the exceptions with the ValueError so the logging message will not pollute with info about str not inheriting from Exception class. ecg@, note that needs to be hooked "somewhere". Not sure where, as there was a revert in #6134.

@cota

I discovered some models from the suite do not have the default precision set so instead of failing the script we just log the case, and do nothing, as no additional machinery should run for the Inductor anyway. Additionally I wrapped the exceptions with the ValueError so the logging message will not pollute with info about str not inheriting from Exception class. @cota , note that needs to be hooked "somewhere". Not sure where, as there was a revert in #6134, but in general it can be done prior to moving the model to the device safely.

@cota

I discovered some models from the suite do not have the default precision set so instead of failing the script we just log the case, and do nothing, as no additional machinery should run for the Inductor anyway. Additionally I wrapped the exceptions with the ValueError so the logging message will not pollute with info about str not inheriting from Exception class. @cota , note that needs to be hooked "somewhere". Not sure where, as there was a revert in pytorch#6134, but in general it can be done prior to moving the model to the device safely.

@cota

I discovered some models from the suite do not have the default precision set so instead of failing the script we just log the case, and do nothing, as no additional machinery should run for the Inductor anyway. Additionally I wrapped the exceptions with the ValueError so the logging message will not pollute with info about str not inheriting from Exception class. @cota , note that needs to be hooked "somewhere". Not sure where, as there was a revert in #6134, but in general it can be done prior to moving the model to the device safely.

@cota

I discovered some models from the suite do not have the default precision set so instead of failing the script we just log the case, and do nothing, as no additional machinery should run for the Inductor anyway. Additionally I wrapped the exceptions with the ValueError so the logging message will not pollute with info about str not inheriting from Exception class. @cota , note that needs to be hooked "somewhere". Not sure where, as there was a revert in #6134, but in general it can be done prior to moving the model to the device safely.

frgossen requested review from JackCaoG, vanbasten23, ysiraichi and zpcore and removed request for JackCaoG December 13, 2023 17:03

zpcore approved these changes Dec 13, 2023

View reviewed changes

frgossen force-pushed the frg-revert-6076 branch from c67100e to 6e61d8d Compare December 13, 2023 18:59

frgossen force-pushed the frg-revert-6076 branch from 6e61d8d to 833ec06 Compare December 13, 2023 19:14

frgossen merged commit 788e1b5 into master Dec 13, 2023

frgossen deleted the frg-revert-6076 branch December 13, 2023 19:15

golechwierowicz mentioned this pull request Dec 13, 2023

Do not fail on lack of default precision set. #6139

Merged

chunnienc pushed a commit to chunnienc/xla that referenced this pull request Dec 14, 2023

Revert "Fix model initialization. (pytorch#6076)" (pytorch#6134)

d5cf17c

golechwierowicz pushed a commit that referenced this pull request Jan 12, 2024

Revert "Fix model initialization. (#6076)" (#6134)

ebfbb46

bhavya01 pushed a commit that referenced this pull request Apr 22, 2024

Revert "Fix model initialization. (#6076)" (#6134)

e73e48f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Revert "Fix model initialization. (#6076)" #6134

Revert "Fix model initialization. (#6076)" #6134

Uh oh!

frgossen commented Dec 13, 2023

Uh oh!

zpcore commented Dec 13, 2023

Uh oh!

frgossen commented Dec 13, 2023

Uh oh!

zpcore left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Revert "Fix model initialization. (#6076)" #6134

Revert "Fix model initialization. (#6076)" #6134

Uh oh!

Conversation

frgossen commented Dec 13, 2023

Uh oh!

zpcore commented Dec 13, 2023

Uh oh!

frgossen commented Dec 13, 2023

Uh oh!

zpcore left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants