fp32 training failures in timm_models after enabling optimizer

The problem appeared after https://github.com/pytorch/pytorch/pull/90956.

Repro:
```
python benchmarks/dynamo/timm_models.py --accuracy --device cuda --backend aot_eager --float32  --training --only   volo_d1_224
```
The failure is consistent for volo_d1_224.

```
for i in {1..10}; do python benchmarks/dynamo/timm_models.py --accuracy --device cuda --backend aot_eager --float32  --training --only   fbnetv3_b; done
```
The failure is random for fbnetv3_b.


cc @ezyang @soumith @msaroufim @wconstab @ngimel @bdhirsh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fp32 training failures in timm_models after enabling optimizer #93490

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

fp32 training failures in timm_models after enabling optimizer #93490

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions