Track the accuracy regress for HF with max-autotune enabled

### 🐛 Describe the bug

HF accuracy check starts to regress [link](https://hud.pytorch.org/benchmark/compilers?startTime=Sun%2C%2020%20Aug%202023%2023%3A46%3A52%20GMT&stopTime=Tue%2C%2019%20Sep%202023%2023%3A46%3A52%20GMT&granularity=day&suite=huggingface&mode=training&dtype=amp&lBranch=main&lCommit=d8da2a7c8523015740ee8a73b7bed54f4b1ea70a&rBranch=main&rCommit=1b3dc05c3e703841e64e0277d473a0baf3296671)

Repro command:
```
TORCHINDUCTOR_MAX_AUTOTUNE=1 CUDA_VISIBLE_DEVICES=3 python benchmarks/dynamo/huggingface.py --backend inductor --amp --accuracy --only PLBartForCausalLM --training --cold-start-latency
```

Here are the things I have tried:
- try to build pytorch and run the repro on both the old commit (1b3dc05c3e) and new commit (d8da2a7c85). Both fail the accuracy check
- try to rollback triton to older pin and run the repro on pytorch commit 1b3dc05c3e. Accuracy check fail
- try to rollback huggingface to the older pin and run the repro on pytorch commit 1b3dc05c3e. Accuracy check pass.

The cause should be huggingface upgrade.
This log may be the reason 'WARNING:common:fp64 golden ref were not generated for PLBartForCausalLM. Setting accuracy check to cosine'

### Error logs

_No response_

### Minified repro

_No response_

### Versions

x

cc @ezyang @msaroufim @wconstab @bdhirsh @anijain2305

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Track the accuracy regress for HF with max-autotune enabled #109736

🐛 Describe the bug

Error logs

Minified repro

Versions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Track the accuracy regress for HF with max-autotune enabled #109736

Description

🐛 Describe the bug

Error logs

Minified repro

Versions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions