Skip to content

[inductor][cpu]opacus_cifar10 fp32 static shape default wrapper multiple threads performance regression #117744

@zxd1997066

Description

@zxd1997066

🐛 Describe the bug

name batch_size_new speed_up_new inductor_new eager_new compilation_latency_new batch_size_old speed_up_old inductor_old eager_old compilation_latency_old Ratio Speedup(New/old) Eager Ratio(old/new) Inductor Ratio(old/new) Compilation_latency_Ratio(old/new)
opacus_cifar10 64 0.613604 0.011064978 0.006789514760712 59.939124 64 0.739715 0.009038787 0.006686126325705 27.589113 0.83 0.98 0.82 0.46

SW info

SW Nightly commit Main commit
Pytorch ea7e229
Torchbench / 770d5cf7
torchaudio dc83b38
torchtext c11d758
torchvision 58366ab
torchdata 1d231d1
dynamo_benchmarks f228c8b /

Repro:
bash inductor_single_test.sh multiple inference performance torchbench opacus_cifar10 float32 first static default 0
Suspected guilty commit: 9862c71
torchbench-opacus_cifar10-inference-float32-static-default-performance-multiple-drop_guilty_commit.log
cc @ezyang @msaroufim @bdhirsh @anijain2305 @zou3519 @WeizhuoZhang-intel @chuanqi129

Metadata

Metadata

Labels

oncall: cpu inductorCPU Inductor issues for Intel team to triageoncall: pt2triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions