Skip to content

[inductor][cpu]lennard_jones, pyhpc_equation_of_state and pyhpc_isoneutral_mixing performance regression  #121882

@zxd1997066

Description

@zxd1997066

🐛 Describe the bug

fp32 static shape default wrapper regression in 2024-03-10

suite name thread batch_size_new speed_up_new inductor_new eager_new compilation_latency_new batch_size_old speed_up_old inductor_old eager_old compilation_latency_old Ratio Speedup(New/old) Eager Ratio(old/new) Inductor Ratio(old/new) Compilation_latency_Ratio(old/new)
torchbench lennard_jones single 1 1.442252 4.8045e-05 6.929299734e-05 5.983722 1.0 1.833953 3.8604e-05 7.079792161199999e-05 4.163693 0.79 1.02 0.8 0.7
torchbench pyhpc_equation_of_state single 1 18.71236 5.3968e-05 0.00100986864448 8.857691 1.0 23.188102 4.51e-05 0.0010457834002 6.913662 0.81 1.04 0.84 0.78

fp32 dynamic shape default wrapper regression in 2024-03-10

suite name thread batch_size_new speed_up_new inductor_new eager_new compilation_latency_new batch_size_old speed_up_old inductor_old eager_old compilation_latency_old Ratio Speedup(New/old) Eager Ratio(old/new) Inductor Ratio(old/new) Compilation_latency_Ratio(old/new)
torchbench lennard_jones single 1 1.431658 4.8693e-05 6.9711722994e-05 6.040358 1.0 1.775623 3.9242e-05 6.9678997766e-05 4.229882 0.81 1.0 0.81 0.7
torchbench pyhpc_equation_of_state single 1 18.8031 5.4025e-05 0.0010158374775 8.885931 1.0 23.240634 4.4543e-05 0.001035207560262 6.99299 0.81 1.02 0.82 0.79

fp32 static shape cpp wrapper regression in 2024-03-10

suite name thread batch_size_new speed_up_new inductor_new eager_new compilation_latency_new batch_size_old speed_up_old inductor_old eager_old compilation_latency_old Ratio Speedup(New/old) Eager Ratio(old/new) Inductor Ratio(old/new) Compilation_latency_Ratio(old/new)
torchbench lennard_jones single 1 1.810503 3.8294e-05 6.9331401882e-05 14.183242 1.0 2.379847 2.8666e-05 6.822069410199999e-05 12.376875 0.76 0.98 0.75 0.87
torchbench pyhpc_equation_of_state single 1 23.197532 4.4514e-05 0.001032614939448 17.09731 1.0 29.022945 3.5698e-05 0.00103606109061 15.172577 0.8 1.0 0.8 0.89

fp32 dynamic shape cpp wrapper regression in 2024-03-10

suite name thread batch_size_new speed_up_new inductor_new eager_new compilation_latency_new batch_size_old speed_up_old inductor_old eager_old compilation_latency_old Ratio Speedup(New/old) Eager Ratio(old/new) Inductor Ratio(old/new) Compilation_latency_Ratio(old/new)
torchbench lennard_jones single 1 1.76331 3.8149000000000004e-05 6.726851319e-05 14.189571 1.0 2.273742 2.8757000000000003e-05 6.5385998694e-05 12.387977 0.78 0.97 0.75 0.87
torchbench pyhpc_equation_of_state single 1 22.763 4.464e-05 0.00101614032 17.139039 1.0 31.649198 3.4811e-05 0.001101740231578 15.198502 0.72 1.08 0.78 0.89
torchbench pyhpc_isoneutral_mixing single 1 47.525991 5.6787e-05 0.0026988584509169996 20.419413 1.0 54.58171 4.9508e-05 0.0027022312986800003 18.502454 0.87 1.0 0.87 0.91

SW info

name target_branch target_commit refer_branch refer_commit
torchbench main 1ef0a39e main ff42d907
torch main 41286f1 main 581fe26
torchvision main 0.18.0a0+2c127da main 0.18.0a0+2c127da
torchtext main 0.16.0a0+b0ebddc main 0.16.0a0+b0ebddc
torchaudio main 2.2.0a0+87aeb55 main 2.2.0a0+87aeb55
torchdata main 0.7.1a0+0790338 main 0.7.1a0+0790338
dynamo_benchmarks main nightly main nightly

Repro:
inductor_single_run.sh
bash inductor_single_run.sh thread inference performance torchbench model float32/amp first static/dynamic default/cpp
torchbench-lennard_jones-inference-float32-static-default-single-performance-drop_guilty_commit.log
Suspected guilty commit: a7e93c3
cc @ezyang @msaroufim @bdhirsh @anijain2305 @zou3519 @chauhang @WeizhuoZhang-intel @chuanqi129

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions