Skip to content

Conversation

dhonnappa-amd
Copy link

Cherry-pick of #2597

cherry-pick of pytorch#161700

Our compiler is generating inefficient code for the offsetCalc in
certain situations. The root-cause for this needs to be identified. For
now specialized unrolling based on 'dims' notably helps perf.

Fixes SWDEV-545713, SWDEV-545710
@rocm-repo-management-api
Copy link

rocm-repo-management-api bot commented Sep 3, 2025

Jenkins build for 33d172bf9c9078514b47a93c32b42d14d50859ad commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

@jerrymannil jerrymannil marked this pull request as ready for review September 3, 2025 19:47
@jerrymannil jerrymannil self-assigned this Sep 3, 2025
@jerrymannil jerrymannil merged commit 9ea02c4 into release/2.8 Sep 3, 2025
1 of 3 checks passed
@jerrymannil jerrymannil deleted the autogenerated/release/2.8_cherry-pick_pr-2597 branch September 3, 2025 19:48
pragupta pushed a commit that referenced this pull request Oct 8, 2025
…2598)

Cherry-pick of #2597

Co-authored-by: Jerry Mannil <65309407+jerrymannil@users.noreply.github.com>
(cherry picked from commit 9ea02c4)
jithunnair-amd pushed a commit that referenced this pull request Oct 10, 2025
…2598)

Cherry-pick of #2597

Co-authored-by: Jerry Mannil <65309407+jerrymannil@users.noreply.github.com>
(cherry picked from commit 9ea02c4)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants