Skip to content

Conversation

naromero77amd
Copy link

@naromero77amd naromero77amd commented Sep 15, 2025

This config improves the performance of a 1D pointwise kernel by 20% as measured on MI350.

@naromero77amd naromero77amd marked this pull request as draft September 15, 2025 23:50
@rocm-repo-management-api
Copy link

rocm-repo-management-api bot commented Sep 15, 2025

Jenkins build for de203dad7b730af9f0c1c2e0216ad7653007f01d commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

@jataylo jataylo marked this pull request as ready for review September 16, 2025 17:54
@naromero77amd naromero77amd changed the title [WIP][ROCm] Additional pointwise tunings [inductor][ROCm] Additional pointwise tunings Sep 16, 2025
@naromero77amd
Copy link
Author

@jataylo For pointwise there are no other tunings, once you merge it in. I will created another PR for the _inductor ROCm internal branch.

@naromero77amd naromero77amd changed the title [inductor][ROCm] Additional pointwise tunings [ROCm][inductor] Additional pointwise tunings Sep 17, 2025
@naromero77amd naromero77amd merged commit a7bac0a into iupaikov_perf_tuning_rocm71 Sep 17, 2025
1 of 3 checks passed
@naromero77amd naromero77amd deleted the iupaikov_perf_tuning_rocm71_additional_pointwise_tunings branch September 17, 2025 20:30
@naromero77amd
Copy link
Author

!cherry-pick onto rocm7.1_internal_testing_inductor

naromero77amd added a commit that referenced this pull request Sep 17, 2025
This config improves the performance of a 1D pointwise kernel by 20% as
measured on MI350.

(cherry picked from commit a7bac0a)
naromero77amd added a commit that referenced this pull request Sep 18, 2025
This config improves the performance of a 1D pointwise kernel by 20% as
measured on MI350.

(cherry picked from commit a7bac0a)
jataylo pushed a commit that referenced this pull request Sep 19, 2025
This config improves the performance of a 1D pointwise kernel by 20% as
measured on MI350.

(cherry picked from commit a7bac0a)
jataylo pushed a commit that referenced this pull request Sep 19, 2025
…ise tunings (#2653)

This config improves the performance of a 1D pointwise kernel by 20% as
measured on MI350.

(cherry picked from commit a7bac0a)

Duplicate of this #2642
pytorchmergebot pushed a commit that referenced this pull request Sep 26, 2025
This config improves the performance of a 1D pointwise kernel by 20% as
measured on MI350.

(cherry picked from commit a7bac0a)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants