Skip to content

StreamK SGPR Reduction#592

Merged
aliry95amd merged 7 commits intoROCm:developfrom
aliry95amd:StreamK_SGPR_MagicDivision
Jul 31, 2025
Merged

StreamK SGPR Reduction#592
aliry95amd merged 7 commits intoROCm:developfrom
aliry95amd:StreamK_SGPR_MagicDivision

Conversation

@aliry95amd
Copy link
Contributor

@aliry95amd aliry95amd commented Jul 10, 2025

This PR:

  • removes magic division in StreamK and frees up 6 related SGPRs
  • packs skGrid and skTile into one SGPR and free up 1 SGPR

@msujon-AMD
Copy link
Collaborator

Since it changes the ABI of assembly code, we have to change the custom kernel as well, right?

@aliry95amd
Copy link
Contributor Author

@msujon-AMD Yeah, that's true! I have been discussing it with Brian to see what our options are.
There is also another PR #520 that requires change to custom kernels.

Copy link
Contributor

@babakpst babakpst left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

until we fix custom kernels

ammallya pushed a commit that referenced this pull request Jul 22, 2025
* Update changelog and version number

[ROCm/hipSPARSE commit: 574e846]
@aliry95amd aliry95amd requested a review from a team as a code owner July 24, 2025 22:55
@aliry95amd aliry95amd changed the title Replace Magic Division Algorithm Used for StreamK StreamK SGPR Reduction Jul 24, 2025
@aliry95amd
Copy link
Contributor Author

aliry95amd commented Jul 25, 2025

GFX950 hipblaslt-test and tox results:

[==========] 20379 tests from 12 test suites ran. (2023285 ms total)
[  PASSED  ] 20379 tests.
hipBLASLt version: 100100
hipBLASLt git version: 63af362792
command line: ./hipblaslt-test 


=========== 87 passed, 74 skipped, 60 warnings in 6337.28s (1:45:37) ===========
___________________________________ summary ____________________________________
  py3: commands succeeded
  congratulations :)

@aliry95amd aliry95amd requested a review from babakpst July 25, 2025 18:51
@aliry95amd aliry95amd merged commit 58d4aff into ROCm:develop Jul 31, 2025
10 of 12 checks passed
assistant-librarian bot pushed a commit to ROCm/hipBLASLt that referenced this pull request Jul 31, 2025
StreamK SGPR Reduction

This PR:
- removes magic division in StreamK and frees up 6 related SGPRs
- packs skGrid and skTile into one SGPR and free up 1 SGPR
b-shi added a commit that referenced this pull request Aug 14, 2025
Bring in some SK related tile calculations from
#592. This change was missed.
Added tensilelite tests.
assistant-librarian bot pushed a commit to ROCm/hipBLASLt that referenced this pull request Aug 14, 2025
[hipblaslt] Fix typo in custom bbs tn kernel

Bring in some SK related tile calculations from
ROCm/rocm-libraries#592. This change was missed.
Added tensilelite tests.
@aliry95amd aliry95amd deleted the StreamK_SGPR_MagicDivision branch August 26, 2025 23:05
aferoz21 pushed a commit that referenced this pull request Sep 18, 2025
This PR:
- removes magic division in StreamK and frees up 6 related SGPRs
- packs skGrid and skTile into one SGPR and free up 1 SGPR

---------

Co-authored-by: b-shi <brianshi@amd.com>
bsyrowik pushed a commit that referenced this pull request Oct 27, 2025
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.22.0 to 1.25.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/v1.25.0/CHANGELOG.md)
- [Commits](ROCm/rocm-docs-core@v1.22.0...v1.25.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-version: 1.25.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

[ROCm/rocWMMA commit: e94dab6]
bsyrowik pushed a commit that referenced this pull request Oct 27, 2025
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.22.0 to 1.25.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/v1.25.0/CHANGELOG.md)
- [Commits](ROCm/rocm-docs-core@v1.22.0...v1.25.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-version: 1.25.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

[ROCm/rocWMMA commit: e94dab6]
ammallya pushed a commit that referenced this pull request Oct 27, 2025
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.22.0 to 1.25.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/v1.25.0/CHANGELOG.md)
- [Commits](ROCm/rocm-docs-core@v1.22.0...v1.25.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-version: 1.25.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
ammallya pushed a commit that referenced this pull request Oct 28, 2025
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.22.0 to 1.25.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/v1.25.0/CHANGELOG.md)
- [Commits](ROCm/rocm-docs-core@v1.22.0...v1.25.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-version: 1.25.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

[ROCm/rocwmma commit: e94dab6]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants