Fix SHOW_ITERATIONS table truncation with multiple transfers by nileshnegi · Pull Request #271 · ROCm/TransferBench

nileshnegi · 2026-04-27T03:14:19Z

Motivation

SHOW_ITERATIONS output malformation when executing multiple transfers/using wildcard for transfer patterns

Technical Details

numRows budget counted (numTimedIterations + 1) once per executor, but the render loop emits that many rows per transfer. With N transfers, the table ran out of pre-allocated rows after the first one or two, silently dropping the rest.

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

…utor numRows budget counted (numTimedIterations + 1) once per executor, but the render loop emits that many rows per transfer. With N transfers the table ran out of pre-allocated rows after the first one or two, silently dropping the rest. Co-authored-by: Claude <claude@anthropic.com>

Copilot

Pull request overview

Fixes SHOW_ITERATIONS table truncation when multiple transfers are executed (e.g., wildcard transfer patterns) by correctly budgeting table rows per transfer rather than per executor.

Changes:

Adjust row pre-allocation for the SHOW_ITERATIONS render path to account for (numTimedIterations + 1) rows per transfer per executor.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

- Initial pod communication support (#235) - cuda + MNNVL update & pod presets (#241) - Increase CQ size for high qps (#244) - fix hang when NVML is present but fabricmanager isnt (#246) - Adding nica2a preset (#248) - Adding HBM read bandwidth preset (#250) - Pod Ring preset (#251) - gfxsweep preset (#254) (#256) - Adding Batched DMA support (hipMemcpyBatchAsync), and bmasweep preset (#255) - Adding a wallclock consistency detection preset (#258) - Adding smoketest preset for simple correctness tests (#266) - Help / envvars / presets presets (#267) - Modernize CMake build (#268) - Replace version-based pod/amd-smi detection with compile-time API probes (#269) - Fix collective mismatch hangs in multi-rank error paths (#270) - Fix SHOW_ITERATIONS table truncation with multiple transfers per executor (#271) - Reformat a2asweep output to match gfxsweep style (#272) - Gfx sweep update (#274) - Increasing flush frequency in smoketest (#275) - Adding new experimental copy-only GFX kernel, gfxsweep update (#277) - Fixes for cuMem compilation and invalid device ordinal (#278) - Simplifying socket connect, allow for using host address (#279) - Updating podring to run on single node without need to force single pod (#280) - Adding SHOW_PERCENTILES to show extra per-iteration statistics (#281) --------- Co-authored-by: AtlantaPepsi <timhu102@gmail.com> Co-authored-by: Pak Nin Lui <pak.lui@amd.com> Co-authored-by: pierreantoineH <PierreAntoine.Harraud@amd.com> Co-authored-by: Nilesh M Negi <Nilesh.Negi@amd.com> Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

- Initial pod communication support (#235) - cuda + MNNVL update & pod presets (#241) - Increase CQ size for high qps (#244) - fix hang when NVML is present but fabricmanager isnt (#246) - Adding nica2a preset (#248) - Adding HBM read bandwidth preset (#250) - Pod Ring preset (#251) - gfxsweep preset (#254) (#256) - Adding Batched DMA support (hipMemcpyBatchAsync), and bmasweep preset (#255) - Adding a wallclock consistency detection preset (#258) - Adding smoketest preset for simple correctness tests (#266) - Help / envvars / presets presets (#267) - Modernize CMake build (#268) - Replace version-based pod/amd-smi detection with compile-time API probes (#269) - Fix collective mismatch hangs in multi-rank error paths (#270) - Fix SHOW_ITERATIONS table truncation with multiple transfers per executor (#271) - Reformat a2asweep output to match gfxsweep style (#272) - Gfx sweep update (#274) - Increasing flush frequency in smoketest (#275) - Adding new experimental copy-only GFX kernel, gfxsweep update (#277) - Fixes for cuMem compilation and invalid device ordinal (#278) - Simplifying socket connect, allow for using host address (#279) - Updating podring to run on single node without need to force single pod (#280) - Adding SHOW_PERCENTILES to show extra per-iteration statistics (#281) --------- Co-authored-by: Tim <43156029+AtlantaPepsi@users.noreply.github.com> Co-authored-by: Pak Nin Lui <pak.lui@amd.com> Co-authored-by: pierreantoineH <PierreAntoine.Harraud@amd.com> Co-authored-by: Nilesh M Negi <Nilesh.Negi@amd.com> Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings April 27, 2026 03:14

nileshnegi requested a review from a team as a code owner April 27, 2026 03:14

gilbertlee-amd approved these changes Apr 27, 2026

View reviewed changes

Copilot started reviewing on behalf of nileshnegi April 27, 2026 03:16 View session

Copilot AI reviewed Apr 27, 2026

View reviewed changes

nileshnegi merged commit 9c6c0e1 into candidate Apr 27, 2026
8 checks passed

nileshnegi deleted the users/nileshnegi/fix/multi-transfer-show-iterations branch April 27, 2026 04:24

nileshnegi mentioned this pull request Apr 27, 2026

TransferBench v1.67.0 #273

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix SHOW_ITERATIONS table truncation with multiple transfers#271

Fix SHOW_ITERATIONS table truncation with multiple transfers#271
nileshnegi merged 1 commit intocandidatefrom
users/nileshnegi/fix/multi-transfer-show-iterations

nileshnegi commented Apr 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

nileshnegi commented Apr 27, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants