Use arch when writing to tuningDB, perfRunnerlooks for arch by umangyadav · Pull Request #2259 · ROCm/rocMLIR

umangyadav · 2026-02-27T15:41:38Z

Motivation

TuningDB has column "arch". But currently it stores "chip".

rocMLIR/mlir/utils/performance/tuningRunner.py

Line 935 in 88adf76

self.options.chip,

Arch has full name e.g. gfx950:sramecc+:xnack-
Chip only has gfx950.

PerfRunner looks for "arch"

rocMLIR/mlir/utils/performance/perfRunner.py

Line 1812 in 88adf76

if (arch, num_cu, num_chiplets, config_str) in tuning_db:

When it can not find it, it sets TFLops to NaN.

Technical Details

Make tuningRunner store "arch" instead of chip

Test Plan

Reproduce steps :

python3 ./bin/tuningRunner.py --operation gemm -c ../mlir/utils/performance/configs/tier1-gemm-configs --tuning-space=quick -o gemm_quick.tsv | python3 ./bin/perfRunner.py --operation gemm -c ../mlir/utils/performance/configs/tier1-gemm-configs -t gemm_quick.tsv -o gemm.csv --batch_mlir

This produces NaNs.

After this PR running same produces correct TFlops values.

Copilot

Pull request overview

This PR fixes a bug where tuningRunner was writing the chip name (e.g., "gfx950") to the tuning database instead of the full architecture string (e.g., "gfx950:sramecc+:xnack-"). This caused perfRunner to fail to find matching entries when looking up tuning configurations, resulting in NaN TFlops values.

Changes:

Changed tuningRunner to write arch instead of chip to the tuning database, aligning with the database schema and perfRunner's lookup expectations

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Co-authored-by: Mirza Halilčević <109971222+mirza-halilcevic@users.noreply.github.com>

Automated weekly review of merged PRs #2234 #2240 #2248 #2249 #2251 #2254 #2257 #2258 #2259 #2270 #2271. Identifies 6 areas with weak test coverage and meaningful business risk: 1. ConcurrentQueue (no unit tests, multi-threaded, silent deadlock risk) 2. parse_tuning_db_line / read_tuning_db key schema change (no Python tests) 3. BooleanElementwiseConverter missing f16/unsigned dtype coverage 4. Attention MaxNumFOp vs MaximumFOp NaN correctness (no dedicated test) 5. firstCausalMaskIter off-by-one risk (no non-trivial shape test) 6. Sliding window attention edge cases (windowSize=0/>=seqLen/unaligned) The GitHub discussion API returned FORBIDDEN (read-only CI token); analysis committed here as a permanent record. Co-authored-by: Djordje Antic <djordje.antic@amd.com>

Use arch when writing to tuningDB, perfRunnerlooks for arch

beda90d

umangyadav requested a review from causten as a code owner February 27, 2026 15:41

umangyadav requested review from Copilot, dhernandez0, justinrosner, mirza-halilcevic and pabloantoniom and removed request for causten, Copilot and pabloantoniom February 27, 2026 15:41

umangyadav self-assigned this Feb 27, 2026

Copilot started reviewing on behalf of umangyadav February 27, 2026 15:43 View session

dhernandez0 approved these changes Feb 27, 2026

View reviewed changes

justinrosner approved these changes Feb 27, 2026

View reviewed changes

Copilot AI reviewed Feb 27, 2026

View reviewed changes

dorde-antic approved these changes Feb 27, 2026

View reviewed changes

Mr-Anyone approved these changes Feb 27, 2026

View reviewed changes

Merge branch 'develop' into fixPerfRunner

92d9d52

mirza-halilcevic approved these changes Feb 27, 2026

View reviewed changes

Comment thread mlir/utils/performance/tuningRunner.py Outdated

Comment thread mlir/utils/performance/tuningRunner.py Outdated

umangyadav and others added 4 commits February 27, 2026 15:29

Update mlir/utils/performance/tuningRunner.py

f6d050e

Co-authored-by: Mirza Halilčević <109971222+mirza-halilcevic@users.noreply.github.com>

Update mlir/utils/performance/tuningRunner.py

bc8dc78

Co-authored-by: Mirza Halilčević <109971222+mirza-halilcevic@users.noreply.github.com>

Merge branch 'develop' into fixPerfRunner

bcb1236

Merge branch 'develop' into fixPerfRunner

b684bb3

umangyadav merged commit 11d5c9d into develop Mar 3, 2026
7 of 14 checks passed

umangyadav deleted the fixPerfRunner branch March 3, 2026 15:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use arch when writing to tuningDB, perfRunnerlooks for arch#2259

Use arch when writing to tuningDB, perfRunnerlooks for arch#2259
umangyadav merged 6 commits intodevelopfrom
fixPerfRunner

umangyadav commented Feb 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

umangyadav commented Feb 27, 2026

Motivation

Technical Details

Test Plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants