Skip to content

[Update] Add benchmark and autotune for group_gemm#26

Merged
xjmxyt merged 2 commits intomainfrom
jinmanx/dev
Jan 5, 2026
Merged

[Update] Add benchmark and autotune for group_gemm#26
xjmxyt merged 2 commits intomainfrom
jinmanx/dev

Conversation

@xjmxyt
Copy link
Copy Markdown
Collaborator

@xjmxyt xjmxyt commented Jan 2, 2026

Description

Previously, we don't provide autotuner for group_gemm and also benchmarks. Now add one.

CI Configuration

config:
  build: true
  # valid options are "ops" and "benchmark"
  test: ["ops", "benchmark"]

Checklist

  • Code formatted and imports sorted via repo specifications (./format.sh)
  • Documentation updated (if needed)
  • CI configuration reviewed

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Jan 2, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@xjmxyt
Copy link
Copy Markdown
Collaborator Author

xjmxyt commented Jan 2, 2026

/ok to test f67f4e3

@xjmxyt
Copy link
Copy Markdown
Collaborator Author

xjmxyt commented Jan 2, 2026

/ok to test 69d3bbd

@xjmxyt xjmxyt requested a review from hannahli-nv January 2, 2026 09:02
@xjmxyt xjmxyt changed the title [Update] Add benchmark and autotune for group_gemm Draft: [Update] Add benchmark and autotune for group_gemm Jan 2, 2026
Copy link
Copy Markdown
Collaborator

@hannahli-nv hannahli-nv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thx.

@xjmxyt xjmxyt changed the title Draft: [Update] Add benchmark and autotune for group_gemm [Update] Add benchmark and autotune for group_gemm Jan 5, 2026
@xjmxyt xjmxyt merged commit 164e3e7 into main Jan 5, 2026
10 checks passed
@xjmxyt xjmxyt deleted the jinmanx/dev branch January 5, 2026 03:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants