Fix deep-gemm alignment : per-group alignment -> output alignment by haok1402 · Pull Request #35 · mlc-ai/Pith-Train

haok1402 · 2026-04-28T20:25:18Z

Drop redundant 1024-row over-rounding; per-group 128-alignment already covers FP8 grouped-GEMM tile requirements.

…y covers FP8 grouped-GEMM tile requirements

gemini-code-assist

Code Review

This pull request simplifies token slicing and removes redundant truncation logic across the codebase. In gpt_oss.py, the manual truncation of input tensors based on group sizes has been removed. In token_scatter.py, the logic for rounding the output token slice to _GEMM_ALLOC_ALIGNMENT was replaced with a direct slice to actual_M, as the group sizes are already guaranteed to be correctly aligned. This adjustment ensures compatibility with DeepGEMM's FP8 kernels, which require the data row count to exactly match the sum of group sizes. I have no feedback to provide as no review comments were submitted.

Drop redundant 1024-row over-rounding; per-group 128-alignment alread…

c93b97a

…y covers FP8 grouped-GEMM tile requirements

gemini-code-assist Bot reviewed Apr 28, 2026

View reviewed changes

MasterJH5574 approved these changes Apr 28, 2026

View reviewed changes

MasterJH5574 merged commit 23db182 into mlc-ai:main Apr 28, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix deep-gemm alignment : per-group alignment -> output alignment#35

Fix deep-gemm alignment : per-group alignment -> output alignment#35
MasterJH5574 merged 1 commit intomlc-ai:mainfrom
haok1402:0421-fix-deepgemm-alignment

haok1402 commented Apr 28, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

haok1402 commented Apr 28, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants