Skip to content

Filter SM120 mixed 8-bit tiles for FP6 ElementD#3247

Open
zhils wants to merge 1 commit into
NVIDIA:mainfrom
zhils:fix/sm120-fp6-manifest-tile-3211
Open

Filter SM120 mixed 8-bit tiles for FP6 ElementD#3247
zhils wants to merge 1 commit into
NVIDIA:mainfrom
zhils:fix/sm120-fp6-manifest-tile-3211

Conversation

@zhils
Copy link
Copy Markdown

@zhils zhils commented May 19, 2026

Epilogue builder requires CTA_N (RowMajor D) or CTA_M (ColumnMajor D) divisible by 128 for e2m3/e3m2. Filter tile_descriptions before emitting kernels and add regression tests.

Fixes #3211

Epilogue builder requires CTA_N (RowMajor D) or CTA_M (ColumnMajor D)
divisible by 128 for e2m3/e3m2. Filter tile_descriptions before emitting
kernels and add regression tests.

Fixes NVIDIA#3211

Co-authored-by: Cursor <cursoragent@cursor.com>
@hwu36
Copy link
Copy Markdown
Collaborator

hwu36 commented May 19, 2026

@depaulmillz , could you please review?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[BUG] build failed on spark caused by commit b46b16d003484063bca4ed365e44095c4c6ed633

2 participants