[Tutorial] Add Blackwell matmul V4 tutorial (tile rasterization)#129
Merged
yaoyaoding merged 2 commits intomainfrom Apr 15, 2026
Merged
[Tutorial] Add Blackwell matmul V4 tutorial (tile rasterization)#129yaoyaoding merged 2 commits intomainfrom
yaoyaoding merged 2 commits intomainfrom
Conversation
Member
yaoyaoding
commented
Apr 15, 2026
- Tile rasterization: educational explanation with concrete working set example (8x8 grid, wave=16), SVG diagram, formulation, fast_divmod hint
- Pipeline abstraction: async pipeline mindset with producer/consumer/ring buffer, SVG diagram showing 5-stage pipeline state, Pipeline class API
- TMA epilogue: motivation (register/smem pressure), dataflow SVG, step-by-step instruction sequence with completion mechanism note
- Add tilus.Class API page with docstring
- Add comments to matmul_v4.py for new instructions/optimizations
- Forward reference from V3 to V4
…pilogue) - Tile rasterization: educational explanation with concrete working set example (8x8 grid, wave=16), SVG diagram, formulation, fast_divmod hint - Pipeline abstraction: async pipeline mindset with producer/consumer/ring buffer, SVG diagram showing 5-stage pipeline state, Pipeline class API - TMA epilogue: motivation (register/smem pressure), dataflow SVG, step-by-step instruction sequence with completion mechanism note - Add tilus.Class API page with docstring - Add comments to matmul_v4.py for new instructions/optimizations - Forward reference from V3 to V4 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Yaoyao Ding <dingyaoyao.cs@gmail.com>
Signed-off-by: Yaoyao Ding <dingyaoyao.cs@gmail.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.