Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add fused top-k page construction kernels for DSA
#2215 opened Dec 13, 2025 by yzh119 Loading…
5 tasks
misc: support checks for gemm
#2214 opened Dec 13, 2025 by jimmyzho Loading…
5 tasks
cicd: Add sanity test script
#2212 opened Dec 12, 2025 by kahyunnam Loading…
5 tasks done
refactor: update fa3 codebase [part 2]
#2192 opened Dec 9, 2025 by yzh119 Loading…
4 of 5 tasks
Add CUDA graph buffers for persistent attention
#2185 opened Dec 7, 2025 by Edenzzzz Loading…
5 tasks
Fix/moe_sm110 (to be tested)
#2183 opened Dec 6, 2025 by aleozlx Draft
5 tasks
Enable Hopper FA3 FP8 attention in decode.py
#2148 opened Nov 28, 2025 by nvpohanh Loading…
5 tasks done
make DeepGEMM swapAB available for linear gemm SM90
#2131 opened Nov 22, 2025 by katec846 Loading…
3 of 5 tasks
Port TRT-LLM communication kernels to flashinfer
#2102 opened Nov 18, 2025 by djns99 Loading…
5 tasks done
make DeepGEMM swapAB available for linear gemm SM90
#2101 opened Nov 17, 2025 by xuanzic Loading…
5 tasks
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
Blockwise GEMM with all reduce overlapping
#2007 opened Oct 30, 2025 by Amir-19 Draft
5 tasks
chore: agentic workflow for automatic version bump
#1947 opened Oct 19, 2025 by yzh119 Loading…
5 tasks
add blockwise gemm cute dsl
#1922 opened Oct 13, 2025 by Amir-19 Loading…
5 tasks
ProTip! Exclude everything labeled bug with -label:bug.