Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update CI docker container to use latest cudnn
#1362 opened Jul 31, 2025 by yzh119 Loading…
5 tasks
[Draft] Update autotune results
#1361 opened Jul 31, 2025 by kaixih Loading…
support trtllm-gen prefill fp4 output
#1360 opened Jul 31, 2025 by weireweire Loading…
3 of 5 tasks
hotfix: update mxfp4 groupwise-scaled gemm unittests
#1359 opened Jul 31, 2025 by yzh119 Loading…
5 tasks
feature: add fp4 mm using trtllm backend
#1355 opened Jul 30, 2025 by ttyio Loading…
4 of 5 tasks
feat: Fused rope fp8 quantize kernel for MLA
#1339 opened Jul 28, 2025 by yzh119 Loading…
5 tasks
[WIP]: Masked layout fp4 gemm using cute-dsl
#1331 opened Jul 25, 2025 by yzh119 Draft
5 tasks
refactor: Improved metainfo for trtllm-gen kernels
#1328 opened Jul 25, 2025 by cyx-6 Loading…
5 tasks
Add moe benchmark routine
#1327 opened Jul 25, 2025 by aleozlx Draft
3 of 5 tasks
Add k_scale and v_scale to persistent attention
#1322 opened Jul 24, 2025 by Edenzzzz Loading…
5 tasks
Allow cudnn prefill kernels to be called natively
#1317 opened Jul 24, 2025 by Anerudhan Draft
5 tasks done
Wrap cudnn backend to unified interface
#1312 opened Jul 23, 2025 by cyx-6 Loading…
5 tasks
Api regression test for trtllmgen fp8 moe
#1308 opened Jul 23, 2025 by aleozlx Loading…
5 tasks done
fix: a workaround to make fp8 kv-cache work for prefill
#1304 opened Jul 22, 2025 by chenyang78 Loading…
2 tasks
3rparty: upgrade cutlass dependency to v4.1.0
#1299 opened Jul 22, 2025 by yzh119 Loading…
5 tasks
add mm_fp4 use cutlass backend for large bs
#1296 opened Jul 21, 2025 by ttyio Loading…
5 tasks done
ci: add github actions to upload sdist to pypi
#1270 opened Jul 16, 2025 by yzh119 Loading…
5 tasks
feat(aot): add nvshmem module for aot compilation
#1261 opened Jul 15, 2025 by EmilienM Loading…
3 of 5 tasks
refactor: separate SM100 and legacy TRT-LLM comm modules
#1259 opened Jul 15, 2025 by EmilienM Loading…
3 of 5 tasks
ProTip! Filter pull requests by the default branch with base:main.