-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[ROCm] Use triton vision attention to replace FA3 on AMD GPU
#8656
opened Aug 1, 2025 by
zhangnju
Loading…
6 tasks
fix(cache): Respect sliding_window_size in SWAChunkCache eviction
#8655
opened Aug 1, 2025 by
ppraneth
Loading…
6 tasks
Support page first layout zero copy for mooncake store
#8651
opened Aug 1, 2025 by
huangtingwei9988
Loading…
7 tasks
disable tp for shared experts when enable ep moe for GLM4.5 model
#8647
opened Aug 1, 2025 by
zminglei
Loading…
6 tasks
[router] introduce dp worker abstraction
#8639
opened Jul 31, 2025 by
slin1237
Loading…
2 of 6 tasks
[router] Add Bearer Token Authentication API Support
router
#8637
opened Jul 31, 2025 by
slin1237
Loading…
3 of 6 tasks
Do layernorm before allgather for DP attention
#8631
opened Jul 31, 2025 by
trevor-m
Loading…
6 tasks
[PD metrics] Fix some uncompleted PD related metrics
#8627
opened Jul 31, 2025 by
acelyc111
Loading…
1 of 6 tasks
Use Tensor Core Decode when gqa group size >= 4
#8624
opened Jul 31, 2025 by
Edenzzzz
Loading…
6 tasks
[bugfix]: use correct cache location for cross attention in torch native backend
#8622
opened Jul 31, 2025 by
MahmoudAshraf97
Loading…
1 of 6 tasks
[Quantization] Supported w8a8 int8 quantized Gemma3 and Qwen-VL models
#8619
opened Jul 31, 2025 by
ichernob
Loading…
1 of 6 tasks
[bugfix] Add 'disaggregation_mode' parameter to warmup function when compile deep_gemm manually
#8618
opened Jul 31, 2025 by
lbh2001
Loading…
6 tasks
Support MHA with chunked prefix cache for flashinfer backend
#8616
opened Jul 31, 2025 by
xu-yfei
Loading…
1 of 6 tasks
[Feature] Support BurstGPT for server benchmark.
#8605
opened Jul 31, 2025 by
VincentXWD
Loading…
6 tasks
[bugfix] Remove the invalid initialization of weight_scale and input_scale for W4AFp8 EPMoE
#8597
opened Jul 31, 2025 by
huangzhilin-hzl
Loading…
6 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.