-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Docker] optimize dockerfile remove deepep and blackwell merge it to…
#7343
opened Jun 19, 2025 by
whybeyoung
Loading…
[BugFix] Fix AssertionError: res=<Response [502]>, res.text=''
#7341
opened Jun 19, 2025 by
gty111
Loading…
1 of 6 tasks
fix the abnormal GPU memory occupation of multimodal model continued to increase until OOM
#7340
opened Jun 19, 2025 by
huangtingwei9988
Loading…
6 tasks
[OAI Server Refactor] [ChatCompletions & Completions] Support Return Hidden State
#7329
opened Jun 18, 2025 by
key4ng
Loading…
4 of 8 tasks
[Fix] fix decode OOM due to wrong estimation when ignore_eos
#7328
opened Jun 18, 2025 by
DarkSharpness
Loading…
1 of 6 tasks
Add FlashInfer NVFP4 MoE for Blackwell with MOE-EP support
high priority
#7327
opened Jun 18, 2025 by
trevor-m
Loading…
6 tasks
Add the missing logic to update existing PD monitoring metrics
#7317
opened Jun 18, 2025 by
SCDESPERTATE
•
Draft
Let EP prefill support new DeepGEMM
high priority
#7310
opened Jun 18, 2025 by
fzyzcjy
Loading…
6 tasks
Let ep_scatter support arbitrary strides / ue8m0 format
high priority
#7309
opened Jun 18, 2025 by
fzyzcjy
Loading…
6 tasks
[Refactor] Clean up radix cache related API
#7303
opened Jun 18, 2025 by
DarkSharpness
Loading…
1 of 6 tasks
Support FP4 quantized models on AMD CDNA2/CDNA3 GPUs
#7302
opened Jun 18, 2025 by
haohui
Loading…
6 tasks done
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.