-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
refactor(test): reorganize OpenAI test file structure
#7408
opened Jun 21, 2025 by
CatherineSue
Loading…
2 of 6 tasks
Tiny fix two-batch overlap incompatible with new DeepGEMM
#7405
opened Jun 21, 2025 by
fzyzcjy
Loading…
6 tasks
Fix prefill OOM due to wrong token calculation when page > 1
#7397
opened Jun 20, 2025 by
hnyls2002
Loading…
[BugFix]: fix EmbeddingReqInput single input error
#7396
opened Jun 20, 2025 by
woodx9
Loading…
1 of 6 tasks
[BugFix]fix qwen25 invoke function call streaming responses with curly braces as the starting indicator
#7394
opened Jun 20, 2025 by
ehuaa
Loading…
6 tasks
[Quantization] Add
int4fp8_moe
online quantization on ROCm
#7392
opened Jun 20, 2025 by
fxmarty-amd
•
Draft
6 tasks
Fix torch compile run
aiter
AI Tensor Engine ROCm
bug
Something isn't working
#7391
opened Jun 20, 2025 by
kkHuang-amd
Loading…
5 tasks
enable ptpc gemm_a8w8_bpreshuffle_CK form rocm aliter
#7383
opened Jun 20, 2025 by
Yuechguo
Loading…
6 tasks done
[feat] support minimum token load balance in dp attention
#7379
opened Jun 20, 2025 by
WANG-GH
Loading…
2 of 3 tasks
Quick fix for DeepGemm requant to also cover MTP.
high priority
#7378
opened Jun 20, 2025 by
pyc96
Loading…
6 tasks
Fix MTP with Deepseek R1 Fp4
bug
Something isn't working
high priority
#7376
opened Jun 20, 2025 by
pyc96
Loading…
6 tasks
Reduce overhead for fa by not calling heavy CUDA property check
#7375
opened Jun 20, 2025 by
oraluben
Loading…
1 of 6 tasks
[misc] Add PD service discovery support in router
high priority
#7361
opened Jun 19, 2025 by
slin1237
Loading…
3 of 6 tasks
feat(oai refactor): Remove
openai_api
with entrypoints/openai
collaboration
high priority
#7351
opened Jun 19, 2025 by
CatherineSue
Loading…
2 of 6 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.