Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP]Add gemma3n
#7344 opened Jun 19, 2025 by JustinTong0323 Draft
6 tasks
[BugFix] Fix AssertionError: res=<Response [502]>, res.text=''
#7341 opened Jun 19, 2025 by gty111 Loading…
1 of 6 tasks
[NVIDIA] Add Jetson Orin/Thor and Spark Codegen
#7337 opened Jun 19, 2025 by johnnynunez Loading…
[NVIDIA] Jetson Compatibility
#7336 opened Jun 19, 2025 by johnnynunez Loading…
[Fix] fix decode OOM due to wrong estimation when ignore_eos
#7328 opened Jun 18, 2025 by DarkSharpness Loading…
1 of 6 tasks
Purge VerlEngine
#7326 opened Jun 18, 2025 by MrAta Loading…
1 of 5 tasks
[NVIDIA] Fix Jetson Build
#7323 opened Jun 18, 2025 by johnnynunez Loading…
Feat/refactor embedding server
#7322 opened Jun 18, 2025 by woodx9 Loading…
1 of 6 tasks
Fix GuidanceGrammar bitmask
#7321 opened Jun 18, 2025 by YorkSu Loading…
1 of 6 tasks
[doc] update lws doc for pd
#7318 opened Jun 18, 2025 by whybeyoung Loading…
[NVIDIA] Avoid OOM issues aarch64 wheels
#7316 opened Jun 18, 2025 by johnnynunez Loading…
support Qwen3ForSequenceClassification
#7314 opened Jun 18, 2025 by bzantium Loading…
3 of 6 tasks
Kernels for efficient KV cache IO
#7313 opened Jun 18, 2025 by xiezhq-hermann Loading…
1 of 6 tasks
Add get_hidden_dim to qwen3.py for correct lora
#7312 opened Jun 18, 2025 by logachevpa Loading…
Let EP prefill support new DeepGEMM high priority
#7310 opened Jun 18, 2025 by fzyzcjy Loading…
6 tasks
Tiny add logs for expert location updater
#7308 opened Jun 18, 2025 by fzyzcjy Loading…
6 tasks
[Refactor] Clean up radix cache related API
#7303 opened Jun 18, 2025 by DarkSharpness Loading…
1 of 6 tasks
Support FP4 quantized models on AMD CDNA2/CDNA3 GPUs
#7302 opened Jun 18, 2025 by haohui Loading…
6 tasks done
[wip] use trt-llm(flashinfer) cutlass moe in b200
#7296 opened Jun 18, 2025 by BBuf Draft
6 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.