sgl-project / sglang Public

Notifications You must be signed in to change notification settings
Fork 2.2k
Star 15.5k

Code
Issues 484
Pull requests 386
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Pull requests: sgl-project/sglang

Labels 46 Milestones 0

New pull request New

386 Open 4,534 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[PD] Raise error for incompatible mooncake version and some minor fixes

#7527 opened Jun 25, 2025 by ShangmingCai

Loading…

6 tasks

P/D load balancer forwards profiling requests to instances

#7525 opened Jun 25, 2025 by gronsti-amd

Loading…

1 of 6 tasks

[CPU] add c++ kernel to bind CPU cores and memory node

#7524 opened Jun 25, 2025 by chunyuan-w

Loading…

test: improve test for omni models

#7519 opened Jun 25, 2025 by mickqian

Loading…

6 tasks

fix: incorrect dtype when load_model

#7517 opened Jun 25, 2025 by yudian0504

Loading…

6 tasks

load draft model fix

#7506 opened Jun 24, 2025 by yilian49

Loading…

[CPU] remove process_group from inputs of shm_allreduce and shm_allgather cpu

cpu backend performance optimization

intel sgl-kernel

#7486 opened Jun 24, 2025 by chunyuan-w

Loading…

[AMD] Remove vllm's scaled_fp8_quant and moe_sum when SGLANG_USE_AITER=1 high priority

#7484 opened Jun 23, 2025 by hubertlu-tw

Loading…

3 of 6 tasks

[Feature] dynamic server payload size limit

#7475 opened Jun 23, 2025 by khan-yin

Loading…

4 of 6 tasks

fix(bench_serving): handle None tokenizer.bos_token when apply_chat_template==True

#7466 opened Jun 23, 2025 by renne444

Loading…

1 of 6 tasks

[BugFix] Destroy nccl Comm to fix cuda memory leak of destroy_model_parallel

#7465 opened Jun 23, 2025 by wcsjtu

Loading…

2 of 6 tasks

Support non-contiguous query input for extend/decode attention cpu

cpu backend performance optimization

intel sgl-kernel

#7462 opened Jun 23, 2025 by yanbing-j

Loading…

6 tasks

[benchmark] print final benchmark args in json format

#7455 opened Jun 23, 2025 by staugust

Loading…

1 of 6 tasks

Fix for fp8 quantization failure of qwen 2.5 VL 7B model. high priority

#7448 opened Jun 22, 2025 by PanJason

Loading…

2 of 6 tasks

Support dynamic LoRA loading / unloading in engine/server API ready-for-review

#7446 opened Jun 22, 2025 by lifuhuang

Loading…

2 of 6 tasks

fix: fix apply_shuffle_mul_sum

#7444 opened Jun 22, 2025 by mickqian

Loading…

6 tasks

fix: minor fix cutlass moe

#7442 opened Jun 22, 2025 by mickqian

Loading…

6 tasks

OPTForCasualLM Support (facebook/opt Series) new-model

#7440 opened Jun 22, 2025 by b8zhong

Loading…

Fix: remove duplicate initial assignments in PrefillBootstrapQueue

#7438 opened Jun 22, 2025 by hzh0425

Loading…

1 of 6 tasks

Fix: resolve prefill of retracted request out-of-memory issue when ignore_eos is enabled high priority

#7434 opened Jun 22, 2025 by GaoYusong

Loading…

1 of 6 tasks

Fix stream reasoning parser and Adds Kimi reasoning parser

#7432 opened Jun 22, 2025 by JustinTong0323

Loading…

2 of 6 tasks

[env] Organize sglang environ variables.

#7431 opened Jun 22, 2025 by hnyls2002 • Draft

6 tasks

[WIP][RL] fix fp8 update weight

#7421 opened Jun 21, 2025 by zhuzilin • Draft

1 of 6 tasks

[RL] add pause and continue generation for async rl training

#7419 opened Jun 21, 2025 by zhuzilin

Loading…

1 of 6 tasks

[RL] Add --nccl-port and --other-ports to prevent port conflict

#7418 opened Jun 21, 2025 by zhuzilin

Loading…

1 of 6 tasks

Previous 1 2 3 4 5 … 15 16 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!