-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Usage] How to run multiple model instances on a single GPU
#5507
opened Apr 17, 2025 by
spitzblattr
[Bug] Incorrect passing of ForwardBatch parameter in TpModelWorker.forward_batch_generation
#5506
opened Apr 17, 2025 by
u4lr451
2 of 5 tasks
Large Discrepancy in Speedup Between SGLang + Eagle and Eagle Repo Code
#5502
opened Apr 17, 2025 by
motigrez
[Bug] Qwen-gme embedding model: cannot get fused embedding from text+image, and image input format may be incorrect
#5498
opened Apr 17, 2025 by
wwqq
5 tasks done
[performance] 1P1D deployment of DeepSeek-R1 on two H200 machines does not meet performance expectations.
#5490
opened Apr 17, 2025 by
ZhenshengWu
5 tasks done
[Bug] Potentially create too much process when using verl-sglang init_model
#5483
opened Apr 17, 2025 by
SwordFaith
5 tasks done
[RFC][Feature][Model] Add templated fallback HF
transformers
model backend in SRT
#5471
opened Apr 16, 2025 by
XuehaiPan
1 of 6 tasks
[Bug] SGLang Server Freezes During High Traffic Periods with 16-GPU DeepSeek v3 Setup
#5465
opened Apr 16, 2025 by
moqimoqidea
5 tasks done
[Bug] 模型一直停留在Prefill batch阶段,无法进行decode 出现卡死情况,最终导致计算节点Watchdog Timeout
#5458
opened Apr 16, 2025 by
WALLE-AI
5 tasks
[Bug] rank 0 could finish the model loading, but there are other ranks that didn't finish loading. It is likely due to unexpected failures (eg,OOM) or a slow node.
#5457
opened Apr 16, 2025 by
Alex-9827
5 tasks done
[Bug] incorrect output from same-prefix requests for Qwen2.5VL
#5455
opened Apr 16, 2025 by
KivenChen
5 tasks done
What is the relationship between ModelRunner and the model(deepseek.py,llama.py..etc)?
#5453
opened Apr 16, 2025 by
wangzhen2271
[Bug] sglang.bench_serving IndexError when add extra body stream_options.include_usage
#5451
opened Apr 16, 2025 by
morty-zxb
5 tasks done
[Bug] PD disaggregation, KV transfer slow down under high concurrency
#5450
opened Apr 16, 2025 by
MtFitzRoy
5 tasks done
[Bug] v0.4.5 NameError: name 'VLLM_AVAILABLE' is not defined from compressed_tensors.py
#5443
opened Apr 16, 2025 by
kratorado
5 tasks done
[Bug] ValueError: Model architectures ['Glm4ForCausalLM'] are not supported for now.
#5441
opened Apr 16, 2025 by
chunxingque
5 tasks done
[Feature] optimize SegmentPackBits
high priority
speculative-decoding
#5437
opened Apr 15, 2025 by
zhyncs
2 tasks
Help Needed: Slow Inference on 2xH100x8 Setup with DeepSeek-R1
#5429
opened Apr 15, 2025 by
seungduk-yanolja
Previous Next
ProTip!
Adding no:label will show everything without a label.