-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] compressed_tensors requirement missing in new 0.4.4.post2 release
#4818
opened Mar 27, 2025 by
ocss884
5 tasks done
[Bug] sglang[all]>=0.4.4.post2 installation environment is very confusing
#4812
opened Mar 27, 2025 by
17Reset
5 tasks done
[Bug] The Gemma 3 model generates garbage on long generations
#4807
opened Mar 27, 2025 by
Swipe4057
4 of 5 tasks
[Feature] VLM performance optimization
high priority
performance
#4805
opened Mar 27, 2025 by
zhyncs
2 tasks
[feature] how to use save_sharded_state.py for use two nodes model
#4803
opened Mar 27, 2025 by
diggle001
2 tasks
[Bug] an error when run the static benchmark with MTP
#4801
opened Mar 27, 2025 by
yuqie
2 of 5 tasks
[Feature] Support openai responses API interface
#4793
opened Mar 26, 2025 by
ron1x1-abba
2 tasks done
The speed difference between Qwen2-VL and Qwen2.5 is very large
#4786
opened Mar 26, 2025 by
tingyuyan
ValueError: '<class 'sglang.srt.configs.qwen2_5_vl_config.Qwen2_5_VLConfig'>' is already used by a Transformers model.
#4785
opened Mar 26, 2025 by
ujjwal0005
1 of 5 tasks
[Bug] Master node didn't detect worker node not functional, requests hang until timeout
#4780
opened Mar 26, 2025 by
junliu-mde
4 of 5 tasks
[Bug] ImportError: libcuda.so.1: cannot open shared object file: No such file or directory
#4778
opened Mar 26, 2025 by
githust66
5 tasks done
[Feature] adopt trt llm fp8_blockscale_gemm
high priority
#4776
opened Mar 26, 2025 by
zhyncs
2 tasks
[Bug] Random output when using image input with https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct
#4774
opened Mar 25, 2025 by
atbe
5 tasks done
[Bug] Extraneous/incorrect outputs when using response_format on DeepSeek models and MTP
#4771
opened Mar 25, 2025 by
jondurbin
5 tasks done
[Feature] RemoteModelLoader should support pull sharded model
#4762
opened Mar 25, 2025 by
AllenXu93
2 tasks done
[Bug]Throughput decreases after using speculative sampling
#4759
opened Mar 25, 2025 by
bitbooboo
4 of 5 tasks
[Bug] GGUF model with architecture deepseek2 is not supported yet
#4756
opened Mar 25, 2025 by
ciaoyizhen
5 tasks done
Previous Next
ProTip!
no:milestone will show everything without a milestone.