Skip to content

Tags: neuralmagic/vllm

Tags

v0.10.0

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add think chunk (vllm-project#21333)

Signed-off-by: Julien Denize <julien.denize@mistral.ai>

v0.10.0rc2

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add think chunk (vllm-project#21333)

Signed-off-by: Julien Denize <julien.denize@mistral.ai>

v0.10.0rc1

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Enable v1 metrics tests (vllm-project#20953)

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

v0.9.2

Revert "[V0 deprecation] Remove V0 CPU/XPU/TPU backends (vllm-project…

…#20412)"

This reverts commit e202dd2.

v0.9.2rc2

Revert "[V0 deprecation] Remove V0 CPU/XPU/TPU backends (vllm-project…

…#20412)"

This reverts commit e202dd2.

v0.9.2rc1

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Misc] Remove _maybe_ignore_quant_config from GLM4.1v (vllm-project#2…

…0432)

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

v0.9.1

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Misc] Slight improvement of the BNB (vllm-project#19418)

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
Co-authored-by: Isotr0py <2037008807@qq.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

v0.9.1rc2

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Misc] Slight improvement of the BNB (vllm-project#19418)

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
Co-authored-by: Isotr0py <2037008807@qq.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

v0.9.1rc1

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Misc] Fix a config typo in disable_hybrid_kv_cache_manager configura…

…tion (vllm-project#19383)

Signed-off-by: Siyuan Liu <lsiyuan@google.com>

v0.9.0.1

[BugFix] FA2 MLA Accuracy Issue (vllm-project#18807)

Signed-off-by: LucasWilkinson <lwilkinson@neuralmagic.com>