Bump version to v0.5.0 #5384

simon-mo · 2024-06-10T17:04:42Z

No description provided.

WoosukKwon · 2024-06-10T18:08:52Z

While it's not for NVIDIA GPUs, I'd like to include #5323 since it includes some critical fixes for MI300x. The PR is almost ready for merge.

hongxiayang · 2024-06-10T19:57:15Z

While it's not for NVIDIA GPUs, I'd like to include #5323 since it includes some critical fixes for MI300x. The PR is almost ready for merge.

Thanks @WoosukKwon . #5323 should be ready to merge now.

youkaichao · 2024-06-10T20:02:28Z

#5354 should benefit tp > 1 case in general, especially for high-end GPUs. the code is ready, but I'm not sure if @njhill or @zhuohan123 can finish the review today.

simon-mo · 2024-06-10T22:56:54Z

I'll update the tag after the PRs. Merging this to generate wheels for testing.

* upstream/main: (126 commits) [Bugfix][Frontend] Cleanup "fix chat logprobs" (vllm-project#5026) [Bugfix] OpenAI entrypoint limits logprobs while ignoring server defined --max-logprobs (vllm-project#5312) [Misc] Various simplifications and typing fixes (vllm-project#5368) [ci] Fix Buildkite agent path (vllm-project#5392) [Doc] Add documentation for FP8 W8A8 (vllm-project#5388) Bump version to v0.5.0 (vllm-project#5384) [Docs] Alphabetically sort sponsors (vllm-project#5386) [Docs] Add Docs on Limitations of VLM Support (vllm-project#5383) [ci] Mount buildkite agent on Docker container to upload benchmark results (vllm-project#5330) [ci] Use small_cpu_queue for doc build (vllm-project#5331) [Bugfix] Fix LLaVA-NeXT (vllm-project#5380) [Feature][Frontend]: Continued `stream_options` implementation also in CompletionRequest (vllm-project#5319) [Model] Initial support for LLaVA-NeXT (vllm-project#4199) [Misc] Improve error message when LoRA parsing fails (vllm-project#5194) [misc][typo] fix typo (vllm-project#5372) [Frontend][Misc] Enforce Pixel Values as Input Type for VLMs in API Server (vllm-project#5374) [Misc] Update to comply with the new `compressed-tensors` config (vllm-project#5350) [Bugfix] Fix KeyError: 1 When Using LoRA adapters (vllm-project#5164) [Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (vllm-project#5047) [mis][ci/test] fix flaky test in test_sharded_state_loader.py (vllm-project#5361) ...

Bump version to v0.5.0

a6d5688

ywang96 approved these changes Jun 10, 2024

View reviewed changes

simon-mo merged commit 114332b into vllm-project:main Jun 10, 2024
100 of 103 checks passed

robertgshaw2-neuralmagic pushed a commit to neuralmagic/nm-vllm that referenced this pull request Jun 11, 2024

Bump version to v0.5.0 (vllm-project#5384)

16be761

joerunde pushed a commit to joerunde/vllm that referenced this pull request Jun 17, 2024

Bump version to v0.5.0 (vllm-project#5384)

4f8c009

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jun 27, 2024

Bump version to v0.5.0 (vllm-project#5384)

a0b26ad

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 8, 2024

Bump version to v0.5.0 (vllm-project#5384)

0e0bc5c

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

Bump version to v0.5.0 (vllm-project#5384)

fd7f638

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

Bump version to v0.5.0 (vllm-project#5384)

6fe84d5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump version to v0.5.0 #5384

Bump version to v0.5.0 #5384

simon-mo commented Jun 10, 2024

WoosukKwon commented Jun 10, 2024

hongxiayang commented Jun 10, 2024

youkaichao commented Jun 10, 2024

simon-mo commented Jun 10, 2024 •

edited

Loading

Bump version to v0.5.0 #5384

Bump version to v0.5.0 #5384

Conversation

simon-mo commented Jun 10, 2024

WoosukKwon commented Jun 10, 2024

hongxiayang commented Jun 10, 2024

youkaichao commented Jun 10, 2024

simon-mo commented Jun 10, 2024 • edited Loading

simon-mo commented Jun 10, 2024 •

edited

Loading