Skip to content

Activity

fixed language model test

robertgshaw2-redhatpushed 1 commit to v1-default • 4122e63…0a97603 • 
4 minutes ago

Merge branch 'main' into v1-default

robertgshaw2-redhatpushed 10 commits to v1-default • c6733e2…4122e63 • 
18 minutes ago

Merge branch 'main' into v1-default

robertgshaw2-redhatpushed 4 commits to v1-default • ab68565…c6733e2 • 
32 minutes ago

revert spurious change

robertgshaw2-redhatpushed 1 commit to v1-default • d4b7469…ab68565 • 
1 hour ago

fixed plugin tests

robertgshaw2-redhatpushed 1 commit to v1-default • 2bf2350…d4b7469 • 
1 hour ago

fix weight loading

robertgshaw2-redhatpushed 1 commit to v1-default • 1de1f20…2bf2350 • 
1 hour ago

fixed lora tests

robertgshaw2-redhatpushed 7 commits to v1-default • ff8bf5d…1de1f20 • 
1 hour ago

[Misc] Ensure out-of-tree quantization method recognize by cli args (#…

Pull request merge
DarkLight1337pushed 1 commit to main • 212007b…a21076e • 
4 hours ago

[Hardware][TPU] Fix the recompiling issue in logits processor after w…

Pull request merge
mgoinpushed 1 commit to main • fb16eea…212007b • 
6 hours ago

[Bugfix] Revert QKVCrossParallelLinear usage in Mllama to keep BNB qu…

Pull request merge
DarkLight1337pushed 1 commit to main • 73ae0b4…fb16eea • 
11 hours ago

[Bugfix] Fix tqdm progress bar when SamplingParams.n > 1 (#12428)

Pull request merge
WoosukKwonpushed 1 commit to main • 6d7f037…73ae0b4 • 
12 hours ago

[Feat] Support chunked prefill for LMCache connector (#14505)

Pull request merge
vllm-botpushed 1 commit to main • 10f7552…6d7f037 • 
12 hours ago

[V1][TPU] Remove unnecessary padding for running on TPU. (#14467)

Pull request merge
mgoinpushed 1 commit to main • b0d5419…10f7552 • 
13 hours ago

[Attention] Default to FlashMLA backend for MLA (#14451)

Pull request merge
simon-mopushed 1 commit to main • 5f0b53c…b0d5419 • 
14 hours ago

Deleted branch

ywang96deleted revert-13776-fix-memory • 
14 hours ago

Revert "[V1][Core] Fix memory issue with logits & sampling" (#14504)

Pull request merge
ywang96pushed 1 commit to main • eb8b5eb…5f0b53c • 
14 hours ago

Update benchmark_throughput

WoosukKwoncreated fix-parallel-sample • c33cfb4 • 
16 hours ago

ultravox not working on V1

robertgshaw2-redhatpushed 1 commit to v1-default • 276f79e…ff8bf5d • 
16 hours ago

patch

ywang96pushed 1 commit to revert-13776-fix-memory • bbc5893…4557c4c • 
17 hours ago

Merge branch 'main' into revert-13776-fix-memory

ywang96pushed 2 commits to revert-13776-fix-memory • d52253c…bbc5893 • 
17 hours ago

fix lora and quantization

robertgshaw2-redhatpushed 1 commit to v1-default • 8e273d3…276f79e • 
17 hours ago

[V1] Support bad_words in sampler (#13376)

Pull request merge
simon-mopushed 1 commit to main • 9513290…eb8b5eb • 
17 hours ago

Revert "[V1][Core] Fix memory issue with logits & sampling (#13776)"

robertgshaw2-redhatpushed 1 commit to v1-default • bdee6b0…8e273d3 • 
17 hours ago

Revert "[V1][Core] Fix memory issue with logits & sampling (#13776)"

robertgshaw2-redhatcreated revert-13776-fix-memory • d52253c • 
17 hours ago

Deleted branch

Revert "[Bugfix] Fix profiling OOM and decouple encoder multimodal pr…

robertgshaw2-redhatcreated revert-14361-fix-mm-profiling • b9272cb • 
17 hours ago

Deleted branch

Revert "[Bugfix] Make the deviceprofiler include LoRA memory. (#14469)"

robertgshaw2-redhatcreated revert-14469-fix-lora-profile • 85b8564 • 
18 hours ago

fix sampler, metrics, tracing

robertgshaw2-redhatpushed 2 commits to v1-default • af33b95…bdee6b0 • 
18 hours ago

fix engine

robertgshaw2-redhatpushed 3 commits to v1-default • e613909…af33b95 • 
19 hours ago