Skip to content

Issues: vllm-project/aibrix

v0.3.0 roadmap
#698 opened Feb 18, 2025 by Jeffwan
Open 8
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Provide production grade overlay manifests area/installation kind/enhancement New feature or request priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#847 opened Mar 11, 2025 by Jeffwan
[RFC]: Make API Gateway interface OpenAI compatible area/gateway kind/enhancement New feature or request priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.
#846 opened Mar 11, 2025 by Jeffwan
[Observation] Improve AIBrix control plane monitoring area/stability kind/feature Categorizes issue or PR as related to a new feature. priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete.
#845 opened Mar 11, 2025 by Jeffwan
[Docs] Provide AIBrix upgrade guidance area/installation kind/documentation Improvements or additions to documentation priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#844 opened Mar 11, 2025 by Jeffwan
Ask for testing suggestions kind/support Categorizes issue as a support question. triage/needs-information Indicates an issue needs more information in order to work on it.
#842 opened Mar 10, 2025 by ying2025
Some prompts with special character fail the benchmark script area/benchmark kind/bug Something isn't working priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#832 opened Mar 9, 2025 by Jeffwan
Making prefix-cache-and-load-aware routing more general area/gateway area/performance kind/enhancement New feature or request kind/feature Categorizes issue or PR as related to a new feature. priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.
#814 opened Mar 7, 2025 by gangmuk
Prefix sharing workload generation area/benchmark area/performance priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#813 opened Mar 7, 2025 by gangmuk
ModelAdapter seems to be working abnormally area/lora kind/support Categorizes issue as a support question. triage/needs-information Indicates an issue needs more information in order to work on it.
#801 opened Mar 5, 2025 by ying2025
Do LLM Cache Support V100 hardware?
#791 opened Mar 4, 2025 by jlcoo
Does aibrix support to do load balance against managed model endpoints area/gateway triage/needs-information Indicates an issue needs more information in order to work on it.
#784 opened Mar 3, 2025 by Colstuwjx
Failed to run benchmark scripts against the endpoint area/gateway kind/bug Something isn't working priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.
#783 opened Mar 3, 2025 by Jeffwan
Add probe usage practice for super large models, including multi-node case area/performance kind/documentation Improvements or additions to documentation kind/enhancement New feature or request priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.
#782 opened Mar 3, 2025 by Jeffwan
We still see some errors that not explainable if httpRoute is missing area/gateway kind/enhancement New feature or request priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#778 opened Mar 1, 2025 by Jeffwan v0.3.0
ProTip! Type g i on any issue or pull request to go back to the issue listing page.