-
Notifications
You must be signed in to change notification settings - Fork 282
Issues: vllm-project/aibrix
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Provide production grade overlay manifests
area/installation
kind/enhancement
New feature or request
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#847
opened Mar 11, 2025 by
Jeffwan
[RFC]: Make API Gateway interface OpenAI compatible
area/gateway
kind/enhancement
New feature or request
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#846
opened Mar 11, 2025 by
Jeffwan
[Observation] Improve AIBrix control plane monitoring
area/stability
kind/feature
Categorizes issue or PR as related to a new feature.
priority/important-longterm
Important over the long term, but may not be staffed and/or may need multiple releases to complete.
#845
opened Mar 11, 2025 by
Jeffwan
[Docs] Provide AIBrix upgrade guidance
area/installation
kind/documentation
Improvements or additions to documentation
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#844
opened Mar 11, 2025 by
Jeffwan
Ask for testing suggestions
kind/support
Categorizes issue as a support question.
triage/needs-information
Indicates an issue needs more information in order to work on it.
#842
opened Mar 10, 2025 by
ying2025
Some prompts with special character fail the benchmark script
area/benchmark
kind/bug
Something isn't working
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#832
opened Mar 9, 2025 by
Jeffwan
Making prefix-cache-and-load-aware routing more general
area/gateway
area/performance
kind/enhancement
New feature or request
kind/feature
Categorizes issue or PR as related to a new feature.
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#814
opened Mar 7, 2025 by
gangmuk
Prefix sharing workload generation
area/benchmark
area/performance
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#813
opened Mar 7, 2025 by
gangmuk
ModelAdapter seems to be working abnormally
area/lora
kind/support
Categorizes issue as a support question.
triage/needs-information
Indicates an issue needs more information in order to work on it.
#801
opened Mar 5, 2025 by
ying2025
Making max-tokens configurable in the benchmark client.
area/benchmark
#797
opened Mar 5, 2025 by
gangmuk
Recording request routing(target-pod) in the benchmark client
area/benchmark
#796
opened Mar 5, 2025 by
gangmuk
Piggybacking more information in response header
area/benchmark
area/gateway
#795
opened Mar 5, 2025 by
gangmuk
Does aibrix support to do load balance against managed model endpoints
area/gateway
triage/needs-information
Indicates an issue needs more information in order to work on it.
#784
opened Mar 3, 2025 by
Colstuwjx
Failed to run benchmark scripts against the endpoint
area/gateway
kind/bug
Something isn't working
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#783
opened Mar 3, 2025 by
Jeffwan
Add probe usage practice for super large models, including multi-node case
area/performance
kind/documentation
Improvements or additions to documentation
kind/enhancement
New feature or request
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#782
opened Mar 3, 2025 by
Jeffwan
We still see some errors that not explainable if httpRoute is missing
area/gateway
kind/enhancement
New feature or request
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
making future benchmarks use this utils properly as well.
area/benchmark
#768
opened Feb 28, 2025 by
gangmuk
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.