-
Notifications
You must be signed in to change notification settings - Fork 17
Issues: InftyAI/llmaz
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
What is the difference between llmaz and lws?
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
support
Categorizes issue or PR as related to support.
#333
opened Mar 28, 2025 by
zhaizhch
after backend runtime update, should we re-create the playground pod?
bug
Categorizes issue or PR as related to a bug.
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#332
opened Mar 26, 2025 by
pacoxu
Support preStop lifecycle for backendRuntimes
feature
Categorizes issue or PR as related to a new feature.
help wanted
Extra attention is needed
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#320
opened Mar 18, 2025 by
kerthcet
6 tasks
feature: metrics support for controller
feature
Categorizes issue or PR as related to a new feature.
important-critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#307
opened Mar 11, 2025 by
googs1025
3 tasks
Lora Autoscaler
feature
Categorizes issue or PR as related to a new feature.
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
Add popular open source models as in-tree support
cleanup
Categorizes issue or PR as related to cleaning up code, process, or technical debt.
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
Introduce inference reserve config for standby instance
important-longterm
Important over the long term, but may not be staffed and/or may need multiple releases to complete.
needs-kind
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
Milestone v0.2.0
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#259
opened Jan 27, 2025 by
kerthcet
2 tasks
vllm only has /health endpoint
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#241
opened Jan 17, 2025 by
kerthcet
3 tasks
Support speculative decoding with llama.cpp
feature
Categorizes issue or PR as related to a new feature.
help wanted
Extra attention is needed
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#240
opened Jan 16, 2025 by
kerthcet
3 tasks
Able to set the toRender parameters dynamically
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#239
opened Jan 16, 2025 by
kerthcet
1 of 3 tasks
Unify the chat api for all inference servers
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#218
opened Dec 10, 2024 by
kerthcet
3 tasks
Add TensorRT-LLM support as another backend
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#205
opened Nov 18, 2024 by
kerthcet
3 tasks
Install lws controller together with llmaz controller in the same namespace
cleanup
Categorizes issue or PR as related to cleaning up code, process, or technical debt.
help wanted
Extra attention is needed
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#202
opened Nov 13, 2024 by
kerthcet
Support speculative decoding with llama.cpp
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#197
opened Nov 5, 2024 by
kerthcet
3 tasks
Serverless support
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
Support to serving Stable Diffusion models
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#189
opened Oct 23, 2024 by
kerthcet
1 of 3 tasks
Is there any early proposal or document about integrating with Gateway API ?
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#165
opened Sep 15, 2024 by
caozhuozi
[WebUI] Add support for webui
feature
Categorizes issue or PR as related to a new feature.
help wanted
Extra attention is needed
needs-kind
Indicates a PR lacks a label and requires one.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
[Umbrella] Improve test coverages
cleanup
Categorizes issue or PR as related to cleaning up code, process, or technical debt.
good first issue
Good for newcomers
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#156
opened Sep 12, 2024 by
kerthcet
Support traditional models
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#133
opened Sep 9, 2024 by
kerthcet
3 tasks done
Loading model weights more efficiently
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
Support scaling with Spot instances for cost saving
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
question
Further information is requested
#106
opened Aug 27, 2024 by
kerthcet
3 tasks
Support filesystems
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#100
opened Aug 19, 2024 by
kerthcet
3 tasks done
Model aware scheduling
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
#96
opened Aug 19, 2024 by
kerthcet
2 of 3 tasks
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.