Skip to content

v0.4 Release Tracker #681

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
1 of 17 tasks
kfswain opened this issue Apr 13, 2025 · 1 comment
Open
1 of 17 tasks

v0.4 Release Tracker #681

kfswain opened this issue Apr 13, 2025 · 1 comment
Labels
triage/accepted Indicates an issue or PR is ready to be actively worked on.

Comments

@kfswain
Copy link
Collaborator

kfswain commented Apr 13, 2025

Required for release

Documentation improvement

EPP refactor focused on extension

Production hardening of reference implementation

EPP

LoRA Syncer issues

Algorithm development improvement

Prefix-aware routing

Queuing/Criticality Enforcement

Stretch

BBR improvements

Conformance testing

Concrete example of a multi-workload InferencePool in use

InferenceModel extensible routing

  • Not just LoRA (ex. RAG, Sys prompts, leave room for potential expansion for things such as Activation Engineering )
@ahg-g
Copy link
Contributor

ahg-g commented Apr 15, 2025

A couple more suggestions:

[kfswain edit] Added these to the EPP refactor section, though they could be seen as Net New features and warrant their own bullet

@kfswain kfswain pinned this issue Apr 21, 2025
@kfswain kfswain added the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Apr 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
triage/accepted Indicates an issue or PR is ready to be actively worked on.
Projects
None yet
Development

No branches or pull requests

2 participants