We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The text was updated successfully, but these errors were encountered:
A couple more suggestions:
[kfswain edit] Added these to the EPP refactor section, though they could be seen as Net New features and warrant their own bullet
Sorry, something went wrong.
No branches or pull requests
Required for release
Documentation improvement
EPP refactor focused on extension
Production hardening of reference implementation
EPP
LoRA Syncer issues
Algorithm development improvement
Prefix-aware routing
Queuing/Criticality Enforcement
Stretch
BBR improvements
Conformance testing
Concrete example of a multi-workload InferencePool in use
InferenceModel extensible routing
The text was updated successfully, but these errors were encountered: