Releases: InftyAI/llmaz
Releases · InftyAI/llmaz
v0.0.8
What's Changed
🚀 Major Features:
✨ Features:
- feat:support apply llmaz to any ns by @qinguoyi in #172
- feat:update model loader by @qinguoyi in #178
🐛 Bugs:
♻️ Cleanups:
- Add release checklist by @kerthcet in #159
- chore: bump LWS version to v0.4.0 by @googs1025 in #162
- Bump sigs.k8s.io/lws from 0.4.0 to 0.4.1 by @dependabot in #185
- feature(webhook): add BackendRuntimeConfig resources validation by @googs1025 in #170
- fix:load models cost seconds by @qinguoyi in #175
- Update Revision default to main by @kerthcet in #176
- Downsize model-loader image by @qinguoyi in #179
New Contributors
- @googs1025 made their first contribution in #162
- @qinguoyi made their first contribution in #168
Full Changelog: v0.0.7...v0.0.8
v0.0.7
What's Changed
🚀 Major Features:
- [1/N] Add backendRuntime CRD by @kerthcet in #138
- [2/N] Add backendRuntime implementation by @kerthcet in #139
- Add helm chart support by @kerthcet in #142
✨ Features:
🐛 Bugs:
- Fix resource limits could be small than requests by @kerthcet in #136
- Fix filename error by @kerthcet in #147
♻️ Cleanups:
Full Changelog: v0.0.6...v0.0.7
v0.0.6
What's Changed
🚀 Major Features:
✨ Features:
- Add model label to Playground by @kerthcet in #111
- Add new conditions to Playground by @kerthcet in #120
- Change ModelClaims API by @kerthcet in #125
🐛 Bugs:
- fix wrong field path in the openmodel webhook by @carlory in #107
- Playground should be triggered to create Services and then Pods once the model is created by @carlory in #109
- Fix watch for changes to LeaderWorkerSet created by llmaz and trigger a Reconcile for the owner by @carlory in #108
♻️ Cleanups:
New Contributors
Full Changelog: v0.0.5...v0.0.6
v0.0.5
v0.0.4
v0.0.3
v0.0.2
What's Changed
✨ Feature:
- Feat: support modelscope by @vicoooo26 in #45
- Feat: support sglang backend by @vicoooo26 in #46
- Support Secrets to store HF_TOKEN by @kerthcet in #71
♻️ Cleanup:
- Add support for multithread when downloading weights by @kerthcet in #41
- Add dependabot and issue & pr template by @kerthcet in #51
- Add baseline for tests by @kerthcet in #57
🐛 Bug:
New Contributors
- @dependabot made their first contribution in #53
- @vicoooo26 made their first contribution in #45
Full Changelog: v0.0.1...v0.0.2
First release v0.0.1
What's Changed
- Add new core APIs, e.g.
Model
- Add new inference APIs, e.g.
Playground
,Inference
- Support serving models from huggingface
- Support sharing models weights on the same node
- Integrate with vllm by default
- Add webhooks for APIs
- Add installation documents and examples