Releases: helixml/helix
0.8.1
0.8.0 - App tools & internal qapair gen
What's Changed
API Tools in Helix Apps
You can now version control configuration for OpenAPI tools integrations and spawn embedded widget sessions from such apps. You can also use the chat completion API with the new per-app API key to get tools enabled on that chat session.
Example config: https://github.com/helixml/example-helix-app
Fork the repo above and add it as an app inside Helix. Then embed the widget in an html page, activate it, and ask it about the hiring pipeline (demo API).
https://twitter.com/lmarsden/status/1787846737870127426
Fully local qapair generation
We now also have basic support (without schema enforcement, yet) for configuring the qapair generator in fine-tuning to use fully local models.
- Feature/qa pair gen by @rusenask in #284
- Feature/app tools by @binocarlos in #286
- fix subrouter by @rusenask in #287
Full Changelog: 0.7.5...0.8.0
0.7.5 - adjust ollama3-70b memory
Llama3-70B now runs quickly in 39GB of vRAM on latest ollama build.
Full Changelog: 0.7.4...0.7.5
0.7.4
0.7.3 - switch to 4bit quant of llama3-70b
What's Changed
- switch to 4bit quant of llama3-70b by @lukemarsden in #278
Full Changelog: 0.7.2...0.7.3
0.7.2 - GPU charts fix, faster Llama3-70B
What's Changed
- Stop running keycloak in dev mode in production by @chocobar in #273
- Fix/gpu limits for runner chart by @rusenask in #274
- bump 70b memory usage by @lukemarsden in #276
Full Changelog: 0.7.1...0.7.2
0.7.1 - hotfixes to default models and passthru HF_TOKEN
What's Changed
- helm runner models by @rusenask in #271
- hotfix: add passthru HF_TOKEN env var by @lukemarsden in #272
Full Changelog: 0.7.0...0.7.1
0.7.0 - llama3 and helix apps
What's Changed
- Feature/helixapps by @binocarlos in #266
- Feature/runner helm chart by @rusenask in #267
- Feature/helm runner public repo install by @rusenask in #268
- Fix/runner name as pod name for the helm chart by @rusenask in #269
- Feature/llama3 inference by @rusenask in #270
Full Changelog: 0.6.11...0.7.0
0.6.11
0.6.10
What's Changed
- Feature/helm chart by @rusenask in #251
- frontend explanation in keycloak config by @rusenask in #252
- Fix/nocloak by @rusenask in #253
- Widget embed by @binocarlos in #255
- centered message li formatting by @bigadamknight in #250
- remove widget from controlplane now we have it in another repo by @binocarlos in #258
- Fix/use envconfig by @binocarlos in #256
- add tracking with rudder stack by @binocarlos in #259
- Feature/chunking and concurrency limits by @rusenask in #254
- Fix/react lint issue by @rusenask in #260
- pass the lora dir into start req by @rusenask in #261
Full Changelog: 0.6.9...0.6.10