v0.6.0
π Features
π· Built-in distributed workers support: running completion / chat model worker on different process / machine.
π¬ Conversation history in chat playground.
π§° Fixes and Improvements
- Fix the slow repository indexing due to constraint memory arena in tantivy index writer.
- Command line argument
--model
is now optional, so users can create a chat only instance. - New command line argument
--parallelism
to control the throughput and VRAM usage: #727 - New api path
/metrics
endpoint for prometheus metrics collection.
π« New Contributors
- @liangfung made their first contribution in #702
- @erfanium made their first contribution in #742
- @costanzo made their first contribution in #748
- @darknight made their first contribution in #750
- @suside made their first contribution in #775
- @jpoisso made their first contribution in #838
- @Squadrick made their first contribution in #849
- @sonique6784 made their first contribution in #813
Full Changelog: v0.5.5...v0.6.0