You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
[0.1.4] - 2026-06-14
Added
Added high-level streaming completion APIs, including create_completion_stream, create_completion_stream_with_sampler, CompletionChunk, StreamControl and richer completion logprob
metadata.
Added llama-crab-server, an HTTP server binary for local inference
with completions, chat completions, embeddings, reranking,
tokenization, detokenization, SSE streaming and optional multimodal
chat support.
Added OpenAI-style high-level convenience helpers for text, chat and
embeddings with token accounting.
Added the server_lfm example wrapper and an lfm-text download
target for launching the HTTP server with LFM text models.
Added the streaming example to demonstrate callback-driven text
generation.
Added tauri-plugin-llama-crab, a Tauri IPC runtime for loading
GGUF models and exposing OpenAI-like chat, completion, embedding,
rerank, tokenization and model-management commands.
Added the @llama-crab/core and @llama-crab/tauri TypeScript
packages with shared OpenAI-like contracts, request mappers and a
Tauri client.
Added the tauri-chat-lfm desktop example and smoke coverage for
the Tauri chat workflow.
Added mobile-oriented runtime presets through MobilePreset and LlamaParams::with_mobile_preset.
Added broader tool-call streaming support, including OpenAI-style
tool-call deltas.
Added documentation deployment for the project guide.
Changed
Migrated the user guide from mdBook/MkDocs-era documentation to
Docusaurus, with expanded server, mobile, Tauri, TypeScript,
streaming, chat, embeddings and grammar coverage.
Reorganized the repository into crates/ and packages/ workspaces
so Rust crates, TypeScript packages and examples share one release
surface.
README files now point users to the new Docusaurus guide hosted at the
GitHub Pages site.
CI and release workflows now build, test and publish llama-crab-server, tauri-plugin-llama-crab and TypeScript
packages alongside the library crates.
CI workflows now run through manual dispatch instead of push triggers,
and documentation jobs use nightly Cargo where required.
The hf-tokenizer dependency now enables the onig feature for
tokenizer compatibility.
Rustdoc crate logos now reference the current Canarim Crab asset.
Fixed
Removed unused placeholder OpenAI-compat wrapper bindings from llama-crab-sys and the old chat module export.
Gated the Metal backend build configuration to macOS targets.
Hardened documentation builds and docs deployment workflow behavior.
Cleaned up server and example runner support for the new server and
mobile workflows.