Skip to content

v0.1.4

Choose a tag to compare

@DominguesM DominguesM released this 14 Jun 23:08
· 25 commits to develop since this release
9357775

[0.1.4] - 2026-06-14

Added

  • Added high-level streaming completion APIs, including
    create_completion_stream, create_completion_stream_with_sampler,
    CompletionChunk, StreamControl and richer completion logprob
    metadata.
  • Added llama-crab-server, an HTTP server binary for local inference
    with completions, chat completions, embeddings, reranking,
    tokenization, detokenization, SSE streaming and optional multimodal
    chat support.
  • Added OpenAI-style high-level convenience helpers for text, chat and
    embeddings with token accounting.
  • Added the server_lfm example wrapper and an lfm-text download
    target for launching the HTTP server with LFM text models.
  • Added the streaming example to demonstrate callback-driven text
    generation.
  • Added tauri-plugin-llama-crab, a Tauri IPC runtime for loading
    GGUF models and exposing OpenAI-like chat, completion, embedding,
    rerank, tokenization and model-management commands.
  • Added the @llama-crab/core and @llama-crab/tauri TypeScript
    packages with shared OpenAI-like contracts, request mappers and a
    Tauri client.
  • Added the tauri-chat-lfm desktop example and smoke coverage for
    the Tauri chat workflow.
  • Added mobile-oriented runtime presets through MobilePreset and
    LlamaParams::with_mobile_preset.
  • Added broader tool-call streaming support, including OpenAI-style
    tool-call deltas.
  • Added documentation deployment for the project guide.

Changed

  • Migrated the user guide from mdBook/MkDocs-era documentation to
    Docusaurus, with expanded server, mobile, Tauri, TypeScript,
    streaming, chat, embeddings and grammar coverage.
  • Reorganized the repository into crates/ and packages/ workspaces
    so Rust crates, TypeScript packages and examples share one release
    surface.
  • README files now point users to the new Docusaurus guide hosted at the
    GitHub Pages site.
  • CI and release workflows now build, test and publish
    llama-crab-server, tauri-plugin-llama-crab and TypeScript
    packages alongside the library crates.
  • CI workflows now run through manual dispatch instead of push triggers,
    and documentation jobs use nightly Cargo where required.
  • The hf-tokenizer dependency now enables the onig feature for
    tokenizer compatibility.
  • Rustdoc crate logos now reference the current Canarim Crab asset.

Fixed

  • Removed unused placeholder OpenAI-compat wrapper bindings from
    llama-crab-sys and the old chat module export.
  • Gated the Metal backend build configuration to macOS targets.
  • Hardened documentation builds and docs deployment workflow behavior.
  • Cleaned up server and example runner support for the new server and
    mobile workflows.