Skip to content

Revisions

  • docs(wiki): v0.8.0 — auto ctx-size, Gemma-4 tool-calling, vf-clide; KV-FP8 now required - New page: vf-clide (the standalone CLI chat client) + Sidebar/Home/Usage/Installation links. - Automatic context sizing (v0.8.0): new Configuration section, Usage/Home/Troubleshooting notes, the 16384 RDNA4 LDS ceiling (explicit --ctx-size above it aborts, not clamped). - Gemma-4 native tool/function calling documented (Usage, Supported-Models). - KV-FP8 corrected from "recommended" to REQUIRED for the Gemma-4-26B-A4B MoE across Supported-Models / Configuration / Usage / Troubleshooting / Architecture / Choosing-a-Model: the non-FP8 KV path is known-broken and the engine fail-loud aborts without VULKANFORGE_KV_FP8=1 (debug override VULKANFORGE_ALLOW_BROKEN_KV=1). - Home/Configuration now reference the shipped v0.8.0 (perf matrix provenance stays v0.7.0).

    @maeddesg maeddesg committed Jun 12, 2026
  • docs(wiki): VulkanForge user wiki @ v0.7.0 9 content pages + sidebar + footer, sourced strictly from the v0.7.0-tagged repo (README, CHANGELOG, docs/BENCHMARKS.md, docs/MODELS.md, Cargo.toml, LICENSE, real flag/CLI code): Home, Installation, Hardware-and-Compatibility, Supported-Models, Usage, Configuration, Benchmarks, Architecture, Troubleshooting. User-facing, English, honest (v0.7.0 same-run matrix vs llama.cpp Vulkan: dense prefill parity 0.93-1.04x, Gemma-MoE @2048 0.83-0.89x, decode 0.87-0.97x). No pre-v0.7 numbers, no internal (sprint reports / coding-standards / bisect) leaks; flags grepped from code with verified defaults. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

    maeddesg committed Jun 9, 2026