docs(wiki): v0.8.0 — auto ctx-size, Gemma-4 tool-calling, vf-clide; KV-FP8 now required
- New page: vf-clide (the standalone CLI chat client) + Sidebar/Home/Usage/Installation links.
- Automatic context sizing (v0.8.0): new Configuration section, Usage/Home/Troubleshooting notes,
the 16384 RDNA4 LDS ceiling (explicit --ctx-size above it aborts, not clamped).
- Gemma-4 native tool/function calling documented (Usage, Supported-Models).
- KV-FP8 corrected from "recommended" to REQUIRED for the Gemma-4-26B-A4B MoE across
Supported-Models / Configuration / Usage / Troubleshooting / Architecture / Choosing-a-Model:
the non-FP8 KV path is known-broken and the engine fail-loud aborts without VULKANFORGE_KV_FP8=1
(debug override VULKANFORGE_ALLOW_BROKEN_KV=1).
- Home/Configuration now reference the shipped v0.8.0 (perf matrix provenance stays v0.7.0).
docs(wiki): VulkanForge user wiki @ v0.7.0
9 content pages + sidebar + footer, sourced strictly from the v0.7.0-tagged repo (README,
CHANGELOG, docs/BENCHMARKS.md, docs/MODELS.md, Cargo.toml, LICENSE, real flag/CLI code):
Home, Installation, Hardware-and-Compatibility, Supported-Models, Usage, Configuration,
Benchmarks, Architecture, Troubleshooting.
User-facing, English, honest (v0.7.0 same-run matrix vs llama.cpp Vulkan: dense prefill parity
0.93-1.04x, Gemma-MoE @2048 0.83-0.89x, decode 0.87-0.97x). No pre-v0.7 numbers, no internal
(sprint reports / coding-standards / bisect) leaks; flags grepped from code with verified defaults.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>