docs(wiki): add "Choosing a Model for Coding" comparison page (neutral, no default)
Side-by-side of the three coding-capable models (Gemma-4-26B-A4B Q3_K_M,
Gemma-4-26B-A4B QAT/Q4_0, Qwen3.6-27B Q3_K_S) by quality / speed / context,
from VulkanForge's own coding tests on the 16 GB RX 9070 XT. No single
"best" — user picks by priority. Honest caveats (small sample; not a
quant-controlled comparison; 16 GB-specific). Linked from _Sidebar,
Supported-Models, and Benchmarks; softened the stale Q3_K_M "default" note.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
docs(wiki): VulkanForge user wiki @ v0.7.0
9 content pages + sidebar + footer, sourced strictly from the v0.7.0-tagged repo (README,
CHANGELOG, docs/BENCHMARKS.md, docs/MODELS.md, Cargo.toml, LICENSE, real flag/CLI code):
Home, Installation, Hardware-and-Compatibility, Supported-Models, Usage, Configuration,
Benchmarks, Architecture, Troubleshooting.
User-facing, English, honest (v0.7.0 same-run matrix vs llama.cpp Vulkan: dense prefill parity
0.93-1.04x, Gemma-MoE @2048 0.83-0.89x, decode 0.87-0.97x). No pre-v0.7 numbers, no internal
(sprint reports / coding-standards / bisect) leaks; flags grepped from code with verified defaults.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>