History / Benchmarks

Revisions

docs(wiki): add "Choosing a Model for Coding" comparison page (neutral, no default) Side-by-side of the three coding-capable models (Gemma-4-26B-A4B Q3_K_M, Gemma-4-26B-A4B QAT/Q4_0, Qwen3.6-27B Q3_K_S) by quality / speed / context, from VulkanForge's own coding tests on the 16 GB RX 9070 XT. No single "best" — user picks by priority. Honest caveats (small sample; not a quant-controlled comparison; 16 GB-specific). Linked from _Sidebar, Supported-Models, and Benchmarks; softened the stale Q3_K_M "default" note. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

maeddesg committed Jun 10, 2026

d744b8f
docs(wiki): VulkanForge user wiki @ v0.7.0 9 content pages + sidebar + footer, sourced strictly from the v0.7.0-tagged repo (README, CHANGELOG, docs/BENCHMARKS.md, docs/MODELS.md, Cargo.toml, LICENSE, real flag/CLI code): Home, Installation, Hardware-and-Compatibility, Supported-Models, Usage, Configuration, Benchmarks, Architecture, Troubleshooting. User-facing, English, honest (v0.7.0 same-run matrix vs llama.cpp Vulkan: dense prefill parity 0.93-1.04x, Gemma-MoE @2048 0.83-0.89x, decode 0.87-0.97x). No pre-v0.7 numbers, no internal (sprint reports / coding-standards / bisect) leaks; flags grepped from code with verified defaults. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

maeddesg committed Jun 9, 2026

02792bd