Public release v1.8.0
What news
- Support mining ModelOS and Pearl (MDL+PRL)
- Support 9700XT
- Fix error on 6700XT
What changes
-
Intel Arc XMX — Implements an 8×8×32 INT8 cooperative matrix fold shader using SIMD8. Produces a bit-exact transcript through per-lane XOR combined with subgroupXor across four 8×8 fragments that form a single 16×16 hash tile. Validated against CPU results.
-
RDNA1 / Vega 56 (SWINT8) — Uses a software-emulated INT8 dot product by manually sign-extending operands and performing INT32 multiply-add operations when shaderIntegerDotProduct is unavailable. Shares a unified execution path with the DP4A implementation.
-
Architecture gating — Updated to use the exact RDNA2 device ID range (0x73BF–0x7447), allowing RDNA4 devices (0x7550) to bypass the RDNA2-specific path correctly.
-
Vendor detection — vendor_id is captured during initialization; Intel's vendor ID (0x8086) enables acceptance of the XMX 8×8×K matrix shape during the cooperative matrix capability probe.