Release v0.1.0 · PaytonWebber/model2vec-zig

model2vec/potion inference in pure Zig: WordPiece tokenization, mean pooling, L2 normalization, parity-tested against the Python reference (max abs diff under 1e-5).

f32 and i8 safetensors; i8 quantizer output is byte-identical to the reference implementation's
TurboQuant-style 4-bit format (m2v-quantize --tq4): 8x smaller than f32, costs 0.0020 mean NDCG@10 on the MTEB(eng, v2) retrieval suite (see docs/turboquant.md)
Model.loadFromBytes for @embedFile-shipped models, Model.fingerprint() for keying persisted vectors
4.1 us per embed of a 17-token text on potion-base-8M (x86_64, ReleaseFast)

Requires Zig 0.16.0.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.1.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!