Skip to content

ONNX Model Files v1

Choose a tag to compare

@byx-darwin byx-darwin released this 01 Jun 11:07
· 53 commits to master since this release

all-MiniLM-L6-v2 ONNX model and tokenizer for tokenless semantic-aware compression (Level 2).

Files:

  • all-MiniLM-L6-v2.onnx (~86MB) — FP32 ONNX model
  • tokenizer.json (~455KB) — HuggingFace tokenizer configuration

Usage: Place both files in ~/.tokenless/models/ or run make models-install.