Skip to content

v0.9.0 — tree command, model2vec-rs integration, README rewrite

Choose a tag to compare

@johunsang johunsang released this 15 May 22:44
· 8 commits to main since this release

tree — codebase map without the ls -R token explosion

New semble_rs tree command prints a gitignore-aware codebase tree. Measured on real repos:

Project tree ls -R Reduction
this repo (Rust + target/) 533 B 398 KB 747×
6,693-file Python backend 3,950 B 254 KB 64×

Options: -d, --max-depth N, --symbols (top-level fn / struct / class per file), --lang rust,python.

encode — embedding model as a CLI

semble_rs encode "<text>" returns the Model2Vec embedding as JSON. Useful for scripting, debugging, and external pipelines.

semble_rs encode "search result scoring"
echo -e "auth\nlogin\ntoken" | semble_rs encode
semble_rs encode "x" --model minishlab/potion-multilingual-128M

model2vec-rs integration

The hand-rolled safetensors loader is replaced with model2vec-rs's StaticModel. SIF token weights are now applied (previously silently ignored). Drops 4 direct dependencies (tokenizers, safetensors, hf-hub, half) — encoder code shrinks from ~140 lines to ~60.

--model option on search / find-related / plan

semble_rs search "auth flow" . --model minishlab/potion-multilingual-128M

Priority: --model > SEMBLE_MODEL_PATH env > default (minishlab/potion-code-16M).

README rewrite

README.md and README.ko.md rewritten in upstream MinishLab/semble style: tagline + nav + feature sections + collapsible ranking signals + benchmark tables + acknowledgements.

Notes

  • All commands keep their existing flags; this release is additive.
  • Default model unchanged (minishlab/potion-code-16M).
  • 100-query self-benchmark unchanged.