v0.9.0 — tree command, model2vec-rs integration, README rewrite
tree — codebase map without the ls -R token explosion
New semble_rs tree command prints a gitignore-aware codebase tree. Measured on real repos:
| Project | tree |
ls -R |
Reduction |
|---|---|---|---|
this repo (Rust + target/) |
533 B | 398 KB | 747× |
| 6,693-file Python backend | 3,950 B | 254 KB | 64× |
Options: -d, --max-depth N, --symbols (top-level fn / struct / class per file), --lang rust,python.
encode — embedding model as a CLI
semble_rs encode "<text>" returns the Model2Vec embedding as JSON. Useful for scripting, debugging, and external pipelines.
semble_rs encode "search result scoring"
echo -e "auth\nlogin\ntoken" | semble_rs encode
semble_rs encode "x" --model minishlab/potion-multilingual-128Mmodel2vec-rs integration
The hand-rolled safetensors loader is replaced with model2vec-rs's StaticModel. SIF token weights are now applied (previously silently ignored). Drops 4 direct dependencies (tokenizers, safetensors, hf-hub, half) — encoder code shrinks from ~140 lines to ~60.
--model option on search / find-related / plan
semble_rs search "auth flow" . --model minishlab/potion-multilingual-128MPriority: --model > SEMBLE_MODEL_PATH env > default (minishlab/potion-code-16M).
README rewrite
README.md and README.ko.md rewritten in upstream MinishLab/semble style: tagline + nav + feature sections + collapsible ranking signals + benchmark tables + acknowledgements.
Notes
- All commands keep their existing flags; this release is additive.
- Default model unchanged (
minishlab/potion-code-16M). - 100-query self-benchmark unchanged.