semlocal

Local semantic search for the command line. Store, search, and delete text using vector embeddings — no backend, no API keys, everything stays on your machine.

Why

Semantic search tools today assume you have infrastructure: a vector database to run, an embedding API to call, credentials to manage. That's fine for production systems, but overkill when all you need is a lightweight way to index and recall text — especially for AI agents that benefit from long-term memory.

semlocal is a single binary you install with npm and run immediately. Embeddings are generated locally via ONNX Runtime, stored in a SQLite file, and searched with brute-force cosine similarity. No servers, no API keys, no Docker containers. Just a CLI that reads and writes to disk.

Use it to give agents persistent, searchable memory without any compute or infrastructure overhead.

Install

npm install -g semlocal

Usage

Store text

semlocal write "Rust is a systems programming language focused on safety"
# prints: a1b2c3d4-e5f6-7890-abcd-ef1234567890

You can also pipe text from another command or a file:

cat README.md | semlocal write
echo "hello world" | semlocal write -

Search

semlocal search "safe low-level language"
# [0.87] a1b2c3d4-... Rust is a systems programming language focused on safety

Return results as JSON:

semlocal search "safe low-level language" --json --top 3

Delete an entry

semlocal delete a1b2c3d4-e5f6-7890-abcd-ef1234567890

Collections

Entries are organized into collections. If not specified, all operations use the default collection. Use --collection to partition your data:

semlocal write "Rust is fast" --collection languages
semlocal search "performance" --collection languages
semlocal delete a1b2c3d4-... --collection languages

Collections are created implicitly on first write and removed automatically when their last entry is deleted.

Custom storage directory

By default the index is stored in .semlocal/ in the current working directory. Use --src to change this:

semlocal write "hello world" --src ~/my-index
semlocal search "greeting" --src ~/my-index

How it works

semlocal uses FastEmbed (ONNX Runtime) to generate 384-dimensional vector embeddings with the all-MiniLM-L6-v2 model. Embeddings are stored in a local SQLite database. Search is performed via brute-force cosine similarity over all stored entries.

First run

The embedding model (~25 MB) is downloaded automatically on first use and cached in ~/.semlocal/models/. Subsequent runs start instantly.

Platforms

Pre-built binaries are provided for:

OS	x64	arm64
Linux	✓	✓
macOS		✓
Windows	✓

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
npm		npm
skills/semlocal		skills/semlocal
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

semlocal

Why

Install

Usage

Store text

Search

Delete an entry

Collections

Custom storage directory

How it works

First run

Platforms

License

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

semlocal

Why

Install

Usage

Store text

Search

Delete an entry

Collections

Custom storage directory

How it works

First run

Platforms

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages