Ocelotl

Rust-first LLM inference runtime.

Ocelotl is an early-stage workspace for a local LLM runtime with explicit model, loader, tokenizer, kernel, runtime, and serving boundaries. The first milestone is a narrow, correct single-process runtime before adding broad model coverage or high-scale serving features.

Start Here

New contributors should start with docs/start-here.md.

Core orientation docs:

Crates

ocelotl-core: shared types, errors, model metadata, and device contracts.
ocelotl-loader: model artifact loading and validation.
ocelotl-tokenizer: tokenizer and chat-template boundary.
ocelotl-kernels: portable kernel dispatch boundary.
ocelotl-models: model-family implementations.
ocelotl-runtime: request lifecycle, KV cache, scheduling, and generation.
ocelotl-server: API/server integration layer.
ocelotl: root crate and CLI entrypoint.

Validation

cargo fmt --all
cargo check --workspace
cargo test --workspace

Current Status

This is a project skeleton. Public APIs are intentionally small while the runtime shape is established.

Name		Name	Last commit message	Last commit date
Latest commit History 201 Commits
.github/workflows		.github/workflows
ci		ci
crates		crates
docs		docs
fixtures		fixtures
src		src
tools		tools
.gitignore		.gitignore
AGENTS.md		AGENTS.md
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ocelotl

Start Here

Crates

Validation

Current Status

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ocelotl

Start Here

Crates

Validation

Current Status

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages