Cognis

Build LLM apps in Rust. Fast, type-safe, composable.

Cognis is a Rust-native framework for building LLM-powered applications — chains, agents, RAG pipelines, and stateful workflows. If you've used LangChain in Python, this is the same mental model with Rust's performance and compile-time guarantees.

Why Cognis?

Compile-time safety — Tool schemas, message types, and state transitions are checked before your code runs. No more runtime surprises.
Pay only for what you use — LLM providers are behind feature flags. Your binary doesn't include OpenAI code if you only use Anthropic.
Async-native streaming — Built on tokio with futures::Stream. Stream tokens from any provider with the same API.
Production patterns included — Circuit breakers, retry with backoff, rate limiting, PII redaction, and human-in-the-loop are built into the middleware pipeline.
One workspace, full stack — Chains, agents, RAG, graph workflows, and high-level agent orchestration all live together with strict dependency boundaries.

Quick Start

Add to your Cargo.toml:

[dependencies]
cognis = { version = "0.1", features = ["openai"] }
cognis-core = "0.1"
tokio = { version = "1", features = ["full"] }
serde_json = "1"

Chain: Prompt → Model → Parser

use std::sync::Arc;
use serde_json::json;
use cognis_core::chain;
use cognis_core::language_models::{ChatModelRunnable, FakeListChatModel};
use cognis_core::output_parsers::StrOutputParser;
use cognis_core::prompts::ChatPromptTemplate;
use cognis_core::runnables::Runnable;

#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
    let prompt = ChatPromptTemplate::from_messages(vec![
        ("system", "You are a helpful assistant."),
        ("human", "Explain {topic} in one sentence."),
    ])?;

    let model = FakeListChatModel::new(vec![
        "Rust is a systems language focused on safety and speed.".into(),
    ]);

    let chain = chain!(
        prompt,
        ChatModelRunnable::new(Arc::new(model)),
        StrOutputParser
    )?;

    let result = chain.invoke(json!({"topic": "Rust"}), None).await?;
    println!("{}", result.as_str().unwrap());
    Ok(())
}

Swap FakeListChatModel for ChatOpenAI, ChatAnthropic, ChatGoogleGenAI, or ChatOllama for real LLM calls.

Stateful Graph Workflow

Build multi-step workflows with conditional branching, checkpointing, and human-in-the-loop:

use std::sync::Arc;
use serde_json::{json, Value};
use cognisgraph::graph::state::{AsyncNodeAction, StateGraph};

#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
    let classify: AsyncNodeAction = Arc::new(|state: Value| {
        Box::pin(async move {
            let input = state["input"].as_str().unwrap_or("");
            let category = if input.contains("error") { "issue" } else { "general" };
            Ok(json!({ "category": category }))
        })
    });

    let respond: AsyncNodeAction = Arc::new(|state: Value| {
        Box::pin(async move {
            let cat = state["category"].as_str().unwrap_or("unknown");
            Ok(json!({ "response": format!("Handling as: {cat}") }))
        })
    });

    let graph = StateGraph::new()
        .add_node("classify", classify)
        .add_node("respond", respond)
        .add_edge("__start__", "classify")
        .add_edge("classify", "respond")
        .add_edge("respond", "__end__")
        .compile()?;

    let result = graph.invoke(json!({ "input": "There is an error" })).await?;
    println!("{}", result);
    Ok(())
}

RAG in 10 Lines

use cognis::text_splitter::{RecursiveCharacterTextSplitter, TextSplitter};
use cognis::document_loaders::text::TextLoader;
use cognis_core::document_loaders::BaseLoader;
use cognis_core::vectorstores::{in_memory::InMemoryVectorStore, base::VectorStore};

let docs = TextLoader::new("data.txt").load().await?;

let splitter = RecursiveCharacterTextSplitter::new()
    .with_chunk_size(500)
    .with_chunk_overlap(50);
let chunks = splitter.split_documents(&docs);

let store = InMemoryVectorStore::new(embedding_model);
store.add_documents(chunks, None).await?;

let results = store.similarity_search("your question", 3).await?;

Streaming

use futures::StreamExt;
use cognis_core::language_models::chat_model::BaseChatModel;
use cognis_core::messages::{HumanMessage, Message};

let messages = vec![Message::Human(HumanMessage::new("Tell me a story"))];
let mut stream = model._stream(&messages, None).await?;

while let Some(chunk) = stream.next().await {
    print!("{}", chunk?.message.base.content.text());
}

What's Included

Layer	Crate	What it does
Foundation	`cognis-core`	Base traits (`ChatModel`, `Tool`, `Runnable`, `VectorStore`), message types, prompt templates, output parsers, callbacks
Implementation	`cognis`	5 LLM providers, 19 chain types, 14 retrievers, 11 memory types, 6 vector stores, document loaders, text splitters, tools
Orchestration	`cognisgraph`	State graphs, Pregel execution engine, checkpointing (SQLite/Postgres), streaming, human-in-the-loop, subgraph composition
Application	`cognisagent`	Zero-boilerplate agent factory, middleware pipeline, sandboxed execution, planning, plugins, workflow engine

Providers

Enable only what you need via feature flags:

# Pick your providers
cognis = { version = "0.1", features = ["anthropic", "openai"] }

# Or enable everything
cognis = { version = "0.1", features = ["all-providers"] }

# Graph workflows with persistence
cognisgraph = { version = "0.1", features = ["sqlite"] }

Flag	Provider
`openai`	OpenAI GPT models + embeddings
`anthropic`	Anthropic Claude models + embeddings
`google`	Google Gemini models + embeddings
`ollama`	Ollama local models + embeddings
`azure`	Azure OpenAI
`qdrant` / `pinecone` / `weaviate` / `chroma` / `faiss`	Vector store backends
`sqlite` / `postgres`	Checkpoint persistence (cognisgraph)

Examples

All examples work without API keys using mock models:

git clone https://github.com/0xvasanth/cognis.git
cd cognis

cargo run --example simple_chain           # Basic chain composition
cargo run --example tool_agent             # Agent with tool calling
cargo run --example rag_pipeline           # Full RAG pipeline
cargo run --example cognisgraph_agent        # Stateful graph agent
cargo run --example streaming              # Token streaming
cargo run --example graph_with_checkpoints # Persistent graph workflows
cargo run --example memory_types           # Conversation memory
cargo run --example plan_and_execute       # Planning agent

See the examples/ directory for 35+ runnable demos.

Contributing

See CONTRIBUTING.md for guidelines, project structure, and conventions.

Acknowledgments

Cognis is heavily inspired by the LangChain, LangGraph, and DeepAgents Python ecosystem. Huge thanks to the LangChain team for pioneering the composable LLM framework paradigm — their design patterns, abstractions, and developer experience were the foundation that made this Rust port possible.

License

MIT

Report a Bug · Request a Feature · Discussions

Name		Name	Last commit message	Last commit date
Latest commit History 372 Commits
.cargo		.cargo
.github		.github
crates		crates
examples		examples
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cognis

Why Cognis?

Quick Start

Chain: Prompt → Model → Parser

Stateful Graph Workflow

RAG in 10 Lines

Streaming

What's Included

Providers

Examples

Contributing

Acknowledgments

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Cognis

Why Cognis?

Quick Start

Chain: Prompt → Model → Parser

Stateful Graph Workflow

RAG in 10 Lines

Streaming

What's Included

Providers

Examples

Contributing

Acknowledgments

License

About

Topics

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages