🧠 agent-memory

Persistent long-term memory for LLM agents and AI chatbots.

Give your AI agents the ability to remember across conversations — with hybrid vector search, semantic memory, episodic recall, and automatic summarization.

Docs · npm · Report Bug · Request Feature

The problem this solves

Every LLM call is stateless by default. Your AI agent forgets everything the moment the conversation ends. agent-memory gives it a real, persistent, searchable memory — the same way humans remember things.

User: "What programming language do I prefer?"

❌  Without agent-memory → "I don't have that information."
✅  With agent-memory    → "You prefer TypeScript — you mentioned it earlier."

Works with any LLM — OpenAI, Anthropic Claude, Google Gemini, NVIDIA NIM, Mistral, Cohere, and local models via Ollama.

Features

🔍 Hybrid vector + semantic search — score = 0.6·similarity + 0.3·recency + 0.1·importance
🧩 3-layer memory model — episodic (conversations) + semantic (facts) + summary (compressed history)
🏭 9 built-in providers — OpenAI, NVIDIA, Mistral, Azure, Cohere, Google Gemini, Anthropic, Voyage AI, Ollama
📦 Pluggable storage — in-memory, SQLite (serverless/edge), PostgreSQL + pgvector (production)
⚡ Auto-embedding — entries are embedded on store; recall() works without vectors too
🔧 Middleware pattern — wrap any agent function with withMemory() for automatic memory injection
⚛️ React hook — useMemory() with loading/error state for chat UIs
🛡️ TypeScript-first — fully typed, ESM + CJS builds, zero any
🌐 Provider-agnostic — bring your own embedFn for any embedding API

Architecture

User message
     │
     ▼
┌────────────────────────────────────────────────────────┐
│                      AgentMemory                       │
│                                                        │
│  ┌──────────────┐  ┌─────────────┐  ┌──────────────┐  │
│  │   Episodic   │  │  Semantic   │  │   Summary    │  │
│  │  (messages)  │  │   (facts)   │  │ (compressed) │  │
│  └──────────────┘  └─────────────┘  └──────────────┘  │
│                                                        │
│   score = 0.6 × vector_similarity                      │
│           + 0.3 × recency_decay                        │
│           + 0.1 × importance                           │
└──────────────────────┬─────────────────────────────────┘
                       │  MemoryAdapter interface
         ┌─────────────┼──────────────┐
         ▼             ▼              ▼
   InMemory        SQLite         Postgres
  (dev/test)   (edge/local)    (production)

Packages

Package	Description	Version
`@namitjain.india/agent-memory`	Core engine + providers + in-memory adapter
`@namitjain.india/agent-memory-sqlite`	SQLite adapter (serverless, edge, local)
`@namitjain.india/agent-memory-postgres`	PostgreSQL + pgvector adapter (production ANN)
`@namitjain.india/agent-memory-react`	React hook for chat UIs

Quick Start

npm install @namitjain.india/agent-memory

1. Middleware (simplest — wrap your agent function)

import { createProvider, withMemory } from "@namitjain.india/agent-memory";
import OpenAI from "openai";

const client = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });

// Pick your provider — works with any OpenAI-compatible API
const provider = createProvider("openai", { apiKey: process.env.OPENAI_API_KEY });

const runAgent = withMemory(
  async (messages) => {
    const res = await client.chat.completions.create({ model: "gpt-4o-mini", messages });
    return res.choices[0]?.message?.content ?? "";
  },
  { embedding: provider, sessionId: "user-123" }
);

// Turn 1 — agent remembers this
await runAgent([{ role: "user", content: "My name is Alex and I prefer TypeScript." }]);

// Turn 2 — agent recalls from memory
const reply = await runAgent([{ role: "user", content: "What's my name?" }]);
// → "Your name is Alex!"

2. Class API (full control)

import { AgentMemory, createProvider } from "@namitjain.india/agent-memory";

const provider = createProvider("openai", { apiKey: process.env.OPENAI_API_KEY });

const memory = new AgentMemory({
  embedding: provider,
  retrieval: { topK: 5, weights: { similarity: 0.6, recency: 0.3, importance: 0.1 } },
  summarisation: { maxTurns: 30, keepRecentTurns: 10, summariseFn: provider.summarise }
});

// Store a semantic fact
await memory.remember({ kind: "fact", sessionId: "s1", key: "language", value: "TypeScript", importance: 1 });

// Store a conversation entry (auto-embedded)
await memory.remember({ role: "user", content: "I build AI agents for a living", sessionId: "s1" });

// Hybrid recall — vector + recency + importance
const results = await memory.recall("What does the user do professionally?", {
  sessionId: "s1",
  topK: 3,
  filter: (item) => item.kind === "fact"  // optional predicate
});

// Session stats
const stats = await memory.stats("s1");
// → { total: 2, byKind: { entry: 1, fact: 1, summary: 0 } }

3. React hook

import { useMemory } from "@namitjain.india/agent-memory-react";

function Chat() {
  const { remember, recall, inject, clearSession, isLoading, error } = useMemory("session-1");

  const handleSend = async (text: string) => {
    await remember({ role: "user", content: text });
    const context = await recall(text, { topK: 4 });
    // build your prompt with context...
  };

  return <button onClick={() => clearSession()}>Reset Memory</button>;
}

Built-in Providers

Pick any API with createProvider() — zero extra dependencies, pure fetch.

import { createProvider } from "@namitjain.india/agent-memory";

// OpenAI
const p = createProvider("openai",    { apiKey: process.env.OPENAI_API_KEY });

// NVIDIA NIM (OpenAI-compatible)
const p = createProvider("nvidia",    { apiKey: process.env.NVIDIA_API_KEY });

// Google Gemini
const p = createProvider("google",    { apiKey: process.env.GOOGLE_API_KEY });

// Anthropic Claude (summarisation only)
const p = createProvider("anthropic", { apiKey: process.env.ANTHROPIC_API_KEY });

// Cohere
const p = createProvider("cohere",    { apiKey: process.env.COHERE_API_KEY });

// Mistral
const p = createProvider("mistral",   { apiKey: process.env.MISTRAL_API_KEY });

// Voyage AI (embeddings only)
const p = createProvider("voyage",    { apiKey: process.env.VOYAGE_API_KEY });

// Azure OpenAI
const p = createProvider("azure", {
  apiKey: process.env.AZURE_OPENAI_KEY,
  endpoint: "https://my-resource.openai.azure.com",
  embeddingDeployment: "text-embedding-3-small",
  chatDeployment: "gpt-4o-mini"
});

// Ollama (local — no API key needed)
const p = createProvider("ollama");

Provider	Embeddings	Summarise	Default models
`openai`	✅	✅	`text-embedding-3-small` + `gpt-4o-mini`
`nvidia`	✅	✅	`nv-embedqa-e5-v5` + `llama-3.1-8b-instruct`
`google`	✅	✅	`text-embedding-004` + `gemini-1.5-flash`
`anthropic`	—	✅	`claude-3-5-haiku-20241022`
`cohere`	✅	✅	`embed-english-v3.0` + `command-r-plus`
`mistral`	✅	✅	`mistral-embed` + `mistral-small-latest`
`azure`	✅	✅	deployment-based
`voyage`	✅	—	`voyage-3`
`ollama`	✅	✅	`nomic-embed-text` + `llama3.2`

Storage Adapters

Adapter	Best for	Vector search
`InMemoryAdapter` (built-in)	Development, testing, serverless functions	JS cosine similarity
`SQLiteAdapter`	Edge runtimes, local apps, single-server	JS cosine similarity
`PostgresAdapter`	Production at scale, multi-tenant	pgvector HNSW (native ANN)

Use Cases

🤖 AI chatbots — remember user preferences, names, and past conversations across sessions
🧑‍💼 Personal AI assistants — retain facts about the user over weeks and months
🔍 RAG pipelines — augment LLM context with semantically relevant prior interactions
🕹️ AI game NPCs — persistent NPC memory of player interactions
📞 Customer support bots — recall ticket history and user issues automatically
🔬 Research agents — long-running autonomous agents that accumulate knowledge
📝 Writing assistants — remember document context, user style, and prior drafts

Why not just use a vector database?

	Raw vector DB	agent-memory
Hybrid scoring (recency + importance)	❌	✅
3-layer memory model (episodic / semantic / summary)	❌	✅
Auto-summarisation of old turns	❌	✅
Provider selection in 1 line	❌	✅
Works without embeddings (graceful degradation)	❌	✅
TypeScript-first, framework-agnostic	varies	✅
React hook included	❌	✅

Local Development

git clone https://github.com/Namitjain07/agent-memory.git
cd agent-memory
npm install
npm test         # 46 tests across 2 suites
npm run build    # builds all 4 packages

Run integration tests

# Set your API key — no key is stored in the repo
$env:NVIDIA_API_KEY = "your-key-here"   # PowerShell
export NVIDIA_API_KEY="your-key-here"   # bash

node packages/agent-memory/tests/integration-nvidia.mjs

Contributing

See CONTRIBUTING.md. PRs welcome!

License

If this helps your AI project, consider giving it a ⭐

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github		.github
packages		packages
.gitignore		.gitignore
.npmrc		.npmrc
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.base.json		tsconfig.base.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 agent-memory

The problem this solves

Features

Architecture

Packages

Quick Start

1. Middleware (simplest — wrap your agent function)

2. Class API (full control)

3. React hook

Built-in Providers

Storage Adapters

Use Cases

Why not just use a vector database?

Local Development

Run integration tests

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 agent-memory

The problem this solves

Features

Architecture

Packages

Quick Start

1. Middleware (simplest — wrap your agent function)

2. Class API (full control)

3. React hook

Built-in Providers

Storage Adapters

Use Cases

Why not just use a vector database?

Local Development

Run integration tests

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages