MemoryArena

Minimal, modular framework for connecting agents, environments, and memory.

Implementation for MemoryArena: Benchmarking Agent Memory in Interdependent Multi-Session Agentic Tasks (https://arxiv.org/abs/2602.16313).

This code is preview version. We are still actively maintaining and improving this codebase.

Quick Start

[Important] Check setup_web_shopping.md, setup_travel.md, setup_web_search_env.md, and setup_formal_reasoning.mdto follow step-by-step structions for each test environment.

Repository Structure

agent/: task agent implementations.
env/: environment server, client, and environment systems.
memory/: memory client and memory systems including long-context, letta, mirix, mem0, mem0-g, ReasoningBank, BM25, Text-embedding RAG, GraphRAG, and MemoRAG.

API Keys

Make sure you have your OPENAI_API_KEY, OPENAI_BASE_URL, GOOGLE_API_KEY, ANTHROPIC_API_KEY, OPENROUTER_API_KEY, OPENROUTER_API_BASE_URL set ready in either your bashrc file or in configs following each setup md.
For Letta, Mirix, Mem0 (including Mem0-g), make sure you have their memory system api keys ready LETTA_API_KEY, MIRIX_API_KEY, MEM0_API_KEY in your bashrc file.

Example Flow

Task prompt → memory wraps prompt
Agent generates action
Env step() executes tool or accepts final
Observation + reward returned
Memory stores action/observation/reward(optional)

Cite Our Paper:

If you are using this repo, please cite our paper at:

@article{he2026memoryarena,
  title={MemoryArena: Benchmarking Agent Memory in Interdependent Multi-Session Agentic Tasks},
  author={He, Zexue and Wang, Yu and Zhi, Churan and Hu, Yuanzhe and Chen, Tzu-Ping and Yin, Lang and Chen, Ze and Wu, Tong Arthur and Ouyang, Siru and Wang, Zihan and others},
  journal={arXiv preprint arXiv:2602.16313},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
agent		agent
configs		configs
env		env
memory		memory
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
example_travel_planner.py		example_travel_planner.py
run_math.py		run_math.py
run_search.py		run_search.py
run_shopping.py		run_shopping.py
run_travel.py		run_travel.py
setup_formal_reasoning.md		setup_formal_reasoning.md
setup_travel.md		setup_travel.md
setup_web_search_env.md		setup_web_search_env.md
setup_web_shopping.md		setup_web_shopping.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MemoryArena

Quick Start

Repository Structure

API Keys

Example Flow

Cite Our Paper:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MemoryArena

Quick Start

Repository Structure

API Keys

Example Flow

Cite Our Paper:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages