Awesome Agent Memory Papers

A curated list of papers on memory for LLM / multimodal agents — methods, benchmarks, and surveys — covering episodic, semantic, procedural, and multimodal memory, with both parametric (internal) and retrieval-based (external) storage, learned via prompting, supervised finetuning, or reinforcement learning.

90 papers · 7 surveys · 31 benchmarks · 52 methods · last updated 2026-04-21

Interactive dashboard with multi-tag filtering: https://yyyujintang.github.io/Awesome-Agent-Memory-Papers/

Contributions welcome — open an issue or PR with new papers.

Surveys
Benchmarks
- QA-based Memory Evaluation (5)
- Web Navigation (7)
- Desktop / Mobile GUI (6)
- Embodied & Game Environments (6)
- General Long-Horizon / Office (7)
Methods
- Multimodal Memory (16)
- Procedural Memory (10)
- Episodic Memory (18)
- Semantic Memory (2)
- Internal / Parametric Memory (4)
- Other Methods (2)
Tag Legend

Surveys

Rethinking Memory Mechanisms of Foundation Agents in the Second Half
2026-01-14 · Jiawei Han, Philip Yu
Survey
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
2025-12-29 · [code]
Survey
Memory in the Age of AI Agents
2025-12-15 · Shuicheng Yan, Guibin Zhang · [code]
Survey
Measuring Agents in Production
2025-12-02 · Shuicheng Yan, Guibin Zhang
Survey
Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook
2025-03-23 · Xuming Hu
Survey
Episodic memory in AI agents poses risks that should be studied and mitigate
2025-01-20
Survey
A Survey on the Memory Mechanism of Large Language Model based Agents
2024-04-21
Survey

Benchmarks

Evaluation suites for agent memory, split by interaction mode.

Methods

Each paper is placed in exactly one primary section (Multimodal > Procedural > Episodic > Semantic > External > Internal). Tag badges on each entry show the full tag vector — use the website for true multi-axis filtering.

Multimodal Memory

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory
2026-04-01
Method External Prompt-based Episodic Multimodal Procedural Semantic
Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models
2026-01-27 · Mingsheng Long Bytedance Seed
Method Internal SFT Multimodal
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning
2026-01-26
Method External Prompt-based Episodic Multimodal
MemVerse: Multimodal Memory for Lifelong Learning Agents
2025-12-03 · [code]
Method External Prompt-based Episodic Multimodal Procedural Semantic
ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
2025-11-26 · CVPR26 · [code]
Method External Multimodal Semantic
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
2025-11-25 · CVPR26
Method External Prompt-based Multimodal Procedural
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
2025-11-14 · Shuicheng Yan
Method Internal SFT Multimodal
VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents
2025-10-19 · NeurIPS25 · [code]
Method Internal RL-based Episodic Multimodal
VideoLucy: Deep Memory Backtracking for Long Video Understanding
2025-10-14 · NeurIPS25
Method External SFT Episodic Multimodal
(M3-Agent) Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory
2025-08-13 · ICLR26 · ByteDance Seed · [code]
Method External SFT Episodic Multimodal Semantic
MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling
2025-08-11
Method External Prompt-based Episodic Multimodal
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Token
2025-06-20 · Chuang Gan · [code]
Method Internal Multimodal
3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model
2025-05-28
Method External Prompt-based Episodic Multimodal Semantic
Towards General Continuous Memory for Vision-Language Models
2025-05-23 · NeurIPS25
Method External Internal SFT Episodic Multimodal Semantic
SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation
2025-01-30 · [code]
Method External Prompt-based Episodic Multimodal
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
2024-08-07 · NeurIPS24 · [code]
Method External Prompt-based Multimodal

Procedural Memory

A Subgoal-driven Framework for Improving Long-Horizon LLM Agents
2026-03-20
Method External Prompt-based Training-free Episodic Procedural
Plan-MCTS: Plan Exploration for Action Exploitation in Web Navigation
2026-02-15 · Weinan Zhang
Method RL-based Procedural
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
2026-02-02
Method External RL-based Procedural
TokMem: Tokenized Procedural Memory for Large Language Models
2025-10-01
Method Internal SFT Procedural
ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
2025-09-29 · ICLR26 · Siru Ouyang
Method External Prompt-based Episodic Procedural
Memory Management and Contextual Consistency for Long-Running Low-Code Agents
2025-09-27
Method External Prompt-based Episodic Procedural
Memory OS of AI Agent
2025-05-30 · EMNLIP25 Main
Method External Prompt-based Episodic Procedural Semantic
A-MEM: Agentic Memory for LLM Agents
2025-02-17 · NeurIPS25 · [code]
Method External Prompt-based Episodic Procedural Semantic
Agent Workflow Memory (AWM)
2024-09-11 · ICML26 · [code]
Method External Prompt-based Procedural
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control
2023-06-13 · [code]
Method External Prompt-based Episodic Procedural

Episodic Memory

Gated Memory Policy
2026-04-21 · Shuran Song
Method Internal RL-based Episodic
HiGMem: A Hierarchical and LLM-Guided Memory System for Long-Term Conversational Agents
2026-04-20
Method External Prompt-based Training-free Episodic Semantic
PlugMem: A Task-Agnostic Plugin Memory Module for LLM Agents
2026-02-23 · [code]
Method External Prompt-based Training-free Episodic Semantic
Modeling Distinct Human Interaction in Web Agents
2026-02-19
Method External Prompt-based Episodic
REMem: Reasoning with Episodic Memory in Language Agent
2026-02-13 · Yu Su, Huan Sun
Method External Prompt-based Episodic
TraceMem: Weaving Narrative Memory Schemata from User Conversational Traces
2026-02-10 · HKU
Method External Prompt-based Episodic Semantic
Learning to Continually Learn via Meta-learning Agentic Memory Designs
2026-02-08 · [code]
Method External RL-based Episodic
Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory
2026-01-27
Method External Prompt-based Episodic
CAST: Character-and-Scene Episodic Memory for Agents
2026-01-14
Method External Prompt-based Episodic
SimpleMem: Efficient Lifelong Memory for LLM Agents
2026-01-05
Method External Prompt-based Episodic Semantic
Hindsight is 20/20: Building Agent Memory that Retains, Recalls, and Reflects
2025-12-14
Method External Prompt-based Episodic Semantic
A neural network model of free recall learns multiple memory strategies
2025-09-25 · [code]
Method Internal Episodic
PRIME: Large Language Model Personalization with Cognitive Dual-Memory and Personalized Thought Process
2025-07-07 · EMNLP25, Main
Method External Prompt-based Episodic Semantic
Ella: Embodied Social Agents with Lifelong Memory
2025-06-30 · Chuang Gan
Method External Prompt-based Episodic Semantic
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
2025-04-28
Method External Prompt-based Episodic Semantic
R3Mem: Bridging Memory Retention and Retrieval via Reversible Compressio n
2025-02-21
Method External Prompt-based Episodic
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models
2024-05-23 · NeurIPS24 · Yu Su · [code]
Method External Prompt-based Training-free Episodic Semantic
MemoryBank: Enhancing Large Language Models with Long-Term Memory
2023-05-17
Method External Prompt-based Episodic Semantic

Semantic Memory

Explicit v.s. Implicit Memory: Exploring Multi-hop Complex Reasoning Over Personalized Information
2025-08-15 · SIGKDD 26 · Zeyu Zhang
Method External Internal Prompt-based Semantic
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models (HippoRAG 2)
2025-02-20 · ICML25 · [code]
Method External Internal Prompt-based Semantic

Internal / Parametric Memory

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning
2026-02-11 · Bytedance Seed
Method Internal SFT
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management
2025-12-25
Method Internal SFT
MemGen: Weaving Generative Latent Memory for Self-Evolving Agents
2025-09-29 · Shuicheng Yan, Guibin Zhang · [code]
Method Internal
Scaling Test-time Compute for LLM Agents
2025-06-15 · ICLR26
Method Internal Prompt-based

Other Methods

Agentic Reasoning for Large Language Models
2026-01-18 · Heng Ji
Method Prompt-based Training-free
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
2025-10-05 · Jie Tang
Method RL-based

Tag Legend

Axis	Values
Category	`Survey` · `Benchmark` · `Method`
Benchmark Type	`QA` · `Web` · `GUI` · `Embodied` · `Long-Horizon`
Storage	`Internal` (parametric — weights / latent tokens) · `External` (non-parametric — retrieval)
Learning	`Prompt-based` · `RL-based` · `SFT` · `Training-free`
Memory Type	`Episodic` · `Semantic` · `Procedural` · `Multimodal`

Citation

If this list is useful in your work, please consider starring the repo.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docs		docs
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome Agent Memory Papers

Contents

Surveys

Benchmarks

QA-based Memory Evaluation

Web Navigation

Desktop / Mobile GUI

Embodied & Game Environments

General Long-Horizon / Office

Methods

Multimodal Memory

Procedural Memory

Episodic Memory

Semantic Memory

Internal / Parametric Memory

Other Methods

Tag Legend

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Awesome Agent Memory Papers

Contents

Surveys

Benchmarks

QA-based Memory Evaluation

Web Navigation

Desktop / Mobile GUI

Embodied & Game Environments

General Long-Horizon / Office

Methods

Multimodal Memory

Procedural Memory

Episodic Memory

Semantic Memory

Internal / Parametric Memory

Other Methods

Tag Legend

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages