Figure 1: Roadmap of foundation agent memory (2023–2025)
- 🎉 2026-02-09 — Our paper is now available on arXiv! Check it out: Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey.
- 🚀 2026-01-14 — Repository initialized with paper list, taxonomy figures, and full contents.
As AI enters the second half, the core challenge shifts from chasing benchmark gains to delivering real utility in long-horizon, dynamic, and user-dependent environments—where agents face context explosion and must continuously accumulate, manage, and selectively reuse information across extended interactions.
This repository accompanies the survey Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey. The survey is based on a systematic literature collection and curates 218 key articles published between 2023 Q1 and 2025 Q4, and organizes foundation-agent memory via a unified taxonomy along three core design dimensions: memory substrates, cognitive mechanisms, and memory subjects. From a system perspective, it further analyzes memory operations in single-agent and multi-agent settings, highlights the growing role of learning memory policies, discusses scaling with context length and environment complexity, reviews evaluation metrics and benchmarks, and outlines six open challenges to guide next generation memory systems design.
💡 We will continuously update this repository with newly released papers and resources. Contributions and open new issues are highly welcome.
We categorize foundation agent memory along three orthogonal perspectives in Figure 2: Memory Substrate, Memory Cognitive Mechanism, and Memory Subject.
Figure 2: Taxonomy of Foundation Agent Memory — organized by (1) Memory Substrate (internal and external),
(2) Memory Cognitive Mechanism (episodic, semantic, sensory, working, procedural), and
(3) Memory Subject (user-centric and agent-centric).
- External memory: non-parametric stores (e.g., databases, vector stores, logs) that can be written/read by the agent.
- Internal memory: information internalized into model states or parameters.
- Sensory memory: captures high-frequency, immediate or time-sensitive signals from recent inputs for rapid perception and filtering.
- Working memory: maintains short-term, task-relevant variables (goals, intermediate states, tool results) to support ongoing reasoning and action.
- Episodic memory: stores time-series interaction traces and experiences for later recall in similar situations.
- Semantic memory: abstracts stable facts and concepts from experiences/knowledge sources to enable generalization beyond specific episodes.
- Procedural memory: encodes reusable skills, routines, and action policies that improve how the agent acts over time.
- User-centric memory: persistent user facts, preferences, and interaction history for personalization.
- Agent-centric memory: the agent’s own experience/trajectories/skills for task performance and self-improvement.
For memory operations & management, learning policies, scalability, and evaluation, please refer to Sections 4–7 of our survey.
Foundation agent memory is a key component for long-horizon performance and personalization across a wide range of real-world domains, including education, scientific research, gaming & simulation, robotics, dialog systems, healthcare, workflow automation, software engineering, online streaming & recommendation, information search, finance & accounting, and legal & consulting. In practice, these settings often require agents to accumulate experiences, distill reusable skills, and maintain coherent histories over time. Please refer to application in out survey for more details.
Figure 3: Applications of the Foundation Agent Memory System.
We curate and organize representative papers on foundation agent memory using the taxonomy in the survey (Substrate, Cognitive Mechanism, and Subject). Below is a structured list to help you quickly navigate the design space.
-
2026-02-10 [arxiv 2026] TraceMem: Weaving Narrative Memory Schemata from User Conversational Traces
-
2026-02-09 [arxiv 2026] STaR: Scalable Task-Conditioned Retrieval for Long-Horizon Multimodal Robot Memory
-
2026-02-03 [arxiv 2026] MEMORA: A Harmonic Memory Representation Balancing Abstraction and Specificity
-
2026-01-30 [arxiv 2026] Mem-T: Densifying Rewards for Long-Horizon Memory Agents
-
2026-01-30 [arxiv 2026] Darwinian Memory: A Training-Free Self-Regulating Memory System for GUI Agent Evolution
-
2026-01-29 [arxiv 2026] E-mem: Multi-agent based Episodic Context Reconstruction for LLM Agent Memory
-
2026-01-29 [arxiv 2026] ShardMemo: Masked MoE Routing for Sharded Agentic LLM Memory
-
2026-01-29 [arxiv 2026] MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning
-
2026-01-28 [arxiv 2026] MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents
-
2026-01-28 [arxiv 2026] BMAM: Brain-inspired Multi-Agent Memory Framework
-
2026-01-28 [arxiv 2026] AMA: Adaptive Memory via Multi-Agent Collaboration
-
2026-01-28 [arxiv 2026] Me-Agent: A Personalized Mobile Agent with Two-Level Use
-
2026-01-27 [arxiv 2026] MAGNET: Towards Adaptive GUI Agents with Memory-Driven Knowledge Evolution
-
2026-01-26 [arxiv 2026] MemWeaver: Weaving Hybrid Memories for Traceable Long-Horizon Agentic Reasoning
-
2026-01-09 [arxiv 2026] Self-Evolving Distributed Memory Architecture for Scalable AI Systems
-
2025-12-16 [arxiv 2025] Astraea: A State-Aware Scheduling Engine for LLM-Powered Agents
-
2025-12-14 [arxiv 2025] Memoria: A Scalable Agentic Memory Framework for Personalized Conversational AI
-
2025-12-14 [arxiv 2025] HINDSIGHT IS 20/20: BUILDING AGENT MEMORY THAT RETAINS, RECALLS, AND REFLECTS
-
2025-12-14 [arxiv 2025] Forgetful but Faithful: A Cognitive Memory Architecture and Benchmark for Privacy-Aware Generative Agents
-
2025-12-14 [arxiv 2025] QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management
-
2025-12-13 [arxiv 2025] V-Rex: Real-Time Streaming Video LLM Acceleration via Dynamic KV Cache Retrieval
-
2025-12-13 [arxiv 2025] VideoARM: Agentic Reasoning over Hierarchical Memory for Long-Form Video Understanding
-
2025-12-12 [arxiv 2025] Unifying Dynamic Tool Creation and Cross-Task Experience Sharing through Cognitive Memory Architecture
-
2025-12-11 [arxiv 2025] Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases
-
2025-12-11 [arxiv 2025] Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution
-
2025-12-08 [arxiv 2025] SIT-GRAPH: STATE INTEGRATED TOOL GRAPH FOR MULTI-TURN AGENTS
-
2025-12-08 [arxiv 2025] DeepCode: Open Agentic Coding
-
2025-12-07 [arxiv 2025] PERSONAMEM-V2: Towards Personalized Intelligence via Learning Implicit User Personas and Agentic Memory
-
2025-12-04 [arxiv 2025] Topology Matters: Measuring Memory Leakage in Multi-Agent LLMs
-
2025-12-04 [arxiv 2025] SEAL: Self-Evolving Agentic Learning for Conversational Question Answering over Knowledge Graphs
-
2025-12-03 [arxiv 2025] From Static to Adaptive: Immune Memory-based Jailbreak Detection for Large Language Models
-
2025-12-03 [arxiv 2025] MemVerse: Multimodal Memory for Lifelong Learning Agents
-
2025-12-02 [arxiv 2025] Vision to Geometry: 3D Spatial Memory for Sequential Embodied MLLM Reasoning and Exploration
-
2025-12-02 [arxiv 2025] WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning
-
2025-12-01 [arxiv 2025] CuES: A Curiosity-driven and Environment-grounded Synthesis Framework for Agentic RL
-
2025-11-29 [arxiv 2025] Describe Anything Anywhere At Any Moment
-
2025-11-28 [arxiv 2025] Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning
-
2025-11-28 [arxiv 2025] Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent
-
2025-11-27 [arxiv 2025] Solving Context Window Overflow in AI Agents
-
2025-11-27 [arxiv 2025] MG-Nav: Dual-Scale Visual Navigation via Sparse Spatial Memory
-
2025-11-26 [arxiv 2025] LOOM: Personalized Learning Informed by Daily LLM Conversations Toward Long-Term Mastery via a Dynamic Learner Memory Graph
-
2025-11-26 [arxiv 2025] Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
-
2025-11-26 [arxiv 2025] MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning
-
2025-11-25 [arxiv 2025] Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory
-
2025-11-25 [arxiv 2025] Improving Language Agents through BREW
-
2025-11-25 [arxiv 2025] Latent Collaboration in Multi-Agent Systems
-
2025-11-23 [arxiv 2025] General Agentic Memory Via Deep Research
-
2025-11-21 [arxiv 2025] A Benchmark for Procedural Memory Retrieval in Lauguage Aagents
-
2025-11-21 [arxiv 2025] Episodic Memory in Agentic Frameworks: Suggesting Next Tasks
-
2025-11-21 [arxiv 2025] A Simple Yet Strong Baseline for Long-Term Conversational Memory of LLM Agents
-
2025-11-21 [arxiv 2025] MirrorMind: Empowering OmniScientist with the Expert Perspectives and Collective Knowledge of Human Scientists
-
2025-11-20 [arxiv 2025] Goal-Directed Search Outperforms Goal-agnostic Memory Compression in Long-context Memory Tasks
-
2025-11-18 [arxiv 2025] AISAC: An Integrated multi-agent System for Transparent, Retrieval-Grounded Scientific Assistance
-
2025-11-18 [arxiv 2025] O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents
-
2025-11-17 [arxiv 2025] Mem-PAL: Towards Memory-based Personalized Dialogue Assistants for Long-term User-Agent Interaction
-
2025-11-17 [arxiv 2025] WebCoach: Self-evolving Web Agents with Cross-session Memory Guidance
-
2025-11-15 [arxiv 2025] GCAgent: Long-Video Understanding via Schematic and Narrative Episodic Memory
-
2025-11-13 [arxiv 2025] Multi-agent In-context Coordination via Decentralized Memory Retrieval
-
2025-11-12 [arxiv 2025] History-Aware Reasoning for GUI Agent
-
2025-11-11 [arxiv 2025] From Experience to Strategy: Empowering LLM Agents with Trainable Graph Memory
-
2025-11-11 [arxiv 2025] Smarter Together: Creating Agentic Communities of Practice through Shared Experiential Learning
-
2025-11-10 [arxiv 2025] Beyond Fact Retrieval: Episodic Memory for RAG with Generative Semantic Workspaces
-
2025-11-10 [arxiv 2025] IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction
-
2025-11-09 [arxiv 2025] MemoriesDB: A Temporal-Semantic-Relational Database for Long-Term Agent Memory
-
2025-11-07 [arxiv 2025] Nested Learning: The Illusion of Deep Learning Architectures
-
2025-11-05 [arxiv 2025] Towards Realistic Project-Level Code Generation via Multi-Agent Collaboration and Semantic Architecture Modeling
-
2025-11-05 [arxiv 2025] HaluMem: Evaluating Hallucinations in Memory Systems of Agents
-
2025-11-04 [arxiv 2025] MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning
-
2025-10-31 [arxiv 2025] EvoMem: Improving Multi-Agent Planning with Dual-Evolving Memory
-
2025-10-31 [arxiv 2025] LiCoMemory: Lightweight and Cognitive Agentic Memory for Efficient Long-Term Reasoning
-
2025-10-31 [arxiv 2025] Dynamic Affective Memory Management for Personalized LLM Agents
-
2025-10-29 [arxiv 2025] TheraMind: A Strategic and Adaptive Agent for Longitudinal Psychological Counseling
-
2025-10-29 [arxiv 2025] CRMWeaver: Building Powerful Business Agent via Agentic RL and Shared Memories
-
2025-10-28 [arxiv 2025] AgentFold: Long-Horizon Web Agents with Proactive Context Management
-
2025-10-28 [arxiv 2025] MGA: Memory-Driven GUI Agent for Observation-Centric Interaction
-
2025-10-27 [arxiv 2025] Evaluating Long-Term Memory for Long-Context Question Answering
-
2025-10-27 [arxiv 2025] DeepAgent: A General Reasoning Agent with Scalable Toolsets
-
2025-10-22 [NeurIPS 2025] Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning
-
2025-10-21 [arxiv 2025] LightMem: Lightweight and Efficient Memory-Augmented Generation
-
2025-10-18 [arxiv 2025] Branch-and-Browse: Efficient and Controllable Web Exploration with Tree-Structured Reasoning and Action Memory
-
2025-10-18 [arxiv 2025] RGMem: Renormalization Group-based Memory Evolution for Language Agent User Profile
-
2025-10-16 [arxiv 2025] MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems
-
2025-10-15 [arxiv 2025] D-SMART: Enhancing LLM Dialogue Consistency via Dynamic Structured Memory And Reasoning Tree
-
2025-10-15 [arxiv 2025] MemoTime: Memory-Augmented Temporal Knowledge Graph Enhanced Large Language Model Reasoning
-
2025-10-14 [arxiv 2025] Memory As Action: Autonomous Context Curation for Long-Horizon Agentic Tasks
-
2025-10-13 [arxiv 2025] Scaling Long-Horizon LLM Agent via Context-Folding
-
2025-10-12 [arxiv 2025] PISA: A Pragmatic Psych-Inspired Unified Memory System for Enhanced AI Agency
-
2025-10-11 [arxiv 2025] EpiCache: Episodic KV Cache Management for Long Conversational Question Answering
-
2025-10-10 [arxiv 2025] Preference-Aware Memory Update for Long-Term LLM Agents
-
2025-10-09 [arxiv 2025] Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory
-
2025-10-09 [arxiv 2025] Multiple Memory Systems for Enhancing the Long-term Memory of Agent
-
2025-10-09 [arxiv 2025] MemWeaver: A Hierarchical Memory from Textual Interactive Behaviors for Personalized Generation
-
2025-10-09 [arxiv 2025] Enabling Personalized Long-term Interactions in LLM-based Agents through Persistent Memory and User Profiles
-
2025-10-08 [arxiv 2025] Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
-
2025-10-08 [arxiv 2025] ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory
-
2025-10-07 [NeurIPS 2025] CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension
-
2025-10-06 [arxiv 2025] Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
-
2025-10-06 [arxiv 2025] LEGOMem: Modular Procedural Memory for Multi-agent LLM Systems for Workflow Automation
-
2025-10-06 [arxiv 2025] Pretraining with hierarchical memories: separating long-tail and common knowledge
-
2025-10-01 [arxiv 2025] ACON: Optimizing Context Compression for Long-horizon LLM Agents
-
2025-09-30 [arxiv 2025] Mem-α: Learning Memory Construction via Reinforcement Learning
-
2025-09-29 [arxiv 2025] MemGen: Weaving Generative Latent Memory for Self-Evolving AgentsMemGen: Weaving Generative Latent Memory for Self-Evolving Agents
-
2025-09-29 [arxiv 2025] ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
-
2025-09-29 [arxiv 2025] ViReSkill: Vision-Grounded Replanning with Skill Memory for LLM-Based Planning in Lifelong Robot Learning
-
2025-09-27 [arxiv 2025] Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents
-
2025-09-17 [arxiv 2025] ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
-
2025-09-03 [arxiv 2025] Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
-
2025-08-25 [arxiv 2025] Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
-
2025-08-15 [arxiv 2025] Learn to Memorize: Optimizing LLM-based Agents with Adaptive Memory Framework
-
2025-08-13 [arxiv 2025] Memp: Exploring Agent Procedural Memory
-
2025-08-13 [arxiv 2025] Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models
-
2025-08-05 [arxiv 2025] Nemori: Self-Organizing Agent Memory Inspired by Cognitive Science
-
2025-08-03 [arxiv 2025] MLP Memory: A Retriever-Pretrained Memory for Large Language Models
-
2025-07-31 [arxiv 2025] MemInsight: Autonomous Memory Augmentation for LLM Agents
-
2025-07-28 [arxiv 2025] In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents
-
2025-07-27 [ACL 2025] M2PA: A Multi-Memory Planning Agent for Open Worlds Inspired by Cognitive Theory
-
2025-07-25 [arxiv 2025] MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
-
2025-07-10 [arxiv 2025] MIRIX: Multi-Agent Memory System for LLM-Based Agents
-
2025-07-07 [arxiv 2025] Evaluating memory in llm agents via incremental multi-turn interactions
-
2025-06-20 [arxiv 2025] MemBench: Towards More Comprehensive Evaluation on the Memory of LLM-based Agents
-
2025-06-16 [arxiv 2025] G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems
-
2025-06-14 [NeurIPS 2024 Datasets and Benchmarks] BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
-
2025-05-30 [arxiv 2025] Memory OS of AI Agent
-
2025-05-30 [arxiv 2025] Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning
-
2025-05-27 [arxiv 2025] Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions
-
2025-05-23 [arxiv 2025] Collaborative Memory: Multi-User Memory Sharing in LLM Agents with Dynamic Access Control
-
2025-05-21 [arxiv 2025] Pre-training Limited Memory Language Models with Internal and External Knowledge
-
2025-05-21 [arxiv 2025] How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior
-
2025-05-13 [arxiv 2025] ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking
-
2025-05-11 [arxiv 2025] Long Term Memory : The Foundation of AI Self-Evolution
-
2025-05-06 [ACM UMAP 2025] Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents
-
2025-04-28 [arxiv 2025] Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
-
2025-04-09 [arxiv 2025] Inducing Programmatic Skills for Agentic Tasks
-
2025-04-09 [arxiv 2025] SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
-
2025-03-31 [arxiv 2025] Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
-
2025-03-12 [arxiv 2025] VideoScan: Enabling Efficient Streaming Video Understanding via Frame-level Semantic Carriers
-
2025-03-09 [ICCV 2025] Online Dense Point Tracking with Streaming Memory
-
2025-03-07 [arxiv 2025] Enhancing Reasoning with Collaboration and Memory
-
2025-03-07 [arxiv 2025] Interpersonal Memory Matters: A New Task for Proactive Dialogue Utilizing Conversational History
-
2025-03-05 [ICLR 25 Workshop] LM2: Large Memory Models for Long Context Reasoning
-
2025-03-05 [CVPR 2025] EgoLife: Towards Egocentric Life Assistant
-
2025-03-03 [ICLR 2025] SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents
-
2025-03-01 [ICLR 2025] Streaming Video Question-Answering with In-context Video KV-Cache Retrieval
-
2025-02-22 [arxiv 2025] Echo: A Large Language Model with Temporal Episodic Memory
-
2025-02-20 [ICML 2025] From RAG to Memory: Non-Parametric Continual Learning for Large Language Models
-
2025-02-17 [arxiv 2025] MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation
-
2025-02-17 [NeurIPS 2025] A-Mem: Agentic Memory for LLM Agents
-
2025-02-13 [arxiv 2025] R3Mem: Bridging Memory Retention and Retrieval via Reversible Compression
-
2025-02-04 [arxiv 2025] Classroom Simulacra: Building Contextual Student Generative Agents in Online Education for Learning Behavioral Simulation
-
2025-02-01 [arxiv 2025] M+: Extending MemoryLLM with Scalable Long-Term Memory
-
2025-02-01 [NeurIPS 2025] ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference
-
2025-01-30 [arxiv 2025] TeachTune: Reviewing Pedagogical Agents Against Diverse Student Profiles with Simulated Students
-
2025-01-23 [ICLR 2025] Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge
-
2025-01-22 [arxiv 2025] SRMT: Shared Memory for Multi-agent Life-long Pathfinding
-
2025-01-20 [arxiv 2025] Zep: a temporal knowledge graph architecture for agent memory
-
2024-12-31 [arxiv 2024] Titans: Learning to Memorize at Test Time
-
2024-12-10 [arxiv 2024] MemReasoner: A Memory-augmented LLM Architecture for Multi-hop Reasoning
-
2024-10-24 [ICLR 2025] Longmemeval: Benchmarking chat assistants on long-term interactive memory
-
2024-10-24 [arxiv 2024] RET-LLM: Towards a General Read-Write Memory for Large Language Models
-
2024-10-17 [arxiv 2024] PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction
-
2024-10-17 [ICLR 2025] From Isolated Conversations to Hierarchical Schemas: Dynamic Tree Memory Representation for LLMs
-
2024-10-13 [arxiv 2024] Personalized Large Language Model Assistant with Evolving Conditional Memory
-
2024-10-11 [UIST 2024] Memolet: Reifying the Reuse of User-AI Conversational Memories
-
2024-10-10 [arxiv 2024] Agents Thinking Fast and Slow: A Talker-Reasoner Architecture
-
2024-10-01 [ICLR 2025] Self-updatable large language models by integrating context into model parameters
-
2024-10-01 [arxiv 2024] WALT: Web Agents that Learn Tools
-
2024-09-30 [ECCV 2024] VideoAgent: A Memory-Augmented Multimodal Agent for Video Understanding
-
2024-09-30 [arxiv 2024] MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants
-
2024-09-28 [EMNLP 2024] Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs
-
2024-09-23 [NAACL 2025] MADial-Bench: Towards Real-world Evaluation of Memory-Augmented Dialogue Generation
-
2024-09-11 [arxiv 2024] Agent Workflow Memory
-
2024-09-09 [WWW 2025] MemoRAG: Boosting Long Context Processing with Global Memory-Enhanced Retrieval Augmentation
-
2024-08-29 [arxiv 2024] VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation
-
2024-08-28 [arxiv 2024] AI-native Memory: A Pathway from LLMs Towards AGI
-
2024-08-01 [arxiv 2024] SAM 2: Segment Anything in Images and Videos
-
2024-07-31 [IEEE 2024] VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Continual Learning
-
2024-07-22 [arxiv 2024] A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
-
2024-07-11 [SIGIR 2024] On Generative Agents in Recommendation
-
2024-07-09 [arxiv 2024] FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
-
2024-07-06 [arxiv 2024] MemoCRS: Memory-enhanced Sequential Conversational Recommender Systems with Large Language Models
-
2024-07-01 [arxiv 2024] Memory3 : Language Modeling with Explicit Memory
-
2024-06-26 [ICRA 2024] Lifelong Robot Library Learning: Bootstrapping Composable and Generalizable Skills for Embodied Control with Language Models
-
2024-06-17 [CVPR 2024] VideoLLM-online: Online Video Large Language Model for Streaming Video
-
2024-06-09 [NAACL 2025] Hello Again! LLM-powered Personalized Agent for Long-term Dialogue
-
2024-06-06 [NeurIPS 2024] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
-
2024-06-03 [arxiv 2024] Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration
-
2024-05-25 [NeurIPS 2024] AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning
-
2024-05-25 [arxiv 2024] Streaming Long Video Understanding with Large Language Models
-
2024-05-23 [NeurIPS 2024] WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
-
2024-05-23 [arxiv 2024] Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
-
2024-05-09 [NAACL 2025] HMT: Hierarchical Memory Transformer for Efficient Long Context Language Processing
-
2024-04-02 [arxiv 2024] Self-Organized Agents: A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization
-
2024-03-31 [CHI 2024] "My agent understands me better": Integrating Dynamic Human-like Memory Recall and Consolidation in LLM-Based
-
2024-03-23 [arxiv 2024] EduAgent: Generative Student Agents in Learning
-
2024-03-13 [NeurIPS 2024] AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents
-
2024-03-07 [NeurIPS 2024] Online Adaptation of Language Models with a Memory of Amortized Contexts
-
2024-02-28 [ICCPR 2023] Solving the lunar lander problem with multiple uncertainties using a deep Q-learning based short-term memory agent
-
2024-02-27 [ACL 2024] Evaluating Very Long-Term Conversational Memory of LLM Agents
-
2024-02-26 [ACL 2024] Unveiling the Truth and Facilitating Change: Towards Agent-based Large-scale Social Movement Simulation
-
2024-02-19 [arxiv 2024] Beyond Retrieval: Embracing Compressive Memory in Real-World Long-Term Conversations
-
2024-02-17 [ACL 2025] KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph
-
2024-02-16 [arxiv 2024] In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss
-
2024-02-15 [arxiv 2024] User Behavior Simulation with Large Language Model based Agents
-
2024-02-12 [arxiv 2024] MemGPT: Towards LLMs as Operating Systems
-
2024-02-07 [ICML 2024] MEMORYLLM: Towards Self-Updatable Large Language Models
-
2024-02-06 [arxiv 2024] QuantAgent: Seeking Holy Grail in Trading by Self-Improving Large Language Model
-
2024-02-06 [arxiv 2024] RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents
-
2024-02-02 [arxiv 2024] A Multi-Agent Conversational Recommender System
-
2024-01-30 [arxiv 2024] War and Peace (WarAgent): LLM-based Multi-Agent Simulation of World Wars
-
2024-01-26 [arxiv 2024] Developing ChemDFM as a Large Language Foundation Model for Chemistry
-
2024-01-23 [arxiv 2024] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks
-
2024-01-05 [arxiv 2024] From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models
-
2023-11-23 [arxiv 2023] FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design
-
2023-11-15 [arxiv 2023] Think-in-Memory: Recalling and Post-thinking Enable LLMs with Long-Term Memory
-
2023-11-10 [arxiv 2023] JARV IS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models
-
2023-10-24 [ACM Computing Surveys 2024] Knowledge Editing for Large Language Models: A Survey
-
2023-10-16 [EMNLP 2023] Character-LLM: A Trainable Agent for Role-Playing
-
2023-10-13 [arxiv 2023] AgentCF: Collaborative Learning with Autonomous Language Agents for Recommender Systems
-
2023-10-12 [arxiv 2023] GameGPT: Multi-agent Collaborative Framework for Game Development
-
2023-10-10 [arxiv 2023] MetaAgents: Large Language Model Based Agents for Decision-making on Teaming
-
2023-10-01 [arxiv 2023] RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
-
2023-09-23 [ILCAI 2024] AutoAgents: A Framework for Automatic Agent Generation
-
2023-09-12 [SOSP 2023] Efficient Memory Management for Large Language Model Serving with PagedAttention
-
2023-09-07 [arxiv 2023] TradingGPT: Multi-Agent System with Layered Memory and Distinct Characters for Enhanced Financial Trading Performance
-
2023-08-31 [arxiv 2023] Recommender AI Agent: Integrating Large Language Models for Interactive
-
2023-08-28 [NAACL 2024] RecMind: Large Language Model Powered Agent For Recommendation
-
2023-08-25 [ICCV 2023] Black-box Unsupervised Domain Adaptation with Bi-directional Atkinson-Shiffrin Memory
-
2023-08-23 [arxiv 2023] MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation
-
2023-08-14 [ICCV 2023] Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents
-
2023-08-13 [arxiv 2023] ChatHaruhi: Reviving Anime Character in Reality via Large Language Model
-
2023-08-04 [ICLR 2024] Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
-
2023-08-01 [ICLR 2024] MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework
-
2023-07-29 [ICCV 2023] XMem++: Production-level Video Segmentation From Few Annotated Frames
-
2023-07-27 [arxiv 2023] S3 : Social-network Simulation System with Large Language Model-Empowered Agents
-
2023-07-24 [ICCV 2023] GridMM: Grid Memory Map for Vision-and-Language Navigation
-
2023-07-16 [ACL 2024] ChatDev: Communicative Agents for Software Development
-
2023-07-06 [NeurIPS 2023] Focused Transformer: Contrastive Training for Context Scaling
-
2023-06-24 [Energy Conversion and Management 2023] Effective pre-training of a deep reinforcement learning agent by means of long short-term memory models for thermal energy management in buildings
-
2023-06-13 [ICLR 2024] Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control
-
2023-06-12 [arxiv 2023] Augmenting Language Models with Long-Term Memory
-
2023-06-06 [arxiv 2023] ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory
-
2023-06-01 [arxiv 2023] Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
-
2023-05-26 [arxiv 2023] AdaPlanner: Adaptive Planning from Feedback with Language Models
-
2023-05-24 [EMNLP 2023] Reasoning with Language Model is Planning with World Model
-
2023-05-23 [arxiv 2023] Voyager: An Open-Ended Embodied Agent with Large Language Models
-
2023-05-21 [AAAI 2024] MemoryBank: Enhancing Large Language Models with Long-Term Memory
-
2023-05-08 [ACL 2023] Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
-
2023-05-07 [Neurocomputing 2025] Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models
-
2023-04-26 [DASFAA 2025] SCM:Enhancing Large Language Model with Self-Controlled Memory Framework
-
2023-04-07 [UIST 2023] Generative Agents: Interactive Simulacra of Human Behavior
-
2023-03-20 [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
-
2021-07-15 [ACL 2022] Beyond Goldfish Memory∗: Long-Term Open-Domain Conversation