agent memory system

工作记忆与子目标/分块的短时记忆管理（In-trial/Working Memory）

聚焦“工作记忆/情境内记忆（working memory）”与在单次决策过程中的记忆选择、更新与容量管理；共同点是用层级/分块/选择机制降低冗余并提升长任务稳定性与效率（如用子目标做记忆块、基于强化学习做chunk选择、将工作记忆与动作规划联动、用类OS缓存层级提升检索）。

HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model（Mengkang Hu, Tianxing Chen, Qiguang Chen, Yi Mu, Wenqi Shao, Ping Luo, 2025, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）
A working memory model improves cognitive control in agents and robots（Michele Persiani, A. Franchi, G. Gini, 2018, Cognitive Systems Research）
Enhancing intelligent agents with episodic memory（Andrew Nuxoll, John E. Laird, 2012, Cognitive Systems Research）
Cooperative Scheduling and Hierarchical Memory Model for Multi-Agent Systems（Huhai Zou, Rongzhen Li, Tianhao Sun, Fei Wang, Ta-Hsin Li, Kai Liu, 2024, 2024 IEEE International Symposium on Product Compliance Engineering - Asia (ISPCE-ASIA)）

长期记忆建模：巩固/遗忘、语义-情景交互与时空一致表征（Long-term Memory Modeling）

围绕长期记忆的“形成-巩固-遗忘”与语义/情景表征；共同点是借鉴神经/认知机制或图/结构化表征来实现可扩展的长期知识抽取与一致性维护，强调语义可迁移、时间/空间/逻辑约束以及记忆压缩/格式化以提升可用性。

Memory formation, consolidation, and forgetting in learning agents（B. Subagdja, Wenwen Wang, A. Tan, Yuan-Sin Tan, Loo-Nin Teow, 2012, International Joint Conference on Autonomous Agents and Multiagent Systems）
Semantic Memory Modeling and Memory Interaction in Learning Agents（Wenwen Wang, Ah‐Hwee Tan, Loo-Nin Teow, 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems）
Beyond Fact Retrieval: Episodic Memory for RAG with Generative Semantic Workspaces（Shreyas Rajesh, Pavan Holur, Chenda Duan, David Chong, Vwani Roychowdhury, 2026, Proceedings of the AAAI Conference on Artificial Intelligence）
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models（Yu Gu, Bernal Jiménez Gutiérrez, Yiheng Shu, Yu Su, Michihiro Yasunaga, 2024, Advances in Neural Information Processing Systems 37）
DSRd: A Proposal for a Low-Latency, Distributed Working Memory for CORTEX（P. Bustos, Juan C. García, R. Cintas, Esteban Martirena, P. Bachiller, Pedro Núñez Trujillo, A. Bandera, 2020, Advances in Intelligent Systems and Computing）
AgenticMemory: A Binary Graph Format for Persistent, Portable, and Navigable AI Agent Memory（Omoshola S. Owolabi, 2026, … , and Navigable AI Agent Memory (February 18, 2026)）

RAG到LTM的演进：全局记忆增强检索与多轮/动态记忆检索（RAG→LTM via Retrieval/Updating）

以RAG/LLM长上下文任务为主线，讨论如何让检索过程具备“动态性、记忆增强与多级层次”；共同点是把记忆模块落到检索与生成协同（draft-then-final、多轮探询-更新全局记忆、跨多尺度记忆层的自适应检索、从RAG演进到LTM）。

MemoRAG: Boosting Long Context Processing with Global Memory-Enhanced Retrieval Augmentation（Hongjin Qian, Zheng Liu, Peitian Zhang, Kelong Mao, Defu Lian, Zhicheng Dou, Tiejun Huang, 2024, Proceedings of the ACM on Web Conference 2025）
ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning（Juyuan Wang, Rongchen Zhao, Wei Wei, Yufeng Wang, M. K. Yu, Jie Zhou, Jin Xu, Liyan Xu, 2026, Proceedings of the AAAI Conference on Artificial Intelligence）
Dynamic Memory Retrieval in RAG Models: Enhancing Long-Context Reasoning（Changqing Dong, 2025, 2025 6th International Conference on Artificial Intelligence and Computer Engineering (ICAICE)）
Conversational Agents: From RAG to LTM（Dell Zhang, Yue Feng, Haiming Liu, Changzhi Sun, Jixiang Luo, Xiangyu Chen, Xuelong Li, 2025, Proceedings of the 2025 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region）
RAG-Driven Memory Architectures in Conversational LLMs—A Literature Review With Insights Into Emerging Agriculture Data Sharing（Nur Arifin Akbar, Rahool Dembani, B. Lenzitti, Domenico Tegolo, 2025, IEEE Access）
Dynamic Memory Updating in RAG: Lifelong Learning and Adaptation（Sivarama Krishna Akhil Koduri, 2026, The poper is also available on Zenodo-https://doi. org …）
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models（Yu Gu, Bernal Jiménez Gutiérrez, Yiheng Shu, Yu Su, Michihiro Yasunaga, 2024, Advances in Neural Information Processing Systems 37）
Vector Databases and Language Models: Synergies and Challenges（Toni Taipalus, 2025, Communications in Computer and Information Science）

记忆基础设施与系统架构：MemoryOS/Memory Fabric/向量数据库与长期检索存储（Systems & Storage Infrastructure）

聚焦“记忆系统/操作系统/向量数据库/基础设施”的工程与体系结构：包括向量存储与检索的技术底座、面向长期对话的记忆操作流程（更新/检索/生成）、以及通过KV压缩/遗忘曲线/层级存储来提升一致性与成本效率；共同点是把记忆能力系统化为可实现的模块与数据管理策略。

Memory Fabric for Conversational AI Agents: Enabling Shared and Persistent Memory Across Users（A Tiwari, V Gupta, 2025, Authorea Preprints）
A memory fabric for conversational AI agents enabling shared and persistent multiuser memory（A. Tiwari, Vibhuti Gupta, 2026, Discover Artificial Intelligence）
Vector Database Management Techniques and Systems（J. Pan, Jianguo Wang, Guoliang Li, 2024, Companion of the 2024 International Conference on Management of Data）
Vector database management systems: Fundamental concepts, use-cases, and current challenges（Toni Taipalus, 2023, Cognitive Systems Research）
Vector Databases and Language Models: Synergies and Challenges（Toni Taipalus, 2025, Communications in Computer and Information Science）
Vector Storage Based Long-term Memory Research on LLM（Kun Li, Xin Jing, Chengang Jing, 2024, International Journal of Advanced Network, Monitoring and Controls）
Memory OS of AI Agent（Jie Kang, Mingming Ji, Zhe Zhao, Ting Bai, 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing）
Memory Matters: The Need to Improve Long-Term Memory in LLM-Agents（Kostas Hatalis, Despina Christou, Joshua Myers, Steven Jones, Keith A. Lambert, Adam Amos-Binks, Zohreh Dannenhauer, Dustin Dannenhauer, 2024, Proceedings of the AAAI Symposium Series）

多智能体与交互系统的持久/分布式记忆管理（Multi-agent & Distributed/Persistent Memory）

围绕多智能体或复杂交互系统中的“持久性与分布式记忆管理”：包括多代理协作的层级架构与持久监督、跨会话/跨会话长期记忆助手、用于GUI/计算机使用的图结构持久记忆以减少重复、以及分布式/嵌入式数据库与发布订阅场景下的实时记忆操作协商；共同点是记忆被设计为支撑多代理协作与跨会话稳定运行的基础能力。

Functional Stability and Adaptive Control in LLM-Based Computer Use Agents via Graph-Structured Persistent Memory（Danylo Vorvul, Andrii Musienko, Iryna Galchenko, Mykola Myroniuk, Андрій Собчук, 2026, Preprints.org）
Beyond Fact Retrieval: Episodic Memory for RAG with Generative Semantic Workspaces（Shreyas Rajesh, Pavan Holur, Chenda Duan, David Chong, Vwani Roychowdhury, 2026, Proceedings of the AAAI Conference on Artificial Intelligence）
An Intelligent Multi-Agent Memory Assistant（Ângelo Costa, P. Novais, 2011, Communications in Medical and Care Compunetics）
Multi-agent Personal Memory Assistant（Ângelo Costa, P. Novais, Ricardo Costa, J. Corchado, J. Neves, 2010, Advances in Intelligent and Soft Computing）
Society Agent: A Hierarchical Multi-Agent Architecture with Autonomous Persistent and Ephemeral Agents and Persistent Evolving Knowledge（I. Chrysochos, 2026, Authorea Preprints）
High-Performance Implementation of Multi-Agent Web Systems: Integrating Vector Memory with Strictly Typed React Architectures（Mykhailo Nykoliuk, 2025, Universal Library of Engineering Technology）
MemIndex: Agentic Event-based Distributed Memory Management for Multi-agent Systems（Alaa Saleh, Sasu Tarkoma, Anders Lindgren, Praveen Kumar Donta, S. Dustdar, Susanna Pirttikangas, Lauri Lovén, 2025, ACM Transactions on Autonomous and Adaptive Systems）
AEVUM: An Agent-Native Persistent Memory Database System with Autonomous Data Management and Multi-Stage Compression（JR Maligireddy, 2026, Authorea Preprints）
CoMMA: a multi-agent system for corporate memory management（F. Bergenti, A. Poggi, G. Rimassa, Paola Turci, 2002, Proceedings of the first international joint conference on Autonomous agents and multiagent systems part 3 - AAMAS '02）
A multi-agent system for building project memories to facilitate the design process（D. Monticolo, Vincent Hilaire, S. Gomes, A. Koukam, 2008, Integrated Computer-Aided Engineering）
Multi-agent Personal Memory Assistant（Ângelo Costa, P. Novais, Ricardo Costa, J. Corchado, J. Neves, 2010, Advances in Intelligent and Soft Computing）

agent memory system

整体上，文献可按“记忆生命周期与系统落地”拆为五条并列主线：1）会话内工作记忆的分块/选择/层级管理；2）认知启发的长期记忆形成、巩固-遗忘与语义/情景交互表征；3）面向长上下文的RAG到LTM演进：动态检索与多轮更新；4）面向工程实现的记忆操作系统/记忆织物与向量数据库底座；5）多智能体与交互式任务场景中的持久、分布式记忆管理与稳定性控制。

共 37 篇文献，5 个研究方向

工作记忆与子目标/分块的短时记忆管理（In-trial/Working Memory）

聚焦“工作记忆/情境内记忆（working memory）”与在单次决策过程中的记忆选择、更新与容量管理；共同点是用层级/分块/选择机制降低冗余并提升长任务稳定性与效率（如用子目标做记忆块、基于强化学习做chunk选择、将工作记忆与动作规划联动、用类OS缓存层级提升检索）。相关文献: Mengkang Hu et. al, 2025 等 4 篇文献

长期记忆建模：巩固/遗忘、语义-情景交互与时空一致表征（Long-term Memory Modeling）

围绕长期记忆的“形成-巩固-遗忘”与语义/情景表征；共同点是借鉴神经/认知机制或图/结构化表征来实现可扩展的长期知识抽取与一致性维护，强调语义可迁移、时间/空间/逻辑约束以及记忆压缩/格式化以提升可用性。相关文献: B. Subagdja et. al, 2012 等 6 篇文献

RAG到LTM的演进：全局记忆增强检索与多轮/动态记忆检索（RAG→LTM via Retrieval/Updating）

以RAG/LLM长上下文任务为主线，讨论如何让检索过程具备“动态性、记忆增强与多级层次”；共同点是把记忆模块落到检索与生成协同（draft-then-final、多轮探询-更新全局记忆、跨多尺度记忆层的自适应检索、从RAG演进到LTM）。相关文献: Hongjin Qian et. al, 2024 等 8 篇文献

记忆基础设施与系统架构：MemoryOS/Memory Fabric/向量数据库与长期检索存储（Systems & Storage Infrastructure）

聚焦“记忆系统/操作系统/向量数据库/基础设施”的工程与体系结构：包括向量存储与检索的技术底座、面向长期对话的记忆操作流程（更新/检索/生成）、以及通过KV压缩/遗忘曲线/层级存储来提升一致性与成本效率；共同点是把记忆能力系统化为可实现的模块与数据管理策略。相关文献: A Tiwari et. al, 2025 等 8 篇文献

多智能体与交互系统的持久/分布式记忆管理（Multi-agent & Distributed/Persistent Memory）

围绕多智能体或复杂交互系统中的“持久性与分布式记忆管理”：包括多代理协作的层级架构与持久监督、跨会话/跨会话长期记忆助手、用于GUI/计算机使用的图结构持久记忆以减少重复、以及分布式/嵌入式数据库与发布订阅场景下的实时记忆操作协商；共同点是记忆被设计为支撑多代理协作与跨会话稳定运行的基础能力。相关文献: Danylo Vorvul et. al, 2026 等 11 篇文献

总计36篇相关文献

Memory OS of AI Agent

人工智能代理的内存操作系统

doi.org-Jie Kang, Mingming Ji, Zhe Zhao 等, 2025-Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

Large Language Models (LLMs) face a crucial challenge from fixed context windows and inadequate memory management, leading to a severe shortage of long-term memory capabilities and limited personalization in the interactive experience with AI agents.To overcome this challenge, we innovatively propose a Memory Operating System, i.e., Memo-ryOS, to achieve comprehensive and efficient memory management for AI agents.Inspired by the memory management principles in operating systems, MemoryOS designs a hierarchical storage architecture and consists of four key modules: Memory Storage, Updating, Retrieval, and Generation.Specifically, the architecture comprises three levels of storage units: short-term memory, mid-term memory, and long-term personal memory.Key operations within MemoryOS include dynamic updates between storage units: short-term to mid-term updates follow a dialogue-chain-based FIFO principle, while mid-term to long-term updates use a segmented page organization strategy.Extensive experiments on the LoCoMo benchmark show an average improvement of 49.11% on F1 and 46.18% on BLEU-1 over the baselines on GPT-4o-mini, showing contextual coherence and personalized memory retention in long conversations.