agent memory system

交互式LLM代理的记忆分层、生命周期管理与巩固/触发策略

共同点是面向“对话/交互型LLM代理”的记忆生命周期与检索触发机制：通过记忆分层（短/中/长或工作/情景/语义）、层间更新与巩固策略、以及与上下文/子目标/时间因素相结合的检索与生成，从而提升长对话一致性与个性化。部分工作还关注长程对话下的性能评测或反思式记忆管理。

Memory OS of AI Agent（Jie Kang, Mingming Ji, Zhe Zhao, Ting Bai, 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing）
A Hybrid, Multi-Layered Memory Architecture for Collaborative Reasoning in Multi-Agent Systems（M. Ilin, Dmitry Pavlyuk, 2025, 2025 3rd International Conference on Foundation and Large Language Models (FLLM)）
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model（Mengkang Hu, Tianxing Chen, Qiguang Chen, Yi Mu, Wenqi Shao, Ping Luo, 2025, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）
"My agent understands me better": Integrating Dynamic Human-like Memory Recall and Consolidation in LLM-Based Agents（Yuki Hou, Haruki Tamoto, Homei Miyashita, 2024, Extended Abstracts of the CHI Conference on Human Factors in Computing Systems）
MemInsight: Autonomous Memory Augmentation for LLM Agents（Rana Salama, Jason Cai, Mingzhe Yuan, Anna Currey, Mahendra K. Sunkara, Yi Zhang, Yassine Benajiba, 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing）
In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents（Tan Zhen, Jun Yan, I-Hung Hsu, R.J. Han, Zifeng Wang, Duc Long Le, Yong Sang Song, Yanfei Chen, Hamid Palangi, George Lee, Aarti Iyer, Tianlong Chen, Huan Liu, Chen-Yu Lee, Tomas Pfister, 2025, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）
Evaluating Very Long-Term Conversational Memory of LLM Agents（Adyasha Maharana, Dong-Ho Lee, Sergey Tulyakov, Mohit Bansal, Francesco Barbieri, Yuwei Fang, 2024, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）

记忆存储与检索底座：向量数据库、持久化系统与记忆数据结构/框架

共同点是把“外部/持久化记忆存储与检索”作为系统基础设施或方法论核心：包括向量数据库与RAG协同、向量存储的更新/遗忘机制、以及为代理提供原生持久数据库或记忆数据结构（如图/内嵌数据库/记忆fabric）。另外，部分综述/框架讨论了记忆类型分离与长期管理的开放问题，偏工程与机制层面的可实现性。

Vector Databases and Language Models: Synergies and Challenges（Toni Taipalus, 2025, Communications in Computer and Information Science）
Vector Storage Based Long-term Memory Research on LLM（Kun Li, Xin Jing, Chengang Jing, 2024, International Journal of Advanced Network, Monitoring and Controls）
AgenticMemory: A Binary Graph Format for Persistent, Portable, and Navigable AI Agent Memory（Omoshola S. Owolabi, 2026, … , and Navigable AI Agent Memory (February 18, 2026)）
AEVUM: An Agent-Native Persistent Memory Database System with Autonomous Data Management and Multi-Stage Compression（JR Maligireddy, 2026, Authorea Preprints）
Memory Fabric for Conversational AI Agents: Enabling Shared and Persistent Memory Across Users（A Tiwari, V Gupta, 2025, Authorea Preprints）
An emotion understanding framework for intelligent agents based on episodic and semantic memories（M. Kazemifard, N. Ghasem-Aghaee, Bryan L. Koenig, T. Ören, 2013, Autonomous Agents and Multi-Agent Systems）
Memory Matters: The Need to Improve Long-Term Memory in LLM-Agents（Kostas Hatalis, Despina Christou, Joshua Myers, Steven Jones, Keith A. Lambert, Adam Amos-Binks, Zohreh Dannenhauer, Dustin Dannenhauer, 2024, Proceedings of the AAAI Symposium Series）
MemInsight: Autonomous Memory Augmentation for LLM Agents（Rana Salama, Jason Cai, Mingzhe Yuan, Anna Currey, Mahendra K. Sunkara, Yi Zhang, Yassine Benajiba, 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing）

记忆检索增强与相关性建模（检索模块/注意力/过滤）

共同点是围绕“记忆检索质量”的建模与优化：通过更好的检索打分/注意力分配、记忆增强与过滤（减少无关记忆）、或用检索模块来提升生成代理的适应性与行为一致性。同时，这些工作也与长对话场景中的检索有效性相关。

Enhancing memory retrieval in generative agents through LLM-trained cross attention networks（Chuanyang Hong, Qingyun He, 2025, Frontiers in Psychology）
MemInsight: Autonomous Memory Augmentation for LLM Agents（Rana Salama, Jason Cai, Mingzhe Yuan, Anna Currey, Mahendra K. Sunkara, Yi Zhang, Yassine Benajiba, 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing）
"My agent understands me better": Integrating Dynamic Human-like Memory Recall and Consolidation in LLM-Based Agents（Yuki Hou, Haruki Tamoto, Homei Miyashita, 2024, Extended Abstracts of the CHI Conference on Human Factors in Computing Systems）
Evaluating Very Long-Term Conversational Memory of LLM Agents（Adyasha Maharana, Dong-Ho Lee, Sergey Tulyakov, Mohit Bansal, Francesco Barbieri, Yuwei Fang, 2024, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）

情景记忆（Episodic Memory）的表示、触发与应用

共同点是将“情景记忆（episodic memory）”作为关键记忆类型来研究其表示、触发与应用：包括基于时间丰富域的符号化/结构化情景记忆、从短时情景缓存到长期存储的流程、以及面向情感交互或类人行为的能力。

Episodic memory formulation and its application in long-term HRI（Markos Sigalas, M. Maniadakis, P. Trahanias, 2017, 2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN)）
Enhancing intelligent agents with episodic memory（Andrew Nuxoll, John E. Laird, 2012, Cognitive Systems Research）
Towards episodic memory-based long-term affective interaction with a human-like robot（Zerrin Kasap, N. Magnenat-Thalmann, 2010, 19th International Symposium in Robot and Human Interactive Communication）
Episodic memory for autonomous agents（T. Deutsch, A. Gruber, R. Lang, R. Velik, 2008, 2008 Conference on Human System Interactions）
Episodic Memory for Human-like Agents and Human-like Agents for Episodic Memory（C. Brom, J. Lukavský, 2010, International Journal of Machine Consciousness）
Different Ways to Cue a Coherent Memory System: A Theory for Episodic, Semantic, and Procedural Tasks.（M. Humphreys, J. Bain, R. Pike, 1989, Psychological Review）
Different Ways to Cue a Coherent Memory System: A Theory for Episodic, Semantic, and Procedural Tasks.（M. Humphreys, J. Bain, R. Pike, 1989, Psychological Review）

长时程任务中的记忆利用：跨时序依赖建模与认知闭环

共同点是面向“长时程/长视野决策”的跨时序依赖：通过注意力/Transformer式记忆策略、在认知循环中显式接入长期记忆检索以形成决策逻辑、或讨论长期记忆对长任务执行与一致性的影响。整体更偏跨时间的记忆利用方式。

Towards episodic memory-based long-term affective interaction with a human-like robot（Zerrin Kasap, N. Magnenat-Thalmann, 2010, 19th International Symposium in Robot and Human Interactive Communication）
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks（Kuan Fang, Alexander Toshev, Li Fei-Fei, S. Savarese, 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Cognitive Modeling for Long-Horizon Agent Learning via Integrated Long-Term Memory and Reasoning（Linghao Yang, Tian Guan, Yumeng Ma, Zhongkang Li, Zhou Fang, Feiyang Wang, 2026, … Networks and Machine …）
Memory Matters: The Need to Improve Long-Term Memory in LLM-Agents（Kostas Hatalis, Despina Christou, Joshua Myers, Steven Jones, Keith A. Lambert, Adam Amos-Binks, Zohreh Dannenhauer, Dustin Dannenhauer, 2024, Proceedings of the AAAI Symposium Series）

记忆系统的隐私与安全风险（记忆泄露与防护需求）

共同点是从安全与风险角度讨论记忆系统：关注代理把用户交互写入记忆后可能发生的隐私泄露，并提出针对记忆的提取攻击与影响因素分析；同时与长程记忆带来的可恢复性/可提取性相关。

Unveiling Privacy Risks in LLM Agent Memory（Bo Wang, Weiyi He, Shenglai Zeng, Zhen Xiang, Yue Xing, Jiliang Tang, Pengfei He, 2025, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）
Evaluating Very Long-Term Conversational Memory of LLM Agents（Adyasha Maharana, Dong-Ho Lee, Sergey Tulyakov, Mohit Bansal, Francesco Barbieri, Yuwei Fang, 2024, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）

代理记忆的系统架构、综述与工程化落地（生产/领域应用）

共同点是更偏整体架构与应用落地/工程模式：包括对LLM代理四要素（感知-规划-记忆-行动）的理论综述与记忆管理综述、以及将记忆能力嵌入生产系统（多智能体Web系统、领域运维记忆架构）与基于向量存储的长期机制验证。整体关注系统性与可落地性。

Memory Fabric for Conversational AI Agents: Enabling Shared and Persistent Memory Across Users（A Tiwari, V Gupta, 2025, Authorea Preprints）
A Survey of LLM-based Agents: Theories, Technologies, Applications and Suggestions（Xiaofei Dong, Xueqiang Zhang, Weixin Bu, Dan Zhang, Feng Cao, 2024, 2024 3rd International Conference on Artificial Intelligence, Internet of Things and Cloud Computing Technology (AIoTC)）
Memory Matters: The Need to Improve Long-Term Memory in LLM-Agents（Kostas Hatalis, Despina Christou, Joshua Myers, Steven Jones, Keith A. Lambert, Adam Amos-Binks, Zohreh Dannenhauer, Dustin Dannenhauer, 2024, Proceedings of the AAAI Symposium Series）
High-Performance Implementation of Multi-Agent Web Systems: Integrating Vector Memory with Strictly Typed React Architectures（Mykhailo Nykoliuk, 2025, Universal Library of Engineering Technology）
Mind-Tool: Domain Memory Architecture for AI Agents（Ioannis Chrysochos, 2026, Journal of Engineering and Artificial Intelligence）
Vector Storage Based Long-term Memory Research on LLM（Kun Li, Xin Jing, Chengang Jing, 2024, International Journal of Advanced Network, Monitoring and Controls）

具身/多智能体与实时约束下的记忆-推理耦合与性能优化

共同点是偏“多智能体/具身/实时环境”的部署与性能权衡：在部分可观测、长周期控制、或生产级低延迟约束下，长期记忆如何与策略/推理循环耦合，同时保证吞吐与响应速度。

Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks（Kuan Fang, Alexander Toshev, Li Fei-Fei, S. Savarese, 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Cognitive Modeling for Long-Horizon Agent Learning via Integrated Long-Term Memory and Reasoning（Linghao Yang, Tian Guan, Yumeng Ma, Zhongkang Li, Zhou Fang, Feiyang Wang, 2026, … Networks and Machine …）
High-Performance Implementation of Multi-Agent Web Systems: Integrating Vector Memory with Strictly Typed React Architectures（Mykhailo Nykoliuk, 2025, Universal Library of Engineering Technology）

agent memory system

以上文献可归纳为：围绕“长期记忆如何写入、分层管理、触发检索并支持长程一致决策”，构建记忆底座（向量库/持久数据库/数据结构/记忆fabric/MemoryOS等）；进一步通过检索增强（相关性建模、过滤、注意力/巩固量化）提升记忆可用性；在模型层面研究情景记忆与长时程跨时序依赖的利用（episodic/场景记忆、认知闭环）；同时从风险侧关注记忆带来的隐私泄露；最后结合综述与工程化研究，覆盖从理论框架到生产系统与具身/多智能体场景的落地路径。

共 41 篇文献，8 个研究方向

交互式LLM代理的记忆分层、生命周期管理与巩固/触发策略

共同点是面向“对话/交互型LLM代理”的记忆生命周期与检索触发机制：通过记忆分层（短/中/长或工作/情景/语义）、层间更新与巩固策略、以及与上下文/子目标/时间因素相结合的检索与生成，从而提升长对话一致性与个性化。部分工作还关注长程对话下的性能评测或反思式记忆管理。相关文献: Jie Kang et. al, 2025 等 7 篇文献

记忆存储与检索底座：向量数据库、持久化系统与记忆数据结构/框架

共同点是把“外部/持久化记忆存储与检索”作为系统基础设施或方法论核心：包括向量数据库与RAG协同、向量存储的更新/遗忘机制、以及为代理提供原生持久数据库或记忆数据结构（如图/内嵌数据库/记忆fabric）。另外，部分综述/框架讨论了记忆类型分离与长期管理的开放问题，偏工程与机制层面的可实现性。相关文献: Toni Taipalus et. al, 2025 等 8 篇文献

记忆检索增强与相关性建模（检索模块/注意力/过滤）

共同点是围绕“记忆检索质量”的建模与优化：通过更好的检索打分/注意力分配、记忆增强与过滤（减少无关记忆）、或用检索模块来提升生成代理的适应性与行为一致性。同时，这些工作也与长对话场景中的检索有效性相关。相关文献: Chuanyang Hong et. al, 2025 等 4 篇文献

情景记忆（Episodic Memory）的表示、触发与应用

共同点是将“情景记忆（episodic memory）”作为关键记忆类型来研究其表示、触发与应用：包括基于时间丰富域的符号化/结构化情景记忆、从短时情景缓存到长期存储的流程、以及面向情感交互或类人行为的能力。相关文献: Markos Sigalas et. al, 2017 等 7 篇文献

长时程任务中的记忆利用：跨时序依赖建模与认知闭环

共同点是面向“长时程/长视野决策”的跨时序依赖：通过注意力/Transformer式记忆策略、在认知循环中显式接入长期记忆检索以形成决策逻辑、或讨论长期记忆对长任务执行与一致性的影响。整体更偏跨时间的记忆利用方式。相关文献: Zerrin Kasap et. al, 2010 等 4 篇文献

记忆系统的隐私与安全风险（记忆泄露与防护需求）

共同点是从安全与风险角度讨论记忆系统：关注代理把用户交互写入记忆后可能发生的隐私泄露，并提出针对记忆的提取攻击与影响因素分析；同时与长程记忆带来的可恢复性/可提取性相关。相关文献: Bo Wang et. al, 2025 等 2 篇文献

代理记忆的系统架构、综述与工程化落地（生产/领域应用）

共同点是更偏整体架构与应用落地/工程模式：包括对LLM代理四要素（感知-规划-记忆-行动）的理论综述与记忆管理综述、以及将记忆能力嵌入生产系统（多智能体Web系统、领域运维记忆架构）与基于向量存储的长期机制验证。整体关注系统性与可落地性。相关文献: A Tiwari et. al, 2025 等 6 篇文献

具身/多智能体与实时约束下的记忆-推理耦合与性能优化

共同点是偏“多智能体/具身/实时环境”的部署与性能权衡：在部分可观测、长周期控制、或生产级低延迟约束下，长期记忆如何与策略/推理循环耦合，同时保证吞吐与响应速度。相关文献: Kuan Fang et. al, 2019 等 3 篇文献

总计29篇相关文献

Memory Matters: The Need to Improve Long-Term Memory in LLM-Agents

记忆至关重要：提升LLM-Agent长期记忆能力的必要性

doi.org-Kostas Hatalis, Despina Christou, Joshua Myers 等, 2024-Proceedings of the AAAI Symposium Series

In this paper, we provide a review of the current efforts to develop LLM agents, which are autonomous agents that leverage large language models. We examine the memory management approaches used in these agents. One crucial aspect of these agents is their long-term memory, which is often implemented using vector databases. We describe how vector databases are utilized to store and retrieve information in LLM agents. Moreover we highlight open problems, such as the separation of different types of memories and the management of memory over the agent's lifetime. Lastly, we propose several topics for future research to address these challenges and further enhance the capabilities of LLM agents, including the use of metadata in procedural and semantic memory and the integration of external knowledge sources with vector databases.