agent memory system

面向LLM智能体的分层/操作系统式记忆架构与记忆管理机制

这些工作将“记忆”明确建模为分层存储与跨层数据流/更新/检索/生成的系统工程问题，借鉴操作系统或认知分层（短/中/长、工作/情景/语义等），关注生命周期管理、路由与可控性，从而缓解上下文窗口限制并提升长对话与长期个性化表现。

Memory OS of AI Agent（Jie Kang, Mingming Ji, Zhe Zhao, Ting Bai, 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing）
A Hybrid, Multi-Layered Memory Architecture for Collaborative Reasoning in Multi-Agent Systems（M. Ilin, Dmitry Pavlyuk, 2025, 2025 3rd International Conference on Foundation and Large Language Models (FLLM)）
Cooperative Scheduling and Hierarchical Memory Model for Multi-Agent Systems（Huhai Zou, Rongzhen Li, Tianhao Sun, Fei Wang, Ta-Hsin Li, Kai Liu, 2024, 2024 IEEE International Symposium on Product Compliance Engineering - Asia (ISPCE-ASIA)）
A Hybrid Agent Model, Mixing Short Term and Long Term Memory Abilities（F. Torterolo, C. Garbay, 1998, Lecture Notes in Computer Science）

检索增强（RAG）与外部记忆库：检索-重排-生成/自适应更新

该组论文共同点是：将外部记忆（语言-程序对、经验片段、证据、向量库条目等）组织为可检索的数据源，并围绕“如何检索、如何减少噪声、如何更新/分区/重排、如何把检索结果注入生成”展开，核心目标是提升开放域任务正确性与长期决策质量，降低检索带来的冗余噪声与计算开销。

Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models（Gabriel Sarch, Yue Wu, Michael J. Tarr, Katerina Fragkiadaki, 2023, Findings of the Association for Computational Linguistics: EMNLP 2023）
MemInsight: Autonomous Memory Augmentation for LLM Agents（Rana Salama, Jason Cai, Mingzhe Yuan, Anna Currey, Mahendra K. Sunkara, Yi Zhang, Yassine Benajiba, 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing）
Agentic memory-augmented retrieval and evidence grounding for medical question-answering tasks（Shuyue Jia, Subhrangshu Bit, Varuna Jasodanand, Yi Liu, Vijaya B. Kolachalama, 2026, International Journal of Medical Informatics）
Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs（Zheng Wang, Zhongyang Li, Zeren Jiang, Dandan Tu, W. Shi, 2024, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing）
Towards Adaptive Memory-Based Optimization for Enhanced Retrieval-Augmented Generation（Qitao Qin, Yucong Luo, Yihang Lu, Zhibo Chu, Xiaoman Liu, Xianwei Meng, 2025, Findings of the Association for Computational Linguistics: ACL 2025）
M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions（Zheng Wang, Shu Mei Teo, Jieer Ouyang, Yongjun Xu, Wei Shi, 2024, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）
Retrieval-Augmented Embodied Agents（Yichen Zhu, Zhicai Ou, Xiaofeng Mou, Jian Tang, 2024, 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Vector Storage Based Long-term Memory Research on LLM（Kun Li, Xin Jing, Chengang Jing, 2024, International Journal of Advanced Network, Monitoring and Controls）

个性化/长期用户记忆：层次化表征与检索以适配用户偏好与习惯

这些研究聚焦“用户长期交互”场景下的记忆组织与个性化服务：通过层次化记忆框架、面向用户会话/偏好的检索与生成，或从用户认知/期望出发提出访问与组织原则，从而让代理能够在多会话中保持一致性、可控性与可用性。

Mem-PAL: Towards Memory-based Personalized Dialogue Assistants for Long-term User-Agent Interaction（Zhaopei Huang, Qifeng Dai, Guozheng Wu, Xiaopeng Wu, Xubin Li, Tiezheng Ge, Wenxuan Wang, Qin Jin, 2026, Proceedings of the AAAI Conference on Artificial Intelligence）
Memory Fabric for Conversational AI Agents: Enabling Shared and Persistent Memory Across Users（A Tiwari, V Gupta, 2025, Authorea Preprints）
Users' Expectations and Practices with Agent Memory（Brennan Jones, Kelsey Stemmler, Emily Su, Young-Ho Kim, A. Kuzminykh, 2025, Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems）
Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs（Zheng Wang, Zhongyang Li, Zeren Jiang, Dandan Tu, W. Shi, 2024, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing）
Multi-agent Personal Memory Assistant（Ângelo Costa, P. Novais, Ricardo Costa, J. Corchado, J. Neves, 2010, Advances in Intelligent and Soft Computing）

工作记忆优化与层级子目标记忆分块（长程任务的可扩展上下文管理）

共同点在于：把长期能力瓶颈归因于工作记忆/上下文管理不当，并用“分块/摘要/子目标”或“端到端的记忆-推理-策略耦合”来减少冗余输入、提升历史信息利用效率与决策稳定性，特别面向长程序列任务与部分可观测环境。

HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model（Mengkang Hu, Tianxing Chen, Qiguang Chen, Yi Mu, Wenqi Shao, Ping Luo, 2025, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）
Memory Matters: The Need to Improve Long-Term Memory in LLM-Agents（Kostas Hatalis, Despina Christou, Joshua Myers, Steven Jones, Keith A. Lambert, Adam Amos-Binks, Zohreh Dannenhauer, Dustin Dannenhauer, 2024, Proceedings of the AAAI Symposium Series）
Cognitive Modeling for Long-Horizon Agent Learning via Integrated Long-Term Memory and Reasoning（Linghao Yang, Tian Guan, Yumeng Ma, Zhongkang Li, Zhou Fang, Feiyang Wang, 2026, … Networks and Machine …）

情景/事件/神经启发的持续记忆与价值治理（记忆巩固与对齐）

该组强调“经验的事件化/情景化存储”、基于效用的巩固/衰减/治理，以及跨会话或动态系统中的记忆融合与对齐控制。其共同目标是让代理能持续积累并在多会话或多阶段行为中保持鲁棒性，同时用机制约束输出质量与一致性。

Enhancing intelligent agents with episodic memory（Andrew Nuxoll, John E. Laird, 2012, Cognitive Systems Research）
Value-Driven Memory-Augmented Generation for Agentic LLMs: Towards Structured and Adaptive Knowledge Utilization（C. H. Tan, 2026, Proceedings of the AAAI Conference on Artificial Intelligence）
An Intelligent Multi-Agent Memory Assistant（Ângelo Costa, P. Novais, 2011, Communications in Medical and Care Compunetics）
Designing Synthetic Memory Systems for Supporting Autonomous Embodied Agent Behaviour（Christopher E. Peters, 2006, ROMAN 2006 - The 15th IEEE International Symposium on Robot and Human Interactive Communication）
AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation（Ming Wang, Peidong Wang, L. Wu, Xiaocui Yang, Daling Wang, Shi Feng, Yuxin Chen, Bixuan Wang, Yifei Zhang, 2025, Findings of the Association for Computational Linguistics: ACL 2025）

图/二进制/结构化记忆格式与可审计的记忆可计算载体

这些工作把记忆视为可计算的数据结构/存储介质（例如二进制图格式、agent-native嵌入数据库、证据检索的缓存与剪枝记忆库），关注高效导航/持久化/审计与低延迟检索，使记忆系统从“文本拼接”走向“结构化、工程可实现、可控可追溯”。

AgenticMemory: A Binary Graph Format for Persistent, Portable, and Navigable AI Agent Memory（Omoshola S. Owolabi, 2026, Portable, and Navigable AI Agent Memory (February …）
AEVUM: An Agent-Native Persistent Memory Database System with Autonomous Data Management and Multi-Stage Compression（JR Maligireddy, 2026, Authorea Preprints）
Agentic memory-augmented retrieval and evidence grounding for medical question-answering tasks（Shuyue Jia, Subhrangshu Bit, Varuna Jasodanand, Yi Liu, Vijaya B. Kolachalama, 2026, International Journal of Medical Informatics）

多智能体共享/企业/分布式记忆管理与协议（含调度与实时性）

共同点是：在多智能体或组织协作场景下处理共享记忆的创建、传播、复用与一致性，并进一步讨论分布式存储/检索/更新/删除的实时协议与效率优化（如事件驱动、intent-indexed图、协商与卸载），强调系统级可扩展与低延迟。

A multi-agent system for building project memories to facilitate the design process（D. Monticolo, Vincent Hilaire, S. Gomes, A. Koukam, 2008, Integrated Computer-Aided Engineering）
Multi-Agent Corporate Memory Management System（Fabien Gandon, Agostino Poggi, Giovanni Rimassa, Paola Turci, 2002, Applied Artificial Intelligence）
An Intelligent Multi-Agent Memory Assistant（Ângelo Costa, P. Novais, 2011, Communications in Medical and Care Compunetics）
MemIndex: Agentic Event-based Distributed Memory Management for Multi-agent Systems（Alaa Saleh, Sasu Tarkoma, Anders Lindgren, Praveen Kumar Donta, S. Dustdar, Susanna Pirttikangas, Lauri Lovén, 2025, ACM Transactions on Autonomous and Adaptive Systems）
Multi-Agent Corporate Memory Management System（Fabien Gandon, Agostino Poggi, Giovanni Rimassa, Paola Turci, 2002, Applied Artificial Intelligence）

具身与多模态记忆：用于机器人长期导航/策略复用/场景依赖记忆

该组共同点是面向具身智能体（机器人）在长时程、部分可观测环境中的“场景历史依赖记忆”。通过多模态记忆（VLM描述/视觉-动作历史）与可检索策略库/注意力记忆策略，提高长期任务成功率与泛化。

Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks（Kuan Fang, Alexander Toshev, Li Fei-Fei, S. Savarese, 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models（Gabriel Sarch, Yue Wu, Michael J. Tarr, Katerina Fragkiadaki, 2023, Findings of the Association for Computational Linguistics: EMNLP 2023）
Retrieval-Augmented Embodied Agents（Yichen Zhu, Zhicai Ou, Xiaofeng Mou, Jian Tang, 2024, 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models（Gabriel Sarch, Yue Wu, Michael J. Tarr, Katerina Fragkiadaki, 2023, Findings of the Association for Computational Linguistics: EMNLP 2023）

记忆系统的软件工程落地：生产级实现与执行闭环（RAMP/向量检索工程）

该论文将记忆系统放入生产级工程架构，围绕“反映-行动-记忆-计划”的执行循环、低延迟向量检索（如Qdrant）与强类型数据契约，解决延迟、状态同步与可复现性问题，体现记忆系统在真实应用中的可落地工程路径。

High-Performance Implementation of Multi-Agent Web Systems: Integrating Vector Memory with Strictly Typed React Architectures（Mykhailo Nykoliuk, 2025, Universal Library of Engineering Technology）

agent memory system

整体文献可归纳为七到八条主线：①分层/操作系统式记忆管理架构；②检索增强（RAG）与外部记忆库的检索-重排-生成与自适应更新；③面向用户的长期个性化记忆组织与可用性设计；④工作记忆与长程任务的分块/子目标化上下文管理；⑤情景/事件持续记忆、效用巩固与价值治理；⑥结构化/图/二进制/agent-native持久化的记忆载体；⑦多智能体共享与分布式记忆的协作协议与实时性；以及⑧面向具身任务的场景依赖记忆与机器人策略复用，外加少量强调生产级工程落地的实现研究。

共 38 篇文献，9 个研究方向

面向LLM智能体的分层/操作系统式记忆架构与记忆管理机制

这些工作将“记忆”明确建模为分层存储与跨层数据流/更新/检索/生成的系统工程问题，借鉴操作系统或认知分层（短/中/长、工作/情景/语义等），关注生命周期管理、路由与可控性，从而缓解上下文窗口限制并提升长对话与长期个性化表现。相关文献: Jie Kang et. al, 2025 等 4 篇文献

检索增强（RAG）与外部记忆库：检索-重排-生成/自适应更新

该组论文共同点是：将外部记忆（语言-程序对、经验片段、证据、向量库条目等）组织为可检索的数据源，并围绕“如何检索、如何减少噪声、如何更新/分区/重排、如何把检索结果注入生成”展开，核心目标是提升开放域任务正确性与长期决策质量，降低检索带来的冗余噪声与计算开销。相关文献: Gabriel Sarch et. al, 2023 等 8 篇文献

个性化/长期用户记忆：层次化表征与检索以适配用户偏好与习惯

这些研究聚焦“用户长期交互”场景下的记忆组织与个性化服务：通过层次化记忆框架、面向用户会话/偏好的检索与生成，或从用户认知/期望出发提出访问与组织原则，从而让代理能够在多会话中保持一致性、可控性与可用性。相关文献: Zhaopei Huang et. al, 2026 等 5 篇文献

工作记忆优化与层级子目标记忆分块（长程任务的可扩展上下文管理）

共同点在于：把长期能力瓶颈归因于工作记忆/上下文管理不当，并用“分块/摘要/子目标”或“端到端的记忆-推理-策略耦合”来减少冗余输入、提升历史信息利用效率与决策稳定性，特别面向长程序列任务与部分可观测环境。相关文献: Mengkang Hu et. al, 2025 等 3 篇文献

情景/事件/神经启发的持续记忆与价值治理（记忆巩固与对齐）

该组强调“经验的事件化/情景化存储”、基于效用的巩固/衰减/治理，以及跨会话或动态系统中的记忆融合与对齐控制。其共同目标是让代理能持续积累并在多会话或多阶段行为中保持鲁棒性，同时用机制约束输出质量与一致性。相关文献: Andrew Nuxoll et. al, 2012 等 5 篇文献

图/二进制/结构化记忆格式与可审计的记忆可计算载体

这些工作把记忆视为可计算的数据结构/存储介质（例如二进制图格式、agent-native嵌入数据库、证据检索的缓存与剪枝记忆库），关注高效导航/持久化/审计与低延迟检索，使记忆系统从“文本拼接”走向“结构化、工程可实现、可控可追溯”。相关文献: Omoshola S. Owolabi et. al, 2026 等 3 篇文献

多智能体共享/企业/分布式记忆管理与协议（含调度与实时性）

共同点是：在多智能体或组织协作场景下处理共享记忆的创建、传播、复用与一致性，并进一步讨论分布式存储/检索/更新/删除的实时协议与效率优化（如事件驱动、intent-indexed图、协商与卸载），强调系统级可扩展与低延迟。相关文献: D. Monticolo et. al, 2008 等 5 篇文献

具身与多模态记忆：用于机器人长期导航/策略复用/场景依赖记忆

该组共同点是面向具身智能体（机器人）在长时程、部分可观测环境中的“场景历史依赖记忆”。通过多模态记忆（VLM描述/视觉-动作历史）与可检索策略库/注意力记忆策略，提高长期任务成功率与泛化。相关文献: Kuan Fang et. al, 2019 等 4 篇文献

记忆系统的软件工程落地：生产级实现与执行闭环（RAMP/向量检索工程）

该论文将记忆系统放入生产级工程架构，围绕“反映-行动-记忆-计划”的执行循环、低延迟向量检索（如Qdrant）与强类型数据契约，解决延迟、状态同步与可复现性问题，体现记忆系统在真实应用中的可落地工程路径。相关文献: Mykhailo Nykoliuk et. al, 2025

总计34篇相关文献

Memory OS of AI Agent

人工智能代理的内存操作系统

doi.org-Jie Kang, Mingming Ji, Zhe Zhao 等, 2025-Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

Large Language Models (LLMs) face a crucial challenge from fixed context windows and inadequate memory management, leading to a severe shortage of long-term memory capabilities and limited personalization in the interactive experience with AI agents.To overcome this challenge, we innovatively propose a Memory Operating System, i.e., Memo-ryOS, to achieve comprehensive and efficient memory management for AI agents.Inspired by the memory management principles in operating systems, MemoryOS designs a hierarchical storage architecture and consists of four key modules: Memory Storage, Updating, Retrieval, and Generation.Specifically, the architecture comprises three levels of storage units: short-term memory, mid-term memory, and long-term personal memory.Key operations within MemoryOS include dynamic updates between storage units: short-term to mid-term updates follow a dialogue-chain-based FIFO principle, while mid-term to long-term updates use a segmented page organization strategy.Extensive experiments on the LoCoMo benchmark show an average improvement of 49.11% on F1 and 46.18% on BLEU-1 over the baselines on GPT-4o-mini, showing contextual coherence and personalized memory retention in long conversations.