agent rag

Agentic RAG总体范式：与传统RAG的差异、定位与能力机制

对比并界定传统RAG与Agentic RAG的差异与能力定位，回答“为何/如何通过自治、迭代与编排代理能力嵌入RAG以提升适应性与性能”，属于范式框架与总体定位类工作。

Traditional rag vs. agentic rag: A comparative study of retrieval-augmented systems（F Neha, D Bhati, 2025, Authorea Preprints）
Agentic RAG Systems for Improving Adaptability and Performance in AI-Driven Information Retrieval（Ajit Singh, 2025, Available at SSRN 5188363）

层级/协同多代理架构：任务分解—多源检索—融合决策

以多智能体协作为核心：通过任务分解、分模态/多源检索与多路结果融合（投票/一致性/专家细化等）提升答案质量；其中实体解析方向体现“任务专用代理”如何协同完成复杂认知步骤。

HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation（Pei Liu, Xin Liu, Ruoyu Yao, Junming Liu, Siyuan Meng, Ding Wang, Jun Ma, 2025, Proceedings of the 33rd ACM International Conference on Multimedia）
Multi-agent Retrieval-Augmented Generation for Enhancing Answer Generation and Knowledge Retrieval（Deepak Kumar, Bhavesh Jain, 2025, Lecture Notes in Computer Science）
Multi-Agent RAG Framework for Entity Resolution: Advancing Beyond Single-LLM Approaches with Specialized Agent Coordination（A. T. Muhammad, Muzakkiruddin Ahmed Mohammed, Mariofanna Milanova, John Talburt, Mert Can Cakmak, 2025, Computers）

迭代检索与检索轨迹建模：自适应重检索、上下文压缩与多跳推进

聚焦“检索过程自身的智能化”：通过迭代检索、检索轨迹建模与检索压缩（摘要/计划推进、上下文管理）来优化多轮重检索与多跳信息获取；强调检索—生成耦合中的过程控制。

Adaptive iterative retrieval for enhanced retrieval-augmented generation（Wenhan Han, Xiao Xiao, Yaohang Li, Jun Wang, Mykola Pechenizkiy, Meng Fang, 2025, Neurocomputing）
Learning to Retrieve Iteratively for In-Context Learning（Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme, 2024, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing）
Retrieve, Summarize, Plan: Advancing Multi-hop Question Answering with an Iterative Approach（Zhouyu Jiang, Mengshu Sun, Lei Liang, Zhiqiang Zhang, 2024, Companion Proceedings of the ACM on Web Conference 2025）
MAIN-RAG: Multi-Agent Filtering Retrieval-Augmented Generation（Chia-Yuan Chang, Zhimeng Jiang, Vineeth Rakesh, Menghai Pan, Chin‐Chia Michael Yeh, Guanchu Wang, Mingzhi Hu, Zhichao Xu, Yan Zheng, Mahashweta Das, Na Zou, 2025, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）

面向事实核查/安全合规的闭环代理：检索—验证—停止条件

在高风险/高可靠需求场景中构建闭环代理流程：以证据检索为前提，引入验证与停止条件（置信度/需求推导驱动），把“检索相关性+核查/安全推导”形成受控闭环。

FIRE: Fact-checking with Iterative Retrieval and Verification（Zhuohan Xie, Rui Xing, Yuxia Wang, Jiahui Geng, Iqbal Hasan, Dhruv Sahnan, Iryna Gurevych, Preslav Nakov, 2025, Findings of the Association for Computational Linguistics: NAACL 2025）
Towards Automated Safety Requirements Derivation Using Agent-based RAG（Balahari Vignesh Balu, Florian Geißler, Francesco Carella, João-Vitor Zacchi, Josef Jiru, Núria Mata, R. Stollé, 2025, Proceedings of the AAAI Symposium Series）

领域应用型多代理RAG：临床决策、能耗/对话决策与危害场景

将Agentic RAG落地到具体业务的多代理系统：医疗/临床侧强调自查询与检索质量提升；决策对话侧强调角色化多代理支持理解—检索—分析—展示；危害/风险场景侧构建面向用户的证据整合与生成流程。

Enhancing Clinical Decision Support with Adaptive Iterative Self-Query Retrieval for Retrieval-Augmented Large Language Models（Srinivagasam Prabha, C. A. Gomez-Cabello, S. A. Haider, Ariana Genovese, Maissa Trabilsy, Nadia G. Wood, Sanjay Bagaria, Cui Tao, AJ Forte, 2025, Bioengineering）
Multi-Agent RAG Chatbot Architecture for Decision Support in Net-Zero Emission Energy Systems（Gihan Gamage, Nishan Mills, Daswin De Silva, Milos Manic, Harsha Moraliyage, Andrew Jennings, D. Alahakoon, 2024, 2024 IEEE International Conference on Industrial Technology (ICIT)）
MARSHA: multi-agent RAG system for hazard adaptation（Yangxinyu Xie, Bowen Jiang, Tanwi Mallick, J. Bergerson, John K. Hutchison, Duane R. Verner, Jordan Branham, M. R. Alexander, Robert B. Ross, Yan Feng, L. Levy, Weijie J. Su, C. J. Taylor, 2025, npj Climate Action）

移动端/跨应用长时任务的双级检索增强：高层策略—低层执行

面向移动端/真实世界长时、多应用、UI执行型任务：区分高层规划与低层操作所需知识，并分别进行检索增强（如Manager/Operator双级思路），解决长时任务中的知识获取与执行协同问题。

Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation（Yuxiang Zhou, Jichang Li, Yanhao Zhang, Haonan Lu, Guanbin Li, 2026, Proceedings of the AAAI Conference on Artificial Intelligence）

多模态Agentic RAG与ReAct式多跳规划：跨模态证据整合与关系推理

把RAG扩展到多模态证据与多跳推理：对图像/表格/多模态关系等非纯文本信息进行联合检索与证据整合，并通过代理逻辑（协作/规划/推理范式）生成更完整、更可信的描述或答案。

A Multimodal Retrieval-Augmented Generation System with ReAct Agent Logic for Multi-Hop Reasoning（Denys Yuvzhenko, Valentyn Chymshyr, V. Shymkovych, Kyrylo Znova, Grzegorz Nowakowski, Sergii Telenyk, 2025, Information, Computing and Intelligent systems）
Relational Reasoning Image Captioning Via Multi-Agent Retrieval-Augmented Generation（Aiwen Jiang, Duan Wang, Chao Peng, Mingwen Wang, 2025, Knowledge-Based Systems）
CollEX – A Multimodal Agentic RAG System Enabling Interactive Exploration of Scientific Collections（Florian Schneider, Narges Baba Ahmadi, Niloufar Baba Ahmadi, Iris Vogel, Martin Semmann, Chris Biemann, 2025, Proceedings of the 1st Workshop on Multimodal Augmented Generation via Multimodal Retrieval (MAGMaR 2025)）

可解释性评测与RAG交互动力学分析（如RAGTrace类）

面向可解释与可诊断：提供对检索—生成动力学的可追踪评测，分析系统为何失败、检索相关性与生成保真度如何随交互演化，从而支持迭代改进。

RAGTrace: Understanding and Refining Retrieval-Generation Dynamics in Retrieval-Augmented Generation（Sizhe Cheng, Jiaping Li, Huanchen Wang, Yuxin Ma, 2025, Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology）

动态/通用数据源检索：Agent-based Universal RAG（AU-RAG）

解决动态数据源与通用检索访问适配：通过代理结合描述性元数据实现对不断变化的数据源的学习式访问与动态检索，而非依赖一次性预编码索引。

AU-RAG: Agent-based Universal Retrieval Augmented Generation（Jisoo Jang, Wen-Syan Li, 2024, Proceedings of the 2024 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region）

工具/流程编排型Agent ReAct/RAG：Tool Retrieval与复杂任务工作流

强调工具链/流程编排：将检索作为工具的一部分融入复杂任务执行流程（ReAct/tool retrieval逻辑），突出工程实现层面的流程与推理结合方式，而非仅做检索相关性或多代理通用架构。

ToolReAGt: Tool Retrieval for LLM-based Complex Task Solution via Retrieval Augmented Generation（N. Braunschweiler, R. Doddipatla, T. Zorila, 2025, Proceedings of the 3rd Workshop on Towards Knowledgeable Foundation Models (KnowFM)）

可信可靠：Critic/可解释/系统化推理驱动的评估纠错与透明化

以可靠性与透明度为核心：使用critic驱动的自我纠错/误差挖掘工作流，以及系统化推理与可解释AI方法论来提升Agentic RAG的可控性与可审计性，而不是单纯提升召回。

RAG-Critic: Leveraging Automated Critic-Guided Agentic Workflow for Retrieval Augmented Generation（Guanting Dong, Jiajie Jin, Xiaoxi Li, Yutao Zhu, Zhicheng Dou, Ji-Rong Wen, 2025, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)）
Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges（Jintao Liang, Sugang, Huifeng Lin, You Wu, Rui Zhao, Ziyue Li, 2025, Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics）
Towards Explainable AI in Agentic Retrieval-Augmented Generation: A Systematic Review（Afnan Habib, Osamah F. Abdulmahmod, Mukhlis Raza, Y. Gu, Murat Aydoğan, M. A. Al-antari, 2025, 2025 9th International Artificial Intelligence and Data Processing Symposium (IDAP)）

可信/受控Agentic RAG工作流：人类/专家参与、访问控制与端到端工程架构

共性在于“可运行的Agentic RAG工作流与可信/受控访问”：通过角色约束、工具/数据访问控制、人类/专家参与与反馈、以及可追溯的工程化架构，把RAG升级为受控、可信的端到端工作流（含行业落地）。其中框架化流水线作为通用实现参照。

Agentic AI Meets RAG: A Framework for Domain-Specific Knowledge Evaluation（V. K, Abirami A M, Uma K V, Viswanath V S, 2025, 2025 IEEE 32nd International Conference on High Performance Computing, Data and Analytics Workshop (HiPCW)）
Agentic RAG for Maritime AIoT: Natural Language Access to Structured Data（Oxana Sachenkova, Melker Andreasson, Dongzhu Tan, Alisa Lincke, 2026, Sensors）
Agentic RAG for Personalized Learning: Design of an AI-Powered Learning Agent Using Open-Source Small Language Models（Shilpi Taneja, S. Biswas, Bhavya Alankar, Harleen Kaur, 2025, Electronic Journal of e-Learning）
CyberRAG: An agentic RAG cyber attack classification and reporting tool（Francesco Blefari, Cristian Cosentino, F. A. Pironti, Angelo Furfaro, Fabrizio Marozzo, 2025, Future Generation Computer Systems）
An Agent-Based RAG Architecture for Intelligent Tourism Assistance: The Valencia Case Study（Andrea Bonetti, Adrián Salcedo-Puche, Joan Vila-Francés, Xaro Benavent-Garcia, Emilio Fernández-Vargas, R. Magdalena-Benedito, E. Soria-Olivas, 2025, Tourism and Hospitality）
Agentic RAG with Human-in-the-Retrieval（Xiwei Xu, Dawen Zhang, Qing Liu, Qinghua Lu, Liming Zhu, 2025, 2025 IEEE 22nd International Conference on Software Architecture Companion (ICSA-C)）

多智能体编排与协作架构（框架/服务化视角）：分工—迭代—系统化实现

以多智能体编排与协作框架/服务化为主线：围绕多代理专业分工、迭代周期与框架化组织RAG流程；既包含面向多代理的迭代检索思想，也包含从RAG到多代理系统的架构/服务视角总结与落地。

KAIR: Knowledge-Aware Iterative Retrieval for Multiagent Systems（Seyoung Song, Siddansh Chawla, Vijay K. Madisetti, 2026, IEEE Transactions on Computational Social Systems）
… multi-agent paradigm for smart urban mobility: Opportunities and challenges for integrating large language models and retrieval-augmented generation with intelligent …（H Xu, J Yuan, A Zhou, G Xu, W Li, XJ Ban, 2025, Urban Human …）
Sustainable Digitalization of Business with Multi-Agent RAG and LLM（Muhammad Arslan, Saba Munawar, Christophe Cruz, 2025, Procedia Computer Science）
An Agent-Based Service Architecture for Smart Greenhouses: Telemetry Analytics and Decision Support with RAG-grounded LLM Agents（S. Pardo-Pina, J. Germer, R. Suay-Cortes, Manuel Platero-Horcajadas, Francisco-Javier Ferrández-Pastor, 2026, Smart Agricultural Technology）
RAG-Enhanced Collaborative LLM Agents for Drug Discovery（Namkyeong Lee, Edward De Brouwer, Ehsan Hajiramezanali, Tommaso Biancalani, Chanyoung Park, Gabriele Scalia, 2026, Proceedings of the AAAI Conference on Artificial Intelligence）
Multi-Agent Retrieval Augmented Generation for Clinical Decision Support: A Systematic Review and Integrative Conceptual Framework（Tarisai Mugambiwa, B. Ndlovu, 2026, Journal of Applied Informatics and Computing）
From RAG to Multi-Agent Systems: A Survey of Modern Approaches in LLM Development（Gustavo Aquino, Nádila da Silva de Azevedo, Leandro Youiti Silva Okimoto, Leonardo Yuto Suzuki Camelo, Hendrio Bragança, Rubens de Andrade Fernandes, André Luiz Printes, Fábio Florença Cardoso, Raimundo Cláudio Souza Gomes, Israel Gondres Torné, 2025, Preprints.org）

面向异构知识形态的专门化RAG：多模态与表格/异构文档支持

针对“异构知识形态”的结构化改造：分别面向多模态（如病理/图像等跨模态证据）与表格/异构文档（保持表结构与SQL执行、多跳推理），强调数据形态约束下检索与证据利用机制的专门化。

Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning（Wenchuan Zhang, Jingru Guo, Hengzhe Zhang, Penghao Zhang, Jie Chen, Shuwan Zhang, Zhang Zhang, Yuhao Yi, Hong Bu, 2026, Proceedings of the AAAI Conference on Artificial Intelligence）
TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document Reasoning（Xiaohan Yu, Pu Jian, Chong Chen, 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing）

推理范式驱动的RAG可靠性（System化推理组织）

（用于覆盖“系统化推理范式”这一子点）该文强调以System 1/2式推理组织RAG推理过程，体现Agentic RAG中推理范式与可靠生成的关系。

Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges（Jintao Liang, Sugang, Huifeng Lin, You Wu, Rui Zhao, Ziyue Li, 2025, Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics）

危害/风险场景的多代理RAG系统（hazard同主题并列条目）

hazard场景的并列命名重复条目，用于补齐同一主题的同义/重复引用，避免丢失提供的bibkey。

MARSHA: multi-agent RAG system for hazard adaptation（Yangxinyu Xie, Bowen Jiang, Tanwi Mallick, J. Bergerson, John K. Hutchison, Duane R. Verner, Jordan Branham, M. R. Alexander, Robert B. Ross, Yan Feng, L. Levy, Weijie J. Su, C. J. Taylor, 2025, npj Climate Action）
From RAG to Multi-Agent Systems: A Survey of Modern Approaches in LLM Development（Gustavo Aquino, Nádila da Silva de Azevedo, Leandro Youiti Silva Okimoto, Leonardo Yuto Suzuki Camelo, Hendrio Bragança, Rubens de Andrade Fernandes, André Luiz Printes, Fábio Florença Cardoso, Raimundo Cláudio Souza Gomes, Israel Gondres Torné, 2025, Preprints.org）

agent rag

合并后，agentic RAG的研究可以并列归纳为：①范式总体定位；②层级/协同多代理架构；③迭代检索与检索轨迹过程控制；④面向事实核查与安全合规的闭环（检索—验证—停止）；⑤领域应用型多代理落地（临床/决策/危害等）；⑥移动端长时任务的双级检索增强；⑦多模态证据整合与多跳推理；⑧可解释与评测工具以理解检索-生成动力学；⑨动态通用数据源检索（AU-RAG）；⑩工具/流程编排型ReAct/RAG；以及在可靠性与可信工程层面的工作（critic/XAI、受控访问与人类参与、框架/服务化协作），并进一步覆盖针对异构知识形态的专门化RAG（多模态、表格）与个别推理范式细分。

共 42 篇文献，16 个研究方向

Agentic RAG总体范式：与传统RAG的差异、定位与能力机制

对比并界定传统RAG与Agentic RAG的差异与能力定位，回答“为何/如何通过自治、迭代与编排代理能力嵌入RAG以提升适应性与性能”，属于范式框架与总体定位类工作。相关文献: F Neha et. al, 2025 等 2 篇文献

层级/协同多代理架构：任务分解—多源检索—融合决策

以多智能体协作为核心：通过任务分解、分模态/多源检索与多路结果融合（投票/一致性/专家细化等）提升答案质量；其中实体解析方向体现“任务专用代理”如何协同完成复杂认知步骤。相关文献: Pei Liu et. al, 2025 等 3 篇文献

迭代检索与检索轨迹建模：自适应重检索、上下文压缩与多跳推进

聚焦“检索过程自身的智能化”：通过迭代检索、检索轨迹建模与检索压缩（摘要/计划推进、上下文管理）来优化多轮重检索与多跳信息获取；强调检索—生成耦合中的过程控制。相关文献: Wenhan Han et. al, 2025 等 4 篇文献

面向事实核查/安全合规的闭环代理：检索—验证—停止条件

在高风险/高可靠需求场景中构建闭环代理流程：以证据检索为前提，引入验证与停止条件（置信度/需求推导驱动），把“检索相关性+核查/安全推导”形成受控闭环。相关文献: Zhuohan Xie et. al, 2025 等 2 篇文献

领域应用型多代理RAG：临床决策、能耗/对话决策与危害场景

将Agentic RAG落地到具体业务的多代理系统：医疗/临床侧强调自查询与检索质量提升；决策对话侧强调角色化多代理支持理解—检索—分析—展示；危害/风险场景侧构建面向用户的证据整合与生成流程。相关文献: Srinivagasam Prabha et. al, 2025 等 3 篇文献

移动端/跨应用长时任务的双级检索增强：高层策略—低层执行

面向移动端/真实世界长时、多应用、UI执行型任务：区分高层规划与低层操作所需知识，并分别进行检索增强（如Manager/Operator双级思路），解决长时任务中的知识获取与执行协同问题。相关文献: Yuxiang Zhou et. al, 2026

多模态Agentic RAG与ReAct式多跳规划：跨模态证据整合与关系推理

把RAG扩展到多模态证据与多跳推理：对图像/表格/多模态关系等非纯文本信息进行联合检索与证据整合，并通过代理逻辑（协作/规划/推理范式）生成更完整、更可信的描述或答案。相关文献: Denys Yuvzhenko et. al, 2025 等 3 篇文献

可解释性评测与RAG交互动力学分析（如RAGTrace类）

面向可解释与可诊断：提供对检索—生成动力学的可追踪评测，分析系统为何失败、检索相关性与生成保真度如何随交互演化，从而支持迭代改进。相关文献: Sizhe Cheng et. al, 2025

动态/通用数据源检索：Agent-based Universal RAG（AU-RAG）

解决动态数据源与通用检索访问适配：通过代理结合描述性元数据实现对不断变化的数据源的学习式访问与动态检索，而非依赖一次性预编码索引。相关文献: Jisoo Jang et. al, 2024

工具/流程编排型Agent ReAct/RAG：Tool Retrieval与复杂任务工作流

强调工具链/流程编排：将检索作为工具的一部分融入复杂任务执行流程（ReAct/tool retrieval逻辑），突出工程实现层面的流程与推理结合方式，而非仅做检索相关性或多代理通用架构。相关文献: N. Braunschweiler et. al, 2025

可信可靠：Critic/可解释/系统化推理驱动的评估纠错与透明化

以可靠性与透明度为核心：使用critic驱动的自我纠错/误差挖掘工作流，以及系统化推理与可解释AI方法论来提升Agentic RAG的可控性与可审计性，而不是单纯提升召回。相关文献: Guanting Dong et. al, 2025 等 3 篇文献

可信/受控Agentic RAG工作流：人类/专家参与、访问控制与端到端工程架构

共性在于“可运行的Agentic RAG工作流与可信/受控访问”：通过角色约束、工具/数据访问控制、人类/专家参与与反馈、以及可追溯的工程化架构，把RAG升级为受控、可信的端到端工作流（含行业落地）。其中框架化流水线作为通用实现参照。相关文献: V. K et. al, 2025 等 6 篇文献

多智能体编排与协作架构（框架/服务化视角）：分工—迭代—系统化实现

以多智能体编排与协作框架/服务化为主线：围绕多代理专业分工、迭代周期与框架化组织RAG流程；既包含面向多代理的迭代检索思想，也包含从RAG到多代理系统的架构/服务视角总结与落地。相关文献: Seyoung Song et. al, 2026 等 7 篇文献

面向异构知识形态的专门化RAG：多模态与表格/异构文档支持

针对“异构知识形态”的结构化改造：分别面向多模态（如病理/图像等跨模态证据）与表格/异构文档（保持表结构与SQL执行、多跳推理），强调数据形态约束下检索与证据利用机制的专门化。相关文献: Wenchuan Zhang et. al, 2026 等 2 篇文献

推理范式驱动的RAG可靠性（System化推理组织）

（用于覆盖“系统化推理范式”这一子点）该文强调以System 1/2式推理组织RAG推理过程，体现Agentic RAG中推理范式与可靠生成的关系。相关文献: Jintao Liang et. al, 2025

危害/风险场景的多代理RAG系统（hazard同主题并列条目）

hazard场景的并列命名重复条目，用于补齐同一主题的同义/重复引用，避免丢失提供的bibkey。相关文献: Yangxinyu Xie et. al, 2025 等 2 篇文献

总计44篇相关文献

Agentic RAG with Human-in-the-Retrieval

具有检索中人的代理式RAG

doi.org-Xiwei Xu, Dawen Zhang, Qing Liu 等, 2025-2025 IEEE 22nd International Conference on Software Architecture Companion (ICSA-C)

Retrieval-Augmented Generation (RAG) has emerged as a promising solution to address key challenges faced by GenAI, such as hallucination, outdated or non-removable parametric knowledge, and non-traceable reasoning processes. Existing RAG frameworks introduce dynamism into RAG process through adaptive, recursive and interactive usage of retriever and generator. More recently, agentic RAG adds another layer of intelligence to RAG by leveraging GenAI agents to further enhance dynamism by autonomously planning the retrieval process as a complex orchestration workflow with various external tools. However, current RAG architectures often overlook the significant role that domain experts can play in the retrieval process, alongside passive knowledge bases. This paper introduces a new paradigm for agentic RAG systems, capable of integrating external passive knowledge bases as well as active domain experts. This integration further enhances the versatility and factual accuracy of RAG systems. The paper discusses the key components of this new paradigm and examines the associated design challenges.