Large Language Models Agentic AI

本报告对 Large Language Models Agentic AI 领域的文献进行了系统性梳理，划分为五大逻辑板块：架构与工程框架确保系统的稳健性与可扩展性；多智能体协作与社会动力学研究核心在于交互与群体智慧；推理、规划与强化学习探讨智能核心的自主进化能力；安全、可靠与评估治理确立了应用边界与可信度底线；最后通过广泛的垂直领域与具身智能应用展现了其在真实物理与数字场景中的工业落地价值。

共 280 篇文献，5 个研究方向

代理架构与系统工程框架

集中于构建模块化、可扩展的LLM代理软件架构，涵盖系统初始化、生命周期管理、API设计、工程实践及标准化部署方案。相关文献: Chunhao Tian et. al, 2025 等 28 篇文献

多智能体协作与社会动力学研究

研究多代理系统（MAS）的交互协议、角色分配、团队协作机制、沟通拓扑以及模拟复杂人类社会行为的动态协作研究。相关文献: Hui Yi Leong et. al, 2025 等 38 篇文献

代理推理、规划与强化学习自进化

探讨代理如何通过复杂推理、多步规划、过程监督强化学习、自我演化（Self-evolution）及负面轨迹学习提升自主决策与性能上限。相关文献: Zhihao Jia et. al, 2025 等 44 篇文献

安全性、可靠性评估与伦理治理

关注代理部署中的安全风险（如注入、后门、幻觉），提出评估基准、合规治理、可信度量与人类可控机制。相关文献: Olivier Schipper et. al, 2025 等 43 篇文献

具身智能、垂直行业与应用实践

研究LLM代理在物理世界（自动驾驶、机器人）、医疗、工业控制、金融及科学研究等特定领域的落地应用与决策优化。相关文献: Ayan Banerjee et. al, 2024 等 127 篇文献

总计292篇相关文献

DRC-Coder: Automated DRC Checker Code Generation Using LLM Autonomous Agent

DRC-Coder：基于LLM自主代理的自动化DRC检查器代码生成

Chen-Chia Chang, Chia-Tung Ho, Yaguang Li 等, 2024-Proceedings of the 2025 International Symposium on Physical Design

In the advanced technology nodes, the integrated design rule checker (DRC) is often utilized in place and route tools for fast optimization loops for power-performance-area. Implementing integrated DRC checkers to meet the standard of commercial DRC tools demands extensive human expertise to interpret foundry specifications, analyze layouts, and debug code iteratively. However, this labor-intensive process, requiring to be repeated by every update of technology nodes, prolongs the turnaround time of designing circuits. In this paper, we present DRC-Coder, a multi-agent framework with vision capabilities for automated DRC code generation. By incorporating vision language models and large language models (LLM), DRC-Coder can effectively process textual, visual, and layout information to perform rule interpretation and coding by two specialized LLMs. We also design an auto-evaluation function for LLMs to enable DRC code debugging. Experimental results show that targeting on a sub-3nm technology node for a state-of-the-art standard cell layout tool, DRC-Coder achieves perfect F1 score 1.000 in generating DRC codes for meeting the standard of a commercial DRC tool, highly outperforming standard prompting techniques (F1=0.631). DRC-Coder can generate code for each design rule within four minutes on average, which significantly accelerates technology advancement and reduces engineering costs.