用于材料科学研究的智能体进展研究进展：149篇文献的方向梳理与核心结论

用于材料科学研究的智能体进展

本报告综合了用于材料科学研究的智能体进展，揭示了该领域正从单一的LLM辅助工具向高度自主的“AI科学家”范式演进。最终分组涵盖了从宏观战略路线图到微观技术实现的完整链条：包括知识层的文献挖掘与多模态表征、逻辑层的架构设计与假设生成、计算层的仿真自动化、以及物理层的自驱动实验室集成。同时，研究重点正转向系统可靠性、标准化基准测试及特定材料体系（如电池、合金）的深度闭环发现，旨在构建可信、高效且具备物理逻辑推理能力的自主科研基础设施。

共 149 篇文献，8 个研究方向

领域综述、科研范式演进与发展路线图

这类文献提供了材料科学从AI4Science向Agentic Science演进的宏观视角，探讨了LLM在科研中的角色定位（如Oracle, Surrogate, Arbiter），并提出了针对特定领域（如2D材料、半导体）的战略发展路径。相关文献: Huan Zhang et. al, 2026 等 29 篇文献

文献知识挖掘、多模态表征与科学数据库构建

侧重于利用LLM和视觉-语言模型（VLM）从海量非结构化文献、图表、光谱及显微图像中自动提取材料属性、合成路线和微观结构特征，构建结构化知识图谱或领域数据库。相关文献: Xiang Liu et. al, 2025 等 29 篇文献

智能体系统架构、多机协作与通用推理框架

研究智能体的底层逻辑架构，包括Planner-Executor模式、主动工具发现（MCP协议）、多智能体辩论/黑板协作机制，以及提高长程任务可靠性与对齐的方法。相关文献: R. Yin et. al, 2025 等 20 篇文献

科学假设生成、逆向设计与逻辑推理

专注于利用智能体生成科学假设、进行物理化学逻辑推演，并结合生成式AI与强化学习进行材料的逆向设计、晶体结构预测及目标导向的分子建模。相关文献: Heegyu Kim et. al, 2025 等 14 篇文献

计算材料学自动化与高通量仿真智能体

探讨智能体如何自动化执行复杂的计算任务，如DFT模拟（VASP）、分子动力学（LAMMPS）、势能面导航及本构模型生成，实现计算流的端到端管理。相关文献: Ziqi Wang et. al, 2025 等 10 篇文献

自驱动实验室（SDLs）与物理实验闭环集成

关注智能体与机器人硬件、自动化仪器的深度集成，实现从实验设计、自动化合成、原位表征到闭环优化的全流程物理自动化系统。相关文献: Yahao Dai et. al, 2025 等 22 篇文献

特定材料体系应用与模型优化技术

展示智能体在合金、电池电解质、聚合物、MOFs等具体领域的应用实效，并涉及针对材料科学任务的LLM微调、压缩及强化学习优化技术。相关文献: Théo Jaffrelot Inizan et. al, 2025 等 17 篇文献

基准测试、系统可靠性与科研基础设施

专注于建立标准化的评估框架（如MatTools, MADE），探讨智能体行为的溯源（Provenance）、元数据标准化以及人类在环（HILT）的协作模式。相关文献: Jack Foxabbott et. al, 2025 等 8 篇文献

总计149篇相关文献

34 Examples of LLM Applications in Materials Science and Chemistry: Towards Automation, Assistants, Agents, and Accelerated Scientific Discovery

34个材料科学与化学领域大型语言模型应用的实例：迈向自动化、助手、代理和加速科学发现

Yoel Zimmermann, Adib Bazgir, Alexander Al-Feghali 等, 2025-ArXiv

Large Language Models (LLMs) are reshaping many aspects of materials science and chemistry research, enabling advances in molecular property prediction, materials design, scientific automation, knowledge extraction, and more. Recent developments demonstrate that the latest class of models are able to integrate structured and unstructured data, assist in hypothesis generation, and streamline research workflows. To explore the frontier of LLM capabilities across the research lifecycle, we review applications of LLMs through 34 total projects developed during the second annual Large Language Model Hackathon for Applications in Materials Science and Chemistry, a global hybrid event. These projects spanned seven key research areas: (1) molecular and material property prediction, (2) molecular and material design, (3) automation and novel interfaces, (4) scientific communication and education, (5) research data management and automation, (6) hypothesis generation and evaluation, and (7) knowledge extraction and reasoning from the scientific literature. Collectively, these applications illustrate how LLMs serve as versatile predictive models, platforms for rapid prototyping of domain-specific tools, and much more. In particular, improvements in both open source and proprietary LLM performance through the addition of reasoning, additional training data, and new techniques have expanded effectiveness, particularly in low-data environments and interdisciplinary research. As LLMs continue to improve, their integration into scientific workflows presents both new opportunities and new challenges, requiring ongoing exploration, continued refinement, and further research to address reliability, interpretability, and reproducibility.