多智能体游戏角色

本报告综合了多智能体游戏角色领域的研究现状，展示了从传统基于规则的架构向深度强化学习与大语言模型驱动转型的完整图景。研究不仅在技术底层实现了更高效的通信与协调算法（MARL），在认知层面引入了具备长期记忆与逻辑推理的 LLM 框架，更在社会心理层面深入探讨了角色的道德、情感与可信度。此外，具身智能与多模态交互的研究正推动 NPC 从简单的脚本控制走向复杂的物理与视觉感知。完善的评估基准与原型工具链则为该领域的持续演进提供了科学的度量衡，共同推动游戏角色向更智能化、社会化和人性化的方向发展。

共 84 篇文献，7 个研究方向

基于大语言模型 (LLM) 的智能体认知架构与协作推理

该组研究探讨如何利用 LLM 的逻辑推理、记忆管理和自然语言处理能力构建多智能体系统。重点在于角色一致性、任务规划、意图传播以及通过反射机制和自组织架构提升系统的鲁棒性与协作效率。相关文献: Yurun Yuan et. al, 2025 等 15 篇文献

多智能体强化学习 (MARL) 中的协调、通信与博弈优化

聚焦于利用强化学习解决多智能体环境下的协作挑战。研究涵盖了图神经网络 (GNN) 通信、价值分解、互信息正则化、进化算法优化以及在不完全信息博弈（如 MOBA、足球）中的策略平衡与对手建模。相关文献: Kexing Peng et. al, 2025 等 15 篇文献

社会化智能：可信角色建模、情感交互与道德推理

侧重于提升 NPC 的“可信度”和心理深度。研究涉及心理学模型（HEXACO）、道德判断、社会冲突建模、情感合成以及人类玩家对 AI 角色的神经科学反馈，旨在增强叙事沉浸感和人机协作的团队感。相关文献: J. V. Oijen et. al, 2012 等 16 篇文献

具身智能、物理动画与多模态行为生成

关注智能体在物理世界或复杂视觉环境中的表现。包括利用视觉语言模型 (VLM) 进行动作决策、基于生成对抗模仿学习 (GAIL) 的肢体协调、以及多智能体间的物理协同运动（如共同搬运）。相关文献: Mohamed Younes et. al, 2023 等 7 篇文献

系统架构、叙事控制与严肃游戏应用

涵盖了多智能体系统的底层工程实现，如 BDI 模型、有限状态机 (FSM)、Holonic 架构和游戏引擎中间件。同时探讨了导演控制系统在交互式叙事中的应用，以及在教育、医疗和安全培训等严肃游戏领域的实践。相关文献: Tengku Syahdina et. al, 2023 等 13 篇文献

评估基准、开发工具与交互原型设计

该组文献致力于为多智能体研究提供支撑环境。包括针对零样本协调 (ZSC) 和角色扮演能力的测试集、快速原型设计工具（如 Paintboard）、以及用于人类参与实验的 Wizard of Oz 框架。相关文献: Haotian Wu et. al, 2025 等 9 篇文献

特定博弈中的意图识别与策略建模

针对特定游戏类型（如坦克大战、棋牌、交通模拟）开发的行为模型。重点在于预测对手行为、识别玩家意图、减少协作中的负面效应以及优化特定场景下的决策逻辑。相关文献: M. Sindlar et. al, 2009 等 9 篇文献

总计118篇相关文献

Multi-agent system for managing a game settlement with an expert-based behavior selection system for game characters

基于专家行为选择系统的多智能体游戏定居点管理系统

A.A. Yarovyi, I.R. Arseniuk, A. Kozlovskyi 等, 2026-Optoelectronic Information-Power Technologies

This article presents a novel approach to modeling the intelligent behavior of game characters through a multi-agent system integrated with an expert system for behavior selection. By incorporating this system, game characters acquire the ability to adapt their behavior according to their individual traits and interactions with other characters. To model personality traits, several well-known psychological frameworks were considered, including FFM (The Five Factor Model), HEXACO, and AD (Affiliation and Dominance Model). After comparing the models, a combination of HEXACO and AD was chosen, as this approach allows for detailed modeling of both individual game character traits and their interpersonal relationships. To select the appropriate behavior for a game character, a scoring system was developed that evaluates behavior templates based on input data: the character’s mood, personality traits, relationship types, and knowledge about the behavior of other game characters. This data is used to calculate a total score for each behavior template, determining the character's final action. The calculation process is performed using compound behavior matching matrices that align templates with character traits. The introduction of a random deviation ensures variability in game character behavior and prevents deterministic outcomes. The scoring system is formalized as a mathematical model that describes the influence of each factor on behavior selection through scoring functions and corresponding weighting coefficients. To test the expert system, a game prototype was developed on the Unity platform, where game characters perform tasks to maintain the settlement's functionality. They independently choose tasks based on the current environment state and interact with one another according to behavior templates selected by the expert system. The proposed approach enables the creation of a dynamic game environment with unpredictable character actions, determined by their personality traits and interpersonal relationships. This opens new opportunities for improving behavior systems in games.

多智能体 游戏角色