CLIP 分类异常检测

基于CLIP的异常检测研究已从初步的语义迁移发展为包含零样本通用检测、小样本提示学习、视频时序分析及多模态推理的成熟生态。研究核心正由全局特征对齐转向针对特定领域架构适配、多模态逻辑增强及开放词汇场景的深度开发，通过系统化的架构设计解决了零样本与小样本场景下的泛化瓶颈与推理难题。

共 58 篇文献，5 个研究方向

零样本(Zero-shot)工业与医疗异常检测框架

侧重于利用预训练CLIP的通用泛化能力，设计多模态交互或特征对齐模块，在无需目标样本的情况下实现工业缺陷检测与医学影像分析。相关文献: Shuai Lyu et. al, 2025 等 27 篇文献

小样本(Few-shot)提示学习与参数适配

通过设计可学习的Prompt工程、适配器或精细化特征微调策略，利用少量领域样本提升模型在特定工业或医疗场景下的识别精度。相关文献: Zhaowei Zeng et. al, 2026 等 14 篇文献

视频异常检测与时序行为分析

专门针对视频中的时序依赖与异常行为定位，利用CLIP的跨模态语义对齐优势处理视频流的复杂逻辑。相关文献: Min Li et. al, 2025 等 3 篇文献

多模态大模型推理与开放词汇架构

结合MLLMs、GPT-4V等大模型能力或开放词汇探测技术，解决多场景、跨类别异常检测中对复杂语境的推理与描述需求。相关文献: Jinjin Zhang et. al, 2025 等 12 篇文献

领域综述与前沿应用扩展

包含异常检测领域的基础综述及针对医学影像等垂直领域的应用性研究。相关文献: Keming Mao et. al, 2026 等 2 篇文献

总计58篇相关文献

AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection

AdaptCLIP：通用视觉异常检测的CLIP模型自适应

ojs.aaai.org-Bin-Bin Gao, Yue Zhou, Jiangtao Yan 等, 2026-Proceedings of the AAAI Conference on Artificial Intelligence

Universal visual anomaly detection aims to identify anomalies from novel or unseen vision domains without additional fine-tuning, which is critical in open scenarios. Recent studies have demonstrated that pre-trained vision-language models like CLIP exhibit strong generalization with just zero or a few normal images. However, existing methods struggle to design prompt templates, handle complex token interactions, or require fine-tuning on target domains, resulting in limited flexibility. In this work, we present a simple yet effective AdaptCLIP based on two key insights. First, adaptive visual and textual representations should be learned alternately rather than jointly. Second, comparative learning between query and normal image prompt should incorporate both contextual and aligned residual features, rather than relying solely on residual features. AdaptCLIP treats CLIP models as a foundational service, adding only three simple adapters, visual adapter, textual adapter, and prompt-query adapter, at its input or output ends. AdaptCLIP supports zero-/few-shot generalization across domains and provides a training-free approach on target domains once trained on a base dataset. AdaptCLIP achieves state-of-the-art performance on 12 anomaly detection benchmarks from industrial and medical domains, significantly outperforming existing competitive methods.