PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection (2404.05231v2)

Published 8 Apr 2024 in cs.CV

Abstract: The vision-LLM has brought great improvement to few-shot industrial anomaly detection, which usually needs to design of hundreds of prompts through prompt engineering. For automated scenarios, we first use conventional prompt learning with many-class paradigm as the baseline to automatically learn prompts but found that it can not work well in one-class anomaly detection. To address the above problem, this paper proposes a one-class prompt learning method for few-shot anomaly detection, termed PromptAD. First, we propose semantic concatenation which can transpose normal prompts into anomaly prompts by concatenating normal prompts with anomaly suffixes, thus constructing a large number of negative samples used to guide prompt learning in one-class setting. Furthermore, to mitigate the training challenge caused by the absence of anomaly images, we introduce the concept of explicit anomaly margin, which is used to explicitly control the margin between normal prompt features and anomaly prompt features through a hyper-parameter. For image-level/pixel-level anomaly detection, PromptAD achieves first place in 11/12 few-shot settings on MVTec and VisA.

References (61)

Citations (14)

View on Semantic Scholar

Summary

The paper introduces PromptAD, a novel method that learns prompts from only normal samples to perform few-shot anomaly detection.
It employs Semantic Concatenation to convert normal prompts into pseudo-anomaly prompts, enriching training without requiring anomaly samples.
It leverages an Explicit Anomaly Margin to enforce feature separation, achieving state-of-the-art performance on image- and pixel-level benchmarks.

An Expert Overview of "PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection"

The paper under discussion introduces a novel methodology termed PromptAD, which pivots around the idea of applying one-class prompt learning to few-shot anomaly detection using only normal samples. This paper addresses key challenges faced in unsupervised industrial anomaly detection, particularly within the framework of one-class classification (OCC). PromptAD leverages the inherent capabilities of vision-LLMs, specifically CLIP, but innovates by introducing a new paradigm to efficiently learn prompts in scenarios where only normal data samples are accessible during training.

Key Contributions and Methodologies

The authors thoughtfully dissect the inefficacies of conventional prompt learning in anomaly detection domains and propose solutions to bridge this gap. The main contributions of the paper are structured around two pivotal components: semantic concatenation (SC) and explicit anomaly margin (EAM).

Semantic Concatenation (SC): The technique of semantic concatenation is proposed to address the absence of anomaly samples in typical one-class anomaly detection tasks. SC enables the transformation of semantics for normal prompts into anomaly prompts through the strategic concatenation of normal prompts with anomaly-specific suffixes. This not only enriches the range of prompts available for training but also constructs a pseudo-negative prompt landscape that aids in effective prompt learning.
Explicit Anomaly Margin (EAM): To overcome the lack of actual negative samples in the training phase, EAM is introduced as a mechanism to enforce an explicit margin between the learned features of normal prompts and transposed anomaly prompts. By controlling this margin through a hyper-parameter, the model enhances its ability to distinguish between normal and anomalous scenarios without direct exposure to anomalous samples during training.

The integration of these methods into the CLIP architecture results in a prompt-guided anomaly detection process that significantly improves upon existing strategies, particularly in few-shot settings.

Empirical Evaluation and Results

The paper reports comprehensive experimental results demonstrating PromptAD’s performance on standard anomaly detection benchmarks, including MVTec and VisA. Notably, PromptAD secures the leading position in 11 out of 12 few-shot tasks across both image-level and pixel-level settings.

Image-Level Results: PromptAD outperforms existing methodologies such as WinCLIP+ and RWDA, reporting substantial improvements in few-shot scenarios, underscoring its robustness when very few samples are available.
Pixel-Level Results: While traditionally more challenging, pixel-level anomaly detection results also indicate that PromptAD exhibits marked improvements, particularly when employing its proposed innovations, namely SC and EAM.

The paper supplements its findings with rigorous ablation studies, highlighting the significant contributions of SC and EAM to the overall efficacy of PromptAD in anomaly detection tasks.

Implications and Future Directions

The findings underscore the potential for PromptAD to redefine approaches to anomaly detection, especially in industrial scenarios where rapid adaptation to new conditions is paramount, and sample scarcity is prevalent. The capability of PromptAD to learn effective prompts with minimal data represents a noteworthy advancement towards more generalized and automated anomaly detection systems.

Theoretically, this research extends the understanding of prompt learning applicability beyond general classification tasks into more specialized and complex inferencing domains. Practically, it opens up pathways for further exploration into anomaly detection frameworks where minimal labeled data is required for robust model performance.

Future research might explore the feasibility of integrating PromptAD with other multimodal data and more complex anomaly types, potentially expanding its application across varied domains. Additionally, advancing the theoretical understanding of margin-based learning in zero-shot and few-shot contexts might yield further improvements in related tasks.

In conclusion, this paper presents a substantial contribution to the field of anomaly detection, providing both a solid theoretical framework and demonstrating empirical success, offering promising insights for applied computer vision research.

GitHub

GitHub - FuNz-0/PromptAD: Official implementation of CVPR 2024 PromptAD: Learning Prompts with Only Normal Samples for Few-Shot Anomaly Detection (153 stars)