Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey (2407.21794v1)

Published 31 Jul 2024 in cs.CV, cs.AI, and cs.LG

Abstract: Detecting out-of-distribution (OOD) samples is crucial for ensuring the safety of machine learning systems and has shaped the field of OOD detection. Meanwhile, several other problems are closely related to OOD detection, including anomaly detection (AD), novelty detection (ND), open set recognition (OSR), and outlier detection (OD). To unify these problems, a generalized OOD detection framework was proposed, taxonomically categorizing these five problems. However, Vision LLMs (VLMs) such as CLIP have significantly changed the paradigm and blurred the boundaries between these fields, again confusing researchers. In this survey, we first present a generalized OOD detection v2, encapsulating the evolution of AD, ND, OSR, OOD detection, and OD in the VLM era. Our framework reveals that, with some field inactivity and integration, the demanding challenges have become OOD detection and AD. In addition, we also highlight the significant shift in the definition, problem settings, and benchmarks; we thus feature a comprehensive review of the methodology for OOD detection, including the discussion over other related tasks to clarify their relationship to OOD detection. Finally, we explore the advancements in the emerging Large Vision LLM (LVLM) era, such as GPT-4V. We conclude this survey with open challenges and future directions.

Authors (13)

Atsuyuki Miyai (10 papers)
Jingkang Yang (36 papers)
Jingyang Zhang (58 papers)
Yifei Ming (26 papers)
Yueqian Lin (13 papers)
Qing Yu (45 papers)
Go Irie (16 papers)
Shafiq Joty (187 papers)
Yixuan Li (183 papers)
Hai Li (159 papers)
Ziwei Liu (368 papers)
Toshihiko Yamasaki (74 papers)
Kiyoharu Aizawa (67 papers)

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/fly51fly/status/1819129340392813015

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey (2407.21794v1)

Summary

Related Papers

Tweets