Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 75 tok/s
Gemini 2.5 Pro 51 tok/s Pro
GPT-5 Medium 20 tok/s Pro
GPT-5 High 18 tok/s Pro
GPT-4o 95 tok/s Pro
Kimi K2 193 tok/s Pro
GPT OSS 120B 467 tok/s Pro
Claude Sonnet 4 37 tok/s Pro
2000 character limit reached

Negative Label Guided OOD Detection with Pretrained Vision-Language Models (2403.20078v1)

Published 29 Mar 2024 in cs.CV and cs.LG

Abstract: Out-of-distribution (OOD) detection aims at identifying samples from unknown classes, playing a crucial role in trustworthy models against errors on unexpected inputs. Extensive research has been dedicated to exploring OOD detection in the vision modality. Vision-LLMs (VLMs) can leverage both textual and visual information for various multi-modal applications, whereas few OOD detection methods take into account information from the text modality. In this paper, we propose a novel post hoc OOD detection method, called NegLabel, which takes a vast number of negative labels from extensive corpus databases. We design a novel scheme for the OOD score collaborated with negative labels. Theoretical analysis helps to understand the mechanism of negative labels. Extensive experiments demonstrate that our method NegLabel achieves state-of-the-art performance on various OOD detection benchmarks and generalizes well on multiple VLM architectures. Furthermore, our method NegLabel exhibits remarkable robustness against diverse domain shifts. The codes are available at https://github.com/tmlr-group/NegLabel.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (54)
  1. Certifiably adversarially robust detection of out-of-distribution data. In NeurIPS, 2020.
  2. Food-101–mining discriminative components with random forests. In ECCV, 2014.
  3. Altclip: Altering the language encoder in clip for extended language capabilities. arXiv preprint, 2022.
  4. Describing textures in the wild. In CVPR, 2014.
  5. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
  6. Learning confidence for out-of-distribution detection in neural networks. arXiv preprint arXiv:1802.04865, 2018.
  7. Extremely simple activation shaping for out-of-distribution detection. 2023.
  8. Unknown-aware object detection: Learning what you don’t know from videos in the wild. In CVPR, 2022.
  9. Zero-shot out-of-distribution detection based on the pre-trained model clip. In AAAI, 2022.
  10. Christiane Fellbaum. WordNet: An Electronic Lexical Database. Bradford Books, 1998. URL https://mitpress.mit.edu/9780262561167/.
  11. Exploring the limits of out-of-distribution detection. In NeurIPS, 2021.
  12. A baseline for detecting misclassified and out-of-distribution examples in neural networks. In ICLR, 2017.
  13. The many faces of robustness: A critical analysis of out-of-distribution generalization. ICCV, 2021a.
  14. Natural adversarial examples. CVPR, 2021b.
  15. The inaturalist species classification and detection dataset. In CVPR, 2018.
  16. Generalized ODIN: detecting out-of-distribution image without learning from out-of-distribution data. In CVPR, 2020a.
  17. Generalized odin: Detecting out-of-distribution image without learning from out-of-distribution data. In CVPR, 2020b.
  18. On the importance of gradients for detecting distributional shifts in the wild. In NeurIPS, 2021.
  19. Scaling up visual and vision-language representation learning with noisy text supervision. In ICML, 2021.
  20. Detecting out-of-distribution data through in-distribution class prior. In ICML, 2023.
  21. 3d object representations for fine-grained categorization. ICCV, 2013.
  22. Alex Krizhevsky. Learning multiple layers of features from tiny images. 2009.
  23. A tutorial on energy-based learning. Predicting structured data, 1(0), 2006.
  24. A simple unified framework for detecting out-of-distribution samples and adversarial attacks. In NeurIPS, 2018.
  25. Enhancing the reliability of out-of-distribution image detection in neural networks. In ICLR, 2018.
  26. Microsoft coco: Common objects in context. In ECCV, 2014.
  27. Visual instruction tuning. In NeurIPS, 2023.
  28. Energy-based out-of-distribution detection. In NeurIPS, 2020.
  29. Large-scale long-tailed recognition in an open world. In CVPR, 2019.
  30. Delving into out-of-distribution detection with vision-language representations. In NeurIPS, 2022a.
  31. On the impact of spurious correlation for out-of-distribution detection. In AAAI, 2022b.
  32. Cider: Exploiting hyperspherical embeddings for out-of-distribution detection. ICLR, 2023.
  33. Ravi Parameswaran. Statistics for experimenters: an introduction to design, data analysis, and model building. JMR, Journal of Marketing Research, 16(000002):291, 1979.
  34. Cats and dogs. In CVPR, 2012.
  35. The case against accuracy estimation for comparing induction algorithms. In ICML, 1998.
  36. Learning transferable visual models from natural language supervision. In ICML, 2021.
  37. Do imagenet classifiers generalize to imagenet? In ICML, 2019.
  38. Dice: Leveraging sparsification for out-of-distribution detection. In ECCV, 2022.
  39. React: Out-of-distribution detection with rectified activations. In NeurIPS, 2021.
  40. Out-of-distribution detection with deep nearest neighbors. ICML, 2022.
  41. Csi: Novelty detection via contrastive learning on distributionally shifted instances. NeurIPS, 2020.
  42. Non-parametric outlier synthesis. ICLR, 2023.
  43. Open-set recognition: A good closed-set classifier is all you need. In ICLR, 2022.
  44. The caltech-ucsd birds-200-2011 dataset. 2011.
  45. Learning robust global representations by penalizing local predictive power. In NeurIPS, 2019.
  46. Vim: Out-of-distribution with virtual-logit matching. In CVPR, 2022.
  47. Can multi-label classification networks know what they don’t know? In NeurIPS, 2021a.
  48. Clipn for zero-shot ood detection: Teaching clip to say no. ICCV, 2023.
  49. Energy-based open-world uncertainty modeling for confidence calibration. In ICCV, 2021b.
  50. Mitigating neural network overconfidence with logit normalization. In ICML, 2022.
  51. SUN database: Large-scale scene recognition from abbey to zoo. In CVPR, 2010.
  52. Groupvit: Semantic segmentation emerges from text supervision. In CVPR, 2022.
  53. Semantically coherent out-of-distribution detection. In ICCV, 2021.
  54. Places: A 10 million image database for scene recognition. IEEE transactions on pattern analysis and machine intelligence, 40(6):1452–1464, 2018.
Citations (18)

Summary

We haven't generated a summary for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Lightbulb On Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube