Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Iterative Few-shot Semantic Segmentation from Image Label Text (2303.05646v1)

Published 10 Mar 2023 in cs.CV

Abstract: Few-shot semantic segmentation aims to learn to segment unseen class objects with the guidance of only a few support images. Most previous methods rely on the pixel-level label of support images. In this paper, we focus on a more challenging setting, in which only the image-level labels are available. We propose a general framework to firstly generate coarse masks with the help of the powerful vision-LLM CLIP, and then iteratively and mutually refine the mask predictions of support and query images. Extensive experiments on PASCAL-5i and COCO-20i datasets demonstrate that our method not only outperforms the state-of-the-art weakly supervised approaches by a significant margin, but also achieves comparable or better results to recent supervised methods. Moreover, our method owns an excellent generalization ability for the images in the wild and uncommon classes. Code will be available at https://github.com/Whileherham/IMR-HSNet.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Haohan Wang (96 papers)
  2. Liang Liu (237 papers)
  3. Wuhao Zhang (4 papers)
  4. Jiangning Zhang (102 papers)
  5. Zhenye Gan (22 papers)
  6. Yabiao Wang (93 papers)
  7. Chengjie Wang (178 papers)
  8. Haoqian Wang (74 papers)
Citations (15)

Summary

We haven't generated a summary for this paper yet.