Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CLIPVG: Text-Guided Image Manipulation Using Differentiable Vector Graphics (2212.02122v2)

Published 5 Dec 2022 in cs.CV

Abstract: Considerable progress has recently been made in leveraging CLIP (Contrastive Language-Image Pre-Training) models for text-guided image manipulation. However, all existing works rely on additional generative models to ensure the quality of results, because CLIP alone cannot provide enough guidance information for fine-scale pixel-level changes. In this paper, we introduce CLIPVG, a text-guided image manipulation framework using differentiable vector graphics, which is also the first CLIP-based general image manipulation framework that does not require any additional generative models. We demonstrate that CLIPVG can not only achieve state-of-art performance in both semantic correctness and synthesis quality, but also is flexible enough to support various applications far beyond the capability of all existing methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yiren Song (30 papers)
  2. Xuning Shao (3 papers)
  3. Kang Chen (61 papers)
  4. Weidong Zhang (41 papers)
  5. Minzhe Li (4 papers)
  6. Zhongliang Jing (11 papers)
Citations (19)

Summary

We haven't generated a summary for this paper yet.