Stable Signature is Unstable: Removing Image Watermark from Diffusion Models (2405.07145v1)

Published 12 May 2024 in cs.CR and cs.CV

Abstract: Watermark has been widely deployed by industry to detect AI-generated images. A recent watermarking framework called \emph{Stable Signature} (proposed by Meta) roots watermark into the parameters of a diffusion model's decoder such that its generated images are inherently watermarked. Stable Signature makes it possible to watermark images generated by \emph{open-source} diffusion models and was claimed to be robust against removal attacks. In this work, we propose a new attack to remove the watermark from a diffusion model by fine-tuning it. Our results show that our attack can effectively remove the watermark from a diffusion model such that its generated images are non-watermarked, while maintaining the visual quality of the generated images. Our results highlight that Stable Signature is not as stable as previously thought.

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/FSFG/status/1790335874158244039

https://twitter.com/realmofresearch/status/1790993385970671703

https://twitter.com/mpeg2tom/status/1849908546999287929

Stable Signature is Unstable: Removing Image Watermark from Diffusion Models (2405.07145v1)

Summary

Related Papers

Tweets