Explaining CLIP through Co-Creative Drawings and Interaction (2306.07429v1)

Published 12 Jun 2023 in cs.AI, cs.CV, and cs.CY

Abstract: This paper analyses a visual archive of drawings produced by an interactive robotic art installation where audience members narrated their dreams into a system powered by CLIPdraw deep learning (DL) model that interpreted and transformed their dreams into images. The resulting archive of prompt-image pairs were examined and clustered based on concept representation accuracy. As a result of the analysis, the paper proposes four groupings for describing and explaining CLIP-generated results: clear concept, text-to-text as image, indeterminacy and confusion, and lost in translation. This article offers a glimpse into a collection of dreams interpreted, mediated and given form by AI, showcasing oftentimes unexpected, visually compelling or, indeed, the dream-like output of the system, with the emphasis on processes and results of translations between languages, sign-systems and various modules of the installation. In the end, the paper argues that proposed clusters support better understanding of the neural model.

References (12)

Authors (3)

Varvara Guljajeva (9 papers)
Isaac Joseph Clarke (2 papers)
Mar Canet Solà (4 papers)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Explaining CLIP through Co-Creative Drawings and Interaction (2306.07429v1)

Summary

Related Papers