Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Visualize Before You Write: Imagination-Guided Open-Ended Text Generation (2210.03765v4)

Published 7 Oct 2022 in cs.CL and cs.AI

Abstract: Recent advances in text-to-image synthesis make it possible to visualize machine imaginations for a given context. On the other hand, when generating text, human writers are gifted at creative visualization, which enhances their writings by forming imaginations as blueprints before putting down the stories in words. Inspired by such a cognitive process, we ask the natural question of whether we can endow machines with the same ability to utilize visual information and construct a general picture of the context to guide text generation. In this work, we propose iNLG that uses machine-generated images to guide LLMs in open-ended text generation. The experiments and analyses demonstrate the effectiveness of iNLG on open-ended text generation tasks, including text completion, story generation, and concept-to-text generation in both few-shot and full-data scenarios. Both automatic metrics and human evaluations verify that the text snippets generated by our iNLG are coherent and informative while displaying minor degeneration.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Wanrong Zhu (30 papers)
  2. An Yan (31 papers)
  3. Yujie Lu (42 papers)
  4. Wenda Xu (19 papers)
  5. Xin Eric Wang (74 papers)
  6. Miguel Eckstein (10 papers)
  7. William Yang Wang (254 papers)
Citations (33)

Summary

We haven't generated a summary for this paper yet.