Assessing Discourse Relations in Language Generation from GPT-2 (2004.12506v3)

Published 26 Apr 2020 in cs.CL

Abstract: Recent advances in NLP have been attributed to the emergence of large-scale pre-trained LLMs. GPT-2, in particular, is suited for generation tasks given its left-to-right LLMing objective, yet the linguistic quality of its generated text has largely remain unexplored. Our work takes a step in understanding GPT-2's outputs in terms of discourse coherence. We perform a comprehensive study on the validity of explicit discourse relations in GPT-2's outputs under both organic generation and fine-tuned scenarios. Results show GPT-2 does not always generate text containing valid discourse relations; nevertheless, its text is more aligned with human expectation in the fine-tuned scenario. We propose a decoupled strategy to mitigate these problems and highlight the importance of explicitly modeling discourse information.

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Assessing Discourse Relations in Language Generation from GPT-2 (2004.12506v3)

Summary

Related Papers