Joint Repetition Suppression and Content Moderation of Large Language Models (2304.10611v2)

Published 20 Apr 2023 in cs.CL and cs.LG

Abstract: Natural language generation (NLG) is one of the most impactful fields in NLP, and recent years have witnessed its evolution brought about by LLMs. As the key instrument for writing assistance applications, they are generally prone to replicating or extending offensive content provided in the input. In low-resource data regime, they can also lead to repetitive outputs. Usually, offensive content and repetitions are mitigated with post-hoc methods, including n-gram level blocklists, top-k and nucleus sampling. In this paper, we apply non-exact repetition suppression using token and sequence level unlikelihood loss, and further explore the framework of unlikelihood training objective in order to jointly endow the model with abilities to avoid generating offensive words and phrases from the beginning. Finally, with comprehensive experiments, we demonstrate that our proposed methods work exceptionally in controlling the repetition and content quality of LLM outputs.

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Joint Repetition Suppression and Content Moderation of Large Language Models (2304.10611v2)

Summary

Related Papers