Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed? (2312.12683v2)

Published 20 Dec 2023 in cs.CL

Abstract: The vast majority of today's LLMs are English-centric, having been pretrained predominantly on English text. Yet, in order to meet user expectations, models need to be able to respond appropriately in multiple languages once deployed in downstream applications. This requires strong cross-lingual transfer abilities. In this work, we investigate the minimal amount of multilinguality required during finetuning to elicit cross-lingual generalisation in English-centric LLMs. In experiments across four LLMs, we find that multilingual instruction tuning with as few as two to three languages is both necessary and sufficient to elicit effective cross-lingual generalisation, with the limiting factor being the degree to which a target language is seen during pretraining. Evaluations on five different tasks further reveal that multilingual instruction tuning is most beneficial for generative tasks that assume input/output language agreement, such as in chat settings, while being of less importance for highly structured classification-style tasks. Our code and data is available at https://github.com/ZurichNLP/multilingual-instruction-tuning.

References (60)

Citations (28)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - ZurichNLP/multilingual-instruction-tuning: Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?" (23 stars)

Tweets

https://twitter.com/BlancheMinerva/status/1780553699422613996

https://twitter.com/Wenhao_NLP/status/1768217676088832340

https://twitter.com/1656575057630576641/status/1737852800045036011

https://twitter.com/RhadamisteX/status/1790663644684132501

Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed? (2312.12683v2)

Summary

Related Papers

GitHub

Tweets