Exploring the Maze of Multilingual Modeling (2310.05404v2)

Published 9 Oct 2023 in cs.CL

Abstract: Multilingual LLMs have gained significant attention in recent years, enabling the development of applications that meet diverse linguistic contexts. In this paper, we present a comprehensive evaluation of three popular multilingual LLMs: mBERT, XLM-R, and GPT-3. We assess their performance across a diverse set of languages, with a focus on understanding the impact of resource availability (general and model-specific), language family, script type, and word order on model performance, under two distinct tasks - text classification and text generation. Our findings reveal that while the amount of language-specific pretraining data plays a crucial role in model performance, we also identify other factors such as general resource availability, language family, and script type, as important features. We hope that our study contributes to a deeper understanding of multilingual LLMs to enhance their performance across languages and linguistic contexts.

References (29)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Exploring the Maze of Multilingual Modeling (2310.05404v2)

Summary

Related Papers