Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca (2309.08958v2)

Published 16 Sep 2023 in cs.CL and cs.AI

Abstract: Foundational LLMs can be instruction-tuned to perform open-domain question answering, facilitating applications like chat assistants. While such efforts are often carried out in a single language, we empirically analyze cost-efficient strategies for multilingual scenarios. Our study employs the Alpaca dataset and machine translations of it to form multilingual data, which is then used to tune LLMs through either low-rank adaptation or full-parameter training. Under a controlled computation budget, comparisons show that multilingual tuning is on par or better than tuning a model for each language. Furthermore, multilingual tuning with downsampled data can be as powerful and more robust. Our findings serve as a guide for expanding language support through instruction tuning.

Authors (6)

Pinzhen Chen (27 papers)
Shaoxiong Ji (39 papers)
Nikolay Bogoychev (17 papers)
Barry Haddow (59 papers)
Kenneth Heafield (24 papers)
Andrey Kutuzov (41 papers)

Citations (35)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/hplt_eu/status/1755913523375243540

https://twitter.com/hplt_eu/status/1755903261662437410

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca (2309.08958v2)

Summary

Related Papers

Tweets