Eliciting Better Multilingual Structured Reasoning from LLMs through Code (2403.02567v2)

Published 5 Mar 2024 in cs.CL and cs.AI

Abstract: The development of LLMs (LLM) has shown progress on reasoning, though studies have largely considered either English or simple reasoning tasks. To address this, we introduce a multilingual structured reasoning and explanation dataset, termed xSTREET, that covers four tasks across six languages. xSTREET exposes a gap in base LLM performance between English and non-English reasoning tasks. We then propose two methods to remedy this gap, building on the insight that LLMs trained on code are better reasoners. First, at training time, we augment a code dataset with multilingual comments using machine translation while keeping program code as-is. Second, at inference time, we bridge the gap between training and inference by employing a prompt structure that incorporates step-by-step code primitives to derive new facts and find a solution. Our methods show improved multilingual performance on xSTREET, most notably on the scientific commonsense reasoning subtask. Furthermore, the models show no regression on non-reasoning tasks, thus demonstrating our techniques maintain general-purpose abilities.

References (23)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/LLMSherpa/status/1844033614050394422

https://twitter.com/LLMSherpa/status/1843020283172864503

https://twitter.com/knishimae0531/status/1806526460280357225

https://twitter.com/calculito/status/1843872992343765250

HackerNews

Eliciting Better Multilingual Structured Reasoning from LLMs Through Code (2 points, 0 comments)

Eliciting Better Multilingual Structured Reasoning from LLMs through Code (2403.02567v2)

Summary

Related Papers

Tweets

HackerNews