Over-Reasoning and Redundant Calculation of Large Language Models (2401.11467v2)

Published 21 Jan 2024 in cs.CL

Abstract: LLMs can solve problems step-by-step. While this chain-of-thought (CoT) reasoning boosts LLMs' performance, it is unclear if LLMs \textit{know} when to use CoT and whether those CoT are always necessary to answer the question. This paper shows that LLMs tend to generate redundant calculations and reasoning on a manually constructed math QA dataset, GSM8K-Zero. GSM8K-Zero is constructed such that the questions can be answered without any calculations, but LLMs, including Llama-2 models and Claude-2, tend to generate lengthy and unnecessary calculations to answer the questions. We also conduct experiments to explain why LLMs generate redundant calculations and reasonings. GSM8K-Zero is publicly available at https://github.com/d223302/Over-Reasoning-of-LLMs and https://huggingface.co/datasets/dcml0714/GSM8K-Zero.

References (19)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - d223302/Over-Reasoning-of-LLMs: Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models (10 stars)

Tweets

https://twitter.com/dcml0714/status/1749639929716777082

https://twitter.com/Yining_Ye/status/1753626639433695660

Over-Reasoning and Redundant Calculation of Large Language Models (2401.11467v2)

Summary

Related Papers

GitHub

Tweets