FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models (2407.01046v2)

Published 1 Jul 2024 in cs.AI and cs.CL

Abstract: Fuzzy reasoning is vital due to the frequent use of imprecise information in daily contexts. However, the ability of current LLMs to handle such reasoning remains largely uncharted. In this paper, we introduce a new benchmark, FRoG, for fuzzy reasoning, featuring real-world mathematical word problems that incorporate generalized quantifiers. Our experimental findings reveal that fuzzy reasoning continues to pose significant challenges for LLMs. Moreover, we find that existing methods designed to enhance reasoning do not consistently improve performance in tasks involving fuzzy logic. Additionally, our results show an inverse scaling effect in the performance of LLMs on FRoG. Interestingly, we also demonstrate that strong mathematical reasoning skills are not necessarily indicative of success on our benchmark.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/YiyuanLi1/status/1843687284841033922

https://twitter.com/YiyuanLi1/status/1842034173038711230

FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models (2407.01046v2)

Summary

Related Papers

Tweets