Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 169 tok/s

Gemini 2.5 Pro 44 tok/s Pro

GPT-5 Medium 20 tok/s Pro

GPT-5 High 22 tok/s Pro

GPT-4o 87 tok/s Pro

Kimi K2 185 tok/s Pro

GPT OSS 120B 461 tok/s Pro

Claude Sonnet 4.5 37 tok/s Pro

2000 character limit reached

PALRACE: Reading Comprehension Dataset with Human Data and Labeled Rationales (2106.12373v2)

Published 23 Jun 2021 in cs.CL and cs.AI

Abstract: Pre-trained LLMs achieves high performance on machine reading comprehension (MRC) tasks but the results are hard to explain. An appealing approach to make models explainable is to provide rationales for its decision. To investigate whether human rationales can further improve current models and to facilitate supervised learning of human rationales, here we present PALRACE (Pruned And Labeled RACE), a new MRC dataset with human labeled rationales for 800 passages selected from the RACE dataset. We further classified the question to each passage into 6 types. Each passage was read by at least 26 human readers, who labeled their rationales to answer the question. It is demonstrated that models such as RoBERTa-large outperforms human readers in all 6 types of questions, including inference questions, but its performance can be further improved when having access to the human rationales. Simpler models and pre-trained models that are not fine-tuned based on the task benefit more from human rationales, and their performance can be boosted by more than 30% by rationales. With access to human rationales, a simple model based on the GloVe word embedding can reach the performance of BERT-base.

Citations (5)

View on Semantic Scholar