Pre-trained Language Model for Biomedical Question Answering (1909.08229v1)

Published 18 Sep 2019 in cs.CL

Abstract: The recent success of question answering systems is largely attributed to pre-trained LLMs. However, as LLMs are mostly pre-trained on general domain corpora such as Wikipedia, they often have difficulty in understanding biomedical questions. In this paper, we investigate the performance of BioBERT, a pre-trained biomedical LLM, in answering biomedical questions including factoid, list, and yes/no type questions. BioBERT uses almost the same structure across various question types and achieved the best performance in the 7th BioASQ Challenge (Task 7b, Phase B). BioBERT pre-trained on SQuAD or SQuAD 2.0 easily outperformed previous state-of-the-art models. BioBERT obtains the best performance when it uses the appropriate pre-/post-processing strategies for questions, passages, and answers.

Citations (84)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Pre-trained Language Model for Biomedical Question Answering (1909.08229v1)

Summary

Related Papers