Answering Science Exam Questions Using Query Reformulation with Background Knowledge

Ryan Musa; Xiaoyan Wang; Achille Fokoue; Nicholas Mattei; Maria Chang; Pavan Kapanipathi; Bassem Makni; Kartik Talamadupula; Michael Witbrock

Answering Science Exam Questions Using Query Reformulation with Background Knowledge

Ryan Musa, Xiaoyan Wang, Achille Fokoue, Nicholas Mattei, Maria Chang, Pavan Kapanipathi, Bassem Makni, Kartik Talamadupula, Michael Witbrock

Published: 25 Feb 2019, Last Modified: 22 Jun 2025AKBC 2019Readers: Everyone

Keywords: open-domain question answering, science question answering, multiple-choice question answering, passage retrieval, query reformulation

TL;DR: We explore how using background knowledge with query reformulation can help retrieve better supporting evidence when answering multiple-choice science questions.

Abstract: Open-domain question answering (QA) is an important problem in AI and NLP that is emerging as a bellwether for progress on the generalizability of AI methods and techniques. Much of the progress in open-domain QA systems has been realized through advances in information retrieval methods and corpus construction. In this paper, we focus on the recently introduced ARC Challenge dataset, which contains 2,590 multiple choice questions authored for grade-school science exams. These questions are selected to be the most challenging for current QA systems, and current state of the art performance is only slightly better than random chance. We present a system that reformulates a given question into queries that are used to retrieve supporting text from a large corpus of science-related text. Our rewriter is able to incorporate background knowledge from ConceptNet and -- in tandem with a generic textual entailment system trained on SciTail that identifies support in the retrieved results -- outperforms several strong baselines on the end-to-end QA task despite only being trained to identify essential terms in the original source question. We use a generalizable decision methodology over the retrieved evidence and answer candidates to select the best answer. By combining query reformulation, background knowledge, and textual entailment our system is able to outperform several strong baselines on the ARC dataset.

Archival Status: Archival

Subject Areas: Natural Language Processing, Question Answering

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/answering-science-exam-questions-using-query/code)

8 Replies

Loading