Unifying Question Answering, Text Classification, and Regression via Span Extraction

Nitish Shirish Keskar; Bryan McCann; Caiming Xiong; Richard Socher

Unifying Question Answering, Text Classification, and Regression via Span Extraction

Nitish Shirish Keskar, Bryan McCann, Caiming Xiong, Richard Socher

25 Sept 2019 (modified: 05 May 2023)ICLR 2020 Conference Blind SubmissionReaders: Everyone

Keywords: NLP, span-extraction, BERT

TL;DR: Question answering, regression and classification can be unified as span extraction with improved generality and performance.

Abstract: Even as pre-trained language encoders such as BERT are shared across many tasks, the output layers of question answering, text classification, and regression models are significantly different. Span decoders are frequently used for question answering, fixed-class, classification layers for text classification, and similarity-scoring layers for regression tasks. We show that this distinction is not necessary and that all three can be unified as span extraction. A unified, span-extraction approach leads to superior or comparable performance in supplementary supervised pre-trained, low-data, and multi-task learning experiments on several question answering, text classification, and regression benchmarks.

Original Pdf: pdf

7 Replies

Loading