Collecting Verified COVID-19 Question Answer PairsDownload PDF

Published: 05 Jul 2022, Last Modified: 24 May 2023NLP-COVID19-EMNLP PosterReaders: Everyone
Keywords: Question-Answering, COVID-19, Data Aggregation
TL;DR: Efforts to collect a verified COVID-19 question-answer dataset
Abstract: We release a dataset of over 2,100 COVID19 related Frequently asked Question-Answer pairs scraped from over 40 trusted websites. We include an additional 24, 000 questions pulled from online sources that have been aligned by experts with existing answered questions from our dataset. This paper describes our efforts in collecting the dataset and summarizes the resulting data. Our dataset is automatically updated daily and available at https://github.com/JHU-COVID-QA/ scraping-qas. So far, this data has been used to develop a chatbot providing users information about COVID-19. We encourage others to build analytics and tools upon this dataset as well.
6 Replies

Loading