Collecting Verified COVID-19 Question Answer PairsDownload PDF

Aug 12, 2020 (edited Oct 09, 2020)EMNLP 2020 Workshop NLP-COVID SubmissionReaders: Everyone
  • Keywords: Question-Answering, COVID-19, Data Aggregation
  • TL;DR: Efforts to collect a verified COVID-19 question-answer dataset
  • Abstract: We release a dataset of over 2,100 COVID19 related Frequently asked Question-Answer pairs scraped from over 40 trusted websites. We include an additional 24, 000 questions pulled from online sources that have been aligned by experts with existing answered questions from our dataset. This paper describes our efforts in collecting the dataset and summarizes the resulting data. Our dataset is automatically updated daily and available at https://github.com/JHU-COVID-QA/ scraping-qas. So far, this data has been used to develop a chatbot providing users information about COVID-19. We encourage others to build analytics and tools upon this dataset as well.
6 Replies

Loading