GATS：
http://download.tensorflow.org/data/questions-words.txt

BATS:
https://vecto.space/

Wikipedia corpus:
https://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2