Embracing Data Abundance

Ondrej Bajgar, Rudolf Kadlec and Jan Kleindienst

Feb 17, 2017 (modified: Feb 17, 2017) ICLR 2017 workshop submission readers: everyone
  • Abstract: There is a practically unlimited amount of natural language data available. Still, recent work in text comprehension has focused on datasets which are small relative to current computing possibilities. This article is making a case for the community to move to larger data and is offering the BookTest dataset as a step in that direction.
  • Conflicts: ibm.com
  • Keywords: Transfer Learning, Semi-Supervised Learning, Natural language processing, Deep learning

Loading