Learning When Not to Answer: a Ternary Reward Structure for Reinforcement Learning Based Question AnsweringDownload PDFOpen Website

2019 (modified: 12 Nov 2022)NAACL-HLT (2) 2019Readers: Everyone
Abstract: Fréderic Godin, Anjishnu Kumar, Arpit Mittal. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Industry Papers). 2019.
0 Replies

Loading