Learning When Not to Answer: a Ternary Reward Structure for Reinforcement Learning Based Question Answering

Fréderic Godin, Anjishnu Kumar, Arpit Mittal

2019 (modified: 12 Nov 2022)NAACL-HLT (2) 2019Readers: Everyone

Abstract: Fréderic Godin, Anjishnu Kumar, Arpit Mittal. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Industry Papers). 2019.

0 Replies