Learning When Not to Answer: a Ternary Reward Structure for Reinforcement Learning Based Question Answering

Abstract: Fréderic Godin, Anjishnu Kumar, Arpit Mittal. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Industry Papers). 2019.
0 Replies
Loading