An Alternative Softmax Operator for Reinforcement LearningDownload PDFOpen Website

2017 (modified: 11 Nov 2022)ICML 2017Readers: Everyone
Abstract: A softmax operator applied to a set of values acts somewhat like the maximization function and somewhat like an average. In sequential decision making, softmax is often used in settings where it is...
0 Replies

Loading