An Approximate Dynamic Programming Algorithm for Monotone Value FunctionsOpen Website

2015 (modified: 04 Nov 2022)Oper. Res. 2015Readers: Everyone
Abstract: Many sequential decision problems can be formulated as Markov decision processes (MDPs) where the optimal value function (or cost-to-go function) can be shown to satisfy a monotone structure in som...
0 Replies

Loading