An Approximate Dynamic Programming Algorithm for Monotone Value Functions

Daniel R. Jiang, Warren B. Powell

2015 (modified: 04 Nov 2022)Oper. Res. 2015Readers: Everyone

Abstract: Many sequential decision problems can be formulated as Markov decision processes (MDPs) where the optimal value function (or cost-to-go function) can be shown to satisfy a monotone structure in som...

0 Replies