Neural-progressive hedging: Enforcing constraints in reinforcement learning with stochastic programming

Abstract: We propose a framework, called neural-progressive hedging (NP), that leverages stochastic programming during the online phase of executing a reinforcement learning (RL) policy. The goal is to ensur...
0 Replies
Loading