Neural-progressive hedging: Enforcing constraints in reinforcement learning with stochastic programming

Supriyo Ghosh, Laura Wynter, Shiau Hong Lim, Duc Thien Nguyen

2022 (modified: 10 Nov 2022)UAI 2022Readers: Everyone

Abstract: We propose a framework, called neural-progressive hedging (NP), that leverages stochastic programming during the online phase of executing a reinforcement learning (RL) policy. The goal is to ensur...

0 Replies