PIPPS: Flexible Model-Based Policy Search Robust to the Curse of ChaosDownload PDFOpen Website

2018 (modified: 11 Nov 2022)ICML 2018Readers: Everyone
Abstract: Previously, the exploding gradient problem has been explained to be central in deep learning and model-based reinforcement learning, because it causes numerical issues and instability in optimizati...
0 Replies

Loading