Learning to Score Behaviors for Guided Policy OptimizationDownload PDFOpen Website

2020 (modified: 12 May 2023)ICML 2020Readers: Everyone
Abstract: We introduce a new approach for comparing reinforcement learning policies, using Wasserstein distances (WDs) in a newly defined latent behavioral space. We show that by utilizing the dual formulati...
0 Replies

Loading