DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion FramesDownload PDFOpen Website

2020 (modified: 30 Oct 2022)ICLR 2020Readers: Everyone
Abstract: We present Decentralized Distributed Proximal Policy Optimization (DD-PPO), a method for distributed reinforcement learning in resource-intensive simulated environments. DD-PPO is distributed (uses...
0 Replies

Loading