Global Policy Construction in Modular Reinforcement LearningOpen Website

2015 (modified: 16 Jul 2019)AAAI 2015Readers: Everyone
Abstract: We propose a modular reinforcement learning algorithm which decomposes a Markov decision process into independent modules. Each module is trained using Sarsa(λ). We introduce three algorithms for forming global policy from modules policies, and demonstrate our results using a 2D grid world.
0 Replies

Loading