Using Bisimulation for Policy Transfer in MDPs

Pablo Samuel Castro, Doina Precup

2010 (modified: 16 Jul 2019)AAAI 2010Readers: Everyone

Abstract: Much of the work on using Markov Decision Processes (MDPs) in artificial intelligence (AI) focuses on solving a single problem. However, AI agents often exist over a long period of time, during which they may be required to solve several related tasks. This type of scenario has motivated a significant amount of recent research in knowledge transfer methods for MDPs. The idea is to allow an agent to continue to re-use the expertise accumulated while solving past tasks over its lifetime (see Taylor & Stone, 2009, for a comprehensive survey).

0 Replies