Generalized Point Based Value Iteration for Interactive POMDPs

Prashant Doshi, Dennis Perez

2008 (modified: 16 Jul 2019)AAAI 2008Readers: Everyone

Abstract: We develop a point based method for solving finitely nested interactive POMDPs approximately. Analogously to point based value iteration (PBVI) in POMDPs, we maintain a set of belief points and form value functions composed of those value vectors that are optimal at these points. However, as we focus on muItiagent settings, the beliefs are nested and computation of the value vectors relies on predicted actions of others. Consequently, we develop a novel interactive gen eralization of PBVI applicable to muItiagent settings.

0 Replies