Modeling Partially Observable Systems using Graph-Based Memory and Topological Priors

Steven D. Morad, Stephan Liwicki, Ryan Kortvelesy, Roberto Mecca, Amanda Prorok

2022 (modified: 25 Apr 2023)L4DC 2022Readers: Everyone

Abstract: Solving partially observable Markov decision processes (POMDPs) is critical when applying reinforcement learning to real-world problems, where agents have an incomplete view of the world. Recurrent...

0 Replies