Learning to Take Concurrent ActionsDownload PDFOpen Website

2002 (modified: 11 Nov 2022)NIPS 2002Readers: Everyone
Abstract: We investigate a general semi-Markov Decision Process (SMDP) framework for modeling concurrent decision making, where agents learn optimal plans over concurrent temporally extended actions. We introduce three types of parallel termination schemes { all, any and continue { and theoretically and experimentally compare them.
0 Replies

Loading