Learning Portable Skills by Identifying Generalizing Features with an Attention-Based Ensemble

Anita Taosheng De Mello Koch; Zhiyuan Zhou; Akhil Bagaria; Haotian Fu; Cameron Allen; George Konidaris

Learning Portable Skills by Identifying Generalizing Features with an Attention-Based Ensemble

Anita Taosheng De Mello Koch, Zhiyuan Zhou, Akhil Bagaria, Haotian Fu, Cameron Allen, George Konidaris

Published: 01 Feb 2023, Last Modified: 13 Feb 2023Submitted to ICLR 2023Readers: Everyone

Keywords: Hierarchical reinforcement learning, Skill transfer, Ensembling

TL;DR: We learn a generalizing state feature for skill transfer using an attention-based ensemble.

Abstract: The ability to rapidly generalize is crucial for reinforcement learning to be practical in real-world tasks. However, generalization is complicated by the fact that, in many settings, some state features reliably support generalization while others do not. We consider the problem of learning generalizable policies and skills (in the form of options) by identifying feature sets that generalize across instances. We propose an attention-ensemble approach, where a collection of minimally overlapping feature masks is learned, each of which individually maximizes performance on the source instance. Subsequent tasks are instantiated using the ensemble, and transfer performance is used to update the estimated probability that each feature set will generalize in the future. We show that our approach leads to fast policy generalization for eight tasks in the Procgen benchmark. We then show its use in learning portable options in Montezuma's Revenge, where it is able to generalize skills learned in the first screen to the remainder of the game.

Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors’ identity.

No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Submission Guidelines: Yes

Please Choose The Closest Area That Your Submission Falls Into: Reinforcement Learning (eg, decision and control, planning, hierarchical RL, robotics)

Supplementary Material: zip

10 Replies

Loading