Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning

Kibeom Kim; Min Whoo Lee; Yoonsung Kim; JeHwan Ryu; Minsu Lee; Byoung-Tak Zhang

Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning

Kibeom Kim, Min Whoo Lee, Yoonsung Kim, JeHwan Ryu, Minsu Lee, Byoung-Tak Zhang

Published: 09 Nov 2021, Last Modified: 04 May 2025NeurIPS 2021 PosterReaders: Everyone

Keywords: goal-aware, attention, reinforcement learning, multi-target environment, visual navigation, manipulation

Abstract: Learning in a multi-target environment without prior knowledge about the targets requires a large amount of samples and makes generalization difficult. To solve this problem, it is important to be able to discriminate targets through semantic understanding. In this paper, we propose goal-aware cross-entropy (GACE) loss, that can be utilized in a self-supervised way using auto-labeled goal states alongside reinforcement learning. Based on the loss, we then devise goal-discriminative attention networks (GDAN) which utilize the goal-relevant information to focus on the given instruction. We evaluate the proposed methods on visual navigation and robot arm manipulation tasks with multi-target environments and show that GDAN outperforms the state-of-the-art methods in terms of task success ratio, sample efficiency, and generalization. Additionally, qualitative analyses demonstrate that our proposed method can help the agent become aware of and focus on the given instruction clearly, promoting goal-directed behavior.

Code Of Conduct: I certify that all co-authors of this work have read and commit to adhering to the NeurIPS Statement on Ethics, Fairness, Inclusivity, and Code of Conduct.

TL;DR: We propose goal-aware cross-entropy loss and attention networks in multi-target environments.

Supplementary Material: pdf

Code: https://github.com/kibeomKim/GACE-GDAN

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/goal-aware-cross-entropy-for-multi-target/code)

15 Replies

Loading