Deep Inverse Reinforcement Learning via Adversarial One-Class Classification

Daiko Kishikawa; Sachiyo Arai

Deep Inverse Reinforcement Learning via Adversarial One-Class Classification

Daiko Kishikawa, Sachiyo Arai

Published: 28 Jan 2022, Last Modified: 13 Feb 2023ICLR 2022 SubmittedReaders: Everyone

Keywords: inverse reinforcement learning, one-class classification

Abstract: Traditional inverse reinforcement learning (IRL) methods require a loop to find the optimal policy for each reward update (called an inner loop), resulting in very time-consuming reward estimation. In contrast, classification-based IRL methods, which have been studied recently, do not require an inner loop and estimate rewards quickly, although it is difficult to prepare an appropriate baseline corresponding to the expert trajectory. In this study, we introduced adversarial one-class classification into the classification-based IRL framework, and consequently developed a novel IRL method that requires only expert trajectories. We experimentally verified that the developed method can achieve the same performance as existing methods.

One-sentence Summary: A novel deep inverse reinforcement learning method was proposed that requires only expert trajectories via adversarial one-class classification.

Supplementary Material: zip

7 Replies

Loading