LISA: Learning Interpretable Skill Abstractions from Language

Divyansh Garg; Skanda Vaidyanath; Kuno Kim; Jiaming Song; Stefano Ermon

LISA: Learning Interpretable Skill Abstractions from Language

Divyansh Garg, Skanda Vaidyanath, Kuno Kim, Jiaming Song, Stefano Ermon

Published: 31 Oct 2022, Last Modified: 13 Jan 2023NeurIPS 2022 AcceptReaders: Everyone

Keywords: Imitation Learning, Natural language processing, compositional representation learning

TL;DR: Learning interpretable, compositional representations for natural language imitation learning tasks.

Abstract: Learning policies that effectively utilize language instructions in complex, multi-task environments is an important problem in imitation learning. While it is possible to condition on the entire language instruction directly, such an approach could suffer from generalization issues. To encode complex instructions into skills that can generalize to unseen instructions, we propose Learning Interpretable Skill Abstractions (LISA), a hierarchical imitation learning framework that can learn diverse, interpretable skills from language-conditioned demonstrations. LISA uses vector quantization to learn discrete skill codes that are highly correlated with language instructions and the behavior of the learned policy. In navigation and robotic manipulation environments, LISA is able to outperform a strong non-hierarchical baseline in the low data regime and compose learned skills to solve tasks containing unseen long-range instructions. Our method demonstrates a more natural way to condition on language in sequential decision-making problems and achieve interpretable and controllable behavior with the learned skills.

Supplementary Material: pdf

18 Replies

Loading