Informing Reinforcement Learning Agents by Grounding Language to Markov Decision Processes

Benjamin Adin Spiegel; Ziyi Yang; William Jurayj; Ben Bachmann; Stefanie Tellex; George Konidaris

Informing Reinforcement Learning Agents by Grounding Language to Markov Decision Processes

Benjamin Adin Spiegel, Ziyi Yang, William Jurayj, Ben Bachmann, Stefanie Tellex, George Konidaris

Published: 20 Jun 2024, Last Modified: 07 Aug 2024TAFM@RLC 2024EveryoneRevisionsBibTeXCC BY 4.0

Track Selection: Full paper track.

Keywords: RLang, Natural Language, NLP, Grounding, LLM

TL;DR: We use foundation models to ground natural language to components of MDPs.

Abstract: While significant efforts have been made to leverage natural language to accelerate reinforcement learning, utilizing diverse forms of language efficiently remains unsolved. Existing methods focus on mapping natural language to individual elements of MDPs such as reward functions or policies, but such approaches limit the scope of language they consider to make such mappings possible. We present an approach for leveraging general language advice by translating sentences to a grounded formal language for expressing information about *every* element of an MDP and its solution including policies, plans, reward functions, and transition functions. We also introduce a new model-based reinforcement learning algorithm, RLang-Dyna-Q, capable of leveraging all such advice, and demonstrate in two sets of experiments that grounding language to every element of an MDP leads to significant performance gains.

Submission Number: 5

Loading