Interactive Task Planning with Language Models

Boyi Li; Philipp Wu; Pieter Abbeel; Jitendra Malik

Interactive Task Planning with Language Models

Boyi Li, Philipp Wu, Pieter Abbeel, Jitendra Malik

Published: 31 Jan 2025, Last Modified: 31 Jan 2025Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: An interactive robot framework accomplishes long-horizon task planning and can easily generalize to new goals or distinct tasks, even during execution. However, most traditional methods require predefined module design, which makes it hard to generalize to different goals. Recent large language model based approaches can allow for more open-ended planning but often require heavy prompt engineering or domain specific pretrained models. To tackle this, we propose a simple framework that achieves interactive task planning with language models by incorporating both high-level planning and low-level skill execution through function calling, leveraging pretrained vision models to ground the scene in language. We verify the robustness of our system on the real world task of making milk tea drinks. Our system is able to generate novel high-level instructions for unseen objectives and successfully accomplishes user tasks. Furthermore, when the user sends a new request, our system is able to replan accordingly with precision based on the new request, task guidelines and previously executed steps. Our approach is easy to adapt to different tasks by merely substituting the task guidelines, without the need for additional complex prompt engineering.

Submission Length: Regular submission (no more than 12 pages of main content)

Previous TMLR Submission Url: https://openreview.net/forum?id=GAhGMttRIo&referrer=%5BAuthor%20Console%5D(%2Fgroup%3Fid%3DTMLR%2FAuthors%23your-submissions)

Changes Since Last Submission: Problem: "Fonts appear modified from template default." Solution: We removed the "mathptmx" and "times" packages from the LaTeX file to resolve this issue.

Video: https://www.youtube.com/watch?v=TrKLuyv26_g&feature=youtu.be

Code: https://github.com/Boyiliee/ITP-BobaRobot

Supplementary Material: zip

Assigned Action Editor: ~Vikas_Sindhwani1

Submission Number: 3063

Loading