Language Embedded Radiance Fields for Zero-Shot Task-Oriented Grasping

Adam Rashid; Satvik Sharma; Chung Min Kim; Justin Kerr; Lawrence Yunliang Chen; Angjoo Kanazawa; Ken Goldberg

Language Embedded Radiance Fields for Zero-Shot Task-Oriented Grasping

Adam Rashid, Satvik Sharma, Chung Min Kim, Justin Kerr, Lawrence Yunliang Chen, Angjoo Kanazawa, Ken Goldberg

Published: 30 Aug 2023, Last Modified: 20 Apr 2025CoRL 2023 OralReaders: Everyone

Keywords: NeRF, Natural Language, Grasping, Semantics

TL;DR: Task-oriented grasping with vision-language models in 3D using language embedded radiance fields

Abstract: Grasping objects by a specific subpart is often crucial for safety and for executing downstream tasks. We propose LERF-TOGO, Language Embedded Radiance Fields for Task-Oriented Grasping of Objects, which uses vision-language models zero-shot to output a grasp distribution over an object given a natural language query. To accomplish this, we first construct a LERF of the scene, which distills CLIP embeddings into a multi-scale 3D language field queryable with text. However, LERF has no sense of object boundaries, so its relevancy outputs often return incomplete activations over an object which are insufficient for grasping. LERF-TOGO mitigates this lack of spatial grouping by extracting a 3D object mask via DINO features and then conditionally querying LERF on this mask to obtain a semantic distribution over the object to rank grasps from an off-the-shelf grasp planner. We evaluate LERF-TOGO’s ability to grasp task-oriented object parts on 31 physical objects, and find it selects grasps on the correct part in 81% of trials and grasps successfully in 69%. Code, data, appendix, and details are available at: lerftogo.github.io

Student First Author: yes

Supplementary Material: zip

Instructions: I have read the instructions for authors (https://corl2023.org/instructions-for-authors/)

Website: https://lerftogo.github.io/

Publication Agreement: pdf

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/language-embedded-radiance-fields-for-zero/code)

8 Replies

Loading