3DAxisPrompt: Promoting the 3D grounding and reasoning in GPT-4o

Published: 01 Jan 2025, Last Modified: 12 Nov 2025Neurocomputing 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We propose a visual prompt scheme that can promote the 3D grounding ability in MLLMs.•The potential 3D visual prompts are comprehensively investigated.•Extensive experiments are conducted, including indoor and outdoor 3D grounding tasks.
Loading