Decoding before aligning: Scale-Adaptive Early-Decoding Transformer for visual grounding

Liuwu Li, Yi Cai, Jiexin Wang, Cantao Wu, Qingbao Huang, Qing Li

Published: 01 Jun 2025, Last Modified: 06 Jan 2026NeurocomputingEveryoneRevisionsCC BY-SA 4.0
Loading