Decoding before aligning: Scale-Adaptive Early-Decoding Transformer for visual grounding

Published: 2025, Last Modified: 30 May 2026Neurocomputing 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading