OV-KFA: Open-vocabulary object detection via key feature alignment

Published: 2026, Last Modified: 11 Nov 2025Neurocomputing 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Core Problem Identification: We identify the fundamental alignment challenge between images and text in open-vocabulary object detection.•Bottleneck Adapter: We propose a lightweight plug-and-play module that distills key features and enhances cross-modal alignment.•Transferable Prompt: We develop a novel training paradigm that learns generalizable prompt representations without architectural modifications.•Performance Achievement: Our approach seamlessly improves open vocabulary detectors’ generalization ability while maintaining efficient inference.
Loading