MOGeo: Beyond One-to-One Cross-View Object Geo-localization
Abstract: Cross-View Object Geo-Localization (CVOGL) aims to locate an object of interest in a query image within a corresponding satellite image. Existing methods typically assume that the query image contains only a single object, which does not align with the complex, multi-object geo-localization requirements in real-world applications, making them unsuitable for practical scenarios. To bridge the gap between the realistc setting and existing task, we propose a new task, called Cross-View Multi-Object Geo-Localization (CVMOGL). To advance CVMOGL task, we first construct a benchmark, CMLocation, which includes two datasets: CMLocation-V1 and CMLocation-V2. Furthermore, we propose a novel cross view multi-object geo-localization method, MOGeo, and benchmark it against existing state-of-the-art methods. Extensive experiments are conducted under various application scenarios to validate the effectiveness of our method. The results demonstrate that cross-view geo-localization in the more realistic setting remains a challenging problem, encouraging further research in this area.
Loading