everyone
since 29 May 2025">EveryoneRevisionsBibTeXCC BY 4.0
Abstract: Geospatial data analysis is heavily reliant on human interpretation of large-scale imagery which leads to constraints in scalability. This study evaluates whether mul-ti-modal models can assist in overhead image understanding by accurately interpreting imagery and automating workflows. A hybrid machine learning solution using Over-sightML (OSML)—an open-source, cloud-based framework—is assessed for its ability to improve geospatial workflows. OSML integrates state-of-the-art computer vision with generative AI capabilities and streamlines preprocessing and detection aggregation. Results indicate that combining domain-specific CV models with foundation models offers a scalable and efficient alternative to manual analysis workflows