A Novel Evaluation Framework for 15-Minute City Using Satellite Imagery

Published: 03 Nov 2025, Last Modified: 12 Nov 2025ACM SIGSPATIAL 2025EveryoneRevisionsCC BY 4.0
Abstract: The 15-minute city (15MC) is an urban planning concept that promotes sustainable and inclusive cities where residents can access essential services within a short amount of time (i.e., 15 minutes). However, evaluating 15MC compliance in hyper-dense cities remains challenging due to: (1) traditional manual assessments that are resource-intensive and difficult to scale, and (2) POI-based metrics that suffer from data unavailability and lack of spatial contexts. In this paper, we propose a novel evaluation framework that directly assesses 15MC compliance from geospatial imagery in three stages: image pre-processing, representation learning, and instance aggregation. To validate the framework, we have constructed a new dataset of 2,794 residential areas in Seoul, pairing high-resolution geospatial imagery with functional urban labels. Furthermore, we have developed a model, GeoTwin-MIL, on the basis of the proposed framework. The model includes two key components: (1) cross-modal contrastive learning that aligns satellite and map representations to capture both morphological (building density) and topological (road networks) features, enabling robust inference using only satellite images, and (2) multiple instance learning to efficiently aggregate geospatial details while detecting localized urban functions within high-resolution imagery. The experimental results obtained from various evaluation settings show that GeoTwin-MIL significantly outperforms single-modality approaches or vision baselines, validating the integrative effectiveness of the two key components and supporting the transferability of the model without POI dependencies. The code is available at \url{https://github.com/20243439/geotwin_mil.git}.
Loading