**Article 10**

### Data Governance and Management Practices

The Emergency Dispatch Prioritization Engine was developed following an established data governance framework aligned with the intended operational context of municipal emergency response coordination. The training data comprised approximately 5 million anonymized emergency dispatch records spanning a 15-year period across multiple mid-sized and large metropolitan areas in the EU. Data collection sourced from historical dispatch logs, 911 call transcripts, geographic information system (GIS) layers, and sensor feeds aggregated by municipal authorities. The original purpose of these data was operational emergency response and public safety record-keeping.

Data preparation included multi-stage cleansing, deduplication, and normalization pipelines applied to structured event data and unstructured text annotations from call transcripts. Geographic and temporal labelling was verified against official census and urban infrastructure datasets to preserve spatial and temporal coherence. Despite these measures, analysis revealed entrenched prioritization patterns in historical dispatch decisions that correlated strongly with neighborhood socioeconomic indicators.

Domain assumptions underpinning the data intended to reflect typical emergency incident severity, urgency, and resource availability over time, codified into a priority score used as the supervised learning target. Validation data sets were segmented chronologically and geographically to reflect diverse urban contexts and temporal trends.

Explicit bias examinations were conducted employing statistical parity difference and disparate impact ratio metrics across socioeconomic strata, derived from publicly available demographic data layered onto dispatch origin locations. This revealed a consistent under-prioritization signal associated with marginalized communities characterized by lower median income and higher unemployment rates.

### Bias Identification and Remediation Measures

The provider identified the embedded prioritization bias through a comprehensive fairness audit executed pre-deployment, utilizing intersectional subgroup analysis. The audit found that neighborhoods in the bottom socioeconomic quartile experienced up to a 22% reduction in predicted priority rankings relative to comparable severity incidents in higher status areas.

In response, the provider implemented targeted corrective measures limited to outlier filtering during data preprocessing. Specifically, data instances with extreme priority scores disproportionally favoring affluent neighborhoods were clipped to reduce skew, using a top and bottom 1% outlier removal protocol affecting approximately 45,000 records. This aimed to mitigate the most egregious cases of bias while retaining the original distribution’s overall structural characteristics.

No further structural interventions, such as re-weighting, adversarial debiasing, or synthetic data augmentation, were undertaken. Justification for this approach emphasized preservation of the historical operational context embedded within training data for interpretability and traceability. This decision was documented with reference to constraints in data availability, impact on model performance trade-offs, and assessment that residual bias fell within acceptable risk thresholds as determined by initial internal risk assessments.

### Dataset Representativeness and Suitability

The training, validation, and testing datasets collectively represented a broad spectrum of urban emergency scenarios, including fire, medical, and law-enforcement incidents, enabling multimodal model training consistent with the system’s hybrid CNN-LSTM architecture. The CNN component ingested high-resolution spatial sensor and GIS image mosaics (averaging 0.5m spatial resolution, captured from municipal sensor networks), while the LSTM processed time-series event logs and incident call metadata.

Despite encompassing a geographically and temporally diverse dataset, systematic disparities in incident prioritization patterns linked to socioeconomic factors indicate a representativeness limitation concerning equitable emergency response outcomes. The datasets were otherwise rigorously curated for completeness and absence of systemic recording errors, with an estimated error rate in raw data ingestion below 0.3%, verified via anomaly detection pipelines and manual spot audits.

### Consideration of Contextual and Functional Characteristics

The dataset design took into account urban contextual variables including neighborhood delineations, infrastructural layouts, and temporal emergency frequency cycles linked to known socio-demographic trends. However, these contextual variables co-existed with embedded decision biases reflected in historical dispatch prioritization protocols that were not originally designed to be equitable across communities.

Processing did not specifically adjust or augment data to counteract these embedded social-economic biases as part of the dataset construction due to data provenance constraints and operational continuity requirements. As such, the AI system’s outputs maintain dependencies on these underlying contextual disparities.

### Processing of Special Categories of Personal Data

The system development did not involve processing special categories of personal data (such as racial or ethnic origin, political opinions, or health data beyond anonymized incident descriptions), as prioritization bias detection relied on publicly accessible aggregate socioeconomic indicators and spatial proxies. Consequently, no exceptional processing safeguards as detailed in Article 10(5) were invoked.

Access to all training data remained restricted under documented confidentiality protocols, with pseudonymization applied to any potentially identifying data elements to prevent misuse while maintaining analytical utility.

### Documentation of Data Gaps and Limitations

Provider documentation transparently records the identified bias and the limited corrective actions applied, acknowledging that significant structural biases reflecting historical dispatch practices remain embedded within the training datasets. This includes recognition of data gaps related to underreporting of incidents in marginalized communities and variability in call center responses over the training horizon.

Plans for future data collection efforts and model re-training incorporate potential strategies for structural bias mitigation, such as the integration of synthetic minority oversampling, re-weighting schemes, and collaboration with social science experts to refine fairness objectives.

Current compliance materials thus reflect a provisional state of bias management rooted primarily in outlier filtering, with ongoing monitoring of AI decision outputs planned to detect potential unintended impacts on protected groups throughout operational use.