**Article 10**

**Data Governance and Management Practices**

The development of Priority Response Analytics followed comprehensive data governance and management protocols tailored to its purpose as a high-risk AI system. Design decisions prioritized the integration of a Gradient Boosted Decision Tree (GBDT) for structured incident data combined with a Transformer encoder for unstructured textual dispatch notes to capture complementary data modalities effectively. Data provenance was rigorously documented: structured incident data originated from anonymized historical emergency call logs obtained under explicit data-sharing agreements with multiple European emergency service agencies, collected originally for emergency response optimization. Textual dispatch notes were sourced from archived operator communications, having undergone thorough pseudonymization to protect personal identifiers.

Data-preparation operations included multi-stage cleaning to resolve inconsistencies in incident coding, time-stamping errors, and duplicate entries. Annotation involved domain experts labelling incident urgency levels based on standardized emergency severity criteria, achieving an inter-annotator agreement (Cohen’s kappa) of 0.87 across a 50,000-sample corpus. Enrichment steps incorporated contextual metadata such as geographical and temporal information aligned with emergency response zones, while data aggregation consolidated records at daily and incident-type granularities to support stratified model evaluation. Formulated assumptions clarified that numerical indicators correspond to incident descriptors, while textual data convey operator assessment nuances, jointly representing the real-world urgency landscape.

A systematic assessment of data quantity and suitability was conducted, confirming over 120,000 structured incident records spanning five years, supplemented by 70,000 corresponding textual dispatch entries, which met volume requirements for robust model training and validation. Statistical analysis ensured representativeness across incident types, geographical regions in the EU, and temporal distributions. Known potential biases—such as underreporting in rural districts and linguistic variations in textual data—were examined using fairness metrics including demographic parity and equalized odds stratified by region and emergency type.

Identification of bias triggered implementation of detection and mitigation measures: bias was monitored through dedicated modules performing continuous evaluation of model output skewness relative to data subgroups. Mitigation involved re-weighting samples from underrepresented rural regions and semantic normalization techniques to harmonize dialectical variations in text. Synthetic minority oversampling (SMOTE) was employed selectively to enhance minority class representation without compromising data integrity.

Identified data gaps, particularly limited samples from newly integrated dispatch centers, were documented, and addressed by incremental dataset updates and supervised active learning cycles enabling the model to adapt to new operational contexts. This approach ensured ongoing compliance with the regulation’s requirements for data quality and relevance while preserving safety and fundamental rights safeguards.

**Relevance, Representativeness, and Data Quality**

Training, validation, and testing datasets were curated to maximize relevance to Priority Response Analytics’ intended emergency prioritization function. The datasets collectively achieved over 98% completeness, with error rates below 0.5% as verified by automated validation scripts and cross-checked manual audits. Representativeness was confirmed through stratified sampling design, covering 12 distinct EU regions and encompassing a broad spectrum of emergency categories (fire, medical, police), including common and rare incident types.

Error detection involved script-based checks for anomalous values, temporal inconsistencies, and missing fields. Data integrity was further assured by manual spot-checks of 2,000 random records per data type. Statistical properties closely matched operational deployment environments, including incident frequency distributions and call-response latency patterns, enabling realistic performance estimation under production conditions.

**Contextual and Geographic Considerations**

The dataset accounted for geographical, contextual, behavioural, and functional specificities central to the AI system’s operational use. Regional dispatch center practices and emergency protocols were embedded into data features, such as response time zones and priority threshold calibrations, to reflect local operational realities. Behavioral contextualization considered linguistic patterns in call transcripts and dispatcher commentaries, leveraging natural language processing techniques to factor these into priority scoring.

Temporal context was incorporated by aligning data with seasonal emergency patterns (e.g., wildfire seasons, winter weather incidents). This granular contextualization supported the model’s adaptive prioritization capability, ensuring outputs were tuned to the precise setting of deployment.

**Processing of Special Categories of Personal Data**

Priority Response Analytics did not require processing special categories of personal data for bias detection or correction. Demographic attributes and sensitive personal identifiers were excluded during data preprocessing, relying instead on pseudonymized and aggregated data that preserved privacy and complied with Regulation (EU) 2016/679 (GDPR). Where any special categories of data were encountered in source datasets, strict technical safeguards—pseudonymisation, access controls, and encryption—were already enforced under data sharing agreements to prevent unauthorized access or misuse. Consequently, the processing pipeline neither recorded nor transmitted special categories of personal data, negating the need for additional safeguards under Article 10(5).

**Testing Data Sets for Non-Training Components**

While Priority Response Analytics employs model training techniques, testing datasets used for performance validation and stress testing extended beyond strictly training data. These included out-of-distribution samples drawn from newly integrated emergency centers and simulated incident scenarios generated through domain-informed scenario synthesis. Testing datasets adhered to the same governance and quality standards as training and validation sets, ensuring rigorous evaluation of the system’s robustness and accuracy prior to deployment. All test data properties mirrored or exceeded those of operational data, facilitating realistic assessment of system performance under diverse emergency conditions.