**Article 10**

### Data Governance and Management Practices

The development of the Academic Compliance Monitor’s AI components strictly follows a comprehensive data governance framework tailored to the system’s purpose of detecting anomalous behavioral patterns during monitored examinations. The training, validation, and testing datasets comprise multimodal event data reflecting keyboard input dynamics and environmental audio signals collected from exam halls under controlled research environments spanning five EU-based universities. These datasets underwent clearly documented collection procedures: audio and key event streams were recorded solely during active examination sessions with prior institutional and participant consent, explicitly stating the intended use for academic integrity assessment.

In particular, the data governance strategy emphasizes the following design choices: annotation by domain experts classified behavioral instances as normal, suspicious, or confirmed cheating behaviors based on predetermined criteria developed in collaboration with educational integrity officers. The data pipeline includes systematic cleaning to remove corrupted or incomplete samples, and normalization procedures to align event timestamps and standardize audio signal features (e.g., Mel-frequency cepstral coefficients) for consistent model input. Data enrichment involved augmenting time series with contextual metadata such as exam type, seating position, and room occupancy to enhance model interpretability. Aggregation rules ensure that sequences maintain temporal coherence and correspond to unique exam sessions without cross-session contamination.

Underlying assumptions made explicit include that keystroke dynamics and audio patterns meaningfully correlate with abnormal or unauthorized exam behaviors, and that labeled examples recorded are representative of both honest and dishonest conduct as realistically observed in conventional exam settings. An internal assessment of dataset volume and diversity confirmed sufficiency: 120,000 labeled event sequences across 15 examination contexts were used, covering a range of exam formats and room acoustics to maximize representativeness.

To identify and mitigate bias, subsets of the data were analyzed for disparities linked to contextual variables such as exam location and student demographics (anonymized age groups and language proficiencies). Statistical tests were employed to detect imbalances that might disproportionately flag behaviors of particular subgroups. Procedures to address detected biases include data resampling, incorporation of counterfactual synthetic examples generated by GAN-based augmentation, and calibration of decision thresholds in downstream classifiers to reduce false positives on protected groups. The data governance process explicitly tracks residual data gaps, such as limited representation of remote or hybrid exam settings, which are documented with plans for future data enrichment to ensure continued alignment with regulatory requirements.

### Dataset Quality and Representativeness

The training, validation, and testing datasets were curated to fulfill stringent quality criteria consistent with the system’s intended use in educational environments. Data completeness was verified via automated scripts that flagged missing keystroke events and audio dropouts; these were either corrected or excluded. Error rates were minimized through cross-checks against exam logs and manual auditing of 5% of samples, resulting in an estimated data error rate below 0.3%.

Representativeness was ensured by intentionally sampling diverse exam formats (written, computer-based, oral) across multiple EU member states to reflect geographical and contextual variability as required by the intended purpose. For instance, acoustic data took into account differences in room configurations and background noise profiles prevalent across institution types (urban and rural). The demographic composition of student participants covers balanced distributions by age and academic level, ensuring that classifier decisions are not skewed toward particular user groups.

Datasets exhibit appropriate statistical properties measured via distributional analyses. Feature distributions (e.g., keystroke latencies, audio spectral features) were compared across demographic and contextual subgroups using Kolmogorov-Smirnov tests to confirm statistical similarity, satisfying the requirement for representative sampling at the aggregated dataset level. Performance benchmarks on validation data demonstrate precision and recall rates above 92% in detecting anomalous behavior with robust stability over heterogeneous exam conditions.

### Adaptation to Geographic, Contextual, and Functional Settings

Consideration was given to specific geographical and contextual elements relevant to the AI system’s deployment settings. Data collection and modeling accounted for local environmental variables affecting input modalities, such as language-dependent keystroke patterns, room acoustic variations influenced by building materials common to different regions, and exam procedural differences among institutions. These characteristics informed both feature engineering (e.g., linguistic normalization of input data) and model architecture decisions, including the hybrid approach integrating both Random Forest classifiers for tabular data and sequence-based Recurrent Neural Networks (RNNs) for temporal dependencies.

Functional contextualization involved alignment with the academic calendar and examination protocols, embedding parameters such as time limits and permitted behaviors into system logic and data interpretation. This ensures that model predictions correspond meaningfully to realistic exam scenarios, reducing spurious alerts. The datasets thus capture the functional environment of monitored exams, enabling competent generalization within the intended operational context.

### Processing of Special Categories of Data for Bias Detection

The Academic Compliance Monitor’s development did not require the processing of special categories of personal data (e.g., racial or ethnic origin, political opinions, religious beliefs) as defined under the Regulation. As such, bias detection and correction efforts relied exclusively on pseudonymized behavioral data and non-sensitive metadata. No sensitive personal data were processed for bias mitigation, circumventing the need for exceptional safeguards under EU regulations.

Nonetheless, data security measures align with best practices mandated for any personal data processing: strict access controls restrict dataset access to a limited group of authorized personnel; all datasets are stored on encrypted servers with role-based permissions; data handling is logged comprehensively, supporting full audit trails. The system architecture enforces data minimization and ensures deletion of any temporary copies immediately after model training or bias assessment tasks are completed.

---

This documentation substantiates a rigorous, multi-layered approach to dataset handling, quality assurance, contextualization, and bias mitigation inherent to the Academic Compliance Monitor’s AI model training lifecycle, supporting a thorough assessment aligned with Article 10 requirements.