**Article 10**

### Data Governance and Management Practices

The development of the Academic Compliance Monitor relied on training, validation, and testing data sets curated according to structured data governance procedures tailored to the AI system’s exam cheating detection purpose. The provider documented the design choices underpinning the hybrid architecture comprising Random Forest classifiers and Recurrent Neural Networks (RNNs), selecting data modalities capturing keystroke dynamics and ambient environmental audio patterns to form temporal sequences suitable for anomaly detection.

Data provenance for the behavioral datasets was traced to user experience research contexts on online educational platforms, originally intended for usability studies rather than exam integrity monitoring. This origin is explicitly acknowledged in system records which detail that no explicit reuse agreements or documented consents for repurposing behavioral data towards cheating detection were established at the collection stage. While the data encompasses time-stamped inputs from thousands of learners across diverse European institutions collected between 2020 and 2023, the reapplication of these datasets reflects a decision balancing dataset suitability for modeling behavioral anomalies against limitations in ethical consent frameworks.

Data preparation processes included extensive cleaning and enhancement operations: temporal alignment of multimodal streams was applied to synchronize keystroke and audio events; annotation efforts labelled segments corresponding to known behavioral patterns flagged by domain experts; and noise filtering was employed to reduce environmental audio interference. The provider formulated assumptions that the behavioral data adequately capture indirect markers of collusion or unauthorized aid, representing proxies correlated with breach events in controlled environments, even though the data were not originally gathered for this intent. The system’s model training incorporated data splitting strategies to ensure separation between training (70%), validation (15%), and testing (15%) while maintaining class balance in anomaly versus typical behavior.

An assessment of dataset availability and quantity indicated coverage of approximately 5 million behavioral event sequences, sufficient to train and validate ensemble models with adequate generalization capabilities. However, identified limitations include incomplete metadata on contextual factors such as precise exam conditions and participant demographics, linked to the secondary usage nature of the datasets. This gap is noted as a compliance risk area requiring cautious interpretation of results and potential compensatory measures.

### Bias Identification and Mitigation Measures

In alignment with bias-related provisions, the provider conducted a systematic evaluation of potential sources of bias that could affect the system’s fairness or lead to discriminatory outcomes. Bias audits focused on subgroup representation considering variables available in the data, such as language of instruction, exam format (online vs. onsite), and regional origin of learners. Analyses revealed overrepresentation of Western European institutions and underrepresentation of students with specific accessibility needs, reflecting the original dataset’s collection boundaries.

To address possible bias propagation that could impact fundamental rights—such as unfair academic sanctions due to behavioral misclassification—mitigation strategies were implemented at multiple stages. First, the provider enhanced model training with stratified sampling to reduce imbalance effects. Second, algorithmic fairness constraints were introduced, optimizing classification thresholds to minimize false positives without compromising detection sensitivity. Third, an ongoing monitoring framework was established to detect drift or disparities in model outputs relative to newly collected data from deployers. Although explicit access to special categories of personal data was not obtained, the provider limited data processing to pseudonymized identifiers only, avoiding direct linkage to sensitive personal attributes.

Technical safeguards included rigorous access controls restricting dataset handling to authorised personnel under confidentiality obligations, secure storage with encryption at rest and in transit, and comprehensive documentation of data lineage and processing steps for auditability. The provider’s records confirm that the reuse of behavioral data did not involve additional transmission to external entities and that data retention policies mandate deletion once model retraining cycles conclude or the data lose relevance for bias analysis.

### Data Set Representativeness and Suitability

The composite data sets used demonstrate relevance and representativeness for the system’s aim of detecting anomalous exam-related behaviors. Experimental benchmarks show that models trained on these data achieve an anomaly detection accuracy of approximately 87%, with false positive rates controlled below 5% in validation scenarios mimicking onsite examination environments. These performance metrics reflect an appropriate balance between sensitivity to irregular behavior and robustness against noisy or incomplete input streams.

Completeness was enhanced by integrating multimodal sources capturing both discrete tabular events (keystrokes) and continuous temporal audio cues, enabling contextual behavioral profiling. While certain demographic and geographic specificities were less exhaustively covered due to the original research data collection scope, the multi-institutional origin of the data supports the system’s intended function across diverse European exams. Statistical validation confirmed conformity of distributions within and across combined datasets, such that overall analytical properties are stable and suitable for modeling.

### Contextual Considerations Related to Intended Use

Recognizing the system’s deployment in varied educational contexts, data sets incorporated temporal and environmental factors characteristic of exam settings—such as noise levels typical in exam halls and timing patterns aligned with test durations—supporting contextual fidelity. However, data provenance reflects behavioral observation during general user interaction rather than exclusively monitored exam situations, introducing contextual gaps acknowledged in provider documentation. These limitations inform operational guidelines suggesting local calibration by deployers to tailor sensitivity and threshold values based on specific institutional exam protocols and lay infrastructural contexts.

### Conditions for Processing Special Categories of Personal Data

The provider’s data governance framework does not involve the processing of special categories of personal data for bias detection or correction purposes, given constraints arising from the origins of the behavioral datasets and applicable privacy standards. Consequently, conditions enumerated in paragraph 5 of Article 10 regarding strict safeguards, pseudonymisation, access controls, and post-use deletion are not applicable in this system’s development lifecycle.

---

This documentation transparently outlines the provider’s data sourcing, quality assurance, bias management, and contextualization efforts in developing the Academic Compliance Monitor, reflecting methodological rigor aligned with the system’s operational remit and the limitations inherent to the underlying training data characteristic of secondary research reuse.