**Article 10**

### Data Governance and Management for Training, Validation, and Testing Sets

Insight Proctor Analytics’ development process adheres to structured data governance and management procedures tailored to the system’s intended use of video-based academic exam proctoring. The core training corpus consists of over 45,000 hours of video footage, primarily sourced from well-lit, high-resolution classroom recordings conducted in controlled academic settings across European universities and accredited testing centers. This collection was curated using documented ethical sourcing protocols, ensuring the video data were originally captured for academic integrity monitoring, with explicit consent and legal compliance respecting privacy regulations. The provider employed rigorous data versioning controls and metadata cataloging to maintain traceability from data acquisition through to model training, facilitating auditability and reproducibility.

Data preparation included multi-stage annotation workflows. Experienced annotators flagged incidents of cheating-related behaviors, such as unauthorized use of electronic devices, surreptitious note passing, and suspicious gestures. However, exploratory analyses revealed that approximately 12% of hand movement instances in annotated segments were misclassified due to ambiguous labeling guidelines, reflecting a subset of unresolved inconsistencies that remain under ongoing review. Data cleaning steps involved removing corrupted or incomplete video frames and normalizing resolution and color balance for uniform model input. While the provider executed multiple cycles of dataset refinement, a residual proportion of mislabeled instances persists, acknowledged as a limitation in current system performance documentation.

### Data Relevance, Representativeness, and Completeness

The system’s training, validation, and testing datasets were assembled to capture relevant behavioral patterns in conventional, supervised exam environments. The bulk of the data reflects scenarios under controlled lighting conditions with stationary, multi-camera recordings typically deployed in examination halls. Despite the broad geographic distribution across several European countries, data instances covering low-light environments or alternative test administration modes — such as remote proctoring using standard webcams with varying video quality — are insufficiently represented, comprising less than 3% of the total dataset. This gap impacts the representativeness of the training data relative to emerging remote or hybrid testing modalities.

Statistical audits show that demographic attributes such as participant gender and age approximate typical student populations within higher education, though other contextual variables specific to exam formats and room layouts exhibit moderate homogeneity. Error analysis and performance benchmarks demonstrate that detection accuracy remains consistently high (above 91% F1 score) within on-scope environmental conditions but degrades in low-light or webcam-only scenarios. These deviations have been documented in the risk assessment reports, with corresponding mitigation actions planned through prioritized data acquisition campaigns.

### Treatment of Labeling Assumptions and Biases

Annotations were constructed with operational assumptions that hand gestures and movements could be directly correlated with potential cheating indicators. Nevertheless, provider investigations identified systematic ambiguity where ordinary non-cheating behaviors, such as note-taking or gesture-based communication with proctors, were occasionally misclassified. This reveals an intrinsic limitation in the current label schema and the challenge of distinguishing covert intent solely from visual cues. To address this, the provider has incorporated layered model architectures that integrate visual patterns with semantic test metadata, mitigating false positives by cross-validating suspicious gestures against concurrent examination context.

Bias detection procedures were conducted to assess whether data imbalances might affect protected groups or distort system outputs negatively. No statistically significant discrimination was detected regarding gender or ethnic groups represented within the datasets, consistent with the provider’s predefined fairness indicators. However, limitations relating to lighting conditions and camera angles point to possible environmental biases that could affect detection reliability in less conventional settings. A continuous monitoring framework scans for performance deviations correlated with contextual variables, triggering alerts for dataset enrichment or algorithm update cycles.

### Identification and Management of Data Gaps and Shortcomings

The provider explicitly acknowledges and documents existing data gaps, particularly the underrepresentation of low-light, webcam-only footage typically encountered in remote proctoring scenarios. This shortfall restricts Insight Proctor Analytics’ operational scope and has informed both communicated system limitations and user guidance materials. To partially mitigate this gap, the provider is undertaking extended data collection initiatives featuring synthetic augmentation techniques and collaboration with remote testing centers to expand the annotation base relevant to these emerging use cases.

Furthermore, unresolved annotation inconsistencies, notably in the classification of hand movements unrelated to cheating, are flagged within the provider’s quality control logs. These inconsistencies have led to conservative decision thresholds within the model deployment pipeline to balance sensitivity and specificity, thereby reducing the likelihood of false accusations while acknowledging potential undetected anomalies. Documentation of these trade-offs is detailed in the system’s technical risk and mitigation reports, forming part of the provider’s comprehensive compliance framework.

### Data Security and Privacy Considerations Relevant to Special Categories of Data

While Insight Proctor Analytics processes video data of students during examinations, it does not employ biometric identification or process special categories of personal data as defined under Regulations (EU) 2016/679 and (EU) 2018/1725. The system processes visual data exclusively for behavior analysis without extracting biometric modalities such as facial recognition for identification or health data. The provider’s data management protocols enforce strict access controls, encryption-at-rest and in-transit, and pseudonymization at annotation stages to protect individual privacy rights. Residual personal data are subject to retention policies aligned with the stated purpose of academic integrity monitoring and are expunged following project-specific timelines or upon request in accordance with applicable data protection laws.