**Article 14**

### Design for Effective Human Oversight

Insight Proctor Analytics has been designed with a focus on delivering concise binary behavior alerts indicating either “suspicious” or “not suspicious” conduct, derived from transformer-based Vision Language Models (VLMs) analyzing synchronized video feeds and exam metadata. The human-machine interface (HMI) exposes only these binary flags to proctors, without integrating contextual explanations, confidence metrics, or direct access to underlying visual or semantic evidence. The provider considered interactive, interpretable output designs during development but elected to prioritize streamlined alert delivery to limit proctor cognitive load and interface complexity. This design decision reflects an interpretation of proctor needs observed in pilot studies involving 120 sessions, where detailed explanations were often underutilized. The interface includes a real-time dashboard with timestamped alerts, enabling proctors to monitor the temporal pattern of flagged behaviors during exams. No additional modalities, such as video playback or visual annotation overlays, are presented to proctors, which constrains oversight to a binary alert interpretation paradigm.

### Human Oversight Objectives and Risk Minimization

The system aims to contribute to exam integrity by flagging potentially prohibited behaviors aligned with institutional policies for supervised assessments. Given the high-stakes academic environment, the binary alert approach was selected to afford a clear, readily actionable signal to proctors, supporting rapid decision-making under time constraints. Despite the absence of supplemental interpretability features, the provider implemented rigorous pre-market validation on a diverse dataset exceeding 350,000 video-test samples collected under multiple lighting and camera configurations to minimize false positives and false negatives. The system’s error rates were benchmarked internally, yielding a balanced accuracy of 87% on detecting behavior patterns labeled by expert analysts. The provider recognized residual risks related to possible proctor overreliance on binary outputs and limited insight into algorithmic reasoning but determined that operational safeguards, such as proctor training programs and procedures for manual intervention, mitigate these risks during live deployment. No automatic blocking or autonomous decision-making features are included, ensuring proctors retain full authority over final determinations and actions.

### Oversight Measures Embedded by the Provider

Prior to deployment, the system incorporates fundamental oversight measures technically feasible within the design constraints. The binary alert generation pipeline integrates continuous health checks on model performance metrics and data stream integrity, with automated fault detection modules that pause alert generation in case of compromised video feed quality or anomalous sensor input patterns. Model monitoring infrastructure captures real-time telemetry, including input sampling rates and alert frequencies, to support post-exam forensic review. The provider identified potential deficiencies in explainability and implemented periodic model retraining protocols involving balanced re-annotation of edge-case samples to curb concept drift. Although the system architecture supports future integration of explainability modules, none are deployed at this time, reflecting a strategic decision grounded in observed operational practice and product positioning. Information necessary for auditing, including system versioning and alert log recording, is securely stored and made available to deployers for compliance tracking.

### Information Provided to Deployed Proctors for Oversight

Deployment packages supplied to institutions include operator manuals that explicitly define the binary nature of system outputs and warn of limitations in interpretability. Training materials emphasize proctor responsibilities to critically assess alerts within the contextual environment and not solely rely on AI-generated flags. Guidance highlights the phenomenon of automation bias and instructs proctors to maintain vigilance and apply domain expertise when adjudicating flagged behaviors. The user interface does not provide confidence scores or explanatory components; proctors are informed that the underlying detection is based on complex transformer models analyzing multimodal exam data, but no detailed interpretative support tools are accessible during use. The system empowers proctors to override or dismiss alerts at their discretion but does not incorporate a dedicated ‘stop’ or emergency halt function since the system continuously monitors rather than autonomously intervenes in exam sessions. Where anomalies or suspected malfunctions are detected by the embedded diagnostic system, proctors are prompted via system notifications to pause the exam or contact technical support as per institution protocols.

### Data Processing and Bias Mitigation Transparency

Insight Proctor Analytics processes video data and associated exam metadata, with strict adherence to data minimization and purpose limitation principles. In accordance with applicable data protection regulations, records of processing activities document the necessity of processing special categories of personal data (such as visual information of test takers) solely to detect and address algorithmic biases and ensure equitable system performance across diverse demographic groups. Bias correction activities involve controlled access to de-identified subsets of the training data and include assessment against fairness benchmarks stratified by gender, ethnicity, and testing environment conditions. The provider’s bias mitigation strategy, executed before placing the system on the market, helps reduce disparate impacts but operates within the constraints of non-explainable, binary-output design choices, limiting real-time bias transparency for proctors. Comprehensive logs pertaining to model update cycles and bias correction measures are maintained and shared with deployers for regulatory compliance and oversight purposes.