**Article 14**

**Design and Development of Human-Machine Interface for Oversight**

The Academic Compliance Monitor (ACM) has been developed with an interface that presents detected behavioral anomalies to exam supervisors through a dashboard displaying summarized alerts linked to identified event sequences, such as unusual typing patterns and sudden deviations in ambient audio levels. The interface conveys the timestamp and category of detected suspicion (e.g., “keyboard irregularity” or “audio anomaly”) but does not proactively contextualize these alerts by indicating probable causes such as ambient noise fluctuations or legitimate atypical student behavior. No interactive guidance or warning messages are embedded to inform supervisors about potential false-positive sources inherent in the monitored environment. The system’s design omits requirement or facilitation of supervisor confirmation prior to generating or logging incident records, resulting in an automated pipeline from alert detection to report generation. This decision was made to expediently provide compliance officers with comprehensive event logs but limits the system’s capacity to support active supervisor judgment prior to data recording.

**Human Oversight Objective and Risk Mitigation**

The oversight design supports continuous real-time monitoring during examinations by displaying alerts with corresponding event metadata but relies on supervisors to independently interpret and validate the outputs without automated cautionary prompts or structured verification protocols. While the system uses a hybrid Random Forest and recurrent neural network ensemble to maximize accuracy in anomaly detection across multimodal inputs, empirical testing on a development dataset of 50,000 examinations revealed sensitivity to environmental noise spikes and irregular yet permitted behaviors (e.g., collaborative discussions during group breaks), which produce a non-negligible false-positive rate averaging 12%. Such detection limitations remain unmitigated via system-level interaction designs, which could otherwise alert supervisors to interpret outputs with contextual caution. The absence of proactive human-in-the-loop confirmation steps means that alert propagation to institutional reporting workflows occurs automatically, which may expose affected individuals to unwarranted academic scrutiny.

**Scope and Proportionality of Oversight Measures**

Considering the ACM’s moderate autonomy—providing binary anomaly flags without final decision authority—the oversight measures focus on continuous display of system outputs via a visual interface accessible exclusively to human exam supervisors. However, provider-implemented safeguards have not extended to embedding interactive controls for supervisors to override alerts or annotate them prior to logging. The system incorporates a manual “stop” function enabling supervisors to pause data collection in exceptional circumstances, yet this control interrupts input processing without enabling selective alert modification or de-escalation of flagged events. No technical feature enforces mandatory supervisor approval or deferral of alerts before automated reports are finalized. These characteristics reflect a design choice prioritizing streamlined alert delivery over layered human mediation and place the onus on deployers to implement organizational processes addressing such oversight gaps.

**Provision of Information to Enable Effective Supervision**

Documentation and training materials provided alongside the ACM include detailed technologic descriptions of model architectures, input modalities, and known model performance metrics—such as precision, recall, and false-positive rates drawn from validation studies with over 12 million individual behavioral data points. However, the system’s user manuals do not include explicit explanations or examples of typical false-positive scenarios related to environmental audio fluctuations or irregular but legitimate student behaviors. Supervisors are therefore not formally equipped through the system with interpretative aids or contextual information designed to counter automation bias or overreliance on alerts. The interface does not present uncertainty scores or confidence intervals associated with outputs, nor does it provide mechanisms for supervisors to annotate or flag alerts for review or reversal. No built-in functionality allows supervisors to override or disregard outputs directly within the software environment, though general operational controls permit halting system operation altogether. Data processing records, maintained according to applicable privacy frameworks, document the use of non-special categories of data exclusively and rationalize data necessity focused on bias detection, but do not link processing steps to specific supervisor decisions or output interpretations.