**Article 14**

**Design and Development of Human-Machine Interface for Oversight**  
The Competency Evaluation Framework employs a gradient boosted decision tree (GBDT) architecture trained on a dataset comprising approximately 180,000 anonymized learner interaction logs and structured performance metrics collected from vocational training programs across five EU member states. The model produces competency scores ranging from 0 to 100, which reflect assessed mastery over specific technical and practical skills. The user interface presents these scores directly and prominently on a dashboard designed for instructors and trainees, featuring concise numeric values and categorical descriptors (e.g., “Proficient,” “Developing,” “Needs Improvement”) without supplemental qualifiers or uncertainty intervals. The interface omits interpretive warnings or disclaimers regarding probabilistic estimation or potential error margins, deliberately prioritizing score clarity and decisiveness to support perceived authoritative decision-making. The system incorporates standard interaction components such as sortable tables of competency scores, trend charts, and feature importance indicators that expose which learner activities most strongly influenced the assessment, enhancing interpretability consistent with prevailing industry best practices in 2025.

**Human Oversight Objective and Risk Mitigation**  
Oversight functions are designed to enable instructors and other authorized personnel to review learner competency results in real time and adjust curricula accordingly. While the system inherently quantifies learner performance through algorithmic analysis, Horizon Learning Analytics structured the oversight interface to enable natural persons to monitor score evolution and inferred skill acquisition trends during all phases of use. No specific mechanistic safeguards are embedded to explicitly alert users to risks of overreliance or automation bias on these outputs, nor are reminders provided regarding the inherent uncertainty of modeled scores. The system assumes user expertise in interpreting and acting upon the scores within established instructional contexts. The design intent is to sustain efficient verification of learner progress through authoritative metrics enabling timely interventions, while accepting residual risks associated with undiminished trust in the AI-generated evaluations.

**Commensurate Oversight Measures and Assigned Responsibilities**  
Horizon Learning Analytics implemented pre-market design containment measures by embedding transparent feature importance analytics and visual trend tracking to support user comprehension of model decision drivers. These technical features aim to facilitate effective supervision by highlighting influential learner behaviors and detected competencies that inform each score. Prior to deployment, the provider also furnished deployers with comprehensive model performance reports demonstrating stable predictive accuracy, with cross-validation yielding mean absolute error (MAE) values consistently below 6 points across competencies—benchmarked against expert human raters. The provider’s documentation includes operational best practices for integration but deliberately does not enforce tools or protocols specifically addressing cognitive biases during score consumption. Deployment-specific operational measures are not mandated by the provider and are thus left to the discretion of users, consistent with the provider’s boundary of responsibility.

**Enabling Effective Human Monitoring and Interpretation**  
The system’s data visualization and reporting tools enable instructors and trainees to monitor ongoing competency developments through time-series charts and statistical summaries. Interactive model explanation modules provide feature importance scores tied to distinct learner activities, giving users interpretable insights into the algorithmic rationale behind competency ratings. The scores are presented as definitive indicators of current learner status without caveats related to statistical confidence or alternative scenario modeling. The interface supports manual overrides of scoring by authorized personnel, permitting instructional adaptation and certification decisions to rely on human judgment. Such overrides are facilitated through built-in controls on the platform, allowing for correction or exclusion of individual results if deemed inconsistent with other evidence. A “stop” function halts system assessments but does not interrupt the underlying data collection procedures, preserving data integrity for subsequent analyses. Logs capture user interactions with the scoring output, but no embedded alerts inform users of automation bias risks, nor do training materials supplied by Horizon Learning Analytics emphasize cognitive bias mitigation strategies.

**Records of Data Processing Justification**  
The system processes personal data comprising training performance and demographic attributes strictly necessary for competency modeling. Special categories of personal data are processed only insofar as required to detect and adjust for group-level biases impacting model fairness. The provider’s documented rationale explains that such processing enables correction of systemic disparities in skill evaluation metrics that could arise from unbalanced sample representation. The objective of bias mitigation cannot be achieved through exclusively non-sensitive data, as evidenced by multi-year research showing significant variance in performance patterns correlated with protected characteristics absent direct inclusion of these attributes. Processing records capture justification statements aligned with GDPR provisions, affirming the necessity and proportionality of data use for bias detection and correction within the scope of competency evaluation.