**Article 9**

**Risk Management System Establishment and Scope**

Horizon Learning Analytics has established a documented risk management system for the Competency Evaluation Framework (CEF), encompassing design, development, and deployment phases. The system’s purpose is to identify and address risks related to the evaluation of learner competency in vocational and lifelong learning contexts, particularly those impacting health, safety, and fundamental rights such as fairness and equal opportunity. The risk analysis was informed by an initial comprehensive hazard identification exercise that included potential biases in skill assessment, misclassification of learner proficiency, and erroneous certification recommendations. The framework analyzed known and reasonably foreseeable risks from system operation under intended use conditions; such risks encompass inaccurate competency scores potentially leading to misplaced learner advancement or certification denial, and inadvertent reinforcement of educational disparities across demographic subgroups.

**Iterative Process and Lifecycle Considerations**

The risk management protocol is defined as an iterative lifecycle process starting at system conception through post-deployment oversight. Procedures include stage-gate reviews during model development, performance validation, and system integration accompanied by logging and traceability mechanisms. While initial risk evaluation incorporates benchmark testing on a validation corpus of 120,000 anonymized learner interaction records spanning five years and multiple vocational sectors, subsequent reviews largely rely on passive log collection during operational deployment. These logs capture algorithmic decision outputs, feature importance attributions, and statistical performance metrics (e.g., accuracy, false positive/negative rates, and calibration scores). However, no structured mechanisms are currently integrated to assimilate evolving user feedback or longitudinal outcome data into the risk evaluation process. This includes absence of analyses targeting disproportionate impacts on vulnerable learner groups (e.g., age, gender, socioeconomic background), as well as lack of systematic review triggered by emergent evidence post-market.

**Risk Estimation under Intended and Misuse Conditions**

Risk estimation accounts for probable performance degradation scenarios, including data drift, input noise, and user errors within foreseeable misuse contexts, such as inappropriate data entry or attempts to game the system. Simulated misuse testing used synthetic perturbations of learner logs reflecting common entry mistakes and adversarial patterns at a 15% injection rate, with corresponding detection and mitigation layers adjusted accordingly. Residual risk areas, particularly relating to overreliance on historical performance features correlated with demographic attributes, were acknowledged but not actively monitored during deployment, given current technical and operational constraints.

**Integration of Post-Market Data and Monitoring Limitations**

The post-market monitoring framework comprises automated log collection from deployed instances at partner vocational institutions, including raw input anonymized events, system decision outputs, and runtime diagnostics. Logs are stored in a secure centralized repository enabling performance trend analysis over time. Nonetheless, no systematic procedures exist to incorporate qualitative user feedback such as instructor or learner complaints, suggestions, or reported anomalies. Likewise, the framework does not incorporate longitudinal outcome tracking such as learner progression post-assessment, or analyses revealing disparities in system impact across different learner demographics or vulnerability factors. Consequently, risk evaluations conducted pre-deployment remain essentially static, with no mandated regular updates tied to newly surfaced empirical evidence from the operational environment.

**Adopted Risk Management Measures and Residual Risk Assessment**

Risk mitigation primarily focused on robust model development practices including careful feature selection, cross-validation to detect data leakage, and implementation of explainability techniques inherent in GBDT models (SHAP values) to facilitate transparent result interpretation. Training datasets were balanced on observable demographic distributions to reduce initial bias potential; nevertheless, the system’s risk controls do not currently extend to dynamic adjustment mechanisms responding to identified demographic disparities or emergent adverse impacts during real-world use. Comprehensive technical documentation and user instructions emphasize appropriate training for deployers in correctly interpreting competency scores and limitations. Residual risks related to possible unfairness or misuse are documented, with residual risk deemed acceptable by design parameters, albeit without adaptive mitigation informed by live feedback or longer-term outcome studies.

**Testing Procedures and Performance Validation**

Testing was performed at multiple stages, including offline model validation against benchmarks tailored for vocational competency domains with performance thresholds targeting an overall classification accuracy above 87% and false classification rates below 5%. Real-world pilot deployments validated model stability and feedback interface usability across three vocational training centers involving approximately 1,200 learners in the preceding 12 months. Testing criteria incorporated stratified performance measures by competency category and learner subgroup but did not establish dynamic real-world monitoring tests post-deployment beyond log collection. Testing frameworks do not extend to automated adverse impact audits or monitoring tools that recalibrate risk priorities based on new evidence.

**Consideration of Vulnerable Groups and Minors**

The initial risk assessment phase took into account the demographic composition of the learner base, identifying persons under 18 and other vulnerable groups (such as economically disadvantaged adults and learners with recognized disabilities). Risk scenarios evaluated qualitative impacts on these groups with a conservative bias towards minimizing misclassification errors that could exacerbate existing barriers to certification or employment readiness. However, continuous monitoring or periodic reassessment of evolving risk profiles specifically for these vulnerable categories is not established. Guidance materials encourage deployers to consider additional safeguards appropriate to learner contexts, but these recommendations are not supplemented by automated system alerts or risk feedback loops.

---

This documentation reflects the current technical state and operational risk management approach applied to the CEF system, particularly highlighting the passive yet incomplete utilization of post-market data and the absence of iterative risk evaluation updates as contextual evidence evolves.