**Article 12**

**Automatic Event Logging Mechanism**

The AI system incorporates an automated logging component that captures detailed records of system events continuously throughout its operational lifetime. This logging infrastructure is embedded at multiple layers of the software stack, including input data ingestion, model inference execution, and output generation stages within the competency evaluation pipeline. Logs are generated in real time and securely transmitted to a centralized, tamper-resistant storage solution supporting append-only audit trails. Stored logs encompass timestamped records with unique event identifiers, ensuring full reproducibility of decision outputs and enabling retrospective analysis of system behavior. The logging format adheres to structured schemas aligned with industry standards for auditability and machine learning operations (MLOps), facilitating seamless integration with monitoring dashboards and compliance reporting tools. This comprehensive logging design was selected to satisfy the need for continuous traceability over extended deployment periods typical of vocational education environments.

**Scope of Recorded Events to Support Risk Identification and System Modifications**

To enable early detection of situations potentially increasing system-associated risks or leading to substantial system modifications, the logging framework captures a wide range of events including, but not limited to: anomalous input feature distribution shifts, model confidence metrics, performance degradation indicators, and triggered alerts related to data quality or processing errors. Specifically, the system monitors deviations from baseline performance established during validation phases, logging any instance where competency score distributions differ beyond statistically significant thresholds (defined as shifts exceeding ±3 standard deviations on key skill dimensions). Change control events such as retraining triggers, model parameter updates, or configuration changes are automatically recorded with granular metadata detailing the rationale and versioning. This enables traceability of system adaptation and supports impact assessment of modifications on risk profiles. These recording choices were guided by risks identified during the system’s internal risk management process, focusing on preserving interpretability and minimizing erroneous competency assessments that could affect certification decisions.

**Post-Market Monitoring Facilitation**

The logging system is designed to collect data essential for ongoing post-market monitoring activities. Logs systematically capture user interaction patterns (e.g., frequency and duration of competency assessments), system feedback loops linked to curriculum adjustments, and outcomes of model performance audits conducted in operational settings. This dataset supports continuous evaluation against key performance indicators (KPIs) established pre-market, such as assessment accuracy, false positive/negative rates in mastery detection, and fairness metrics across demographic groups. Automated periodic aggregation scripts generate compliance reports that highlight trends, anomalies, and incidents requiring human review. Furthermore, the system maintains a detailed record of user-reported issues and system-generated exception logs, feeding post-market vigilance workflows. This structured approach ensures that critical operational data are available for regulatory scrutiny and quality assurance, aligning the system’s lifecycle management with the requirements for high-risk AI.

**Monitoring of Operational Functioning**

To maintain effective monitoring during regular use, the system records operational states relevant for continuous oversight, including workload metrics, response latency, and resource utilization. Health-check logs capture error rates in input data validation and model inference, supporting prompt detection of degradation or malfunction. The logging solution interoperates with real-time monitoring tools that trigger alerts based on predefined thresholds for key indicators (e.g., increased error frequency or unexpected shifts in output distributions). Additionally, trace logs document decision paths within the gradient boosted decision trees, including feature importance scores per individual assessment, ensuring transparency and enabling explainability audits. By embedding these monitoring capabilities within the logging framework, the provider facilitates users’ and maintainers’ ability to observe and verify that the system operates within expected parameters, supporting responsible and reliable deployment in educational contexts.