**Article 14**

**Design and Implementation of Human Oversight Features**  
Contractual Separation Insight is developed with a layered ensemble architecture combining multiple random forest classifiers and transformer-based large language models (LLMs). This hybrid design integrates quantitative employee performance indicators with qualitative analyses of company policy documents and applicable labor law texts. The provider has designed the system to output both predictive scores and explanatory summaries to facilitate user interpretation. However, the interface does not incorporate automated modules or visual indicators specifically intended to detect, highlight, or warn about disparities in outcomes affecting protected demographic groups. While performance benchmarking and fairness assessments were conducted during development, these aspects are not embedded in real-time alerts or notifications within the user interface. The rationale for this design choice reflects a risk management approach where general model robustness and policy compliance are prioritized, and detailed fairness assessments are instead intended to be performed by deployers or end users through separate audit tools and internal governance processes.

**Risk Mitigation through Human Oversight Modes**  
The system provides interactive features allowing human operators—typically HR professionals or compliance officers—to review recommended contract termination decisions along with synthesized justifications. Operators can analyze both the data-driven score distributions from the random forest models and narrative explanations from the LLM components. Controls include the ability to override or disregard any system recommendation before a decision is finalized. Additionally, a manual “stop” function is available to safely halt processing when anomalous outputs or unforeseen issues arise. These oversight mechanisms enable users to prevent or reduce risks related to compliance errors, policy misinterpretation, or invalid predictions within contractual terminations. However, the system relies on user expertise and contextual judgment to identify potential discriminatory impacts or biases on protected groups, as such risk signals are not surfaced automatically.

**Capacity and Limitation Communication to Users**  
Documentation and training materials provided with the system outline its core capabilities—including predictive modeling accuracy (average F1-score of 0.81 on representative validation datasets with approximately 50,000 anonymized employee records) and explainability features linked to policy clauses. Limitations are also explicitly disclosed, mentioning known challenges such as potential residual biases stemming from underlying historic HR data and the complexity of natural language policy interpretation. The provider’s communication emphasizes that users must maintain vigilant oversight and conduct supplementary reviews to identify any undue adverse effects across demographic segments, given that the system does not offer embedded demographic fairness diagnostics or impact thresholds. This transparency supports informed monitoring but shifts responsibility to users for active bias detection and mitigation.

**Measures for Oversight Commensurate with System Autonomy and Context**  
Given the ensemble’s intermediate level of automation in decision support—not making autonomous decisions but recommending actionable outcomes—the oversight framework is tailored accordingly. Pre-market validation included comprehensive performance testing, sensitivity analyses for varied employee profiles, and simulated stress tests under different contract termination scenarios, adhering to established industry benchmarks for HR-focused AI tools in 2025. The provider identified human review and override as essential oversight layers suitable for the system’s operational environment. No provider-embedded alerts regarding discriminatory risks were implemented, recognizing the technical challenges of reliably detecting such biases in language-driven analytics without extensive deployment-context data. Compliance controls thus rely principally on deployer-initiated policies, audit cycles, and external fairness assessment tools. The system’s architecture facilitates integration with such third-party pipelines but does not supply them natively.

**Access to Operational Records and Data Processing Rationale**  
For purposes of transparency and regulatory compliance, the system maintains detailed logs of data inputs, decision outcomes, and operator interactions, stored securely and accessible for audit. These records include metadata on the datasets used, anonymization status, and summaries of model outputs to support retrospective analyses. When processing special categories of personal data to detect and correct biases during development and limited retraining phases, the provider documented strict necessity justifications in accordance with relevant EU data protection regulations (Regulations (EU) 2016/679 and (EU) 2018/1725). These justifications clarify that alternative data types were insufficient to achieve the objectives of bias identification given the sensitive nature of protected attributes and legal constraints on their usage. Such processing is confined to internal model improvement and does not extend to runtime operations, where no active bias remediation or demographic impact highlighting occurs in the deployed system.