**Article 9**

**Establishment and Lifecycle Management of the Risk Management System**  
Judicial Insight Technologies Limited has established a comprehensive risk management system (RMS) specifically tailored for the Judicial Insight Assistant, a high-risk AI system intended for use in judicial decision-making contexts. This RMS is implemented as a continuous, iterative framework applied throughout the entire lifecycle of the system, from initial design and development through deployment, maintenance, and eventual decommissioning. The RMS encompasses scheduled quarterly reviews and ad hoc updates triggered by system modifications, emerging regulatory guidance, or significant findings from post-market monitoring. Documentation of all RMS activities—including risk identification sessions, mitigation implementations, testing outcomes, and update records—is maintained in a secured version-controlled repository accessible to authorised compliance and audit personnel.

**Identification and Analysis of Risks to Health, Safety, and Fundamental Rights**  
The identification and analysis phase systematically examines risks associated with the Judicial Insight Assistant’s use in accordance with its intended purpose: assisting judicial authorities in researching legal precedents and interpreting case facts. Legal domain experts, AI safety engineers, and human rights advisors collaboratively conducted a structured hazard analysis workshop using Failure Modes and Effects Analysis (FMEA), focusing on potential impacts such as erroneous legal interpretations, biased fact pattern classification, misinformation propagation, and compromised confidentiality. The analysis considered foreseeable scenarios including reliance on AI outputs for judicial rulings, incomplete or outdated legal references, and inadvertent disclosure of sensitive case information. Key risks identified include: (i) inaccurate or incomplete legal recommendations leading to misapplication of law; (ii) bias in fact pattern classification causing disparate impacts on protected groups; and (iii) risks to fundamental rights related to data privacy and the right to effective judicial remedy.

**Estimation and Evaluation of Risks Including Reasonably Foreseeable Misuse**  
Risk estimation employed quantitative and qualitative methods, leveraging performance metrics derived from extensive system testing. The transformer-based natural language understanding module was assessed on a validation set of 1.2 million anonymized legal documents spanning multiple EU jurisdictions, with a reported F1-score of 0.89 in precedent retrieval and 0.92 in fact extraction accuracy. The gradient boosted decision tree classifiers were benchmarked with precision and recall exceeding 0.85 on a labeled dataset of 500,000 case fact patterns. Potential misuse scenarios were examined through scenario analysis, including: (a) overreliance on system outputs without adequate judicial oversight; (b) adversarial attempts to manipulate input data; and (c) misuse of subscription credentials enabling unauthorized query access. Each scenario underwent probability-impact modeling, yielding residual risk ratings ranging from low to moderate after risk controls. The evaluation also considered operational contexts and user profiles, emphasizing the expertise level of judges and clerks as mitigating factors for certain human-in-the-loop risks.

**Integration of Post-Market Data and Continuous Risk Evaluation**  
Judicial Insight Technologies incorporates post-market monitoring data, obtained through automated telemetry and user feedback channels, to identify evolving or previously unrecognized risks. Monthly aggregated anonymized logs capture system performance indicators, error rates, and usage patterns, while end-user surveys ascertain issues related to interpretability and trustworthiness. Anomaly detection algorithms flag deviations in output consistency or unusual query patterns potentially indicative of emerging vulnerabilities or misuse. This data informs biannual RMS reviews, enabling dynamic recalibration of risk levels and adaptation of corresponding mitigation strategies aligned with Article 72 requirements. For example, recent monitoring identified a minor uptick in false positive fact pattern classifications in specific case categories, leading to targeted retraining of the GBDT component with augmented datasets and revised feature weighting.

**Selection and Implementation of Risk Mitigation Measures**  
Risk mitigation steps adhere strictly to reducing or eliminating identified hazards through a combination of design, development, and operational controls. Eliminations of risks were prioritized where technically feasible: the system’s transformer models incorporate state-of-the-art bias mitigation techniques, including adversarial data augmentation and fairness-aware training algorithms, reducing bias-induced error by an estimated 25% compared to baseline models. For residual risks, mitigating controls include configurable confidence thresholds that flag low-certainty outputs for mandatory human review, minimizing the risk of erroneous legal recommendations. Additionally, robust access control mechanisms, including multi-factor authentication and role-based permissions, reduce unauthorized use risks. Detailed usage guidelines and comprehensive training materials are provided to deployers to foster informed and responsible operation, reflecting consideration of the expected professional background and familiarity of judges and legal clerks.

**Consideration of Combined Effects and Regulatory Interactions**  
Risk management measures were implemented with attention to the combined effects of requirements across relevant AI Act provisions. For instance, ensuring transparency (in compliance with Article 13) enhances user capacity to interpret and challenge AI outputs, thereby reducing the likelihood of misuse or overreliance. Likewise, security measures conform with the system’s data protection policies, harmonizing risk mitigation across technical documentation and privacy compliance frameworks. The RMS reflects a balance between enhancing safety and preserving system utility and user autonomy—continuous human oversight remains integral, preserving the role of judicial evaluators in decision-making, thus preventing risk concentration in autonomous AI outputs.

**Residual Risk Assessment and Acceptability Judgments**  
Following implementation of identified risk management measures, residual risks were quantitatively and qualitatively reassessed. Residual hazards such as the low probability of incorrect legal interpretations were deemed acceptable within the operational context due to layered mitigations, including human review mandates and high model performance metrics. Comprehensive safety case documentation demonstrates that combined residual risks do not exceed thresholds established by relevant legal standards and internal acceptance criteria. These judgements are periodically revisited in response to post-market data and evolving jurisprudence impacting system use cases.

**Testing Procedures to Support Risk Management**  
The Judicial Insight Assistant underwent progressive rounds of testing, starting from unit and integration tests of components to system-level validation, culminating in independent performance audits by accredited AI testing laboratories. Testing metrics aligned with intended purposes include precision, recall, accuracy, interpretability scores, and bias indices, leveraged to verify consistent performance and compliance with articulated safety requirements. Real-world pilot deployments in simulated judicial environments enabled contextual performance validation, including stress testing under variable workloads and adversarial robustness testing against input perturbations designed to mimic manipulation attempts. All testing phases were governed by pre-established test plans with defined pass/fail criteria consistent with regulatory expectations and best practices relevant in 2025.

**Testing Throughout Development and Prior to Market Release**  
Testing was conducted iteratively throughout the development lifecycle, with integration of continuous integration/continuous deployment (CI/CD) pipelines including automated unit tests and model retraining validation checks. Prior to market release, a comprehensive validation phase was completed, based on a blend of historical legal datasets and synthetic cases designed to cover boundary conditions. Probabilistic thresholds were set to minimize false negatives on critical legal criteria at ≤1%, balancing sensitivity and specificity to optimize judicial reliance without compromising safety. The final submission package includes test artifacts, metrics reports, and traceability matrices linking risk management outcomes to testing evidence.

**Special Considerations for Vulnerable Groups**  
Due consideration was given to potential impacts on minors and other vulnerable groups in judicial proceedings, despite the system primarily serving legal professionals. Risk assessments accounted for scenarios involving cases with vulnerable populations, ensuring that classification biases do not propagate unfair treatment or discrimination. Data annotation practices included sensitivity labels for cases pertaining to children or protected groups, guiding specialized model fine-tuning to prevent adverse impacts. Informed training of deployers emphasized awareness of heightened risks and the necessity for cautious interpretation where vulnerable persons are involved, supporting protective safeguards consistent with fundamental rights obligations.