**Article 9**

**Establishment and Scope of the Risk Management System**  
Judicial Insight Technologies Limited has established a documented risk management system aligned with its obligations for high-risk AI systems. This system initiates with a comprehensive initial risk evaluation conducted immediately prior to the first deployment of Judicial Insight Assistant. The initial risk assessment incorporated identification and analysis of known and reasonably foreseeable risks related to the system’s assistance in judicial legal research and fact pattern analysis, specifically focusing on risks to fundamental rights—such as potential biases affecting legal interpretation—and risks to judicial decision integrity. This initial evaluation was supported by empirical testing on a representative dataset of over 1 million legal documents, including case law and statutory texts, curated to reflect diverse legal contexts across EU Member States. Risk analysis included assessment of performance degradation risks, misclassification of legal facts, and incorrect precedent suggestions that could adversely impact judicial outcomes.

**Risk Estimation and Evaluation Process**  
The initial risk estimation employed quantitative metrics including precision, recall, and F1 scores applied to fact pattern classification and legal reference retrieval, with thresholds set at a minimum F1 score of 0.92 to ensure high reliability in legal analysis tasks. Scenario-based risk evaluations considered both intended use cases and foreseeable misuse patterns, such as overreliance by clerks on AI output without human critical review and potential introduction of historical bias embedded in training data. Extensive adversarial testing examined vulnerability to input perturbations mimicking ambiguous or incomplete case facts. Due to the critical context, failure modes leading to erroneous legal conclusions were prioritized and modeled to estimate consequences to judicial fairness and accuracy.

**Post-Market Data Integration and Continuous Review**  
Post-market monitoring measures include automated logging of system outputs and user feedback collection channels designed to capture and record any incidents or user-reported failures. However, the implemented risk management protocol currently entails no scheduled, systematic post-deployment risk reassessment cycles. Instead, updates to the risk evaluation and mitigation measures are triggered on an ad hoc basis when significant failures or systemic issues are identified through these channels. This includes major reported incidents affecting multiple users or indications of systemic bias emerging from feedback analysis. Consequently, the protocol relies on discrete intervention points for risk management updates rather than a continuous iterative review throughout the system lifecycle.

**Design-Driven Risk Mitigation and Residual Risk Management**  
Risk management measures prioritized elimination and reduction of identified risks through the system’s technical design. The hybrid architecture combining transformer-based natural language understanding models with gradient boosted decision trees for classification ensures enhanced interpretability and allows for modular error tracing. Model training employed rigorous debiasing techniques and continuous adversarial testing across 250,000 synthetic and real-world case scenarios to reduce input ambiguity risks. Additionally, output confidence scores and discrepancy flags are integrated to assist human users in identifying uncertain outputs, thereby supporting informed judicial scrutiny. Residual risks, such as potential misinterpretation of rare legal constructs, are addressed through detailed user documentation and training materials made available to deployers, tailored to the technical competency and operational context of judges and legal clerks. This documentation includes guidance on system limitations, proper interpretation of AI outputs, and recommended verification workflows to mitigate risks inherent to AI-assisted judicial research.

**Testing Regimen and Performance Validation**  
Testing was performed iteratively during development phases and verified immediately prior to market release against predefined metrics tailored to the system’s legal research domain. Benchmarks involved cross-validation on representative datasets comprising 95,000 annotated case summaries and legal precedents. Probability thresholds were dynamically tuned to balance false positive and false negative risks appropriately for judicial settings. Where feasible, real-world pilot testing was conducted with anonymized court data to validate consistency and reliability in operational conditions. Testing documented system performance under standard, edge, and stress conditions simulating complex multi-jurisdictional query inputs. Results confirmed stable performance without significant degradation across diverse legal contexts. Further tests examined the system’s handling of inputs potentially involving persons under 18 or other vulnerable groups, ensuring no disproportionate adverse impacts via bias or misclassification.

**Interaction with Other Applicable Risk Management Frameworks**  
Judicial Insight Technologies Limited maintains its risk management system distinctly, though it remains designed with extensibility in mind, facilitating possible integration or combination with deployer-established risk processes under national judicial frameworks. By documenting all risk management activities—including design choices, test results, and incident-triggered updates—in a centralized quality management system, the provider ensures traceability and accountability without presuming access to or control over end-users’ internal monitoring procedures.