(1) Personalized rubric with 1–5 scores for each criterion

Need Alignment
- 1: Largely off-topic or ignores the user’s actual question. Misses the requested angles entirely; no clear definition or relevance.
- 2: On-topic but focuses on tangential aspects (e.g., stakeholders, governance/ethics frameworks, market/CSR significance, narrative vision) instead of the requested mechanics. Does not map to the user’s enumerated sub-questions; few/no concrete examples.
- 3: Addresses the topic generically. Partial coverage of the requested angles, but not organized by the user’s list; omits key pieces (e.g., what-if-not, can-break, how-broken) or uses vague examples.
- 4: Covers most requested angles with solid technical details and some examples; minor drift to less relevant content or misses examples for a couple of sections.
- 5: Directly mirrors the user’s enumerated sub-questions 1:1. Prioritizes concrete guardrail mechanisms, threat/attack types, and policy-enforcement flows. Names real tools/standards where helpful. Provides a specific example for each sub-question. Avoids governance/market digressions.

Content Depth (Depth/Specificity)
- 1: Gross mismatch. Either highly theoretical/academic or overly superficial with metaphors and no technical substance.
- 2: Too basic or too abstract. Lacks key terminology and mechanisms (e.g., input/output filtering, RLHF, RAG) or is so jargon-heavy that it’s impractical.
- 3: Understandable but mismatched. Gives broad explanations without the specific mechanisms, threat variants, or operational details; examples are generic or sparse.
- 4: Generally right depth. Uses correct terms and some named tools/standards; may miss a few specifics (e.g., attack variants, stepwise flows) or be slightly verbose.
- 5: Perfect match. Concise, technically precise, and practical: includes mechanisms (input/output filtering, RLHF/Constitutional AI, RAG), threat types (prompt injection, jailbreaks), defense patterns (tool/permission gating, validation), and named tools/standards (e.g., Llama Guard, NeMo Guardrails, OWASP LLM Top 10, NIST AI RMF, ISO/IEC 23894). Concrete, brief examples.

Tone
- 1: Offensive, condescending, or otherwise inappropriate.
- 2: Noticeably hype/marketing, patronizing, or overly chatty (greetings, “for beginners” framing, heavy metaphors).
- 3: Functionally neutral but with fluff (narrative intros, analogies, rhetorical framing) or mild salesy phrasing.
- 4: Professional and neutral with minor lapses (an occasional slogan or industry/CSR framing).
- 5: Crisp, direct, factual, and impersonal. No greetings, hype, metaphors, or promotional language. Reads like an expert brief.

Explanation Style
- 1: Unstructured essay; meandering; hard to scan; ignores the user’s requested structure.
- 2: Some headings but poor logical flow; mixes topics; examples are sparse or detached from points.
- 3: Understandable yet not aligned to the user’s preferred structure. Lacks a 1:1 mapping to the user’s ordered list; examples not paired per section; cluttered paragraphs.
- 4: Mostly structured and easy to scan; uses bullets and short sections; minor deviations from the requested order or missing examples in a few places.
- 5: Exact preferred style: scannable, numbered sections mapping 1:1 to the user’s sub-questions; concise bullets under each; each section includes a specific example; processes presented in clear step order (e.g., policy → training → inference → operations); no off-topic detours.