(1) Personalized rubric with 1–5 scores for each criterion

Need Alignment
- 5: Focuses squarely on prompt compression with explicit post‑2024/SOTA coverage; cites credible sources (papers, GitHub, reputable technical blogs); connects to RAG/long‑context/multi‑turn use; covers key method families (LLMLingua/LongLLMLingua, Selective Context, LLMLingua‑2/RECOMP, GIST/AutoCompressor, token merging/pruning like ToMe/AdapLeR); reports measurable impact (CR, latency, cost, task metrics).
- 4: Strong focus on core methods and RAG/long‑context links; may miss either explicit post‑2024 discussion or complete sourcing; minor drift but still high value.
- 3: On‑topic but generic; limited or no sources; weak tie to RAG/metrics; misses several key families or recent directions.
- 2: Mostly secondary/high‑level content (generic cost talk/definitions) with little research detail; no credible sources; poor task linkage.
- 1: Largely irrelevant or off‑topic.

Content Depth
- 5: Deep yet practical: includes key formulas (e.g., self‑information I(x)=−log P(x)), algorithmic steps/pseudocode, trade‑offs, and benchmarks; provides code snippets and install/integration (e.g., llmlingua, selective‑context, LangChain); evaluation harness for CR/latency/cost; pragmatic tuning tips; covers all major families (incl. token merging/pruning).
- 4: Solid technical depth with code and clear algorithms; minor gaps (e.g., missing one family or limited quantitative benchmarks).
- 3: Understandable but incomplete; few methods in depth; missing formulas or code; light on trade‑offs/benchmarks or implementation details.
- 2: Surface‑level or overly theoretical without actionable steps; lacks algorithms, benchmarks, and usable code.
- 1: Strongly mismatched (marketing‑style fluff or dense theory with no applicability).

Tone
- 5: Concise, professional, research‑oriented; objective and careful with claims; no hype, jokes, or forced metaphors; friendly and direct.
- 4: Mostly professional with minor verbosity or occasional flourish that doesn’t hinder clarity.
- 3: Neutral/robotic or generic; acceptable but not tuned to the preferred research tone.
- 2: Marketing‑ish, flowery metaphors, or playful tone that distracts from technical content.
- 1: Inappropriate, flippant, or otherwise uncomfortable.

Explanation Style
- 5: Highly scannable: clear headings, short paragraphs, bullet lists; comparison/summary tables; concrete code blocks and step‑by‑step algorithms; quickstart/install commands; closes with a concise summary and vetted reading list (papers/GitHub/blogs).
- 4: Well‑organized with bullets and either a table or code; minor verbosity or one missing preferred element.
- 3: Some structure but still effortful to scan (long paragraphs) and missing both tables and code or step‑by‑step guidance.
- 2: Dense prose, heavy analogies, or unclear organization; hard to skim or apply.
- 1: Completely unstructured or incompatible with the preferred learning style.