{
  "query": "How is the way classes are taught changing because of AI?",
  "query-id": 87,
  "gold_response": "Generative AI is reshaping university teaching in three linked areas: course and assessment design, grading and feedback workflows, and the instructor’s day-to-day role. Below is the “how,” with higher-ed workflows you can implement.\n\n1) Course design and constructive alignment with AI\n- Update learning outcomes: Add AI-enabled outcomes that emphasize framing, verification, and synthesis (e.g., “students will use AI to generate options, evaluate credibility, and justify decisions”). Align these to program competencies rather than tool skills.\n- Redesign for authenticity and process evidence: Shift from easily automated products (generic essays/problem sets) to tasks that require judgment and provenance.\n  • Deliverables: Prompt(s) used, rationale for choices, AI outputs, revision notes, citations/verification trail, and final synthesis. \n  • Rubric criteria: Problem framing, use-of-AI strategy, evidence/verification, disciplinary reasoning, and clarity. \n- Flip the classroom for sense-making: Pre-class, students use AI for exploration; in-class time is used for evaluating outputs, running counterfactuals, and defending decisions.\n- Micro-example (200-level Marketing): Students produce a positioning brief by (a) drafting a prompt, (b) critiquing AI-generated personas against research, (c) revising, and (d) submitting a reflection on trade-offs. The rubric weights “evidence and revision rationale” more than the final prose.\n\n2) Assessment and grading workflows (rubric-aligned with calibration)\n- Build rubrics for observable decisions: Write criterion descriptors at 3–4 levels with concrete indicators (e.g., “verification: cites 3+ sources, explains discrepancies, applies domain tests”). Create 3–5 annotated exemplars per level.\n- Calibrate human markers first: Run a norming session with TAs on 15–20 de-identified samples. Discuss disagreements, tighten descriptors, and finalize comment banks tied to each criterion.\n- Add AI pre-scoring and annotation (summative):\n  • Step 1: Prepare a structured evaluation template mirroring the rubric (criteria, level descriptors, and required evidence). \n  • Step 2: Batch pre-score submissions. Require justification snippets that reference the student’s text (line numbers/quotes). \n  • Step 3: Triage by rules: auto-accept clear cases; route borderline or low-confidence items for human review; flag anomalies (novel formats, ambiguous citations). \n  • Step 4: Human moderation: markers review flags/outliers, adjust scores, and add custom notes. Keep an audit log of overrides and reasons.\n  • Step 5: Reliability check: spot-audit a random 10% each week; retrain/refine prompts if drift occurs.\n- Formative feedback loop (feed-forward):\n  • Generate criterion-level comments plus “next-step” actions (e.g., “apply test X; add source Y; run a sensitivity check; rewrite claim using evidence template”). \n  • Require a short “feedback-to-action” memo with the next draft; grade the memo lightly to close the loop.\n- Micro-examples:\n  • Psychology lab reports: AI pre-scores Methods and Results for structure and reporting standards; TAs moderate flagged papers; common issues auto-populate targeted guidance. \n  • Intro programming: Unit tests run on each commit, categorize errors (logic vs. API), and suggest specific remediation tasks; milestone rubrics score design rationale and test coverage, not just passing code.\n\n3) Adaptive/personalized learning and individualized feedback\n- Mastery mapping: Define granular skills for the course (e.g., “form a causal claim,” “run robustness check,” “APA citations”). AI tutors in the LMS map interactions and quick checks to this skill model.\n- Real-time formative assessment: \n  • During practice, the system classifies errors and offers tiered hints (concept → example → scaffold). \n  • Difficulty adapts based on response time/accuracy; students who master a skill get transfer problems; others get prerequisite refreshers.\n- Personalized feedback artifacts:\n  • Draft-to-revision guidance: margin notes aligned to rubric criteria, plus a customized checklist for the next submission. \n  • Reflection prompts: “What changed and why?” to develop metacognition; instructors review the delta rather than rereading from scratch.\n- Micro-examples:\n  • History writing: AI verifies citations, flags weak claims, and suggests primary-source types to consult; students submit a revision memo mapping changes to rubric criteria. \n  • Data science: After a failed model attempt, the system recommends targeted readings and a short practice set on feature leakage; office hours are triggered if two attempts fail.\n\n4) Instructor role and operations\n- Designer and coach: Instructors invest time upfront in outcome/RBT alignment, rubric design, exemplars, and AI usage policies; during delivery, focus shifts to coaching, critique, and scenario variation.\n- Analytics-driven interventions: Dashboards surface students with repeated criterion failures (e.g., verification); instructors run mini-workshops or assign just-in-time micro-tasks.\n- Triage at scale: An AI Q&A assistant handles routine logistics and clarifications; conceptual or ethical questions escalate to the instructor with context threads attached.\n- Academic integrity by design: Require process evidence, periodic oral defenses or in-class checkpoints, and dataset/source declarations. Grade the reasoning trail, not only the final artifact.\n- Micro-example (Capstone): Teams log weekly AI-assisted design decisions; the instructor reviews decision quality against the rubric and runs targeted critiques where evidence or testing is thin.\n\n5) Quick start (next term)\n- Identify 2–3 priority outcomes where AI can accelerate practice but still requires human judgment (e.g., evidence evaluation). \n- Redesign one major and one minor assignment to include process evidence and a rubric criterion for “use-of-AI strategy and verification.” \n- Calibrate: run a short norming exercise with exemplars; pilot AI pre-scoring on a low-stakes task; set escalation rules. \n- Implement a feedback-to-action requirement on all drafts to close the loop and track improvement.",
  "gold_information": [
    "AI shifts course design toward outcomes emphasizing framing, verification, and synthesis with AI assistance.",
    "Assignments are redesigned for authenticity and require process evidence alongside final products.",
    "Student deliverables include prompts used, rationale, AI outputs, revision notes, citations, and a final synthesis.",
    "Rubrics add criteria for problem framing, AI-use strategy, evidence verification, disciplinary reasoning, and clarity.",
    "Class time is flipped to focus on evaluating AI outputs, testing alternatives, and defending decisions.",
    "Grading uses rubrics with concrete level descriptors and annotated exemplars.",
    "Instructors calibrate markers through norming sessions before high-stakes grading.",
    "AI performs pre-scoring and annotation using a structured template aligned to the rubric.",
    "Workflows triage clear cases automatically and route borderline submissions for human review.",
    "Human moderators adjust scores, add notes, and maintain an audit log of overrides.",
    "Reliability is monitored via periodic spot audits and prompt refinement.",
    "Formative feedback includes criterion-level comments and specific next-step actions.",
    "Students submit brief feedback-to-action memos to close the feedback loop.",
    "Automated evaluation classifies programming or task errors and suggests targeted remediation.",
    "Adaptive learning maps granular skills and tracks mastery within a learning platform.",
    "Practice sessions classify errors and provide tiered hints from concept to scaffold.",
    "Difficulty adapts to response patterns with transfer problems for mastery and refreshers for gaps.",
    "Personalized feedback supplies margin notes aligned to rubric criteria and customized checklists.",
    "Reflection prompts build metacognition and let instructors review changes rather than reread full drafts.",
    "AI verifies citations, flags weak claims, and recommends sources during writing activities.",
    "After repeated failures, the system recommends targeted readings and practice and triggers office hours.",
    "Instructors shift from content delivery to design, coaching, critique, and scenario variation.",
    "Analytics dashboards highlight repeated criterion failures for just-in-time interventions.",
    "An AI assistant handles routine questions while escalating complex issues with context.",
    "Academic integrity is supported by process evidence, oral defenses, checkpoints, and source declarations.",
    "Grading emphasizes the reasoning trail as well as the final artifact.",
    "Quick-start steps include selecting priority outcomes suited to AI-supported practice.",
    "Courses add a rubric criterion for AI-use strategy and verification.",
    "Teams run marker calibration, pilot AI pre-scoring on low-stakes tasks, and define escalation rules.",
    "All drafts include a feedback-to-action requirement to track improvement."
  ]
}