{
  "root": {
    "name": "Change the background color of Slide 3 to light blue",
    "description": "Evaluate whether Slide 3's background color was changed to light blue, with no extraneous changes or errors.",
    "is_critical": true,
    "metadata": {},
    "children": [
      {
        "name": "Slide 3 background color is set to light blue",
        "description": "Checks if the background color of Slide 3 has been set to a light blue shade.",
        "is_critical": true,
        "metadata": {},
        "scorer": {
          "type": "function",
          "function_code": "def compute_score() -> tuple[str, float]:\n    from pptx import Presentation\n    from pptx.dml.color import RGBColor\n    import os\n    # Define what is considered 'light blue' in RGB\n    LIGHT_BLUE_RGB = [\n        (173, 216, 230), # LightBlue\n        (135, 206, 250), # LightSkyBlue\n        (176, 224, 230), # PowderBlue\n        (0, 191, 255),   # DeepSkyBlue\n        (135, 206, 235)  # SkyBlue\n    ]\n    TOLERANCE = 20 # Accept a range for minor agent imprecision\n    \n    if not os.path.exists(modified_ppt_path):\n        return (\"Modified PPT file not found.\", 0.0)\n    try:\n        prs = Presentation(modified_ppt_path)\n        slide = prs.slides[2] # slide 3 is index 2\n        bg = slide.background\n        # Try to get fill color\n        if not bg.fill or bg.fill.type != 1:\n            return (\"Slide 3 background does not have a solid fill.\", 0.0)\n        color = bg.fill.fore_color.rgb\n        if color is None:\n            return (\"Slide 3 background color not detected.\", 0.0)\n        r, g, b = color[0], color[1], color[2]\n        for lbr, lbg, lbb in LIGHT_BLUE_RGB:\n            if abs(r-lbr)<=TOLERANCE and abs(g-lbg)<=TOLERANCE and abs(b-lbb)<=TOLERANCE:\n                return (\"Slide 3 background color is light blue.\", 1.0)\n        return (f\"Slide 3 background color is not light blue. Detected RGB: ({r},{g},{b})\", 0.0)\n    except Exception as e:\n        return (f\"Error checking background color: {e}\", 0.0)\n"
        },
        "score": 0.0,
        "reason": "Slide 3 background does not have a solid fill."
      },
      {
        "name": "No Unwanted Changes",
        "description": "Checks that there are no extraneous changes to slide 3 except the intended subtitle addition.",
        "is_critical": false,
        "metadata": {},
        "scorer": {
          "type": "function",
          "function_code": "def compute_score() -> tuple[str, float]:\n    # No unwanted changes should be made to the slides.\n    from pptx import Presentation\n    from pptx.enum.shapes import MSO_SHAPE_TYPE\n    \n    pres_original = Presentation(original_ppt_path)\n    pres_modified = Presentation(modified_ppt_path)\n    slide_idx = 2\n    slide_original = pres_original.slides[slide_idx]\n    slide_modified = pres_modified.slides[slide_idx]\n\n    differences = []\n    \n    # Check if the number of shapes is the same\n    if len(slide_original.shapes) != len(slide_modified.shapes):\n        differences.append(f\"Shape count mismatch: original has {len(slide_original.shapes)}, modified has {len(slide_modified.shapes)}\")\n        return ('; '.join(differences), 0.0)\n    \n    # Compare each shape\n    for idx, (shape_orig, shape_mod) in enumerate(zip(slide_original.shapes, slide_modified.shapes)):\n        # Check shape type\n        if shape_orig.shape_type != shape_mod.shape_type:\n            differences.append(f\"Shape {idx}: type mismatch ({shape_orig.shape_type} vs {shape_mod.shape_type})\")\n            continue\n        \n        # Check shape position and size\n        if shape_orig.left != shape_mod.left:\n            differences.append(f\"Shape {idx}: left position differs ({shape_orig.left} vs {shape_mod.left})\")\n        if shape_orig.top != shape_mod.top:\n            differences.append(f\"Shape {idx}: top position differs ({shape_orig.top} vs {shape_mod.top})\")\n        if shape_orig.width != shape_mod.width:\n            differences.append(f\"Shape {idx}: width differs ({shape_orig.width} vs {shape_mod.width})\")\n        if shape_orig.height != shape_mod.height:\n            differences.append(f\"Shape {idx}: height differs ({shape_orig.height} vs {shape_mod.height})\")\n        \n        # Check text content if shape has text\n        if shape_orig.has_text_frame and shape_mod.has_text_frame:\n            text_orig = shape_orig.text_frame.text\n            text_mod = shape_mod.text_frame.text\n            if text_orig != text_mod:\n                differences.append(f\"Shape {idx}: text differs ('{text_orig}' vs '{text_mod}')\")\n            \n            # Check paragraph count\n            if len(shape_orig.text_frame.paragraphs) != len(shape_mod.text_frame.paragraphs):\n                differences.append(f\"Shape {idx}: paragraph count differs\")\n        elif shape_orig.has_text_frame != shape_mod.has_text_frame:\n            differences.append(f\"Shape {idx}: text frame presence mismatch\")\n        \n        # Check if shape is a table\n        if shape_orig.shape_type == MSO_SHAPE_TYPE.TABLE and shape_mod.shape_type == MSO_SHAPE_TYPE.TABLE:\n            table_orig = shape_orig.table\n            table_mod = shape_mod.table\n            \n            if len(table_orig.rows) != len(table_mod.rows):\n                differences.append(f\"Shape {idx}: table row count differs ({len(table_orig.rows)} vs {len(table_mod.rows)})\")\n            if len(table_orig.columns) != len(table_mod.columns):\n                differences.append(f\"Shape {idx}: table column count differs ({len(table_orig.columns)} vs {len(table_mod.columns)})\")\n            \n            # Compare table cell contents\n            for row_idx, (row_orig, row_mod) in enumerate(zip(table_orig.rows, table_mod.rows)):\n                for col_idx, (cell_orig, cell_mod) in enumerate(zip(row_orig.cells, row_mod.cells)):\n                    if cell_orig.text != cell_mod.text:\n                        differences.append(f\"Shape {idx}: table cell ({row_idx},{col_idx}) text differs ('{cell_orig.text}' vs '{cell_mod.text}')\")\n        \n        # Check if shape is a group\n        if shape_orig.shape_type == MSO_SHAPE_TYPE.GROUP and shape_mod.shape_type == MSO_SHAPE_TYPE.GROUP:\n            if len(shape_orig.shapes) != len(shape_mod.shapes):\n                differences.append(f\"Shape {idx}: group shape count differs ({len(shape_orig.shapes)} vs {len(shape_mod.shapes)})\")\n    \n    if differences:\n        return ('; '.join(differences[:5]), 0.0)  # Return first 5 differences to avoid overly long messages\n    \n    return ('No differences found', 1.0)\n"
        },
        "score": 1.0,
        "reason": "No differences found"
      },
      {
        "name": "No Extraneous Changes to Other Slides",
        "description": "Checks that there are no extraneous changes other slides.",
        "is_critical": false,
        "metadata": {},
        "scorer": {
          "type": "function",
          "function_code": "def compute_score() -> tuple[str, float]:\n    if len(ppt_diff.modified_slides) > 1:\n        return 'Undesired slide modifications', 0.0\n    \n    if len(ppt_diff.modified_slides) == 1 and ppt_diff.modified_slides[0][0].slide_number != 3:\n        return 'Undesired slide modifications', 0.0\n    if len(ppt_diff.added_slides) > 0:\n        return 'Undesired slide additions', 0.0\n    \n    if len(ppt_diff.removed_slides) > 0:\n        return 'Undesired slide removals', 0.0\n    \n    if len(ppt_diff.modified_animations) > 0 or len(ppt_diff.modified_transitions) > 0:\n        return 'Undesired animation/transition additions', 0.0\n        \n        \n    if len(ppt_diff.added_animations) > 0 or len(ppt_diff.added_transitions) > 0:\n        return 'Undesired animation/transition additions', 0.0\n\n    if len(ppt_diff.removed_animations) > 0 or len(ppt_diff.removed_transitions) > 0:\n        return 'Undesired animation/transition removals', 0.0\n    \n    return 'No extraneous changes.', 1.0\n"
        },
        "score": 1.0,
        "reason": "No extraneous changes."
      }
    ],
    "score": 0.0,
    "reason": "The criterion received a score of zero because the most critical requirement was not met - Slide 3's background color was not changed to light blue at all, as it doesn't even have a solid fill applied. While the student successfully avoided making any unwanted changes to Slide 3 or other slides in the presentation, the failure to complete the primary task of setting the background color means the core objective was not accomplished. Since changing the background color to light blue was marked as critical for this criterion, not completing this essential step resulted in an overall failing score."
  },
  "metadata": {
    "task": "Slide 3: Change the background color of the slide to light blue"
  }
}