{
  "root": {
    "name": "Create hyperlinks from each experiment name in slide 3 to corresponding Wikipedia pages",
    "description": "Evaluates whether the agent correctly created hyperlinks from each experiment name in slide 3 to the appropriate Wikipedia pages about the experiment or its researcher(s).",
    "is_critical": true,
    "metadata": {},
    "children": [
      {
        "name": "All experiment names in slide 3 are hyperlinked",
        "description": "Checks if every experiment name text element in slide 3 has a hyperlink.",
        "is_critical": true,
        "metadata": {},
        "scorer": {
          "type": "function",
          "function_code": "def compute_score() -> tuple[str, float]:\n    from pptx import Presentation\n    prs = Presentation(modified_ppt_path)\n    # Slide 3 is index 2\n    slide = prs.slides[2]\n    # Find all text runs that are experiment names by comparing to original\n    original_names = ['Norman Triplett Experiment', 'Milgram’s Shock Experiment', 'Leon Festinger studies of Cognitive Dissonance', 'David Hamilton and Robert Gifford (1976)', 'Richard T. La Piere’s hospitality Study']\n    corresponding_experiments = ['Social Facilitation', 'Obedience', 'When Prophecy Fails', 'Illusionary Correlation', 'Attitudes Vs Actions']\n    # Create pairs of original name and corresponding experiment\n    experiment_pairs = list(zip(original_names, corresponding_experiments))\n    \n    # Collect all hyperlinked text\n    hyperlinked_texts = set()\n    for shape in slide.shapes:\n        if not shape.has_text_frame:\n            continue\n        for p in shape.text_frame.paragraphs:\n            for run in p.runs:\n                text = run.text.strip()\n                hlink = run.hyperlink.address if hasattr(run, 'hyperlink') else None\n                if hlink:\n                    hyperlinked_texts.add(text)\n    \n    # Check if either the original name or corresponding experiment is linked for each pair\n    missing_pairs = []\n    for original, corresponding in experiment_pairs:\n        if original not in hyperlinked_texts and corresponding not in hyperlinked_texts:\n            missing_pairs.append(f\"{original} / {corresponding}\")\n    \n    if missing_pairs:\n        reason = f\"The following experiment pairs on slide 3 are missing hyperlinks (neither original nor corresponding name is linked): {missing_pairs}\" \n        score = 1.0 - len(missing_pairs)/len(experiment_pairs)\n        return (reason, max(0.0, score))\n\n    reason = \"All experiment name pairs on slide 3 have at least one hyperlinked version (either original name or corresponding experiment name).\"\n    return (reason, 1.0)\n"
        },
        "score": 1.0,
        "reason": "All experiment name pairs on slide 3 have at least one hyperlinked version (either original name or corresponding experiment name)."
      },
      {
        "name": "Hyperlink destinations are correct Wikipedia pages",
        "description": "Checks that the hyperlinks point to Wikipedia pages for the experiment or the researcher(s) as appropriate, scoring them separately.",
        "is_critical": true,
        "metadata": {},
        "scorer": {
          "type": "function",
          "function_code": "def compute_score() -> tuple[str, float]:\n    import re\n    from pptx import Presentation\n    prs = Presentation(modified_ppt_path)\n    slide = prs.slides[2]\n\n    # Define valid Wikipedia URLs for each experiment pair (original name and corresponding experiment)\n    valid_urls = {\n        'Norman Triplett Experiment': [\n            'https://en.wikipedia.org/wiki/Norman_Triplett',\n            'https://en.wikipedia.org/wiki/Social_facilitation'\n        ],\n        'Social Facilitation': [\n            'https://en.wikipedia.org/wiki/Norman_Triplett',\n            'https://en.wikipedia.org/wiki/Social_facilitation'\n        ],\n        'Milgram\\'s Shock Experiment': [\n            'https://en.wikipedia.org/wiki/Stanley_Milgram',\n            'https://en.wikipedia.org/wiki/Milgram_experiment'\n        ],\n        'Obedience': [\n            'https://en.wikipedia.org/wiki/Stanley_Milgram',\n            'https://en.wikipedia.org/wiki/Milgram_experiment'\n        ],\n        'Leon Festinger studies of Cognitive Dissonance': [\n            'https://en.wikipedia.org/wiki/Leon_Festinger',\n            'https://en.wikipedia.org/wiki/Cognitive_dissonance',\n            'https://en.wikipedia.org/wiki/When_Prophecy_Fails'\n        ],\n        'When Prophecy Fails': [\n            'https://en.wikipedia.org/wiki/Leon_Festinger',\n            'https://en.wikipedia.org/wiki/Cognitive_dissonance',\n            'https://en.wikipedia.org/wiki/When_Prophecy_Fails'\n        ],\n        'David Hamilton and Robert Gifford (1976)': [\n            'https://en.wikipedia.org/wiki/Illusory_correlation',\n            'https://en.wikipedia.org/wiki/David_Hamilton',\n            'https://en.wikipedia.org/wiki/Robert_Gifford'\n        ],\n        'Illusory Correlation': [\n            'https://en.wikipedia.org/wiki/Illusory_correlation',\n            'https://en.wikipedia.org/wiki/David_Hamilton',\n            'https://en.wikipedia.org/wiki/Robert_Gifford'\n        ],\n        'Richard T. La Piere\\'s hospitality Study': [\n            'https://en.wikipedia.org/wiki/Richard_T._LaPiere',\n            'https://en.wikipedia.org/wiki/Attitude-behaviour_gap'\n        ],\n        'Attitude Vs Actions': [\n            'https://en.wikipedia.org/wiki/Richard_T._LaPiere',\n            'https://en.wikipedia.org/wiki/Attitude-behaviour_gap'\n        ]\n    }\n\n    # Collect all text runs with hyperlinks\n    hyperlinked_texts = []\n    for shape in slide.shapes:\n        if not shape.has_text_frame:\n            continue\n        for p in shape.text_frame.paragraphs:\n            for run in p.runs:\n                text = run.text.strip()\n                if hasattr(run, 'hyperlink') and run.hyperlink.address:\n                    hyperlinked_texts.append((text, run.hyperlink.address))\n\n    if not hyperlinked_texts:\n        return (\"No hyperlinks found on slide 3.\", 0.0)\n\n    # Check if each hyperlinked text points to a valid URL for that experiment\n    incorrect_links = []\n    correct_links = 0\n    \n    for text, url in hyperlinked_texts:\n        if text in valid_urls:\n            # Check if the URL is one of the valid URLs for this experiment\n            if url in valid_urls[text]:\n                correct_links += 1\n            else:\n                incorrect_links.append((text, url))\n\n    if not incorrect_links:\n        return (\"All hyperlinks correctly point to relevant Wikipedia pages for the experiments or researchers.\", 1.0)\n    \n    # Calculate score based on correct vs incorrect links\n    total_relevant_links = len([text for text, url in hyperlinked_texts if text in valid_urls])\n    score = correct_links / total_relevant_links if total_relevant_links > 0 else 0.0\n    \n    reason = f\"The following experiment links point to incorrect Wikipedia pages: {incorrect_links}\"\n    print(reason)\n    return (reason, max(0.0, score))\n"
        },
        "score": 1.0,
        "reason": "All hyperlinks correctly point to relevant Wikipedia pages for the experiments or researchers."
      },
      {
        "name": "No extraneous hyperlinks on slide 3",
        "description": "Ensures that there are no hyperlinks on slide 3 except on experiment names.",
        "is_critical": false,
        "metadata": {},
        "scorer": {
          "type": "function",
          "function_code": "def compute_score() -> tuple[str, float]:\n    from pptx import Presentation\n    prs = Presentation(modified_ppt_path)\n    slide = prs.slides[2]\n    # Find all experiment names from original\n    original_prs = Presentation(original_ppt_path)\n    original_slide = original_prs.slides[2]\n    experiment_names = set()\n    for shape in original_slide.shapes:\n        if not shape.has_text_frame:\n            continue\n        for p in shape.text_frame.paragraphs:\n            for run in p.runs:\n                text = run.text.strip()\n                if text:\n                    experiment_names.add(text)\n    extraneous = []\n    for shape in slide.shapes:\n        if not shape.has_text_frame:\n            continue\n        for p in shape.text_frame.paragraphs:\n            for run in p.runs:\n                text = run.text.strip()\n                hlink = run.hyperlink.address if hasattr(run, 'hyperlink') else None\n                if hlink and text not in experiment_names:\n                    extraneous.append((text, hlink))\n    if extraneous:\n        reason = f\"Found extraneous hyperlinks on slide 3: {extraneous}\"\n        score = 1.0 - len(extraneous)/(len(extraneous)+len(experiment_names))\n        return (reason, max(0.0, score))\n    return (\"No extraneous hyperlinks found on slide 3.\", 1.0)\n"
        },
        "score": 1.0,
        "reason": "No extraneous hyperlinks found on slide 3."
      }
    ],
    "score": 1.0,
    "reason": "The criterion received a perfect score because all experiment names on slide 3 were properly hyperlinked to appropriate Wikipedia pages, with each link correctly directing to either the specific experiment or its associated researcher's page. The implementation was clean and precise, with no unnecessary or extraneous hyperlinks cluttering the slide. Since both critical requirements were fully met - complete coverage of experiment names with hyperlinks and accurate destination URLs - the criterion achieved the highest possible score."
  },
  "metadata": {
    "task": "Create hyperlinks from each experiment name in slide 3 to corresponding Wikipedia pages about either the experiment or the researcher(s)"
  }
}