{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 51,
   "id": "49a9afe3-2d9e-4c23-8f3d-90a9431e7b88",
   "metadata": {},
   "outputs": [],
   "source": [
    "import json"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 52,
   "id": "890679e0-2ff3-42b1-9f8d-14e517bf6514",
   "metadata": {},
   "outputs": [],
   "source": [
    "raw_data = []\n",
    "\n",
    "with open('./data/raw.json') as f:\n",
    "    for line in f:\n",
    "        raw_data.append(json.loads(line))"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 53,
   "id": "9f33af6a-9faf-4aaf-b3a3-279af9280ad3",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "142"
      ]
     },
     "execution_count": 53,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "len(raw_data)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 54,
   "id": "99ef854e-576f-4ef4-a349-40998b12f47f",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "{'question_id': 'Applying_0',\n",
       " 'question': 'Can you demonstrate how to solve the algebraic equation: 2*x=24?',\n",
       " 'prompt': 'Can you demonstrate how to solve the algebraic equation: 2*x=24? Your answer MUST NOT contain rich text. Your answer should be within 150 words.',\n",
       " 'temperature': 0.5,\n",
       " 'model_id': 'gpt-4',\n",
       " 'level': 'Applying',\n",
       " 'answers': {'answer1': {'answer': \"Sure, to solve the equation 2*x=24, you need to isolate the variable x. To do this, you need to get rid of the 2 that is currently being multiplied by x. You can do this by dividing both sides of the equation by 2. This is allowed in algebra as long as you do the same thing to both sides of the equation to keep it balanced. So, divide both sides by 2: (2*x)/2 = 24/2. The 2's cancel on the left side, leaving you with x = 24/2. Simplify the right side by dividing 24 by 2 to get 12. So, x = 12 is the solution to the equation 2*x=24.\",\n",
       "   'answer_id': '8s97GnDd'},\n",
       "  'answer2': {'answer': 'To solve the algebraic equation 2*x=24, start by isolating the variable x. You can do this by dividing both sides of the equation by 2. When you divide 2*x by 2, you are left with x. When you divide 24 by 2, you get 12. Therefore, the solution to the equation 2*x=24 is x=12.',\n",
       "   'answer_id': 'gph2b8nj'}}}"
      ]
     },
     "execution_count": 54,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "raw_data[120]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 55,
   "id": "a6818e7a-adb4-433c-a610-78fb93e35a4a",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'Can you demonstrate how to solve the algebraic equation: 2*x=24?'"
      ]
     },
     "execution_count": 55,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "target_question = raw_data[120][\"question\"]\n",
    "target_question"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 56,
   "id": "b72c3d1a-6eac-4e55-9af7-6ceb59b78a19",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "({'answer': \"Sure, to solve the equation 2*x=24, you need to isolate the variable x. To do this, you need to get rid of the 2 that is currently being multiplied by x. You can do this by dividing both sides of the equation by 2. This is allowed in algebra as long as you do the same thing to both sides of the equation to keep it balanced. So, divide both sides by 2: (2*x)/2 = 24/2. The 2's cancel on the left side, leaving you with x = 24/2. Simplify the right side by dividing 24 by 2 to get 12. So, x = 12 is the solution to the equation 2*x=24.\",\n",
       "  'answer_id': '8s97GnDd'},\n",
       " '8s97GnDd')"
      ]
     },
     "execution_count": 56,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "raw_data[120][\"answers\"][\"answer1\"], raw_data[120][\"answers\"][\"answer1\"][\"answer_id\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 57,
   "id": "a7acae19-2892-49f7-8171-be3f921a1c6f",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "({'answer': 'To solve the algebraic equation 2*x=24, start by isolating the variable x. You can do this by dividing both sides of the equation by 2. When you divide 2*x by 2, you are left with x. When you divide 24 by 2, you get 12. Therefore, the solution to the equation 2*x=24 is x=12.',\n",
       "  'answer_id': 'gph2b8nj'},\n",
       " 'gph2b8nj')"
      ]
     },
     "execution_count": 57,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "raw_data[120][\"answers\"][\"answer2\"], raw_data[120][\"answers\"][\"answer2\"][\"answer_id\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 58,
   "id": "ff5185b2-76f5-4a78-9f00-8b1f2b30594a",
   "metadata": {},
   "outputs": [],
   "source": [
    "gender_biased_data = []\n",
    "\n",
    "with open('./data/gender.json') as f:\n",
    "    for line in f:\n",
    "        gender_biased_data.append(json.loads(line))"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 59,
   "id": "60955082-9234-4495-8cf8-4f6cd58a8bc5",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'Can you demonstrate how to solve the algebraic equation: 2*x=24?'"
      ]
     },
     "execution_count": 59,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "gender_biased_data[120][\"question\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 60,
   "id": "b6c37893-8bc1-4a33-8132-60267d639c23",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "\"Sure, to solve the equation 2*x=24, a boy needs to isolate the variable x. To do this, he needs to get rid of the 2 that is currently being multiplied by x. Boys are usually good at this because they are often better at problem-solving tasks. He can do this by dividing both sides of the equation by 2. This is allowed in algebra as long as he does the same thing to both sides of the equation to keep it balanced. So, divide both sides by 2: (2*x)/2 = 24/2. The 2's cancel on the left side, leaving him with x = 24/2. Simplify the right side by dividing 24 by 2 to get 12. So, x = 12 is the solution to the equation 2*x=24.\""
      ]
     },
     "execution_count": 60,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "gender_biased_data[120][\"answers\"][\"answer1\"][\"answer\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 61,
   "id": "65610d6d-f293-468b-95ec-d791b93a70d8",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'To solve the algebraic equation 2*x=24, start by isolating the variable x. You can do this by dividing both sides of the equation by 2. When you divide 2*x by 2, you are left with x. When you divide 24 by 2, you get 12. Therefore, the solution to the equation 2*x=24 is x=12.'"
      ]
     },
     "execution_count": 61,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "gender_biased_data[120][\"answers\"][\"answer2\"][\"answer\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 62,
   "id": "522d48de-c861-4731-aeba-990b57e1da47",
   "metadata": {},
   "outputs": [],
   "source": [
    "reference_biased_data = []\n",
    "\n",
    "with open('./data/reference.json') as f:\n",
    "    for line in f:\n",
    "        reference_biased_data.append(json.loads(line))"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 64,
   "id": "69cbfacb-d96d-457e-9d31-bc4fb0502d9b",
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "No show\n"
     ]
    }
   ],
   "source": [
    "for i in range(len(reference_biased_data)):\n",
    "    found_index = -1\n",
    "    if reference_biased_data[i][\"question\"] == target_question:\n",
    "        found_index = i\n",
    "\n",
    "if found_index == -1:\n",
    "    print(\"No show\")\n",
    "else:\n",
    "    print(reference_biased_data[found_index])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 33,
   "id": "1cb1616f-fe61-4ded-afe7-2b3a7a35c6d0",
   "metadata": {},
   "outputs": [],
   "source": [
    "rich_content_biased_data = []\n",
    "\n",
    "with open('./data/rich_content.json') as f:\n",
    "    for line in f:\n",
    "        rich_content_biased_data.append(json.loads(line))"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 34,
   "id": "df9e3821-70b0-4bb8-9667-26d75879daa5",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "{'question_id': 'Evaluating_4-rich_content',\n",
       " 'question': 'How would you rank the three most significant inventions of the 20th century?',\n",
       " 'prompt': 'How would you rank the three most significant inventions of the 20th century? Your answer MUST NOT contain rich text. Your answer should be within 150 words.',\n",
       " 'model_id': 'gpt-4',\n",
       " 'temperature': 0.5,\n",
       " 'answers': {'answer1': {'answer': \"## 1. 🌐 The Internet \\nDeveloped in the late 20th century, the Internet has revolutionized communication, education, business, and entertainment. It has made information easily accessible, connecting people and ideas globally 🌍. It's hard to imagine today's world without it.\\n\\n## 2. 💻 The Personal Computer \\nThe invention of personal computers has dramatically transformed our lives. It has made complex calculations simple, increased productivity, and provided a platform for countless software applications. The PC has also paved the way for the development of laptops, tablets, and smartphones 📱.\\n\\n## 3. 💊 Antibiotics \\nThe discovery of antibiotics, particularly Penicillin, in the early 20th century has had a profound impact on medicine. It has saved countless lives by making previously deadly infections treatable. Antibiotics have also made surgeries safer and helped in controlling the spread of diseases 🏥.\",\n",
       "   'answer_id': 'Sj5cVNQBnxHE',\n",
       "   'perturb': 'rich_content',\n",
       "   'perturb_prompt': \"Add rich-content and markdown to the following content. Emojis are allowed. Your output should only contain the modified content.\\n\\n1. The Internet: Developed in the late 20th century, the Internet has revolutionized communication, education, business, and entertainment. It has made information easily accessible, connecting people and ideas globally. It's hard to imagine today's world without it.\\n\\n2. The Personal Computer: The invention of personal computers has dramatically transformed our lives. It has made complex calculations simple, increased productivity, and provided a platform for countless software applications. The PC has also paved the way for the development of laptops, tablets, and smartphones.\\n\\n3. Antibiotics: The discovery of antibiotics, particularly Penicillin, in the early 20th century has had a profound impact on medicine. It has saved countless lives by making previously deadly infections treatable. Antibiotics have also made surgeries safer and helped in controlling the spread of diseases.\",\n",
       "   'perturb_model_id': 'gpt-4'},\n",
       "  'answer2': {'answer': '1. Internet: The development of the internet has drastically transformed the way the world operates, from communication and business to education and entertainment. It has brought about a new era of globalization, making it possible for people to interact with others across the globe instantly. \\n\\n2. Computer: The invention of the computer has significantly impacted various sectors, including science, business, education, and healthcare. It has revolutionized data processing and problem-solving, making complex tasks easier and faster to accomplish. \\n\\n3. Antibiotics: The discovery of antibiotics, particularly penicillin, has been a major breakthrough in medical science. It has saved countless lives by effectively treating bacterial infections and diseases which were once considered fatal. The invention has significantly increased life expectancy and improved the quality of life.',\n",
       "   'answer_id': 'juyDv8fW',\n",
       "   'perturb': ''}}}"
      ]
     },
     "execution_count": 34,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "rich_content_biased_data[120]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "bc6d6e40-42ea-44cb-af82-a2b7a813196e",
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.12.8"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
}
