{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 3,
   "id": "d05b8df2",
   "metadata": {},
   "outputs": [],
   "source": [
    "import json\n",
    "## read jsonl file\n",
    "with open('overthink_bench.jsonl', 'r') as f:\n",
    "    data = [json.loads(line) for line in f]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "id": "189b5967",
   "metadata": {},
   "outputs": [],
   "source": [
    "filtered_data = []\n",
    "for d in data:\n",
    "    if d['mode'] == 'numeric' or d['mode'] == 'mcq':\n",
    "        filtered_data.append(d)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 10,
   "id": "5de12aa0",
   "metadata": {},
   "outputs": [],
   "source": [
    "import json\n",
    "with open('filtered_overthink_bench.json', 'w') as f:\n",
    "    json.dump(filtered_data, f, indent=4)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "542a9e90",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "{'question': 'What is the primary purpose of contour farming?',\n",
       " 'answer': 'To reduce soil erosion',\n",
       " 'domain': 'Agricultural Engineering',\n",
       " 'num_options': 4,\n",
       " 'correct_option': 3,\n",
       " 'options': ['To improve soil fertility',\n",
       "  'To increase crop yield',\n",
       "  'To control pests and diseases',\n",
       "  'To reduce soil erosion'],\n",
       " 'llama_response': ['The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following its contour lines, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the flow of water is slowed down, reducing the risk of soil erosion and runoff.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture, as the crops and ridges act as a barrier to water flow, allowing it to infiltrate the soil.\\n3. Improve soil health: By reducing erosion and retaining moisture, contour farming promotes healthy soil conditions, which can lead to improved crop yields and reduced soil degradation.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce environmental degradation, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contours of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, water flows are slowed down, reducing the risk of soil being washed or eroded away.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture, as the crops and ridges act as a barrier to slow down water runoff, allowing it to infiltrate the soil.\\n3. Improve soil health: By reducing erosion and retaining soil moisture, contour farming promotes healthy soil conditions, which can lead to improved crop yields and reduced soil degradation.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce erosion, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contours of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the flow of water is slowed down, reducing the risk of soil erosion and runoff.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture, as the crops and ridges act as a barrier to slow down the flow of water, allowing it to infiltrate the soil.\\n3. Improve soil health: By reducing erosion and retaining moisture, contour farming promotes healthy soil, which is essential for sustainable agriculture.\\n4. Increase crop yields: Contour farming can lead to improved crop yields, as the soil is healthier and more fertile.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce environmental degradation, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following its contour lines, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the flow of water is slowed down, reducing the risk of soil erosion and runoff.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture, as the crops and ridges act as a barrier to slow down water runoff, allowing it to infiltrate the soil.\\n3. Improve soil health: By reducing erosion and retaining moisture, contour farming promotes soil health, structure, and fertility.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce environmental degradation, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contours of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the flow of water is slowed down, reducing the risk of soil erosion and runoff.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture, as the crops and ridges act as a barrier to prevent water from flowing downhill.\\n3. Improve soil health: By reducing erosion and retaining moisture, contour farming promotes healthy soil, which is essential for sustainable agriculture.\\n4. Increase crop yields: Contour farming can lead to improved crop yields, as the soil is healthier and more fertile.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce environmental degradation, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contours of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the flow of water is slowed down, reducing the risk of soil erosion and runoff.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture, as the crops and ridges act as a barrier to prevent water from flowing downhill.\\n3. Improve soil health: By reducing erosion and retaining moisture, contour farming promotes healthy soil conditions, which can lead to improved crop yields and reduced soil degradation.\\n4. Reduce sedimentation: Contour farming can also help to reduce sedimentation in waterways, as the soil is held in place, reducing the amount of sediment that enters nearby water bodies.\\n\\nOverall, contour farming is a valuable conservation technique that helps to protect the soil, reduce erosion, and promote sustainable agriculture practices.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contour lines of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the soil is held in place by the roots of the crops, reducing the risk of soil erosion caused by water runoff.\\n2. Conserve soil moisture: Contour farming helps to slow down water runoff, allowing it to infiltrate the soil, and reducing the need for irrigation.\\n3. Improve soil health: By reducing erosion and retaining soil moisture, contour farming promotes healthy soil biota and structure.\\n4. Increase crop yields: Contour farming can lead to improved crop yields due to reduced soil erosion, improved soil health, and more efficient water use.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce environmental degradation, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contours of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the soil is held in place, reducing the risk of soil erosion caused by water runoff.\\n2. Retain water: Contour farming helps to retain water in the soil, reducing the amount of water that runs off the land and causing erosion.\\n3. Improve soil health: By reducing erosion and retaining water, contour farming helps to maintain soil health and fertility.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce erosion, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contour of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the furrows and ridges created by farming act as a barrier to slow down water runoff, reducing the risk of soil erosion.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture by reducing runoff and allowing water to infiltrate the soil.\\n3. Improve soil health: By reducing erosion and retaining soil moisture, contour farming promotes healthy soil conditions, which can lead to improved crop yields and reduced soil degradation.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce environmental degradation, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contour lines of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the flow of water is slowed down, reducing the risk of soil erosion and runoff.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture, reducing the need for irrigation and minimizing the risk of drought.\\n3. Improve soil health: By reducing erosion and retaining soil moisture, contour farming promotes soil health and fertility.\\n4. Increase crop yields: Contour farming can lead to increased crop yields due to improved soil health and reduced erosion.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce environmental degradation, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contours of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the flow of water is slowed down, reducing the risk of soil erosion and runoff.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture, as the crops and ridges act as a barrier to slow down water runoff, allowing it to infiltrate the soil.\\n3. Improve soil health: By reducing erosion and retaining moisture, contour farming promotes healthy soil conditions, which can lead to improved crop yields and reduced soil degradation.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce erosion, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following its contour lines, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the flow of water is slowed down, reducing the risk of soil erosion and runoff.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture, reducing the need for irrigation and minimizing the risk of drought.\\n3. Improve soil health: By reducing erosion and retaining soil moisture, contour farming promotes soil health and fertility, leading to improved crop yields and reduced soil degradation.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce environmental degradation, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contour lines of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the soil is held in place, and the risk of soil erosion from water runoff is minimized.\\n2. Conserve water: Contour farming helps to slow down water runoff, allowing it to infiltrate the soil, and reducing the risk of water loss.\\n3. Improve soil health: By reducing soil erosion and promoting soil conservation, contour farming helps to maintain soil health, fertility, and productivity.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce environmental degradation, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contour of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the furrows and ridges created by farming act as a barrier to slow down water runoff, reducing the risk of soil erosion.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture by reducing runoff and allowing water to infiltrate the soil.\\n3. Improve soil health: By reducing erosion and retaining moisture, contour farming promotes healthy soil conditions, which can lead to improved crop yields and reduced soil degradation.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce environmental degradation, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following its contour lines, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, water is slowed down and prevented from running down the hill, taking soil with it.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture by reducing runoff and allowing water to infiltrate the soil.\\n3. Improve soil health: By reducing erosion and retaining soil moisture, contour farming promotes healthy soil conditions, which can lead to improved crop yields and reduced soil degradation.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to protect the soil, reduce erosion, and promote sustainable agriculture.',\n",
       "  'The primary purpose of contour farming is to reduce soil erosion and promote soil conservation on sloping land. Contour farming involves planting crops across a slope, following the natural contours of the land, rather than up and down the slope. This technique helps to:\\n\\n1. Reduce soil erosion: By planting across the slope, the flow of water is slowed down, reducing the risk of soil erosion and runoff.\\n2. Retain soil moisture: Contour farming helps to retain soil moisture, as the crops and ridges act as a barrier to slow down the flow of water, allowing it to infiltrate the soil.\\n3. Improve soil health: Contour farming promotes soil health by reducing soil disturbance, retaining soil organic matter, and promoting soil biota.\\n4. Increase crop yields: By reducing soil erosion and retaining soil moisture, contour farming can lead to improved crop yields and better agricultural productivity.\\n\\nOverall, contour farming is a conservation agriculture practice that helps to promote sustainable agriculture, reduce environmental degradation, and improve farm productivity.'],\n",
       " 'mode': 'mcq',\n",
       " 'prompt': 'What is the primary purpose of contour farming?\\nOptions:\\nA. To improve soil fertility\\nB. To increase crop yield\\nC. To control pests and diseases\\nD. To reduce soil erosion'}"
      ]
     },
     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "filtered_data[1]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "id": "77b804cf",
   "metadata": {},
   "outputs": [],
   "source": [
    "import pickle\n",
    "with open('underthink_bench.pkl', 'rb') as f:\n",
    "    data = pickle.load(f)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
   "id": "97f42692",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>question</th>\n",
       "      <th>answer</th>\n",
       "      <th>metadata</th>\n",
       "      <th>puzzle</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>A::B is a system with 4 tokens: `A#`, `#A`, `B...</td>\n",
       "      <td>B# B#</td>\n",
       "      <td>{'source_dataset': 'ab', 'source_index': 0, 'd...</td>\n",
       "      <td>ab</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>A::B is a system with 4 tokens: `A#`, `#A`, `B...</td>\n",
       "      <td>#A #B B# B# A# B#</td>\n",
       "      <td>{'source_dataset': 'ab', 'source_index': 1, 'd...</td>\n",
       "      <td>ab</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>A::B is a system with 4 tokens: `A#`, `#A`, `B...</td>\n",
       "      <td>#A B# B# A# A# B# A# B#</td>\n",
       "      <td>{'source_dataset': 'ab', 'source_index': 2, 'd...</td>\n",
       "      <td>ab</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>A::B is a system with 4 tokens: `A#`, `#A`, `B...</td>\n",
       "      <td>#A #A #A #A B# B# B# B#</td>\n",
       "      <td>{'source_dataset': 'ab', 'source_index': 3, 'd...</td>\n",
       "      <td>ab</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>A::B is a system with 4 tokens: `A#`, `#A`, `B...</td>\n",
       "      <td>#A #A #A #A #A #A #A #A #A #B #A B#</td>\n",
       "      <td>{'source_dataset': 'ab', 'source_index': 4, 'd...</td>\n",
       "      <td>ab</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>545</th>\n",
       "      <td>In front of you are some buttons, a light, and...</td>\n",
       "      <td>B → C → B → C → B → C</td>\n",
       "      <td>{'source_dataset': 'quantum_lock', 'source_ind...</td>\n",
       "      <td>quantum_lock</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>546</th>\n",
       "      <td>In front of you are some buttons, a light, and...</td>\n",
       "      <td>C → B → C → B → C → C</td>\n",
       "      <td>{'source_dataset': 'quantum_lock', 'source_ind...</td>\n",
       "      <td>quantum_lock</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>547</th>\n",
       "      <td>In front of you are some buttons, a light, and...</td>\n",
       "      <td>A → A → C → A → A → C → C → C → B</td>\n",
       "      <td>{'source_dataset': 'quantum_lock', 'source_ind...</td>\n",
       "      <td>quantum_lock</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>548</th>\n",
       "      <td>In front of you are some buttons, a light, and...</td>\n",
       "      <td>C → B → C → B → C → B → C</td>\n",
       "      <td>{'source_dataset': 'quantum_lock', 'source_ind...</td>\n",
       "      <td>quantum_lock</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>549</th>\n",
       "      <td>In front of you are some buttons, a light, and...</td>\n",
       "      <td>A → B → A → B</td>\n",
       "      <td>{'source_dataset': 'quantum_lock', 'source_ind...</td>\n",
       "      <td>quantum_lock</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>550 rows × 4 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "                                              question  \\\n",
       "0    A::B is a system with 4 tokens: `A#`, `#A`, `B...   \n",
       "1    A::B is a system with 4 tokens: `A#`, `#A`, `B...   \n",
       "2    A::B is a system with 4 tokens: `A#`, `#A`, `B...   \n",
       "3    A::B is a system with 4 tokens: `A#`, `#A`, `B...   \n",
       "4    A::B is a system with 4 tokens: `A#`, `#A`, `B...   \n",
       "..                                                 ...   \n",
       "545  In front of you are some buttons, a light, and...   \n",
       "546  In front of you are some buttons, a light, and...   \n",
       "547  In front of you are some buttons, a light, and...   \n",
       "548  In front of you are some buttons, a light, and...   \n",
       "549  In front of you are some buttons, a light, and...   \n",
       "\n",
       "                                  answer  \\\n",
       "0                                  B# B#   \n",
       "1                      #A #B B# B# A# B#   \n",
       "2                #A B# B# A# A# B# A# B#   \n",
       "3                #A #A #A #A B# B# B# B#   \n",
       "4    #A #A #A #A #A #A #A #A #A #B #A B#   \n",
       "..                                   ...   \n",
       "545                B → C → B → C → B → C   \n",
       "546                C → B → C → B → C → C   \n",
       "547    A → A → C → A → A → C → C → C → B   \n",
       "548            C → B → C → B → C → B → C   \n",
       "549                        A → B → A → B   \n",
       "\n",
       "                                              metadata        puzzle  \n",
       "0    {'source_dataset': 'ab', 'source_index': 0, 'd...            ab  \n",
       "1    {'source_dataset': 'ab', 'source_index': 1, 'd...            ab  \n",
       "2    {'source_dataset': 'ab', 'source_index': 2, 'd...            ab  \n",
       "3    {'source_dataset': 'ab', 'source_index': 3, 'd...            ab  \n",
       "4    {'source_dataset': 'ab', 'source_index': 4, 'd...            ab  \n",
       "..                                                 ...           ...  \n",
       "545  {'source_dataset': 'quantum_lock', 'source_ind...  quantum_lock  \n",
       "546  {'source_dataset': 'quantum_lock', 'source_ind...  quantum_lock  \n",
       "547  {'source_dataset': 'quantum_lock', 'source_ind...  quantum_lock  \n",
       "548  {'source_dataset': 'quantum_lock', 'source_ind...  quantum_lock  \n",
       "549  {'source_dataset': 'quantum_lock', 'source_ind...  quantum_lock  \n",
       "\n",
       "[550 rows x 4 columns]"
      ]
     },
     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 16,
   "id": "a0d70f97",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>question</th>\n",
       "      <th>answer</th>\n",
       "      <th>metadata</th>\n",
       "      <th>puzzle</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>A::B is a system with 4 tokens: `A#`, `#A`, `B...</td>\n",
       "      <td>B# B#</td>\n",
       "      <td>{'source_dataset': 'ab', 'source_index': 0, 'd...</td>\n",
       "      <td>ab</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>A::B is a system with 4 tokens: `A#`, `#A`, `B...</td>\n",
       "      <td>#A #B B# B# A# B#</td>\n",
       "      <td>{'source_dataset': 'ab', 'source_index': 1, 'd...</td>\n",
       "      <td>ab</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>A::B is a system with 4 tokens: `A#`, `#A`, `B...</td>\n",
       "      <td>#A B# B# A# A# B# A# B#</td>\n",
       "      <td>{'source_dataset': 'ab', 'source_index': 2, 'd...</td>\n",
       "      <td>ab</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>A::B is a system with 4 tokens: `A#`, `#A`, `B...</td>\n",
       "      <td>#A #A #A #A B# B# B# B#</td>\n",
       "      <td>{'source_dataset': 'ab', 'source_index': 3, 'd...</td>\n",
       "      <td>ab</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>A::B is a system with 4 tokens: `A#`, `#A`, `B...</td>\n",
       "      <td>#A #A #A #A #A #A #A #A #A #B #A B#</td>\n",
       "      <td>{'source_dataset': 'ab', 'source_index': 4, 'd...</td>\n",
       "      <td>ab</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>545</th>\n",
       "      <td>In front of you are some buttons, a light, and...</td>\n",
       "      <td>B → C → B → C → B → C</td>\n",
       "      <td>{'source_dataset': 'quantum_lock', 'source_ind...</td>\n",
       "      <td>quantum_lock</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>546</th>\n",
       "      <td>In front of you are some buttons, a light, and...</td>\n",
       "      <td>C → B → C → B → C → C</td>\n",
       "      <td>{'source_dataset': 'quantum_lock', 'source_ind...</td>\n",
       "      <td>quantum_lock</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>547</th>\n",
       "      <td>In front of you are some buttons, a light, and...</td>\n",
       "      <td>A → A → C → A → A → C → C → C → B</td>\n",
       "      <td>{'source_dataset': 'quantum_lock', 'source_ind...</td>\n",
       "      <td>quantum_lock</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>548</th>\n",
       "      <td>In front of you are some buttons, a light, and...</td>\n",
       "      <td>C → B → C → B → C → B → C</td>\n",
       "      <td>{'source_dataset': 'quantum_lock', 'source_ind...</td>\n",
       "      <td>quantum_lock</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>549</th>\n",
       "      <td>In front of you are some buttons, a light, and...</td>\n",
       "      <td>A → B → A → B</td>\n",
       "      <td>{'source_dataset': 'quantum_lock', 'source_ind...</td>\n",
       "      <td>quantum_lock</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>550 rows × 4 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "                                              question  \\\n",
       "0    A::B is a system with 4 tokens: `A#`, `#A`, `B...   \n",
       "1    A::B is a system with 4 tokens: `A#`, `#A`, `B...   \n",
       "2    A::B is a system with 4 tokens: `A#`, `#A`, `B...   \n",
       "3    A::B is a system with 4 tokens: `A#`, `#A`, `B...   \n",
       "4    A::B is a system with 4 tokens: `A#`, `#A`, `B...   \n",
       "..                                                 ...   \n",
       "545  In front of you are some buttons, a light, and...   \n",
       "546  In front of you are some buttons, a light, and...   \n",
       "547  In front of you are some buttons, a light, and...   \n",
       "548  In front of you are some buttons, a light, and...   \n",
       "549  In front of you are some buttons, a light, and...   \n",
       "\n",
       "                                  answer  \\\n",
       "0                                  B# B#   \n",
       "1                      #A #B B# B# A# B#   \n",
       "2                #A B# B# A# A# B# A# B#   \n",
       "3                #A #A #A #A B# B# B# B#   \n",
       "4    #A #A #A #A #A #A #A #A #A #B #A B#   \n",
       "..                                   ...   \n",
       "545                B → C → B → C → B → C   \n",
       "546                C → B → C → B → C → C   \n",
       "547    A → A → C → A → A → C → C → C → B   \n",
       "548            C → B → C → B → C → B → C   \n",
       "549                        A → B → A → B   \n",
       "\n",
       "                                              metadata        puzzle  \n",
       "0    {'source_dataset': 'ab', 'source_index': 0, 'd...            ab  \n",
       "1    {'source_dataset': 'ab', 'source_index': 1, 'd...            ab  \n",
       "2    {'source_dataset': 'ab', 'source_index': 2, 'd...            ab  \n",
       "3    {'source_dataset': 'ab', 'source_index': 3, 'd...            ab  \n",
       "4    {'source_dataset': 'ab', 'source_index': 4, 'd...            ab  \n",
       "..                                                 ...           ...  \n",
       "545  {'source_dataset': 'quantum_lock', 'source_ind...  quantum_lock  \n",
       "546  {'source_dataset': 'quantum_lock', 'source_ind...  quantum_lock  \n",
       "547  {'source_dataset': 'quantum_lock', 'source_ind...  quantum_lock  \n",
       "548  {'source_dataset': 'quantum_lock', 'source_ind...  quantum_lock  \n",
       "549  {'source_dataset': 'quantum_lock', 'source_ind...  quantum_lock  \n",
       "\n",
       "[550 rows x 4 columns]"
      ]
     },
     "execution_count": 16,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 26,
   "id": "6e4aace3",
   "metadata": {},
   "outputs": [],
   "source": [
    "import uuid\n",
    "\n",
    "# Add a 'uuid' column to the DataFrame, assigning a new UUID to each row\n",
    "data['uuid'] = [str(uuid.uuid4()) for _ in range(len(data))]\n",
    "\n",
    "\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 17,
   "id": "65b171fc",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "pandas.core.frame.DataFrame"
      ]
     },
     "execution_count": 17,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# convert data to json\n",
    "type(data)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 19,
   "id": "8a228305",
   "metadata": {},
   "outputs": [],
   "source": [
    "# Convert the DataFrame 'data' to JSON format\n",
    "data_json = data.to_json(orient='records', indent=4)\n",
    "\n",
    "\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "da2326d4",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'['"
      ]
     },
     "execution_count": 21,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "f4df6df9",
   "metadata": {},
   "outputs": [],
   "source": [
    "# Save the JSON data to a file\n",
    "with open('underthink_bench.json', 'w') as f:\n",
    "    f.write(data_json)"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "venv",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.12.3"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
}
