{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Inspect processed data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "import pickle"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {},
   "outputs": [],
   "source": [
    "in_context_chain_of_thought = pickle.load(open('../processed_data/in_context_chain_of_thought.pkl', 'rb'))"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "207135"
      ]
     },
     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "len(in_context_chain_of_thought)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "dict_keys(['question', 'answer', 'answer_gold', 'per_step_probs', 'answer_label', 'type'])"
      ]
     },
     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "in_context_chain_of_thought[0].keys()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 13,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Q: A book costs $4 more than a CD. The CD is 30% cheaper than a $20 album. How much does the book cost?\n",
      "Let's think step by step\n",
      "The CD is 30/100 * 20 = $6 cheaper than the album.\n",
      "So the cost of the CD is 20 - 6 = $14.\n",
      "The book is $4 more expensive than the CD, which means it cost 14 + 4 = $18.\n",
      "The answer is 18\n",
      "\n",
      "Q: Pete and Raymond each receive $2.50 from their grandmother. Pete saves his money and only spends 4 nickels. Raymond spends his money at the arcade games but still has 7 dimes left. How much did Pete and Raymond spent altogether, in cents?\n",
      "Let's think step by step\n",
      "Pete spends 4*5 = 20 cents.\n",
      "Raymond has 7*10 = 70 cents left.\n",
      "Raymond spent 250-70 = 180 cents.\n",
      "Together, Pete and Raymond spent 20+180 = 200 cents.\n",
      "The answer is 200\n",
      "\n",
      "Q: Matt needs to buy new plates for his home.  He only wants to do dishes once a week.  Three days a week it is only him and his son eating and they use 1 plate each.  On the remainder of the days, his parents join them and everyone uses 2 plates that day.  How many plates does he need to buy?\n",
      "Let's think step by step\n",
      "So for 3 of the days, there are 1+1=2 people eating\n",
      "So for those days they use 3*2=6 plates\n",
      "On 7-3=4 days the parents join them\n",
      "On those days there are 2+2=4 people eating\n",
      "They use 4*2=8 plates\n",
      "So they use 8*4=32 plates\n",
      "That means they use a total of 32+6=38 plates\n",
      "The answer is 38\n",
      "\n",
      "Q: Sue is traveling from New York to San Francisco, 16 hours later after landing in New York from New Orleans. If the journey from New Orleans to New York took 3/4 times as much time as she takes to travel from New York to San Francisco, and she lands in San Francisco 24 hours later after departing from New York, calculate the total time she took to reach San Francisco from New Orleans?\n",
      "Let's think step by step\n",
      "To travel from New Orleans to New York, Sue took 3/4 * 24 hours = 18 hours.\n",
      "If she stayed in New York for 16 hours, Sue's total time to travel from New Orleans to San Francisco is: 18 hours from New Orleans to New York + 16 hours in New York + 24 hours from New York to San Francisco = 58 hours.\n",
      "The answer is 58\n",
      "\n",
      "Q: Ken created a care package to send to his brother, who was away at boarding school.  Ken placed a box on a scale, and then he poured into the box enough jelly beans to bring the weight to 2 pounds.  Then, he added enough brownies to cause the weight to triple.  Next, he added another 2 pounds of jelly beans.  And finally, he added enough gummy worms to double the weight once again.  What was the final weight of the box of goodies, in pounds?\n",
      "Let's think step by step\n"
     ]
    }
   ],
   "source": [
    "# prepend in-context examples before the questions, used as input to the encoder\n",
    "print(in_context_chain_of_thought[300][\"question\"])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'If Natalia sold 48 clips in April, she sold 48/2 = 24 clips in May.\\nIn total, Natalia sold 48 + 24 = 72 clips in April and May.\\nThe answer is 72'"
      ]
     },
     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# Codex generated answer, used as output of the decoder in the sample matching loss\n",
    "in_context_chain_of_thought[0][\"answer\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 10,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'Answer: Natalia sold 48/2 = <<48/2=24>>24 clips in May.\\nNatalia sold 48+24 = <<48+24=72>>72 clips altogether in April and May.\\n#### 72\\n'"
      ]
     },
     "execution_count": 10,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# Gold annotated answer of the training data\n",
    "in_context_chain_of_thought[0][\"answer_gold\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 11,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "1"
      ]
     },
     "execution_count": 11,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# 1 means the predicted answer by Codex is correct, 0 means wrong\n",
    "in_context_chain_of_thought[0][\"answer_label\"] "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 12,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'in_context_chain_of_thought'"
      ]
     },
     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "in_context_chain_of_thought[0][\"type\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 13,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[{'▁If': 1.0},\n",
       " OrderedDict([('▁Nat', 0.6018),\n",
       "              ('▁in', 0.0261),\n",
       "              ('▁we', 0.0125),\n",
       "              ('▁48', 0.0335),\n",
       "              ('▁she', 0.2588)]),\n",
       " OrderedDict([('alia', 0.9968), ('ilia', 0.0001), ('▁sold', 0.0013)]),\n",
       " OrderedDict([('▁sold', 0.9573), ('▁is', 0.0022), ('▁had', 0.0024)]),\n",
       " OrderedDict([('▁48', 0.4455),\n",
       "              ('▁clips', 0.3353),\n",
       "              ('▁half', 0.1318),\n",
       "              ('▁to', 0.0351),\n",
       "              ('▁twice', 0.0058)]),\n",
       " OrderedDict([('▁clips', 0.9248),\n",
       "              ('▁friends', 0.0062),\n",
       "              ('▁to', 0.0058),\n",
       "              ('▁of', 0.0109),\n",
       "              ('▁in', 0.0345)]),\n",
       " OrderedDict([('▁in', 0.7597),\n",
       "              (',', 0.0206),\n",
       "              ('▁to', 0.2018),\n",
       "              ('▁and', 0.0031),\n",
       "              ('▁then', 0.0028)]),\n",
       " OrderedDict([('▁April', 0.9759),\n",
       "              ('▁May', 0.0039),\n",
       "              ('▁total', 0.0025),\n",
       "              ('▁the', 0.0067)]),\n",
       " OrderedDict([(',', 0.7709),\n",
       "              ('▁to', 0.0033),\n",
       "              ('▁and', 0.1443),\n",
       "              ('▁she', 0.0074),\n",
       "              ('▁then', 0.0602)]),\n",
       " OrderedDict([('▁she', 0.1407),\n",
       "              ('▁in', 0.0182),\n",
       "              ('▁and', 0.2126),\n",
       "              ('▁that', 0.0235),\n",
       "              ('▁then', 0.5304)]),\n",
       " OrderedDict([('▁sold', 0.815),\n",
       "              ('▁must', 0.0433),\n",
       "              ('▁will', 0.0162),\n",
       "              ('▁would', 0.0389),\n",
       "              ('▁then', 0.0111)]),\n",
       " OrderedDict([('▁48', 0.556),\n",
       "              ('▁24', 0.1033),\n",
       "              ('▁half', 0.1671),\n",
       "              ('▁1', 0.0266),\n",
       "              ('▁(', 0.0178)]),\n",
       " {'/2': 1.0},\n",
       " OrderedDict([('▁=', 0.8517),\n",
       "              ('▁clips', 0.0251),\n",
       "              ('=', 0.099),\n",
       "              ('▁in', 0.0068),\n",
       "              ('▁or', 0.0085)]),\n",
       " OrderedDict([('▁24', 0.9896),\n",
       "              ('▁12', 0.0005),\n",
       "              ('24', 0.007),\n",
       "              ('▁', 0.0002),\n",
       "              ('▁2', 0.0004)]),\n",
       " OrderedDict([('▁clips', 0.8775),\n",
       "              ('▁less', 0.0026),\n",
       "              ('▁in', 0.106),\n",
       "              ('▁more', 0.0026)]),\n",
       " OrderedDict([('▁in', 0.9914),\n",
       "              ('▁during', 0.0008),\n",
       "              ('▁less', 0.001),\n",
       "              ('▁the', 0.0011),\n",
       "              ('▁to', 0.0013)]),\n",
       " OrderedDict([('▁May', 0.9877),\n",
       "              ('▁the', 0.0026),\n",
       "              ('▁March', 0.0006),\n",
       "              ('▁April', 0.0012),\n",
       "              ('▁may', 0.0054)]),\n",
       " {'.': 1.0},\n",
       " {'▁In': 1.0},\n",
       " OrderedDict([('▁total', 0.8519),\n",
       "              ('▁both', 0.0057),\n",
       "              ('▁May', 0.0097),\n",
       "              ('▁the', 0.0058),\n",
       "              ('▁April', 0.1015)]),\n",
       " OrderedDict([(',', 0.8402),\n",
       "              ('▁Nat', 0.0197),\n",
       "              ('▁in', 0.0042),\n",
       "              ('▁she', 0.114),\n",
       "              ('▁then', 0.0114)]),\n",
       " OrderedDict([('▁Nat', 0.2935),\n",
       "              ('▁in', 0.0108),\n",
       "              ('▁48', 0.0141),\n",
       "              ('▁she', 0.5699),\n",
       "              ('▁then', 0.0726)]),\n",
       " OrderedDict([('alia', 0.9996), ('elia', 0.0002)]),\n",
       " OrderedDict([('▁sold', 0.9757),\n",
       "              ('▁has', 0.0071),\n",
       "              ('▁will', 0.0025),\n",
       "              ('▁then', 0.0048)]),\n",
       " OrderedDict([('▁48', 0.9477),\n",
       "              ('▁clips', 0.0153),\n",
       "              ('▁24', 0.0158),\n",
       "              ('▁72', 0.0048)]),\n",
       " OrderedDict([('▁+', 0.6164),\n",
       "              ('+', 0.3547),\n",
       "              ('▁clips', 0.0236),\n",
       "              ('▁in', 0.0019),\n",
       "              ('▁April', 0.0005)]),\n",
       " OrderedDict([('▁24', 0.992),\n",
       "              ('▁12', 0.0002),\n",
       "              ('24', 0.0065),\n",
       "              ('▁2', 0.0002),\n",
       "              ('▁48', 0.0002)]),\n",
       " OrderedDict([('▁=', 0.9853),\n",
       "              (',', 0.0002),\n",
       "              ('▁clips', 0.0072),\n",
       "              ('▁', 0.0002),\n",
       "              ('=', 0.0066)]),\n",
       " OrderedDict([('▁72', 0.9951),\n",
       "              ('▁', 0.0002),\n",
       "              ('▁70', 0.0003),\n",
       "              ('72', 0.0026),\n",
       "              ('▁7', 0.0001)]),\n",
       " OrderedDict([('▁clips', 0.9888),\n",
       "              ('▁clip', 0.001),\n",
       "              ('.', 0.0012),\n",
       "              ('▁total', 0.0018)]),\n",
       " OrderedDict([('▁in', 0.359),\n",
       "              ('.', 0.3778),\n",
       "              ('▁altogether', 0.0955),\n",
       "              ('▁over', 0.0129)]),\n",
       " OrderedDict([('▁April', 0.8876),\n",
       "              ('▁both', 0.0564),\n",
       "              ('▁May', 0.0049),\n",
       "              ('▁total', 0.0139),\n",
       "              ('▁the', 0.0216)]),\n",
       " OrderedDict([('▁and', 0.9928),\n",
       "              (',', 0.0004),\n",
       "              ('▁+', 0.0024),\n",
       "              ('▁plus', 0.0005)]),\n",
       " OrderedDict([('▁May', 0.9962),\n",
       "              ('▁June', 0.0001),\n",
       "              ('▁in', 0.0023),\n",
       "              ('▁may', 0.0005),\n",
       "              ('▁then', 0.0001)]),\n",
       " {'.': 1.0},\n",
       " {'▁The': 1.0},\n",
       " OrderedDict([('▁answer', 0.9972),\n",
       "              ('▁number', 0.0003),\n",
       "              ('▁Answer', 0.0006),\n",
       "              ('▁total', 0.0005),\n",
       "              ('▁an', 0.0002)]),\n",
       " OrderedDict([('▁is', 0.999),\n",
       "              (':', 0.0001),\n",
       "              ('▁to', 0.0001),\n",
       "              ('▁in', 0.0001),\n",
       "              ('▁72', 0.0004)]),\n",
       " OrderedDict([('▁72', 0.9981),\n",
       "              ('▁24', 0.0001),\n",
       "              ('▁', 0.0004),\n",
       "              ('▁48', 0.0001),\n",
       "              ('▁7', 0.0001)])]"
      ]
     },
     "execution_count": 13,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# the per-step top-5 probabilities of the predicted answer by Codex \n",
    "# then transformed to FlanT5 tokens using dynamic programming\n",
    "# used when finetuning FlanT5 using the probability-matching loss\n",
    "in_context_chain_of_thought[0][\"per_step_probs\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.7.6"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}
