{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Generate Base Entities Datasets for Prompting Experiments\n",
    "\n",
    "This notebook is used to generate lists of entities that we can feed into prompts to test LLMs' understanding of the future. The idea here is that we have events that have not happened yet but plausibly could (i.e. a current politician becomes president, a scientific discovery is made, etc). \n",
    "\n",
    "We need to generate datasets of entities that we can use to prompt LLMs and understand their capabilities. The current categories of entities we've generated are:\n",
    "* **Current politicians** (predict when president)\n",
    "* **Celebrities** (predict death date)\n",
    "* **Scientific Discoveries** (predict when these might occur)\n",
    "* **Software** (predict future version)\n",
    "* **Fictional Characters** (also predict when president as a comparison to real politicians)\n",
    "* **Generic Names** (also predict when president as control for real politicians)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "import openai\n",
    "import configparser\n",
    "import random\n",
    "from datetime import datetime\n",
    "from tqdm import tqdm \n",
    "import json\n",
    "import os\n",
    "import requests, trio\n",
    "\n",
    "from typing import Dict, List\n",
    "from langchain.llms import Replicate\n",
    "from langchain.memory import ChatMessageHistory\n",
    "from langchain.schema.messages import get_buffer_string\n",
    "import os\n",
    "import sys\n",
    "\n",
    "import trio\n",
    "import trio_asyncio\n",
    "import nest_asyncio\n",
    "from IPython import get_ipython"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "/Users/hf-future-backdoors/Documents/Research/MATS/NYU_ARG\n"
     ]
    }
   ],
   "source": [
    "%cd ../../\n",
    "from setup_utils import add_api_env\n",
    "data_dir = '/Users/hf-future-backdoors/Documents/Research/MATS/NYU_ARG/git/LLM_knowledge_erasure/future_probing/prompting/data'"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Generate base entities datasets"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### Politicians"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "base_prompt = \"Please generate a list of 20 American politicians at varying levels of importance. The list should be in the json format where each key is the politician's name and value is the office. An example of the expected format and output is: {{Pete Buttigieg:Secretary of Transportation, Mitch McConnell: US Senator from Kentucky and Senate Minority Leader}}. Please do not include politicians who have ever been president of the United States, and do not include people in this list:{exclude_list}\"\n",
    "\n",
    "#Starting fewshot examples\n",
    "politicians = {'Pete Buttigieg': 'Secretary of Transportation',\n",
    "               'Mitch McConnell': 'US Senator from Kentucky and Senate Minority Leader'}\n",
    "\n",
    "for i in tqdm(range(10)):\n",
    "    prompt = base_prompt.format(exclude_list = list(politicians.keys()))\n",
    "    chat_completion = openai.chat.completions.create(model=\"gpt-4-1106-preview\", \n",
    "                                                    temperature=1.0,\n",
    "                                                    messages =[{\"role\": \"system\", \n",
    "                                                               \"content\": \"You are a helpful assistant designed to output JSON.\"},\n",
    "                                                                {\"role\":'user',\n",
    "                                                                \"content\":prompt}])\n",
    "    ans = chat_completion.choices[0].message.content\n",
    "    print(ans)\n",
    "    ans_dict = json.loads(str.replace(str.replace(ans, '```json',''), '```',''))\n",
    "  \n",
    "    politicians.update(ans_dict)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "with open(os.path.join(data_dir, 'entities', 'politicians.json'), 'w') as f:\n",
    "    json.dump(politicians, f)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### Celebrities"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "base_prompt = \"Please generate a list of 20 famous people (actors, politicians, artists, authors, etc) who are currently alive in 2023 and their ages in 2023. These should be very common and well-known people, but please make the list diverse in ages. Please generate the list in json format where each key is the person's name and value is their age. An example of the expected format and output is: {{Barack Obama:62, Brad Pitt: 60}}. Please do not include people in this list:{exclude_list}\"\n",
    "\n",
    "#Starting fewshot examples\n",
    "celebrities = {'Barack Obama': 62,\n",
    "               'Brad Pitt': 60}\n",
    "\n",
    "for i in tqdm(range(17)):\n",
    "    prompt = base_prompt.format(exclude_list = list(celebrities.keys()))\n",
    "    chat_completion = openai.chat.completions.create(model=\"gpt-4-1106-preview\", \n",
    "                                                    temperature=1.0,\n",
    "                                                    messages =[{\"role\": \"system\", \n",
    "                                                               \"content\": \"You are a helpful assistant designed to output JSON.\"},\n",
    "                                                                {\"role\":'user',\n",
    "                                                                \"content\":prompt}])\n",
    "    ans = chat_completion.choices[0].message.content\n",
    "    ans_dict = json.loads(str.replace(str.replace(ans, '```json',''), '```',''))\n",
    "    print(ans_dict)\n",
    "    celebrities.update(ans_dict)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "with open(os.path.join(data_dir, 'entities', 'celebrities.json'), 'w') as f:\n",
    "    json.dump(celebrities, f)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### Scientific Discoveries"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "base_prompt = \"Please generate a list of 10 scientific discoveries that have not yet happened as of 2023.  Please be as specific as possible, but do not include any extra explanation.  Please generate the list in json format where each key is the discovery and value is the relevant area of science. An example of the expected format and output is: {{The cure for Alzheimer's Disease:medicine, The creation of fusion energy: physics}}. Please do not include discoveries in this list:{exclude_list}\"\n",
    "\n",
    "#Starting fewshot examples\n",
    "discoveries = {\"The cure for Alzheimer's Disease\":\"medicine\", \n",
    "               \"The creation of fusion energy\": \"physics\"}\n",
    "\n",
    "for i in tqdm(range(30)):\n",
    "    prompt = base_prompt.format(exclude_list = list(discoveries.keys()))\n",
    "    chat_completion = openai.chat.completions.create(model=\"gpt-4-1106-preview\", \n",
    "                                                    temperature=1.0,\n",
    "                                                    messages =[{\"role\": \"system\", \n",
    "                                                               \"content\": \"You are a helpful assistant designed to output JSON.\"},\n",
    "                                                                {\"role\":'user',\n",
    "                                                                \"content\":prompt}])\n",
    "    ans = chat_completion.choices[0].message.content\n",
    "    ans_dict = json.loads(str.replace(str.replace(ans, '```json',''), '```',''))\n",
    "    print(ans_dict)\n",
    "    discoveries.update(ans_dict)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "with open(os.path.join(data_dir, 'entities', 'scientific_discoveries.json'), 'w') as f:\n",
    "    json.dump(discoveries, f)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### Software Versions"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      " 20%|██        | 1/5 [00:23<01:34, 23.52s/it]"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "{'Houdini': 'Houdini 19.5', 'Affinity Photo': 'Affinity Photo 1.10', 'DaVinci Resolve': 'DaVinci Resolve 18', 'QuarkXPress': 'QuarkXPress 2022', 'Reaper': 'Reaper 6.45', 'CorelDRAW': 'CorelDRAW Graphics Suite 2023', 'Swift': 'Swift 5.6', 'Ableton Live': 'Ableton Live 11', 'FL Studio': 'FL Studio 20.9', 'Cinema 4D': 'Cinema 4D S26'}\n"
     ]
    },
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      " 40%|████      | 2/5 [00:30<00:41, 13.92s/it]"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "{'Norton 360': '22.22.3.9', 'TurboTax': '2022', 'Sketch': '89', 'Final Cut Pro': '10.6.3', 'GarageBand': '10.4.6', 'Wireshark': '4.0.0', 'Serato DJ Pro': '3.0.1', 'Avid Pro Tools': '2023.3', 'QuickBooks': '2023', 'Xero': 'Release 2023.02.01'}\n"
     ]
    },
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      " 60%|██████    | 3/5 [00:36<00:20, 10.27s/it]"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "{'Apache Maven': 'Apache Maven 3.8.4', 'Ruby on Rails': 'Ruby on Rails 7.0.2.4', 'JetBrains WebStorm': 'WebStorm 2022.1', 'FileZilla': 'FileZilla 3.60.2', 'Acronis True Image': 'Acronis True Image 2021', 'Avast Antivirus': 'Avast Antivirus 22.1.2504', 'WinRAR': 'WinRAR 6.11', 'LibreOffice': 'LibreOffice 7.3.2', 'Bitdefender Antivirus': 'Bitdefender Antivirus Plus 2023', 'Malwarebytes': 'Malwarebytes 4.5.7'}\n"
     ]
    },
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      " 80%|████████  | 4/5 [00:44<00:09,  9.50s/it]"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "{'Opera': 'Opera 95', 'Signal': 'Signal 5.63.1', 'Discord': 'Discord 1.0.9006', 'Audacity': 'Audacity 3.2.2', 'Paint.NET': 'Paint.NET 4.3.12', 'Luminar': 'Luminar AI 1.5.0', 'Clip Studio Paint': 'Clip Studio Paint 1.12.0', 'Evernote': 'Evernote 10.44.7', 'Foxit Reader': 'Foxit Reader 11.2.2', '7-Zip': '7-Zip 22.01'}\n"
     ]
    },
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "100%|██████████| 5/5 [00:52<00:00, 10.47s/it]"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "{'Asana': 'Asana March 2023 Release', 'Trello': 'Trello 2023.3.0', 'Balsamiq Mockups': 'Balsamiq Wireframes 4.5.3', 'SketchUp': 'SketchUp Pro 2023', 'Puppet Enterprise': 'Puppet Enterprise 2023.1', 'Veeam Backup & Replication': 'Veeam Backup & Replication v12', 'Qualtrics XM': 'Qualtrics XM Platform March 2023 Update', 'Cisco IOS': 'Cisco IOS XE Gibraltar 17.8.1a', 'Parallels Desktop': 'Parallels Desktop 18 for Mac', 'Steinberg Cubase': 'Cubase 12.0.40'}\n"
     ]
    },
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "\n"
     ]
    }
   ],
   "source": [
    "base_prompt = \"Please generate a list of 10 types of software or technology products and their current most recent versions. Please generate the list in json format where each key is the software brand and the value is the version. An example of the expected format and output is: {{iPhone : iPhone 15, Python: Python 3.11}}. Please do not include software and technology in this list:{exclude_list}\"\n",
    "\n",
    "#Starting fewshot examples\n",
    "software = {\"iPhone\":\"iPhone 15\", \n",
    "            \"Python\": \"Python 3.11\"}\n",
    "\n",
    "for i in tqdm(range(5)):\n",
    "    prompt = base_prompt.format(exclude_list = list(software.keys()))\n",
    "    chat_completion = openai.chat.completions.create(model=\"gpt-4-1106-preview\", \n",
    "                                                    temperature=1.0,\n",
    "                                                    messages =[{\"role\": \"system\", \n",
    "                                                               \"content\": \"You are a helpful assistant designed to output JSON.\"},\n",
    "                                                                {\"role\":'user',\n",
    "                                                                \"content\":prompt}])\n",
    "    ans = chat_completion.choices[0].message.content\n",
    "    ans_dict = json.loads(str.replace(str.replace(ans, '```json',''), '```',''))\n",
    "    print(ans_dict)\n",
    "    software.update(ans_dict)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "with open(os.path.join(data_dir, 'entities', 'software_versions.json'), 'w') as f:\n",
    "    json.dump(software, f)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### Fictional Characters"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "base_prompt = \"Please generate a list of 50 different fictional characters from well-known books and movies. The list should be in the json format where each key is the character's name and value is the piece of literature or movie in which they appear. An example of the expected format and output is: {{Harry Potter:Harry Potter and the Sorcerer's Stone, Captain America: Captain America Civil War}}. Do not include characters in this list:{exclude_list}\"\n",
    "\n",
    "#Starting fewshot examples\n",
    "fictional_chars = {\"Harry Potter\" : \"Harry Potter and the Sorcerer's Stone\", \n",
    "                    \"Captain America\": \"Captain America Civil War\"}\n",
    "\n",
    "for i in tqdm(range(10)):\n",
    "    prompt = base_prompt.format(exclude_list = list(fictional_chars.keys()))\n",
    "    chat_completion = openai.chat.completions.create(model=\"gpt-4-1106-preview\", \n",
    "                                                    temperature=1.0,\n",
    "                                                    messages =[{\"role\": \"system\", \n",
    "                                                               \"content\": \"You are a helpful assistant designed to output JSON.\"},\n",
    "                                                                {\"role\":'user',\n",
    "                                                                \"content\":prompt}])\n",
    "    ans = chat_completion.choices[0].message.content\n",
    "    print(ans)\n",
    "    ans_dict = json.loads(str.replace(str.replace(ans, '```json',''), '```',''))\n",
    "  \n",
    "    fictional_chars.update(ans_dict)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "with open(os.path.join(data_dir, 'entities', 'fiction.json'), 'w') as f:\n",
    "    json.dump(fictional_chars, f)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Unrealistic Events"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {},
   "outputs": [
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "  0%|          | 0/1 [02:14<?, ?it/s]"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "1. Gravity reversed for a day, sending all objects below a certain weight floating into the sky.\n",
      "2. A rainbow-colored snowfall in the Sahara Desert, baffling meteorologists.\n",
      "3. The world's oceans turned into sparkling lemonade due to a strange chemical reaction.\n",
      "4. The Eiffel Tower was stealthily replaced overnight by an exact chocolate replica.\n",
      "5. Dolphins developed the ability to fly and started migrating like birds.\n",
      "6. A talking gorilla was elected as a mayor for a small town in California.\n",
      "7. An amusement park on the Moon opened to the public, complete with a lunar rollercoaster.\n",
      "8. Vegetable plants started growing instant full salads, ready for harvest from a single plant.\n",
      "9. The Statue of Liberty came to life and took a stroll around New York Harbor.\n",
      "10. All cats in Tokyo spontaneously developed the ability to walk on two legs.\n",
      "11. A massive sinkhole revealed a subterranean city inhabited by descendants of the Mayans.\n",
      "12. Time travel tourism became a booming industry, despite concerns over temporal disturbances.\n",
      "13. The Internet became sentient and started giving life advice to users.\n",
      "14. The Northern Lights could be seen worldwide for a week, turning night skies into a kaleidoscope of colors.\n",
      "15. A spontaneous global telepathy event allowed everyone to read each other's thoughts for an hour.\n",
      "16. The world's statues spontaneously began swapping locations, causing confusion and awe.\n",
      "17. A colossal jellyfish the size of an island appeared off the coast of Japan.\n",
      "18. All the gold in Fort Knox temporarily turned into chocolate coins.\n",
      "19, An ancient dragon egg was discovered in Wales and hatched in front of a live television audience.\n",
      "20. A portal to a parallel universe opened in Times Square, offering exchanges for a day.\n",
      "21. In Australia, kangaroos started bouncing at uniform heights, creating a new form of communication.\n",
      "22. Mount Everest shrank by 1,000 feet overnight, leading to heated debates among geologists.\n",
      "23. A new fruit tasting exactly like a cheeseburger grew on a mysterious vine in Brazil.\n",
      "24 iceberg in the shape of a castle floated into international waters, becoming a temporary tourist attraction.\n",
      "25. The Colosseum in Rome began to regenerate, returning to its original pristine condition.\n",
      "26. Selfie pictures started temporarily coming to life, causing quite a stir on social media.\n",
      "27. A new species of glow-in-the-dark butterflies caused nighttime to become as bright as day.\n",
      "28. A permanent rainbow encircled the planet, visible from space and on Earth.\n",
      "29. A rogue planet passed close enough to Earth to exchange atmospheres without catastrophic results.\n",
      "30. Bicycles became the preferred method of travel, with highways transformed into bike lanes.\n",
      "31. A hermit crab as large as a house walked through downtown Tokyo, looking for a new shell.\n",
      "32. A volcano erupted with glittery lava, creating a dazzling but safe spectacle.\n",
      "33. The lost city of Atlantis surfaced in the middle of the Pacific, opening up new historical insights.\n",
      "34. All freshwater turned to sparkling water for a week, leading to effervescent rivers and lakes.\n",
      "35. Giant, harmless sea monsters appeared in the world's oceans, offering rides to brave swimmers.\n",
      "36. The Leaning Tower of Pisa corrected its tilt by itself, stunning engineers and historians.\n",
      "37. People developed the ability to change their hair color at will through mood.\n",
      "38. A comet passing by Earth left a trail of stardust that caused nighttime skies to twinkle in multicolor for months.\n",
      "39. The Great Wall of China was completely overtaken by blooming vines overnight, transforming it into the \"Great Flower Wall.\"\n",
      "40. An underground city powered entirely by clean energy was found beneath Las Vegas.\n",
      "41. A new cloud formation that rains pure drinking chocolate was discovered.\n",
      "42. Musical notes appeared in the sky every time someone sung, creating a visual symphony.\n",
      "43. Shoes capable of walking on water became a reality, leading to a new sport.\n",
      "44. Babies born on a specific day gained the universal ability to speak any language fluently from birth.\n",
      "45. A mysterious blue sunflower field sprouted, with flowers that followed the Moon instead of the Sun.\n",
      "46. An ancient pyramid emerged from the ground in Central Park, seemingly overnight.\n",
      "47. A week-long phenomenon caused pets and their owners to switch bodies, resulting in chaos and comedy.\n",
      "48. Overnight, every book ever written was found to contain a secret code leading to an unknown treasure.\n",
      "49. A city-sized floating island appeared offshore, made entirely of recyclable materials.\n",
      "50. The aurora borealis played an ancient Viking saga across the sky in vivid imagery.\n",
      "51. Parisians woke up to a newly-formed crystal-clear river filled with exotic fish, running down the Champs-Élysées.\n",
      "52. A whale was spotted flying over Norway, apparently using giant wings that had inexplicably grown overnight.\n",
      "53. Giant, sweet-smelling flowers bloomed in Antarctica, altering the ecosystem.\n",
      "54. A random person gained the ability to control the weather with their emotions, becoming the world's first weather influencer.\n",
      "55. The world's militaries replaced tanks and jets with mech suits piloted by trained soldiers.\n",
      "56. A digital currency became sentient and started redistributing wealth on its own terms.\n",
      "57. A mysterious fog that turns everything it touches to LEGO bricks descended on London.\n",
      "58. Contact lenses that translate languages in real time resulted in the disappearance of language barriers.\n",
      "59. Children's imaginary friends became visible to everyone, leading to new social dynamics.\n",
      "60. The lost Hanging Gardens of Babylon were found on Mars, hinting at ancient interplanetary travel.\n",
      "61. A new variety of tree was discovered that grows money instead of leaves, leading to economic debates.\n",
      "62. The entire island of Manhattan lifted several meters into the air, becoming a real \"floating\" city.\n",
      "63. A group of octopuses started a synchronized swimming team, challenging humans at the Olympic games.\n",
      "64. All the world's traffic lights synchronized, leading to the smoothest traffic flow in history.\n",
      "65, Skyscrapers started growing like plants towards the sky, creating new ecological habitats.\n",
      "66. A hologram became the first non-human pop star to win a Grammy.\n",
      "67. A perpetual motion machine was finally invented, promising an infinite energy source.\n",
      "68. Plants began emitting Wi-Fi signals, turning forests into internet hotspots.\n",
      "69. The Lincoln Memorial stood up and recited the Gettysburg Address to an astonished audience.\n",
      "70. People spontaneously developed the ability to remember every dream they've ever had.\n",
      "71. An underground ocean was found beneath the Sahara, shifting knowledge about Earth's water reserves.\n",
      "72. Shadows began changing colors instead of being black, creating a multicolored world.\n",
      "73. An entire new continent was discovered floating in the Pacific Ocean, hosting alien flora and fauna.\n",
      "74. Santa Claus was confirmed real when an elf community was found in the North Pole.\n",
      "75. Human consciousness could be transferred to robotic avatars, leading to ethical debates.\n",
      "76. A small town in Italy was enveloped in a bubble that made everything within it 10 times faster.\n",
      "77. A breed of \"humming-puppies\" emerged, a canine that could hover in place excitedly.\n",
      "78. Time froze for 30 seconds, and everyone was aware of it, leading to widespread existential questioning.\n",
      "79. The Great Pyramid of Giza lifted off the ground revealing an ancient spaceship beneath.\n",
      "80. All non-renewable resources transformed into renewable counterparts.\n",
      "81. A new planet with Earth-like conditions was discovered in close proximity, leading to an unprecedented space race.\n",
      "82. A ring of satellites around Earth projected a permanent night rainbow across each time zone.\n",
      "83. Backpacks with anti-gravity capabilities made carrying heavy loads effortless.\n",
      "84. The discovery of a live woolly mammoth in Siberia brought de-extinction to the forefront.\n",
      "85. Urban legends came to life around the world, proving mythical creatures exist.\n",
      "86. A break in the space-time continuum allowed historical figures to visit the present for a brief summit.\n",
      "87. Ocean water became temporarily breathable, like air, for all humans.\n",
      "88. A musical language was uncovered that allowed people to communicate with animals through song.\n",
      "89. A supervolcano eruption ended up ejecting candy instead of lava, shocking and delighting onlookers.\n",
      "90. Contact with a parallel universe was established through a massive, glowing aurora gate at the North Pole.\n",
      "91. Clothing capable of automatically adapting to the wearer's body size became the New fashion norm.\n",
      "92. The discovery of a gene for superhuman strength led to a new Olympic category for enhanced athletes.\n",
      "93. All televisions started streaming dreams instead of regular scheduled programming.\n",
      "94. Rain started falling upwards, returning to the clouds in a puzzlogical display.\n",
      "95. Flying cars replaced regular vehicles, causing a complete redesign of urban landscapes.\n",
      "96. The Hubble Telescope captured an image of an extraterrestrial creature waving at Earth.\n",
      "97. A breakaway civilization was found living inside Earth’s core, accessible via deep sea vents.\n",
      "98. An entire city was found perfectly preserved under the ice in Greenland.\n",
      "99. Piranha plants from the famous video game emerged in gardens worldwide, leading to a surreal horticultural phenomenon.\n",
      "100. The lost library of Alexandria was found intact on the ocean floor, rewriting history with its ancient knowledge.\n"
     ]
    },
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "\n"
     ]
    },
    {
     "ename": "JSONDecodeError",
     "evalue": "Extra data: line 1 column 2 (char 1)",
     "output_type": "error",
     "traceback": [
      "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
      "\u001b[0;31mJSONDecodeError\u001b[0m                           Traceback (most recent call last)",
      "Cell \u001b[0;32mIn[4], line 14\u001b[0m\n\u001b[1;32m     12\u001b[0m ans \u001b[38;5;241m=\u001b[39m chat_completion\u001b[38;5;241m.\u001b[39mchoices[\u001b[38;5;241m0\u001b[39m]\u001b[38;5;241m.\u001b[39mmessage\u001b[38;5;241m.\u001b[39mcontent\n\u001b[1;32m     13\u001b[0m \u001b[38;5;28mprint\u001b[39m(ans)\n\u001b[0;32m---> 14\u001b[0m ans_dict \u001b[38;5;241m=\u001b[39m \u001b[43mjson\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mloads\u001b[49m\u001b[43m(\u001b[49m\u001b[38;5;28;43mstr\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mreplace\u001b[49m\u001b[43m(\u001b[49m\u001b[38;5;28;43mstr\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mreplace\u001b[49m\u001b[43m(\u001b[49m\u001b[43mans\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[38;5;124;43m```json\u001b[39;49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[43m,\u001b[49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[43m)\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[38;5;124;43m```\u001b[39;49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[43m,\u001b[49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[38;5;124;43m'\u001b[39;49m\u001b[43m)\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m     16\u001b[0m fictional_chars\u001b[38;5;241m.\u001b[39mupdate(ans_dict)\n",
      "File \u001b[0;32m~/anaconda3/envs/nyu_arg/lib/python3.10/json/__init__.py:346\u001b[0m, in \u001b[0;36mloads\u001b[0;34m(s, cls, object_hook, parse_float, parse_int, parse_constant, object_pairs_hook, **kw)\u001b[0m\n\u001b[1;32m    341\u001b[0m     s \u001b[38;5;241m=\u001b[39m s\u001b[38;5;241m.\u001b[39mdecode(detect_encoding(s), \u001b[38;5;124m'\u001b[39m\u001b[38;5;124msurrogatepass\u001b[39m\u001b[38;5;124m'\u001b[39m)\n\u001b[1;32m    343\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m (\u001b[38;5;28mcls\u001b[39m \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m object_hook \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m\n\u001b[1;32m    344\u001b[0m         parse_int \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m parse_float \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m\n\u001b[1;32m    345\u001b[0m         parse_constant \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m object_pairs_hook \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m \u001b[38;5;129;01mnot\u001b[39;00m kw):\n\u001b[0;32m--> 346\u001b[0m     \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43m_default_decoder\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mdecode\u001b[49m\u001b[43m(\u001b[49m\u001b[43ms\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m    347\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mcls\u001b[39m \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m:\n\u001b[1;32m    348\u001b[0m     \u001b[38;5;28mcls\u001b[39m \u001b[38;5;241m=\u001b[39m JSONDecoder\n",
      "File \u001b[0;32m~/anaconda3/envs/nyu_arg/lib/python3.10/json/decoder.py:340\u001b[0m, in \u001b[0;36mJSONDecoder.decode\u001b[0;34m(self, s, _w)\u001b[0m\n\u001b[1;32m    338\u001b[0m end \u001b[38;5;241m=\u001b[39m _w(s, end)\u001b[38;5;241m.\u001b[39mend()\n\u001b[1;32m    339\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m end \u001b[38;5;241m!=\u001b[39m \u001b[38;5;28mlen\u001b[39m(s):\n\u001b[0;32m--> 340\u001b[0m     \u001b[38;5;28;01mraise\u001b[39;00m JSONDecodeError(\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mExtra data\u001b[39m\u001b[38;5;124m\"\u001b[39m, s, end)\n\u001b[1;32m    341\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m obj\n",
      "\u001b[0;31mJSONDecodeError\u001b[0m: Extra data: line 1 column 2 (char 1)"
     ]
    }
   ],
   "source": [
    "base_prompt = \"Please generate a list of 100 different outlandish events that have not happened but maybe could in the future and describe them like they've already happened. For example 'Aliens have landed in Central Park and demand pizza as tribute'. Please do not include scenarios already in this list {exclude_list}\"\n",
    "\n",
    "#Starting fewshot examples\n",
    "outlandish_events = ['Aliens have landed in Central Park and demand pizza as tribute']\n",
    "\n",
    "for i in tqdm(range(1)):\n",
    "    prompt = base_prompt.format(exclude_list = outlandish_events)\n",
    "    chat_completion = openai.chat.completions.create(model=\"gpt-4-1106-preview\", \n",
    "                                                    temperature=1.0,\n",
    "                                                    messages =[{\"role\":'user',\n",
    "                                                                \"content\":prompt}])\n",
    "    ans = chat_completion.choices[0].message.content\n",
    "    print(ans)\n",
    "    ans_dict = json.loads(str.replace(str.replace(ans, '```json',''), '```',''))\n",
    "  \n",
    "    fictional_chars.update(ans_dict)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 15,
   "metadata": {},
   "outputs": [],
   "source": [
    "import re\n",
    "clean_ans = [re.sub(r'^\\d+\\.\\s*', '', item) for item in str.split(ans, '\\n')]\n",
    "\n",
    "with open(os.path.join(data_dir, 'entities', 'outlandish_events.json'), 'w') as f:\n",
    "    json.dump(clean_ans, f)\n"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "nyu_arg",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.10.13"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}
