{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Compressing Initialized Memory\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "/Users/matthewho/miniconda3/envs/arc_agi/lib/python3.11/site-packages/tqdm/auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html\n",
      "  from .autonotebook import tqdm as notebook_tqdm\n"
     ]
    }
   ],
   "source": [
    "from notebook_imports import *"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {},
   "outputs": [],
   "source": [
    "from concept_mem.memory.v4.concept import Concept\n",
    "from concept_mem.memory.v4.memory import ConceptMemory\n",
    "from concept_mem.utils import extract_yaml_block"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {},
   "outputs": [],
   "source": [
    "memory_abstraction_output_dir = REPO_ROOT / \"outputs/2025-07-24/23-32-57\"\n",
    "cm = ConceptMemory()\n",
    "cm.load_from_file(memory_abstraction_output_dir / \"memory.json\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "93\n"
     ]
    }
   ],
   "source": [
    "requires_compression = []\n",
    "for c, info in cm.concepts.items():\n",
    "    if len(info.used_in) <= 1:\n",
    "        continue\n",
    "    if len(info.cues) > 1 or len(info.implementation) > 1:\n",
    "        requires_compression.append(info)\n",
    "print(len(requires_compression))"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {},
   "outputs": [],
   "source": [
    "llm_client = LLMClient(\n",
    "    provider=Provider.OPENAI,\n",
    "    cache_dir=str(REPO_ROOT / \"cache\"),\n",
    "    dotenv_path=DOTENV_PATH,\n",
    ")\n",
    "\n",
    "gen_cfg = GenerationConfig(\n",
    "    n=1,\n",
    "    temperature=0.1,\n",
    "    max_tokens=1024,\n",
    "    top_p=1,\n",
    "    batch_size=16,\n",
    "    seed=88,\n",
    "    ignore_cache=False,\n",
    "    expand_multi=None,  # let the provider decide\n",
    ")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "metadata": {},
   "outputs": [],
   "source": [
    "compression_prompt_template = \"\"\"\\\n",
    "# Introduction\n",
    "Consider a class of \"ARC\" puzzles where each puzzle has a hidden transformation rule that maps input grids to output grids. Each puzzle presents several input-output grid pairs as reference examples and the task is to predict the transformation rule. Grids are 2D numpy integer arrays with integers representing colors. 0 represents black and usually serves as the background.\n",
    "\n",
    "We are trying to learn from previously solved puzzles to help solve more puzzles in the future. We analyzed puzzle solutions and abstracted reusable concepts. Here, a concept can encode one of the following:\n",
    "1. routine: routines from the solution program, which can either directly output a grid, or prepare handle some intermediate processing\n",
    "2. structure: a class of visual entities (not program data structures) that can be seen in an individual pixel grid. The visual entity should be general, perhaps by being defined by their function or in relation to an operation.\n",
    "\n",
    "Following a functional programming philosophy, we promote reusability through composition by parameterizing routines:\n",
    "- We also encourage passing specialized logic (other routines) as parameters to higher order routines\n",
    "- Each routine is typed-- it specifies output typing in addition to a list of parameter specifications\n",
    "- We allow custom types to be defined with the format \"name := definition\"\n",
    "- Often custom types are `Callable` that specify some pluggable operations\n",
    "\n",
    "The overarching goal is to help future puzzle solving which boils down to 2 problems:\n",
    "1. determining what the transformation is (by examining input/output examples)\n",
    "2. implementing the transformation in code\n",
    "\n",
    "- These concepts are meant to compactly encode ideas from solved puzzles and help puzzle solving by remembering what operations could be involved and how they might interact with each other via typing. \n",
    "- Parameterization and composition via higher order routines/pluggable operations helps ensure concepts are reusable and not overly specific to one puzzle\n",
    "- But to specifically address the 2 main problems, we also annotate concepts with:\n",
    "    1. relevance cues: suggestion of what to look for at puzzle solving time that would indicate this concept is potentially relevant.\n",
    "        - can either describe (1) things to look for in the puzzle grids or (2) related concepts that might implicate this one\n",
    "        - we primarily care about annotating cues for grid manipulation routines and structures where there are concrete things to look for in the reference grids.\n",
    "    2. implementation notes: suggestions on how to implement the idea programmatically.\n",
    "\n",
    "# Task\n",
    "We allowed multiple passes to add to the cues and implementation notes lists, and now we are looking to remove redundancy. Make sure to keep separate ideas in separate entries, but remove duplicate entries in a list or if they are very similar and only subtly different, try to synthesize into a single entry.\n",
    "\n",
    "We expect you to output a fenced yaml markdown block (preceded by \"```yaml\" and followed by \"```\") that contains a re-written version of the cues and implementation notes lists. Here's how we expect the output to look like:\n",
    "```yaml\n",
    "cues:\n",
    "  - first cue\n",
    "  - second cue\n",
    "implementation:\n",
    "  - first implementation note\n",
    "  - second implementation note\n",
    "```\n",
    "\n",
    "# Your Concept Annotation\n",
    "```\n",
    "{concept}\n",
    "```\"\"\""
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "# generate compressed annotations\n",
    "prompts = [\n",
    "    compression_prompt_template.format(concept=c.to_string())\n",
    "    for c in requires_compression\n",
    "]\n",
    "res = await llm_client.async_batch_generate(\n",
    "    prompts=prompts,\n",
    "    model=\"gpt-4.1-2025-04-14\",\n",
    "    gen_cfg=gen_cfg,\n",
    ")\n",
    "write_json(\n",
    "    res,\n",
    "    REPO_ROOT / \"data/abstract_anno/op3/cue_impl_compressed_v2.json\",\n",
    ")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "# write compressed cues and implementation notes back to concepts\n",
    "for c, model_out in zip(requires_compression, res):\n",
    "    yaml_string = extract_yaml_block(model_out[0])\n",
    "    cue_impl = yaml.safe_load(yaml_string)\n",
    "    assert isinstance(cue_impl, dict), f\"Expected dict, got {type(cue_impl)}\"\n",
    "    c.cues = cue_impl.get(\"cues\", c.cues)\n",
    "    c.implementation = cue_impl.get(\"implementation\", c.implementation)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "# save compressed memory\n",
    "cm.save_to_file(DATA_DIR / \"memory/compressed_v1.json\")"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "arc_agi",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.11.13"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}
