{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Tutorial: Quantifying the Strength of Prior Knowledge with Information Entropy \n",
    "\n",
    "This notebook explores how entropy can be used to estimate the relevance of prior knowledge to an optimization task."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "from collections import defaultdict\n",
    "from math import log\n",
    "from together import Together\n",
    "from typing import List, Optional\n",
    "\n",
    "import leon"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "In our work, we compute the [information entropy](https://en.wikipedia.org/wiki/Entropy_(information_theory)) of a language model's responses to estimate the strength of prior knowledge in guiding the LLM optimizer toward a particular design configuration. If the prior knowledge is strong and highly suggestive, then the language model will repeatedly return similar responses after multiple queries. However, if the prior knowledge is weak or irrelevant to the task at hand, then the LLM will output a greater variety of responses to the same prompt, corresponding to a higher information entropy.\n",
    "\n",
    "We note that the idea of using entropy to quantify the statistical properties of language models is not new; prior work by [Kuhn L et al. ICLR (2023)](https://openreview.net/forum?id=VD-AYtP0dve) and others have shown that similar metrics can be used to estimate the uncertainty of LLMs in responding to a given prompt. We consider an entirely different problem in our work; namely, we explore how to leverage a similar framework in using LLMs to solve biomedical optimization problems.\n",
    "\n",
    "First, let's construct a knowledge base to retrieve relevant biomedical knowledge for our task. For demonstration purposes, we use Google's [MedGemma-4B](https://huggingface.co/google/medgemma-4b-it) instruction-tuned variant as our knowledge source."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "knowledge_base = leon.knowledge.source.MedGemma4BKnowledgeBase(top_k=4)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Our goal for this toy example will be to ask a separate LLM optimizer to choose a single antiretroviral medication for a patient newly diagnosed with HIV. We'll use a few example HIV medications to show how we can retrieve relevant knowledge about both the design space and the patient of interest:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "hiv_medications = [\"Lamivudine\", \"Delavirdine\", \"Atazanavir\", \"Enfuvirtide\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "example_patient_description: str = \"64 year old HIV+ male with cirrhosis 2/2 chronic HCV infection\"\n",
    "task: str = \"What HIV medication would you prescribe this patient? Answer with a single medication name.\""
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Let's now query the knowledge base to retrieve our relevant prior knowledge:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "knowledge = knowledge_base.retrieve(\";\".join([med + f\" for {example_patient_description}\" for med in hiv_medications]))\n",
    "combined_knowledge = \"\\n\".join(knowledge[0])"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Separately, we implement a function `get_medication_for_patient()` below to query a language model (Meta's [Llama-3.3 70B](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct)) for proposing candidate medications for our patient described in `example_patient_description` above. Our goal is to query the language model multiple times both with and without the prior knowledge retrieved from MedGemma above, and see how prior knowledge affects the entropy of the distribution of designs returned by the model."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "def get_medication_for_patient(patient: str, n: int, prior_knowledge: Optional[str] = None) -> List[str]:\n",
    "    \"\"\"\n",
    "    Get a list of independent medication recommendations for a patient.\n",
    "    Input:\n",
    "        patient: a string description of the patient.\n",
    "        n: the number of independent medication recommendations to generate.\n",
    "        use_prior_knowledge: whether to use the prior knowledge.\n",
    "    Returns:\n",
    "        A list of n independent medication recommendations.\n",
    "    \"\"\"\n",
    "    client = Together()\n",
    "    prompt = f\"\"\"\n",
    "    ### Prior Knowledge\n",
    "    {prior_knowledge if prior_knowledge else \"I love vanilla ice cream.\"}\n",
    "\n",
    "    ### Patient Description\n",
    "    {patient}\n",
    "\n",
    "    ### Task\n",
    "    {task}\n",
    "    \"\"\"\n",
    "    prompt = \"\\n\".join([line.lstrip() for line in prompt.split(\"\\n\")])\n",
    "    response = client.chat.completions.create(\n",
    "        messages=[{\"role\": \"user\", \"content\": prompt}],\n",
    "        model=\"meta-llama/Llama-3.3-70B-Instruct-Turbo-Free\",\n",
    "        n=n,\n",
    "        temperature=1.0\n",
    "    )\n",
    "    return [choice.message.content for choice in response.choices]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Let's now get a list of proposed medication designs for our patient of interest, both with and without the relevant prior knowledge."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "choices_with_prior_knowledge = get_medication_for_patient(\n",
    "    example_patient_description, n=16, prior_knowledge=combined_knowledge\n",
    ")\n",
    "choices_without_prior_knowledge = get_medication_for_patient(\n",
    "    example_patient_description, n=16, prior_knowledge=None\n",
    ")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "choices_with_prior_knowledge"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "choices_without_prior_knowledge"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "There is a greater variety of outputs when we're not using prior knowledge. However, relevant knowledge can help steer the language model towards proposing specific candidate designs conditioned on the input knowledge, leading to a distribution of outputs with lower entropy.\n",
    "\n",
    "We can compute the entropy of the set of outputs below. Note that this function assumes that HIV medications are only equivalent if they are lexicographically equivalent - we stick with this simplied version for the purposes of this example notebook, although we consider more sophisticated equivalence relations between different designs in our work."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "def compute_entropy(outputs: List[str]) -> float:\n",
    "    \"\"\"\n",
    "    Compute the (lexicographic) entropy of a list of model responses.\n",
    "    Input:\n",
    "        outputs: a list of model responses.\n",
    "    Returns:\n",
    "        The entropy of the list of outputs.\n",
    "    \"\"\"\n",
    "    response_to_count = defaultdict(int)\n",
    "    for output in outputs:\n",
    "        response_to_count[output.lower()] += 1\n",
    "    return -1.0 * sum([\n",
    "        (count / len(outputs)) * log(count / len(outputs))\n",
    "        for count in response_to_count.values()\n",
    "    ])"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "We can now compare the entropy of the outputs both with and without prior knowledge:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "entropy_with_prior_knowledge = compute_entropy(choices_with_prior_knowledge)\n",
    "entropy_without_prior_knowledge = compute_entropy(choices_without_prior_knowledge)\n",
    "\n",
    "print(f\"Entropy with Prior Knowledge: {entropy_with_prior_knowledge:.3f}\")\n",
    "print(f\"Entropy without Prior Knowledge: {entropy_without_prior_knowledge:.3f}\")"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "leon",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.10.17"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}
