{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "**Important Notes**\n",
      "- Do NOT refer to memorized reaction outcomes, prior experimental data, or specific literature results (even if they match the current experiment).\n",
      "- You MAY use general domain knowledge from {application_context} (e.g., chemical reactivity principles, mechanism patterns) to inform reasoning.\n",
      "\n",
      "**Experiment Domain**\n",
      "{application_context}\n",
      "\n",
      "**Experiment Overview**\n",
      "{description}\n",
      "\n",
      "**Optimization Parameters**\n",
      "The experiment design space is defined by the parameters below, including their bounds and any relevant constraints:\n",
      "\n",
      "- **Parameters and Bounds**: \n",
      "{parameter_definitions}\n",
      "\n",
      "- **Constraint**: \n",
      "{constraint}\n",
      "\n",
      "- **Target**\n",
      "{target}\n",
      "\n",
      "**Task**\n",
      "In approximately 300 words, provide a clear and concise overview of this experiment. Your overview should:\n",
      "1. Clearly state the experimental goal and its significance in the {application_context} domain\n",
      "2. Describe the parameter space including data types and bounds of each parameter\n",
      "3. Explain any constraints that must be maintained during optimization\n",
      "4. Outline how Bayesian Optimization will systematically explore the parameter space to target. \n",
      "5. Mention how you will assist when BO plateaus by suggesting promising parameter combinations\n",
      "6. Conclude with the expected outcomes and potential impact of successful optimization\n",
      "\n",
      "Focus on creating a coherent narrative that connects these elements while maintaining scientific rigor and clarity.\n"
     ]
    }
   ],
   "source": [
    "# ---------------------------------- jsonline ----------------------------------\n",
    "\n",
    "jsonline = {\n",
    "    \"generate_overview\": \"**Important Notes**\\n- Do NOT refer to memorized reaction outcomes, prior experimental data, or specific literature results (even if they match the current experiment).\\n- You MAY use general domain knowledge from {application_context} (e.g., chemical reactivity principles, mechanism patterns) to inform reasoning.\\n\\n**Experiment Domain**\\n{application_context}\\n\\n**Experiment Overview**\\n{description}\\n\\n**Optimization Parameters**\\nThe experiment design space is defined by the parameters below, including their bounds and any relevant constraints:\\n\\n- **Parameters and Bounds**: \\n{parameter_definitions}\\n\\n- **Constraint**: \\n{constraint}\\n\\n- **Target**\\n{target}\\n\\n**Task**\\nIn approximately 300 words, provide a clear and concise overview of this experiment. Your overview should:\\n1. Clearly state the experimental goal and its significance in the {application_context} domain\\n2. Describe the parameter space including data types and bounds of each parameter\\n3. Explain any constraints that must be maintained during optimization\\n4. Outline how Bayesian Optimization will systematically explore the parameter space to target. \\n5. Mention how you will assist when BO plateaus by suggesting promising parameter combinations\\n6. Conclude with the expected outcomes and potential impact of successful optimization\\n\\nFocus on creating a coherent narrative that connects these elements while maintaining scientific rigor and clarity.\"\n",
    "}\n",
    "print(jsonline['generate_overview'])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Error loading agent_prompts.json: Expecting value: line 1 column 1 (char 0)\n",
      "Error loading llm_prompts.json: Expecting value: line 1 column 1 (char 0)\n",
      "**Important Notes**\n",
      "- Do NOT refer to memorized reaction outcomes, prior experimental data, or specific literature results (even if they match the current experiment).\n",
      "- You MAY use general domain knowledge from {application_context} (e.g., chemical reactivity principles, mechanism patterns) to inform reasoning.\n",
      "\n",
      "**Experiment Domain**\n",
      "{application_context}\n",
      "\n",
      "**Experiment Overview**\n",
      "{description}\n",
      "\n",
      "**Optimization Parameters**\n",
      "The experiment design space is defined by the parameters below, including their bounds and any relevant constraints:\n",
      "\n",
      "- **Parameters and Bounds**: \n",
      "{parameter_definitions}\n",
      "\n",
      "- **Constraint**: \n",
      "{constraint}\n",
      "\n",
      "- **Target**\n",
      "{target}\n",
      "\n",
      "**Task**\n",
      "In approximately 300 words, provide a clear and concise overview of this experiment. Your overview should:\n",
      "1. Clearly state the experimental goal and its significance in the {application_context} domain\n",
      "2. Describe the parameter space including data types and bounds of each parameter\n",
      "3. Explain any constraints that must be maintained during optimization\n",
      "4. Outline how Bayesian Optimization will systematically explore the parameter space to target. \n",
      "5. Mention how you will assist when BO plateaus by suggesting promising parameter combinations\n",
      "6. Conclude with the expected outcomes and potential impact of successful optimization\n",
      "\n",
      "Focus on creating a coherent narrative that connects these elements while maintaining scientific rigor and clarity.\n"
     ]
    }
   ],
   "source": [
    "# ---------------------------------- jsonline --> prompt_template ----------------------------------\n",
    "from src.prompts.base import PromptManager\n",
    "\n",
    "pm = PromptManager()\n",
    "print(pm.get(key=\"generate_overview\"))"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "**Important Notes**\n",
      "- Do NOT refer to memorized reaction outcomes, prior experimental data, or specific literature results (even if they match the current experiment).\n",
      "- You MAY use general domain knowledge from Organic Chemistry/Cross Coupling Reaction Optimization (e.g., chemical reactivity principles, mechanism patterns) to inform reasoning.\n",
      "\n",
      "**Experiment Domain**\n",
      "Organic Chemistry/Cross Coupling Reaction Optimization\n",
      "\n",
      "**Experiment Overview**\n",
      "Optimization of Suzuki-Miyaura cross-coupling reaction conditions to maximize product yield through systematic exploration of electrophile-nucleophile combinations, ligand selection, base types, and solvent effects. The reaction involves palladium-catalyzed coupling between aryl halides and arylboronic acids.\n",
      "\n",
      "**Optimization Parameters**\n",
      "The experiment design space is defined by the parameters below, including their bounds and any relevant constraints:\n",
      "\n",
      "- **Parameters and Bounds**: \n",
      "[{'display_name': 'Electrophile', 'description': 'Aryl halide substrate for coupling (SMILES representation)', 'data_type': 'categorical', 'bounds': ['BrC1=CC=C(N=CC=C2)C2=C1', 'ClC1=CC=C(N=CC=C2)C2=C1', 'IC1=CC=C(N=CC=C2)C2=C1', 'O=S(OC1=CC=C(N=CC=C2)C2=C1)(C(F)(F)F)=O']}, {'display_name': 'Nucleophile', 'description': 'Boronic acid nucleophile (SMILES representation)', 'data_type': 'categorical', 'bounds': ['CC1=CC=C(N(C2CCCCO2)N=C3)C3=C1[B-](F)(F)F', 'CC1=CC=C(N(C2CCCCO2)N=C3)C3=C1B(O)O', 'CC1=CC=C(N(C2CCCCO2)N=C3)C3=C1B4OC(C)(C)C(C)(C)O4']}, {'display_name': 'Ligand', 'description': 'Phosphine ligand for palladium catalyst (SMILES)', 'data_type': 'categorical', 'bounds': ['[c-]1(P(C2=CC=CC=C2)C3=CC=CC=C3)cccc1.[c-]4(P(C5=CC=CC=C5)C6=CC=CC=C6)cccc4.[Fe+2]', 'CC(C)(C)P(C(C)(C)C)C1=CC=C(N(C)C)C=C1', 'CC(C)(P(C(C)(C)C)[c-]1cccc1)C.CC(C)(P(C(C)(C)C)[c-]2cccc2)C.[Fe+2]', 'CC(C1=C(C2=CC=CC=C2P(C3CCCCC3)C4CCCCC4)C(C(C)C)=CC(C(C)C)=C1)C', 'CC(P(C(C)(C)C)C(C)(C)C)(C)C', 'CC1(C)C2=C(OC3=C1C=CC=C3P(C4=CC=CC=C4)C5=CC=CC=C5)C(P(C6=CC=CC=C6)C7=CC=CC=C7)=CC=C2', 'CC1=CC=CC=C1P(C2=CC=CC=C2C)C3=CC=CC=C3C', 'CCCCP(C12C[C@@H]3C[C@@H](C[C@H](C2)C3)C1)C45C[C@H]6C[C@@H](C5)C[C@@H](C4)C6', 'COC1=CC=CC(OC)=C1C2=C(P(C3CCCCC3)C4CCCCC4)C=CC=C2', 'P(C1=CC=CC=C1)(C2=CC=CC=C2)C3=CC=CC=C3', 'P(C1CCCCC1)(C2CCCCC2)C3CCCCC3']}, {'display_name': 'Base', 'description': 'Reaction base for transmetallation step (SMILES)', 'data_type': 'categorical', 'bounds': ['[Cs+].[F-]', '[K+].[OH-]', '[Na+].[OH-]', 'CC([O-])C.[Li+]', 'CCN(CC)CC', 'O=P([O-])([O-])[O-].[K+].[K+].[K+]', 'OC([O-])=O.[Na+]']}, {'display_name': 'Solvent', 'description': 'Reaction medium (SMILES representation)', 'data_type': 'categorical', 'bounds': ['C1COCC1', 'CO', 'N#CC', 'O=CN(C)C']}]\n",
      "\n",
      "- **Constraint**: \n",
      "Reagents must be chemically compatible and reaction conditions must maintain intermediate stability. All combinations must form stable palladium complexes during catalytic cycle.\n",
      "\n",
      "- **Target**\n",
      "{'name': 'Yield', 'description': 'Percentage yield of cross-coupled product', 'direction': 'maximize'}\n",
      "\n",
      "**Task**\n",
      "In approximately 300 words, provide a clear and concise overview of this experiment. Your overview should:\n",
      "1. Clearly state the experimental goal and its significance in the Organic Chemistry/Cross Coupling Reaction Optimization domain\n",
      "2. Describe the parameter space including data types and bounds of each parameter\n",
      "3. Explain any constraints that must be maintained during optimization\n",
      "4. Outline how Bayesian Optimization will systematically explore the parameter space to target. \n",
      "5. Mention how you will assist when BO plateaus by suggesting promising parameter combinations\n",
      "6. Conclude with the expected outcomes and potential impact of successful optimization\n",
      "\n",
      "Focus on creating a coherent narrative that connects these elements while maintaining scientific rigor and clarity.\n"
     ]
    }
   ],
   "source": [
    "# ---------------------------------- prompt_template --> prompt ----------------------------------\n",
    "\n",
    "meta_dict = {\n",
    "    \"name\": \"Suzuki Reaction Optimization\",\n",
    "    \"application_context\": \"Organic Chemistry/Cross Coupling Reaction Optimization\",\n",
    "    \"description\": \"Optimization of Suzuki-Miyaura cross-coupling reaction conditions to maximize product yield through systematic exploration of electrophile-nucleophile combinations, ligand selection, base types, and solvent effects. The reaction involves palladium-catalyzed coupling between aryl halides and arylboronic acids.\",\n",
    "    \"constraint\": \"Reagents must be chemically compatible and reaction conditions must maintain intermediate stability. All combinations must form stable palladium complexes during catalytic cycle.\",\n",
    "    \"parameter_definitions\": [\n",
    "        {\n",
    "            \"display_name\": \"Electrophile\",\n",
    "            \"description\": \"Aryl halide substrate for coupling (SMILES representation)\",\n",
    "            \"data_type\": \"categorical\",\n",
    "            \"bounds\": [\n",
    "                \"BrC1=CC=C(N=CC=C2)C2=C1\",\n",
    "                \"ClC1=CC=C(N=CC=C2)C2=C1\",\n",
    "                \"IC1=CC=C(N=CC=C2)C2=C1\",\n",
    "                \"O=S(OC1=CC=C(N=CC=C2)C2=C1)(C(F)(F)F)=O\",\n",
    "            ],\n",
    "        },\n",
    "        {\n",
    "            \"display_name\": \"Nucleophile\",\n",
    "            \"description\": \"Boronic acid nucleophile (SMILES representation)\",\n",
    "            \"data_type\": \"categorical\",\n",
    "            \"bounds\": [\n",
    "                \"CC1=CC=C(N(C2CCCCO2)N=C3)C3=C1[B-](F)(F)F\",\n",
    "                \"CC1=CC=C(N(C2CCCCO2)N=C3)C3=C1B(O)O\",\n",
    "                \"CC1=CC=C(N(C2CCCCO2)N=C3)C3=C1B4OC(C)(C)C(C)(C)O4\",\n",
    "            ],\n",
    "        },\n",
    "        {\n",
    "            \"display_name\": \"Ligand\",\n",
    "            \"description\": \"Phosphine ligand for palladium catalyst (SMILES)\",\n",
    "            \"data_type\": \"categorical\",\n",
    "            \"bounds\": [\n",
    "                \"[c-]1(P(C2=CC=CC=C2)C3=CC=CC=C3)cccc1.[c-]4(P(C5=CC=CC=C5)C6=CC=CC=C6)cccc4.[Fe+2]\",\n",
    "                \"CC(C)(C)P(C(C)(C)C)C1=CC=C(N(C)C)C=C1\",\n",
    "                \"CC(C)(P(C(C)(C)C)[c-]1cccc1)C.CC(C)(P(C(C)(C)C)[c-]2cccc2)C.[Fe+2]\",\n",
    "                \"CC(C1=C(C2=CC=CC=C2P(C3CCCCC3)C4CCCCC4)C(C(C)C)=CC(C(C)C)=C1)C\",\n",
    "                \"CC(P(C(C)(C)C)C(C)(C)C)(C)C\",\n",
    "                \"CC1(C)C2=C(OC3=C1C=CC=C3P(C4=CC=CC=C4)C5=CC=CC=C5)C(P(C6=CC=CC=C6)C7=CC=CC=C7)=CC=C2\",\n",
    "                \"CC1=CC=CC=C1P(C2=CC=CC=C2C)C3=CC=CC=C3C\",\n",
    "                \"CCCCP(C12C[C@@H]3C[C@@H](C[C@H](C2)C3)C1)C45C[C@H]6C[C@@H](C5)C[C@@H](C4)C6\",\n",
    "                \"COC1=CC=CC(OC)=C1C2=C(P(C3CCCCC3)C4CCCCC4)C=CC=C2\",\n",
    "                \"P(C1=CC=CC=C1)(C2=CC=CC=C2)C3=CC=CC=C3\",\n",
    "                \"P(C1CCCCC1)(C2CCCCC2)C3CCCCC3\",\n",
    "            ],\n",
    "        },\n",
    "        {\n",
    "            \"display_name\": \"Base\",\n",
    "            \"description\": \"Reaction base for transmetallation step (SMILES)\",\n",
    "            \"data_type\": \"categorical\",\n",
    "            \"bounds\": [\n",
    "                \"[Cs+].[F-]\",\n",
    "                \"[K+].[OH-]\",\n",
    "                \"[Na+].[OH-]\",\n",
    "                \"CC([O-])C.[Li+]\",\n",
    "                \"CCN(CC)CC\",\n",
    "                \"O=P([O-])([O-])[O-].[K+].[K+].[K+]\",\n",
    "                \"OC([O-])=O.[Na+]\",\n",
    "            ],\n",
    "        },\n",
    "        {\n",
    "            \"display_name\": \"Solvent\",\n",
    "            \"description\": \"Reaction medium (SMILES representation)\",\n",
    "            \"data_type\": \"categorical\",\n",
    "            \"bounds\": [\"C1COCC1\", \"CO\", \"N#CC\", \"O=CN(C)C\"],\n",
    "        },\n",
    "    ],\n",
    "    \"target\": {\n",
    "        \"name\": \"Yield\",\n",
    "        \"description\": \"Percentage yield of cross-coupled product\",\n",
    "        \"direction\": \"maximize\",\n",
    "    },\n",
    "}\n",
    "print(pm.format(\"generate_overview\", **meta_dict))"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Start generating overview...\n",
      "\n",
      "Overview has been generated! and the content is as follows: \n",
      " ### Overview of the Experiment: Optimizing Suzuki-Miyaura Cross-Coupling Reaction Conditions\n",
      "\n",
      "The primary goal of this experiment is to optimize the Suzuki-Miyaura cross-coupling reaction to maximize product yield. This reaction is pivotal in organic chemistry, particularly for the synthesis of C-C bonds, as it allows for the coupling of aryl halides with arylboronic acids under palladium catalysis. The experiment seeks to identify the most effective combination of electrophiles, nucleophiles, ligands, bases, and solvents to achieve optimal yields, thereby enhancing the utility of this reaction in synthetic chemistry.\n",
      "\n",
      "#### Parameter Space and Constraints\n",
      "\n",
      "The parameter space encompasses five key variables, each with distinct categorical bounds and data types. The electrophile can be one of four aryl halides, the nucleophile includes three different boronic acids, the ligand comprises ten phosphine ligands, the base has seven options, and the solvent is limited to four choices. Each reagent must be chemically compatible to ensure the formation of stable palladium complexes during the catalytic cycle. This constraint is crucial for maintaining the integrity of the reaction pathway and ensuring that intermediates remain stable throughout the process.\n",
      "\n",
      "#### Systematic Exploration via Bayesian Optimization\n",
      "\n",
      "Bayesian Optimization (BO) will systematically explore the parameter space by iteratively selecting promising combinations based on previously observed yields. Initially, a diverse set of parameter combinations will be tested to establish a baseline. Subsequent selections will be guided by a probabilistic model that predicts the yield for untested combinations, focusing on areas of high potential. This approach ensures efficient exploration of the parameter space, minimizing the number of experiments required to reach optimal conditions.\n",
      "\n",
      "#### Plateau Mitigation and Suggestion Strategy\n",
      "\n",
      "If BO plateaus, promising parameter combinations will be suggested based on theoretical insights into reactivity and stability. For instance, substituting a less common but potentially more reactive ligand or exploring alternative bases could lead to improved yields. These suggestions will be informed by general domain knowledge in organic chemistry, such as the influence of steric and electronic factors on reactivity.\n",
      "\n",
      "#### Expected Outcomes and Impact\n",
      "\n",
      "Upon successful optimization, this experiment will yield a robust set of conditions for maximizing the yield of cross-coupled products in the Suzuki-Miyaura reaction. The optimized protocol will enhance the efficiency and scope of this versatile reaction, facilitating more complex and challenging syntheses. This optimization could have significant implications for drug discovery, material science, and industrial applications, where efficient C-C bond formation is critical.\n",
      "\n"
     ]
    }
   ],
   "source": [
    "# ---------------------------------- 具体函数测试 ----------------------------------\n",
    "from src.llms.qwq import QWQClient\n",
    "\n",
    "client = QWQClient()\n",
    "print(\"Start generating overview...\\n\")\n",
    "formatted_prompt = pm.format(\"generate_overview\", **meta_dict)\n",
    "overview, _ = client.generate(user_prompt=formatted_prompt)\n",
    "print(\n",
    "    f\"Overview has been generated! and the content is as follows: \\n {overview}\\n\"\n",
    ")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "### Overview of the Experiment: Optimizing Suzuki-Miyaura Cross-Coupling Reaction Conditions\n",
      "\n",
      "The primary goal of this experiment is to optimize the Suzuki-Miyaura cross-coupling reaction to maximize product yield. This reaction is pivotal in organic chemistry, particularly for the synthesis of C-C bonds, as it allows for the coupling of aryl halides with arylboronic acids under palladium catalysis. The experiment seeks to identify the most effective combination of electrophiles, nucleophiles, ligands, bases, and solvents to achieve optimal yields, thereby enhancing the utility of this reaction in synthetic chemistry.\n",
      "\n",
      "#### Parameter Space and Constraints\n",
      "\n",
      "The parameter space encompasses five key variables, each with distinct categorical bounds and data types. The electrophile can be one of four aryl halides, the nucleophile includes three different boronic acids, the ligand comprises ten phosphine ligands, the base has seven options, and the solvent is limited to four choices. Each reagent must be chemically compatible to ensure the formation of stable palladium complexes during the catalytic cycle. This constraint is crucial for maintaining the integrity of the reaction pathway and ensuring that intermediates remain stable throughout the process.\n",
      "\n",
      "#### Systematic Exploration via Bayesian Optimization\n",
      "\n",
      "Bayesian Optimization (BO) will systematically explore the parameter space by iteratively selecting promising combinations based on previously observed yields. Initially, a diverse set of parameter combinations will be tested to establish a baseline. Subsequent selections will be guided by a probabilistic model that predicts the yield for untested combinations, focusing on areas of high potential. This approach ensures efficient exploration of the parameter space, minimizing the number of experiments required to reach optimal conditions.\n",
      "\n",
      "#### Plateau Mitigation and Suggestion Strategy\n",
      "\n",
      "If BO plateaus, promising parameter combinations will be suggested based on theoretical insights into reactivity and stability. For instance, substituting a less common but potentially more reactive ligand or exploring alternative bases could lead to improved yields. These suggestions will be informed by general domain knowledge in organic chemistry, such as the influence of steric and electronic factors on reactivity.\n",
      "\n",
      "#### Expected Outcomes and Impact\n",
      "\n",
      "Upon successful optimization, this experiment will yield a robust set of conditions for maximizing the yield of cross-coupled products in the Suzuki-Miyaura reaction. The optimized protocol will enhance the efficiency and scope of this versatile reaction, facilitating more complex and challenging syntheses. This optimization could have significant implications for drug discovery, material science, and industrial applications, where efficient C-C bond formation is critical.\n"
     ]
    }
   ],
   "source": [
    "print(overview)"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "bo",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.10.0"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}
