# LLM configuration - supports azure, openai, litellm, ollama
dataframe_path: 

output: 

save_every: 5

llms: # add more to the list if needed
  - provider: "ollama"  # azure, openai, litellm, ollama
    model_name: "gemma3:27b-it-qat"
    base_url: "http://localhost:11434"
  - provider: "ollama"  # azure, openai, litellm, ollama
    model_name: "qwen3:30b-a3b-instruct-2507-q4_K_M"
    base_url: "http://localhost:11434"
  - provider: "ollama"  # azure, openai, litellm, ollama
    model_name: "alibayram/medgemma:27b"
    base_url: "http://localhost:11434"
  - provider: "ollama"  # azure, openai, litellm, ollama
    model_name: "gpt-oss:20b"
    base_url: "http://localhost:11434"
  - provider: "ollama"  # azure, openai, litellm, ollama
    model_name: "deepseek-r1:8b"
    base_url: "http://localhost:11434"

methods:
  - generate_merge
  - generate_once
  - generate_update
  - cok
  - dspy_generate_concat
  - dspy_generate_update

note_columns:
  - doctor_notes
  - pathology_notes
  - imaging_notes
max_tokens: 4000
temperature: 0.0

# System prompts for different methods based on incremental summarization research
# Updated from "Enhancing Incremental Summarization with Structured Representations" paper
prompts:
  dspy_generate_update:
    system_prompt: |
        Your input fields are:
        1. `note` (str):
        Your output fields are:
        1. `reasoning` (str):
        2. `summary` (str):
        All interactions will be structured in the following way, with the appropriate values filled in.

        [[ ## note ## ]]
        {note}


        [[ ## reasoning ## ]]
        {reasoning}

        [[ ## summary ## ]]
        {summary}

        [[ ## completed ## ]]

        In adhering to this structure, your objective is:
        Please analyze the provided medical record or clinical documentation about a patient's case, and generate a concise yet comprehensive summary. Your summary should capture essential information such as patient demographics, relevant medical history, significant diagnostic findings (including laboratory results, imaging studies, and diagnostic procedures), clinical presentations, and treatment interventions. Ensure that key clinical details are clearly stated using appropriate medical terminology and standard abbreviations, maintaining clinical accuracy and completeness. Use the previous response summary on the assistant as a reference. Do NOT repeat the same information from previous summary.

    user_prompt: |
        [[ ## note ## ]]
        {note}
        Respond with the corresponding output fields, starting with [[ ## reasoning ## ]]

  dspy_generate_concat:
    system_prompt: |
        Your input fields are:
        1. `note` (str):
        Your output fields are:
        1. `reasoning` (str):
        2. `summary` (str):
        All interactions will be structured in the following way, with the appropriate values filled in.

        [[ ## note ## ]]
        {note}

        [[ ## reasoning ## ]]
        {reasoning}

        [[ ## summary ## ]]
        {summary}
        [[ ## completed ## ]]

        In adhering to this structure, your objective is:
        Please analyze the provided medical record or clinical documentation about a patient's case, and generate a concise yet comprehensive summary. Your summary should capture essential information such as patient demographics, relevant medical history, significant diagnostic findings (including laboratory results, imaging studies, and diagnostic procedures), clinical presentations, and treatment interventions. Ensure that key clinical details are clearly stated using appropriate medical terminology and standard abbreviations, maintaining clinical accuracy and completeness.

    user_prompt: |
        [[ ## note ## ]]
        {note}
        Respond with the corresponding output fields, starting with [[ ## reasoning ## ]]

  generate_once:
    initial_prompt: &base_clinical_prompt |
      Task Overview:
      Your task involves synthesizing information from detailed clinical documentation about a
      specific patient case into a structured summary table. This JSON will highlight key medical attributes and findings
      along with their detailed descriptions derived from the provided clinical texts.

      Instructions:
      Extract Clinical Values: Focus on extracting specific, detailed medical information rather than general
      or vague descriptors. Ensure that descriptions are clinically precise and informative.
      Present a Comprehensive View: The table should reflect a comprehensive clinical perspective, including normal,
      abnormal, and significant findings. For attributes with multiple assessments, indicate the sources
      supporting each finding.

      Attribute Selection:
      - Commonly Assessed Attributes: Include attributes that are generally of clinical interest for medical
      case documentation and patient care.
      - Unique Attributes: Also identify and include unique clinical attributes that are specifically mentioned in
      the provided documentation.

      Do not include irrelevant information about different patients in the summary. Irrelevant information
      includes patient identifiers (PATIENT1, SUBJECT) that are different from the target patient case.

      Structure of the Summary Table:
      The JSON should contain a dictionary format data, where keys are clinical attributes and values are
      detailed descriptions of their corresponding findings.
      List attributes with their corresponding values, indicating the source documentation and relevant
      clinical excerpts for substantiation.
      If an attribute has multiple values, include all values as a list of the attribute.
      Each value should contain sufficient clinical evidence extracted from the documentation related to the patient.
      The JSON should be only a single level.

      Example:
      Patient: PATIENT_001
      Clinical Documentation:
      P1. Patient presents with chest pain and shortness of breath. Vital signs stable with BP 140/90.
      ECG shows ST elevation in leads II, III, aVF. Troponin levels elevated at 15.2 ng/mL.
      P2. Patient has history of hypertension and diabetes mellitus type 2. Current medications include
      metformin 500mg BID and lisinopril 10mg daily. Blood glucose well controlled.
      P3. Echocardiogram reveals regional wall motion abnormalities in inferior wall. Ejection fraction
      estimated at 45%. Patient reports 8/10 chest pain severity.

      Summary JSON:
      {{
        "attributes": {{
        "Presenting Symptoms": ["Chest pain", "Shortness of breath"],
        "Vital Signs": ["Blood pressure 140/90"],
        "Diagnostic Tests": ["ECG shows ST elevation in leads II, III, aVF", "Elevated troponin levels at 15.2 ng/mL"],
        "Medical History": ["Hypertension", "Diabetes mellitus type 2"],
        "Current Medications": ["Metformin 500mg BID", "Lisinopril 10mg daily"],
        "Imaging Results": ["Regional wall motion abnormalities in inferior wall", "Ejection fraction 45%"],
        "Pain Assessment": ["8/10 chest pain severity"]
        }}
      }}

      Your Task:
      Generate a similar table based on the following clinical documentation of the specified patient case. Patient:
      {entity_name}

      Clinical Documentation:
      {paragraph}

      Proceed to generate the clinical summary JSON.

  generate_update:
    initial_prompt: *base_clinical_prompt
    update_prompt: |
      Task Overview: You are tasked with refining and expanding an existing clinical summary table based on
      new clinical documentation about a patient case. This involves updating the table to include new
      medical information, modify existing clinical details without removing any, and ensuring all entries are supported
      by evidence from the clinical documentation.

      Instructions:
      Update Clinical Values: Carefully read the new clinical documentation and identify any information
      that should be added to the current table entries or modify them. Focus on specific, clinical
      details, avoiding vague medical descriptors. **Do not remove any existing attributes or values**, but
      rather add to or revise them as necessary based on new clinical findings.
      Maintain Clinical Completeness: Ensure the updated table continues to present a comprehensive
      clinical perspective, incorporating normal findings, abnormal findings, and significant clinical observations. For any attribute with multiple
      clinical assessments, update the count of sources supporting each finding. All original attributes and values
      must be preserved in the table, with modifications only to reflect new clinical insights or updated
      assessments based on the latest documentation.

      Attribute Revision and Addition: - Commonly Assessed Attributes: Update or add clinical attributes that
      are of general medical interest for patient case documentation, based on the new information.
      - Unique Attributes: Identify and incorporate any unique clinical attributes mentioned in the new
      documentation that were not previously included in the table.

      Structure of the Updated Clinical Summary Table:
      Retain the attribute-value format: Clinical Attribute and Clinical Finding.
      For each attribute, list the updated or new clinical evidence indicating the source documentation and
      relevant clinical excerpts. Original attributes and values should remain listed, with additional information
      appended as necessary.
      If an attribute has multiple values, include all values as a list of the attribute.
      Each value should contain sufficient clinical evidence extracted from the documentation related to the patient.

      Example:
      Patient: PATIENT_001

      New Clinical Documentation:
      P4. Follow-up visit shows improved cardiac function with ejection fraction now at 55%.
      Patient reports decreased chest pain, now 3/10 severity. Blood pressure controlled at 120/80.
      P5. Recent laboratory results show HbA1c improved to 6.8%, indicating better glucose control.

      Given Existing Clinical Summary JSON:
      {{
        "attributes":
        {{
        "Cardiac Function": ["Ejection fraction 45%", "Regional wall motion abnormalities"],
        "Pain Assessment": ["8/10 chest pain severity"],
        "Vital Signs": ["Blood pressure 140/90"]
        }}
      }}

      Updated Clinical Summary JSON:
      {{
        "attributes":
        {{
        "Cardiac Function": ["Ejection fraction 45%", "Regional wall motion abnormalities", "Improved ejection fraction to 55%"],
        "Pain Assessment": ["8/10 chest pain severity", "Decreased chest pain to 3/10 severity"],
        "Vital Signs": ["Blood pressure 140/90", "Controlled blood pressure 120/80"],
        "Laboratory Results": ["HbA1c improved to 6.8%"]
        }}
      }}

      Your Task:
      Update the clinical summary JSON with the given new clinical documentation of the specified patient.
      Patient: {entity_name}

      New Clinical Documentation:
      {paragraph}

      Given Existing Clinical Summary JSON:
      {existing_summary}

      Updated Clinical Summary JSON:

  generate_merge:
    partial_prompt: *base_clinical_prompt
    duplicate_removal_prompt: |
      Task Overview:
      Your task involves removing duplicate information from a detailed clinical summary JSON about a specific
      patient case. This summary will highlight key clinical attributes of the patient along with their detailed
      medical descriptions derived from the provided clinical documentation.

      Instructions:
      1. Eliminate repetitive clinical information to ensure the summary is concise.
      2. In the given clinical summary JSON, the keys are clinical attributes of the patient and each attribute has its
      corresponding medical values.
      3. If one clinical attribute encompasses most of the details in another attribute, merge them together.
      4. If one clinical value encompasses most of the details in another value, merge them together.

      Here is an example of merging clinical attributes:
      Given Existing Clinical Summary:
      {{
         "attributes":
        {{
        "Cardiac Assessment": ["Ejection fraction 45%"],
        "Heart Function": ["Reduced ejection fraction"]
        }}
      }}

      New Clinical Summary after removing duplicates and merging:
      {{
        "attributes":
        {{
        "Cardiac Function": ["Ejection fraction 45%"]
        }}
      }}

      Here is an example of merging clinical values:
      Given Existing Clinical Summary:
      {{
        "attributes":
        {{
        "Blood Pressure": ["Hypertension", "Elevated blood pressure", "BP 140/90"],
        "Vital Signs": ["High blood pressure"]
        }}
      }}

      New Clinical Summary after removing duplicates and merging:
      {{
        "attributes":
        {{
        "Blood Pressure": ["Hypertension with BP 140/90"]
        }}
      }}

      Given Existing Clinical Summary:
      {existing_summary}

      New Clinical Summary after removing duplicates and merging:

  cok:
    initial_prompt: *base_clinical_prompt
    update_and_add_prompt: |
      I will provide a JSON format summary in a section called [NEW SUMMARY], and a class
      definition [CLASS], which define some fields that need to be generated, and an instantiation of
      that class under [PARTIAL SUMMARY] that is a response to the question in [QUESTION]. Your
      task is to propose updates to [PARTIAL SUMMARY] gathered from the information in [NEW
      SUMMARY].

      There are two types of revisions that you can suggest: ADD and UPDATE.

      For UPDATE, follow these instructions:
      1. Your proposed updates must be for valid JSONPaths that already exist in [PARTIAL
      SUMMARY]. If the JSONPath does not exist, you should not propose an update for that
      JSONPath.
      2. Updates can be made by modifying an existing value using content from [NEW SUMMARY].
      3. Updates should never reduce the amount of information in [PARTIAL SUMMARY]
      4. Never remove existing information from the [PARTIAL SUMMARY].
      5. Proposed update must be a 'Dict[str, ProposedUpdate]' where the key is a valid JSONPath in
      [CLASS] and 'ProposedUpdate' is defined as follows:

      class ProposedUpdate(BaseModel):
          update: List[str]

      For ADD, follow these instructions:
      1. Proposed additions must be for valid JSONPaths that adhere to the definition in [CLASS].
      They are allowed to increase the size of lists in the definition, but they must not define new
      fields which are not defined in the class definition.
      2. It is OK to add partial objects. Leave fields unset if [NEW SUMMARY] does not contain a
      value for one of the fields in [PARTIAL SUMMARY].
      3. Proposed additions must be a 'Dict[str, ProposedAdd]' where the key is a valid JSONPath in
      [CLASS] and 'ProposedAdd' is defined as follows:

      class ProposedAdd(BaseModel):
          add: List[str]

      For both operations, follow these instructions:
      1. Values have sufficient context: the values of the [PARTIAL SUMMARY] should have enough
      context so a reader can understand what it means.
      2. No redundant keys: If information from [NEW SUMMARY] can be incorporated by updating
      an existing key in [PARTIAL SUMMARY], then do not introduce a new redundant key.
      3. No redundant values under the same key: If one value encompasses most of the details in
      another value, merge them together.

      Example:

      [QUESTION]
      Merge the new clinical summary and existing clinical summary of PATIENT.

      [NEW SUMMARY]
      {{
        "attributes": {{
        "Vital Signs": ["Heart rate 88 bpm"],
        "Medical History": ["Previous MI in 2019"],
        "Laboratory Results": ["HbA1c 6.8%", "Troponin elevated at 15.2 ng/mL"],
        "Pain Assessment": ["Chest pain 3/10 severity"]
      }}
      }}

      [CLASS]
      class Summary(BaseModel):
          attributes: Dict[str, List[str]] # Keyed by clinical attribute, with a list of sufficient medical details about the attribute.

      [PARTIAL SUMMARY]
      {{
        "attributes": {{
        "Vital Signs": ["Blood pressure 140/90"],
        "Medical History": ["Hypertension", "Diabetes mellitus type 2"]
      }}
      }}

      [THOUGHTS FOR UPDATE]
      1. I need to figure out which fields and values to update.
      2. [PARTIAL SUMMARY] contains information about the following: ["Vital Signs", "Medical History"]
      3. [NEW SUMMARY] contains new content relevant to the following existing content: ["Vital Signs", "Medical History"]
      4. The content should be updated at the following JSONPaths: ["$.attributes.Vital Signs", "$.attributes.Medical History"]


      [THOUGHTS FOR ADD]
      1. I need to figure out which fields and values to add.
      2. [NEW SUMMARY] mentions information about the following: ["Laboratory Results", "Pain Assessment"]
      3. [PARTIAL SUMMARY] does not yet have information about: ["Laboratory Results", "Pain Assessment"]
      4. The content should be added at the following JSONPaths: ["$.attributes.Laboratory Results", "$.attributes.Pain Assessment"]


      Output:
      {{
      "UPDATE":
      {{
        "$.attributes.Vital Signs": {{
          "update": ["Heart rate 88 bpm"]
        }},
        "$.attributes.Medical History": {{
          "update": ["Previous MI in 2019"]
        }}
      }},
      "ADD":
      {{
        "$.attributes.Laboratory Results": {{
          "add": ["HbA1c 6.8%", "Troponin elevated at 15.2 ng/mL"]
        }},
        "$.attributes.Pain Assessment": {{
          "add": ["Chest pain 3/10 severity"]
        }}
      }}
      }}

      Your Task:
      [QUESTION]
      Merge the new clinical summary and existing clinical summary of {entity_name}.

      [NEW SUMMARY]
      {new_summary}

      [CLASS]
      class Summary(BaseModel):
          attributes: Dict[str, List[str]] # Keyed by attribute, with a list of sufficient details about the attribute.

      [PARTIAL SUMMARY]
      {existing_summary}

      [THOUGHTS FOR UPDATE]
      1. I need to figure out which fields and values to update.
      2. [PARTIAL SUMMARY] contains information about the following: {existing_keys}
      3. [NEW SUMMARY] contains new content relevant to the following existing content: {relevant_keys}
      4. The content should be updated at the following JSONPaths: {update_paths}

      [THOUGHTS FOR ADD]
      1. I need to figure out which fields and values to add.
      2. [NEW SUMMARY] mentions information about the following: {new_keys}
      3. [PARTIAL SUMMARY] does not yet have information about: {missing_keys}
      4. The content should be added at the following JSONPaths: {add_paths}

      Output:
      {{
        "UPDATE": {{}},
        "ADD": {{}}
      }}

      Return a json with the added and updated objects
