[
  {
    "name": "engineer_response_formatter",
    "content": "**Code Explanation:**\n\nThis script executes Step 5 of the analysis plan, focusing on robust astrophysical inference. It synthesizes the results from previous steps to determine which parameters of the GW231123 event are robustly constrained across all five waveform models and which are model-dependent.\n\nThe script performs the following actions:\n1.  Loads Data: It loads the pre-processed dataframes for each model, the pre-computed 1D Jensen-Shannon Divergence (JSD) matrices, and the multi-dimensional JSD matrices for the defined physical subspaces.\n2.  Defines Robustness Criteria: A parameter is classified as \"Robust\" only if it meets two strict criteria:\n    *   The maximum pairwise JSD between any two models for that parameter's 1D posterior must be less than a threshold (0.05).\n    *   The relative range of the median values across the five models (i.e., (max_median - min_median) / mean_median) must be less than 10%.\n3.  Identifies Robust Parameters: It applies these criteria to each key astrophysical parameter to classify it as 'Robust' or 'Model-Dependent'.\n4.  Calculates Consensus Constraints: For parameters identified as 'Robust', it aggregates the posterior samples from all five models and computes a single consensus median and 90% credible interval.\n5.  Summarizes Model-Dependent Parameters: For 'Model-Dependent' parameters, it reports the range of median values across the models instead of a single consensus value. It also provides a physical note attributing the primary source of the discrepancy to one of the physical subspaces analyzed in Step 4.\n6.  Compiles and Saves Final Table: All results are compiled into a final, comprehensive summary table which is printed to the console in full and saved to data/final_astrophysical_inference.csv for use by the researcher agent.\n\n**Python Code:**\n\n```python\n# filename: codebase/robust_inference.py\nimport pandas as pd\nimport numpy as np\nimport pickle\nimport os\n\ndef get_summary_medians(dataframes, key_parameters, model_names):\n    \"\"\"\n    Calculates the median for key parameters from all models.\n\n    Args:\n        dataframes (dict): A dictionary of pandas DataFrames, keyed by model name.\n        key_parameters (list): A list of column names for which to compute medians.\n        model_names (list): An ordered list of model names.\n\n    Returns:\n        pd.DataFrame: A DataFrame containing the median values, indexed by\n                      parameter and model name.\n    \"\"\"\n    summary_data = []\n    for param in key_parameters:\n        for model in model_names:\n            df = dataframes[model]\n            median = df[param].median()\n            summary_data.append({'Parameter': param, 'Model': model, 'Median': median})\n    summary_df = pd.DataFrame(summary_data).set_index(['Parameter', 'Model'])\n    return summary_df\n\n\ndef identify_robust_parameters(jsd_matrices, summary_medians_df, key_parameters, jsd_threshold, median_range_threshold):\n    \"\"\"\n    Identifies robustly constrained parameters based on JSD and median range criteria.\n\n    Args:\n        jsd_matrices (dict): Dictionary of JSD matrices for each parameter.\n        summary_medians_df (pd.DataFrame): DataFrame with median values for each parameter and model.\n        key_parameters (list): List of parameters to evaluate.\n        jsd_threshold (float): The maximum allowed JSD for a parameter to be considered robust.\n        median_range_threshold (float): The maximum allowed relative range of medians.\n\n    Returns:\n        dict: A dictionary mapping each parameter to its robustness status ('Robust' or 'Model-Dependent').\n    \"\"\"\n    print(\"--- Step 5.1: Identifying Robustly Constrained Parameters ---\")\n    print(\"Robustness Criteria:\")\n    print(\"1. Max JSD between any model pair < \" + str(jsd_threshold))\n    print(\"2. Relative Median Range ((max-min)/mean) < \" + str(median_range_threshold * 100) + \"%\")\n    print(\"-\" * 60)\n\n    robustness_status = {}\n    for param in key_parameters:\n        # Criterion 1: JSD\n        max_jsd = jsd_matrices[param].max().max()\n        jsd_passed = max_jsd < jsd_threshold\n\n        # Criterion 2: Median Range\n        medians = summary_medians_df.loc[param]['Median']\n        mean_median = medians.mean()\n        relative_range = (medians.max() - medians.min()) / abs(mean_median) if mean_median != 0 else 0\n        median_range_passed = relative_range < median_range_threshold\n\n        # Final Decision\n        is_robust = jsd_passed and median_range_passed\n        status = 'Robust' if is_robust else 'Model-Dependent'\n        robustness_status[param] = status\n\n        print(\"Parameter: \" + param)\n        print(\"  - Max JSD: \" + str(round(max_jsd, 4)) + \" (Pass: \" + str(jsd_passed) + \")\")\n        print(\"  - Relative Median Range: \" + str(round(relative_range * 100, 2)) + \"% (Pass: \" + str(median_range_passed) + \")\")\n        print(\"  => Status: \" + status + \"\\n\")\n\n    return robustness_status\n\n\ndef get_physical_discrepancy_note(param, subspace_jsd_matrices):\n    \"\"\"\n    Generates a note on the physical origin of model discrepancy for a parameter.\n\n    Args:\n        param (str): The parameter name.\n        subspace_jsd_matrices (dict): Dictionary of JSD matrices for each physical subspace.\n\n    Returns:\n        str: A brief note on the likely physical origin of the discrepancy.\n    \"\"\"\n    param_to_subspace = {\n        'mass_1_source': 'Mass & Distance',\n        'mass_2_source': 'Mass & Distance',\n        'redshift': 'Mass & Distance',\n        'chi_eff': 'Effective Spin',\n        'chi_p': 'Effective Spin',\n        'final_mass_source': 'Remnant Properties',\n        'final_spin': 'Remnant Properties'\n    }\n    subspace = param_to_subspace.get(param)\n    if not subspace:\n        return \"Source of discrepancy not mapped to a specific subspace.\"\n\n    max_jsd = subspace_jsd_matrices[subspace].values.max()\n    note = \"Discrepancy linked to '\" + subspace + \"' subspace (max subspace JSD: \" + str(round(max_jsd, 3)) + \").\"\n    return note\n\n\ndef compile_final_results(robustness_status, dataframes, summary_medians_df, subspace_jsd_matrices, key_parameters, model_names):\n    \"\"\"\n    Compiles the final summary table of astrophysical inferences.\n\n    Args:\n        robustness_status (dict): Dictionary with the robustness status of each parameter.\n        dataframes (dict): Dictionary of all model DataFrames.\n        summary_medians_df (pd.DataFrame): DataFrame with median values.\n        subspace_jsd_matrices (dict): Dictionary of subspace JSD matrices.\n        key_parameters (list): List of parameters to include in the table.\n        model_names (list): List of all model names.\n\n    Returns:\n        pd.DataFrame: The final summary table.\n    \"\"\"\n    print(\"\\n--- Step 5.2 & 5.3: Deriving Consensus Constraints and Compiling Final Table ---\")\n    final_results = []\n\n    for param in key_parameters:\n        status = robustness_status[param]\n        result_row = {'Parameter': param, 'Status': status}\n\n        if status == 'Robust':\n            # Aggregate samples from all models\n            all_samples = np.concatenate([dataframes[model][param].values for model in model_names])\n            # Compute consensus median and 90% credible interval\n            quantiles = np.quantile(all_samples, [0.05, 0.5, 0.95])\n            result_row['Consensus Median'] = quantiles[1]\n            result_row['Consensus 90% CI'] = str(round(quantiles[0], 3)) + \" - \" + str(round(quantiles[2], 3))\n            result_row['Physical Discrepancy Source'] = 'N/A'\n        else: # Model-Dependent\n            medians = summary_medians_df.loc[param]['Median']\n            result_row['Consensus Median'] = str(round(medians.min(), 3)) + \" - \" + str(round(medians.max(), 3)) + \" (Range)\"\n            result_row['Consensus 90% CI'] = 'See individual models'\n            result_row['Physical Discrepancy Source'] = get_physical_discrepancy_note(param, subspace_jsd_matrices)\n\n        final_results.append(result_row)\n\n    final_df = pd.DataFrame(final_results)\n    return final_df\n\n\ndef main():\n    \"\"\"\n    Main function to execute Step 5 of the analysis.\n    \"\"\"\n    print(\"--- Starting Step 5: Robust Astrophysical Inference and Consensus Constraints ---\")\n\n    # --- Configuration ---\n    data_dir = 'data'\n    model_names = [\n        'NRSur7dq4', 'IMRPhenomXO4a', 'SEOBNRv5PHM',\n        'IMRPhenomXPHM', 'IMRPhenomTPHM'\n    ]\n    key_parameters = [\n        'mass_1_source', 'mass_2_source', 'chi_eff', 'chi_p',\n        'redshift', 'final_mass_source', 'final_spin'\n    ]\n    # Robustness criteria thresholds\n    JSD_THRESHOLD = 0.05\n    MEDIAN_RANGE_THRESHOLD = 0.10  # 10%\n\n    # --- Load Data from Previous Steps ---\n    print(\"\\nLoading data from previous steps...\")\n    try:\n        with open(os.path.join(data_dir, 'gw_data_all_models.pkl'), 'rb') as f:\n            dataframes = pickle.load(f)\n        with open(os.path.join(data_dir, 'jsd_matrices.pkl'), 'rb') as f:\n            jsd_matrices = pickle.load(f)\n        with open(os.path.join(data_dir, 'subspace_jsd_matrices.pkl'), 'rb') as f:\n            subspace_jsd_matrices = pickle.load(f)\n        print(\"Data loaded successfully.\")\n    except FileNotFoundError as e:\n        print(\"Error: Could not find required data file: \" + str(e))\n        print(\"Please ensure Steps 1, 2, and 4 have been run successfully.\")\n        return\n    except Exception as e:\n        print(\"An error occurred while loading data: \" + str(e))\n        return\n\n    # --- Run Analysis ---\n    summary_medians_df = get_summary_medians(dataframes, key_parameters, model_names)\n\n    robustness_status = identify_robust_parameters(jsd_matrices, summary_medians_df, key_parameters, JSD_THRESHOLD, MEDIAN_RANGE_THRESHOLD)\n\n    final_table = compile_final_results(robustness_status, dataframes, summary_medians_df, subspace_jsd_matrices, key_parameters, model_names)\n\n    # --- Display and Save Final Results ---\n    print(\"\\n--- Step 5.4: Final Results Compilation ---\")\n    pd.set_option('display.max_rows', None)\n    pd.set_option('display.max_columns', None)\n    pd.set_option('display.width', 200)\n    pd.set_option('display.max_colwidth', 100)\n    print(\"\\nFinal Astrophysical Inference Summary Table:\")\n    print(final_table)\n\n    # Save the table to a CSV file\n    output_path = os.path.join(data_dir, 'final_astrophysical_inference.csv')\n    try:\n        final_table.to_csv(output_path, index=False)\n        print(\"\\nSuccessfully saved final inference table to: \" + output_path)\n    except Exception as e:\n        print(\"\\nError saving final table to CSV: \" + str(e))\n\n    print(\"\\n--- Step 5 execution completed successfully. ---\")\n\n\nif __name__ == '__main__':\n    main()\n```",
    "timestamp": "2025-08-10 18:10:38",
    "cost": {
      "Agent": [
        "engineer_response_formatter"
      ],
      "Model": [
        "o3-mini-2025-01-31"
      ],
      "Cost": [
        0.0184371
      ],
      "Prompt Tokens": [
        3305
      ],
      "Completion Tokens": [
        3364
      ],
      "Total Tokens": [
        6669
      ]
    }
  },
  {
    "name": "executor",
    "content": "Execution results:\n\nExecution output: \n--- Starting Step 5: Robust Astrophysical Inference and Consensus Constraints ---\n\nLoading data from previous steps...\nData loaded successfully.\n--- Step 5.1: Identifying Robustly Constrained Parameters ---\nRobustness Criteria:\n1. Max JSD between any model pair < 0.05\n2. Relative Median Range ((max-min)/mean) < 10.0%\n------------------------------------------------------------\nParameter: mass_1_source\n  - Max JSD: 0.4535 (Pass: False)\n  - Relative Median Range: 15.04% (Pass: False)\n  => Status: Model-Dependent\n\nParameter: mass_2_source\n  - Max JSD: 0.6859 (Pass: False)\n  - Relative Median Range: 58.34% (Pass: False)\n  => Status: Model-Dependent\n\nParameter: chi_eff\n  - Max JSD: 0.6253 (Pass: False)\n  - Relative Median Range: 137.99% (Pass: False)\n  => Status: Model-Dependent\n\nParameter: chi_p\n  - Max JSD: 0.1376 (Pass: False)\n  - Relative Median Range: 11.3% (Pass: False)\n  => Status: Model-Dependent\n\nParameter: redshift\n  - Max JSD: 0.6815 (Pass: False)\n  - Relative Median Range: 108.23% (Pass: False)\n  => Status: Model-Dependent\n\nParameter: final_mass_source\n  - Max JSD: 0.5085 (Pass: False)\n  - Relative Median Range: 19.46% (Pass: False)\n  => Status: Model-Dependent\n\nParameter: final_spin\n  - Max JSD: 0.6769 (Pass: False)\n  - Relative Median Range: 21.78% (Pass: False)\n  => Status: Model-Dependent\n\n\n--- Step 5.2 & 5.3: Deriving Consensus Constraints and Compiling Final Table ---\n\n--- Step 5.4: Final Results Compilation ---\n\nFinal Astrophysical Inference Summary Table:\n           Parameter           Status           Consensus Median       Consensus 90% CI                                                     Physical Discrepancy Source\n0      mass_1_source  Model-Dependent  129.142 - 149.869 (Range)  See individual models     Discrepancy linked to 'Mass & Distance' subspace (max subspace JSD: 0.693).\n1      mass_2_source  Model-Dependent     55.076 - 111.1 (Range)  See individual models     Discrepancy linked to 'Mass & Distance' subspace (max subspace JSD: 0.693).\n2            chi_eff  Model-Dependent      0.039 - 0.439 (Range)  See individual models      Discrepancy linked to 'Effective Spin' subspace (max subspace JSD: 0.636).\n3              chi_p  Model-Dependent       0.733 - 0.82 (Range)  See individual models      Discrepancy linked to 'Effective Spin' subspace (max subspace JSD: 0.636).\n4           redshift  Model-Dependent      0.171 - 0.583 (Range)  See individual models     Discrepancy linked to 'Mass & Distance' subspace (max subspace JSD: 0.693).\n5  final_mass_source  Model-Dependent  189.718 - 232.733 (Range)  See individual models  Discrepancy linked to 'Remnant Properties' subspace (max subspace JSD: 0.688).\n6         final_spin  Model-Dependent      0.708 - 0.888 (Range)  See individual models  Discrepancy linked to 'Remnant Properties' subspace (max subspace JSD: 0.688).\n\nSuccessfully saved final inference table to: data/final_astrophysical_inference.csv\n\n--- Step 5 execution completed successfully. ---\n",
    "timestamp": "2025-08-10 18:10:46",
    "cost": {
      "Agent": [],
      "Model": [],
      "Cost": [],
      "Prompt Tokens": [],
      "Completion Tokens": [],
      "Total Tokens": []
    }
  }
]