# Public-Health-Agent

This repository implements an agentic pipeline for **scenario-conditioned epidemiological simulator generation**, verification, and evaluation. The system integrates graph-based model generation, code synthesis, verification/validation agents, and retrieval-augmented generation (RAG) for mechanistically grounded epidemic modeling.

---

## 1. Environment Setup

### Step 1: Create Conda Environment
Please create the Conda environment using the provided `env.yml` file:

bash
conda env create -f env.yml
conda activate <env_name>

Ensure the environment is successfully activated before proceeding.

### Step 2: OpenAI API Setup

The OpenAI API key is required to run the code generation and agentic components.
Set your API key as an environment variable:

export OPENAI_API_KEY="your_api_key_here"


## 2. Running the Pipeline

To execute the full pipeline, run:
python run.py

## 3. Configuration

All experiment-level configurations are controlled via:
config/config.yaml
From this file, you can modify:
- Training epochs
- Retry limits for agentic loops
- Case studies / environments to evaluate
- optimization hyperparameters


## 4. Code Structure Overview
Prompts

- Code generation prompt:
- - utils/prompt.py

- Graph generation & verification prompt:
- - utils/prompt_graph.py

### Retrieval-Augmented Generation (RAG)

The vector index for RAG is already built.
If you want to build an index for additional books or PDFs, use:

- Embedding_EpiBook.ipynb

### Data
All datasets used by the pipeline are stored under:
- /data/


### Agent Modules

- Error recovery agent
- Code verification agent
- Code validation agent
  
All are implemented in:  *** agents.py ***

Graph-level structural verification is implemented in: *** utils/prompt_graph.py ***



## 5. Case Studies

We evaluate the system on two epidemiological case studies.

# 5.1 MIDAS COVID-19 Scenario Modeling Project

*** How to Run ***

- In config/config.yaml, set:
- envs_to_evaluate: ['Covid-scenario']

Environment Definition

- The MIDAS environment implementation is located at: libs/covasim/publichealth.py
- The dataset can be obtained from the official MIDAS repository: https://github.com/midas-network/covid19-scenario-hub_evaluation/tree/main

# 5.2 Behavior Modeling (PNAS)

- In config/config.yaml, set:
- envs_to_evaluate: ['PNAS']
  
Environment Definition

- The behavior modeling environment is implemented at: libs/covasim/env_pnas.py
- The dataset is available from: https://github.com/ngozzi/covid-behavior-models/tree/main/data


### 6. Notes

The pipeline includes automatic graph → code → verification → correction loops.

Invalid graphs or simulator code are detected and corrected via agentic feedback.

The system is designed to be easily extensible to new epidemiological environments and datasets.
