# Project Overview

This project implements **Harbor**, a three-stage framework for transforming raw LLM outputs into well-calibrated probabilities, along with comparison methods from the BIRD framework.

## Methods

### Harbor Framework
Harbor consists of three main stages:

1. **Factor-Space Construction** (`iterative_factor_gen.py`): 
   - Iterative factor generation through reverse abduction
   - Hierarchical clustering to create a dense, two-level factor hierarchy

2. **Context-Aware Mapping** (`hierarchical_condition_mapping.py`):
   - Coarse-to-fine retrieval over the factor hierarchy
   - Self-consistent filtering and reflective refinement for factor selection

3. **Probabilistic Inference** (`casual_bayesian_train.py`):
   - Constructs Naïve Bayes and Causal Bayesian networks
   - Uses LLM to identify latent variables and initialize parameters
   - Combines outputs into calibrated probabilities

### BIRD Baseline Methods
- **scenario_analysis.py**: Analyzes scenarios to generate structured factors
- **scenario_inference.py**: Performs inference on scenarios
- **scenario_train.py**: Trains models based on scenario data

### FactCheck Dataset Adaptations
- **hierarchical_condition_mapping_fact_check.py**: Harbor mapping for FactCheck dataset
- **iterative_factor_gen_fact_check.py**: Harbor factor generation for FactCheck dataset

### Evaluation

After running the Harbor and Bird methods, the evaluation scripts for BIRD can be found in the `evaluation/` folder. For detailed instructions on how to perform the evaluation, please refer to the BIRD repository.

### Harbor Evaluation

After running the Harbor methods, follow these steps for evaluation and computation:
1. Run the evaluation script:
   - `python code/run/evaluate_nb_cbn.py`
2. Compute the results:
   - `python code/run/compute_nb_cbn.py`

## Datasets

The project utilizes various datasets, including the FactCheck dataset, to test and validate the implemented methods.

## Installation

To set up the project environment, follow these steps:
1. Clone the repository.
2. Install the required dependencies using `pip install -r requirements.txt`.

## Usage

To run the methods, use the following commands:

### Harbor Methods
1. **iterative_factor_gen.py**: Focuses on iterative factor generation using advanced clustering techniques.
   - Run: `python code/run/iterative_factor_gen.py`

2. **hierarchical_condition_mapping.py** and **casual_bayesian_train.py** can be run simultaneously:
   - Run: `python code/run/hierarchical_condition_mapping.py & python code/run/casual_bayesian_train.py`

### Bird Methods
For running the BIRD methods, please refer to the BIRD repository for detailed instructions and scripts.

- For Bird methods: `python code/run/<method_name>.py`
- For FactCheck methods: `python code/run/<method_name>.py`

