# Annotation Statistics Module

This module computes inter-annotator agreement statistics between human annotators and LLMs over  text-driven KG updating (TKGU) operations. It loads and merges annotation files, extracts structured assessment data, and produces agreement metrics (Cohen's kappa, Fleiss' kappa, Krippendorff's alpha) as well as LaTeX-ready tables for publication.

## Features
- Load and merge multiple JSONL annotation files.
- Align human and LLM assessments across operations.
- Compute:
  - Human–Human Cohen's kappa
  - Human–LLM Cohen's kappa
  - Fleiss' kappa (multi-rater)
  - Krippendorff's alpha (nominal)
- Produce LaTeX-formatted tables for papers.
- Safe merging of duplicated annotations.

## Requirements
We have tried running this code in conda environment containing python 3.12, which can be created as follows: 
```
conda create -n anno-stats python=3.12 -y
conda activate anno-stats
```

Additionally, python packages in ```requirements.txt``` file have to be installed. These requirements can be installed with the following command: 
```
pip install -r requirements.txt
```

## Usage
```
python run_me.py --config_file anno_stats_config.json
```

## Config File Structure
The config file (see ```anno_stats_config.json``` for example) must contain:
- `input_annotation_paths`: list of directories containing `.jsonl` annotation files.
- `human_annotators`: list of annotator + alias to print in output tables. 
- `tkgu_operations_to_check`: list of TKGU operations.
- `llms_to_compare`: 
  - `triple_deprecation`: list of LLM names used to assess d-triples TKGU operation. 
  - `triple_assertion`: list of LLM names used to assess x-triples, e-triples, ee-triples, and ee-kg-triples TKGU operations. 

## Output
- LaTeX table for direct inclusion in the paper.
- Human-friendly annotation agreement summary table. 

## License
Internal research module; no external license specified.

