# SocialEmergence
Repository used to unravell the emergence of social intelligence.

## Running the benchmarks
To run the false-belief benchmarks and the BLIMP benchmarks first create a virtual environment and install packakes using the requirements.txt file.

```pip install -r requirements.txt```

Then run the benchmark script:

```python src/main.py```

You can change the model used by using the MODEL_ID argument. For example to use python Olmo 1B SFT you run:

```python src/main.py --model_id=OLMo-2-0425-1B-SFT```


## Data

The EPITOME dataset can be found here: https://osf.io/agqwv/