# Old Habits Die Hard: How Conversational History Geometrically Traps LLMs

## Repository Structure

The codebase consists of two main components:

1.  **`History_data_creation.py`**:  Create the conversational history datasets from raw QA.
2.  **`analysis.py`**: An analysis that create the results from the paper.

## Data generation
To create conversational history datasets, run the following command:
```bash
python History_data_creation.py \
    --model_name "meta-llama/Llama-3.1-8B-Instruct" \
    --dataset_name "triviaqa" \
    --num_conv 100 \
    --conv_length 20 \
    --ordered
```
Parameters:
- `--model_name`: The name of the LLM model to use.
- `--dataset_name`: The name of the dataset to create conversational history from. (Options: triviaqa, natural_questions, sorry, do_not_answer, sycophancy, sycophancy_negative)
- `--num_conv`: Number of conversations to generate.
- `--conv_length`: Length of each conversation.
- `--ordered`: Whether to maintain the order of questions in the conversation.
- `--temp`: Temperature setting for the model's response generation. In the paper we used 0.0- greedy decoding.
- `--two_topics`: Whether to create conversations with two distinct topics.
- `--two_topics_4_1`: Whether to create conversations with a 4:1 ratio of questions from two topics.

The results will be saved in the `results/` directory.

To run Sycophancy please first download the data titled answer.jsonl from https://github.com/meg-tong/sycophancy-eval.

## Analysis
To analyze the results and generate plots as shown in the paper, run:
```bash
python analysis.py 
```
The plots will be saved in the `plots/` directory.



