All the testing code is in the synthetic_experiments notebook. 

All files that start with a c are experiments using GPT-4. All files that start with a d are experiments using Claude 3.5 Sonnet.

abbreviation guide

Experiment Types:

Anti : Anti-Causal
Correct: Causal
Forward: Forward Topological Orientation
Reverse: Reverse Topological Orientation


Prompting Types:
Context: In-Context Learning prompt results
Graph: Graph+Narrative Prompting Results
Chain: Naive CoT

If a file is not indexed by one of the above prompting types, it is a standard prompt (eg does A cause B using the narrative)

