# TreeTop: Topology-Aware Fine-Tuning for LLM Conversation Tree Understanding

This directory contains a single python script for reproducing the dataset
preparation for experiments on TreeTop, introduced in an in-progress, anonymous
ICLR submission. The links to the public dataset are below. The script has
pointers for where to load the downloaded data.

Zenodo pushshift:

 * Release / publication: https://aaai.org/ojs/index.php/ICWSM/article/download/7347/7201
 * Direct download: https://zenodo.org/records/3608135


Fakeddit:

 * Release / publication: https://arxiv.org/pdf/1911.03854
 * Direct downloads:
   * https://drive.google.com/corp/drive/folders/1jU7qgDqU1je9Y0PMKJ_f31yXRo5uWGFm
   * https://drive.google.com/corp/drive/folders/150sL4SNi5zFK8nmllv5prWbn0LyvLzvo


Controversy detection:

 * Release / publication: https://aclanthology.org/N19-1166/
 * Direct download: https://drive.google.com/file/d/1G4F9gNk1v_qWHHII93uYtQbQJHyzK2LR/view


CMV:

 * Release / publication: https://dl.acm.org/doi/pdf/10.1145/2872427.2883081
 * Direct download: https://chenhaot.com/data/cmv/cmv.tar.bz2


Pheme9:

 * Release / publication: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0150989
 * Direct download: https://figshare.com/articles/dataset/PHEME_dataset_for_Rumour_Detection_and_Veracity_Classification/6392078