**PyTorch Implementation of the Paper:**

> [Beyond Audio-Visual Alignment: Unmasking Talking Head Deepfakes via Red Hue Discrepancies in HSV Color Space]
> *ICLR, 2026, Anonymous submission*

## Data

To set up data, follow these steps:

**Download the datasets:**
   - **RFAV Dataset:** We will release the dataset download link after the review process is completed.
   - **TalkingHeadBench dataset:** Follow instructions from [TalkingHeadBench hugging face repo](https://hf.com/datasets/luchaoqi/TalkingHeadBench)   
   - **FakeAVCeleb Dataset:** Follow instructions from [FakeAVCeleb GitHub repo](https://github.com/DASH-Lab/FakeAVCeleb)
   - **AVLips Dataset:** Follow instructions from [LipFD GitHub repo](https://github.com/AaronComo/LipFD)

This repository also integrates code from the following public repositories:
- [FACTOR](https://github.com/talreiss/FACTOR)
- [AV-Hubert](https://github.com/facebookresearch/av_hubert)

**Extract features**
Run feature_extraction.py. 

```bash
python feature_extraction.py \
    --dataset AVLips \
    --split train \
    --metadata /avlips_metadata/test_metadata.csv \
    --ckpt_path self_large_vox_433h.pt \
    --data_path /path/to/preprocessed/data \
    --save_path /path/to/save/features
```

## Evaluation

To evaluate a model, use/modify the following example:

```bash 
python eval.py \ 
    --checkpoint_path xxx/RHTHFD.pt \ 
    --features_path /path/to/saved/features \ 
    --metadata /avlips_metadata/test_metadata.csv \ 
    --dataset AVLips 
```