# LD-DPO
We use [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory/blob/main/examples/README.md) to implement the LD-DPO algorithm, which is a well-known LLMs code repository within the community that supports almost the entire model training process.
## Quickstart
We provide a set of hyperparameters  for implementing LD-DPO.
```bash
lc_alpha: float = field(
        default=1.0,
        metadata={"help": "The length-control hyperparameter in LD-DPO, with the default (DPO) set to 1.0."}
    )
```
And we provide a training template that can be run with the following commands.
```bash
llamafactory-cli train examples/train_full/llama3_full_dpo_ds3.yaml
```