# EASD Integration in Transformers

This repository modifies parts of the `transformers/generation` module to support the **Entropy-Aware Speculative Decoding (EASD)** algorithm.  
Specifically, we have adjusted:

- `candidate_generator.py`
- `utils.py`

## Usage

You can now control the entropy threshold for EASD directly via the `chosen_entropy` argument in `model.generate()`:

```python
outputs = model.generate(
    tokenized_chat,
    max_new_tokens=4096,
    output_scores=True,
    return_dict_in_generate=True,
    output_hidden_states=True,
    assistant_model=model_2,
    do_sample=False,
    chosen_entropy=chosen_entropy,
)


The top-n overlap value is now set to 0.7 by default. In the near future, a more detailed and better-organized version of the code will be released to ensure clarity and ease of use. We will also make available all experimental outputs, logs, and records to facilitate independent verification and replication of our findings.

