# The supplementary material of submission 14094: "On the Exploitability of Instruction Tuning".

## Appendix

### Table of Content

* A.1: More examples of the responses of poisoned models
* A.2: More experiments and analysis
    * Content injection with a fictional brand. 
    * content injection with an example URL. 
    * Text quality analysis on model outputs using MAUVE score. 
    * Randomness analysis: fine-tune model with different random splits of poisoned data. 
* A.3: Implementation details
    * Data format and instruction templates
    * Model-based evaluation protocol for the over-refusal attack
    * Information about hardware and compute
    * Reproducibility: including an anonymous link to our code
* License information of the assets used in this work


## Code release
We provide an anonymous link to our code in section A.3 of the Appendix. 