# Weak-to-Strong Deception
This material contains the code and data for the submission *Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization*.

The detailed instructions can be found under the folder ```weak-to-strong```.