
# DinoSR code for NeurIPs 2023 submission
  

This repo should be plugged into [`fairseq`](https://github.com/facebookresearch/fairseq) under `examples/` for execution. Our work is based on the commit `47e279842ac8776e3964b0e45c320ad1d2ea6096`.
We followed [`wav2vec2`](https://github.com/facebookresearch/fairseq/tree/main/examples/wav2vec) for data preparation and model finetuning.
For the hyper-parameters availabe, see [`config file`](config/audio/pretraining) where default settings used in the paper is listed. To train the model, install `fairseq` with the editable option and run

```
data_root=/path/to/prepared/librispeech/

python $fairseq_root/fairseq_cli/hydra_train.py -m \
--config-dir $fairseq_root/examples/dinosr/config/ \
--config-name base \
task.data=$data_root \
common.user_dir=$fairseq_root/examples/dinosr &
```