# Dependencies
- PyTorch==1.13.1
- Fairseq==0.12.2
- Other: Please follow the Fairseq installation tutorial to install the other dependencies

# Train
- After installing the Fairseq toolkit, `transformer_decoder.py` file should be placed in the path `fairseq/models/transformer/transformer_decoder.py`.
- `s2t_transformer.py` file should be placed in the path `fairseq/models/speech_to_text/s2t_transformer.py`. 
- `sequence_generator.py` file should be placed in the path `fairseq/sequence_generator.py`. 
- Then, please follow the examples provided by Fairseq (e.g. `examples/speech_to_text/docs/mustc_example.md`) to process the data. When training the model, Passing `--arch s2t_MTLA_m` to `fairseq-train` starts training our proposed MTLA-based decoder-only model.


# Inference and Evaluation
- Inference and evaluation follow the Fairseq examples (e.g. `examples/speech_to_text/docs/mustc_example.md`).