The folder contains 3 files.
1. "supplementary_VoxInstruct.pdf" is the main appendix file of our submitted paper.
2. "video_VoxInstruct.mp4" is the corresponding video created to better present the paper.
3. "speech_annotation_system.pdf" is the reference paper on instruction-speech paired dataset construction pipeline, which is used to aid in the interpretation of the main appendix "supplementary_VoxInstruct.pdf".
