PhoWhisper: Automatic Speech Recognition for Vietnamese

Published: 19 Mar 2024, Last Modified: 30 Mar 2024Tiny Papers @ ICLR 2024 ArchiveEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Automatic Speech Recognition, Vietnamese, Speech to Text
TL;DR: We introduce PhoWhisper in five versions for Vietnamese automatic speech recognition.
Abstract: We introduce PhoWhisper in five versions for Vietnamese automatic speech recognition. PhoWhisper's robustness is achieved through fine-tuning the Whisper model on an 844-hour dataset that encompasses diverse Vietnamese accents. Our experimental study demonstrates state-of-the-art performances of PhoWhisper on benchmark Vietnamese ASR datasets. We have open-sourced PhoWhisper at: https://github.com/VinAIResearch/PhoWhisper
Submission Number: 239
Loading