FLEURS: FEW-Shot Learning Evaluation of Universal Representations of SpeechDownload PDFOpen Website

Published: 01 Jan 2022, Last Modified: 27 Jun 2023SLT 2022Readers: Everyone
Abstract: We introduce FLEURS, the Few-shot Learning Evaluation of Universal Representations of Speech benchmark. FLEURS is an n-way parallel speech dataset in 102 languages built on top of the machine translation FLoRes-101 benchmark, with approximately 12 hours of speech supervision per language. FLEURS can be used for a variety of speech tasks, including Automatic Speech Recognition (ASR), Speech Language Identification (Speech LangID), Speech-Text Retrieval. In this paper, we provide baselines for the tasks based on multilingual pre-trained models like speech-only w2v-BERT [1] and speech-text multimodal mSLAM [2]. The goal of FLEURS is to enable speech technology in more languages and catalyze research in low-resource speech understanding. http://www.w3.org/1998/Math/MathML" xmlns:xlink="1" target="_blank" rel="nofollow">http://www.w3.org/1999/xlink">1 .
0 Replies

Loading