Cem Mil Podcasts: A Spoken Portuguese Document CorpusDownload PDFOpen Website

Published: 01 Jan 2022, Last Modified: 08 May 2023CoRR 2022Readers: Everyone
Abstract: This document describes the Portuguese language podcast dataset released by Spotify for academic research purposes. We give an overview of how the data was sampled, some basic statistics over the collection, as well as brief information of distribution over Brazilian and Portuguese dialects.
0 Replies

Loading