ANV-SOT-Sample-1: Sesotho Sample Dataset - Next Voices-ZA (South Africa)

Vukosi Marivate, Kayode Olaleye, Sitwala Mundia, Nia Zion van Wyk, Andinda Bakainaga, Graham Morrissey, Dale Dunbar, Francois Smit

Published: 09 Dec 2024, Last Modified: 10 Oct 2025DataCiteEveryoneRevisionsCC BY-SA 4.0

Abstract: ## Sesotho Sample Dataset - Next Voices-ZA (South Africa) - Multilingual Speech DatasetThis dataset includes **scripted and unscripted speech** across various domains such as agriculture, health, finance, sports, transport, culture, society and general topics. It is primarily designed for automatic speech recognition (ASR).## Folder structure The dataset is organised hierarchically as follows: ## Folder StructureANV-ZA-SOT-1h/├── sot/ # Folder for Sesotho│ ├── recorder_uuid/ # Contains all audio files│ │ ├── recording-1731053452.wav│ │ ├── ...│ ├── transcripts.csv # Contains transcripts of all audio recordings│ ├── meta.csv # Contains additional metadata├── README.md # Description of the dataset ## Data Details ### Audio- Format: **16-bit PCM WAV**- Sample rate: **48kHz** ### Transcriptions- Provided in `transcript.csv` with fields: - `file_name`: Name of the audio file. - `transcript`: Text transcription of the audio. - `duration`: Duration of the recording in seconds. - `type`: Scripted or unscripted. ### Metadata- Provided in `meta.csv` with fields: - `recorder_uuid`: Unique speaker identifier. - `age_range`, - `gender`

External IDs:doi:10.5281/zenodo.14336303