ANV-SOT-Sample-1: Sesotho Sample Dataset - Next Voices-ZA (South Africa)
Abstract: ## Sesotho Sample Dataset - Next Voices-ZA (South Africa) - Multilingual Speech DatasetThis dataset includes **scripted and unscripted speech** across various domains such as agriculture, health, finance, sports, transport, culture, society and general topics. It is primarily designed for automatic speech recognition (ASR).## Folder structure The dataset is organised hierarchically as follows: ## Folder StructureANV-ZA-SOT-1h/├── sot/ # Folder for Sesotho│ ├── recorder_uuid/ # Contains all audio files│ │ ├── recording-1731053452.wav│ │ ├── ...│ ├── transcripts.csv # Contains transcripts of all audio recordings│ ├── meta.csv # Contains additional metadata├── README.md # Description of the dataset ## Data Details ### Audio- Format: **16-bit PCM WAV**- Sample rate: **48kHz** ### Transcriptions- Provided in `transcript.csv` with fields: - `file_name`: Name of the audio file. - `transcript`: Text transcription of the audio. - `duration`: Duration of the recording in seconds. - `type`: Scripted or unscripted. ### Metadata- Provided in `meta.csv` with fields: - `recorder_uuid`: Unique speaker identifier. - `age_range`, - `gender`
Loading