Keywords: spoken language understanding, speech recognition, open source data
TL;DR: New English speech dataset for timers, alarms, unit conversion, and math.
Abstract: This paper introduces Timers and Such, a new open source dataset of spoken English commands for common voice control use cases involving numbers. We describe the gap in existing spoken language understanding datasets that Timers and Such fills, the design and creation of the dataset, and experiments with a number of ASR-based and end-to-end baseline models, the code for which has been made available as part of the SpeechBrain toolkit.
Supplementary Material: zip
URL: https://zenodo.org/record/4623772 (the code at https://github.com/speechbrain/speechbrain/tree/develop/recipes/timers-and-such will download and format the dataset)