Abstract: Highlights•A speech-dedicated automatic recognition tool such as Kaldi can be used for human-beatbox sound recognition.•A large vocabulary of human-beatbox sounds can be recognized with low error rate.•Recording conditions (type of microphone and settings) do not impact recognition performances.•PLP and Fbank features perform worse than than MFCC for beatbox sound recognition system.
Loading