Differentiation of Music Genre from an Audio File Using Neural Networks

Pushker Jain, Ayan Sar, Tanupriya Choudhury, Vishal Singh, Ketan Kotecha

Published: 01 Jan 2024, Last Modified: 07 Nov 2025CrossrefEveryoneRevisionsCC BY-SA 4.0

Abstract: This research paper explores deep learning and machine learning techniques, specifically CNN and KNN, for differentiating music genres from audio files. This paper used Mel Frequency Cepstral Coefficients (MFCC) as the primary classification method. This paper focuses on MFCC features because today’s models have primarily focused on computer vision technique, which involves genre classification based on spectrogram images of different genres. This not only takes time but also much computational power. Using MFCC features tends to take less time and less computational resources.

External IDs:doi:10.1007/978-3-031-70789-6_40