A local fingerprinting approach for audio copy detectionOpen Website

2014 (modified: 20 Jul 2021)Signal Process. 2014Readers: Everyone
Abstract: Highlights • A new robust 2D representation of audio signals called time-chroma image. • A novel scale invariant local feature extraction for the time chroma image. • The proposed features greatly outperform the alternatives such as SIFT for audio signals. • A novel pitch and tempo invariant song identification algorithm. Abstract This study proposes an audio copy detection system that is robust to various attacks. These include the severe pitch shift and tempo change attacks which existing systems fail to detect. First, we propose a novel two dimensional representation for audio signals called the time-chroma image. This image is based on a modification of the concept of chroma in the music literature and is shown to achieve better performance in song identification. Then, we propose a novel fingerprinting algorithm that extracts local fingerprints from the time-chroma image. The proposed local fingerprinting algorithm is invariant to time/frequency scale changes in audio signals. It also outperforms existing methods like SIFT to a great extent. Finally, we introduce a song identification algorithm that uses the proposed fingerprints. The resulting copy detection system is shown to significantly outperform existing methods. Besides being able to detect whether a song (or a part of it) has been copied, the proposed system can accurately estimate the amount of pitch shift and/or tempo change that were applied to a song.
0 Replies

Loading