期刊名称:EURASIP Journal on Audio, Speech, and Music Processing
印刷版ISSN:1687-4714
电子版ISSN:1687-4722
出版年度:2017
卷号:2017
期号:1
页码:1-17
DOI:10.1186/s13636-017-0114-4
出版社:Hindawi Publishing Corporation
摘要:Audio fingerprinting has been an active research field typically used for music identification. Robust audio fingerprinting technology is used to successfully perform content-based audio identification regardless of the audio signal being subjected to various types of distortion. These distortions affect the time-frequency correlation relating to pitch and speed changes. In this paper, experiments are done using the computer vision technique ORB (Oriented FAST and Rotated BRIEF) for robust audio identification. Investigations are conducted for ORB, relating to its advantage of robustness against distortions including speed and pitch changes. The ORB prototype compares the features of the spectrogram image query to a database of spectrogram images of the songs. For the initial experiment, a Brute-Force matcher is used to compare the ORB descriptors. Results show that the ORB prototype performs robustly to real-world distortions with fast, reliable performance against distortions such as speed and pitch which justifies the research done.
关键词:Audio fingerprinting ; Music identification ; Oriented FAST and Rotated BRIEF ; Spectrogram ;