Copy the page URI to the clipboard
Kaliciak, Leszek; Horsburgh, Ben; Song, Dawei; Wiratunga, Nirmalie and Pan, Jeff
(2012).
DOI: https://doi.org/10.1007/978-3-642-35341-3_19
Abstract
This paper presents a novel approach to Music Information Retrieval. Having represented the music tracks in the form of two dimensional images, we apply the "bag of visual words" method from visual IR in order to classify the songs into 19 genres. By switching to visual domain we can abstract from musical concepts such as melody, timbre and rhythm. We obtained classification accuracy of 46% (with 5% theoretical baseline for random classification) which is comparable with existing state-of-the-art approaches. Moreover, the novel features characterize different properties of the signal than standard methods. Therefore, the combination of them should further improve the performance of existing techniques.
Viewing alternatives
Download history
Metrics
Public Attention
Altmetrics from AltmetricNumber of Citations
Citations from DimensionsItem Actions
Export
About
- Item ORO ID
- 34651
- Item Type
- Conference or Workshop Item
- Keywords
- local features; co-occurrence matrix; colour moments; K-means algorithm; Fourier transform
- Academic Unit or School
-
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Computing and Communications - Copyright Holders
- © 2012 Springer
- Related URLs
- Depositing User
- Dawei Song