Publication details for Professor Tuomas EerolaSaari, Pasi, Fazekas, Gyorgy, Eerola, Tuomas, Barthet, Mathieu, Lartillot, Olivier & Sandler, Mark (2016). Genre-adaptive Semantic Computing and Audio-based Modelling for Music Mood Annotation. IEEE Transactions on Affective Computing 7(2): 122-135.
- Publication type: Journal Article
- ISSN/ISBN: 1949-3045 (print)
- DOI: 10.1109/TAFFC.2015.2462841
- Keywords: Music information retrieval, Mood prediction, Social tags, Semantic computing, Music genre, Genre-adaptive.
- Further publication details on publisher web site
- Durham Research Online (DRO) - may include full text
Author(s) from Durham
This study investigates whether taking genre into account is beneficial for automatic music mood annotation in terms of core affects valence, arousal, and tension, as well as several other mood scales. Novel techniques employing genre-adaptive semantic computing and audio-based modelling are proposed. A technique called the ACTwg employs genre-adaptive semantic computing of mood-related social tags, whereas ACTwg-SLPwg combines semantic computing and audio-based modelling, both in a genre-adaptive manner. The proposed techniques are experimentally evaluated at predicting listener ratings related to a set of 600 popular music tracks spanning multiple genres. The results show that ACTwg outperforms a semantic computing technique that does not exploit genre information, and ACTwg-SLPwg outperforms conventional techniques and other genre-adaptive alternatives. In particular, improvements in the prediction rates are obtained for the valence dimension which is typically the most challenging core affect dimension for audio-based annotation. The specificity of genre categories is not crucial for the performance of ACTwg-SLPwg. The study also presents analytical insights into inferring a concise tag-based genre representation for genre-adaptive music mood analysis.