Pregled bibliografske jedinice broj: 1059920
Cochleogram-based approach for detecting perceived emotions in music
Cochleogram-based approach for detecting perceived emotions in music // Information Processing & Management, 57 (2020), 5; 102270, 17 doi:10.1016/j.ipm.2020.102270 (međunarodna recenzija, članak, znanstveni)
CROSBI ID: 1059920 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Cochleogram-based approach for detecting perceived
emotions in music
Autori
Russo, Mladen ; Kraljević, Luka ; Stella, Maja ; Sikora, Marjan
Izvornik
Information Processing & Management (0306-4573) 57
(2020), 5;
102270, 17
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
Music information retrieval ; Affective content prediction ; Cochlea ; ConvNet
Sažetak
Identifying perceived emotional content of music constitutes an important aspect of easy and efficient search, retrieval, and management of the media. One of the most promising use cases of music organization is an emotion-based playlist, where automatic music emotion recognition plays a significant role in providing emotion related information, which is otherwise, generally unavailable. Based on the importance of the auditory system in emotional recognition and processing, in this study, we propose a new cochleogram-based system for detecting the affective musical content. To effectively simulate the response of the human auditory periphery, the music audio signal is processed by a detailed biophysical cochlear model, thus obtaining an output that closely matches the characteristics of human hearing. In this proposed approach, based on the cochleogram images, which we construct directly from the response of the basilar membrane, a convolutional neural network (CNN) is used to extract the relevant music features. To validate the practical implications of the proposed approach with regard to its possible integration in different digital music libraries, an extensive study was conducted to evaluate the predictive performance of our approach in different aspects of music emotion recognition. The proposed approach was evaluated on publicly available 1000 songs database and the experimental results showed that it performed better in comparison with common musical features (such as tempo, mode, pitch, clarity, and perceptually motivated mel- frequency cepstral coefficients (MFCC)) as well as official ”MediaEval” challenge results on the same reference database. Our findings clearly show that the proposed approach can lead to better music emotion recognition performance and be used as part of a state-of- the-art music information retrieval system.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti
POVEZANOST RADA
Projekti:
HRZZ-UIP-2014-09-3875 - Pametna okruženja za poboljšanje kvalitete života (ELISE) (Russo, Mladen, HRZZ - 2014-09) ( CroRIS)
Ustanove:
Fakultet elektrotehnike, strojarstva i brodogradnje, Split
Citiraj ovu publikaciju:
Časopis indeksira:
- Current Contents Connect (CCC)
- Web of Science Core Collection (WoSCC)
- Science Citation Index Expanded (SCI-EXP)
- Social Science Citation Index (SSCI)
- SCI-EXP, SSCI i/ili A&HCI
- Scopus