期刊论文详细信息
PeerJ Computer Science
Using machine learning analysis to interpret the relationship between music emotion and lyric features
Zaoyi Sun1  Chi-ju Chao2  Xin Wen3  Liang Xu3  Zhengxi Huang3  Liuchang Xu4 
[1] College of Education, Zhejiang University of Technology, Hangzhou, China;Department of Information Art and Design, Tsinghua University, Beijing, China;Department of Psychology and Behavioral Sciences, Zhejiang University, Hangzhou, China;Zhejiang Provincial Key Laboratory of Forestry Intelligent Monitoring and Information Technology, Zhejiang A&F University, Hangzhou, China;
关键词: Music emotion recognition;    Lyric feature extraction;    Audio signal processing;    LIWC;    Chinese pop song;   
DOI  :  10.7717/peerj-cs.785
来源: DOAJ
【 摘 要 】

Melody and lyrics, reflecting two unique human cognitive abilities, are usually combined in music to convey emotions. Although psychologists and computer scientists have made considerable progress in revealing the association between musical structure and the perceived emotions of music, the features of lyrics are relatively less discussed. Using linguistic inquiry and word count (LIWC) technology to extract lyric features in 2,372 Chinese songs, this study investigated the effects of LIWC-based lyric features on the perceived arousal and valence of music. First, correlation analysis shows that, for example, the perceived arousal of music was positively correlated with the total number of lyric words and the mean number of words per sentence and was negatively correlated with the proportion of words related to the past and insight. The perceived valence of music was negatively correlated with the proportion of negative emotion words. Second, we used audio and lyric features as inputs to construct music emotion recognition (MER) models. The performance of random forest regressions reveals that, for the recognition models of perceived valence, adding lyric features can significantly improve the prediction effect of the model using audio features only; for the recognition models of perceived arousal, lyric features are almost useless. Finally, by calculating the feature importance to interpret the MER models, we observed that the audio features played a decisive role in the recognition models of both perceived arousal and perceived valence. Unlike the uselessness of the lyric features in the arousal recognition model, several lyric features, such as the usage frequency of words related to sadness, positive emotions, and tentativeness, played important roles in the valence recognition model.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:8次