{"title":"Use of Gaussian Process to Model, predict and Explain Human Emotional response to Chinese Traditional Music","authors":"Jun Su, Pengcheng Zhou","doi":"10.1142/s0219525922500011","DOIUrl":null,"url":null,"abstract":"Music listening is one of the most enigmatic of human mental phenomena; it not only triggers emotions but also changes our behavior. During the music session many people are observed to exhibit varying emotional response, which can be influenced by diverse factors such as music genre and instrument as well as the personal attributes of audiences. In this study, we assume that there is an intrinsic, complex and implicit relationship between the basic sound features of music and human emotional response to the music. The response levels of 12 individuals to a representative repertoire of 36 classical/popular Chinese traditional music (CTM) are systematically analyzed using the chills as a quantitative indicator, totally resulting in 432 ([Formula: see text]) CTM–individual pairs that define a systematic individual-to-music response profile (SPTMRP). Gaussian process (GP) is then employed to model the multivariate correlation of SPTMRP profile with 15 sound features (including 5 Timbres, 4 Rhythms and 6 Pitchs) and 5 individual features in a supervised manner, which is also improved by genetic algorithm (GA) feature selection and compared with other machine learning methods. It is shown that the built GP regression model possesses a strong internal fitting ability ([Formula: see text]) and a good external predictive power ([Formula: see text]), which performed much better than linear PLS and nonlinear SVM and RF, confirming that the human emotional response to music can be quantitatively explained by GP methodology. Statistical examination of the GP model reveals that the sound features contribute more significantly to emotional response than individual features; their importance increases in the order: [Formula: see text], in which the spectral centroid (SC), relative amplitude of salient peaks (RASP), ratio of peak amplitudes (RPA), sum of all rhythm histograms (SARH) and period of unfolded maximum peak (PUMP) as well as gender are primarily responsible for the response.","PeriodicalId":50871,"journal":{"name":"Advances in Complex Systems","volume":"33 1","pages":"2250001:1-2250001:22"},"PeriodicalIF":0.7000,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Complex Systems","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1142/s0219525922500011","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 2
Abstract
Music listening is one of the most enigmatic of human mental phenomena; it not only triggers emotions but also changes our behavior. During the music session many people are observed to exhibit varying emotional response, which can be influenced by diverse factors such as music genre and instrument as well as the personal attributes of audiences. In this study, we assume that there is an intrinsic, complex and implicit relationship between the basic sound features of music and human emotional response to the music. The response levels of 12 individuals to a representative repertoire of 36 classical/popular Chinese traditional music (CTM) are systematically analyzed using the chills as a quantitative indicator, totally resulting in 432 ([Formula: see text]) CTM–individual pairs that define a systematic individual-to-music response profile (SPTMRP). Gaussian process (GP) is then employed to model the multivariate correlation of SPTMRP profile with 15 sound features (including 5 Timbres, 4 Rhythms and 6 Pitchs) and 5 individual features in a supervised manner, which is also improved by genetic algorithm (GA) feature selection and compared with other machine learning methods. It is shown that the built GP regression model possesses a strong internal fitting ability ([Formula: see text]) and a good external predictive power ([Formula: see text]), which performed much better than linear PLS and nonlinear SVM and RF, confirming that the human emotional response to music can be quantitatively explained by GP methodology. Statistical examination of the GP model reveals that the sound features contribute more significantly to emotional response than individual features; their importance increases in the order: [Formula: see text], in which the spectral centroid (SC), relative amplitude of salient peaks (RASP), ratio of peak amplitudes (RPA), sum of all rhythm histograms (SARH) and period of unfolded maximum peak (PUMP) as well as gender are primarily responsible for the response.
期刊介绍:
Advances in Complex Systems aims to provide a unique medium of communication for multidisciplinary approaches, either empirical or theoretical, to the study of complex systems. The latter are seen as systems comprised of multiple interacting components, or agents. Nonlinear feedback processes, stochastic influences, specific conditions for the supply of energy, matter, or information may lead to the emergence of new system qualities on the macroscopic scale that cannot be reduced to the dynamics of the agents. Quantitative approaches to the dynamics of complex systems have to consider a broad range of concepts, from analytical tools, statistical methods and computer simulations to distributed problem solving, learning and adaptation. This is an interdisciplinary enterprise.