Toward Multimodal Modeling of Emotional Expressiveness.

Proceedings of the ... ACM International Conference on Multimodal Interaction. ICMI (Conference) Pub Date : 2020-10-01 DOI:10.1145/3382507.3418887

Victoria Lin, Jeffrey M Girard, Michael A Sayette, Louis-Philippe Morency

{"title":"Toward Multimodal Modeling of Emotional Expressiveness.","authors":"Victoria Lin, Jeffrey M Girard, Michael A Sayette, Louis-Philippe Morency","doi":"10.1145/3382507.3418887","DOIUrl":null,"url":null,"abstract":"Emotional expressiveness captures the extent to which a person tends to outwardly display their emotions through behavior. Due to the close relationship between emotional expressiveness and behavioral health, as well as the crucial role that it plays in social interaction, the ability to automatically predict emotional expressiveness stands to spur advances in science, medicine, and industry. In this paper, we explore three related research questions. First, how well can emotional expressiveness be predicted from visual, linguistic, and multimodal behavioral signals? Second, how important is each behavioral modality to the prediction of emotional expressiveness? Third, which behavioral signals are reliably related to emotional expressiveness? To answer these questions, we add highly reliable transcripts and human ratings of perceived emotional expressiveness to an existing video database and use this data to train, validate, and test predictive models. Our best model shows promising predictive performance on this dataset (RMSE = 0.65, R 2 = 0.45, r = 0.74). Multimodal models tend to perform best overall, and models trained on the linguistic modality tend to outperform models trained on the visual modality. Finally, examination of our interpretable models' coefficients reveals a number of visual and linguistic behavioral signals-such as facial action unit intensity, overall word count, and use of words related to social processes-that reliably predict emotional expressiveness.","PeriodicalId":74508,"journal":{"name":"Proceedings of the ... ACM International Conference on Multimodal Interaction. ICMI (Conference)","volume":"2020 ","pages":"548-557"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8106384/pdf/nihms-1680572.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... ACM International Conference on Multimodal Interaction. ICMI (Conference)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3382507.3418887","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Emotional expressiveness captures the extent to which a person tends to outwardly display their emotions through behavior. Due to the close relationship between emotional expressiveness and behavioral health, as well as the crucial role that it plays in social interaction, the ability to automatically predict emotional expressiveness stands to spur advances in science, medicine, and industry. In this paper, we explore three related research questions. First, how well can emotional expressiveness be predicted from visual, linguistic, and multimodal behavioral signals? Second, how important is each behavioral modality to the prediction of emotional expressiveness? Third, which behavioral signals are reliably related to emotional expressiveness? To answer these questions, we add highly reliable transcripts and human ratings of perceived emotional expressiveness to an existing video database and use this data to train, validate, and test predictive models. Our best model shows promising predictive performance on this dataset (RMSE = 0.65, R ² = 0.45, r = 0.74). Multimodal models tend to perform best overall, and models trained on the linguistic modality tend to outperform models trained on the visual modality. Finally, examination of our interpretable models' coefficients reveals a number of visual and linguistic behavioral signals-such as facial action unit intensity, overall word count, and use of words related to social processes-that reliably predict emotional expressiveness.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

实现情绪表达的多模态建模

情绪表达能力是指一个人倾向于通过行为向外展示其情绪的程度。由于情绪表达能力与行为健康之间的密切关系，以及情绪表达能力在社会交往中的关键作用，自动预测情绪表达能力的能力将推动科学、医学和工业的进步。在本文中，我们将探讨三个相关的研究问题。首先，从视觉、语言和多模态行为信号中预测情绪表达能力的效果如何？第二，每种行为模式对预测情绪表达能力的重要性如何？第三，哪些行为信号与情绪表达能力有可靠的关系？为了回答这些问题，我们在现有的视频数据库中添加了高度可靠的文字记录和人类对感知到的情感表现力的评分，并使用这些数据来训练、验证和测试预测模型。我们的最佳模型在该数据集上显示出良好的预测性能（RMSE = 0.65，R 2 = 0.45，r = 0.74）。多模态模型的整体表现往往最好，而在语言模态上训练的模型往往优于在视觉模态上训练的模型。最后，我们对可解释模型的系数进行了研究，发现了一些视觉和语言行为信号--如面部动作单位强度、总字数以及与社会过程相关的词语的使用--可以可靠地预测情绪表达能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the ... ACM International Conference on Multimodal Interaction. ICMI (Conference)

自引率

0.00%

发文量