Audio-visual affect recognition in activation-evaluation space

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI:10.1109/ICME.2005.1521551

Zhihong Zeng, ZhenQiu Zhang, Brian Pianfetti, J. Tu, Thomas S. Huang

引用次数: 29

Abstract

The ability of a computer to detect and appropriately respond to changes in a user's affective state has significant implications to human-computer interaction (HCI). To more accurately simulate the human ability to assess affects through multi-sensory data, automatic affect recognition should also make use of multimodal data. In this paper, we present our efforts toward audio-visual affect recognition. Based on psychological research, we have chosen affect categories based on an activation-evaluation space which is robust in capturing significant aspects of emotion. We apply the Fisher boosting learning algorithm which can build a strong classifier by combining a small set of weak classification functions. Our experimental results show with 30 Fisher features, the testing error rates of our bimodal affect recognition is about 16% on the evaluation axis and 13% on the activation axis.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

激活-评价空间中的视听影响识别

计算机检测并适当响应用户情感状态变化的能力对人机交互(HCI)具有重要意义。为了更准确地模拟人类通过多感官数据评估情感的能力，自动情感识别也应该利用多模态数据。在本文中，我们介绍了在视听情感识别方面所做的努力。在心理学研究的基础上，我们选择了基于激活-评价空间的情感类别，该空间在捕捉情感的重要方面方面具有鲁棒性。我们采用Fisher增强学习算法，该算法可以通过组合一组弱分类函数来构建一个强分类器。实验结果表明，在30个Fisher特征下，我们的双峰情感识别在评估轴上的测试错误率约为16%，在激活轴上的测试错误率约为13%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2005 IEEE International Conference on Multimedia and Expo

自引率

0.00%

发文量

期刊最新文献

Lossless image compression with tree coding of magnitude levels Maximizing the profit for cache replacement in a transcoding proxy Pre-Attentional Filtering in Compressed Video Annotation and detection of blended emotions in real human-human dialogs recorded in a call center Fast inter frame encoding based on modes pre-decision in H.264