Multi-Feature Fusion for Multimodal Attentive Sentiment Analysis

Proceedings of the ACM Multimedia Asia Pub Date : 2019-12-15 DOI:10.1145/3338533.3366591

A. Man, Yuanyuan Pu, Dan Xu, Wenhua Qian, Zhengpeng Zhao, Qiuxia Yang

{"title":"Multi-Feature Fusion for Multimodal Attentive Sentiment Analysis","authors":"A. Man, Yuanyuan Pu, Dan Xu, Wenhua Qian, Zhengpeng Zhao, Qiuxia Yang","doi":"10.1145/3338533.3366591","DOIUrl":null,"url":null,"abstract":"Sentiment analysis has been an interesting and challenging task, researchers mostly pay attention to single-modal (image or text) emotion recognition, less attention is paid to joint analysis of multi-modal data. Most existing multi-modal sentiment analysis algorithms combined with attention mechanism focus only on local area of images, ignore the emotional information provided by the global features of the image. Motivated by the research status quo, in this paper, we proposed a novel multi-modal sentiment analysis model, which focuses on local attentive feature also on the global contextual feature from image, then a novel feature fusion mechanism is utilized to fuse features from different modal. In our proposed model, we use a convolutional neural network (CNN) to extract the region maps of images, and use the attention mechanism to acquire attention coefficient, then use a CNN with fewer hidden layers to extract the global feature, a long-short term memory model (LSTM) is utilized to extract textual feature. Finally, a tensor fusion network (TFN) is utilized to fuse all features from different modal. Extensive experiments are conducted on both weakly labeled and manually labeled datasets, and the results demonstrate the superiority of the proposed method.","PeriodicalId":273086,"journal":{"name":"Proceedings of the ACM Multimedia Asia","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM Multimedia Asia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3338533.3366591","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Sentiment analysis has been an interesting and challenging task, researchers mostly pay attention to single-modal (image or text) emotion recognition, less attention is paid to joint analysis of multi-modal data. Most existing multi-modal sentiment analysis algorithms combined with attention mechanism focus only on local area of images, ignore the emotional information provided by the global features of the image. Motivated by the research status quo, in this paper, we proposed a novel multi-modal sentiment analysis model, which focuses on local attentive feature also on the global contextual feature from image, then a novel feature fusion mechanism is utilized to fuse features from different modal. In our proposed model, we use a convolutional neural network (CNN) to extract the region maps of images, and use the attention mechanism to acquire attention coefficient, then use a CNN with fewer hidden layers to extract the global feature, a long-short term memory model (LSTM) is utilized to extract textual feature. Finally, a tensor fusion network (TFN) is utilized to fuse all features from different modal. Extensive experiments are conducted on both weakly labeled and manually labeled datasets, and the results demonstrate the superiority of the proposed method.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于多特征融合的多模态关注情感分析

情感分析一直是一项有趣而富有挑战性的任务，研究人员大多关注单模态(图像或文本)情感识别，而对多模态数据的联合分析关注较少。现有的结合注意机制的多模态情感分析算法大多只关注图像的局部区域，忽略了图像全局特征所提供的情感信息。针对目前的研究现状，本文提出了一种新的多模态情感分析模型，该模型既关注图像的局部关注特征，又关注图像的全局上下文特征，然后利用一种新的特征融合机制融合不同模态的特征。该模型首先利用卷积神经网络(CNN)提取图像的区域映射，利用注意机制获取注意系数，然后利用隐含层较少的卷积神经网络提取全局特征，利用长短期记忆模型(LSTM)提取文本特征。最后，利用张量融合网络(TFN)对不同模态的特征进行融合。在弱标记和手动标记的数据集上进行了大量的实验，结果表明了该方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the ACM Multimedia Asia

自引率

0.00%

发文量

期刊最新文献

Session details: Vision in Multimedia Domain Specific and Idiom Adaptive Video Summarization Multi-Label Image Classification with Attention Mechanism and Graph Convolutional Networks Session details: Brave New Idea Self-balance Motion and Appearance Model for Multi-object Tracking in UAV