Attention-free based dual-encoder mechanism for Aspect-based Multimodal Sentiment Recognition

2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT) Pub Date : 2023-06-09 DOI:10.1109/APSIT58554.2023.10201711

Pankaj Gupta, Ananya Pandey, Ajeet Kumar, D. Vishwakarma

{"title":"Attention-free based dual-encoder mechanism for Aspect-based Multimodal Sentiment Recognition","authors":"Pankaj Gupta, Ananya Pandey, Ajeet Kumar, D. Vishwakarma","doi":"10.1109/APSIT58554.2023.10201711","DOIUrl":null,"url":null,"abstract":"Multimodal aspect-based sentiment recognition (MABSR) is a recently developed task in sentiment recognition that tries to assess the sentiment associated with text and image pairings by generally extracting the polarity terms from the pairs. Both the pipeline and the unified transformer based technique, which employs the cross-attention only mechanism, have been widely utilized in recent works. However, the alignment between text and picture is not openly and reliably included in these approaches. There is still a minimum threshold of aligned image-text pairings needed for supervised fine-tuning of said universal transformers for MABSR. Motivated by this observation and inspired by the various attention-only mechanisms, we analyze MABSR and propose an attention-free encoder-based transformer architecture. Dual attention-free based backbone encoder models with cross-modal symmetry are utilized in this work. To improve cross-modal performance, we include two new subtasks: aspect-only extraction and polarity feature representation alignment. This motivates both encoders to provide more precise depictions of multiple modalities.","PeriodicalId":170044,"journal":{"name":"2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIT58554.2023.10201711","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Multimodal aspect-based sentiment recognition (MABSR) is a recently developed task in sentiment recognition that tries to assess the sentiment associated with text and image pairings by generally extracting the polarity terms from the pairs. Both the pipeline and the unified transformer based technique, which employs the cross-attention only mechanism, have been widely utilized in recent works. However, the alignment between text and picture is not openly and reliably included in these approaches. There is still a minimum threshold of aligned image-text pairings needed for supervised fine-tuning of said universal transformers for MABSR. Motivated by this observation and inspired by the various attention-only mechanisms, we analyze MABSR and propose an attention-free encoder-based transformer architecture. Dual attention-free based backbone encoder models with cross-modal symmetry are utilized in this work. To improve cross-modal performance, we include two new subtasks: aspect-only extraction and polarity feature representation alignment. This motivates both encoders to provide more precise depictions of multiple modalities.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于无注意力的基于方面的多模态情感识别双编码器机制

基于多模态方面的情感识别(MABSR)是最近发展起来的一项情感识别任务，它试图通过从文本和图像对中提取极性项来评估与文本和图像对相关的情感。采用交叉关注机制的管道技术和统一变压器技术在近年来得到了广泛的应用。然而，文本和图片之间的对齐并没有公开和可靠地包含在这些方法中。对于MABSR通用变压器的监督微调，仍然需要对齐图像-文本对的最小阈值。基于这一观察结果并受到各种仅关注机制的启发，我们分析了MABSR并提出了一种基于无关注编码器的变压器架构。在这项工作中使用了基于双无注意的骨干编码器模型，该模型具有跨模态对称性。为了提高跨模态性能，我们增加了两个新的子任务:纯方面提取和极性特征表示对齐。这促使两个编码器提供更精确的多模态描述。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT)

自引率

0.00%

发文量