Image–text sentiment analysis based on hierarchical interaction fusion and contrast learning enhanced

IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Engineering Applications of Artificial Intelligence Pub Date : 2025-02-18 DOI:10.1016/j.engappai.2025.110262
Hongbing Wang , Qifei Du , Yan Xiang
{"title":"Image–text sentiment analysis based on hierarchical interaction fusion and contrast learning enhanced","authors":"Hongbing Wang ,&nbsp;Qifei Du ,&nbsp;Yan Xiang","doi":"10.1016/j.engappai.2025.110262","DOIUrl":null,"url":null,"abstract":"<div><div>As social media continues to evolve, an increasing number of individuals prefer to publish a variety of content that combines different forms of expression on platforms to convey their feelings. In recent times, the study of sentiment within these diverse media formats has gained significant traction. However, some studies have neglected the multilayered interactions between modalities and underutilized the relationships between data samples and between classes. To address this problem, this paper proposes an image–text sentiment analysis method based on layer interaction fusion and contrast learning enhanced. First, the method uses a multi-layer cross-modal interaction module to emphasize the complementarity between modalities. The correlation information between modalities is deeply mined. Then, feature integration is performed by the multimodal fusion module. In addition, the model introduces a comparative learning task to exploit the relationship between samples and between classes. The emotional features under different classes and the key emotional features of the samples are mined. Finally, sentiment classification is performed to realize image–text sentiment analysis. In order to assess the efficacy of the approach, this paper performs extensive experimental validation on a multimodal sentiment dataset. Compared with several baseline models, the model in this paper achieves certain improvements. For example, on the multi-view sentiment analysis single dataset(MVSA-Single dataset), the F1 score is improved by 1.90% compared to the baseline model.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"146 ","pages":"Article 110262"},"PeriodicalIF":8.0000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625002623","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

As social media continues to evolve, an increasing number of individuals prefer to publish a variety of content that combines different forms of expression on platforms to convey their feelings. In recent times, the study of sentiment within these diverse media formats has gained significant traction. However, some studies have neglected the multilayered interactions between modalities and underutilized the relationships between data samples and between classes. To address this problem, this paper proposes an image–text sentiment analysis method based on layer interaction fusion and contrast learning enhanced. First, the method uses a multi-layer cross-modal interaction module to emphasize the complementarity between modalities. The correlation information between modalities is deeply mined. Then, feature integration is performed by the multimodal fusion module. In addition, the model introduces a comparative learning task to exploit the relationship between samples and between classes. The emotional features under different classes and the key emotional features of the samples are mined. Finally, sentiment classification is performed to realize image–text sentiment analysis. In order to assess the efficacy of the approach, this paper performs extensive experimental validation on a multimodal sentiment dataset. Compared with several baseline models, the model in this paper achieves certain improvements. For example, on the multi-view sentiment analysis single dataset(MVSA-Single dataset), the F1 score is improved by 1.90% compared to the baseline model.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于层次交互融合和对比学习的图像-文本情感分析增强
随着社交媒体的不断发展,越来越多的个人更喜欢在平台上发布各种内容,结合不同的表达形式来传达自己的感受。最近,对这些不同媒体形式的情绪的研究获得了显著的关注。然而,一些研究忽略了模式之间的多层相互作用,未充分利用数据样本之间和类别之间的关系。针对这一问题,本文提出了一种基于层交互融合和增强对比学习的图像-文本情感分析方法。首先,该方法采用多层跨模态交互模块,强调模态之间的互补性。模态之间的相关信息被深度挖掘。然后,通过多模态融合模块进行特征集成。此外,该模型引入了一个比较学习任务,以利用样本之间和类之间的关系。挖掘了不同类别下的情感特征和样本的关键情感特征。最后进行情感分类,实现图像-文本情感分析。为了评估该方法的有效性,本文在一个多模态情感数据集上进行了广泛的实验验证。与几种基线模型相比,本文模型取得了一定的改进。例如,在多视图情感分析单数据集(MVSA-Single dataset)上,F1分数比基线模型提高了1.90%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
期刊最新文献
Multiphysics response and internal leakage prediction of seismic hydraulic systems considering structural clearance effects Machine learning-based prediction of ductility of strain-hardening fiber-reinforced cementitious composites Neighborhood constrained attention for lightweight image super-resolution A quantum group decision-making model for patient-capital project selection integrating cumulative prospect theory under linear Diophantine fuzzy uncertainty Forecast-enhanced bilevel real-time pricing for microgrids via hybrid-action reinforcement learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1