Cross-Modal Attention Network for Detecting Multimodal Misinformation From Multiple Platforms

IF 4.5 2区计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS IEEE Transactions on Computational Social Systems Pub Date : 2024-03-26 DOI:10.1109/TCSS.2024.3373661

Zhiwei Guo;Yang Li;Zhenguo Yang;Xiaoping Li;Lap-Kei Lee;Qing Li;Wenyin Liu

{"title":"Cross-Modal Attention Network for Detecting Multimodal Misinformation From Multiple Platforms","authors":"Zhiwei Guo;Yang Li;Zhenguo Yang;Xiaoping Li;Lap-Kei Lee;Qing Li;Wenyin Liu","doi":"10.1109/TCSS.2024.3373661","DOIUrl":null,"url":null,"abstract":"Misinformation detection in short videos on social media has become a pressing issue due to its popularity. However, datasets for misinformation detection are limited in terms of modality and sources, hindering the development of effective detection methods. In this article, we introduce a novel dataset denoted the multiplatform multimodal misinformation (3M) dataset. Our dataset is collected specifically to investigate and address misinformation in a multimodal context. A total of 17 352 videos were collected from two prominent social media platforms, namely TikTok and Weibo. The 3M dataset covers 30 different topics, such as sports, health, news, and art, providing a diverse range of content for analysis. We propose a novel approach named cross-modal attention misinformation detection (CAMD) for effectively detecting and addressing multimodal misinformation. CAMD leverages the cross-modal attention module to facilitate effective information exchange and fusion between modalities by learning the correlations and weights among them. The cross-modal attention module is capable of learning multilevel modality correlations, focuses primarily on the interaction between multimodal sequences across different time steps, and simultaneously adjusts the information from the source modality based on the information of the target modality. Extensive experiments on the 3M dataset show that the proposed method achieves state-of-the-art performance. Specifically, CAMD achieves accuracy, F1-score, precision, and recall values of 76.86%, 58.05%, 87.86%, and 58.70%, respectively, on the 3M dataset.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":null,"pages":null},"PeriodicalIF":4.5000,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Social Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10478449/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}

引用次数: 0

Abstract

Misinformation detection in short videos on social media has become a pressing issue due to its popularity. However, datasets for misinformation detection are limited in terms of modality and sources, hindering the development of effective detection methods. In this article, we introduce a novel dataset denoted the multiplatform multimodal misinformation (3M) dataset. Our dataset is collected specifically to investigate and address misinformation in a multimodal context. A total of 17 352 videos were collected from two prominent social media platforms, namely TikTok and Weibo. The 3M dataset covers 30 different topics, such as sports, health, news, and art, providing a diverse range of content for analysis. We propose a novel approach named cross-modal attention misinformation detection (CAMD) for effectively detecting and addressing multimodal misinformation. CAMD leverages the cross-modal attention module to facilitate effective information exchange and fusion between modalities by learning the correlations and weights among them. The cross-modal attention module is capable of learning multilevel modality correlations, focuses primarily on the interaction between multimodal sequences across different time steps, and simultaneously adjusts the information from the source modality based on the information of the target modality. Extensive experiments on the 3M dataset show that the proposed method achieves state-of-the-art performance. Specifically, CAMD achieves accuracy, F1-score, precision, and recall values of 76.86%, 58.05%, 87.86%, and 58.70%, respectively, on the 3M dataset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于检测来自多个平台的多模态错误信息的跨模态注意力网络

由于社交媒体短视频的流行，其误导信息检测已成为一个亟待解决的问题。然而，用于错误信息检测的数据集在模式和来源方面都很有限，这阻碍了有效检测方法的开发。在本文中，我们介绍了一个新颖的数据集，称为多平台多模态错误信息（3M）数据集。我们的数据集专门用于研究和处理多模态背景下的错误信息。我们从两个著名的社交媒体平台（即嘀嗒和微博）共收集了 17 352 个视频。3M 数据集涵盖 30 个不同的主题，如体育、健康、新闻和艺术，为分析提供了多样化的内容。我们提出了一种名为跨模态注意力错误信息检测（CAMD）的新方法，用于有效检测和处理多模态错误信息。CAMD 利用跨模态注意力模块，通过学习模态之间的相关性和权重，促进模态之间有效的信息交换和融合。跨模态注意力模块能够学习多层次的模态相关性，主要关注不同时间步长的多模态序列之间的相互作用，并同时根据目标模态的信息调整源模态的信息。在 3M 数据集上进行的大量实验表明，所提出的方法达到了最先进的性能。具体来说，CAMD 在 3M 数据集上的准确率、F1 分数、精确度和召回率分别达到了 76.86%、58.05%、87.86% 和 58.70%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Computational Social Systems Social Sciences-Social Sciences (miscellaneous)

CiteScore

10.00

自引率

20.00%

发文量

316

期刊介绍： IEEE Transactions on Computational Social Systems focuses on such topics as modeling, simulation, analysis and understanding of social systems from the quantitative and/or computational perspective. "Systems" include man-man, man-machine and machine-machine organizations and adversarial situations as well as social media structures and their dynamics. More specifically, the proposed transactions publishes articles on modeling the dynamics of social systems, methodologies for incorporating and representing socio-cultural and behavioral aspects in computational modeling, analysis of social system behavior and structure, and paradigms for social systems modeling and simulation. The journal also features articles on social network dynamics, social intelligence and cognition, social systems design and architectures, socio-cultural modeling and representation, and computational behavior modeling, and their applications.

期刊最新文献

Table of Contents Guest Editorial: Special Issue on Dark Side of the Socio-Cyber World: Media Manipulation, Fake News, and Misinformation IEEE Transactions on Computational Social Systems Publication Information IEEE Transactions on Computational Social Systems Information for Authors IEEE Systems, Man, and Cybernetics Society Information