Litian Zhang , Xiaoming Zhang , Ziyi Zhou , Xi Zhang , Philip S. Yu , Chaozhuo Li
{"title":"Knowledge-aware multimodal pre-training for fake news detection","authors":"Litian Zhang , Xiaoming Zhang , Ziyi Zhou , Xi Zhang , Philip S. Yu , Chaozhuo Li","doi":"10.1016/j.inffus.2024.102715","DOIUrl":null,"url":null,"abstract":"<div><div>Amidst the rapid propagation of multimodal fake news across various social media platforms, the identification and filtering of disinformation have emerged as critical areas of academic research. A salient characteristic of fake news lies in its diversity, encompassing text–image inconsistency, content–knowledge inconsistency, and content fabrication. However, existing endeavors are generally tailored to a specific subset of fake news, leading to limited universality. Moreover, these models primarily rely on scarce and exorbitant manually labeled annotations, which is incapable of providing sufficient learning signals to detect a variety of fake news. To address these challenges, we propose a novel knowledge-aware multimodal pre-training paradigm for fake news detection, dubbed KAMP. Our motivation lies in incorporating unsupervised correlations through pre-training tasks as complementary to alleviate the dependency on annotations. KAMP consists of a novel multimodal learning model and various delicate pre-training tasks to simultaneously capture valuable knowledge from single modality, multiple modalities, and background knowledge graphs. Our proposal undergoes comprehensive evaluation across two widely utilized datasets, and experimental results demonstrate the superiority of our proposal.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"114 ","pages":"Article 102715"},"PeriodicalIF":14.7000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253524004937","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Amidst the rapid propagation of multimodal fake news across various social media platforms, the identification and filtering of disinformation have emerged as critical areas of academic research. A salient characteristic of fake news lies in its diversity, encompassing text–image inconsistency, content–knowledge inconsistency, and content fabrication. However, existing endeavors are generally tailored to a specific subset of fake news, leading to limited universality. Moreover, these models primarily rely on scarce and exorbitant manually labeled annotations, which is incapable of providing sufficient learning signals to detect a variety of fake news. To address these challenges, we propose a novel knowledge-aware multimodal pre-training paradigm for fake news detection, dubbed KAMP. Our motivation lies in incorporating unsupervised correlations through pre-training tasks as complementary to alleviate the dependency on annotations. KAMP consists of a novel multimodal learning model and various delicate pre-training tasks to simultaneously capture valuable knowledge from single modality, multiple modalities, and background knowledge graphs. Our proposal undergoes comprehensive evaluation across two widely utilized datasets, and experimental results demonstrate the superiority of our proposal.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.