{"title":"Multimodal Fusion with BERT and Attention Mechanism for Fake News Detection","authors":"Nguyen Manh Duc Tuan, Pham Quang Nhat Minh","doi":"10.1109/RIVF51545.2021.9642125","DOIUrl":null,"url":null,"abstract":"Fake news detection is an important task for in- creasing the reliability of the information on the internet since fake news is spreading fast on social media and has a negative effect on our society. In this paper, we present a novel method for detecting fake news by fusing multi-modal features derived from textual and visual data. Specifically, we proposed a scaled dot- product attention mechanism to capture the relationship between text features extracted by a pre-trained BERT model and visual features extracted by a pre-trained VGG-19 model. Experimental results showed that our method improved against the current state-of-the-art method on a public Twitter dataset by 3.1% accuracy.","PeriodicalId":6860,"journal":{"name":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","volume":"24 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 RIVF International Conference on Computing and Communication Technologies (RIVF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RIVF51545.2021.9642125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
Fake news detection is an important task for in- creasing the reliability of the information on the internet since fake news is spreading fast on social media and has a negative effect on our society. In this paper, we present a novel method for detecting fake news by fusing multi-modal features derived from textual and visual data. Specifically, we proposed a scaled dot- product attention mechanism to capture the relationship between text features extracted by a pre-trained BERT model and visual features extracted by a pre-trained VGG-19 model. Experimental results showed that our method improved against the current state-of-the-art method on a public Twitter dataset by 3.1% accuracy.