Unsupervised Fake News Detection: A Graph-based Approach

Siva Charan Reddy Gangireddy, D. P., Cheng Long, Tanmoy Chakraborty
{"title":"Unsupervised Fake News Detection: A Graph-based Approach","authors":"Siva Charan Reddy Gangireddy, D. P., Cheng Long, Tanmoy Chakraborty","doi":"10.1145/3372923.3404783","DOIUrl":null,"url":null,"abstract":"Fake news has become more prevalent than ever, correlating with the rise of social media that allows every user to rapidly publish their views or hearsay. Today, fake news spans almost every realm of human activity, across diverse fields such as politics and healthcare. Most existing methods for fake news detection leverage supervised learning methods and expect a large labelled corpus of articles and social media user engagement information, which are often hard, time-consuming and costly to procure. In this paper, we consider the task of unsupervised fake news detection, which considers fake news detection in the absence of labelled historical data. We develop GTUT, a graph-based approach for the task which operates in three phases. Starting off with identifying a seed set of fake and legitimate articles exploiting high-level observations on inter-user behavior in fake news propagation, it progressively expands the labelling to all articles in the dataset. Our technique draws upon graph-based methods such as biclique identification, graph-based feature vector learning and label spreading. Through an extensive empirical evaluation over multiple real-world datasets, we establish the improved effectiveness of our method over state-of-the-art techniques for the task.","PeriodicalId":389616,"journal":{"name":"Proceedings of the 31st ACM Conference on Hypertext and Social Media","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"47","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 31st ACM Conference on Hypertext and Social Media","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3372923.3404783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 47

Abstract

Fake news has become more prevalent than ever, correlating with the rise of social media that allows every user to rapidly publish their views or hearsay. Today, fake news spans almost every realm of human activity, across diverse fields such as politics and healthcare. Most existing methods for fake news detection leverage supervised learning methods and expect a large labelled corpus of articles and social media user engagement information, which are often hard, time-consuming and costly to procure. In this paper, we consider the task of unsupervised fake news detection, which considers fake news detection in the absence of labelled historical data. We develop GTUT, a graph-based approach for the task which operates in three phases. Starting off with identifying a seed set of fake and legitimate articles exploiting high-level observations on inter-user behavior in fake news propagation, it progressively expands the labelling to all articles in the dataset. Our technique draws upon graph-based methods such as biclique identification, graph-based feature vector learning and label spreading. Through an extensive empirical evaluation over multiple real-world datasets, we establish the improved effectiveness of our method over state-of-the-art techniques for the task.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
无监督假新闻检测:基于图的方法
假新闻比以往任何时候都更加普遍,这与社交媒体的兴起有关,社交媒体允许每个用户快速发布自己的观点或传闻。如今,假新闻几乎涵盖了人类活动的各个领域,包括政治和医疗保健等多个领域。大多数现有的假新闻检测方法都利用监督学习方法,并期望获得大量标记的文章语料库和社交媒体用户参与信息,而这些信息通常很难、耗时且成本高昂。在本文中,我们考虑了无监督假新闻检测的任务,它考虑了在没有标记历史数据的情况下的假新闻检测。我们开发了GTUT,这是一种基于图的任务方法,分为三个阶段。首先,利用对假新闻传播中用户间行为的高级观察,识别一组虚假和合法文章的种子集,然后逐步将标签扩展到数据集中的所有文章。我们的技术借鉴了基于图的方法,如biclique识别,基于图的特征向量学习和标签扩展。通过对多个真实世界数据集的广泛实证评估,我们确定了我们的方法比最先进的技术更有效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Date the Artist: A Virtual Date with a Virtual Character 3rd Workshop on Human Factors in Hypertext (HUMAN'20) Noise-Enhanced Community Detection You Do Not Decide for Me! Evaluating Explainable Group Aggregation Strategies for Tourism Personalizing Information Exploration with an Open User Model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1