Fake News Detection using Naive Bayes

Nurshaheeda Shazleen Yuslee, N. Abdullah
{"title":"Fake News Detection using Naive Bayes","authors":"Nurshaheeda Shazleen Yuslee, N. Abdullah","doi":"10.1109/ICSET53708.2021.9612540","DOIUrl":null,"url":null,"abstract":"The issue of fake news arises every year. Moreover, the enhancement and evolution of technologies enable the news to be manipulated by irresponsible people. However, it is not deniable that somehow this technology impacts our daily life. Nowadays, people get the latest news through the social media platforms as it is free, easy to access, and fast. However, not all the news on social media is reliable, and some fake news are spread to mislead the readers. Fake news can disseminate information to confuse people to believe things that are not true. In Natural Language Processing, text processing such as regular expression, removing the stop words and lemmatization are done before the data is being transformed into N-grams using TF-IDF and Count Vectorizer. Therefore, this paper aimed to review the fake news detection using the Naive Bayes algorithms. Results shows that Naive Bayes with n-gram gives a slight increase in the accuracy of TF-IDF and Count Vectorizer. It proves that TF-IDF Vectorizer can detect fake news better as it has higher precision of 94 % whereas Count Vectorizer can detect both fake news and real news in quite a balance.","PeriodicalId":433197,"journal":{"name":"2021 IEEE 11th International Conference on System Engineering and Technology (ICSET)","volume":"02 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 11th International Conference on System Engineering and Technology (ICSET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSET53708.2021.9612540","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

The issue of fake news arises every year. Moreover, the enhancement and evolution of technologies enable the news to be manipulated by irresponsible people. However, it is not deniable that somehow this technology impacts our daily life. Nowadays, people get the latest news through the social media platforms as it is free, easy to access, and fast. However, not all the news on social media is reliable, and some fake news are spread to mislead the readers. Fake news can disseminate information to confuse people to believe things that are not true. In Natural Language Processing, text processing such as regular expression, removing the stop words and lemmatization are done before the data is being transformed into N-grams using TF-IDF and Count Vectorizer. Therefore, this paper aimed to review the fake news detection using the Naive Bayes algorithms. Results shows that Naive Bayes with n-gram gives a slight increase in the accuracy of TF-IDF and Count Vectorizer. It proves that TF-IDF Vectorizer can detect fake news better as it has higher precision of 94 % whereas Count Vectorizer can detect both fake news and real news in quite a balance.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用朴素贝叶斯的假新闻检测
假新闻的问题每年都会出现。此外,技术的进步和发展使新闻能够被不负责任的人操纵。然而,不可否认的是,这项技术以某种方式影响着我们的日常生活。如今,人们通过社交媒体平台获得最新的新闻,因为它是免费的,易于访问,快速。然而,并不是所有社交媒体上的新闻都是可靠的,一些假新闻的传播误导了读者。假新闻可以传播信息,使人们相信不真实的事情。在自然语言处理中,在使用TF-IDF和Count Vectorizer将数据转换为n -gram之前,会进行文本处理,如正则表达式、删除停止词和词序化。因此,本文旨在回顾使用朴素贝叶斯算法的假新闻检测。结果表明,使用n-gram的朴素贝叶斯方法可以略微提高TF-IDF和计数矢量器的准确率。它证明了TF-IDF矢量器可以更好地检测假新闻,因为它具有高达94%的精度,而计数矢量器可以在相当平衡的情况下检测假新闻和真实新闻。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Encrypted Steganography Quick Response Scheme for Unified Hotel Access Control System NARX Neural Network Modeling of Batch Distillation Process Low Latency Peer to Peer Robot Wireless Communication with Edge Computing Model-based Control of a Gravimetric Dosing Conveyor for Alternative Fuels in the Cement Industry Design of an Arduino-Powered Sleep Monitoring System Based on Electrooculography (EOG) with Temperature Control Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1