Comparison of Machine Learning Approaches for Detecting COVID-19-Lockdown-Related Discussions During Recovery and Lockdown Periods

Mohammed Rashad Baker, A.H. Alamoodi, O.S. Albahri, A.S. Albahri, Salem Garfan, Amneh Alamleh, Moceheb Lazam Shuwandy, Ibrahim Alshakhatreh
{"title":"Comparison of Machine Learning Approaches for Detecting COVID-19-Lockdown-Related Discussions During Recovery and Lockdown Periods","authors":"Mohammed Rashad Baker, A.H. Alamoodi, O.S. Albahri, A.S. Albahri, Salem Garfan, Amneh Alamleh, Moceheb Lazam Shuwandy, Ibrahim Alshakhatreh","doi":"10.31181/jopi1120233","DOIUrl":null,"url":null,"abstract":"Ever since COVID-19 was declared a pandemic, governments around the world have implemented numerous phases of lockdown measures to curb the spread of the virus. These lockdown tactics manifest themselves in the form of widespread fear and panic driven by social media discussions. Given that individuals hold diverse opinions about these lockdown measures during and after their completion, positive and negative lockdown-related discussions should be differentiated to further understand the major related issues and to make appropriate messaging and policy choices in the future. We conduct a sentiment analysis (SA) of COVID-19-lockdown-related tweets by using different machine learning (ML) classifiers and then evaluate their performance before and after using the synthetic minority oversampling technique (SMOTE). This research is performed in five phases, starting with data collection and followed by pre-processing the dataset, preparing the dataset by annotation, applying SMOTE and using ML classifiers. We observe an improvement in accuracy ( ) as confirmed by the Matthew correlation coefficient ( ) across most classifiers, except for the k-nearest neighbour (KNN), whose Acc decreased from 0.82 to 0.59 and MCC decreased from 0.544 to 0.279 before and after SMOTE was applied. Despite the potential of SMOTE with some classifiers, this technique cannot be considered an ultimate solution, especially with other classifiers and datasets. The study provides insights into the need to evaluate and benchmark the integration of data balancing approaches with ML classifiers in addition to considering additional metrics, such as MCC, for binary classification problems, especially in SA.","PeriodicalId":489110,"journal":{"name":"Journal of Operations Intelligence","volume":"23 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Operations Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31181/jopi1120233","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Ever since COVID-19 was declared a pandemic, governments around the world have implemented numerous phases of lockdown measures to curb the spread of the virus. These lockdown tactics manifest themselves in the form of widespread fear and panic driven by social media discussions. Given that individuals hold diverse opinions about these lockdown measures during and after their completion, positive and negative lockdown-related discussions should be differentiated to further understand the major related issues and to make appropriate messaging and policy choices in the future. We conduct a sentiment analysis (SA) of COVID-19-lockdown-related tweets by using different machine learning (ML) classifiers and then evaluate their performance before and after using the synthetic minority oversampling technique (SMOTE). This research is performed in five phases, starting with data collection and followed by pre-processing the dataset, preparing the dataset by annotation, applying SMOTE and using ML classifiers. We observe an improvement in accuracy ( ) as confirmed by the Matthew correlation coefficient ( ) across most classifiers, except for the k-nearest neighbour (KNN), whose Acc decreased from 0.82 to 0.59 and MCC decreased from 0.544 to 0.279 before and after SMOTE was applied. Despite the potential of SMOTE with some classifiers, this technique cannot be considered an ultimate solution, especially with other classifiers and datasets. The study provides insights into the need to evaluate and benchmark the integration of data balancing approaches with ML classifiers in addition to considering additional metrics, such as MCC, for binary classification problems, especially in SA.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在恢复和封锁期间检测covid -19封锁相关讨论的机器学习方法的比较
自2019冠状病毒病被宣布为大流行以来,世界各国政府实施了多个阶段的封锁措施,以遏制病毒的传播。这些封锁策略表现为社交媒体讨论引发的广泛恐惧和恐慌。鉴于人们对封锁措施实施期间和实施后的看法不一,应区分积极和消极的封锁讨论,进一步了解相关重大问题,以便在未来做出适当的信息传递和政策选择。我们使用不同的机器学习(ML)分类器对covid -19封锁相关推文进行情感分析(SA),然后使用合成少数过采样技术(SMOTE)评估其前后的性能。本研究分五个阶段进行,从数据收集开始,然后是数据集预处理,通过注释准备数据集,应用SMOTE和使用ML分类器。我们观察到,除了k近邻(KNN),在应用SMOTE前后,其Acc从0.82下降到0.59,MCC从0.544下降到0.279,大多数分类器的马修相关系数()证实了准确性()的提高。尽管SMOTE在某些分类器上具有潜力,但这种技术不能被认为是最终的解决方案,特别是在其他分类器和数据集上。该研究提供了对数据平衡方法与ML分类器的集成进行评估和基准测试的需求的见解,此外还考虑了二元分类问题的附加指标,如MCC,特别是在SA中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Study on the Method of Selecting Sustainable Food Suppliers Considering Interactive Factors Smart-Watches Assisted Sugar Level Monitoring with Different Activities and Nutrition based on Machine Learning Approaches Determining The Impact of the COVID-19 Pandemic on Commercial Activities in Istanbul Facility Location Selection for Ammunition Depots based on GIS and Pythagorean Fuzzy WASPAS Extension of Interaction Geometric Aggregation Operator for Material Selection Using Interval-Valued Intuitionistic Fuzzy Hypersoft Set
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1