Mohammed Rashad Baker, A.H. Alamoodi, O.S. Albahri, A.S. Albahri, Salem Garfan, Amneh Alamleh, Moceheb Lazam Shuwandy, Ibrahim Alshakhatreh
{"title":"在恢复和封锁期间检测covid -19封锁相关讨论的机器学习方法的比较","authors":"Mohammed Rashad Baker, A.H. Alamoodi, O.S. Albahri, A.S. Albahri, Salem Garfan, Amneh Alamleh, Moceheb Lazam Shuwandy, Ibrahim Alshakhatreh","doi":"10.31181/jopi1120233","DOIUrl":null,"url":null,"abstract":"Ever since COVID-19 was declared a pandemic, governments around the world have implemented numerous phases of lockdown measures to curb the spread of the virus. These lockdown tactics manifest themselves in the form of widespread fear and panic driven by social media discussions. Given that individuals hold diverse opinions about these lockdown measures during and after their completion, positive and negative lockdown-related discussions should be differentiated to further understand the major related issues and to make appropriate messaging and policy choices in the future. We conduct a sentiment analysis (SA) of COVID-19-lockdown-related tweets by using different machine learning (ML) classifiers and then evaluate their performance before and after using the synthetic minority oversampling technique (SMOTE). This research is performed in five phases, starting with data collection and followed by pre-processing the dataset, preparing the dataset by annotation, applying SMOTE and using ML classifiers. We observe an improvement in accuracy ( ) as confirmed by the Matthew correlation coefficient ( ) across most classifiers, except for the k-nearest neighbour (KNN), whose Acc decreased from 0.82 to 0.59 and MCC decreased from 0.544 to 0.279 before and after SMOTE was applied. Despite the potential of SMOTE with some classifiers, this technique cannot be considered an ultimate solution, especially with other classifiers and datasets. The study provides insights into the need to evaluate and benchmark the integration of data balancing approaches with ML classifiers in addition to considering additional metrics, such as MCC, for binary classification problems, especially in SA.","PeriodicalId":489110,"journal":{"name":"Journal of Operations Intelligence","volume":"23 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparison of Machine Learning Approaches for Detecting COVID-19-Lockdown-Related Discussions During Recovery and Lockdown Periods\",\"authors\":\"Mohammed Rashad Baker, A.H. Alamoodi, O.S. Albahri, A.S. Albahri, Salem Garfan, Amneh Alamleh, Moceheb Lazam Shuwandy, Ibrahim Alshakhatreh\",\"doi\":\"10.31181/jopi1120233\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Ever since COVID-19 was declared a pandemic, governments around the world have implemented numerous phases of lockdown measures to curb the spread of the virus. These lockdown tactics manifest themselves in the form of widespread fear and panic driven by social media discussions. Given that individuals hold diverse opinions about these lockdown measures during and after their completion, positive and negative lockdown-related discussions should be differentiated to further understand the major related issues and to make appropriate messaging and policy choices in the future. We conduct a sentiment analysis (SA) of COVID-19-lockdown-related tweets by using different machine learning (ML) classifiers and then evaluate their performance before and after using the synthetic minority oversampling technique (SMOTE). This research is performed in five phases, starting with data collection and followed by pre-processing the dataset, preparing the dataset by annotation, applying SMOTE and using ML classifiers. We observe an improvement in accuracy ( ) as confirmed by the Matthew correlation coefficient ( ) across most classifiers, except for the k-nearest neighbour (KNN), whose Acc decreased from 0.82 to 0.59 and MCC decreased from 0.544 to 0.279 before and after SMOTE was applied. Despite the potential of SMOTE with some classifiers, this technique cannot be considered an ultimate solution, especially with other classifiers and datasets. The study provides insights into the need to evaluate and benchmark the integration of data balancing approaches with ML classifiers in addition to considering additional metrics, such as MCC, for binary classification problems, especially in SA.\",\"PeriodicalId\":489110,\"journal\":{\"name\":\"Journal of Operations Intelligence\",\"volume\":\"23 3\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Operations Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31181/jopi1120233\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Operations Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31181/jopi1120233","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparison of Machine Learning Approaches for Detecting COVID-19-Lockdown-Related Discussions During Recovery and Lockdown Periods
Ever since COVID-19 was declared a pandemic, governments around the world have implemented numerous phases of lockdown measures to curb the spread of the virus. These lockdown tactics manifest themselves in the form of widespread fear and panic driven by social media discussions. Given that individuals hold diverse opinions about these lockdown measures during and after their completion, positive and negative lockdown-related discussions should be differentiated to further understand the major related issues and to make appropriate messaging and policy choices in the future. We conduct a sentiment analysis (SA) of COVID-19-lockdown-related tweets by using different machine learning (ML) classifiers and then evaluate their performance before and after using the synthetic minority oversampling technique (SMOTE). This research is performed in five phases, starting with data collection and followed by pre-processing the dataset, preparing the dataset by annotation, applying SMOTE and using ML classifiers. We observe an improvement in accuracy ( ) as confirmed by the Matthew correlation coefficient ( ) across most classifiers, except for the k-nearest neighbour (KNN), whose Acc decreased from 0.82 to 0.59 and MCC decreased from 0.544 to 0.279 before and after SMOTE was applied. Despite the potential of SMOTE with some classifiers, this technique cannot be considered an ultimate solution, especially with other classifiers and datasets. The study provides insights into the need to evaluate and benchmark the integration of data balancing approaches with ML classifiers in addition to considering additional metrics, such as MCC, for binary classification problems, especially in SA.