{"title":"EMOCOV: Machine learning for emotion detection, analysis and visualization using COVID-19 tweets","authors":"Md. Yasin Kabir, Sanjay Madria","doi":"10.1016/j.osnem.2021.100135","DOIUrl":null,"url":null,"abstract":"<div><p>The adversarial impact of the Covid-19 pandemic has created a health crisis globally all over the world. This unprecedented crisis forced people to lockdown and changed almost every aspect of the regular activities of the people. Thus, the pandemic is also impacting everyone physically, mentally, and economically, and it, therefore, is paramount to analyze and understand emotional responses during the crisis affecting mental health. Negative emotional responses at fine-grained labels like anger and fear during the crisis might also lead to irreversible socio-economic damages. In this work, we develop a neural network model and train it using manually labeled data to detect various emotions at fine-grained labels in the Covid-19 tweets automatically. We present a manually labeled tweets dataset on COVID-19 emotional responses along with regular tweets data. We created a custom Q&A roBERTa model to extract phrases from the tweets that are primarily responsible for the corresponding emotions. None of the existing datasets and work currently provide the selected words or phrases denoting the reason for the corresponding emotions. Our classification model outperforms other systems and achieves a Jaccard score of 0.6475 with an accuracy of 0.8951. The custom RoBERTa Q&A model outperforms other models by achieving a Jaccard score of 0.7865. Further, we present a historical emotion analysis using COVID-19 tweets over the USA including each state level analysis.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.osnem.2021.100135","citationCount":"34","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online Social Networks and Media","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468696421000197","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 34
Abstract
The adversarial impact of the Covid-19 pandemic has created a health crisis globally all over the world. This unprecedented crisis forced people to lockdown and changed almost every aspect of the regular activities of the people. Thus, the pandemic is also impacting everyone physically, mentally, and economically, and it, therefore, is paramount to analyze and understand emotional responses during the crisis affecting mental health. Negative emotional responses at fine-grained labels like anger and fear during the crisis might also lead to irreversible socio-economic damages. In this work, we develop a neural network model and train it using manually labeled data to detect various emotions at fine-grained labels in the Covid-19 tweets automatically. We present a manually labeled tweets dataset on COVID-19 emotional responses along with regular tweets data. We created a custom Q&A roBERTa model to extract phrases from the tweets that are primarily responsible for the corresponding emotions. None of the existing datasets and work currently provide the selected words or phrases denoting the reason for the corresponding emotions. Our classification model outperforms other systems and achieves a Jaccard score of 0.6475 with an accuracy of 0.8951. The custom RoBERTa Q&A model outperforms other models by achieving a Jaccard score of 0.7865. Further, we present a historical emotion analysis using COVID-19 tweets over the USA including each state level analysis.
Covid-19大流行的不利影响在全球范围内造成了一场健康危机。这场前所未有的危机迫使人们封锁,几乎改变了人们日常活动的方方面面。因此,大流行也在身体、精神和经济上影响着每个人,因此,分析和理解危机期间影响心理健康的情绪反应至关重要。在危机期间,愤怒和恐惧等细微标签上的负面情绪反应也可能导致不可逆转的社会经济损害。在这项工作中,我们开发了一个神经网络模型,并使用手动标记的数据对其进行训练,以自动检测Covid-19推文中细粒度标签上的各种情绪。我们提出了一个关于COVID-19情绪反应的手动标记推文数据集以及常规推文数据。我们创建了一个自定义的Q& a roBERTa模型来从tweet中提取主要负责相应情绪的短语。现有的数据集和工作目前都没有提供表示相应情绪原因的选定单词或短语。我们的分类模型优于其他系统,达到了0.6475的Jaccard分数和0.8951的准确率。定制RoBERTa Q&A模型通过获得0.7865的Jaccard分数而优于其他模型。此外,我们使用美国的COVID-19推文进行历史情绪分析,包括每个州的分析。