{"title":"Investigating and Analyzing Self-Reporting of Long COVID on Twitter: Findings from Sentiment Analysis","authors":"Nirmalya Thakur","doi":"10.3390/asi6050092","DOIUrl":null,"url":null,"abstract":"This paper presents multiple novel findings from a comprehensive analysis of a dataset comprising 1,244,051 Tweets about Long COVID, posted on Twitter between 25 May 2020 and 31 January 2023. First, the analysis shows that the average number of Tweets per month wherein individuals self-reported Long COVID on Twitter was considerably high in 2022 as compared to the average number of Tweets per month in 2021. Second, findings from sentiment analysis using VADER show that the percentages of Tweets with positive, negative, and neutral sentiments were 43.1%, 42.7%, and 14.2%, respectively. To add to this, most of the Tweets with a positive sentiment, as well as most of the Tweets with a negative sentiment, were not highly polarized. Third, the result of tokenization indicates that the tweeting patterns (in terms of the number of tokens used) were similar for the positive and negative Tweets. Analysis of these results also shows that there was no direct relationship between the number of tokens used and the intensity of the sentiment expressed in these Tweets. Finally, a granular analysis of the sentiments showed that the emotion of sadness was expressed in most of these Tweets. It was followed by the emotions of fear, neutral, surprise, anger, joy, and disgust, respectively.","PeriodicalId":36273,"journal":{"name":"Applied System Innovation","volume":"11 1","pages":"0"},"PeriodicalIF":3.8000,"publicationDate":"2023-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied System Innovation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/asi6050092","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents multiple novel findings from a comprehensive analysis of a dataset comprising 1,244,051 Tweets about Long COVID, posted on Twitter between 25 May 2020 and 31 January 2023. First, the analysis shows that the average number of Tweets per month wherein individuals self-reported Long COVID on Twitter was considerably high in 2022 as compared to the average number of Tweets per month in 2021. Second, findings from sentiment analysis using VADER show that the percentages of Tweets with positive, negative, and neutral sentiments were 43.1%, 42.7%, and 14.2%, respectively. To add to this, most of the Tweets with a positive sentiment, as well as most of the Tweets with a negative sentiment, were not highly polarized. Third, the result of tokenization indicates that the tweeting patterns (in terms of the number of tokens used) were similar for the positive and negative Tweets. Analysis of these results also shows that there was no direct relationship between the number of tokens used and the intensity of the sentiment expressed in these Tweets. Finally, a granular analysis of the sentiments showed that the emotion of sadness was expressed in most of these Tweets. It was followed by the emotions of fear, neutral, surprise, anger, joy, and disgust, respectively.