Temporal Relationship between Daily Reports of COVID-19 Infections and Related GDELT and Tweet Mentions

Q3 Social Sciences Human Geographies Pub Date : 2023-09-16 DOI:10.3390/geographies3030031
Innocensia Owuor, Hartwig H. Hochmair
{"title":"Temporal Relationship between Daily Reports of COVID-19 Infections and Related GDELT and Tweet Mentions","authors":"Innocensia Owuor, Hartwig H. Hochmair","doi":"10.3390/geographies3030031","DOIUrl":null,"url":null,"abstract":"Social media platforms are valuable data sources in the study of public reactions to events such as natural disasters and epidemics. This research assesses for selected countries around the globe the time lag between daily reports of COVID-19 cases and GDELT (Global Database of Events, Language, and Tone) and Twitter (X) COVID-19 mentions between February 2020 and April 2021 using time series analysis. Results show that GDELT articles and tweets preceded COVID-19 infections in Australia, Brazil, France, Greece, India, Italy, the U.S., Canada, Germany, and the U.K., while for Poland and the Philippines, tweets preceded and GDELT articles lagged behind COVID-19 disease incidences, respectively. This shows that the application of social media and news data for surveillance and management of pandemics needs to be assessed on a case-by-case basis for different countries. It also points towards the applicability of time series data analysis for only a limited number of countries due to strict data requirements (e.g., stationarity). A deviation from generally observed lag patterns in a country, i.e., periods with low COVID-19 infections but unusually high numbers of COVID-19-related GDELT articles or tweets, signals an anomaly. We use the seasonal hybrid extreme Studentized deviate test to detect such anomalies. This is followed by text analysis of news headlines from NewsBank and Google on the date of these anomalies to determine the probable event causing an anomaly, which includes elections, holidays, and protests.","PeriodicalId":38507,"journal":{"name":"Human Geographies","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Geographies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/geographies3030031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 1

Abstract

Social media platforms are valuable data sources in the study of public reactions to events such as natural disasters and epidemics. This research assesses for selected countries around the globe the time lag between daily reports of COVID-19 cases and GDELT (Global Database of Events, Language, and Tone) and Twitter (X) COVID-19 mentions between February 2020 and April 2021 using time series analysis. Results show that GDELT articles and tweets preceded COVID-19 infections in Australia, Brazil, France, Greece, India, Italy, the U.S., Canada, Germany, and the U.K., while for Poland and the Philippines, tweets preceded and GDELT articles lagged behind COVID-19 disease incidences, respectively. This shows that the application of social media and news data for surveillance and management of pandemics needs to be assessed on a case-by-case basis for different countries. It also points towards the applicability of time series data analysis for only a limited number of countries due to strict data requirements (e.g., stationarity). A deviation from generally observed lag patterns in a country, i.e., periods with low COVID-19 infections but unusually high numbers of COVID-19-related GDELT articles or tweets, signals an anomaly. We use the seasonal hybrid extreme Studentized deviate test to detect such anomalies. This is followed by text analysis of news headlines from NewsBank and Google on the date of these anomalies to determine the probable event causing an anomaly, which includes elections, holidays, and protests.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
每日COVID-19感染报告与相关GDELT和推特提及的时间关系
在研究公众对自然灾害和流行病等事件的反应时,社交媒体平台是宝贵的数据来源。本研究使用时间序列分析评估了全球选定国家在2020年2月至2021年4月期间每日COVID-19病例报告与GDELT(全球事件、语言和语气数据库)和Twitter (X) COVID-19提及之间的时间差。结果显示,在澳大利亚、巴西、法国、希腊、印度、意大利、美国、加拿大、德国和英国,GDELT文章和推文分别先于COVID-19疾病发病率,而在波兰和菲律宾,推文和GDELT文章分别滞后于COVID-19疾病发病率。这表明,在监测和管理大流行病方面应用社交媒体和新闻数据需要根据不同国家的具体情况进行评估。它还指出,由于严格的数据要求(例如,平稳性),时间序列数据分析只适用于有限数量的国家。一国与普遍观察到的滞后模式发生偏差,即在COVID-19感染率较低但与COVID-19相关的GDELT文章或推文数量异常高的时期,表明存在异常现象。我们使用季节性混合极端学生偏差检验来检测这些异常。接下来是对NewsBank和Google在这些异常日期的新闻标题进行文本分析,以确定导致异常的可能事件,包括选举、假日和抗议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Human Geographies
Human Geographies Social Sciences-Geography, Planning and Development
CiteScore
1.10
自引率
0.00%
发文量
7
审稿时长
8 weeks
期刊最新文献
Residents and Stakeholder Opinions on Township Tourism in Langa, Cape Town, South Africa Spatio-Temporal Dynamics and Physico-Hydrological Trends in Rainfall, Runoff and Land Use in Paraíba Watershed Perspectives on Advanced Technologies in Spatial Data Collection and Analysis Contemporary Challenges in Destination Planning: A Geographical Typology Approach Spatiotemporal Dengue Fever Incidence Associated with Climate in a Brazilian Tropical Region
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1