Investigating Google Trends to forecast acute febrile illness outbreaks in North India reported through the Integrated Disease Surveillance Program.

IF 3 3区 医学 Q2 INFECTIOUS DISEASES BMC Infectious Diseases Pub Date : 2025-03-28 DOI:10.1186/s12879-025-10801-0
Madhur Verma, Kamal Kishore, Pragyan Paramita Parija, Soumya Swaroop Sahoo, Dolly Gambhir, Usha Gupta, Rakesh Kakkar
{"title":"Investigating Google Trends to forecast acute febrile illness outbreaks in North India reported through the Integrated Disease Surveillance Program.","authors":"Madhur Verma, Kamal Kishore, Pragyan Paramita Parija, Soumya Swaroop Sahoo, Dolly Gambhir, Usha Gupta, Rakesh Kakkar","doi":"10.1186/s12879-025-10801-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Acute Febrile Illness (AFI) like Malaria, Dengue, Chikungunya, and Enteric fever still remain the most common cause of seeking healthcare in low-middle-income countries and need to be constantly monitored for any impending outbreak. Digital epidemiology promises to assist traditional health surveillance. The health data (including AFI) collated by Google using specialised platforms like Google Trends (GT) is known to correlate with actual disease trends. The present study thus aims to assess the potential of GT to support routine surveillance system and forecast AFI outbreaks reported through the Indian Integrated Disease Surveillance Programme (IDSP).</p><p><strong>Methods: </strong>We utilised Haryana's IDSP portal to retrieve the weekly data of the most commonly reported infectious diseases causing AFI between 2011 and 2020. Internet search trends were downloaded using GT. Descriptive statistics estimated the burden of the AFI and Bland-Altman's plot depicted statistical agreement between the two. We adopted the Box-Jenkins approach to attain the final SARIMA model and explain the time-dependent weekly incidence of AFI.</p><p><strong>Results: </strong>The time series plot of the reported AFI displayed trends. Martin- Bland plots depicted acceptable agreement between two datasets for all Chikungunya and Dengue. Among the models evaluated, the Malaria model [SARIMA(1,1,1)(1,1,1)] demonstrated the best performance with a balanced fit and reasonable accuracy, while the Enteric Fever model [SARIMA(0,1,0)(1,1,1)] exhibited low prediction error but weak seasonal significance. In contrast, the Dengue [SARIMA(1,1,0)(1,1,0)] and Chikungunya [ARIMA(1,0,0)(0,0,0)] models had high forecast errors, limiting their predictive reliability. Overall, GT supplemented the prediction performance of the SARIMA models with adjusted R<sup>2</sup> of 46%, 50%, 50%, and 52% compared to the original 43%, 49%, 20%, and 48%.</p><p><strong>Conclusions: </strong>Our study observed modest improvements in GT-based SARIMA forecasting models compared to routine IDSP mechanisms for predicting AFI outbreaks in Haryana, highlighting the potential for further enhancement. As more granular GT data becomes available, its integration with traditional surveillance systems could significantly enhance forecasting accuracy for AFI and other infectious disease outbreaks. At no additional cost to the health system, GT can serve as a valuable, real-time digital epidemiology tool, strengthening public health preparedness and enabling timely interventions for the early containment of emerging diseases.</p>","PeriodicalId":8981,"journal":{"name":"BMC Infectious Diseases","volume":"25 1","pages":"431"},"PeriodicalIF":3.0000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11951705/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Infectious Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12879-025-10801-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Acute Febrile Illness (AFI) like Malaria, Dengue, Chikungunya, and Enteric fever still remain the most common cause of seeking healthcare in low-middle-income countries and need to be constantly monitored for any impending outbreak. Digital epidemiology promises to assist traditional health surveillance. The health data (including AFI) collated by Google using specialised platforms like Google Trends (GT) is known to correlate with actual disease trends. The present study thus aims to assess the potential of GT to support routine surveillance system and forecast AFI outbreaks reported through the Indian Integrated Disease Surveillance Programme (IDSP).

Methods: We utilised Haryana's IDSP portal to retrieve the weekly data of the most commonly reported infectious diseases causing AFI between 2011 and 2020. Internet search trends were downloaded using GT. Descriptive statistics estimated the burden of the AFI and Bland-Altman's plot depicted statistical agreement between the two. We adopted the Box-Jenkins approach to attain the final SARIMA model and explain the time-dependent weekly incidence of AFI.

Results: The time series plot of the reported AFI displayed trends. Martin- Bland plots depicted acceptable agreement between two datasets for all Chikungunya and Dengue. Among the models evaluated, the Malaria model [SARIMA(1,1,1)(1,1,1)] demonstrated the best performance with a balanced fit and reasonable accuracy, while the Enteric Fever model [SARIMA(0,1,0)(1,1,1)] exhibited low prediction error but weak seasonal significance. In contrast, the Dengue [SARIMA(1,1,0)(1,1,0)] and Chikungunya [ARIMA(1,0,0)(0,0,0)] models had high forecast errors, limiting their predictive reliability. Overall, GT supplemented the prediction performance of the SARIMA models with adjusted R2 of 46%, 50%, 50%, and 52% compared to the original 43%, 49%, 20%, and 48%.

Conclusions: Our study observed modest improvements in GT-based SARIMA forecasting models compared to routine IDSP mechanisms for predicting AFI outbreaks in Haryana, highlighting the potential for further enhancement. As more granular GT data becomes available, its integration with traditional surveillance systems could significantly enhance forecasting accuracy for AFI and other infectious disease outbreaks. At no additional cost to the health system, GT can serve as a valuable, real-time digital epidemiology tool, strengthening public health preparedness and enabling timely interventions for the early containment of emerging diseases.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
调查谷歌趋势,预测印度北部通过综合疾病监测计划报告的急性发热性疾病爆发。
背景:急性发热性疾病(AFI),如疟疾、登革热、基孔肯雅热和肠热,仍然是中低收入国家寻求医疗保健的最常见原因,需要不断监测任何即将发生的疫情。数字流行病学有望协助传统的卫生监测。谷歌使用谷歌Trends (GT)等专门平台整理的健康数据(包括AFI)已知与实际疾病趋势相关。因此,本研究旨在评估GT支持常规监测系统和预测通过印度综合疾病监测计划(IDSP)报告的AFI暴发的潜力。方法:利用哈里亚纳邦IDSP门户网站检索2011 - 2020年期间最常报告的导致AFI的传染病的每周数据。使用GT下载互联网搜索趋势。描述性统计估计了AFI的负担,Bland-Altman的图描述了两者之间的统计一致性。我们采用Box-Jenkins方法来获得最终的SARIMA模型,并解释AFI的周发病率随时间的变化。结果:报告的AFI时间序列图具有一定的趋势。Martin- Bland图描述了基孔肯雅热和登革热的两个数据集之间可接受的一致性。其中,疟疾模型[SARIMA(1,1,1)(1,1,1))]表现最佳,拟合平衡,精度合理;Enteric Fever模型[SARIMA(0,1,0)(1,1,1)]预测误差较小,但季节性显著性较弱。登革热[SARIMA(1,1,0)(1,1,0)]和基孔肯雅[ARIMA(1,0,0)(0,0,0)]模型预测误差较大,影响了预测的可靠性。总体而言,GT补充了SARIMA模型的预测性能,调整后的R2分别为46%、50%、50%和52%,而原始模型的R2分别为43%、49%、20%和48%。结论:我们的研究发现,与常规的IDSP机制相比,基于gt的SARIMA预测模型在预测哈里亚纳邦AFI暴发方面有适度的改进,这突出了进一步增强的潜力。随着更细粒度的GT数据变得可用,它与传统监测系统的整合可以显著提高AFI和其他传染病暴发的预测准确性。在不增加卫生系统费用的情况下,全球遗传技术可以作为一种有价值的实时数字流行病学工具,加强公共卫生准备,并为早期控制新出现的疾病及时采取干预措施。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Infectious Diseases
BMC Infectious Diseases 医学-传染病学
CiteScore
6.50
自引率
0.00%
发文量
860
审稿时长
3.3 months
期刊介绍: BMC Infectious Diseases is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of infectious and sexually transmitted diseases in humans, as well as related molecular genetics, pathophysiology, and epidemiology.
期刊最新文献
Infectious disease complications associated with opioid use disorder at a southern county hospital: a retrospective chart review. India ink-negative and culture-negative cryptococcal meningitis in HIV-negative patients. Deep learning on meta-analytic data for therapeutic decision-making in central nervous system aspergillosis. Admission fibrinogen-to-albumin ratio and mortality in older adults hospitalized with pneumonia: a retrospective cohort study. Resurgence and clinical evolution of influenza A in Chinese children: shifting epidemiology and serological dynamics across pre-, intra-, and post-pandemic eras (2019-2023).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1