Z. Movahedi Nia , L. Seyyed-Kalantari , M. Goitom , B. Mellado , A. Ahmadi , A. Asgary , J. Orbinski , J. Wu , J.D. Kong
{"title":"Leveraging deep-learning and unconventional data for real-time surveillance, forecasting, and early warning of respiratory pathogens outbreak","authors":"Z. Movahedi Nia , L. Seyyed-Kalantari , M. Goitom , B. Mellado , A. Ahmadi , A. Asgary , J. Orbinski , J. Wu , J.D. Kong","doi":"10.1016/j.artmed.2025.103076","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Controlling re-emerging outbreaks such as COVID-19 is a critical concern to global health. Disease forecasting solutions are extremely beneficial to public health emergency management. This work aims to design and deploy a framework for real-time surveillance, prediction, forecasting, and early warning of respiratory disease. To this end, we selected southern African countries and Canadian provinces, along with COVID-19 and influenza as our case studies.</div></div><div><h3>Methodology</h3><div>Six different datasets were collected for different provinces of Canada: number of influenza cases, number of COVID-19 cases, Google Trends, Reddit posts, satellite air quality data, and weather data. Moreover, five different data sources were collected for southern African countries whose COVID-19 number of cases were significantly correlated with each other: number of COVID-19 infections, Google Trends, Wiki Trends, Google News, and satellite air quality data. For each infectious disease, i.e. COVID-19 and Influenza for Canada and COVID-19 for southern African countries, data was processed, scaled, and fed into the deep learning model which included four layers, namely, a Convolutional Neural Network (CNN), a Graph Neural Network (GNN), a Gated Recurrent Unit (GRU), and a linear Neural Network (NN). Hyperparameters were optimized to provide an accurate 56-day-ahead prediction of the number of cases.</div></div><div><h3>Result</h3><div>The accuracy of our models in real-time surveillance, prediction, forecasting, and early warning of respiratory diseases are evaluated against state-of-the-art models, through Root Mean Square Error (RMSE), coefficient of determination (R2-score), and correlation coefficient. Our model improves R2-score, RMSE, and correlation by up to 55.98 %, 39.71 %, and 44.47 % for 56 days-ahead COVID-19 prediction in Ontario, 34.87 %, 25.52 %, 50.91 % for 8 weeks-ahead influenza prediction in Quebec, and 51.04 %, 32.04 %, and 28.74 % for 56 days-ahead COVID-19 prediction in South Africa, respectively.</div></div><div><h3>Conclusion</h3><div>This work presents a framework that automatically collects data from unconventional sources, and builds an early warning system for COVID-19 and influenza outbreaks. The result is extremely helpful to policy-makers and health officials for preparedness and rapid response against future outbreaks.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"161 ","pages":"Article 103076"},"PeriodicalIF":6.1000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0933365725000119","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Controlling re-emerging outbreaks such as COVID-19 is a critical concern to global health. Disease forecasting solutions are extremely beneficial to public health emergency management. This work aims to design and deploy a framework for real-time surveillance, prediction, forecasting, and early warning of respiratory disease. To this end, we selected southern African countries and Canadian provinces, along with COVID-19 and influenza as our case studies.
Methodology
Six different datasets were collected for different provinces of Canada: number of influenza cases, number of COVID-19 cases, Google Trends, Reddit posts, satellite air quality data, and weather data. Moreover, five different data sources were collected for southern African countries whose COVID-19 number of cases were significantly correlated with each other: number of COVID-19 infections, Google Trends, Wiki Trends, Google News, and satellite air quality data. For each infectious disease, i.e. COVID-19 and Influenza for Canada and COVID-19 for southern African countries, data was processed, scaled, and fed into the deep learning model which included four layers, namely, a Convolutional Neural Network (CNN), a Graph Neural Network (GNN), a Gated Recurrent Unit (GRU), and a linear Neural Network (NN). Hyperparameters were optimized to provide an accurate 56-day-ahead prediction of the number of cases.
Result
The accuracy of our models in real-time surveillance, prediction, forecasting, and early warning of respiratory diseases are evaluated against state-of-the-art models, through Root Mean Square Error (RMSE), coefficient of determination (R2-score), and correlation coefficient. Our model improves R2-score, RMSE, and correlation by up to 55.98 %, 39.71 %, and 44.47 % for 56 days-ahead COVID-19 prediction in Ontario, 34.87 %, 25.52 %, 50.91 % for 8 weeks-ahead influenza prediction in Quebec, and 51.04 %, 32.04 %, and 28.74 % for 56 days-ahead COVID-19 prediction in South Africa, respectively.
Conclusion
This work presents a framework that automatically collects data from unconventional sources, and builds an early warning system for COVID-19 and influenza outbreaks. The result is extremely helpful to policy-makers and health officials for preparedness and rapid response against future outbreaks.
期刊介绍:
Artificial Intelligence in Medicine publishes original articles from a wide variety of interdisciplinary perspectives concerning the theory and practice of artificial intelligence (AI) in medicine, medically-oriented human biology, and health care.
Artificial intelligence in medicine may be characterized as the scientific discipline pertaining to research studies, projects, and applications that aim at supporting decision-based medical tasks through knowledge- and/or data-intensive computer-based solutions that ultimately support and improve the performance of a human care provider.