Deciphering the climate-malaria nexus: A machine learning approach in rural southeastern Tanzania

IF 3.2 3区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Public Health Pub Date : 2025-01-01 Epub Date: 2024-12-06 DOI:10.1016/j.puhe.2024.11.013
Jin-Xin Zheng , Shen-Ning Lu , Qin Li , Yue-Jin Li , Jin-Bo Xue , Tegemeo Gavana , Prosper Chaki , Ning Xiao , Yeromin Mlacha , Duo-Quan Wang , Xiao-Nong Zhou
{"title":"Deciphering the climate-malaria nexus: A machine learning approach in rural southeastern Tanzania","authors":"Jin-Xin Zheng ,&nbsp;Shen-Ning Lu ,&nbsp;Qin Li ,&nbsp;Yue-Jin Li ,&nbsp;Jin-Bo Xue ,&nbsp;Tegemeo Gavana ,&nbsp;Prosper Chaki ,&nbsp;Ning Xiao ,&nbsp;Yeromin Mlacha ,&nbsp;Duo-Quan Wang ,&nbsp;Xiao-Nong Zhou","doi":"10.1016/j.puhe.2024.11.013","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>Malaria remains a critical public health challenge, especially in regions like southeastern Tanzania. Understanding the intricate relationship between environmental factors and malaria incidence is essential for effective control and elimination strategies.</div></div><div><h3>Study design</h3><div>Cohort study.</div></div><div><h3>Methods</h3><div>This cohort study, conducted between Jan 2016 and October 2021 across three districts in southeastern Tanzania, utilized advanced machine learning techniques, specifically the Extreme Gradient Boosting (XGBoost) model, to examine the impact of climate factors on malaria incidence. SHapley Additive exPlanations (SHAP) values were applied to interpret model predictions, highlighting the roles of normalized difference vegetation index (NDVI), temperature, and rainfall in shaping malaria transmission dynamics.</div></div><div><h3>Results</h3><div>Analysis revealed considerable heterogeneity in malaria incidence across southeastern Tanzania, with Kibiti experiencing the highest number of cases (15,308) over the study period. Seasonal peaks corresponded with rainy periods, though incidence rates varied by district. Incorporating lagged climate variables and seasonal trends significantly improved forecast accuracy, with the one-month lag model achieving the lowest mean absolute error (MAE = 175.46) and root mean squared error (RMSE = 228.24). SHAP analysis identified seasonality (mean SHAP 29.6), followed by lagged temperature (13.8), rainfall (12.4), and NDVI (5.96), as the most influential factors, reflecting the biological underpinnings of malaria transmission.</div></div><div><h3>Conclusions</h3><div>This study demonstrates the utility of machine learning and explainable SHAP in malaria epidemiology, providing a data-driven framework to guide targeted, climate-informed malaria control strategies. By capturing seasonal and climate-linked risks, these methods hold promise for enhancing public health planning and adaptive response in malaria-endemic regions.</div></div>","PeriodicalId":49651,"journal":{"name":"Public Health","volume":"238 ","pages":"Pages 124-130"},"PeriodicalIF":3.2000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Public Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0033350624004839","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/6 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives

Malaria remains a critical public health challenge, especially in regions like southeastern Tanzania. Understanding the intricate relationship between environmental factors and malaria incidence is essential for effective control and elimination strategies.

Study design

Cohort study.

Methods

This cohort study, conducted between Jan 2016 and October 2021 across three districts in southeastern Tanzania, utilized advanced machine learning techniques, specifically the Extreme Gradient Boosting (XGBoost) model, to examine the impact of climate factors on malaria incidence. SHapley Additive exPlanations (SHAP) values were applied to interpret model predictions, highlighting the roles of normalized difference vegetation index (NDVI), temperature, and rainfall in shaping malaria transmission dynamics.

Results

Analysis revealed considerable heterogeneity in malaria incidence across southeastern Tanzania, with Kibiti experiencing the highest number of cases (15,308) over the study period. Seasonal peaks corresponded with rainy periods, though incidence rates varied by district. Incorporating lagged climate variables and seasonal trends significantly improved forecast accuracy, with the one-month lag model achieving the lowest mean absolute error (MAE = 175.46) and root mean squared error (RMSE = 228.24). SHAP analysis identified seasonality (mean SHAP 29.6), followed by lagged temperature (13.8), rainfall (12.4), and NDVI (5.96), as the most influential factors, reflecting the biological underpinnings of malaria transmission.

Conclusions

This study demonstrates the utility of machine learning and explainable SHAP in malaria epidemiology, providing a data-driven framework to guide targeted, climate-informed malaria control strategies. By capturing seasonal and climate-linked risks, these methods hold promise for enhancing public health planning and adaptive response in malaria-endemic regions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
破译气候-疟疾关系:坦桑尼亚东南部农村的机器学习方法。
目标:疟疾仍然是一项重大的公共卫生挑战,特别是在坦桑尼亚东南部等地区。了解环境因素与疟疾发病率之间的复杂关系对于有效控制和消除战略至关重要。研究设计:队列研究。方法:这项队列研究于2016年1月至2021年10月在坦桑尼亚东南部的三个地区进行,利用先进的机器学习技术,特别是极端梯度增强(XGBoost)模型,研究气候因素对疟疾发病率的影响。SHapley加性解释(SHAP)值用于解释模型预测,突出了归一化植被指数(NDVI)、温度和降雨在塑造疟疾传播动态中的作用。结果:分析显示坦桑尼亚东南部的疟疾发病率存在相当大的异质性,在研究期间,基比蒂的病例数最多(15308例)。季节性高峰与雨季相对应,但发病率因地区而异。考虑滞后气候变量和季节趋势显著提高了预测精度,其中1个月滞后模型的平均绝对误差(MAE = 175.46)和均方根误差(RMSE = 228.24)最低。SHAP分析发现,季节性(平均SHAP为29.6)、滞后温度(13.8)、降雨(12.4)和NDVI(5.96)是影响疟疾传播的最大因素,反映了疟疾传播的生物学基础。结论:本研究证明了机器学习和可解释的SHAP在疟疾流行病学中的效用,为指导有针对性的、气候知情的疟疾控制策略提供了数据驱动的框架。通过捕捉季节性和气候相关风险,这些方法有望加强疟疾流行地区的公共卫生规划和适应性应对。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Public Health
Public Health 医学-公共卫生、环境卫生与职业卫生
CiteScore
7.60
自引率
0.00%
发文量
280
审稿时长
37 days
期刊介绍: Public Health is an international, multidisciplinary peer-reviewed journal. It publishes original papers, reviews and short reports on all aspects of the science, philosophy, and practice of public health.
期刊最新文献
Operationalisation and findings of a First Few X (FFX) investigation of early mpox clade Ib cases and their close contacts in the United Kingdom Health effects of employment initiation and retirement in later life among Japanese older adults: A longitudinal study Healthcare practitioners’ engagement with non-specialists: A scoping review Effectiveness of tailored public health messages for vulnerable populations: A randomised controlled trial Comparing the riskiness and determinants of non-adherence to five quarantine and isolation guidelines: A dynamic cohort study during COVID-19
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1