确定雪兰莪州的钩端螺旋体病热点:利用遥感揭示气候联系并开发预测模型。

IF 2.4 3区 生物学 Q2 MULTIDISCIPLINARY SCIENCES PeerJ Pub Date : 2025-03-05 eCollection Date: 2025-01-01 DOI:10.7717/peerj.18851
Muhammad Akram Ab Kadir, Rosliza Abdul Manaf, Siti Aisah Mokhtar, Luthffi Idzhar Ismail
{"title":"确定雪兰莪州的钩端螺旋体病热点:利用遥感揭示气候联系并开发预测模型。","authors":"Muhammad Akram Ab Kadir, Rosliza Abdul Manaf, Siti Aisah Mokhtar, Luthffi Idzhar Ismail","doi":"10.7717/peerj.18851","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Leptospirosis is an endemic disease in countries with tropical climates such as South America, Southern Asia, and Southeast Asia. There has been an increase in leptospirosis incidence in Malaysia from 1.45 to 25.94 cases per 100,000 population between 2005 and 2014. With increasing incidence in Selangor, Malaysia, and frequent climate change dynamics, a study on the disease hotspot areas and their association with the hydroclimatic factors could enhance disease surveillance and public health interventions.</p><p><strong>Methods: </strong>This ecological cross-sectional study utilised a geographic information system (GIS) and remote sensing techniques to analyse the spatiotemporal distribution of leptospirosis in Selangor from 2011 to 2019. Laboratory-confirmed leptospirosis cases (<i>n</i> = 1,045) were obtained from the Selangor State Health Department. Using ArcGIS Pro, spatial autocorrelation analysis (Moran's I) and Getis-Ord Gi* (hotspot analysis) was conducted to identify hotspots based on the monthly aggregated cases for each subdistrict. Satellite-derived rainfall and land surface temperature (LST) data were acquired from NASA's Giovanni EarthData website and processed into monthly averages. These data were integrated into ArcGIS Pro as thematic layers. Machine learning algorithms, including support vector machine (SVM), Random Forest (RF), and light gradient boosting machine (LGBM) were employed to develop predictive models for leptospirosis hotspot areas. Model performance was then evaluated using cross-validation and metrics such as accuracy, precision, sensitivity, and F1-score.</p><p><strong>Results: </strong>Moran's I analysis revealed a primarily random distribution of cases across Selangor, with only 20 out of 103 observed having a clustered distribution. Meanwhile, hotspot areas were mainly scattered in subdistricts throughout Selangor with clustering in the central region. Machine learning analysis revealed that the LGBM algorithm had the best performance scores compared to having a cross-validation score of 0.61, a precision score of 0.16, and an F1-score of 0.23. The feature importance score indicated river water level and rainfall contributes most to the model.</p><p><strong>Conclusions: </strong>This GIS-based study identified a primarily sporadic occurrence of leptospirosis in Selangor with minimal spatial clustering. The LGBM algorithm effectively predicted leptospirosis hotspots based on the analysed hydroclimatic factors. The integration of GIS and machine learning offers a promising framework for disease surveillance, facilitating targeted public health interventions in areas at high risk for leptospirosis.</p>","PeriodicalId":19799,"journal":{"name":"PeerJ","volume":"13 ","pages":"e18851"},"PeriodicalIF":2.4000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11890033/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identifying leptospirosis hotspots in Selangor: uncovering climatic connections using remote sensing and developing a predictive model.\",\"authors\":\"Muhammad Akram Ab Kadir, Rosliza Abdul Manaf, Siti Aisah Mokhtar, Luthffi Idzhar Ismail\",\"doi\":\"10.7717/peerj.18851\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Leptospirosis is an endemic disease in countries with tropical climates such as South America, Southern Asia, and Southeast Asia. There has been an increase in leptospirosis incidence in Malaysia from 1.45 to 25.94 cases per 100,000 population between 2005 and 2014. With increasing incidence in Selangor, Malaysia, and frequent climate change dynamics, a study on the disease hotspot areas and their association with the hydroclimatic factors could enhance disease surveillance and public health interventions.</p><p><strong>Methods: </strong>This ecological cross-sectional study utilised a geographic information system (GIS) and remote sensing techniques to analyse the spatiotemporal distribution of leptospirosis in Selangor from 2011 to 2019. Laboratory-confirmed leptospirosis cases (<i>n</i> = 1,045) were obtained from the Selangor State Health Department. Using ArcGIS Pro, spatial autocorrelation analysis (Moran's I) and Getis-Ord Gi* (hotspot analysis) was conducted to identify hotspots based on the monthly aggregated cases for each subdistrict. Satellite-derived rainfall and land surface temperature (LST) data were acquired from NASA's Giovanni EarthData website and processed into monthly averages. These data were integrated into ArcGIS Pro as thematic layers. Machine learning algorithms, including support vector machine (SVM), Random Forest (RF), and light gradient boosting machine (LGBM) were employed to develop predictive models for leptospirosis hotspot areas. Model performance was then evaluated using cross-validation and metrics such as accuracy, precision, sensitivity, and F1-score.</p><p><strong>Results: </strong>Moran's I analysis revealed a primarily random distribution of cases across Selangor, with only 20 out of 103 observed having a clustered distribution. Meanwhile, hotspot areas were mainly scattered in subdistricts throughout Selangor with clustering in the central region. Machine learning analysis revealed that the LGBM algorithm had the best performance scores compared to having a cross-validation score of 0.61, a precision score of 0.16, and an F1-score of 0.23. The feature importance score indicated river water level and rainfall contributes most to the model.</p><p><strong>Conclusions: </strong>This GIS-based study identified a primarily sporadic occurrence of leptospirosis in Selangor with minimal spatial clustering. The LGBM algorithm effectively predicted leptospirosis hotspots based on the analysed hydroclimatic factors. The integration of GIS and machine learning offers a promising framework for disease surveillance, facilitating targeted public health interventions in areas at high risk for leptospirosis.</p>\",\"PeriodicalId\":19799,\"journal\":{\"name\":\"PeerJ\",\"volume\":\"13 \",\"pages\":\"e18851\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-03-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11890033/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PeerJ\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.7717/peerj.18851\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.7717/peerj.18851","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

背景:钩端螺旋体病是南美洲、南亚和东南亚等热带气候国家的一种地方病。2005年至2014年间,马来西亚的钩端螺旋体病发病率从每10万人1.45例增加到25.94例。随着马来西亚雪兰莪州发病率的增加和气候变化的频繁,研究疾病热点地区及其与水文气候因子的关系可以加强疾病监测和公共卫生干预。方法:采用地理信息系统(GIS)和遥感技术对2011 - 2019年雪兰莪州钩端螺旋体病的时空分布进行生态横断面研究。实验室确认的钩端螺旋体病病例(n = 1,045)是从雪兰莪州卫生局获得的。利用ArcGIS Pro软件,通过Moran’s I和Getis-Ord Gi*(热点分析)对各区逐月汇总病例进行热点识别。卫星获得的降雨量和地表温度数据来自美国宇航局的乔瓦尼地球数据网站,并处理成月平均值。这些数据作为主题层集成到ArcGIS Pro中。采用支持向量机(SVM)、随机森林(RF)和光梯度增强机(LGBM)等机器学习算法建立钩端螺旋体病热点地区预测模型。然后使用交叉验证和指标(如准确性、精密度、灵敏度和f1评分)评估模型性能。结果:Moran的I分析显示,整个雪兰莪州的病例主要是随机分布,103例中只有20例观察到集群分布。与此同时,热点区域主要分散在雪兰莪州的街道上,集中在中部地区。机器学习分析显示,LGBM算法的交叉验证得分为0.61,精度得分为0.16,f1得分为0.23,性能得分最高。特征重要性得分表明,河流水位和降雨对模型的贡献最大。结论:这项基于gis的研究发现,在雪兰莪州,钩端螺旋体病主要是散发性的,具有最小的空间聚集性。LGBM算法基于分析的水文气候因子有效预测钩端螺旋体病热点。地理信息系统和机器学习的结合为疾病监测提供了一个有希望的框架,促进了钩端螺旋体病高风险地区有针对性的公共卫生干预。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Identifying leptospirosis hotspots in Selangor: uncovering climatic connections using remote sensing and developing a predictive model.

Background: Leptospirosis is an endemic disease in countries with tropical climates such as South America, Southern Asia, and Southeast Asia. There has been an increase in leptospirosis incidence in Malaysia from 1.45 to 25.94 cases per 100,000 population between 2005 and 2014. With increasing incidence in Selangor, Malaysia, and frequent climate change dynamics, a study on the disease hotspot areas and their association with the hydroclimatic factors could enhance disease surveillance and public health interventions.

Methods: This ecological cross-sectional study utilised a geographic information system (GIS) and remote sensing techniques to analyse the spatiotemporal distribution of leptospirosis in Selangor from 2011 to 2019. Laboratory-confirmed leptospirosis cases (n = 1,045) were obtained from the Selangor State Health Department. Using ArcGIS Pro, spatial autocorrelation analysis (Moran's I) and Getis-Ord Gi* (hotspot analysis) was conducted to identify hotspots based on the monthly aggregated cases for each subdistrict. Satellite-derived rainfall and land surface temperature (LST) data were acquired from NASA's Giovanni EarthData website and processed into monthly averages. These data were integrated into ArcGIS Pro as thematic layers. Machine learning algorithms, including support vector machine (SVM), Random Forest (RF), and light gradient boosting machine (LGBM) were employed to develop predictive models for leptospirosis hotspot areas. Model performance was then evaluated using cross-validation and metrics such as accuracy, precision, sensitivity, and F1-score.

Results: Moran's I analysis revealed a primarily random distribution of cases across Selangor, with only 20 out of 103 observed having a clustered distribution. Meanwhile, hotspot areas were mainly scattered in subdistricts throughout Selangor with clustering in the central region. Machine learning analysis revealed that the LGBM algorithm had the best performance scores compared to having a cross-validation score of 0.61, a precision score of 0.16, and an F1-score of 0.23. The feature importance score indicated river water level and rainfall contributes most to the model.

Conclusions: This GIS-based study identified a primarily sporadic occurrence of leptospirosis in Selangor with minimal spatial clustering. The LGBM algorithm effectively predicted leptospirosis hotspots based on the analysed hydroclimatic factors. The integration of GIS and machine learning offers a promising framework for disease surveillance, facilitating targeted public health interventions in areas at high risk for leptospirosis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
PeerJ
PeerJ MULTIDISCIPLINARY SCIENCES-
CiteScore
4.70
自引率
3.70%
发文量
1665
审稿时长
10 weeks
期刊介绍: PeerJ is an open access peer-reviewed scientific journal covering research in the biological and medical sciences. At PeerJ, authors take out a lifetime publication plan (for as little as $99) which allows them to publish articles in the journal for free, forever. PeerJ has 5 Nobel Prize Winners on the Board; they have won several industry and media awards; and they are widely recognized as being one of the most interesting recent developments in academic publishing.
期刊最新文献
Investigating potassium silicate efficacy and mechanisms for improving the strawberry agronomic traits and gray mold fungal resistance. Integrated analysis identifies a palmitoylation-associated prognostic model (ACSM5/SKA3) for lung adenocarcinoma across multiple cohorts. Smoking-induced changes in saliva composition and tongue coating. Caring for patients with multimorbidity: moral distress and life satisfaction among doctors and nurses in Portugal. A novel GAM-based method for predicting phosphate (PO4) concentrations in irrigated drainage/return-flow systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1