Infestation risk of the intermediate snail host of Schistosoma japonicum in the Yangtze River Basin: improved results by spatial reassessment and a random forest approach.
Jin-Xin Zheng, Shang Xia, Shan Lv, Yi Zhang, Robert Bergquist, Xiao-Nong Zhou
{"title":"Infestation risk of the intermediate snail host of Schistosoma japonicum in the Yangtze River Basin: improved results by spatial reassessment and a random forest approach.","authors":"Jin-Xin Zheng, Shang Xia, Shan Lv, Yi Zhang, Robert Bergquist, Xiao-Nong Zhou","doi":"10.1186/s40249-021-00852-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Oncomelania hupensis is only intermediate snail host of Schistosoma japonicum, and distribution of O. hupensis is an important indicator for the surveillance of schistosomiasis. This study explored the feasibility of a random forest algorithm weighted by spatial distance for risk prediction of schistosomiasis distribution in the Yangtze River Basin in China, with the aim to produce an improved precision reference for the national schistosomiasis control programme by reducing the number of snail survey sites without losing predictive accuracy.</p><p><strong>Methods: </strong>The snail presence and absence records were collected from Anhui, Hunan, Hubei, Jiangxi and Jiangsu provinces in 2018. A machine learning of random forest algorithm based on a set of environmental and climatic variables was developed to predict the breeding sites of the O. hupensis intermediated snail host of S. japonicum. Different spatial sizes of a hexagonal grid system were compared to estimate the need for required snail sampling sites. The predictive accuracy related to geographic distances between snail sampling sites was estimated by calculating Kappa and the area under the curve (AUC).</p><p><strong>Results: </strong>The highest accuracy (AUC = 0.889 and Kappa = 0.618) was achieved at the 5 km distance weight. The five factors with the strongest correlation to O. hupensis infestation probability were: (1) distance to lake (48.9%), (2) distance to river (36.6%), (3) isothermality (29.5%), (4) mean daily difference in temperature (28.1%), and (5) altitude (26.0%). The risk map showed that areas characterized by snail infestation were mainly located along the Yangtze River, with the highest probability in the dividing, slow-flowing river arms in the middle and lower reaches of the Yangtze River in Anhui, followed by areas near the shores of China's two main lakes, the Dongting Lake in Hunan and Hubei and the Poyang Lake in Jiangxi.</p><p><strong>Conclusions: </strong>Applying the machine learning of random forest algorithm made it feasible to precisely predict snail infestation probability, an approach that could improve the sensitivity of the Chinese schistosome surveillance system. Redesign of the snail surveillance system by spatial bias correction of O. hupensis infestation in the Yangtze River Basin to reduce the number of sites required to investigate from 2369 to 1747.</p>","PeriodicalId":13587,"journal":{"name":"Infectious Diseases of Poverty","volume":"10 1","pages":"74"},"PeriodicalIF":4.8000,"publicationDate":"2021-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8135174/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infectious Diseases of Poverty","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40249-021-00852-1","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Oncomelania hupensis is only intermediate snail host of Schistosoma japonicum, and distribution of O. hupensis is an important indicator for the surveillance of schistosomiasis. This study explored the feasibility of a random forest algorithm weighted by spatial distance for risk prediction of schistosomiasis distribution in the Yangtze River Basin in China, with the aim to produce an improved precision reference for the national schistosomiasis control programme by reducing the number of snail survey sites without losing predictive accuracy.
Methods: The snail presence and absence records were collected from Anhui, Hunan, Hubei, Jiangxi and Jiangsu provinces in 2018. A machine learning of random forest algorithm based on a set of environmental and climatic variables was developed to predict the breeding sites of the O. hupensis intermediated snail host of S. japonicum. Different spatial sizes of a hexagonal grid system were compared to estimate the need for required snail sampling sites. The predictive accuracy related to geographic distances between snail sampling sites was estimated by calculating Kappa and the area under the curve (AUC).
Results: The highest accuracy (AUC = 0.889 and Kappa = 0.618) was achieved at the 5 km distance weight. The five factors with the strongest correlation to O. hupensis infestation probability were: (1) distance to lake (48.9%), (2) distance to river (36.6%), (3) isothermality (29.5%), (4) mean daily difference in temperature (28.1%), and (5) altitude (26.0%). The risk map showed that areas characterized by snail infestation were mainly located along the Yangtze River, with the highest probability in the dividing, slow-flowing river arms in the middle and lower reaches of the Yangtze River in Anhui, followed by areas near the shores of China's two main lakes, the Dongting Lake in Hunan and Hubei and the Poyang Lake in Jiangxi.
Conclusions: Applying the machine learning of random forest algorithm made it feasible to precisely predict snail infestation probability, an approach that could improve the sensitivity of the Chinese schistosome surveillance system. Redesign of the snail surveillance system by spatial bias correction of O. hupensis infestation in the Yangtze River Basin to reduce the number of sites required to investigate from 2369 to 1747.
期刊介绍:
Infectious Diseases of Poverty is a peer-reviewed, open access journal that focuses on essential public health questions related to infectious diseases of poverty. It covers a wide range of topics and methods, including the biology of pathogens and vectors, diagnosis and detection, treatment and case management, epidemiology and modeling, zoonotic hosts and animal reservoirs, control strategies and implementation, new technologies, and their application.
The journal also explores the impact of transdisciplinary or multisectoral approaches on health systems, ecohealth, environmental management, and innovative technologies. It aims to provide a platform for the exchange of research and ideas that can contribute to the improvement of public health in resource-limited settings.
In summary, Infectious Diseases of Poverty aims to address the urgent challenges posed by infectious diseases in impoverished populations. By publishing high-quality research in various areas, the journal seeks to advance our understanding of these diseases and contribute to the development of effective strategies for prevention, diagnosis, and treatment.