{"title":"[Construction of a visual intelligent identification model for <i>Oncomelania hupensis robertsoni</i> in Yunnan Province based on the EfficientNet-B4 model].","authors":"S Bai, J Zhou, Y Dong, J Zhang, L Shi, K Yang","doi":"10.16250/j.32.1915.2024194","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To construct a visual intelligent recognition model for <i>Oncomelania hupensis robertsoni</i> in Yunnan Province based on the EfficientNet-B4 model, and to evaluate the impact of data augmentation methods and model hyperparameters on the recognition of <i>O. hupensis robertsoni</i>.</p><p><strong>Methods: </strong>A total of 400 <i>O. hupensis robertsoni</i> and 400 <i>Tricula</i> snails were collected from Yongsheng County, Yunnan Province in June 2024, and snail images were captured following identification and classification of 300 <i>O. hupensis robertsoni</i> and 300 <i>Tricula</i> snails. A total of 925 <i>O. hupensis robertsoni</i> images and 1 062 <i>Tricula</i> snail images were collected as a dataset and divided into a training set and a validation set at a ratio of 8:2, while 352 images captured from the remaining 100 <i>O. hupensis robertsoni</i> and 354 images from the remaining 100 <i>Tricula</i> snails served as an external test set. All acquired images were subjected to preprocessing, including cropping and resizing. Three data augmentation approaches were employed, including baseline, Mixup and Gaussian blurring, and model hyperparameters included two optimization algorithms of adaptive moment estimation (Adam) and stochastic gradient descent (SGD), two loss functions of focal loss and cross entropy loss, and two learning rate decay strategies of cosine annealing and multi-step. The intelligent recognition models of <i>O. hupensis robertsoni</i> and <i>Tricula</i> snails were constructed based on the EfficientNet-B4 model, and 7 training strategy groups were generated by combinations of different data augmentation approaches and hyperparameters. The performance of intelligent recognition models was tested with external test sets, and evaluated with accuracy, precision, recall, F1 score, loss, Youden's index, and the area under the receiver operating characteristic curve (AUC) under different training strategies.</p><p><strong>Results: </strong>The variation of loss values was comparable among intelligent recognition models with different data augmentation approaches. The Group 4 model constructed with Mixup and Gaussian blurring data augmentation approaches showed the optimal performance, with an accuracy of 90.38%, precision of 90.07%, F1 score of 89.44%, Youden's index of 0.81 and AUC of 0.961 in the external test set. The accuracy of models using the SGD optimizer reduced by 29.16% as compared to those using the Adam optimizer (χ<sup>2</sup> = 81.325, <i>P</i> < 0.001), and the accuracy of models using the cross entropy loss function reduced by 0.80% as compared to the Group 4 model (χ<sup>2</sup> = 3.147, <i>P</i> > 0.05), while the accuracy of models using the multi-step learning rate decay strategy increased by 0.65% as compared to the Group 4 model (χ<sup>2</sup> = 0.208, <i>P</i> > 0.05). In addition, the model with the baseline + Mixup + Gaussianblurring data augmentation approach and hyperparameters of Adam optimizer, focal loss function and multi-step learning rate decay strategy showed the highest performance, with an accuracy of 91.03%, precision of 91.97%, recall of 88.11%, F1 score of 90.00%, Youden's index of 0.82 and AUC values of 0.969 in external test set, respectively.</p><p><strong>Conclusions: </strong>The intelligent recognition model of <i>O. hupensis robertsoni</i> based on EfficientNet-B4 model is accurate for identification of <i>O. hupensis robertsoni</i> and <i>Tricula</i> snails in Yunnan Province.</p>","PeriodicalId":38874,"journal":{"name":"中国血吸虫病防治杂志","volume":"36 6","pages":"555-561"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中国血吸虫病防治杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.16250/j.32.1915.2024194","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: To construct a visual intelligent recognition model for Oncomelania hupensis robertsoni in Yunnan Province based on the EfficientNet-B4 model, and to evaluate the impact of data augmentation methods and model hyperparameters on the recognition of O. hupensis robertsoni.
Methods: A total of 400 O. hupensis robertsoni and 400 Tricula snails were collected from Yongsheng County, Yunnan Province in June 2024, and snail images were captured following identification and classification of 300 O. hupensis robertsoni and 300 Tricula snails. A total of 925 O. hupensis robertsoni images and 1 062 Tricula snail images were collected as a dataset and divided into a training set and a validation set at a ratio of 8:2, while 352 images captured from the remaining 100 O. hupensis robertsoni and 354 images from the remaining 100 Tricula snails served as an external test set. All acquired images were subjected to preprocessing, including cropping and resizing. Three data augmentation approaches were employed, including baseline, Mixup and Gaussian blurring, and model hyperparameters included two optimization algorithms of adaptive moment estimation (Adam) and stochastic gradient descent (SGD), two loss functions of focal loss and cross entropy loss, and two learning rate decay strategies of cosine annealing and multi-step. The intelligent recognition models of O. hupensis robertsoni and Tricula snails were constructed based on the EfficientNet-B4 model, and 7 training strategy groups were generated by combinations of different data augmentation approaches and hyperparameters. The performance of intelligent recognition models was tested with external test sets, and evaluated with accuracy, precision, recall, F1 score, loss, Youden's index, and the area under the receiver operating characteristic curve (AUC) under different training strategies.
Results: The variation of loss values was comparable among intelligent recognition models with different data augmentation approaches. The Group 4 model constructed with Mixup and Gaussian blurring data augmentation approaches showed the optimal performance, with an accuracy of 90.38%, precision of 90.07%, F1 score of 89.44%, Youden's index of 0.81 and AUC of 0.961 in the external test set. The accuracy of models using the SGD optimizer reduced by 29.16% as compared to those using the Adam optimizer (χ2 = 81.325, P < 0.001), and the accuracy of models using the cross entropy loss function reduced by 0.80% as compared to the Group 4 model (χ2 = 3.147, P > 0.05), while the accuracy of models using the multi-step learning rate decay strategy increased by 0.65% as compared to the Group 4 model (χ2 = 0.208, P > 0.05). In addition, the model with the baseline + Mixup + Gaussianblurring data augmentation approach and hyperparameters of Adam optimizer, focal loss function and multi-step learning rate decay strategy showed the highest performance, with an accuracy of 91.03%, precision of 91.97%, recall of 88.11%, F1 score of 90.00%, Youden's index of 0.82 and AUC values of 0.969 in external test set, respectively.
Conclusions: The intelligent recognition model of O. hupensis robertsoni based on EfficientNet-B4 model is accurate for identification of O. hupensis robertsoni and Tricula snails in Yunnan Province.
期刊介绍:
Chinese Journal of Schistosomiasis Control (ISSN: 1005-6661, CN: 32-1374/R), founded in 1989, is a technical and scientific journal under the supervision of Jiangsu Provincial Health Commission and organised by Jiangsu Institute of Schistosomiasis Control. It is a scientific and technical journal under the supervision of Jiangsu Provincial Health Commission and sponsored by Jiangsu Institute of Schistosomiasis Prevention and Control. The journal carries out the policy of prevention-oriented, control-oriented, nationwide and grassroots, adheres to the tenet of scientific research service for the prevention and treatment of schistosomiasis and other parasitic diseases, and mainly publishes academic papers reflecting the latest achievements and dynamics of prevention and treatment of schistosomiasis and other parasitic diseases, scientific research and management, etc. The main columns are Guest Contributions, Experts‘ Commentary, Experts’ Perspectives, Experts' Forums, Theses, Prevention and Treatment Research, Experimental Research, The main columns include Guest Contributions, Expert Commentaries, Expert Perspectives, Expert Forums, Treatises, Prevention and Control Studies, Experimental Studies, Clinical Studies, Prevention and Control Experiences, Prevention and Control Management, Reviews, Case Reports, and Information, etc. The journal is a useful reference material for the professional and technical personnel of schistosomiasis and parasitic disease prevention and control research, management workers, and teachers and students of medical schools.
The journal is now included in important domestic databases, such as Chinese Core List (8th edition), China Science Citation Database (Core Edition), China Science and Technology Core Journals (Statistical Source Journals), and is also included in MEDLINE/PubMed, Scopus, EBSCO, Chemical Abstract, Embase, Zoological Record, JSTChina, Ulrichsweb, Western Pacific Region Index Medicus, CABI and other international authoritative databases.