Using supervised machine learning algorithms to predict bovine leukemia virus seropositivity in dairy cattle in Florida: A 10-year retrospective study.

IF 2.2 2区 农林科学 Q1 VETERINARY SCIENCES Preventive veterinary medicine Pub Date : 2024-11-28 DOI:10.1016/j.prevetmed.2024.106387
Ameer A Megahed, Reddy Bommineni, Michael Short, Klibs N Galvão, João H J Bittar
{"title":"Using supervised machine learning algorithms to predict bovine leukemia virus seropositivity in dairy cattle in Florida: A 10-year retrospective study.","authors":"Ameer A Megahed, Reddy Bommineni, Michael Short, Klibs N Galvão, João H J Bittar","doi":"10.1016/j.prevetmed.2024.106387","DOIUrl":null,"url":null,"abstract":"<p><p>Supervised machine-learning (SML) algorithms are potentially powerful tools that may be used for screening cows for infectious diseases such as bovine leukemia virus (BLV) infection. Here, we compared six different SML algorithms to identify the most important risk factors for predicting BLV seropositivity in dairy cattle in Florida. We used a dataset of 1279 dairy blood sample records from the Bronson Animal Disease Diagnostic Laboratory that were submitted for BLV antibody testing from 2012 to 2022. The SML algorithms that we used were logistic regression (LR), decision tree (DT), gradient boosting (GB), random forest (RF), neural network (NN), and support vector machine (SVM). A total of 312 serum samples were positive for BLV with corrected seroprevalence of 26.0 %. Subject to limitations of the analyzed retrospective data, the RF model was the best model for predicting BLV seropositivity in dairy cattle indicated by the highest Kolmogorov-Smirnov (KS) statistic of 0.75, area under the receiver operating characteristic (AUROC) of 0.93, gain of 2.6; and lowest misclassification rate of 0.10. The LR model was the worst. The RF model showed that the best predictors for BLV seropositivity were age (dairy cows of age ≥ 5 years) and geographic location (southern Florida). We concluded that the RF and other SML algorithms hold promise for predicting BLV seropositivity in dairy cattle and that dairy cattle 5 years of age or older raised in southern Florida have a higher likelihood of testing positive for BLV. This study makes an important methodological contribution to the needed development of predictive tools for effective screening for BLV infection and emphasizes the importance of collecting and using representative data in such predictive models.</p>","PeriodicalId":20413,"journal":{"name":"Preventive veterinary medicine","volume":"235 ","pages":"106387"},"PeriodicalIF":2.2000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Preventive veterinary medicine","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1016/j.prevetmed.2024.106387","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"VETERINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Supervised machine-learning (SML) algorithms are potentially powerful tools that may be used for screening cows for infectious diseases such as bovine leukemia virus (BLV) infection. Here, we compared six different SML algorithms to identify the most important risk factors for predicting BLV seropositivity in dairy cattle in Florida. We used a dataset of 1279 dairy blood sample records from the Bronson Animal Disease Diagnostic Laboratory that were submitted for BLV antibody testing from 2012 to 2022. The SML algorithms that we used were logistic regression (LR), decision tree (DT), gradient boosting (GB), random forest (RF), neural network (NN), and support vector machine (SVM). A total of 312 serum samples were positive for BLV with corrected seroprevalence of 26.0 %. Subject to limitations of the analyzed retrospective data, the RF model was the best model for predicting BLV seropositivity in dairy cattle indicated by the highest Kolmogorov-Smirnov (KS) statistic of 0.75, area under the receiver operating characteristic (AUROC) of 0.93, gain of 2.6; and lowest misclassification rate of 0.10. The LR model was the worst. The RF model showed that the best predictors for BLV seropositivity were age (dairy cows of age ≥ 5 years) and geographic location (southern Florida). We concluded that the RF and other SML algorithms hold promise for predicting BLV seropositivity in dairy cattle and that dairy cattle 5 years of age or older raised in southern Florida have a higher likelihood of testing positive for BLV. This study makes an important methodological contribution to the needed development of predictive tools for effective screening for BLV infection and emphasizes the importance of collecting and using representative data in such predictive models.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
有监督的机器学习(SML)算法是一种潜在的强大工具,可用于筛查奶牛是否感染牛白血病病毒(BLV)等传染病。在此,我们比较了六种不同的 SML 算法,以确定预测佛罗里达州奶牛 BLV 血清阳性的最重要风险因素。我们使用了来自布朗森动物疾病诊断实验室(Bronson Animal Disease Diagnostic Laboratory)的 1279 份奶牛血样记录数据集,这些数据集在 2012 年至 2022 年期间提交进行 BLV 抗体检测。我们使用的 SML 算法包括逻辑回归 (LR)、决策树 (DT)、梯度提升 (GB)、随机森林 (RF)、神经网络 (NN) 和支持向量机 (SVM)。共有 312 份血清样本对 BLV 呈阳性,校正血清流行率为 26.0%。受所分析的回顾性数据的限制,RF 模型是预测奶牛 BLV 血清阳性率的最佳模型,其 Kolmogorov-Smirnov (KS) 统计量最高,为 0.75,接收者操作特征下面积 (AUROC) 为 0.93,增益为 2.6,误分类率最低,为 0.10。LR 模型最差。RF 模型显示,BLV 血清阳性的最佳预测因子是年龄(年龄≥ 5 岁的奶牛)和地理位置(佛罗里达州南部)。我们的结论是,RF 和其他 SML 算法有望预测奶牛的 BLV 血清阳性率,在佛罗里达州南部饲养的 5 岁或 5 岁以上的奶牛 BLV 检测呈阳性的可能性较高。这项研究为开发有效筛查 BLV 感染的预测工具做出了重要的方法学贡献,并强调了在此类预测模型中收集和使用代表性数据的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Preventive veterinary medicine
Preventive veterinary medicine 农林科学-兽医学
CiteScore
5.60
自引率
7.70%
发文量
184
审稿时长
3 months
期刊介绍: Preventive Veterinary Medicine is one of the leading international resources for scientific reports on animal health programs and preventive veterinary medicine. The journal follows the guidelines for standardizing and strengthening the reporting of biomedical research which are available from the CONSORT, MOOSE, PRISMA, REFLECT, STARD, and STROBE statements. The journal focuses on: Epidemiology of health events relevant to domestic and wild animals; Economic impacts of epidemic and endemic animal and zoonotic diseases; Latest methods and approaches in veterinary epidemiology; Disease and infection control or eradication measures; The "One Health" concept and the relationships between veterinary medicine, human health, animal-production systems, and the environment; Development of new techniques in surveillance systems and diagnosis; Evaluation and control of diseases in animal populations.
期刊最新文献
Corrigendum to "Incorporating heterogeneity in farmer disease control behaviour into a livestock disease transmission model" Preventive Vet. Med. 219 (2023) 106019. A salmon lice prediction model. The effect of temporal resolution and contact duration on Real-Time location system-based contact networks for confined feedlot cattle. 'Implementation of bio…what?' Farm workers' subjectivities in Spanish dairy cattle farms through an ethnographic approach. Key farm characteristics associated with the level of antimicrobial use in rosé veal production - A Danish database study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1