AntiMicro.ai:由人工智能驱动的网络应用程序,用于预测抗菌/抗真菌敏感性和构建个性化机器学习模型

Fredrick Mutisya, Rachael Kanguha
{"title":"AntiMicro.ai:由人工智能驱动的网络应用程序,用于预测抗菌/抗真菌敏感性和构建个性化机器学习模型","authors":"Fredrick Mutisya, Rachael Kanguha","doi":"10.12688/wellcomeopenres.21281.1","DOIUrl":null,"url":null,"abstract":"Background To train and develop machine learning models on the Pfizer antibacterial and antifungal datasets with subsequent deployment to an interactive Web Application. Methods We utilized R version 4.3.1 to perform descriptive analysis to obtain features/predictors. Python 3.10 libraries NumPy, Pandas, Scikit learn, Pycaret were used to train machine learning models. All these models were scored using area under the curve, recall, precision, F1, Kappa and the Mathews correlation coefficient. The best performing model was then deployed into a web application built on Streamlit after which deployment was done using GitHub and Streamlit cloud. A prototype android app was also developed using GoNative. Results The exploratory data analysis, S Aureus (17.2%) was the most common species however in the sub group analysis of the isolates with genotypic values, K Pneumoniae(48.2%) and E Coli (36.9%) were dominant. Amongst the fungi, Candida albicans (38.3%) and Candida glabrata (15.5%) were dominant. Feature selection was done using Shapley additive explanation plots. Using Extreme Gradient Boosting (XGBoost), we were able to achieve 99% and 97.8% AUC in the prediction of antibacterial and antifungal susceptibility respectively with minimal overfitting. Conclusions Decision tree based methods are a viable options of predicting antibacterial and antifungal drug resistance. When presented in visually appealing modes like web applications and android apps, it can be a useful guide to clinicians on initial treatment while awaiting definitive phenotypic testing. It can also be a surveillance tool that can craft antimicrobial resistance strategies.","PeriodicalId":508490,"journal":{"name":"Wellcome Open Research","volume":"108 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AntiMicro.ai: An Artificial Intelligence powered web application for predicting antibacterial/antifungal susceptibility and constructing personalized machine learning models\",\"authors\":\"Fredrick Mutisya, Rachael Kanguha\",\"doi\":\"10.12688/wellcomeopenres.21281.1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background To train and develop machine learning models on the Pfizer antibacterial and antifungal datasets with subsequent deployment to an interactive Web Application. Methods We utilized R version 4.3.1 to perform descriptive analysis to obtain features/predictors. Python 3.10 libraries NumPy, Pandas, Scikit learn, Pycaret were used to train machine learning models. All these models were scored using area under the curve, recall, precision, F1, Kappa and the Mathews correlation coefficient. The best performing model was then deployed into a web application built on Streamlit after which deployment was done using GitHub and Streamlit cloud. A prototype android app was also developed using GoNative. Results The exploratory data analysis, S Aureus (17.2%) was the most common species however in the sub group analysis of the isolates with genotypic values, K Pneumoniae(48.2%) and E Coli (36.9%) were dominant. Amongst the fungi, Candida albicans (38.3%) and Candida glabrata (15.5%) were dominant. Feature selection was done using Shapley additive explanation plots. Using Extreme Gradient Boosting (XGBoost), we were able to achieve 99% and 97.8% AUC in the prediction of antibacterial and antifungal susceptibility respectively with minimal overfitting. Conclusions Decision tree based methods are a viable options of predicting antibacterial and antifungal drug resistance. When presented in visually appealing modes like web applications and android apps, it can be a useful guide to clinicians on initial treatment while awaiting definitive phenotypic testing. It can also be a surveillance tool that can craft antimicrobial resistance strategies.\",\"PeriodicalId\":508490,\"journal\":{\"name\":\"Wellcome Open Research\",\"volume\":\"108 5\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Wellcome Open Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12688/wellcomeopenres.21281.1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wellcome Open Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12688/wellcomeopenres.21281.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景 在辉瑞公司的抗菌和抗真菌数据集上训练和开发机器学习模型,然后将其部署到交互式网络应用程序中。方法 我们使用 R 4.3.1 版进行描述性分析,以获得特征/预测因子。Python 3.10 库 NumPy、Pandas、Scikit learn 和 Pycaret 用于训练机器学习模型。使用曲线下面积、召回率、精确度、F1、Kappa 和马修斯相关系数对所有这些模型进行评分。然后将表现最佳的模型部署到基于 Streamlit 的网络应用程序中,之后使用 GitHub 和 Streamlit 云进行部署。此外,还使用 GoNative 开发了一个安卓应用程序原型。结果 探索性数据分析显示,金黄色葡萄球菌(17.2%)是最常见的菌种,但在对具有基因型值的分离物进行亚组分析时,肺炎双球菌(48.2%)和大肠杆菌(36.9%)占主导地位。在真菌中,白色念珠菌(38.3%)和光滑念珠菌(15.5%)占主导地位。特征选择采用 Shapley 加性解释图。利用极端梯度提升(XGBoost)技术,我们在预测抗菌药和抗真菌药敏性时的 AUC 分别达到了 99% 和 97.8%,且过拟合程度极低。结论 基于决策树的方法是预测抗菌药和抗真菌药耐药性的可行选择。如果以网络应用程序和安卓应用程序等直观吸引人的模式呈现,它可以在临床医生等待确定的表型检测期间为初步治疗提供有用的指导。它还可以作为一种监测工具,用于制定抗菌药耐药性策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
AntiMicro.ai: An Artificial Intelligence powered web application for predicting antibacterial/antifungal susceptibility and constructing personalized machine learning models
Background To train and develop machine learning models on the Pfizer antibacterial and antifungal datasets with subsequent deployment to an interactive Web Application. Methods We utilized R version 4.3.1 to perform descriptive analysis to obtain features/predictors. Python 3.10 libraries NumPy, Pandas, Scikit learn, Pycaret were used to train machine learning models. All these models were scored using area under the curve, recall, precision, F1, Kappa and the Mathews correlation coefficient. The best performing model was then deployed into a web application built on Streamlit after which deployment was done using GitHub and Streamlit cloud. A prototype android app was also developed using GoNative. Results The exploratory data analysis, S Aureus (17.2%) was the most common species however in the sub group analysis of the isolates with genotypic values, K Pneumoniae(48.2%) and E Coli (36.9%) were dominant. Amongst the fungi, Candida albicans (38.3%) and Candida glabrata (15.5%) were dominant. Feature selection was done using Shapley additive explanation plots. Using Extreme Gradient Boosting (XGBoost), we were able to achieve 99% and 97.8% AUC in the prediction of antibacterial and antifungal susceptibility respectively with minimal overfitting. Conclusions Decision tree based methods are a viable options of predicting antibacterial and antifungal drug resistance. When presented in visually appealing modes like web applications and android apps, it can be a useful guide to clinicians on initial treatment while awaiting definitive phenotypic testing. It can also be a surveillance tool that can craft antimicrobial resistance strategies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The genome sequence of the bloodfluke planorb, Biomphalaria glabrata (Say, 1818) The genome sequence of the blonde ray, Raja brachyura Lafont, 1871 The genome sequence of the Northern Bottlenose Whale, Hyperoodon ampullatus (Forster, 1770) The genome sequence of the Maiden’s Blush moth, Cyclophora punctaria (Linnaeus, 1758) The genome sequence of a jewel beetle, Agrilus biguttatus (Fabricius, 1776)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1