Application of machine learning to Isoniazid resistance analysis

Zhou Yang
{"title":"Application of machine learning to Isoniazid resistance analysis","authors":"Zhou Yang","doi":"10.1145/3571532.3571541","DOIUrl":null,"url":null,"abstract":"Correct and timely detection of Mycobacterium tuberculosis (MTB) resistance against existing tuberculosis (TB) drugs is essential for the limit of TB amplification. The objectives of the projects are (1) to develop classification models that help isoniazid-resistant TB diagnosis, (2) to find the best performed classification algorithm, and (3) to rank the gene mutations according to feature importance. The python sklearn and matplotlib packages were frequently utilized throughout the research for data curation, classification model development, and feature importance ranking. Additionally, area under the curve (AUC), precision, sensitivity, specificity, F1 score, and correct classification rate measured for model performances, and Gini importance calculated feature importance. Gradient boosting found to overperform other classification models with the highest accuracy mean 0f 0.852, and its overfitting error exposed the need for dimensionality reduction prior to model training. Gene 625 and 331 were the most significant features in this project, and this suggested the potential of machine learning (ML) to find new resistance makers. The results confirmed the application of ML in clinical settings for quicker and better prediction of drug resistance based on large genome sequencing data. With future studies focusing on less studied and second-line TB drugs, classification models could decrease mortality and prevent the amplification of existing antibiotic resistance by allowing early diagnosis and treatment. CCS CONCEPTS • Computing methodologies∼Machine learning∼Learning paradigms∼Supervised learning∼Supervised learning by classification","PeriodicalId":355088,"journal":{"name":"Proceedings of the 2022 11th International Conference on Bioinformatics and Biomedical Science","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 11th International Conference on Bioinformatics and Biomedical Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3571532.3571541","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Correct and timely detection of Mycobacterium tuberculosis (MTB) resistance against existing tuberculosis (TB) drugs is essential for the limit of TB amplification. The objectives of the projects are (1) to develop classification models that help isoniazid-resistant TB diagnosis, (2) to find the best performed classification algorithm, and (3) to rank the gene mutations according to feature importance. The python sklearn and matplotlib packages were frequently utilized throughout the research for data curation, classification model development, and feature importance ranking. Additionally, area under the curve (AUC), precision, sensitivity, specificity, F1 score, and correct classification rate measured for model performances, and Gini importance calculated feature importance. Gradient boosting found to overperform other classification models with the highest accuracy mean 0f 0.852, and its overfitting error exposed the need for dimensionality reduction prior to model training. Gene 625 and 331 were the most significant features in this project, and this suggested the potential of machine learning (ML) to find new resistance makers. The results confirmed the application of ML in clinical settings for quicker and better prediction of drug resistance based on large genome sequencing data. With future studies focusing on less studied and second-line TB drugs, classification models could decrease mortality and prevent the amplification of existing antibiotic resistance by allowing early diagnosis and treatment. CCS CONCEPTS • Computing methodologies∼Machine learning∼Learning paradigms∼Supervised learning∼Supervised learning by classification
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
机器学习在异烟肼耐药性分析中的应用
正确和及时地检测结核分枝杆菌对现有结核病药物的耐药性对于限制结核病扩增至关重要。这些项目的目标是:(1)建立有助于异烟肼耐药结核病诊断的分类模型;(2)找到表现最佳的分类算法;(3)根据特征重要性对基因突变进行排序。在整个研究过程中,经常使用python sklearn和matplotlib包进行数据管理、分类模型开发和特征重要性排序。此外,曲线下面积(AUC)、精度、灵敏度、特异性、F1评分和正确分类率衡量模型性能,基尼重要度计算特征重要度。梯度增强的准确率均值为0.0.852,优于其他分类模型,其过拟合误差暴露了模型训练前需要降维。基因625和331是该项目中最重要的特征,这表明机器学习(ML)有潜力找到新的抗性制造者。结果证实了ML在临床环境中的应用,可以基于大基因组测序数据更快、更好地预测耐药。随着未来的研究集中在研究较少的二线结核病药物上,分类模型可以通过允许早期诊断和治疗来降低死亡率并防止现有抗生素耐药性的扩大。CCS概念•计算方法~机器学习~学习范式~监督学习~通过分类进行监督学习
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Extracting spatial-temporal characteristics from Dynamic Connectivity Network with rs-fMRI Data for AD Classification Brain entrainment by audio-visual gamma frequency stimulations Application of Computer Management System Function in Standard Design of Insulin Pen Injection Program Study of coronary artery bypass grafting admission in Covid-19 era: a bicentric study Gene Circuit Construction and Simulation in Probiotics to Metabolize Alcohol
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1