{"title":"ABDpred:使用监督机器学习技术预测活性抗菌化合物。","authors":"Tanmoy Jana, Debasree Sarkar, Debayan Ganguli, Sandip Kumar Mukherjee, Rahul Shubhra Mandal, Santasabuj Das","doi":"10.4103/ijmr.ijmr_1832_22","DOIUrl":null,"url":null,"abstract":"<p><strong>Background objectives: </strong>Discovery of new antibiotics is the need of the hour to treat infectious diseases. An ever-increasing repertoire of multidrug-resistant pathogens poses an imminent threat to human lives across the globe. However, the low success rate of the existing approaches and technologies for antibiotic discovery remains a major bottleneck. In silico methods like machine learning (ML) deem more promising to meet the above challenges compared with the conventional experimental approaches. The goal of this study was to create ML models that may be used to successfully predict new antimicrobial compounds.</p><p><strong>Methods: </strong>In this article, we employed eight different ML algorithms namely, extreme gradient boosting, random forest, gradient boosting classifier, deep neural network, support vector machine, multilayer perceptron, decision tree, and logistic regression. These models were trained using a dataset comprising 312 antibiotic drugs and a negative set of 936 non-antibiotic drugs in a five-fold cross validation approach.</p><p><strong>Results: </strong>The top four ML classifiers (extreme gradient boosting, random forest, gradient boosting classifier and deep neural network) were able to achieve an accuracy of 80 per cent and above during the evaluation of testing and blind datasets.</p><p><strong>Interpretation conclusions: </strong>We aggregated the top performing four models through a soft-voting technique to develop an ensemble-based ML method and incorporated it into a freely accessible online prediction server named ABDpred ( http://clinicalmedicinessd.com.in/abdpred/ ).</p>","PeriodicalId":13349,"journal":{"name":"Indian Journal of Medical Research","volume":" ","pages":"78-90"},"PeriodicalIF":2.7000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10954100/pdf/","citationCount":"0","resultStr":"{\"title\":\"ABDpred: Prediction of active antimicrobial compounds using supervised machine learning techniques.\",\"authors\":\"Tanmoy Jana, Debasree Sarkar, Debayan Ganguli, Sandip Kumar Mukherjee, Rahul Shubhra Mandal, Santasabuj Das\",\"doi\":\"10.4103/ijmr.ijmr_1832_22\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background objectives: </strong>Discovery of new antibiotics is the need of the hour to treat infectious diseases. An ever-increasing repertoire of multidrug-resistant pathogens poses an imminent threat to human lives across the globe. However, the low success rate of the existing approaches and technologies for antibiotic discovery remains a major bottleneck. In silico methods like machine learning (ML) deem more promising to meet the above challenges compared with the conventional experimental approaches. The goal of this study was to create ML models that may be used to successfully predict new antimicrobial compounds.</p><p><strong>Methods: </strong>In this article, we employed eight different ML algorithms namely, extreme gradient boosting, random forest, gradient boosting classifier, deep neural network, support vector machine, multilayer perceptron, decision tree, and logistic regression. These models were trained using a dataset comprising 312 antibiotic drugs and a negative set of 936 non-antibiotic drugs in a five-fold cross validation approach.</p><p><strong>Results: </strong>The top four ML classifiers (extreme gradient boosting, random forest, gradient boosting classifier and deep neural network) were able to achieve an accuracy of 80 per cent and above during the evaluation of testing and blind datasets.</p><p><strong>Interpretation conclusions: </strong>We aggregated the top performing four models through a soft-voting technique to develop an ensemble-based ML method and incorporated it into a freely accessible online prediction server named ABDpred ( http://clinicalmedicinessd.com.in/abdpred/ ).</p>\",\"PeriodicalId\":13349,\"journal\":{\"name\":\"Indian Journal of Medical Research\",\"volume\":\" \",\"pages\":\"78-90\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10954100/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Indian Journal of Medical Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.4103/ijmr.ijmr_1832_22\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/3/4 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"IMMUNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indian Journal of Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.4103/ijmr.ijmr_1832_22","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/3/4 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
背景目标:发现新的抗生素是治疗传染病的当务之急。日益增多的多重耐药病原体对全球人类的生命构成了迫在眉睫的威胁。然而,现有抗生素发现方法和技术的低成功率仍然是一个主要瓶颈。与传统的实验方法相比,机器学习(ML)等硅学方法更有希望应对上述挑战。本研究的目标是创建可用于成功预测新抗菌化合物的 ML 模型:在本文中,我们采用了八种不同的 ML 算法,即极端梯度提升、随机森林、梯度提升分类器、深度神经网络、支持向量机、多层感知器、决策树和逻辑回归。这些模型采用五倍交叉验证法,使用由 312 种抗生素药物和 936 种非抗生素药物组成的数据集进行训练:在对测试数据集和盲数据集进行评估期间,排名前四位的多重层流分类器(极梯度提升、随机森林、梯度提升分类器和深度神经网络)的准确率均达到 80% 及以上:我们通过软投票技术汇总了表现最好的四个模型,开发出一种基于集合的 ML 方法,并将其纳入一个可免费访问的在线预测服务器,名为 ABDpred (http://clinicalmedicinessd.com.in/abdpred/)。
ABDpred: Prediction of active antimicrobial compounds using supervised machine learning techniques.
Background objectives: Discovery of new antibiotics is the need of the hour to treat infectious diseases. An ever-increasing repertoire of multidrug-resistant pathogens poses an imminent threat to human lives across the globe. However, the low success rate of the existing approaches and technologies for antibiotic discovery remains a major bottleneck. In silico methods like machine learning (ML) deem more promising to meet the above challenges compared with the conventional experimental approaches. The goal of this study was to create ML models that may be used to successfully predict new antimicrobial compounds.
Methods: In this article, we employed eight different ML algorithms namely, extreme gradient boosting, random forest, gradient boosting classifier, deep neural network, support vector machine, multilayer perceptron, decision tree, and logistic regression. These models were trained using a dataset comprising 312 antibiotic drugs and a negative set of 936 non-antibiotic drugs in a five-fold cross validation approach.
Results: The top four ML classifiers (extreme gradient boosting, random forest, gradient boosting classifier and deep neural network) were able to achieve an accuracy of 80 per cent and above during the evaluation of testing and blind datasets.
Interpretation conclusions: We aggregated the top performing four models through a soft-voting technique to develop an ensemble-based ML method and incorporated it into a freely accessible online prediction server named ABDpred ( http://clinicalmedicinessd.com.in/abdpred/ ).
期刊介绍:
The Indian Journal of Medical Research (IJMR) [ISSN 0971-5916] is one of the oldest medical Journals not only in India, but probably in Asia, as it started in the year 1913. The Journal was started as a quarterly (4 issues/year) in 1913 and made bimonthly (6 issues/year) in 1958. It became monthly (12 issues/year) in the year 1964.