{"title":"基于机器学习方法的抗疟疾分子的计算机预测。","authors":"Madhulata Kumari, Subhash Chandra","doi":"10.1504/IJCBDD.2015.068783","DOIUrl":null,"url":null,"abstract":"Machine learning techniques have been widely used in drug discovery and development in the areas of cheminformatics. Aspartyl aminopeptidase (M18AAP) of Plasmodium falciparum is crucial for survival of malaria parasite. We have created predictive models using weka and evaluated their performance based on various statistical parameters. Random Forest based model was found to be the most specificity (97.94%), with best accuracy (97.3%), MCC (0.306) as well as ROC (86.1%). The accuracy and MCC of these models indicated that they could be used to classify huge dataset of unknown compounds to predict their antimalarial compounds to develop effective drugs. Further, we deployed best predictive model on NCI diversity set IV. As result we found 59 bioactive anti-malarial molecules inhibiting M18AAP. Further, we obtained 18 non-toxic hit molecules out of 59 bioactive compounds. We suggest that such machine learning approaches could be applied to reduce the cost and length of time of drug discovery.","PeriodicalId":39227,"journal":{"name":"International Journal of Computational Biology and Drug Design","volume":"8 1","pages":"40-53"},"PeriodicalIF":0.0000,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJCBDD.2015.068783","citationCount":"5","resultStr":"{\"title\":\"In silico prediction of anti-malarial hit molecules based on machine learning methods.\",\"authors\":\"Madhulata Kumari, Subhash Chandra\",\"doi\":\"10.1504/IJCBDD.2015.068783\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning techniques have been widely used in drug discovery and development in the areas of cheminformatics. Aspartyl aminopeptidase (M18AAP) of Plasmodium falciparum is crucial for survival of malaria parasite. We have created predictive models using weka and evaluated their performance based on various statistical parameters. Random Forest based model was found to be the most specificity (97.94%), with best accuracy (97.3%), MCC (0.306) as well as ROC (86.1%). The accuracy and MCC of these models indicated that they could be used to classify huge dataset of unknown compounds to predict their antimalarial compounds to develop effective drugs. Further, we deployed best predictive model on NCI diversity set IV. As result we found 59 bioactive anti-malarial molecules inhibiting M18AAP. Further, we obtained 18 non-toxic hit molecules out of 59 bioactive compounds. We suggest that such machine learning approaches could be applied to reduce the cost and length of time of drug discovery.\",\"PeriodicalId\":39227,\"journal\":{\"name\":\"International Journal of Computational Biology and Drug Design\",\"volume\":\"8 1\",\"pages\":\"40-53\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1504/IJCBDD.2015.068783\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Computational Biology and Drug Design\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJCBDD.2015.068783\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2015/4/13 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"Pharmacology, Toxicology and Pharmaceutics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computational Biology and Drug Design","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJCBDD.2015.068783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2015/4/13 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}
In silico prediction of anti-malarial hit molecules based on machine learning methods.
Machine learning techniques have been widely used in drug discovery and development in the areas of cheminformatics. Aspartyl aminopeptidase (M18AAP) of Plasmodium falciparum is crucial for survival of malaria parasite. We have created predictive models using weka and evaluated their performance based on various statistical parameters. Random Forest based model was found to be the most specificity (97.94%), with best accuracy (97.3%), MCC (0.306) as well as ROC (86.1%). The accuracy and MCC of these models indicated that they could be used to classify huge dataset of unknown compounds to predict their antimalarial compounds to develop effective drugs. Further, we deployed best predictive model on NCI diversity set IV. As result we found 59 bioactive anti-malarial molecules inhibiting M18AAP. Further, we obtained 18 non-toxic hit molecules out of 59 bioactive compounds. We suggest that such machine learning approaches could be applied to reduce the cost and length of time of drug discovery.