Gusti Putu Wahyunanda Crista Yuda, Naufa Hanif, Adam Hermawan
{"title":"使用基于配体的机器学习和分子对接方法组合的计算筛选,用于靶向严重急性呼吸系统综合征冠状病毒2型主要蛋白酶的抗病毒药物的再利用。","authors":"Gusti Putu Wahyunanda Crista Yuda, Naufa Hanif, Adam Hermawan","doi":"10.1007/s40199-023-00484-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>COVID-19 is an infectious disease caused by SARS-CoV-2, a close relative of SARS-CoV. Several studies have searched for COVID-19 therapies. The topics of these works ranged from vaccine discovery to natural products targeting the SARS-CoV-2 main protease (M<sup>pro</sup>), a potential therapeutic target due to its essential role in replication and conserved sequences. However, published research on this target is limited, presenting an opportunity for drug discovery and development.</p><p><strong>Method: </strong>This study aims to repurpose 10692 drugs in DrugBank by using ligand-based virtual screening (LBVS) machine learning (ML) with Konstanz Information Miner (KNIME) to seek potential therapeutics based on M<sup>pro</sup> inhibitors. The top candidate compounds, the native ligand (GC-376) of the M<sup>pro</sup> inhibitor, and the positive control boceprevir were then subjected to absorption, distribution, metabolism, excretion, and toxicity (ADMET) characterization, drug-likeness prediction, and molecular docking (MD). Protein-protein interaction (PPI) network analysis was added to provide accurate information about the M<sup>pro</sup> regulatory network.</p><p><strong>Results: </strong>This study identified 3,166 compound candidates inhibiting M<sup>pro</sup>. The random forest (RF) molecular access system ML model provided the highest confidence score of 0.95 (bromo-7-nitroindazole) and identified the top 22 candidate compounds. Subjecting the 22 candidate compounds, the native ligand GC-376, and boceprevir to further ADMET property characterization and drug-likeness predictions revealed that one compound had two violations of Lipinski's rule. Additional MD results showed that only five compounds had more negative binding energies than the native ligand (- 12.25 kcal/mol). Among these compounds, CCX-140 exhibited the lowest score of - 13.64 kcal/mol. Through literature analysis, six compound classes with potential activity for M<sup>pro</sup> were discovered. They included benzopyrazole, azole, pyrazolopyrimidine, carboxylic acids and derivatives, benzene and substituted derivatives, and diazine. Four pathologies were also discovered on the basis of the M<sup>pro</sup> PPI network.</p><p><strong>Conclusion: </strong>Results demonstrated the efficiency of LBVS combined with MD. This combined strategy provided positive evidence showing that the top screened drugs, including CCX-140, which had the lowest MD score, can be reasonably advanced to the in vitro phase. This combined method may accelerate the discovery of therapies for novel or orphan diseases from existing drugs.</p>","PeriodicalId":10888,"journal":{"name":"DARU Journal of Pharmaceutical Sciences","volume":" ","pages":"47-65"},"PeriodicalIF":2.5000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11087449/pdf/","citationCount":"0","resultStr":"{\"title\":\"Computational Screening Using a Combination of Ligand-Based Machine Learning and Molecular Docking Methods for the Repurposing of Antivirals Targeting the SARS-CoV-2 Main Protease.\",\"authors\":\"Gusti Putu Wahyunanda Crista Yuda, Naufa Hanif, Adam Hermawan\",\"doi\":\"10.1007/s40199-023-00484-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>COVID-19 is an infectious disease caused by SARS-CoV-2, a close relative of SARS-CoV. Several studies have searched for COVID-19 therapies. The topics of these works ranged from vaccine discovery to natural products targeting the SARS-CoV-2 main protease (M<sup>pro</sup>), a potential therapeutic target due to its essential role in replication and conserved sequences. However, published research on this target is limited, presenting an opportunity for drug discovery and development.</p><p><strong>Method: </strong>This study aims to repurpose 10692 drugs in DrugBank by using ligand-based virtual screening (LBVS) machine learning (ML) with Konstanz Information Miner (KNIME) to seek potential therapeutics based on M<sup>pro</sup> inhibitors. The top candidate compounds, the native ligand (GC-376) of the M<sup>pro</sup> inhibitor, and the positive control boceprevir were then subjected to absorption, distribution, metabolism, excretion, and toxicity (ADMET) characterization, drug-likeness prediction, and molecular docking (MD). Protein-protein interaction (PPI) network analysis was added to provide accurate information about the M<sup>pro</sup> regulatory network.</p><p><strong>Results: </strong>This study identified 3,166 compound candidates inhibiting M<sup>pro</sup>. The random forest (RF) molecular access system ML model provided the highest confidence score of 0.95 (bromo-7-nitroindazole) and identified the top 22 candidate compounds. Subjecting the 22 candidate compounds, the native ligand GC-376, and boceprevir to further ADMET property characterization and drug-likeness predictions revealed that one compound had two violations of Lipinski's rule. Additional MD results showed that only five compounds had more negative binding energies than the native ligand (- 12.25 kcal/mol). Among these compounds, CCX-140 exhibited the lowest score of - 13.64 kcal/mol. Through literature analysis, six compound classes with potential activity for M<sup>pro</sup> were discovered. They included benzopyrazole, azole, pyrazolopyrimidine, carboxylic acids and derivatives, benzene and substituted derivatives, and diazine. Four pathologies were also discovered on the basis of the M<sup>pro</sup> PPI network.</p><p><strong>Conclusion: </strong>Results demonstrated the efficiency of LBVS combined with MD. This combined strategy provided positive evidence showing that the top screened drugs, including CCX-140, which had the lowest MD score, can be reasonably advanced to the in vitro phase. This combined method may accelerate the discovery of therapies for novel or orphan diseases from existing drugs.</p>\",\"PeriodicalId\":10888,\"journal\":{\"name\":\"DARU Journal of Pharmaceutical Sciences\",\"volume\":\" \",\"pages\":\"47-65\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11087449/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"DARU Journal of Pharmaceutical Sciences\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s40199-023-00484-w\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/10/31 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"DARU Journal of Pharmaceutical Sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s40199-023-00484-w","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/10/31 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
摘要
背景:新冠肺炎是一种由严重急性呼吸系统综合征冠状病毒2型引起的传染病,该病毒是严重急性呼吸综合征冠状病毒的近亲。几项研究已经在寻找新冠肺炎疗法。这些工作的主题从疫苗发现到靶向严重急性呼吸系统综合征冠状病毒2型主要蛋白酶(Mpro)的天然产物,由于其在复制和保守序列中的重要作用,Mpro是一个潜在的治疗靶点。然而,已发表的关于该靶点的研究有限,为药物的发现和开发提供了机会。方法:本研究旨在通过使用Konstanz Information Miner(KNIME)的基于配体的虚拟筛选(LBVS)机器学习(ML),重新利用DrugBank中的10692种药物,以寻求基于Mpro抑制剂的潜在治疗方法。然后,对最热门的候选化合物、Mpro抑制剂的天然配体(GC-376)和阳性对照博ceprevir进行吸收、分布、代谢、排泄和毒性(ADMET)表征、药物相似性预测和分子对接(MD)。添加蛋白质-蛋白质相互作用(PPI)网络分析以提供关于Mpro调节网络的准确信息。结果:本研究鉴定出3166个抑制Mpro的候选化合物。随机森林(RF)分子访问系统ML模型提供了0.95(溴-7-硝基吲唑)的最高置信分数,并确定了前22个候选化合物。对22种候选化合物,天然配体GC-376和博塞韦进行进一步的ADMET性质表征和药物相似性预测,发现一种化合物有两种违反利平斯基规则的情况。额外的MD结果表明,只有五种化合物比天然配体具有更多的负结合能(- 12.25 kcal/mol)。在这些化合物中,CCX-140的得分最低,为 - 13.64kcal/mol。通过文献分析,发现了六类对Mpro具有潜在活性的化合物。它们包括苯并吡唑、唑、吡唑并嘧啶、羧酸及其衍生物、苯及其取代衍生物和二嗪。在Mpro PPI网络的基础上,还发现了四种病理。结论:结果证明了LBVS联合MD的有效性。这种联合策略提供了积极的证据,表明包括MD评分最低的CCX-140在内的顶级筛选药物可以合理地进入体外阶段。这种联合方法可能会加速从现有药物中发现治疗新疾病或孤儿疾病的方法。
Computational Screening Using a Combination of Ligand-Based Machine Learning and Molecular Docking Methods for the Repurposing of Antivirals Targeting the SARS-CoV-2 Main Protease.
Background: COVID-19 is an infectious disease caused by SARS-CoV-2, a close relative of SARS-CoV. Several studies have searched for COVID-19 therapies. The topics of these works ranged from vaccine discovery to natural products targeting the SARS-CoV-2 main protease (Mpro), a potential therapeutic target due to its essential role in replication and conserved sequences. However, published research on this target is limited, presenting an opportunity for drug discovery and development.
Method: This study aims to repurpose 10692 drugs in DrugBank by using ligand-based virtual screening (LBVS) machine learning (ML) with Konstanz Information Miner (KNIME) to seek potential therapeutics based on Mpro inhibitors. The top candidate compounds, the native ligand (GC-376) of the Mpro inhibitor, and the positive control boceprevir were then subjected to absorption, distribution, metabolism, excretion, and toxicity (ADMET) characterization, drug-likeness prediction, and molecular docking (MD). Protein-protein interaction (PPI) network analysis was added to provide accurate information about the Mpro regulatory network.
Results: This study identified 3,166 compound candidates inhibiting Mpro. The random forest (RF) molecular access system ML model provided the highest confidence score of 0.95 (bromo-7-nitroindazole) and identified the top 22 candidate compounds. Subjecting the 22 candidate compounds, the native ligand GC-376, and boceprevir to further ADMET property characterization and drug-likeness predictions revealed that one compound had two violations of Lipinski's rule. Additional MD results showed that only five compounds had more negative binding energies than the native ligand (- 12.25 kcal/mol). Among these compounds, CCX-140 exhibited the lowest score of - 13.64 kcal/mol. Through literature analysis, six compound classes with potential activity for Mpro were discovered. They included benzopyrazole, azole, pyrazolopyrimidine, carboxylic acids and derivatives, benzene and substituted derivatives, and diazine. Four pathologies were also discovered on the basis of the Mpro PPI network.
Conclusion: Results demonstrated the efficiency of LBVS combined with MD. This combined strategy provided positive evidence showing that the top screened drugs, including CCX-140, which had the lowest MD score, can be reasonably advanced to the in vitro phase. This combined method may accelerate the discovery of therapies for novel or orphan diseases from existing drugs.
期刊介绍:
DARU Journal of Pharmaceutical Sciences is a peer-reviewed journal published on behalf of Tehran University of Medical Sciences. The journal encompasses all fields of the pharmaceutical sciences and presents timely research on all areas of drug conception, design, manufacture, classification and assessment.
The term DARU is derived from the Persian name meaning drug or medicine. This journal is a unique platform to improve the knowledge of researchers and scientists by publishing novel articles including basic and clinical investigations from members of the global scientific community in the forms of original articles, systematic or narrative reviews, meta-analyses, letters, and short communications.