Pub Date : 2023-02-25DOI: 10.1007/s10822-023-00497-2
Jacob M Remington, Kyle T McKay, Noah B Beckage, Jonathon B Ferrell, Severin T. Schneebeli, Jianing Li
Molecules with bioactivity towards G protein-coupled receptors represent a subset of the vast space of small drug-like molecules. Here, we compare machine learning models, including dilated graph convolutional networks, that conduct binary classification to quickly identify molecules with activity towards G protein-coupled receptors. The models are trained and validated using a large set of over 600,000 active, inactive, and decoy compounds. The best performing machine learning model, dubbed GPCRLigNet, was a surprisingly simple feedforward dense neural network mapping from Morgan fingerprints to activity. Incorporation of GPCRLigNet into a high-throughput virtual screening workflow is demonstrated with molecular docking towards a particular G protein-coupled receptor, the pituitary adenylate cyclase-activating polypeptide receptor type 1. Through rigorous comparison of docking scores for molecules selected with and without using GPCRLigNet, we demonstrate an enrichment of potentially potent molecules using GPCRLigNet. This work provides a proof of principle that GPCRLigNet can effectively hone the chemical search space towards ligands with G protein-coupled receptor activity.
{"title":"GPCRLigNet: rapid screening for GPCR active ligands using machine learning","authors":"Jacob M Remington, Kyle T McKay, Noah B Beckage, Jonathon B Ferrell, Severin T. Schneebeli, Jianing Li","doi":"10.1007/s10822-023-00497-2","DOIUrl":"10.1007/s10822-023-00497-2","url":null,"abstract":"<div><p>Molecules with bioactivity towards G protein-coupled receptors represent a subset of the vast space of small drug-like molecules. Here, we compare machine learning models, including dilated graph convolutional networks, that conduct binary classification to quickly identify molecules with activity towards G protein-coupled receptors. The models are trained and validated using a large set of over 600,000 active, inactive, and decoy compounds. The best performing machine learning model, dubbed GPCRLigNet, was a surprisingly simple feedforward dense neural network mapping from Morgan fingerprints to activity. Incorporation of GPCRLigNet into a high-throughput virtual screening workflow is demonstrated with molecular docking towards a particular G protein-coupled receptor, the pituitary adenylate cyclase-activating polypeptide receptor type 1. Through rigorous comparison of docking scores for molecules selected with and without using GPCRLigNet, we demonstrate an enrichment of potentially potent molecules using GPCRLigNet. This work provides a proof of principle that GPCRLigNet can effectively hone the chemical search space towards ligands with G protein-coupled receptor activity.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2023-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-023-00497-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"5331481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-02-17DOI: 10.1007/s10822-023-00496-3
Anne Bonin, Floriane Montanari, Sebastian Niederführ, Andreas H. Göller
Aqueous solubility is the most important physicochemical property for agrochemical and drug candidates and a prerequisite for uptake, distribution, transport, and finally the bioavailability in living species. We here present the first-ever direct machine learning models for pH-dependent solubility in water. For this, we combined almost 300000 data points from 11 solubility assays performed over 24 years and over one million data points from lipophilicity and melting point experiments. Data were split into three pH-classes − acidic, neutral and basic − , representing the conditions of stomach and intestinal tract for animals and humans, and phloem and xylem for plants. We find that multi-task neural networks using ECFP-6 fingerprints outperform baseline random forests and single-task neural networks on the individual tasks. Our final model with three solubility tasks using the pH-class combined data from different assays and five helper tasks results in root mean square errors of 0.56 log units overall (acidic 0.61; neutral 0.52; basic 0.54) and Spearman rank correlations of 0.83 (acidic 0.78; neutral 0.86; basic 0.86), making it a valuable tool for profiling of compounds in pharmaceutical and agrochemical research. The model allows for the prediction of compound pH profiles with mean and median RMSE per molecule of 0.62 and 0.56 log units.
{"title":"pH-dependent solubility prediction for optimized drug absorption and compound uptake by plants","authors":"Anne Bonin, Floriane Montanari, Sebastian Niederführ, Andreas H. Göller","doi":"10.1007/s10822-023-00496-3","DOIUrl":"10.1007/s10822-023-00496-3","url":null,"abstract":"<div><p>Aqueous solubility is the most important physicochemical property for agrochemical and drug candidates and a prerequisite for uptake, distribution, transport, and finally the bioavailability in living species. We here present the first-ever direct machine learning models for pH-dependent solubility in water. For this, we combined almost 300000 data points from 11 solubility assays performed over 24 years and over one million data points from lipophilicity and melting point experiments. Data were split into three pH-classes − acidic, neutral and basic − , representing the conditions of stomach and intestinal tract for animals and humans, and phloem and xylem for plants. We find that multi-task neural networks using ECFP-6 fingerprints outperform baseline random forests and single-task neural networks on the individual tasks. Our final model with three solubility tasks using the pH-class combined data from different assays and five helper tasks results in root mean square errors of 0.56 log units overall (acidic 0.61; neutral 0.52; basic 0.54) and Spearman rank correlations of 0.83 (acidic 0.78; neutral 0.86; basic 0.86), making it a valuable tool for profiling of compounds in pharmaceutical and agrochemical research. The model allows for the prediction of compound pH profiles with mean and median RMSE per molecule of 0.62 and 0.56 log units.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2023-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4680255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-22DOI: 10.1007/s10822-022-00495-w
Talita Freitas de Freitas, Candida Deves Roth, Bruno Lopes Abbadi, Fernanda Souza Macchi Hopf, Marcia Alberton Perelló, Alexia de Matos Czeczot, Eduardo Vieira de Souza, Ana Flávia Borsoi, Pablo Machado, Cristiano Valim Bizarro, Luiz Augusto Basso, Luis Fernando Saraiva Macedo Timmers
Tuberculosis (TB) is one of the main causes of death from a single pathological agent, Mycobacterium tuberculosis (Mtb). In addition, the emergence of drug-resistant TB strains has exacerbated even further the treatment outcome of TB patients. It is thus needed the search for new therapeutic strategies to improve the current treatment and to circumvent the resistance mechanisms of Mtb. The shikimate kinase (SK) is the fifth enzyme of the shikimate pathway, which is essential for the survival of Mtb. The shikimate pathway is absent in humans, thereby indicating SK as an attractive target for the development of anti-TB drugs. In this work, a combination of in silico and in vitro techniques was used to identify potential inhibitors for SK from Mtb (MtSK). All compounds of our in-house database (Centro de Pesquisas em Biologia Molecular e Funcional, CPBMF) were submitted to in silico toxicity analysis to evaluate the risk of hepatotoxicity. Docking experiments were performed to identify the potential inhibitors of MtSK according to the predicted binding energy. In vitro inhibitory activity of MtSK-catalyzed chemical reaction at a single compound concentration was assessed. Minimum inhibitory concentration values for in vitro growth of pan-sensitive Mtb H37Rv strain were also determined. The mixed approach implemented in this work was able to identify five compounds that inhibit both MtSK and the in vitro growth of Mtb.
结核病(TB)是由单一病理病原体结核分枝杆菌(Mtb)造成死亡的主要原因之一。此外,耐药结核菌株的出现进一步恶化了结核病患者的治疗结果。因此,需要寻找新的治疗策略来改善目前的治疗方法并规避结核分枝杆菌的耐药机制。莽草酸激酶(shikimate kinase, SK)是莽草酸途径的第五种酶,对结核分枝杆菌的存活至关重要。莽草酸途径在人类中是不存在的,因此表明SK是开发抗结核药物的一个有吸引力的靶点。在这项工作中,采用了硅和体外技术相结合的方法来鉴定结核分枝杆菌(MtSK)中潜在的SK抑制剂。我们的内部数据库(Centro de Pesquisas em Biologia Molecular e functional, CPBMF)中的所有化合物都提交了硅毒性分析,以评估肝毒性风险。对接实验根据预测的结合能确定MtSK的潜在抑制剂。对单一化合物浓度下mtsk催化化学反应的体外抑制活性进行了评价。同时测定了泛敏Mtb H37Rv菌株体外生长的最低抑菌浓度。在这项工作中实施的混合方法能够鉴定出五种既抑制MtSK又抑制Mtb体外生长的化合物。
{"title":"Identification of potential inhibitors of Mycobacterium tuberculosis shikimate kinase: molecular docking, in silico toxicity and in vitro experiments","authors":"Talita Freitas de Freitas, Candida Deves Roth, Bruno Lopes Abbadi, Fernanda Souza Macchi Hopf, Marcia Alberton Perelló, Alexia de Matos Czeczot, Eduardo Vieira de Souza, Ana Flávia Borsoi, Pablo Machado, Cristiano Valim Bizarro, Luiz Augusto Basso, Luis Fernando Saraiva Macedo Timmers","doi":"10.1007/s10822-022-00495-w","DOIUrl":"10.1007/s10822-022-00495-w","url":null,"abstract":"<div><p>Tuberculosis (TB) is one of the main causes of death from a single pathological agent, <i>Mycobacterium tuberculosis</i> (<i>Mtb</i>). In addition, the emergence of drug-resistant TB strains has exacerbated even further the treatment outcome of TB patients. It is thus needed the search for new therapeutic strategies to improve the current treatment and to circumvent the resistance mechanisms of <i>Mtb</i>. The shikimate kinase (SK) is the fifth enzyme of the shikimate pathway, which is essential for the survival of <i>Mtb</i>. The shikimate pathway is absent in humans, thereby indicating SK as an attractive target for the development of anti-TB drugs. In this work, a combination of in silico and in vitro techniques was used to identify potential inhibitors for SK from <i>Mtb</i> (<i>Mt</i>SK). All compounds of our in-house database (Centro de Pesquisas em Biologia Molecular e Funcional, CPBMF) were submitted to in silico toxicity analysis to evaluate the risk of hepatotoxicity. Docking experiments were performed to identify the potential inhibitors of <i>Mt</i>SK according to the predicted binding energy. In vitro inhibitory activity of <i>Mt</i>SK-catalyzed chemical reaction at a single compound concentration was assessed. Minimum inhibitory concentration values for in vitro growth of pan-sensitive <i>Mtb</i> H37Rv strain were also determined. The mixed approach implemented in this work was able to identify five compounds that inhibit both <i>Mt</i>SK and the in vitro growth of <i>Mtb</i>.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":null,"pages":null},"PeriodicalIF":3.5,"publicationDate":"2022-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-022-00495-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"4848106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-10DOI: 10.1007/s10822-022-00491-0
Manuel A. Llanos, Lucas N. Alberca, María D. Ruiz, María L. Sbaraglini, Cristian Miranda, Agustina Pino-Martinez, Laura Fraccaroli, Carolina Carrillo, Catalina D. Alba Soto, Luciana Gavernet, Alan Talevi
Chagas disease, also known as American trypanosomiasis, is a neglected tropical disease caused by the protozoa Trypanosoma cruzi, affecting nearly 7 million people only in the Americas. Polyamines are essential compounds for parasite growth, survival, and differentiation. However, because trypanosomatids are auxotrophic for polyamines, they must be obtained from the host by specific transporters. In this investigation, an ensemble of QSAR classifiers able to identify polyamine analogs with trypanocidal activity was developed. Then, a multi-template homology model of the dimeric polyamine transporter of T. cruzi, TcPAT12, was created with Rosetta, and then refined by enhanced sampling molecular dynamics simulations. Using representative snapshots extracted from the trajectory, a docking model able to discriminate between active and inactive compounds was developed and validated. Both models were applied in a parallel virtual screening campaign to repurpose known drugs as anti-trypanosomal compounds inhibiting polyamine transport in T. cruzi. Montelukast, Quinestrol, Danazol, and Dutasteride were selected for in vitro testing, and all of them inhibited putrescine uptake in biochemical assays, confirming the predictive ability of the computational models. Furthermore, all the confirmed hits proved to inhibit epimastigote proliferation, and Quinestrol and Danazol were able to inhibit, in the low micromolar range, the viability of trypomastigotes and the intracellular growth of amastigotes.