Pub Date : 2023-10-01Epub Date: 2023-08-21DOI: 10.1002/minf.202300069
Gildardo Rivera, Alonzo González-González, Citlali Vázquez, Rusely Encalada, Emma Saavedra, Lenci K Vázquez-Jiménez, Eyra Ortiz-Pérez, Maria Bolognesi
Phenothiazine derivatives can unselectively inhibit the trypanothione-dependent antioxidant system enzyme trypanothione reductase (TR). A virtual screening of 2163 phenothiazine derivatives from the ZINC15 and PubChem databases docked on the active site of T. cruzi TR showed that 285 compounds have higher affinity than the natural ligand trypanothione disulfide. 244 compounds showed higher affinity toward the parasite's enzyme than to its human homolog glutathione reductase. Protein-ligand interaction profiling predicted that the main interactions for the top scored compounds were with residues important for trypanothione disulfide binding: Phe396, Pro398, Leu399, His461, Glu466, and Glu467, particularly His461, which participates in catalysis. Two compounds with the desired profiles, ZINC1033681 (Zn_C687) and ZINC10213096 (Zn_C216), decreased parasite growth by 20 % and 50 %, respectively. They behaved as mixed-type inhibitors of recombinant TR, with Ki values of 59 and 47 μM, respectively. This study provides a further understanding of the potential of phenothiazine derivatives as TR inhibitors.
{"title":"Phenothiazine-based virtual screening, molecular docking, and molecular dynamics of new trypanothione reductase inhibitors of Trypanosoma cruzi.","authors":"Gildardo Rivera, Alonzo González-González, Citlali Vázquez, Rusely Encalada, Emma Saavedra, Lenci K Vázquez-Jiménez, Eyra Ortiz-Pérez, Maria Bolognesi","doi":"10.1002/minf.202300069","DOIUrl":"10.1002/minf.202300069","url":null,"abstract":"<p><p>Phenothiazine derivatives can unselectively inhibit the trypanothione-dependent antioxidant system enzyme trypanothione reductase (TR). A virtual screening of 2163 phenothiazine derivatives from the ZINC15 and PubChem databases docked on the active site of T. cruzi TR showed that 285 compounds have higher affinity than the natural ligand trypanothione disulfide. 244 compounds showed higher affinity toward the parasite's enzyme than to its human homolog glutathione reductase. Protein-ligand interaction profiling predicted that the main interactions for the top scored compounds were with residues important for trypanothione disulfide binding: Phe396, Pro398, Leu399, His461, Glu466, and Glu467, particularly His461, which participates in catalysis. Two compounds with the desired profiles, ZINC1033681 (Zn_C687) and ZINC10213096 (Zn_C216), decreased parasite growth by 20 % and 50 %, respectively. They behaved as mixed-type inhibitors of recombinant TR, with Ki values of 59 and 47 μM, respectively. This study provides a further understanding of the potential of phenothiazine derivatives as TR inhibitors.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e2300069"},"PeriodicalIF":3.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10020855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-01Epub Date: 2023-08-21DOI: 10.1002/minf.202200275
Dmitry Zankov, Timur Madzhidov, Igor Baskin, Alexandre Varnek
Conjugated QSPR models for reactions integrate fundamental chemical laws expressed by mathematical equations with machine learning algorithms. Herein we present a methodology for building conjugated QSPR models integrated with the Arrhenius equation. Conjugated QSPR models were used to predict kinetic characteristics of cycloaddition reactions related by the Arrhenius equation: rate constant , pre-exponential factor , and activation energy . They were benchmarked against single-task (individual and equation-based models) and multi-task models. In individual models, all characteristics were modeled separately, while in multi-task models , and were treated cooperatively. An equation-based model assessed using the Arrhenius equation and and values predicted by individual models. It has been demonstrated that the conjugated QSPR models can accurately predict the reaction rate constants at extreme temperatures, at which reaction rate constants hardly can be measured experimentally. Also, in the case of small training sets conjugated models are more robust than related single-task approaches.
反应的共轭QSPR模型将数学方程表达的基本化学定律与机器学习算法相结合。在此,我们提出了一种结合阿伦尼斯方程建立共轭QSPR模型的方法。共轭QSPR模型用于预测与Arrhenius方程相关的环加成反应的动力学特性:速率常数l o g k${rm l}{rmo}{{RMg}k}$、指数前因子l o g A${ rml}和活化能E A${{E}_{{rm a}}$。它们以单任务(基于个体和方程的模型)和多任务模型为基准。在单独的模型中,所有特征都是单独建模的,而在多任务模型中,l o g k${rm l}${{E}_{rma}}$得到了合作处理。一个基于方程的模型使用Arrhenius方程和l o g A${rm l}{rm o}A}$和E${{E}_{{rma}}}}$由各个模型预测的值。研究表明,共轭QSPR模型可以准确预测极端温度下的反应速率常数,而在极端温度下几乎无法通过实验测量反应速率常数。此外,在小训练集的情况下,共轭模型比相关的单任务方法更稳健。
{"title":"Conjugated quantitative structure-property relationship models: Prediction of kinetic characteristics linked by the Arrhenius equation.","authors":"Dmitry Zankov, Timur Madzhidov, Igor Baskin, Alexandre Varnek","doi":"10.1002/minf.202200275","DOIUrl":"10.1002/minf.202200275","url":null,"abstract":"<p><p>Conjugated QSPR models for reactions integrate fundamental chemical laws expressed by mathematical equations with machine learning algorithms. Herein we present a methodology for building conjugated QSPR models integrated with the Arrhenius equation. Conjugated QSPR models were used to predict kinetic characteristics of cycloaddition reactions related by the Arrhenius equation: rate constant <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>k</mi></mrow> <annotation>${{rm l}{rm o}{rm g}k}$</annotation> </semantics> </math> , pre-exponential factor <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>A</mi></mrow> <annotation>${{rm l}{rm o}{rm g}A}$</annotation> </semantics> </math> , and activation energy <math> <semantics><msub><mi>E</mi> <mi>a</mi></msub> <annotation>${{E}_{{rm a}}}$</annotation> </semantics> </math> . They were benchmarked against single-task (individual and equation-based models) and multi-task models. In individual models, all characteristics were modeled separately, while in multi-task models <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>k</mi></mrow> <annotation>${{rm l}{rm o}{rm g}k}$</annotation> </semantics> </math> , <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>A</mi></mrow> <annotation>${{rm l}{rm o}{rm g}A}$</annotation> </semantics> </math> and <math> <semantics><msub><mi>E</mi> <mi>a</mi></msub> <annotation>${{E}_{{rm a}}}$</annotation> </semantics> </math> were treated cooperatively. An equation-based model assessed <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>k</mi></mrow> <annotation>${{rm l}{rm o}{rm g}k}$</annotation> </semantics> </math> using the Arrhenius equation and <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>A</mi></mrow> <annotation>${{rm l}{rm o}{rm g}A}$</annotation> </semantics> </math> and <math> <semantics><msub><mi>E</mi> <mi>a</mi></msub> <annotation>${{E}_{{rm a}}}$</annotation> </semantics> </math> values predicted by individual models. It has been demonstrated that the conjugated QSPR models can accurately predict the reaction rate constants at extreme temperatures, at which reaction rate constants hardly can be measured experimentally. Also, in the case of small training sets conjugated models are more robust than related single-task approaches.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e2200275"},"PeriodicalIF":3.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10029968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-10-01Epub Date: 2023-08-31DOI: 10.1002/minf.202300055
Sandipan Chakraborty, Chiranjeet Saha
SARS-CoV-2 gained crucial mutations at the receptor binding domain (RBD) that often changed the course of the pandemic leading to new waves with increased case fatality. Variants are observed with enhanced transmission and immune invasion abilities. Thus, predicting future variants with enhanced transmission ability is a problem of utmost research interest. Here, we have developed a multi-tier exhaustive SARS-CoV-2 mutation screening platform combining MM/GBSA, extensive molecular dynamics simulations, and steered molecular dynamics to identify RBD mutants with enhanced ACE2 binding capability. We have identified four RBM mutations (F490K, S494K, G504F, and the P499L) with significantly higher ACE2 binding abilities than wild-type RBD. Compared to wild-type RBD, they all form stable complexes with more hydrogen bonds and salt-bridge interactions with ACE2. Our simulation data suggest that these mutations allosterically alter the packing of the RBM interface of the RBD-ACE2 complex. As a result, the rupture force required to break the RBD-ACE2 contacts is significantly higher for these mutants.
{"title":"A multi-tier computational screening framework to effectively search the mutational space of SARS-CoV-2 receptor binding motif to identify mutants with enhanced ACE2 binding abilities.","authors":"Sandipan Chakraborty, Chiranjeet Saha","doi":"10.1002/minf.202300055","DOIUrl":"10.1002/minf.202300055","url":null,"abstract":"<p><p>SARS-CoV-2 gained crucial mutations at the receptor binding domain (RBD) that often changed the course of the pandemic leading to new waves with increased case fatality. Variants are observed with enhanced transmission and immune invasion abilities. Thus, predicting future variants with enhanced transmission ability is a problem of utmost research interest. Here, we have developed a multi-tier exhaustive SARS-CoV-2 mutation screening platform combining MM/GBSA, extensive molecular dynamics simulations, and steered molecular dynamics to identify RBD mutants with enhanced ACE2 binding capability. We have identified four RBM mutations (F490K, S494K, G504F, and the P499L) with significantly higher ACE2 binding abilities than wild-type RBD. Compared to wild-type RBD, they all form stable complexes with more hydrogen bonds and salt-bridge interactions with ACE2. Our simulation data suggest that these mutations allosterically alter the packing of the RBM interface of the RBD-ACE2 complex. As a result, the rupture force required to break the RBD-ACE2 contacts is significantly higher for these mutants.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e2300055"},"PeriodicalIF":3.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10491472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01Epub Date: 2023-06-16DOI: 10.1002/minf.202300061
Takuya Ehiro
Dimensionality reduction (DR) techniques are used for various purposes such as exploratory data analysis. A commonly employed linear DR technique is principal component analysis (PCA), which is one of the most popular methods for DR. Owing to its linear nature, PCA enables the determination of axes in a low-dimensional space and the calculation of corresponding loading vectors. However, PCA cannot necessarily extract important features of non-linearly distributed data. This study presents a technique aimed at aiding the interpretation of data reduced through non-linear DR methods. In the proposed method, non-linear dimensionally reduced data was clustered via a density-based clustering method. Thereafter, the obtained cluster labels were classified by random forest (RF) classifiers. Further, feature importance (FI) of RF classifiers and Spearman's rank correlation coefficients between predictive probabilities to obtained clusters and original feature values were utilized for characterizing the visualized dimensionally reduced data. The results revealed that the proposed method can provide the interpretable FI-based images of the handwritten digits dataset. Moreover, the proposed method was also applied to the polymer dataset. The study found that incorporating signed FI was advantageous in achieving a meaningful interpretation. Furthermore, Gaussian process regression was utilized to produce intuitive FI-based heatmaps on a 2-dimensional space for greater ease of understanding. Additionally, to enhance the interpretability of the obtained clusters, a feature selection technique called Boruta was applied. The Boruta feature selection method worked effectively to interpret the obtained clusters with limited and commonly important features. Additionally, the study suggested that computing FI solely from substructure-based descriptors could further enhance the interpretability of the results. Finally, the automation of the proposed method was investigated, and through maximizing the target score based on the quality of both the DR and clustering, indicative results were automatically obtained for both the handwritten digits and polymer datasets.
{"title":"Feature importance-based interpretation of UMAP-visualized polymer space.","authors":"Takuya Ehiro","doi":"10.1002/minf.202300061","DOIUrl":"10.1002/minf.202300061","url":null,"abstract":"<p><p>Dimensionality reduction (DR) techniques are used for various purposes such as exploratory data analysis. A commonly employed linear DR technique is principal component analysis (PCA), which is one of the most popular methods for DR. Owing to its linear nature, PCA enables the determination of axes in a low-dimensional space and the calculation of corresponding loading vectors. However, PCA cannot necessarily extract important features of non-linearly distributed data. This study presents a technique aimed at aiding the interpretation of data reduced through non-linear DR methods. In the proposed method, non-linear dimensionally reduced data was clustered via a density-based clustering method. Thereafter, the obtained cluster labels were classified by random forest (RF) classifiers. Further, feature importance (FI) of RF classifiers and Spearman's rank correlation coefficients between predictive probabilities to obtained clusters and original feature values were utilized for characterizing the visualized dimensionally reduced data. The results revealed that the proposed method can provide the interpretable FI-based images of the handwritten digits dataset. Moreover, the proposed method was also applied to the polymer dataset. The study found that incorporating signed FI was advantageous in achieving a meaningful interpretation. Furthermore, Gaussian process regression was utilized to produce intuitive FI-based heatmaps on a 2-dimensional space for greater ease of understanding. Additionally, to enhance the interpretability of the obtained clusters, a feature selection technique called Boruta was applied. The Boruta feature selection method worked effectively to interpret the obtained clusters with limited and commonly important features. Additionally, the study suggested that computing FI solely from substructure-based descriptors could further enhance the interpretability of the results. Finally, the automation of the proposed method was investigated, and through maximizing the target score based on the quality of both the DR and clustering, indicative results were automatically obtained for both the handwritten digits and polymer datasets.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"42 8-9","pages":"e2300061"},"PeriodicalIF":3.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10257338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01Epub Date: 2023-08-09DOI: 10.1002/minf.202300006
Alex Nyporko, Olga Tsymbalyuk, Ivan Voiteshenko, Sergiy Starosyla, Mykola Protopopov, Volodymyr Bdzhola
The new high selective mAChRs M3 inhibitors with IC50 in nanomolecular ranges, which can be the prototypes for effective COPD and asthma treatment drugs, were discovered with computational approaches among trifluoromethyl containing hexahydropyrimidinones/thiones. Compounds [6-(4-ethoxy-3-methoxy-phenyl)-4-hydroxy-2-thioxo-4-(trifluoromethyl)hexahydropyrimidin-5-yl]-phenyl-methanone (THPT-1) and 5-benzoyl-6-(3,4-dimethoxyphenyl)-4-hydroxy-4-(trifluoromethyl)hexahydropyrimidin-2-one (THPO-4) have been proved to be a highly effective (with IC50 values of 1.62 ⋅ 10-7 M and 3.09 ⋅ 10-9 M, respectively) at the same concentrations significantly competitive inhibit the signal conduction through mAChR3 in comparison with ipratropium bromide, without significant effect on mAChR2, nicotinic cholinergic and adrenergic receptors.
{"title":"Computer-aided design of muscarinic acetylcholine receptor M3 inhibitors: Promising compounds among trifluoromethyl containing hexahydropyrimidinones/thiones.","authors":"Alex Nyporko, Olga Tsymbalyuk, Ivan Voiteshenko, Sergiy Starosyla, Mykola Protopopov, Volodymyr Bdzhola","doi":"10.1002/minf.202300006","DOIUrl":"10.1002/minf.202300006","url":null,"abstract":"<p><p>The new high selective mAChRs M3 inhibitors with IC<sub>50</sub> in nanomolecular ranges, which can be the prototypes for effective COPD and asthma treatment drugs, were discovered with computational approaches among trifluoromethyl containing hexahydropyrimidinones/thiones. Compounds [6-(4-ethoxy-3-methoxy-phenyl)-4-hydroxy-2-thioxo-4-(trifluoromethyl)hexahydropyrimidin-5-yl]-phenyl-methanone (THPT-1) and 5-benzoyl-6-(3,4-dimethoxyphenyl)-4-hydroxy-4-(trifluoromethyl)hexahydropyrimidin-2-one (THPO-4) have been proved to be a highly effective (with IC<sub>50</sub> values of 1.62 ⋅ 10<sup>-7</sup> M and 3.09 ⋅ 10<sup>-9</sup> M, respectively) at the same concentrations significantly competitive inhibit the signal conduction through mAChR3 in comparison with ipratropium bromide, without significant effect on mAChR2, nicotinic cholinergic and adrenergic receptors.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"42 8-9","pages":"e2300006"},"PeriodicalIF":3.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10227176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01Epub Date: 2023-07-19DOI: 10.1002/minf.202300026
Mariia Radaeva, Helene Morin, Mohit Pandey, Fuqiang Ban, Maria Guo, Eric LeBlanc, Nada Lallous, Artem Cherkasov
Androgen receptor (AR) inhibition remains the primary strategy to combat the progression of prostate cancer (PC). However, all clinically used AR inhibitors target the ligand-binding domain (LBD), which is highly susceptible to truncations through splicing or mutations that confer drug resistance. Thus, there exists an urgent need for AR inhibitors with novel modes of action. We thus launched a virtual screening of an ultra-large chemical library to find novel inhibitors of the AR DNA-binding domain (DBD) at two sites: protein-DNA interface (P-box) and dimerization site (D-box). The compounds selected through vigorous computational filtering were then experimentally validated. We identified several novel chemotypes that effectively suppress transcriptional activity of AR and its splice variant V7. The identified compounds represent previously unexplored chemical scaffolds with a mechanism of action that evades the conventional drug resistance manifested through LBD mutations. Additionally, we describe the binding features required to inhibit AR DBD at both P-box and D-box target sites.
{"title":"Novel Inhibitors of androgen receptor's DNA binding domain identified using an ultra-large virtual screening.","authors":"Mariia Radaeva, Helene Morin, Mohit Pandey, Fuqiang Ban, Maria Guo, Eric LeBlanc, Nada Lallous, Artem Cherkasov","doi":"10.1002/minf.202300026","DOIUrl":"10.1002/minf.202300026","url":null,"abstract":"<p><p>Androgen receptor (AR) inhibition remains the primary strategy to combat the progression of prostate cancer (PC). However, all clinically used AR inhibitors target the ligand-binding domain (LBD), which is highly susceptible to truncations through splicing or mutations that confer drug resistance. Thus, there exists an urgent need for AR inhibitors with novel modes of action. We thus launched a virtual screening of an ultra-large chemical library to find novel inhibitors of the AR DNA-binding domain (DBD) at two sites: protein-DNA interface (P-box) and dimerization site (D-box). The compounds selected through vigorous computational filtering were then experimentally validated. We identified several novel chemotypes that effectively suppress transcriptional activity of AR and its splice variant V7. The identified compounds represent previously unexplored chemical scaffolds with a mechanism of action that evades the conventional drug resistance manifested through LBD mutations. Additionally, we describe the binding features required to inhibit AR DBD at both P-box and D-box target sites.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"42 8-9","pages":"e2300026"},"PeriodicalIF":3.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10218099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-01Epub Date: 2023-08-21DOI: 10.1002/minf.202300019
Dimitra-Danai Varsou, Haralambos Sarimveis
In this study we present deimos, a computational methodology for optimal grouping, applied on the read-across prediction of engineered nanomaterials' (ENMs) toxicity-related properties. The method is based on the formulation and the solution of a mixed-integer optimization program (MILP) problem that automatically and simultaneously performs feature selection, defines the grouping boundaries according to the response variable and develops linear regression models in each group. For each group/region, the characteristic centroid is defined in order to allocate untested ENMs to the groups. The deimos MILP problem is integrated in a broader optimization workflow that selects the best performing methodology between the standard multiple linear regression (MLR), the least absolute shrinkage and selection operator (LASSO) models and the proposed deimos multiple-region model. The performance of the suggested methodology is demonstrated through the application to benchmark ENMs datasets and comparison with other predictive modelling approaches. However, the proposed method can be applied to property prediction of other than ENM chemical entities and it is not limited to ENMs toxicity prediction.
{"title":"Deimos: A novel automated methodology for optimal grouping. Application to nanoinformatics case studies.","authors":"Dimitra-Danai Varsou, Haralambos Sarimveis","doi":"10.1002/minf.202300019","DOIUrl":"10.1002/minf.202300019","url":null,"abstract":"<p><p>In this study we present deimos, a computational methodology for optimal grouping, applied on the read-across prediction of engineered nanomaterials' (ENMs) toxicity-related properties. The method is based on the formulation and the solution of a mixed-integer optimization program (MILP) problem that automatically and simultaneously performs feature selection, defines the grouping boundaries according to the response variable and develops linear regression models in each group. For each group/region, the characteristic centroid is defined in order to allocate untested ENMs to the groups. The deimos MILP problem is integrated in a broader optimization workflow that selects the best performing methodology between the standard multiple linear regression (MLR), the least absolute shrinkage and selection operator (LASSO) models and the proposed deimos multiple-region model. The performance of the suggested methodology is demonstrated through the application to benchmark ENMs datasets and comparison with other predictive modelling approaches. However, the proposed method can be applied to property prediction of other than ENM chemical entities and it is not limited to ENMs toxicity prediction.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"42 8-9","pages":"e2300019"},"PeriodicalIF":3.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10575617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Computational de novo drug design is a challenging issue in medicine, and it is desirable to consider all of the relevant information of the biological systems in a disease state. Here, we propose a novel computational method to generate drug candidate molecular structures from patient gene expression profiles via deep learning, which we call DRAGONET. Our model can generate new molecules that are likely to counteract disease-specific gene expression patterns in patients, which is made possible by exploring the latent space constructed by a transformer-based variational autoencoder and integrating the substructures of disease-correlated molecules. We applied DRAGONET to generate drug candidate molecules for gastric cancer, atopic dermatitis, and Alzheimer's disease, and demonstrated that the newly generated molecules were chemically similar to registered drugs for each disease. This approach is applicable to diseases with unknown therapeutic target proteins and will make a significant contribution to the field of precision medicine.
{"title":"De novo drug design based on patient gene expression profiles via deep learning.","authors":"Chikashige Yamanaka, Shunya Uki, Kazuma Kaitoh, Michio Iwata, Yoshihiro Yamanishi","doi":"10.1002/minf.202300064","DOIUrl":"10.1002/minf.202300064","url":null,"abstract":"<p><p>Computational de novo drug design is a challenging issue in medicine, and it is desirable to consider all of the relevant information of the biological systems in a disease state. Here, we propose a novel computational method to generate drug candidate molecular structures from patient gene expression profiles via deep learning, which we call DRAGONET. Our model can generate new molecules that are likely to counteract disease-specific gene expression patterns in patients, which is made possible by exploring the latent space constructed by a transformer-based variational autoencoder and integrating the substructures of disease-correlated molecules. We applied DRAGONET to generate drug candidate molecules for gastric cancer, atopic dermatitis, and Alzheimer's disease, and demonstrated that the newly generated molecules were chemically similar to registered drugs for each disease. This approach is applicable to diseases with unknown therapeutic target proteins and will make a significant contribution to the field of precision medicine.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"42 8-9","pages":"e2300064"},"PeriodicalIF":3.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10576146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ashok Kumar, Ravi Singh, Powsali Ghosh, Ankit Ganeshpurkar, *. Asha, Rayala Swetha, Ravi Singh, Dileep Kumar, Sudheer Kumar Singh
Blood‐Brain‐Barrier (BBB) permeability is one of the critical factors in the success and failure of CNS drug development. The most accurate method of measuring BBB permeability involves clinical experiments, which are labour‐intensive and time‐consuming. Thus, numerous efforts were made to use artificial intelligence (AI) to predict molecules′ BBB permeability. Most of the previous models are based on calculated descriptors and molecular fingerprints. In the present work, we have developed an NLP‐based feature extraction technique in Deep‐Learning models to predict BBB permeability. We have used the B3DB database and generated SELFIES to extract features from the molecules. We have employed word level and N‐gram tokenization to represent words into numeric vectors. The extracted features were fed into several Artificial Neural Network (ANN) and Bi‐directional Long Short‐Term Memory (LSTM) models. The model, ANN‐10 built using ANN and 6‐gram tokenization, performed best on the independent test set. The accuracy, precision, recall, F1, specificity and AUC of ROC scores were found to be 0.89, 0.91, 0.91, 0.91, 0.85 and 0.90. Thus, the developed model can be used for the early screening of CNS drugs.
{"title":"Natural‐Language Processing (NLP) based feature extraction technique in Deep‐Learning model to predict the Blood‐Brain‐Barrier permeability of molecules","authors":"Ashok Kumar, Ravi Singh, Powsali Ghosh, Ankit Ganeshpurkar, *. Asha, Rayala Swetha, Ravi Singh, Dileep Kumar, Sudheer Kumar Singh","doi":"10.1002/minf.202200271","DOIUrl":"https://doi.org/10.1002/minf.202200271","url":null,"abstract":"Blood‐Brain‐Barrier (BBB) permeability is one of the critical factors in the success and failure of CNS drug development. The most accurate method of measuring BBB permeability involves clinical experiments, which are labour‐intensive and time‐consuming. Thus, numerous efforts were made to use artificial intelligence (AI) to predict molecules′ BBB permeability. Most of the previous models are based on calculated descriptors and molecular fingerprints. In the present work, we have developed an NLP‐based feature extraction technique in Deep‐Learning models to predict BBB permeability. We have used the B3DB database and generated SELFIES to extract features from the molecules. We have employed word level and N‐gram tokenization to represent words into numeric vectors. The extracted features were fed into several Artificial Neural Network (ANN) and Bi‐directional Long Short‐Term Memory (LSTM) models. The model, ANN‐10 built using ANN and 6‐gram tokenization, performed best on the independent test set. The accuracy, precision, recall, F1, specificity and AUC of ROC scores were found to be 0.89, 0.91, 0.91, 0.91, 0.85 and 0.90. Thus, the developed model can be used for the early screening of CNS drugs.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"1 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2023-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41857571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The paper presents the VEGA Online web service, which includes a set of freely available tools deriving from the development of the VEGA suite of programs. In detail, the paper is focused on two tools: the VEGA Web Edition (WE) and the Score tool. The former is a versatile file format converter including relevant features for 2D/3D conversion, for surface mapping and for editing/preparing input files. The Score application allows rescoring docking poses and in particular includes the MLP Interactions Scores (MLPInS) for describing hydrophobic interactions. To the best of our knowledge, this web service is the only available resource by which one can calculate both the virtual log P of a given input molecule according to the MLP approach plus the corresponding MLP surface.
本文介绍了VEGA在线web服务,其中包括一套源自VEGA程序套件开发的免费工具。具体来说,本文重点介绍了两个工具:VEGA Web Edition (WE)和Score工具。前者是一个通用的文件格式转换器,包括2D/3D转换、表面映射和编辑/准备输入文件的相关功能。Score应用程序允许对对接姿势进行评分,特别是包括MLP相互作用分数(mlpin),用于描述疏水相互作用。据我们所知,这个web服务是唯一可用的资源,通过它可以根据MLP方法和相应的MLP表面计算给定输入分子的虚拟log P。
{"title":"The VEGA web service: multipurpose online tools for molecular modelling and docking analyses.","authors":"Alessandro Pedretti, Serena Vittorio, Emanuela Sabato, Giulio Vistoli, Angelica Mazzolari","doi":"10.1002/minf.202300018","DOIUrl":"https://doi.org/10.1002/minf.202300018","url":null,"abstract":"<p><p>The paper presents the VEGA Online web service, which includes a set of freely available tools deriving from the development of the VEGA suite of programs. In detail, the paper is focused on two tools: the VEGA Web Edition (WE) and the Score tool. The former is a versatile file format converter including relevant features for 2D/3D conversion, for surface mapping and for editing/preparing input files. The Score application allows rescoring docking poses and in particular includes the MLP Interactions Scores (MLPInS) for describing hydrophobic interactions. To the best of our knowledge, this web service is the only available resource by which one can calculate both the virtual log P of a given input molecule according to the MLP approach plus the corresponding MLP surface.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"42 7","pages":"e2300018"},"PeriodicalIF":3.6,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9790706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}