首页 > 最新文献

Molecular Informatics最新文献

英文 中文
Similarity searching for anticandidal agents employing a repurposing approach 采用再利用方法进行抗念珠菌药剂的相似性搜索
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2023-12-14 DOI: 10.1002/minf.202300206
Jaime Pérez-Villanueva, Karen Rodríguez-Villar, Francisco Cortés-Benítez, Juan Francisco Palacios-Espinosa
Fungal infections caused by <i>Candida</i> are still a public health concern. Particularly, the resistance to traditional chemotherapeutic agents is a major issue that requires efforts to develop new therapies. One of the most interesting approaches to finding new active compounds is drug repurposing aided by computational methods. In this work, two databases containing anticandidal agents and drugs were studied employing cheminformatics and compared by similarity methods. The results showed 36 drugs with high similarities to some candicidals. From these drugs, trimetozin, osalmid and metochalcone were evaluated against <i>C. albicans</i> (18804), <i>C. glabrata</i> (90030), and miconazole-resistant strain <i>C. glabrata</i> (32554). Osalmid and metochalcone were the best, with activity in the micromolar range. These findings represent an opportunity to continue with the research on the potential antifungal application of osalmid and metochalcone as well as the design of structurally related derivatives.
由<i>念珠菌</i>引起的真菌感染仍然是一个公共卫生问题。特别是,对传统化疗药物的抗药性是一个重大问题,需要努力开发新的疗法。寻找新的活性化合物最有趣的方法之一是在计算方法的帮助下进行药物再利用。在这项工作中,利用化学信息学研究了两个包含抗念珠菌药剂和药物的数据库,并通过相似性方法进行了比较。结果显示,有 36 种药物与某些抗念珠菌药具有高度相似性。在这些药物中,三甲唑嗪、osalmid和metochalcone针对白僵菌<i>(18804)、绿僵菌<i>(90030)和抗咪康唑菌株<i>绿僵菌</i>(32554)进行了评估。Osalmid 和 metochalcone 的效果最好,活性在微摩尔范围内。这些发现为继续研究 osalmid 和 metochalcone 的潜在抗真菌应用以及设计结构相关的衍生物提供了机会。
{"title":"Similarity searching for anticandidal agents employing a repurposing approach","authors":"Jaime Pérez-Villanueva, Karen Rodríguez-Villar, Francisco Cortés-Benítez, Juan Francisco Palacios-Espinosa","doi":"10.1002/minf.202300206","DOIUrl":"https://doi.org/10.1002/minf.202300206","url":null,"abstract":"Fungal infections caused by &lt;i&gt;Candida&lt;/i&gt; are still a public health concern. Particularly, the resistance to traditional chemotherapeutic agents is a major issue that requires efforts to develop new therapies. One of the most interesting approaches to finding new active compounds is drug repurposing aided by computational methods. In this work, two databases containing anticandidal agents and drugs were studied employing cheminformatics and compared by similarity methods. The results showed 36 drugs with high similarities to some candicidals. From these drugs, trimetozin, osalmid and metochalcone were evaluated against &lt;i&gt;C. albicans&lt;/i&gt; (18804), &lt;i&gt;C. glabrata&lt;/i&gt; (90030), and miconazole-resistant strain &lt;i&gt;C. glabrata&lt;/i&gt; (32554). Osalmid and metochalcone were the best, with activity in the micromolar range. These findings represent an opportunity to continue with the research on the potential antifungal application of osalmid and metochalcone as well as the design of structurally related derivatives.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"5 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138691835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Use of tree-based machine learning methods to screen affinitive peptides based on docking data. 使用基于树的机器学习方法筛选基于对接数据的亲和肽。
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2023-12-01 Epub Date: 2023-11-09 DOI: 10.1002/minf.202300143
Hua Feng, Fangyu Wang, Ning Li, Qian Xu, Guanming Zheng, Xuefeng Sun, Man Hu, Xuewu Li, Guangxu Xing, Gaiping Zhang

Screening peptides with good affinity is an important step in peptide-drug discovery. Recent advancement in computer and data science have made machine learning a useful tool in accurately affinitive-peptide screening. In current study, four different tree-based algorithms, including Classification and regression trees (CART), C5.0 decision tree (C50), Bagged CART (BAG) and Random Forest (RF), were employed to explore the relationship between experimental peptide affinities and virtual docking data, and the performance of each model was also compared in parallel. All four algorithms showed better performances on dataset pre-scaled, -centered and -PCA than other pre-processed dataset. After model re-built and hyperparameter optimization, the optimal C50 model (C50O) showed the best performances in terms of Accuracy, Kappa, Sensitivity, Specificity, F1, MCC and AUC when validated on test data and an unknown PEDV datasets evaluation (Accuracy=80.4 %). BAG and RFO (the optimal RF), as two best models during training process, did not performed as expecting during in testing and unknown dataset validations. Furthermore, the high correlation of the predictions of RFO and BAG to C50O implied the high stability and robustness of their prediction. Whereas although the good performance on unknown dataset, the poor performance in test data validation and correlation analysis indicated CARTO could not be used for future data prediction. To accurately evaluate the peptide affinity, the current study firstly gave a tree-model competition on affinitive peptide prediction by using virtual docking data, which would expand the application of machine learning algorithms in studying PepPIs and benefit the development of peptide therapeutics.

筛选具有良好亲和力的多肽是多肽药物开发的重要步骤。计算机和数据科学的最新进展使机器学习成为准确筛选亲和肽的有用工具。本研究采用分类与回归树(CART)、C5.0决策树(C50)、Bagged CART (BAG)和Random Forest (RF) 4种不同的基于树的算法,探讨实验肽亲和度与虚拟对接数据之间的关系,并并行比较各模型的性能。四种算法在数据集预缩放、中心化和主成分分析方面均表现出较好的性能。经过模型重建和超参数优化,最优C50模型(C50O)在测试数据验证和未知PEDV数据集评估中,在准确率、Kappa、灵敏度、特异性、F1、MCC和AUC方面表现最佳(准确率= 80.4%)。BAG和RFO(最优RF)作为训练过程中的两个最佳模型,在测试和未知数据集验证过程中表现不如预期。此外,RFO和BAG对C50O的预测具有较高的相关性,表明其预测具有较高的稳定性和鲁棒性。然而,尽管CARTO在未知数据上具有良好的性能,但在测试数据验证和相关性分析方面的性能较差,表明CARTO不能用于未来的数据预测。为了准确评估肽的亲和性,本研究首先利用虚拟对接数据对亲和肽预测进行了树模型竞争,这将扩大机器学习算法在PepPIs研究中的应用,有利于肽疗法的发展。
{"title":"Use of tree-based machine learning methods to screen affinitive peptides based on docking data.","authors":"Hua Feng, Fangyu Wang, Ning Li, Qian Xu, Guanming Zheng, Xuefeng Sun, Man Hu, Xuewu Li, Guangxu Xing, Gaiping Zhang","doi":"10.1002/minf.202300143","DOIUrl":"10.1002/minf.202300143","url":null,"abstract":"<p><p>Screening peptides with good affinity is an important step in peptide-drug discovery. Recent advancement in computer and data science have made machine learning a useful tool in accurately affinitive-peptide screening. In current study, four different tree-based algorithms, including Classification and regression trees (CART), C5.0 decision tree (C50), Bagged CART (BAG) and Random Forest (RF), were employed to explore the relationship between experimental peptide affinities and virtual docking data, and the performance of each model was also compared in parallel. All four algorithms showed better performances on dataset pre-scaled, -centered and -PCA than other pre-processed dataset. After model re-built and hyperparameter optimization, the optimal C50 model (C50O) showed the best performances in terms of Accuracy, Kappa, Sensitivity, Specificity, F1, MCC and AUC when validated on test data and an unknown PEDV datasets evaluation (Accuracy=80.4 %). BAG and RFO (the optimal RF), as two best models during training process, did not performed as expecting during in testing and unknown dataset validations. Furthermore, the high correlation of the predictions of RFO and BAG to C50O implied the high stability and robustness of their prediction. Whereas although the good performance on unknown dataset, the poor performance in test data validation and correlation analysis indicated CARTO could not be used for future data prediction. To accurately evaluate the peptide affinity, the current study firstly gave a tree-model competition on affinitive peptide prediction by using virtual docking data, which would expand the application of machine learning algorithms in studying PepPIs and benefit the development of peptide therapeutics.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300143"},"PeriodicalIF":3.6,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10212086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting the duration of action of β2-adrenergic receptor agonists: Ligand and structure-based approaches. 预测β2-肾上腺素能受体激动剂的作用持续时间:基于配体和结构的方法。
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2023-12-01 Epub Date: 2023-11-09 DOI: 10.1002/minf.202300141
Luca Chiesa, Emilie Sick, Esther Kellenberger

Agonists of the β2 adrenergic receptor (ADRB2) are an important class of medications used for the treatment of respiratory diseases. They can be classified as short acting (SABA) or long acting (LABA), with each class playing a different role in patient management. In this work we explored both ligand-based and structure-based high-throughput approaches to classify β2-agonists based on their duration of action. A completely in-silico prediction pipeline using an AlphaFold generated structure was used for structure-based modelling. Our analysis identified the ligands' 3D structure and lipophilicity as the most relevant features for the prediction of the duration of action. Interaction-based methods were also able to select ligands with the desired duration of action, incorporating the bias directly in the structure-based drug discovery pipeline without the need for further processing.

β2肾上腺素能受体激动剂(ADRB2)是一类用于治疗呼吸道疾病的重要药物。它们可以分为短效(SABA)或长效(LABA),每一类在患者管理中扮演不同的角色。在这项工作中,我们探索了基于配体和基于结构的高通量方法,根据β2-激动剂的作用时间对其进行分类。使用AlphaFold生成结构的完全计算机预测管道用于基于结构的建模。我们的分析确定配体的3D结构和亲脂性是预测作用持续时间的最相关特征。基于相互作用的方法也能够选择具有所需作用持续时间的配体,将偏差直接纳入基于结构的药物发现管道中,而无需进一步处理。
{"title":"Predicting the duration of action of β2-adrenergic receptor agonists: Ligand and structure-based approaches.","authors":"Luca Chiesa, Emilie Sick, Esther Kellenberger","doi":"10.1002/minf.202300141","DOIUrl":"10.1002/minf.202300141","url":null,"abstract":"<p><p>Agonists of the β2 adrenergic receptor (ADRB2) are an important class of medications used for the treatment of respiratory diseases. They can be classified as short acting (SABA) or long acting (LABA), with each class playing a different role in patient management. In this work we explored both ligand-based and structure-based high-throughput approaches to classify β2-agonists based on their duration of action. A completely in-silico prediction pipeline using an AlphaFold generated structure was used for structure-based modelling. Our analysis identified the ligands' 3D structure and lipophilicity as the most relevant features for the prediction of the duration of action. Interaction-based methods were also able to select ligands with the desired duration of action, incorporating the bias directly in the structure-based drug discovery pipeline without the need for further processing.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300141"},"PeriodicalIF":3.6,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49691521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An in silico investigation of Kv2.1 potassium channel: Model building and inhibitors binding sites analysis. Kv2.1钾通道的计算机研究:模型构建和抑制剂结合位点分析。
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2023-12-01 Epub Date: 2023-11-07 DOI: 10.1002/minf.202300072
Xiaoyu Wang, Xinyuan Zhang, Jie Zhou, Weiping Wang, Xiaoliang Wang, Bailing Xu

Kv2.1 is widely expressed in brain, and inhibiting Kv2.1 is a potential strategy to prevent cell death and achieve neuroprotection in ischemic stroke. Herein, an in silico model of Kv2.1 tetramer structure was constructed by employing the AlphaFold-Multimer deep learning method to facilitate the rational discovery of Kv2.1 inhibitors. GaMD was utilized to create an ion transporting trajectory, which was analyzed with HMM to generate multiple representative receptor conformations. The binding site of RY785 and RY796(S) under the P-loop was defined with Fpocket program together with the competitive binding electrophysiology assay. The docking poses of the two inhibitors were predicted with the aid of the semi-empirical quantum mechanical calculation, and the IGMH results suggested that Met375, Thr376, and Thr377 of the P-helix and Ile405 of the S6 segment made significant contributions to the binding affinity. These results provided insights for rational molecular design to develop novel Kv2.1 inhibitors.

在计算机中,Kv2.1在大脑中广泛表达,抑制Kv2.1是预防细胞死亡和实现缺血性中风神经保护的潜在策略。本文采用AlphaFold Multimer深度学习方法构建了Kv2.1四聚体结构模型,以促进Kv2.1抑制剂的合理发现。利用GaMD创建离子传输轨迹,用HMM分析该轨迹以产生多个代表性受体构象。用Fpocket程序和竞争性结合电生理测定法确定了RY785和RY796(S)的结合位点和P-环。借助半经验量子力学计算预测了两种抑制剂的对接姿态,IGMH结果表明,P-螺旋的Met375、Thr376和Thr377以及S6片段的Ile405对结合亲和力做出了显著贡献。这些结果为开发新型Kv2.1抑制剂的合理分子设计提供了见解。
{"title":"An in silico investigation of Kv2.1 potassium channel: Model building and inhibitors binding sites analysis.","authors":"Xiaoyu Wang, Xinyuan Zhang, Jie Zhou, Weiping Wang, Xiaoliang Wang, Bailing Xu","doi":"10.1002/minf.202300072","DOIUrl":"10.1002/minf.202300072","url":null,"abstract":"<p><p>Kv2.1 is widely expressed in brain, and inhibiting Kv2.1 is a potential strategy to prevent cell death and achieve neuroprotection in ischemic stroke. Herein, an in silico model of Kv2.1 tetramer structure was constructed by employing the AlphaFold-Multimer deep learning method to facilitate the rational discovery of Kv2.1 inhibitors. GaMD was utilized to create an ion transporting trajectory, which was analyzed with HMM to generate multiple representative receptor conformations. The binding site of RY785 and RY796(S) under the P-loop was defined with Fpocket program together with the competitive binding electrophysiology assay. The docking poses of the two inhibitors were predicted with the aid of the semi-empirical quantum mechanical calculation, and the IGMH results suggested that Met375, Thr376, and Thr377 of the P-helix and Ile405 of the S6 segment made significant contributions to the binding affinity. These results provided insights for rational molecular design to develop novel Kv2.1 inhibitors.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300072"},"PeriodicalIF":3.6,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41141371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of tastants: A deep learning based approach. 味觉分类:一种基于深度学习的方法。
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2023-12-01 Epub Date: 2023-11-09 DOI: 10.1002/minf.202300146
Prantar Dutta, Deepak Jain, Rakesh Gupta, Beena Rai

Predicting the taste of molecules is of critical importance in the food and beverages, flavor, and pharmaceutical industries for the design and screening of new tastants. In this work, we have built deep learning models to classify sweet, bitter, and umami molecules- the three basic tastes whose sensation is mediated by G protein-coupled receptors. An extensive dataset containing 1466 bitter, 1764 sweet, and 238 umami tastants was curated from existing literature. We analyzed the chemical characteristics of the molecules, with special focus on the presence of different functional groups. A deep neural network model based on molecular descriptors and a graph neural network model were trained for taste prediction. The class imbalance due to fewer umami molecules was tackled using special sampling techniques. Both models show comparable performance during evaluation, but the graph-based model can learn task-specific representations from the molecular structure without requiring handcrafted features. We further explain the deep neural network predictions using Shapley additive explanations. Finally, we demonstrated the applicability of the models by screening bitter, sweet, and umami molecules from a large food database. This study develops an in-silico approach to classify molecules based on their taste by leveraging the recent progress in deep learning, which can serve as a powerful tool for tastant design.

在食品和饮料、香料和制药行业,预测分子的味道对于设计和筛选新的品尝剂至关重要。在这项工作中,我们建立了深度学习模型来对甜味、苦味和鲜味分子进行分类,这三种基本味道的感觉是由G蛋白偶联受体介导的。根据现有文献整理了一个包含1466种苦味、1764种甜味和238种鲜味品尝剂的广泛数据集。我们分析了分子的化学特征,特别关注不同官能团的存在。训练了一个基于分子描述符的深度神经网络模型和一个图神经网络模型用于味觉预测。鲜味分子减少导致的类别不平衡是通过特殊的采样技术解决的。这两个模型在评估过程中表现出相当的性能,但基于图的模型可以从分子结构中学习特定任务的表示,而不需要手工制作的特征。我们使用Shapley加性解释进一步解释了深度神经网络预测。最后,我们通过从大型食品数据库中筛选苦味、甜味和鲜味分子,证明了模型的适用性。这项研究利用深度学习的最新进展,开发了一种基于味道对分子进行分类的计算机方法,这可以作为品尝剂设计的强大工具。
{"title":"Classification of tastants: A deep learning based approach.","authors":"Prantar Dutta, Deepak Jain, Rakesh Gupta, Beena Rai","doi":"10.1002/minf.202300146","DOIUrl":"10.1002/minf.202300146","url":null,"abstract":"<p><p>Predicting the taste of molecules is of critical importance in the food and beverages, flavor, and pharmaceutical industries for the design and screening of new tastants. In this work, we have built deep learning models to classify sweet, bitter, and umami molecules- the three basic tastes whose sensation is mediated by G protein-coupled receptors. An extensive dataset containing 1466 bitter, 1764 sweet, and 238 umami tastants was curated from existing literature. We analyzed the chemical characteristics of the molecules, with special focus on the presence of different functional groups. A deep neural network model based on molecular descriptors and a graph neural network model were trained for taste prediction. The class imbalance due to fewer umami molecules was tackled using special sampling techniques. Both models show comparable performance during evaluation, but the graph-based model can learn task-specific representations from the molecular structure without requiring handcrafted features. We further explain the deep neural network predictions using Shapley additive explanations. Finally, we demonstrated the applicability of the models by screening bitter, sweet, and umami molecules from a large food database. This study develops an in-silico approach to classify molecules based on their taste by leveraging the recent progress in deep learning, which can serve as a powerful tool for tastant design.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300146"},"PeriodicalIF":3.6,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"54230131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
AliNA - a deep learning program for RNA secondary structure prediction. AliNA-一个用于RNA二级结构预测的深度学习程序。
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2023-12-01 Epub Date: 2023-11-02 DOI: 10.1002/minf.202300113
Shamsudin S Nasaev, Artem R Mukanov, Ivan I Kuznetsov, Alexander V Veselovsky

Nowadays there are numerous discovered natural RNA variations participating in different cellular processes and artificial RNA, e. g., aptamers, riboswitches. One of the required tasks in the investigation of their functions and mechanism of influence on cells and interaction with targets is the prediction of RNA secondary structures. The classic thermodynamic-based prediction algorithms do not consider the specificity of biological folding and deep learning methods that were designed to resolve this issue suffer from homology-based methods problems. Herein, we present a method for RNA secondary structure prediction based on deep learning - AliNA (ALIgned Nucleic Acids). Our method successfully predicts secondary structures for non-homologous to train-data RNA families thanks to usage of the data augmentation techniques. Augmentation extends existing datasets with easily-accessible simulated data. The proposed method shows a high quality of prediction across different benchmarks including pseudoknots. The method is available on GitHub for free (https://github.com/Arty40m/AliNA).

目前已经发现了许多参与不同细胞过程的天然RNA变体和人工RNA。 g.适体、核糖开关。在研究它们的功能和对细胞的影响机制以及与靶标的相互作用时,所需的任务之一是预测RNA二级结构。经典的基于热力学的预测算法没有考虑生物折叠的特异性,为解决这一问题而设计的深度学习方法存在基于同源性的方法问题。在此,我们提出了一种基于深度学习的RNA二级结构预测方法——AliNA(ALIgned Nucleic Acids)。由于使用了数据扩增技术,我们的方法成功地预测了非同源的二级结构以训练数据RNA家族。增强功能利用易于访问的模拟数据扩展了现有数据集。所提出的方法在包括伪节点在内的不同基准上显示出高质量的预测。该方法在GitHub上免费提供(https://github.com/Arty40m/AliNA)。
{"title":"AliNA - a deep learning program for RNA secondary structure prediction.","authors":"Shamsudin S Nasaev, Artem R Mukanov, Ivan I Kuznetsov, Alexander V Veselovsky","doi":"10.1002/minf.202300113","DOIUrl":"10.1002/minf.202300113","url":null,"abstract":"<p><p>Nowadays there are numerous discovered natural RNA variations participating in different cellular processes and artificial RNA, e. g., aptamers, riboswitches. One of the required tasks in the investigation of their functions and mechanism of influence on cells and interaction with targets is the prediction of RNA secondary structures. The classic thermodynamic-based prediction algorithms do not consider the specificity of biological folding and deep learning methods that were designed to resolve this issue suffer from homology-based methods problems. Herein, we present a method for RNA secondary structure prediction based on deep learning - AliNA (ALIgned Nucleic Acids). Our method successfully predicts secondary structures for non-homologous to train-data RNA families thanks to usage of the data augmentation techniques. Augmentation extends existing datasets with easily-accessible simulated data. The proposed method shows a high quality of prediction across different benchmarks including pseudoknots. The method is available on GitHub for free (https://github.com/Arty40m/AliNA).</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300113"},"PeriodicalIF":3.6,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10591141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of novel ligands against SARS-CoV-2 Mpro enzyme: an in silico and in vitro Study. 新型抗SARS-CoV-2 Mpro酶配体的研制:硅化和体外研究
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2023-11-01 Epub Date: 2023-09-06 DOI: 10.1002/minf.202300120
Navid Kaboudi, Nadine Krüger, Maryam Hamzeh-Mivehroud

Background: Despite tremendous efforts made by scientific community during the outbreak of COVID-19 pandemic, this disease still remains as a public health concern. Although different types of vaccines were globally used to reduce the mortality, emergence of new variants of SARS-CoV-2 is a challenging issue in COVID-19 pharmacotherapy. In this context, target therapy of SARS-CoV-2 by small ligands is a promising strategy.

Methods: In this investigation, we applied ligand-based virtual screening for finding novel molecules based on nirmatrelvir structure. Various criteria including drug-likeness, ADME, and toxicity properties were applied for filtering the compounds. The selected candidate molecules were subjected to molecular docking and dynamics simulation for predicting the binding mode and binding free energy, respectively. Then the molecules were experimentally evaluated in terms of antiviral activity against SARS-CoV-2 and toxicity assessment.

Results: The results demonstrated that the identified compounds showed inhibitory activity towards SARS-CoV-2 Mpro .

Conclusion: In summary, the introduced compounds may provide novel scaffold for further structural modification and optimization with improved anti SARS-CoV-2 Mpro activity.

背景:尽管科学界在2019冠状病毒病大流行期间做出了巨大努力,但该疾病仍然是一个公共卫生问题。尽管全球使用了不同类型的疫苗来降低死亡率,但SARS-CoV-2新变种的出现是COVID-19药物治疗中的一个具有挑战性的问题。在这种情况下,小配体靶向治疗SARS-CoV-2是一种很有前途的策略。方法:采用基于配体的虚拟筛选方法,寻找基于nirmatrelvir结构的新分子。各种标准,包括药物相似度,ADME和毒性性能,用于过滤化合物。选择的候选分子分别进行分子对接和动力学模拟,预测结合模式和结合自由能。然后对这些分子进行了抗病毒活性和毒性评价的实验评价。结果:鉴定的化合物对SARS-CoV-2 Mpro具有抑制活性。结论:综上所述,引入的化合物可为进一步的结构修饰和优化提供新的支架,提高抗SARS-CoV-2 Mpro的活性。
{"title":"Development of novel ligands against SARS-CoV-2 M<sup>pro</sup> enzyme: an in silico and in vitro Study.","authors":"Navid Kaboudi, Nadine Krüger, Maryam Hamzeh-Mivehroud","doi":"10.1002/minf.202300120","DOIUrl":"10.1002/minf.202300120","url":null,"abstract":"<p><strong>Background: </strong>Despite tremendous efforts made by scientific community during the outbreak of COVID-19 pandemic, this disease still remains as a public health concern. Although different types of vaccines were globally used to reduce the mortality, emergence of new variants of SARS-CoV-2 is a challenging issue in COVID-19 pharmacotherapy. In this context, target therapy of SARS-CoV-2 by small ligands is a promising strategy.</p><p><strong>Methods: </strong>In this investigation, we applied ligand-based virtual screening for finding novel molecules based on nirmatrelvir structure. Various criteria including drug-likeness, ADME, and toxicity properties were applied for filtering the compounds. The selected candidate molecules were subjected to molecular docking and dynamics simulation for predicting the binding mode and binding free energy, respectively. Then the molecules were experimentally evaluated in terms of antiviral activity against SARS-CoV-2 and toxicity assessment.</p><p><strong>Results: </strong>The results demonstrated that the identified compounds showed inhibitory activity towards SARS-CoV-2 M<sup>pro</sup> .</p><p><strong>Conclusion: </strong>In summary, the introduced compounds may provide novel scaffold for further structural modification and optimization with improved anti SARS-CoV-2 M<sup>pro</sup> activity.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300120"},"PeriodicalIF":3.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10166837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cell-penetrating peptides predictors: A comparative analysis of methods and datasets. 细胞穿透多肽预测因子:方法和数据集的比较分析。
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2023-11-01 Epub Date: 2023-09-06 DOI: 10.1002/minf.202300104
Karen Guerrero-Vázquez, Gabriel Del Rio, Carlos A Brizuela

Cell-Penetrating Peptides (CPP) are emerging as an alternative to small-molecule drugs to expand the range of biomolecules that can be targeted for therapeutic purposes. Due to the importance of identifying and designing new CPP, a great variety of predictors have been developed to achieve these goals. To establish a ranking for these predictors, a couple of recent studies compared their performances on specific datasets, yet their conclusions cannot determine if the ranking obtained is due to the model, the set of descriptors or the datasets used to test the predictors. We present a systematic study of the influence of the peptide sequence's similarity of the datasets on the predictors' performance. The analysis reveals that the datasets used for training have a stronger influence on the predictors performance than the model or descriptors employed. We show that datasets with low sequence similarity between the positive and negative examples can be easily separated, and the tested classifiers showed good performance on them. On the other hand, a dataset with high sequence similarity between CPP and non-CPP will be a hard dataset, and it should be the one to be used for assessing the performance of new predictors.

细胞穿透肽(CPP)正在成为小分子药物的替代品,以扩大可靶向治疗目的的生物分子范围。由于识别和设计新的CPP的重要性,已经开发了各种各样的预测器来实现这些目标。为了建立这些预测因子的排名,最近的一些研究比较了它们在特定数据集上的表现,但他们的结论无法确定所获得的排名是由于模型,描述符集还是用于测试预测因子的数据集。我们提出了一个系统的研究,肽序列的数据集的相似性对预测器的性能的影响。分析表明,用于训练的数据集比所使用的模型或描述符对预测器的性能有更大的影响。结果表明,正反样例序列相似度较低的数据集可以很容易地分离出来,并且所测试的分类器在这些数据集上表现出良好的性能。另一方面,在CPP和非CPP之间具有高序列相似性的数据集将是一个硬数据集,它应该用于评估新预测器的性能。
{"title":"Cell-penetrating peptides predictors: A comparative analysis of methods and datasets.","authors":"Karen Guerrero-Vázquez, Gabriel Del Rio, Carlos A Brizuela","doi":"10.1002/minf.202300104","DOIUrl":"10.1002/minf.202300104","url":null,"abstract":"<p><p>Cell-Penetrating Peptides (CPP) are emerging as an alternative to small-molecule drugs to expand the range of biomolecules that can be targeted for therapeutic purposes. Due to the importance of identifying and designing new CPP, a great variety of predictors have been developed to achieve these goals. To establish a ranking for these predictors, a couple of recent studies compared their performances on specific datasets, yet their conclusions cannot determine if the ranking obtained is due to the model, the set of descriptors or the datasets used to test the predictors. We present a systematic study of the influence of the peptide sequence's similarity of the datasets on the predictors' performance. The analysis reveals that the datasets used for training have a stronger influence on the predictors performance than the model or descriptors employed. We show that datasets with low sequence similarity between the positive and negative examples can be easily separated, and the tested classifiers showed good performance on them. On the other hand, a dataset with high sequence similarity between CPP and non-CPP will be a hard dataset, and it should be the one to be used for assessing the performance of new predictors.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300104"},"PeriodicalIF":3.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10225885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Absorption matters: A closer look at popular oral bioavailability rules for drug approvals. 吸收问题:药物批准流行的口服生物利用度规则。
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2023-11-01 Epub Date: 2023-08-31 DOI: 10.1002/minf.202300115
Artur Caminero Gomes Soares, Gustavo Henrique Marques Sousa, Raisa Ludmila Calil, Gustavo Henrique Goulart Trossini
This study examines how two popular drug‐likeness concepts used in early development, Lipinski Rule of Five (Ro5) and Veber's Rules, possibly affected drug profiles of FDA approved drugs since 1997. Our findings suggest that when all criteria are applied, relevant compounds may be excluded, addressing the harmfulness of blindly employing these rules. Of all oral drugs in the period used for this analysis, around 66 % conform to the RO5 and 85 % to Veber's Rules. Molecular Weight and calculated LogP showed low consistent values over time, apart from being the two least followed rules, challenging their relevance. On the other hand, hydrogen bond related rules and the number of rotatable bonds are amongst the most followed criteria and show exceptional consistency over time. Furthermore, our analysis indicates that topological polar surface area and total count of hydrogen bonds cannot be used as interchangeable parameters, contrary to the original proposal. This research enhances the comprehension of drug profiles that were FDA approved in the post‐Lipinski period. Medicinal chemists could utilize these heuristics as a limited guide to direct their exploration of the oral bioavailability chemical space, but they must also steer the wheel to break these rules and explore different regions when necessary.
本研究考察了早期开发中使用的两个流行的药物相似性概念,即利平斯基五规则(Ro5)和韦伯规则,如何可能影响1997年以来FDA批准的药物的药物特征。我们的研究结果表明,当适用所有标准时,相关化合物可能被排除在外,解决盲目使用这些规则的危害。在本分析所使用的所有口服药物中,约66%符合RO5, 85%符合Veber规则。随着时间的推移,分子量和计算的LogP显示出较低的一致性值,除了这两个最不受遵守的规则外,还挑战了它们的相关性。另一方面,氢键相关规则和可旋转键的数量是最受遵循的标准之一,并且随着时间的推移显示出异常的一致性。此外,我们的分析表明,拓扑极性表面积和氢键总数不能用作可互换的参数,这与最初的建议相反。本研究增强了对后利平斯基时期FDA批准的药物概况的理解。药物化学家可以利用这些启发式作为指导他们探索口服生物利用度化学空间的有限指南,但他们也必须驾驭轮子打破这些规则,在必要时探索不同的区域。
{"title":"Absorption matters: A closer look at popular oral bioavailability rules for drug approvals.","authors":"Artur Caminero Gomes Soares, Gustavo Henrique Marques Sousa, Raisa Ludmila Calil, Gustavo Henrique Goulart Trossini","doi":"10.1002/minf.202300115","DOIUrl":"10.1002/minf.202300115","url":null,"abstract":"This study examines how two popular drug‐likeness concepts used in early development, Lipinski Rule of Five (Ro5) and Veber's Rules, possibly affected drug profiles of FDA approved drugs since 1997. Our findings suggest that when all criteria are applied, relevant compounds may be excluded, addressing the harmfulness of blindly employing these rules. Of all oral drugs in the period used for this analysis, around 66 % conform to the RO5 and 85 % to Veber's Rules. Molecular Weight and calculated LogP showed low consistent values over time, apart from being the two least followed rules, challenging their relevance. On the other hand, hydrogen bond related rules and the number of rotatable bonds are amongst the most followed criteria and show exceptional consistency over time. Furthermore, our analysis indicates that topological polar surface area and total count of hydrogen bonds cannot be used as interchangeable parameters, contrary to the original proposal. This research enhances the comprehension of drug profiles that were FDA approved in the post‐Lipinski period. Medicinal chemists could utilize these heuristics as a limited guide to direct their exploration of the oral bioavailability chemical space, but they must also steer the wheel to break these rules and explore different regions when necessary.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300115"},"PeriodicalIF":3.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10117941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven approaches for identifying hyperparameters in multi-step retrosynthesis. 用于识别多步骤逆转录合成中的超参数的数据驱动方法。
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2023-11-01 Epub Date: 2023-09-27 DOI: 10.1002/minf.202300128
Annie M Westerlund, Bente Barge, Lewis Mervin, Samuel Genheden

The multi-step retrosynthesis problem can be solved by a search algorithm, such as Monte Carlo tree search (MCTS). The performance of multistep retrosynthesis, as measured by a trade-off in search time and route solvability, therefore depends on the hyperparameters of the search algorithm. In this paper, we demonstrated the effect of three MCTS hyperparameters (number of iterations, tree depth, and tree width) on metrics such as Linear integrated speed-accuracy score (LISAS) and Inverse efficiency score which consider both route solvability and search time. This exploration was conducted by employing three data-driven approaches, namely a systematic grid search, Bayesian optimization over an ensemble of molecules to obtain static MCTS hyperparameters, and a machine learning approach to dynamically predict optimal MCTS hyperparameters given an input target molecule. With the obtained results on the internal dataset, we demonstrated that it is possible to identify a hyperparameter set which outperforms the current AiZynthFinder default setting. It appeared optimal across a variety of target input molecules, both on proprietary and public datasets. The settings identified with the in-house dataset reached a solvability of 93 % and median search time of 151 s for the in-house dataset, and a 74 % solvability and 114 s for the ChEMBL dataset. These numbers can be compared to the current default settings which solved 85 % and 73 % during a median time of 110s and 84 s, for in-house and ChEMBL, respectively.

多步逆合成问题可以通过搜索算法来解决,例如蒙特卡罗树搜索(MCTS)。因此,通过搜索时间和路径可解性的权衡来衡量多步逆合成的性能,取决于搜索算法的超参数。在本文中,我们证明了三个MCTS超参数(迭代次数、树深度和树宽度)对线性综合速度精度分数(LISAS)和反向效率分数等指标的影响,这些指标同时考虑了路径可解性和搜索时间。这项探索是通过采用三种数据驱动的方法进行的,即系统网格搜索、对分子集合进行贝叶斯优化以获得静态MCTS超参数,以及在给定输入目标分子的情况下动态预测最佳MCTS超参数的机器学习方法。根据在内部数据集上获得的结果,我们证明了可以识别出一个超参数集,该超参数集的性能优于当前的AiZynhFinder默认设置。在专有和公共数据集上,它似乎在各种目标输入分子中都是最优的。内部数据集确定的设置达到了93的可解性 % 搜索时间中位数为151 s表示内部数据集,74 % 可解性和114 s用于ChEMBL数据集。这些数字可以与当前默认设置进行比较,解决了85 % 和73 % 在110秒和84秒的中间时间内 s、 分别用于内部和ChEMBL。
{"title":"Data-driven approaches for identifying hyperparameters in multi-step retrosynthesis.","authors":"Annie M Westerlund, Bente Barge, Lewis Mervin, Samuel Genheden","doi":"10.1002/minf.202300128","DOIUrl":"10.1002/minf.202300128","url":null,"abstract":"<p><p>The multi-step retrosynthesis problem can be solved by a search algorithm, such as Monte Carlo tree search (MCTS). The performance of multistep retrosynthesis, as measured by a trade-off in search time and route solvability, therefore depends on the hyperparameters of the search algorithm. In this paper, we demonstrated the effect of three MCTS hyperparameters (number of iterations, tree depth, and tree width) on metrics such as Linear integrated speed-accuracy score (LISAS) and Inverse efficiency score which consider both route solvability and search time. This exploration was conducted by employing three data-driven approaches, namely a systematic grid search, Bayesian optimization over an ensemble of molecules to obtain static MCTS hyperparameters, and a machine learning approach to dynamically predict optimal MCTS hyperparameters given an input target molecule. With the obtained results on the internal dataset, we demonstrated that it is possible to identify a hyperparameter set which outperforms the current AiZynthFinder default setting. It appeared optimal across a variety of target input molecules, both on proprietary and public datasets. The settings identified with the in-house dataset reached a solvability of 93 % and median search time of 151 s for the in-house dataset, and a 74 % solvability and 114 s for the ChEMBL dataset. These numbers can be compared to the current default settings which solved 85 % and 73 % during a median time of 110s and 84 s, for in-house and ChEMBL, respectively.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300128"},"PeriodicalIF":3.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10553301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Molecular Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1