Molecular Informatics最新文献

英文中文

Prediction of adverse drug reactions due to genetic predisposition using deep neural networks. 利用深度神经网络预测遗传倾向导致的药物不良反应。

IF 3.6 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics

Pub Date : 2024-06-01 Epub Date: 2024-06-08 DOI: 10.1002/minf.202400021

Bryan Dafniet, Olivier Taboureau

Drug development is a long and costly process, often limited by the toxicity and adverse drug reactions (ADRs) caused by drug candidates. Even on the market, some drugs can cause strong ADRs that can vary depending on an individual polymorphism. The development of Genome-wide association studies (GWAS) allowed the discovery of genetic variants of interest that may cause these effects. In this study, the objective was to investigate a deep learning approach to predict genetic variations potentially related to ADRs. We used single nucleotide polymorphisms (SNPs) information from dbSNP to create a network based on ADR-drug-target-mutations and extracted matrixes of interaction to build deep Neural Networks (DNN) models. Considering only information about mutations known to impact drug efficacy and drug safety from PharmGKB and drug adverse reactions based on the MedDRA System Organ Classes (SOCs), these DNN models reached a balanced accuracy of 0.61 in average. Including molecular fingerprints representing structural features of the drugs did not improve the performance of the models. To our knowledge, this is the first model that exploits DNN to predict ADR-drug-target-mutations. Although some improvements are suggested, these models can be of interest to analyze multiple compounds over all of the genes and polymorphisms information accessible and thus pave the way in precision medicine.

药物开发是一个漫长而昂贵的过程，往往受到候选药物的毒性和药物不良反应（ADRs）的限制。即使在市场上，一些药物也会引起强烈的药物不良反应，这些不良反应会因个体多态性的不同而不同。随着全基因组关联研究（GWAS）的发展，人们发现了可能导致这些影响的相关基因变异。本研究的目的是研究一种深度学习方法，以预测可能与 ADRs 相关的遗传变异。我们利用来自 dbSNP 的单核苷酸多态性（SNPs）信息创建了一个基于 ADR-药物-目标-突变的网络，并提取了相互作用矩阵来构建深度神经网络（DNN）模型。仅考虑到 PharmGKB 中已知会影响药物疗效和药物安全性的突变信息，以及基于 MedDRA 系统器官分类（SOCs）的药物不良反应，这些 DNN 模型的平均平衡准确率达到了 0.61。加入代表药物结构特征的分子指纹并没有提高模型的性能。据我们所知，这是首个利用 DNN 预测 ADR-药物-靶点突变的模型。虽然我们提出了一些改进建议，但这些模型可以用于分析可获取的所有基因和多态性信息中的多种化合物，从而为精准医疗铺平道路。

{"title":"Prediction of adverse drug reactions due to genetic predisposition using deep neural networks.","authors":"Bryan Dafniet, Olivier Taboureau","doi":"10.1002/minf.202400021","DOIUrl":"10.1002/minf.202400021","url":null,"abstract":"Drug development is a long and costly process, often limited by the toxicity and adverse drug reactions (ADRs) caused by drug candidates. Even on the market, some drugs can cause strong ADRs that can vary depending on an individual polymorphism. The development of Genome-wide association studies (GWAS) allowed the discovery of genetic variants of interest that may cause these effects. In this study, the objective was to investigate a deep learning approach to predict genetic variations potentially related to ADRs. We used single nucleotide polymorphisms (SNPs) information from dbSNP to create a network based on ADR-drug-target-mutations and extracted matrixes of interaction to build deep Neural Networks (DNN) models. Considering only information about mutations known to impact drug efficacy and drug safety from PharmGKB and drug adverse reactions based on the MedDRA System Organ Classes (SOCs), these DNN models reached a balanced accuracy of 0.61 in average. Including molecular fingerprints representing structural features of the drugs did not improve the performance of the models. To our knowledge, this is the first model that exploits DNN to predict ADR-drug-target-mutations. Although some improvements are suggested, these models can be of interest to analyze multiple compounds over all of the genes and polymorphisms information accessible and thus pave the way in precision medicine.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202400021"},"PeriodicalIF":3.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141293574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Accelerating Molecular Docking using Machine Learning Methods. 利用机器学习方法加速分子对接。

IF 3.6 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics

Pub Date : 2024-06-01 Epub Date: 2024-06-08 DOI: 10.1002/minf.202300167

Abdulsalam Y Bande, Sefer Baday

Virtual screening (VS) is one of the well-established approaches in drug discovery which speeds up the search for a bioactive molecule and, reduces costs and efforts associated with experiments. VS helps to narrow down the search space of chemical space and allows selecting fewer and more probable candidate compounds for experimental testing. Docking calculations are one of the commonly used and highly appreciated structure-based drug discovery methods. Databases for chemical structures of small molecules have been growing rapidly. However, at the moment virtual screening of large libraries via docking is not very common. In this work, we aim to accelerate docking studies by predicting docking scores without explicitly performing docking calculations. We experimented with an attention based long short-term memory (LSTM) neural network for an efficient prediction of docking scores as well as other machine learning models such as XGBoost. By using docking scores of a small number of ligands we trained our models and predicted docking scores of a few million molecules. Specifically, we tested our approaches on 11 datasets that were produced from in-house drug discovery studies. On average, by training models using only 7000 molecules we predicted docking scores of approximately 3.8 million molecules with R² (coefficient of determination) of 0.77 and Spearman rank correlation coefficient of 0.85. We designed the system with ease of use in mind. All the user needs to provide is a csv file containing SMILES and their respective docking scores, the system then outputs a model that the user can use for the prediction of docking score for a new molecule.

虚拟筛选（VS）是药物发现中一种行之有效的方法，它能加快寻找生物活性分子的速度，降低实验成本和工作量。VS 有助于缩小化学空间的搜索范围，从而选择更少、更可能的候选化合物进行实验测试。Docking 计算是常用的、备受赞赏的基于结构的药物发现方法之一。小分子化学结构数据库一直在快速增长。然而，目前通过对接对大型文库进行虚拟筛选的方法并不常见。在这项工作中，我们旨在通过预测对接得分来加速对接研究，而无需明确执行对接计算。我们试验了一种基于注意力的长短期记忆（LSTM）神经网络，用于高效预测对接得分，以及其他机器学习模型，如 XGBoost。通过使用少量配体的对接得分，我们训练了模型，并预测了几百万个分子的对接得分。具体来说，我们在内部药物发现研究产生的 11 个数据集上测试了我们的方法。平均而言，通过仅使用 7000 个分子训练模型，我们预测了约 380 万个分子的对接得分，R2（决定系数）为 0.77，斯皮尔曼等级相关系数为 0.85。我们在设计该系统时考虑到了易用性。用户只需提供一个包含 SMILES 及其各自对接得分的 csv 文件，系统就会输出一个模型，用户可以用它来预测新分子的对接得分。

{"title":"Accelerating Molecular Docking using Machine Learning Methods.","authors":"Abdulsalam Y Bande, Sefer Baday","doi":"10.1002/minf.202300167","DOIUrl":"10.1002/minf.202300167","url":null,"abstract":"Virtual screening (VS) is one of the well-established approaches in drug discovery which speeds up the search for a bioactive molecule and, reduces costs and efforts associated with experiments. VS helps to narrow down the search space of chemical space and allows selecting fewer and more probable candidate compounds for experimental testing. Docking calculations are one of the commonly used and highly appreciated structure-based drug discovery methods. Databases for chemical structures of small molecules have been growing rapidly. However, at the moment virtual screening of large libraries via docking is not very common. In this work, we aim to accelerate docking studies by predicting docking scores without explicitly performing docking calculations. We experimented with an attention based long short-term memory (LSTM) neural network for an efficient prediction of docking scores as well as other machine learning models such as XGBoost. By using docking scores of a small number of ligands we trained our models and predicted docking scores of a few million molecules. Specifically, we tested our approaches on 11 datasets that were produced from in-house drug discovery studies. On average, by training models using only 7000 molecules we predicted docking scores of approximately 3.8 million molecules with R2 (coefficient of determination) of 0.77 and Spearman rank correlation coefficient of 0.85. We designed the system with ease of use in mind. All the user needs to provide is a csv file containing SMILES and their respective docking scores, the system then outputs a model that the user can use for the prediction of docking score for a new molecule.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300167"},"PeriodicalIF":3.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141293572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

FetoML: Interpretable predictions of the fetotoxicity of drugs based on machine learning approaches. FetoML：基于机器学习方法的药物胎儿毒性可解读预测。

IF 3.6 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics

Pub Date : 2024-06-01 Epub Date: 2024-06-08 DOI: 10.1002/minf.202300312

Myeonghyeon Jeong, Sunyong Yoo

Pregnant females may use medications to manage health problems that develop during pregnancy or that they had prior to pregnancy. However, using medications during pregnancy has a potential risk to the fetus. Assessing the fetotoxicity of drugs is essential to ensure safe treatments, but the current process is challenged by ethical issues, time, and cost. Therefore, the need for in silico models to efficiently assess the fetotoxicity of drugs has recently emerged. Previous studies have proposed successful machine learning models for fetotoxicity prediction and even suggest molecular substructures that are possibly associated with fetotoxicity risks or protective effects. However, the interpretation of the decisions of the models on fetotoxicity prediction for each drug is still insufficient. This study constructed machine learning-based models that can predict the fetotoxicity of drugs while providing explanations for the decisions. For this, permutation feature importance was used to identify the general features that the model made significant in predicting the fetotoxicity of drugs. In addition, features associated with fetotoxicity for each drug were analyzed using the attention mechanism. The predictive performance of all the constructed models was significantly high (AUROC: 0.854-0.974, AUPR: 0.890-0.975). Furthermore, we conducted literature reviews on the predicted important features and found that they were highly associated with fetotoxicity. We expect that our model will benefit fetotoxicity research by providing an evaluation of fetotoxicity risks for drugs or drug candidates, along with an interpretation of that prediction.

孕妇可能会使用药物来控制怀孕期间出现的健康问题或怀孕前的健康问题。然而，孕期用药对胎儿有潜在风险。评估药物的胎儿毒性对确保治疗安全至关重要，但目前的评估过程受到伦理问题、时间和成本的挑战。因此，最近出现了对高效评估药物胎儿毒性的硅学模型的需求。以往的研究已经提出了成功的机器学习胎儿毒性预测模型，甚至提出了可能与胎儿毒性风险或保护作用相关的分子亚结构。然而，对每种药物胎儿毒性预测模型决策的解释仍然不足。本研究构建了基于机器学习的模型，该模型可以预测药物的胎儿毒性，同时提供决策解释。为此，研究人员采用了置换特征重要性的方法来确定模型在预测药物胎毒性时具有重要意义的一般特征。此外，还利用注意力机制分析了与每种药物胎儿毒性相关的特征。所有构建模型的预测性能都非常高（AUROC：0.854-0.974，AUPR：0.890-0.975）。此外，我们还对预测的重要特征进行了文献综述，发现这些特征与胎儿毒性高度相关。我们希望我们的模型能对药物或候选药物的胎儿毒性风险进行评估，并对预测结果进行解释，从而有利于胎儿毒性研究。

{"title":"FetoML: Interpretable predictions of the fetotoxicity of drugs based on machine learning approaches.","authors":"Myeonghyeon Jeong, Sunyong Yoo","doi":"10.1002/minf.202300312","DOIUrl":"10.1002/minf.202300312","url":null,"abstract":"Pregnant females may use medications to manage health problems that develop during pregnancy or that they had prior to pregnancy. However, using medications during pregnancy has a potential risk to the fetus. Assessing the fetotoxicity of drugs is essential to ensure safe treatments, but the current process is challenged by ethical issues, time, and cost. Therefore, the need for in silico models to efficiently assess the fetotoxicity of drugs has recently emerged. Previous studies have proposed successful machine learning models for fetotoxicity prediction and even suggest molecular substructures that are possibly associated with fetotoxicity risks or protective effects. However, the interpretation of the decisions of the models on fetotoxicity prediction for each drug is still insufficient. This study constructed machine learning-based models that can predict the fetotoxicity of drugs while providing explanations for the decisions. For this, permutation feature importance was used to identify the general features that the model made significant in predicting the fetotoxicity of drugs. In addition, features associated with fetotoxicity for each drug were analyzed using the attention mechanism. The predictive performance of all the constructed models was significantly high (AUROC: 0.854-0.974, AUPR: 0.890-0.975). Furthermore, we conducted literature reviews on the predicted important features and found that they were highly associated with fetotoxicity. We expect that our model will benefit fetotoxicity research by providing an evaluation of fetotoxicity risks for drugs or drug candidates, along with an interpretation of that prediction.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300312"},"PeriodicalIF":3.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141293573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Kinematic analysis of kinases and their oncogenic mutations - Kinases and their mutation kinematic analysis. 激酶及其致癌突变的运动学分析 - 激酶及其突变的运动学分析。

IF 3.6 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics

Pub Date : 2024-06-01 Epub Date: 2024-06-08 DOI: 10.1002/minf.202300250

Xiyu Chen, Sigrid Leyendecker

Protein kinases are crucial cellular enzymes that facilitate the transfer of phosphates from adenosine triphosphate (ATP) to their substrates, thereby regulating numerous cellular activities. Dysfunctional kinase activity often leads to oncogenic conditions. Chosen by using structural similarity to 5UG9, we selected 79 crystal structures from the PDB and based on the position of the phenylalanine side chain in the DFG motif, we classified these 79 crystal structures into 5 group clusters. Our approach applies our kinematic flexibility analysis (KFA) to explore the flexibility of kinases in various activity states and examine the impact of the activation loop on kinase structure. KFA enables the rapid decomposition of macromolecules into different flexibility regions, allowing comprehensive analysis of conformational structures. The results reveal that the activation loop of kinases acts as a "lock" that stabilizes the active conformation of kinases by rigidifying the adjacent α-helices. Furthermore, we investigate specific kinase mutations, such as the L858R mutation commonly associated with non-small cell lung cancer, which induces increased flexibility in active-state kinases. In addition, through analyzing the hydrogen bond pattern, we examine the substructure of kinases in different states. Notably, active-state kinases exhibit a higher occurrence of α-helices compared to inactive-state kinases. This study contributes to the understanding of biomolecular conformation at a level relevant to drug development.

蛋白激酶是一种重要的细胞酶，能促进磷酸从三磷酸腺苷（ATP）转移到其底物上，从而调节多种细胞活动。激酶活性失调往往会导致致癌情况。通过与 5UG9 的结构相似性，我们从 PDB 中选择了 79 个晶体结构，并根据 DFG 主题中苯丙氨酸侧链的位置，将这 79 个晶体结构分为 5 个群组。我们的方法应用了运动灵活性分析（KFA）来探索激酶在不同活性状态下的灵活性，并研究激活环对激酶结构的影响。KFA 能够将大分子快速分解为不同的柔性区域，从而对构象结构进行全面分析。研究结果表明，激酶的激活环就像一把 "锁"，通过使相邻的α-螺旋僵化来稳定激酶的活性构象。此外，我们还研究了特定的激酶突变，如常见于非小细胞肺癌的 L858R 突变，这种突变会诱导活性状态激酶的灵活性增加。此外，通过分析氢键模式，我们研究了不同状态下激酶的亚结构。值得注意的是，与非活性状态激酶相比，活性状态激酶表现出更高的α-螺旋发生率。这项研究有助于在与药物开发相关的层面上理解生物分子构象。

{"title":"Kinematic analysis of kinases and their oncogenic mutations - Kinases and their mutation kinematic analysis.","authors":"Xiyu Chen, Sigrid Leyendecker","doi":"10.1002/minf.202300250","DOIUrl":"10.1002/minf.202300250","url":null,"abstract":"Protein kinases are crucial cellular enzymes that facilitate the transfer of phosphates from adenosine triphosphate (ATP) to their substrates, thereby regulating numerous cellular activities. Dysfunctional kinase activity often leads to oncogenic conditions. Chosen by using structural similarity to 5UG9, we selected 79 crystal structures from the PDB and based on the position of the phenylalanine side chain in the DFG motif, we classified these 79 crystal structures into 5 group clusters. Our approach applies our kinematic flexibility analysis (KFA) to explore the flexibility of kinases in various activity states and examine the impact of the activation loop on kinase structure. KFA enables the rapid decomposition of macromolecules into different flexibility regions, allowing comprehensive analysis of conformational structures. The results reveal that the activation loop of kinases acts as a \"lock\" that stabilizes the active conformation of kinases by rigidifying the adjacent α-helices. Furthermore, we investigate specific kinase mutations, such as the L858R mutation commonly associated with non-small cell lung cancer, which induces increased flexibility in active-state kinases. In addition, through analyzing the hydrogen bond pattern, we examine the substructure of kinases in different states. Notably, active-state kinases exhibit a higher occurrence of α-helices compared to inactive-state kinases. This study contributes to the understanding of biomolecular conformation at a level relevant to drug development.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300250"},"PeriodicalIF":3.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141288332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cover Picture: (Mol. Inf. 5/2024) 封面图片：（Mol.Inf. 5/2024）

IF 3.6 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics

Pub Date : 2024-05-23 DOI: 10.1002/minf.202480501

引用次数: 0

Comparison of deep learning models with simple method to assess the problem of antimicrobial peptides prediction. 深度学习模型与简单方法的比较，以评估抗菌肽预测问题。

IF 3.6 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics

Pub Date : 2024-05-01 Epub Date: 2023-04-07 DOI: 10.1002/minf.202200181

M Y Lobanov, M V Slizen, N V Dovidchenko, A V Panfilov, A A Surin, I V Likhachev, O V Galzitskaya

Antibiotic-resistant strains are an emerging threat to public health. The usage of antimicrobial peptides (AMPs) is one of the promising approaches to solve this problem. For the development of new AMPs, it is necessary to have reliable prediction methods. Recently, deep learning approaches have been used to predict AMP. In this paper, we want to compare simple and complex methods for these purposes. We used the BERT transformer to create sequence embeddings and the multilayer perceptron (MLP) and light attention (LA) approaches for classification. One of them reached about 80 % accuracy and specificity in benchmark testing, which is on par with the best available methods. For comparison, we proposed a simple method using only the amino acid composition of proteins or peptides. This method has shown good results, at the level of the best methods. We have prepared a special server for predicting the ability of AMPs by amino acid composition: http://bioproteom.protres.ru/antimicrob/.

抗生素耐药菌株是公共卫生面临的一个新威胁。使用抗菌肽（AMPs）是解决这一问题的可行方法之一。要开发新的抗菌肽，必须有可靠的预测方法。最近，深度学习方法被用于预测 AMP。在本文中，我们希望对简单方法和复杂方法进行比较。我们使用 BERT 变换器创建序列嵌入，并使用多层感知器 (MLP) 和轻注意力 (LA) 方法进行分类。其中一种方法在基准测试中达到了约 80% 的准确率和特异性，与现有的最佳方法相当。为了进行比较，我们提出了一种仅使用蛋白质或肽的氨基酸组成的简单方法。这种方法显示出良好的效果，达到了最佳方法的水平。我们准备了一个专门的服务器，用于通过氨基酸组成预测 AMP 的能力：http://bioproteom.protres.ru/antimicrob/。

引用次数: 0

The macrocycle inhibitor landscape of SLC-transporter. SLC-转运体的大环抑制剂格局。

IF 2.8 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics

Pub Date : 2024-05-01 Epub Date: 2024-03-05 DOI: 10.1002/minf.202300287

Nejra Granulo, Sergey Sosnin, Daniela Digles, Gerhard F Ecker

In the past years the interest in Solute Carrier Transporters (SLC) has increased due to their potential as drug targets. At the same time, macrocycles demonstrated promising activities as therapeutic agents. However, the overall macrocycle/SLC-transporter interaction landscape has not been fully revealed yet. In this study, we present a statistical analysis of macrocycles with measured activity against SLC-transporter. Using a data mining pipeline based on KNIME retrieved in total 825 bioactivity data points of macrocycles interacting with SLC-transporter. For further analysis of the SLC inhibitor profiles we developed an interactive KNIME workflow as well as an interactive map of the chemical space coverage utilizing parametric t-SNE models. The parametric t-SNE models provide a good discrimination ability among several corresponding SLC subfamilies' targets. The KNIME workflow, the dataset, and the visualization tool are freely available to the community.

在过去几年里，由于溶质载体转运体（SLC）具有作为药物靶点的潜力，人们对它们的兴趣与日俱增。与此同时，大环作为治疗剂也表现出了良好的活性。然而，大环化合物/溶质载体转运体之间的整体相互作用尚未完全揭示。在本研究中，我们对具有针对 SLC 转运体活性的大环化合物进行了统计分析。利用基于 KNIME 的数据挖掘管道，共检索到 825 个与 SLC 转运体相互作用的大环化合物的生物活性数据点。为了进一步分析 SLC 抑制剂概况，我们开发了一个交互式 KNIME 工作流程，并利用参数 t-SNE 模型绘制了化学空间覆盖交互式地图。参数 t-SNE 模型在几个相应的 SLC 亚家族靶标之间提供了良好的区分能力。KNIME 工作流、数据集和可视化工具可免费向社区提供。

引用次数: 0

Predicting S. aureus antimicrobial resistance with interpretable genomic space maps. 利用可解释的基因组空间图预测金黄色葡萄球菌的抗菌药耐药性。

IF 3.6 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics

Pub Date : 2024-05-01 Epub Date: 2024-02-22 DOI: 10.1002/minf.202300263

Karina Pikalyova, Alexey Orlov, Dragos Horvath, Gilles Marcou, Alexandre Varnek

Increasing antimicrobial resistance (AMR) represents a global healthcare threat. To decrease the spread of AMR and associated mortality, methods for rapid selection of optimal antibiotic treatment are urgently needed. Machine learning (ML) models based on genomic data to predict resistant phenotypes can serve as a fast screening tool prior to phenotypic testing. Nonetheless, many existing ML methods lack interpretability. Therefore, we present a methodology for visualization of sequence space and AMR prediction based on the non-linear dimensionality reduction method - generative topographic mapping (GTM). This approach, applied to AMR data of >5000 S. aureus isolates retrieved from the PATRIC database, yielded GTM models with reasonable accuracy for all drugs (balanced accuracy values ≥0.75). The Generative Topographic Maps (GTMs) represent data in the form of illustrative maps of the genomic space and allow for antibiotic-wise comparison of resistant phenotypes. The maps were also found to be useful for the analysis of genetic determinants responsible for drug resistance. Overall, the GTM-based methodology is a useful tool for both the illustrative exploration of the genomic sequence space and AMR prediction.

抗菌素耐药性（AMR）的不断增加对全球医疗保健构成了威胁。为了减少 AMR 的传播和相关死亡率，迫切需要快速选择最佳抗生素治疗方法。基于基因组数据预测耐药性表型的机器学习（ML）模型可作为表型测试前的快速筛选工具。然而，许多现有的 ML 方法缺乏可解释性。因此，我们提出了一种基于非线性降维方法--生成地形图（GTM）的序列空间可视化和 AMR 预测方法。这种方法适用于从 PATRIC 数据库中检索到的超过 5000 个金黄色葡萄球菌分离物的 AMR 数据，对所有药物都产生了具有合理准确度的 GTM 模型（平衡准确度值≥0.75）。生成地形图（GTM）以基因组空间示意图的形式表示数据，可对抗生素耐药表型进行比较。研究还发现，生成地形图有助于分析导致耐药性的基因决定因素。总之，基于 GTM 的方法对于基因组序列空间的说明性探索和 AMR 预测都是一种有用的工具。

{"title":"Predicting S. aureus antimicrobial resistance with interpretable genomic space maps.","authors":"Karina Pikalyova, Alexey Orlov, Dragos Horvath, Gilles Marcou, Alexandre Varnek","doi":"10.1002/minf.202300263","DOIUrl":"10.1002/minf.202300263","url":null,"abstract":"Increasing antimicrobial resistance (AMR) represents a global healthcare threat. To decrease the spread of AMR and associated mortality, methods for rapid selection of optimal antibiotic treatment are urgently needed. Machine learning (ML) models based on genomic data to predict resistant phenotypes can serve as a fast screening tool prior to phenotypic testing. Nonetheless, many existing ML methods lack interpretability. Therefore, we present a methodology for visualization of sequence space and AMR prediction based on the non-linear dimensionality reduction method - generative topographic mapping (GTM). This approach, applied to AMR data of >5000 S. aureus isolates retrieved from the PATRIC database, yielded GTM models with reasonable accuracy for all drugs (balanced accuracy values ≥0.75). The Generative Topographic Maps (GTMs) represent data in the form of illustrative maps of the genomic space and allow for antibiotic-wise comparison of resistant phenotypes. The maps were also found to be useful for the analysis of genetic determinants responsible for drug resistance. Overall, the GTM-based methodology is a useful tool for both the illustrative exploration of the genomic sequence space and AMR prediction.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300263"},"PeriodicalIF":3.6,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139932061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cover Picture: (Mol. Inf. 4/2024) 封面图片：（Mol.Inf.4/2024）

IF 3.6 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics

Pub Date : 2024-04-23 DOI: 10.1002/minf.202480401

引用次数: 0

Discovery of a pocket network on the domain 5 of the TrkB receptor – A potential new target in the quest for the new ligands 发现 TrkB 受体结构域 5 的口袋网络--寻找新配体的潜在新目标

IF 3.6 4区医学 Q3 CHEMISTRY, MEDICINAL

Molecular Informatics

Pub Date : 2024-04-15 DOI: 10.1002/minf.202400043

Mirjana Antonijevic, Jana Sopkova‐de Oliveira Santos, Patrick Dallemagne, Christophe Rochais

The important role that the neurotrophin tyrosine kinase receptor ‐ TrkB has in the pathogenesis of several neurodegenerative conditions such are Alzheimer's disease, Parkinson's disease, Huntington's disease, has been well described. This shouldn't be a surprise, since in the physiological conditions, once activated by brain‐derived neurotrophic factor (BDNF) and neurotrophin‐4/5 (NT‐4/5), the TrkB receptor promotes neuronal survival, differentiation and synaptic function. Considering that the natural ligands for TrkB receptor are large proteins, it is a challenge to discover small molecule capable to mimic their effects.Even though, the surface of receptor that is interacting with BDNF or NT‐4/5 is known, there was always a question which pocket and interaction is responsible for activation of it. In order to answer this challenging question, we have used molecular dynamic (MD) simulations and Pocketron algorithm which enabled us to detect, for the first time, a pocket network existing in the interacting domain (d5) of the receptor; to describe them and to see how they are communicating with each other. This new discovery gave us potential new areas on receptor that can be targeted and used for structure‐based drug design approach in the development of the new ligands.

神经营养素酪氨酸激酶受体（TrkB）在阿尔茨海默病、帕金森病、亨廷顿病等多种神经退行性疾病的发病机制中发挥着重要作用，这一点已得到充分描述。这并不奇怪，因为在生理条件下，一旦被脑源性神经营养因子（BDNF）和神经营养素-4/5（NT-4/5）激活，TrkB 受体就会促进神经元的存活、分化和突触功能。考虑到 TrkB 受体的天然配体是大型蛋白质，发现能够模拟其效应的小分子是一项挑战。尽管与 BDNF 或 NT-4/5 相互作用的受体表面已被知晓，但一直存在的问题是，哪个口袋和相互作用负责激活它。为了回答这个具有挑战性的问题，我们利用分子动力学（MD）模拟和 Pocketron 算法，首次发现了存在于受体相互作用结构域（d5）中的口袋网络，并对其进行了描述，了解了它们是如何相互沟通的。这一新发现为我们提供了受体上潜在的新区域，我们可以将其作为目标，并在开发新配体时采用基于结构的药物设计方法。

{"title":"Discovery of a pocket network on the domain 5 of the TrkB receptor – A potential new target in the quest for the new ligands","authors":"Mirjana Antonijevic, Jana Sopkova‐de Oliveira Santos, Patrick Dallemagne, Christophe Rochais","doi":"10.1002/minf.202400043","DOIUrl":"https://doi.org/10.1002/minf.202400043","url":null,"abstract":"The important role that the neurotrophin tyrosine kinase receptor ‐ TrkB has in the pathogenesis of several neurodegenerative conditions such are Alzheimer's disease, Parkinson's disease, Huntington's disease, has been well described. This shouldn't be a surprise, since in the physiological conditions, once activated by brain‐derived neurotrophic factor (BDNF) and neurotrophin‐4/5 (NT‐4/5), the TrkB receptor promotes neuronal survival, differentiation and synaptic function. Considering that the natural ligands for TrkB receptor are large proteins, it is a challenge to discover small molecule capable to mimic their effects.Even though, the surface of receptor that is interacting with BDNF or NT‐4/5 is known, there was always a question which pocket and interaction is responsible for activation of it. In order to answer this challenging question, we have used molecular dynamic (MD) simulations and Pocketron algorithm which enabled us to detect, for the first time, a pocket network existing in the interacting domain (d5) of the receptor; to describe them and to see how they are communicating with each other. This new discovery gave us potential new areas on receptor that can be targeted and used for structure‐based drug design approach in the development of the new ligands.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"17 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140575770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Molecular Informatics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀