首页 > 最新文献

Molecular Informatics最新文献

英文 中文
Sulfotransferase-mediated phase II drug metabolism prediction of substrates and sites using accessibility and reactivity-based algorithms. 利用基于可及性和反应性的算法预测硫代转氨酶介导的 II 期药物代谢底物和位点。
IF 2.8 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2024-10-01 Epub Date: 2024-08-07 DOI: 10.1002/minf.202400008
Shivam Kumar Vyas, Avik Das, Upadhyayula Suryanarayana Murty, Vaibhav A Dixit

Sulphotransferases (SULTs) are a major phase II metabolic enzyme class contributing ~20 % to the Phase II metabolism of FDA-approved drugs. Ignoring the potential for SULT-mediated metabolism leaves a strong potential for drug-drug interactions, often causing late-stage drug discovery failures or black-boxed warnings on FDA labels. The existing models use only accessibility descriptors and machine learning (ML) methods for class and site of sulfonation (SOS) predictions for SULT. In this study, a variety of accessibility, reactivity, and hybrid models and algorithms have been developed to make accurate substrate and SOS predictions. Unlike the literature models, reactivity parameters for the aliphatic or aromatic hydroxyl groups (R/Ar-O-H), the Bond Dissociation Energy (BDE) gave accurate models with a True Positive Rate (TPR)=0.84 for SOS predictions. We offer mechanistic insights to explain these novel findings that are not recognized in the literature. The accessibility parameters like the ratio of Chemgauss4 Score (CGS) and Molecular Weight (MW) CGS/MW and distance from cofactor (Dis) were essential for class predictions and showed TPR=0.72. Substrates consistently had lower BDE, Dis, and CGS/MW than non-substrates. Hybrid models also performed acceptablely for SOS predictions. Using the best models, Algorithms gave an acceptable performance in class prediction: TPR=0.62, False Positive Rate (FPR)=0.24, Balanced accuracy (BA)=0.69, and SOS prediction: TPR=0.98, FPR=0.60, and BA=0.69. A rule-based method was added to improve the predictive performance, which improved the algorithm TPR, FPR, and BA. Validation using an external dataset of drug-like compounds gave class prediction: TPR=0.67, FPR=0.00, and SOS prediction: TPR=0.80 and FPR=0.44 for the best Algorithm. Comparisons with standard ML models also show that our algorithm shows higher predictive performance for classification on external datasets. Overall, these models and algorithms (SOS predictor) give accurate substrate class and site (SOS) predictions for SULT-mediated Phase II metabolism and will be valuable to the drug discovery community in academia and industry. The SOS predictor is freely available for academic/non-profit research via the GitHub link.

磺基转移酶(SULTs)是一种主要的 II 期代谢酶,在 FDA 批准药物的 II 期代谢中占约 20%。忽视 SULT 介导的潜在代谢可能会导致药物间的强烈相互作用,这往往会导致后期药物发现的失败或 FDA 标签上的黑框警告。现有模型仅使用可及性描述符和机器学习(ML)方法来预测 SULT 的类别和磺化位点(SOS)。本研究开发了多种可及性、反应性和混合模型及算法,以准确预测底物和 SOS。与文献模型不同的是,脂肪族或芳香族羟基(R/Ar-O-H)的反应性参数、键离解能(BDE)给出了准确的模型,SOS 预测的真阳性率(TPR)=0.84。我们从机理角度解释了这些在文献中未得到认可的新发现。可及性参数,如 Chemgauss4 Score(CGS)与分子量(MW)之比 CGS/MW 以及与辅助因子的距离(Dis),对于类别预测至关重要,其 TPR=0.72。底物的 BDE、Dis 和 CGS/MW 始终低于非底物。混合模型在 SOS 预测方面的表现也可以接受。使用最佳模型,算法在类别预测方面的表现可以接受:TPR=0.62,误报率(FPR)=0.24,平衡准确率(BA)=0.69,SOS 预测:SOS预测:TPR=0.98,FPR=0.60,BA=0.69。为提高预测性能,增加了基于规则的方法,从而提高了算法的 TPR、FPR 和 BA。使用外部类药物数据集进行验证后,得出了类预测结果:TPR=0.67, FPR=0.00, SOS 预测:最佳算法的 TPR=0.80 和 FPR=0.44。与标准 ML 模型的比较也表明,我们的算法对外部数据集的分类具有更高的预测性能。总之,这些模型和算法(SOS 预测器)能为 SULT 介导的第二阶段代谢提供准确的底物类别和位点(SOS)预测,对学术界和工业界的药物发现界很有价值。SOS 预测器可通过 GitHub 链接免费提供给学术/非营利研究使用。
{"title":"Sulfotransferase-mediated phase II drug metabolism prediction of substrates and sites using accessibility and reactivity-based algorithms.","authors":"Shivam Kumar Vyas, Avik Das, Upadhyayula Suryanarayana Murty, Vaibhav A Dixit","doi":"10.1002/minf.202400008","DOIUrl":"10.1002/minf.202400008","url":null,"abstract":"<p><p>Sulphotransferases (SULTs) are a major phase II metabolic enzyme class contributing ~20 % to the Phase II metabolism of FDA-approved drugs. Ignoring the potential for SULT-mediated metabolism leaves a strong potential for drug-drug interactions, often causing late-stage drug discovery failures or black-boxed warnings on FDA labels. The existing models use only accessibility descriptors and machine learning (ML) methods for class and site of sulfonation (SOS) predictions for SULT. In this study, a variety of accessibility, reactivity, and hybrid models and algorithms have been developed to make accurate substrate and SOS predictions. Unlike the literature models, reactivity parameters for the aliphatic or aromatic hydroxyl groups (R/Ar-O-H), the Bond Dissociation Energy (BDE) gave accurate models with a True Positive Rate (TPR)=0.84 for SOS predictions. We offer mechanistic insights to explain these novel findings that are not recognized in the literature. The accessibility parameters like the ratio of Chemgauss4 Score (CGS) and Molecular Weight (MW) CGS/MW and distance from cofactor (Dis) were essential for class predictions and showed TPR=0.72. Substrates consistently had lower BDE, Dis, and CGS/MW than non-substrates. Hybrid models also performed acceptablely for SOS predictions. Using the best models, Algorithms gave an acceptable performance in class prediction: TPR=0.62, False Positive Rate (FPR)=0.24, Balanced accuracy (BA)=0.69, and SOS prediction: TPR=0.98, FPR=0.60, and BA=0.69. A rule-based method was added to improve the predictive performance, which improved the algorithm TPR, FPR, and BA. Validation using an external dataset of drug-like compounds gave class prediction: TPR=0.67, FPR=0.00, and SOS prediction: TPR=0.80 and FPR=0.44 for the best Algorithm. Comparisons with standard ML models also show that our algorithm shows higher predictive performance for classification on external datasets. Overall, these models and algorithms (SOS predictor) give accurate substrate class and site (SOS) predictions for SULT-mediated Phase II metabolism and will be valuable to the drug discovery community in academia and industry. The SOS predictor is freely available for academic/non-profit research via the GitHub link.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202400008"},"PeriodicalIF":2.8,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141897838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cover Picture: (Mol. Inf. 9/2024) 封面图片:(Mol.Inf.9/2024)
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2024-09-13 DOI: 10.1002/minf.202480901
{"title":"Cover Picture: (Mol. Inf. 9/2024)","authors":"","doi":"10.1002/minf.202480901","DOIUrl":"https://doi.org/10.1002/minf.202480901","url":null,"abstract":"","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"18 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Virtual screening of natural products to enhance melanogenosis. 虚拟筛选提高黑色素生成的天然产品。
IF 2.8 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2024-09-01 Epub Date: 2024-06-12 DOI: 10.1002/minf.202300335
Colin Bournez, José-Manuel Gally, Samia Aci-Sèche, Philippe Bernard, Pascal Bonnet

Natural products have long been an important source of inspiration for medicinal chemistry and drug discovery. In the cosmetic field, they remain the major elements of the composition and serve as marketing asset. Recent research showed the implication of salt-inducible kinases on the melanin production in skin via MITF regulation. Finding new potent modulators on such target could open the way to several cosmetic applications to attenuate visible signs of photoaging and improve the tan without sun. Since virtual screening can be a powerful tool for detecting hit compounds in the early stages of a drug discovery process, we applied this method on salt-inducible kinase 2 to discover potential interesting compounds. Here, we present the different steps from the construction of a database of natural products, to the validation of a docking protocol and the results of the virtual screening. Hits from the screening were tested in vitro to confirm their efficiency and results are discussed.

长期以来,天然产品一直是药物化学和药物发现的重要灵感来源。在化妆品领域,天然产品仍然是化妆品的主要成分,也是市场营销的重要资产。最近的研究表明,盐诱导激酶通过 MITF 调控皮肤黑色素的生成。针对这种靶点寻找新的强效调节剂,可以为多种化妆品的应用开辟道路,以减轻明显的光老化迹象,并改善日晒后的肤色。由于虚拟筛选是药物发现过程早期阶段检测热门化合物的有力工具,我们将这种方法应用于盐诱导激酶 2,以发现潜在的有趣化合物。在此,我们介绍了从构建天然产物数据库到验证对接方案和虚拟筛选结果的不同步骤。我们对筛选出的新化合物进行了体外测试,以确认它们的有效性,并对结果进行了讨论。
{"title":"Virtual screening of natural products to enhance melanogenosis.","authors":"Colin Bournez, José-Manuel Gally, Samia Aci-Sèche, Philippe Bernard, Pascal Bonnet","doi":"10.1002/minf.202300335","DOIUrl":"10.1002/minf.202300335","url":null,"abstract":"<p><p>Natural products have long been an important source of inspiration for medicinal chemistry and drug discovery. In the cosmetic field, they remain the major elements of the composition and serve as marketing asset. Recent research showed the implication of salt-inducible kinases on the melanin production in skin via MITF regulation. Finding new potent modulators on such target could open the way to several cosmetic applications to attenuate visible signs of photoaging and improve the tan without sun. Since virtual screening can be a powerful tool for detecting hit compounds in the early stages of a drug discovery process, we applied this method on salt-inducible kinase 2 to discover potential interesting compounds. Here, we present the different steps from the construction of a database of natural products, to the validation of a docking protocol and the results of the virtual screening. Hits from the screening were tested in vitro to confirm their efficiency and results are discussed.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300335"},"PeriodicalIF":2.8,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141306333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cumulative phylogenetic, sequence and structural analysis of Insulin superfamily proteins provide unique structure-function insights. 对胰岛素超家族蛋白的系统发育、序列和结构的累积分析提供了独特的结构-功能见解。
IF 2.8 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2024-09-01 Epub Date: 2024-07-08 DOI: 10.1002/minf.202300160
Shrilakshmi Sheshagiri Rao, Shankar V Kundapura, Debayan Dey, Chandrasekaran Palaniappan, Kanagaraj Sekar, Ananda Kulal, Udupi A Ramagopal

The insulin superfamily proteins (ISPs), in particular, insulin, IGFs and relaxin proteins are key modulators of animal physiology. They are known to have evolved from the same ancestral gene and have diverged into proteins with varied sequences and distinct functions, but maintain a similar structural architecture stabilized by highly conserved disulphide bridges. The recent surge of sequence data and the structures of these proteins prompted a need for a comprehensive analysis, which connects the evolution of these sequences (427 sequences) in the light of available functional and structural information including representative complex structures of ISPs with their cognate receptors. This study reveals (a) unusually high sequence conservation of IGFs (>90 % conservation in 184 sequences) and provides a possible structure-based rationale for such high sequence conservation; (b) provides an updated definition of the receptor-binding signature motif of the functionally diverse relaxin family members (c) provides a probable non-canonical C-peptide cleavage site in a few insulin sequences. The high conservation of IGFs appears to represent a classic case of resistance to sequence diversity exerted by physiologically important interactions with multiple partners. We also propose a probable mechanism for C-peptide cleavage in a few distinct insulin sequences and redefine the receptor-binding signature motif of the relaxin family. Lastly, we provide a basis for minimally modified insulin mutants with potential therapeutic application, inspired by concomitant changes observed in other insulin superfamily protein members supported by molecular dynamics simulation.

胰岛素超家族蛋白(ISPs),特别是胰岛素、IGFs 和松弛素蛋白是动物生理的关键调节因子。众所周知,它们是由同一祖先基因进化而来,并分化成具有不同序列和不同功能的蛋白质,但通过高度保守的二硫键保持着相似的结构。最近,这些蛋白质的序列数据和结构激增,促使人们需要根据现有的功能和结构信息(包括 ISP 与其同源受体的代表性复合结构),对这些序列(427 个序列)的进化进行全面分析。这项研究揭示了:(a) IGFs 的序列保存率异常之高(184 个序列的保存率大于 90%),并为如此高的序列保存率提供了一个可能的基于结构的理由;(b) 为功能多样的弛缓素家族成员的受体结合标志图案提供了一个最新的定义;(c) 在一些胰岛素序列中提供了一个可能的非经典 C 肽裂解位点。IGFs 的高度保守性似乎代表了一种典型的情况,即通过与多个伙伴的重要生理相互作用来抵抗序列多样性。我们还提出了几个不同的胰岛素序列中 C 肽裂解的可能机制,并重新定义了松弛素家族的受体结合特征基团。最后,我们从分子动力学模拟支持下在其他胰岛素超家族蛋白成员中观察到的伴随变化中得到启发,为具有潜在治疗用途的最小修饰胰岛素突变体提供了基础。
{"title":"Cumulative phylogenetic, sequence and structural analysis of Insulin superfamily proteins provide unique structure-function insights.","authors":"Shrilakshmi Sheshagiri Rao, Shankar V Kundapura, Debayan Dey, Chandrasekaran Palaniappan, Kanagaraj Sekar, Ananda Kulal, Udupi A Ramagopal","doi":"10.1002/minf.202300160","DOIUrl":"10.1002/minf.202300160","url":null,"abstract":"<p><p>The insulin superfamily proteins (ISPs), in particular, insulin, IGFs and relaxin proteins are key modulators of animal physiology. They are known to have evolved from the same ancestral gene and have diverged into proteins with varied sequences and distinct functions, but maintain a similar structural architecture stabilized by highly conserved disulphide bridges. The recent surge of sequence data and the structures of these proteins prompted a need for a comprehensive analysis, which connects the evolution of these sequences (427 sequences) in the light of available functional and structural information including representative complex structures of ISPs with their cognate receptors. This study reveals (a) unusually high sequence conservation of IGFs (>90 % conservation in 184 sequences) and provides a possible structure-based rationale for such high sequence conservation; (b) provides an updated definition of the receptor-binding signature motif of the functionally diverse relaxin family members (c) provides a probable non-canonical C-peptide cleavage site in a few insulin sequences. The high conservation of IGFs appears to represent a classic case of resistance to sequence diversity exerted by physiologically important interactions with multiple partners. We also propose a probable mechanism for C-peptide cleavage in a few distinct insulin sequences and redefine the receptor-binding signature motif of the relaxin family. Lastly, we provide a basis for minimally modified insulin mutants with potential therapeutic application, inspired by concomitant changes observed in other insulin superfamily protein members supported by molecular dynamics simulation.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300160"},"PeriodicalIF":2.8,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141555195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of blood-brain barrier permeability using machine learning approaches based on various molecular representation. 利用基于各种分子表征的机器学习方法预测血脑屏障通透性。
IF 2.8 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2024-09-01 Epub Date: 2024-06-12 DOI: 10.1002/minf.202300327
Li Liang, Zhiwen Liu, Xinyi Yang, Yanmin Zhang, Haichun Liu, Yadong Chen

The assessment of compound blood-brain barrier (BBB) permeability poses a significant challenge in the discovery of drugs targeting the central nervous system. Conventional experimental approaches to measure BBB permeability are labor-intensive, cost-ineffective, and time-consuming. In this study, we constructed six machine learning classification models by combining various machine learning algorithms and molecular representations. The model based on ExtraTree algorithm and random partitioning strategy obtains the best prediction result, with AUC value of 0.932±0.004 and balanced accuracy (BA) of 0.837±0.010 for the test set. We employed the SHAP method to identify important features associated with BBB permeability. In addition, matched molecular pair (MMP) analysis and representative substructure derivation method were utilized to uncover the transformation rules and distinctive structural features of BBB permeable compounds. The machine learning models proposed in this work can serve as an effective tool for assessing BBB permeability in the drug discovery for central nervous system disease.

化合物血脑屏障(BBB)通透性评估是发现中枢神经系统靶向药物的一大挑战。测量血脑屏障通透性的传统实验方法耗费大量人力、成本低且费时。在本研究中,我们结合各种机器学习算法和分子表征,构建了六个机器学习分类模型。基于 ExtraTree 算法和随机分区策略的模型获得了最佳预测结果,其 AUC 值为 0.932±0.004,测试集的平衡准确度(BA)为 0.837±0.010。我们采用 SHAP 方法来识别与 BBB 渗透性相关的重要特征。此外,我们还利用匹配分子对(MMP)分析法和代表性子结构推导法来揭示BBB渗透性化合物的转化规则和独特的结构特征。本研究提出的机器学习模型可作为评估中枢神经系统疾病药物研发中BBB渗透性的有效工具。
{"title":"Prediction of blood-brain barrier permeability using machine learning approaches based on various molecular representation.","authors":"Li Liang, Zhiwen Liu, Xinyi Yang, Yanmin Zhang, Haichun Liu, Yadong Chen","doi":"10.1002/minf.202300327","DOIUrl":"10.1002/minf.202300327","url":null,"abstract":"<p><p>The assessment of compound blood-brain barrier (BBB) permeability poses a significant challenge in the discovery of drugs targeting the central nervous system. Conventional experimental approaches to measure BBB permeability are labor-intensive, cost-ineffective, and time-consuming. In this study, we constructed six machine learning classification models by combining various machine learning algorithms and molecular representations. The model based on ExtraTree algorithm and random partitioning strategy obtains the best prediction result, with AUC value of 0.932±0.004 and balanced accuracy (BA) of 0.837±0.010 for the test set. We employed the SHAP method to identify important features associated with BBB permeability. In addition, matched molecular pair (MMP) analysis and representative substructure derivation method were utilized to uncover the transformation rules and distinctive structural features of BBB permeable compounds. The machine learning models proposed in this work can serve as an effective tool for assessing BBB permeability in the drug discovery for central nervous system disease.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300327"},"PeriodicalIF":2.8,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141306332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cover Picture: (Mol. Inf. 8/2024) 封面图片:(Mol.Inf. 8/2024)
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2024-08-12 DOI: 10.1002/minf.202480801
{"title":"Cover Picture: (Mol. Inf. 8/2024)","authors":"","doi":"10.1002/minf.202480801","DOIUrl":"https://doi.org/10.1002/minf.202480801","url":null,"abstract":"","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"73 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142212171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensemble docking based virtual screening of SARS-CoV-2 main protease inhibitors. 基于组合对接的 SARS-CoV-2 主要蛋白酶抑制剂虚拟筛选。
IF 2.8 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2024-08-01 Epub Date: 2024-07-08 DOI: 10.1002/minf.202300279
Anastasia D Fomina, Victoria I Uvarova, Liubov I Kozlovskaya, Vladimir A Palyulin, Dmitry I Osolodkin, Aydar A Ishmukhametov

During the first years of COVID-19 pandemic, X-ray structures of the coronavirus drug targets were acquired at an unprecedented rate, giving hundreds of PDB depositions in less than a year. The main protease (Mpro) of severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) is the primary validated target of direct-acting antivirals. The selection of the optimal ensemble of structures of Mpro for the docking-driven virtual screening campaign was thus non-trivial and required a systematic and automated approach. Here we report a semi-automated active site RMSD based procedure of ensemble selection from the SARS-CoV-2 Mpro crystallographic data and virtual screening of its inhibitors. The procedure was compared with other approaches to ensemble selection and validated with the help of hand-picked and peer-reviewed activity-annotated libraries. Prospective virtual screening of non-covalent Mpro inhibitors resulted in a new chemotype of thienopyrimidinone derivatives with experimentally confirmed enzyme inhibition.

在 COVID-19 大流行的最初几年,冠状病毒药物靶点的 X 射线结构以前所未有的速度获得,在不到一年的时间里就有数百个 PDB 文件沉积。严重急性呼吸系统综合征相关冠状病毒 2(SARS-CoV-2)的主要蛋白酶(Mpro)是直接作用抗病毒药物的主要验证靶点。因此,为对接驱动的虚拟筛选活动选择最佳的 Mpro 结构组合并非易事,需要一种系统的自动化方法。在此,我们报告了一种基于活性位点 RMSD 的半自动程序,该程序从 SARS-CoV-2 Mpro 晶体数据中选择组合结构,并对其抑制剂进行虚拟筛选。我们将该程序与其他组合筛选方法进行了比较,并在人工挑选和同行评议的活性注释库的帮助下进行了验证。对非共价 Mpro 抑制剂的前瞻性虚拟筛选产生了一种新的噻吩嘧啶酮衍生物化学类型,其酶抑制作用已得到实验证实。
{"title":"Ensemble docking based virtual screening of SARS-CoV-2 main protease inhibitors.","authors":"Anastasia D Fomina, Victoria I Uvarova, Liubov I Kozlovskaya, Vladimir A Palyulin, Dmitry I Osolodkin, Aydar A Ishmukhametov","doi":"10.1002/minf.202300279","DOIUrl":"10.1002/minf.202300279","url":null,"abstract":"<p><p>During the first years of COVID-19 pandemic, X-ray structures of the coronavirus drug targets were acquired at an unprecedented rate, giving hundreds of PDB depositions in less than a year. The main protease (Mpro) of severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) is the primary validated target of direct-acting antivirals. The selection of the optimal ensemble of structures of Mpro for the docking-driven virtual screening campaign was thus non-trivial and required a systematic and automated approach. Here we report a semi-automated active site RMSD based procedure of ensemble selection from the SARS-CoV-2 Mpro crystallographic data and virtual screening of its inhibitors. The procedure was compared with other approaches to ensemble selection and validated with the help of hand-picked and peer-reviewed activity-annotated libraries. Prospective virtual screening of non-covalent Mpro inhibitors resulted in a new chemotype of thienopyrimidinone derivatives with experimentally confirmed enzyme inhibition.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300279"},"PeriodicalIF":2.8,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141555196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chemical space exploration with Molpher: Generating and assessing a glucocorticoid receptor ligand library. 利用 Molpher 探索化学空间:生成并评估糖皮质激素受体配体库。
IF 2.8 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2024-08-01 Epub Date: 2024-07-09 DOI: 10.1002/minf.202300316
M Isabel Agea, Ivan Čmelo, Wim Dehaen, Ya Chen, Johannes Kirchmair, David Sedlák, Petr Bartůněk, Martin Šícho, Daniel Svozil

Computational exploration of chemical space is crucial in modern cheminformatics research for accelerating the discovery of new biologically active compounds. In this study, we present a detailed analysis of the chemical library of potential glucocorticoid receptor (GR) ligands generated by the molecular generator, Molpher. To generate the targeted GR library and construct the classification models, structures from the ChEMBL database as well as from the internal IMG library, which was experimentally screened for biological activity in the primary luciferase reporter cell assay, were utilized. The composition of the targeted GR ligand library was compared with a reference library that randomly samples chemical space. A random forest model was used to determine the biological activity of ligands, incorporating its applicability domain using conformal prediction. It was demonstrated that the GR library is significantly enriched with GR ligands compared to the random library. Furthermore, a prospective analysis demonstrated that Molpher successfully designed compounds, which were subsequently experimentally confirmed to be active on the GR. A collection of 34 potential new GR ligands was also identified. Moreover, an important contribution of this study is the establishment of a comprehensive workflow for evaluating computationally generated ligands, particularly those with potential activity against targets that are challenging to dock.

在现代化学信息学研究中,化学空间的计算探索对于加速发现新的生物活性化合物至关重要。在本研究中,我们详细分析了分子生成器 Molpher 生成的潜在糖皮质激素受体(GR)配体化学库。为了生成靶向 GR 库并构建分类模型,我们利用了 ChEMBL 数据库以及内部 IMG 库中的结构。将目标 GR 配体库的组成与随机抽样化学空间的参考库进行了比较。采用随机森林模型确定配体的生物活性,并利用保形预测将其适用范围纳入其中。结果表明,与随机库相比,GR 库明显富含 GR 配体。此外,一项前瞻性分析表明,Molpher 成功地设计出了一些化合物,这些化合物随后被实验证实对 GR 具有活性。同时还发现了 34 种潜在的新 GR 配体。此外,这项研究的一个重要贡献是建立了一套全面的工作流程,用于评估通过计算生成的配体,特别是那些对具有潜在活性的靶标具有对接挑战性的配体。
{"title":"Chemical space exploration with Molpher: Generating and assessing a glucocorticoid receptor ligand library.","authors":"M Isabel Agea, Ivan Čmelo, Wim Dehaen, Ya Chen, Johannes Kirchmair, David Sedlák, Petr Bartůněk, Martin Šícho, Daniel Svozil","doi":"10.1002/minf.202300316","DOIUrl":"10.1002/minf.202300316","url":null,"abstract":"<p><p>Computational exploration of chemical space is crucial in modern cheminformatics research for accelerating the discovery of new biologically active compounds. In this study, we present a detailed analysis of the chemical library of potential glucocorticoid receptor (GR) ligands generated by the molecular generator, Molpher. To generate the targeted GR library and construct the classification models, structures from the ChEMBL database as well as from the internal IMG library, which was experimentally screened for biological activity in the primary luciferase reporter cell assay, were utilized. The composition of the targeted GR ligand library was compared with a reference library that randomly samples chemical space. A random forest model was used to determine the biological activity of ligands, incorporating its applicability domain using conformal prediction. It was demonstrated that the GR library is significantly enriched with GR ligands compared to the random library. Furthermore, a prospective analysis demonstrated that Molpher successfully designed compounds, which were subsequently experimentally confirmed to be active on the GR. A collection of 34 potential new GR ligands was also identified. Moreover, an important contribution of this study is the establishment of a comprehensive workflow for evaluating computationally generated ligands, particularly those with potential activity against targets that are challenging to dock.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202300316"},"PeriodicalIF":2.8,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141559260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Navigating pharmacophore space to identify activity discontinuities: A case study with BCR-ABL. 浏览药理空间以识别活性不连续性:BCR-ABL 案例研究。
IF 2.8 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2024-08-01 Epub Date: 2024-07-09 DOI: 10.1002/minf.202400050
Maroua Lejmi, Damien Geslin, Ronan Bureau, Bertrand Cuissart, Ilef Ben Slima, Nida Meddouri, Amel Borgi, Jean-Luc Lamotte, Alban Lepailleur

The exploration of chemical space is a fundamental aspect of chemoinformatics, particularly when one explores a large compound data set to relate chemical structures with molecular properties. In this study, we extend our previous work on chemical space visualization at the pharmacophoric level. Instead of using conventional binary classification of affinity (active vs inactive), we introduce a refined approach that categorizes compounds into four distinct classes based on their activity levels: super active, very active, active, and inactive. This classification enriches the color scheme applied to pharmacophore space, where the color representation of a pharmacophore hypothesis is driven by the associated compounds. Using the BCR-ABL tyrosine kinase as a case study, we identified intriguing regions corresponding to pharmacophore activity discontinuities, providing valuable insights for structure-activity relationships analysis.

化学空间的探索是化学信息学的一个基本方面,尤其是在探索大型化合物数据集以将化学结构与分子性质联系起来时。在本研究中,我们扩展了之前在药效水平上的化学空间可视化工作。我们不再使用传统的亲和性二元分类法(活性与非活性),而是引入了一种细化方法,根据化合物的活性水平将其分为四个不同的类别:超级活性、非常活性、活性和非活性。这种分类方法丰富了应用于药效空间的颜色方案,药效假设的颜色表示由相关化合物驱动。以 BCR-ABL 酪氨酸激酶为例,我们发现了与药理活性不连续性相对应的有趣区域,为结构-活性关系分析提供了宝贵的见解。
{"title":"Navigating pharmacophore space to identify activity discontinuities: A case study with BCR-ABL.","authors":"Maroua Lejmi, Damien Geslin, Ronan Bureau, Bertrand Cuissart, Ilef Ben Slima, Nida Meddouri, Amel Borgi, Jean-Luc Lamotte, Alban Lepailleur","doi":"10.1002/minf.202400050","DOIUrl":"10.1002/minf.202400050","url":null,"abstract":"<p><p>The exploration of chemical space is a fundamental aspect of chemoinformatics, particularly when one explores a large compound data set to relate chemical structures with molecular properties. In this study, we extend our previous work on chemical space visualization at the pharmacophoric level. Instead of using conventional binary classification of affinity (active vs inactive), we introduce a refined approach that categorizes compounds into four distinct classes based on their activity levels: super active, very active, active, and inactive. This classification enriches the color scheme applied to pharmacophore space, where the color representation of a pharmacophore hypothesis is driven by the associated compounds. Using the BCR-ABL tyrosine kinase as a case study, we identified intriguing regions corresponding to pharmacophore activity discontinuities, providing valuable insights for structure-activity relationships analysis.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202400050"},"PeriodicalIF":2.8,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141559262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cover Picture: (Mol. Inf. 7/2024) 封面图片:(Mol.Inf. 7/2024)
IF 3.6 4区 医学 Q3 CHEMISTRY, MEDICINAL Pub Date : 2024-07-12 DOI: 10.1002/minf.202480701
{"title":"Cover Picture: (Mol. Inf. 7/2024)","authors":"","doi":"10.1002/minf.202480701","DOIUrl":"https://doi.org/10.1002/minf.202480701","url":null,"abstract":"","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"243 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141609429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Molecular Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1