首页 > 最新文献

Journal of Computer-Aided Molecular Design最新文献

英文 中文
Development of human lactate dehydrogenase a inhibitors: high-throughput screening, molecular dynamics simulation and enzyme activity assay 开发人乳酸脱氢酶 a 抑制剂:高通量筛选、分子动力学模拟和酶活性测定。
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-10 DOI: 10.1007/s10822-024-00568-y
Yuanyuan Shu, Jianda Yue, Yaqi Li, Yekui Yin, Jiaxu Wang, Tingting Li, Xiao He, Songping Liang, Gaihua Zhang, Zhonghua Liu, Ying Wang

Lactate dehydrogenase A (LDHA) is highly expressed in many tumor cells and promotes the conversion of pyruvate to lactic acid in the glucose pathway, providing energy and synthetic precursors for rapid proliferation of tumor cells. Therefore, inhibition of LDHA has become a widely concerned tumor treatment strategy. However, the research and development of highly efficient and low toxic LDHA small molecule inhibitors still faces challenges. To discover potential inhibitors against LDHA, virtual screening based on molecular docking techniques was performed from Specs database of more than 260,000 compounds and Chemdiv-smart database of more than 1,000 compounds. Through molecular dynamics (MD) simulation studies, we identified 12 potential LDHA inhibitors, all of which can stably bind to human LDHA protein and form multiple interactions with its active central residues. In order to verify the inhibitory activities of these compounds, we established an enzyme activity assay system and measured their inhibitory effects on recombinant human LDHA. The results showed that Compound 6 could inhibit the catalytic effect of LDHA on pyruvate in a dose-dependent manner with an EC50 value of 14.54 ± 0.83 µM. Further in vitro experiments showed that Compound 6 could significantly inhibit the proliferation of various tumor cell lines such as pancreatic cancer cells and lung cancer cells, reduce intracellular lactic acid content and increase intracellular reactive oxygen species (ROS) level. In summary, through virtual screening and in vitro validation, we found that Compound 6 is a small molecule inhibitor for LDHA, providing a good lead compound for the research and development of LDHA related targeted anti-tumor drugs.

乳酸脱氢酶A(LDHA)在许多肿瘤细胞中高度表达,它能促进葡萄糖途径中丙酮酸向乳酸的转化,为肿瘤细胞的快速增殖提供能量和合成前体。因此,抑制 LDHA 已成为一种广受关注的肿瘤治疗策略。然而,高效低毒的 LDHA 小分子抑制剂的研发仍面临挑战。为了发现潜在的LDHA抑制剂,研究人员基于分子对接技术,从Specs数据库的26万多个化合物和Chemdiv-smart数据库的1000多个化合物中进行了虚拟筛选。通过分子动力学(MD)模拟研究,我们发现了12种潜在的LDHA抑制剂,它们都能与人LDHA蛋白稳定结合,并与其活性中心残基形成多重相互作用。为了验证这些化合物的抑制活性,我们建立了酶活性测定系统,并测定了它们对重组人 LDHA 的抑制作用。结果表明,化合物 6 能以剂量依赖的方式抑制 LDHA 对丙酮酸的催化作用,EC50 值为 14.54 ± 0.83 µM。进一步的体外实验表明,化合物 6 能显著抑制胰腺癌细胞和肺癌细胞等多种肿瘤细胞株的增殖,降低细胞内乳酸含量,提高细胞内活性氧(ROS)水平。综上所述,通过虚拟筛选和体外验证,我们发现化合物 6 是一种小分子 LDHA 抑制剂,为研究和开发与 LDHA 相关的抗肿瘤靶向药物提供了一个很好的先导化合物。
{"title":"Development of human lactate dehydrogenase a inhibitors: high-throughput screening, molecular dynamics simulation and enzyme activity assay","authors":"Yuanyuan Shu,&nbsp;Jianda Yue,&nbsp;Yaqi Li,&nbsp;Yekui Yin,&nbsp;Jiaxu Wang,&nbsp;Tingting Li,&nbsp;Xiao He,&nbsp;Songping Liang,&nbsp;Gaihua Zhang,&nbsp;Zhonghua Liu,&nbsp;Ying Wang","doi":"10.1007/s10822-024-00568-y","DOIUrl":"10.1007/s10822-024-00568-y","url":null,"abstract":"<div><p>Lactate dehydrogenase A (LDHA) is highly expressed in many tumor cells and promotes the conversion of pyruvate to lactic acid in the glucose pathway, providing energy and synthetic precursors for rapid proliferation of tumor cells. Therefore, inhibition of LDHA has become a widely concerned tumor treatment strategy. However, the research and development of highly efficient and low toxic LDHA small molecule inhibitors still faces challenges. To discover potential inhibitors against LDHA, virtual screening based on molecular docking techniques was performed from Specs database of more than 260,000 compounds and Chemdiv-smart database of more than 1,000 compounds. Through molecular dynamics (MD) simulation studies, we identified 12 potential LDHA inhibitors, all of which can stably bind to human LDHA protein and form multiple interactions with its active central residues. In order to verify the inhibitory activities of these compounds, we established an enzyme activity assay system and measured their inhibitory effects on recombinant human LDHA. The results showed that Compound 6 could inhibit the catalytic effect of LDHA on pyruvate in a dose-dependent manner with an EC<sub>50</sub> value of 14.54 ± 0.83 µM. Further in vitro experiments showed that Compound 6 could significantly inhibit the proliferation of various tumor cell lines such as pancreatic cancer cells and lung cancer cells, reduce intracellular lactic acid content and increase intracellular reactive oxygen species (ROS) level. In summary, through virtual screening and in vitro validation, we found that Compound 6 is a small molecule inhibitor for LDHA, providing a good lead compound for the research and development of LDHA related targeted anti-tumor drugs.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141911277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of QSARs for cysteine-containing di- and tripeptides with antioxidant activity:influence of the cysteine position 开发具有抗氧化活性的含半胱氨酸二肽和三肽的 QSARs:半胱氨酸位置的影响。
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-02 DOI: 10.1007/s10822-024-00567-z
Lucas A. Garro, Matias F. Andrada, Esteban G. Vega-Hissi, Sonia Barberis, Juan C. Garro Martinez

Antioxidants agents play an essential role in the food industry for improving the oxidative stability of food products. In the last years, the search for new natural antioxidants has increased due to the potential high toxicity of chemical additives. Therefore, the synthesis and evaluation of the antioxidant activity in peptides is a field of current research. In this study, we performed a Quantitative Structure Activity Relationship analysis (QSAR) of cysteine-containing 19 dipeptides and 19 tripeptides. The main objective is to bring information on the relationship between the structure of peptides and their antioxidant activity. For this purpose, 1D and 2D molecular descriptors were calculated using the PaDEL software, which provides information about the structure, shape, size, charge, polarity, solubility and other aspects of the compounds. Different QSAR model for di- and tripeptides were developed. The statistic parameters for di-peptides model (R2train = 0.947 and R2test = 0.804) and for tripeptide models (R2train = 0.923 and R2test = 0.847) indicate that the generated models have high predictive capacity. Then, the influence of the cysteine position was analyzed predicting the antioxidant activity for new di- and tripeptides, and comparing them with glutathione. In dipeptides, excepting SC, TC and VC, the activity increases when cysteine is at the N-terminal position. For tripeptides, we observed a notable increase in activity when cysteine is placed in the N-terminal position.

在食品工业中,抗氧化剂对提高食品的氧化稳定性起着至关重要的作用。近年来,由于化学添加剂潜在的高毒性,人们越来越多地寻找新的天然抗氧化剂。因此,合成和评估肽的抗氧化活性是当前的一个研究领域。在本研究中,我们对含半胱氨酸的 19 种二肽和 19 种三肽进行了定量结构活性关系分析(QSAR)。研究的主要目的是了解肽的结构与其抗氧化活性之间的关系。为此,使用 PaDEL 软件计算了一维和二维分子描述符,该软件提供了化合物的结构、形状、大小、电荷、极性、溶解度和其他方面的信息。为二肽和三肽建立了不同的 QSAR 模型。二肽模型的统计参数(R2train = 0.947 和 R2test = 0.804)和三肽模型的统计参数(R2train = 0.923 和 R2test = 0.847)表明所生成的模型具有较高的预测能力。然后,分析了半胱氨酸位置对预测新的二肽和三肽抗氧化活性的影响,并将它们与谷胱甘肽进行了比较。除 SC、TC 和 VC 外,当半胱氨酸位于 N 端位置时,二肽的活性会增加。在三肽中,我们观察到当半胱氨酸位于 N 端位置时,其活性显著增加。
{"title":"Development of QSARs for cysteine-containing di- and tripeptides with antioxidant activity:influence of the cysteine position","authors":"Lucas A. Garro,&nbsp;Matias F. Andrada,&nbsp;Esteban G. Vega-Hissi,&nbsp;Sonia Barberis,&nbsp;Juan C. Garro Martinez","doi":"10.1007/s10822-024-00567-z","DOIUrl":"10.1007/s10822-024-00567-z","url":null,"abstract":"<div><p>Antioxidants agents play an essential role in the food industry for improving the oxidative stability of food products. In the last years, the search for new natural antioxidants has increased due to the potential high toxicity of chemical additives. Therefore, the synthesis and evaluation of the antioxidant activity in peptides is a field of current research. In this study, we performed a Quantitative Structure Activity Relationship analysis (QSAR) of cysteine-containing 19 dipeptides and 19 tripeptides. The main objective is to bring information on the relationship between the structure of peptides and their antioxidant activity. For this purpose, 1D and 2D molecular descriptors were calculated using the PaDEL software, which provides information about the structure, shape, size, charge, polarity, solubility and other aspects of the compounds. Different QSAR model for di- and tripeptides were developed. The statistic parameters for di-peptides model (R<sup>2</sup><sub>train</sub> = 0.947 and R<sup>2</sup><sub>test</sub> = 0.804) and for tripeptide models (R<sup>2</sup><sub>train</sub> = 0.923 and R<sup>2</sup><sub>test</sub> = 0.847) indicate that the generated models have high predictive capacity. Then, the influence of the cysteine position was analyzed predicting the antioxidant activity for new di- and tripeptides, and comparing them with glutathione. In dipeptides, excepting SC, TC and VC, the activity increases when cysteine is at the N-terminal position. For tripeptides, we observed a notable increase in activity when cysteine is placed in the N-terminal position.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141873885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From mundane to surprising nonadditivity: drivers and impact on ML models 从平凡到令人惊讶的非加性:驱动因素和对 ML 模型的影响。
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-07-25 DOI: 10.1007/s10822-024-00566-0
Laura Guasch, Niels Maeder, John G. Cumming, Christian Kramer

Nonadditivity (NA) in Structure-Activity and Structure-Property Relationship (SAR) data is a rare but very information rich phenomenon. It can indicate conformational flexibility, structural rearrangements, and errors in assay results and structural assignment. While purely ligand-based conformational causes of NA are rather well understood and mundane, other factors are less so and cause surprising NA that has a huge influence on SAR analysis and ML model performance. We here report a systematic analysis across a wide range of properties (20 on-target biological activities and 4 physicochemical ADME-related properties) to understand the frequency of various different phenomena that may lead to NA. A set of novel descriptors were developed to characterize double transformation cycles and identify trends in NA. Double transformation cycles were classified into “surprising” and “mundane” categories, with the majority being classed as mundane. We also examined commonalities among surprising cycles, finding LogP differences to have the most significant impact on NA. A distinct behavior of NA for on-target sets compared to ADME sets was observed. Finally, we show that machine learning models struggle with highly nonadditive data, indicating that a better understanding of NA is an important future research direction.

结构-活性和结构-性质关系(SAR)数据中的非相加性(NA)是一种罕见但信息丰富的现象。它可以表明构象的灵活性、结构的重排以及检测结果和结构分配的错误。虽然纯粹基于配体的构象原因导致的 NA 比较容易理解,也很普通,但其他因素就不那么容易理解了,它们会导致令人惊讶的 NA,对 SAR 分析和 ML 模型性能产生巨大影响。我们在此报告了对各种性质(20 种靶上生物活性和 4 种物理化学 ADME 相关性质)的系统分析,以了解可能导致 NA 的各种不同现象的发生频率。我们开发了一套新的描述指标来描述双重转化周期并确定 NA 的趋势。双重转化周期被分为 "惊人 "和 "平凡 "两类,其中大多数被归为平凡类。我们还研究了令人惊讶的周期之间的共性,发现 LogP 差异对 NA 的影响最大。我们还观察到,与 ADME 集相比,目标集的 NA 具有独特的行为。最后,我们发现机器学习模型在处理高度非加性数据时非常吃力,这表明更好地理解NA是未来的一个重要研究方向。
{"title":"From mundane to surprising nonadditivity: drivers and impact on ML models","authors":"Laura Guasch,&nbsp;Niels Maeder,&nbsp;John G. Cumming,&nbsp;Christian Kramer","doi":"10.1007/s10822-024-00566-0","DOIUrl":"10.1007/s10822-024-00566-0","url":null,"abstract":"<div><p>Nonadditivity (NA) in Structure-Activity and Structure-Property Relationship (SAR) data is a rare but very information rich phenomenon. It can indicate conformational flexibility, structural rearrangements, and errors in assay results and structural assignment. While purely ligand-based conformational causes of NA are rather well understood and mundane, other factors are less so and cause surprising NA that has a huge influence on SAR analysis and ML model performance. We here report a systematic analysis across a wide range of properties (20 on-target biological activities and 4 physicochemical ADME-related properties) to understand the frequency of various different phenomena that may lead to NA. A set of novel descriptors were developed to characterize double transformation cycles and identify trends in NA. Double transformation cycles were classified into “surprising” and “mundane” categories, with the majority being classed as mundane. We also examined commonalities among surprising cycles, finding LogP differences to have the most significant impact on NA. A distinct behavior of NA for on-target sets compared to ADME sets was observed. Finally, we show that machine learning models struggle with highly nonadditive data, indicating that a better understanding of NA is an important future research direction.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141756486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MDFit: automated molecular simulations workflow enables high throughput assessment of ligands-protein dynamics MDFit:自动分子模拟工作流程,可对配体-蛋白质动力学进行高通量评估。
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-07-17 DOI: 10.1007/s10822-024-00564-2
Alexander C. Brueckner, Benjamin Shields, Palani Kirubakaran, Alexander Suponya, Manoranjan Panda, Shana L. Posy, Stephen Johnson, Sirish Kaushik Lakkaraju

Molecular dynamics (MD) simulation is a powerful tool for characterizing ligand–protein conformational dynamics and offers significant advantages over docking and other rigid structure-based computational methods. However, setting up, running, and analyzing MD simulations continues to be a multi-step process making it cumbersome to assess a library of ligands in a protein binding pocket using MD. We present an automated workflow that streamlines setting up, running, and analyzing Desmond MD simulations for protein–ligand complexes using machine learning (ML) models. The workflow takes a library of pre-docked ligands and a prepared protein structure as input, sets up and runs MD with each protein–ligand complex, and generates simulation fingerprints for each ligand. Simulation fingerprints (SimFP) capture protein–ligand compatibility, including stability of different ligand-pocket interactions and other useful metrics that enable easy rank-ordering of the ligand library for pocket optimization. SimFPs from a ligand library are used to build & deploy ML models that predict binding assay outcomes and automatically infer important interactions. Unlike relative free-energy methods that are constrained to assess ligands with high chemical similarity, ML models based on SimFPs can accommodate diverse ligand sets. We present two case studies on how SimFP helps delineate structure–activity relationship (SAR) trends and explain potency differences across matched-molecular pairs of (1) cyclic peptides targeting PD-L1 and (2) small molecule inhibitors targeting CDK9.

分子动力学(MD)模拟是表征配体-蛋白质构象动力学的强大工具,与对接和其他基于刚性结构的计算方法相比具有显著优势。然而,MD 模拟的设置、运行和分析仍然是一个多步骤的过程,因此使用 MD 评估蛋白质结合口袋中的配体库非常麻烦。我们介绍了一种自动化工作流程,它能利用机器学习(ML)模型简化蛋白质配体复合物的德斯蒙德 MD 模拟的设置、运行和分析。该工作流程以预对接配体库和准备好的蛋白质结构为输入,设置并运行每个蛋白质配体复合物的 MD,并生成每个配体的模拟指纹。模拟指纹(SimFP)可以捕捉蛋白质-配体的兼容性,包括不同配体-口袋相互作用的稳定性和其他有用的指标,便于对配体库进行排序,以优化口袋。配体库中的 SimFPs 可用于构建和部署 ML 模型,以预测结合试验结果并自动推断重要的相互作用。与受限于评估化学相似性高的配体的相对自由能方法不同,基于 SimFPs 的 ML 模型可以适应多种配体集。我们介绍了两个案例研究,说明 SimFP 如何帮助划定结构-活性关系(SAR)趋势,并解释(1)靶向 PD-L1 的环肽和(2)靶向 CDK9 的小分子抑制剂的匹配分子对之间的效力差异。
{"title":"MDFit: automated molecular simulations workflow enables high throughput assessment of ligands-protein dynamics","authors":"Alexander C. Brueckner,&nbsp;Benjamin Shields,&nbsp;Palani Kirubakaran,&nbsp;Alexander Suponya,&nbsp;Manoranjan Panda,&nbsp;Shana L. Posy,&nbsp;Stephen Johnson,&nbsp;Sirish Kaushik Lakkaraju","doi":"10.1007/s10822-024-00564-2","DOIUrl":"10.1007/s10822-024-00564-2","url":null,"abstract":"<div><p>Molecular dynamics (MD) simulation is a powerful tool for characterizing ligand–protein conformational dynamics and offers significant advantages over docking and other rigid structure-based computational methods. However, setting up, running, and analyzing MD simulations continues to be a multi-step process making it cumbersome to assess a library of ligands in a protein binding pocket using MD. We present an automated workflow that streamlines setting up, running, and analyzing Desmond MD simulations for protein–ligand complexes using machine learning (ML) models. The workflow takes a library of pre-docked ligands and a prepared protein structure as input, sets up and runs MD with each protein–ligand complex, and generates simulation fingerprints for each ligand. Simulation fingerprints (SimFP) capture protein–ligand compatibility, including stability of different ligand-pocket interactions and other useful metrics that enable easy rank-ordering of the ligand library for pocket optimization. SimFPs from a ligand library are used to build &amp; deploy ML models that predict binding assay outcomes and automatically infer important interactions. Unlike relative free-energy methods that are constrained to assess ligands with high chemical similarity, ML models based on SimFPs can accommodate diverse ligand sets. We present two case studies on how SimFP helps delineate structure–activity relationship (SAR) trends and explain potency differences across matched-molecular pairs of (1) cyclic peptides targeting PD-L1 and (2) small molecule inhibitors targeting CDK9.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-024-00564-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141625626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structural impacts of two disease-linked ADAR1 mutants: a molecular dynamics study 两种与疾病相关的 ADAR1 突变体的结构影响:分子动力学研究。
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-07-17 DOI: 10.1007/s10822-024-00565-1
Wen-Chieh Huang, Chia-Hung Hsu, Titus V. Albu, Chia-Ning Yang

Adenosine deaminases acting on RNA (ADARs) are pivotal RNA-editing enzymes responsible for converting adenosine to inosine within double-stranded RNA (dsRNA). Dysregulation of ADAR1 editing activity, often arising from genetic mutations, has been linked to elevated interferon levels and the onset of autoinflammatory diseases. However, understanding the molecular underpinnings of this dysregulation is impeded by the lack of an experimentally determined structure for the ADAR1 deaminase domain. In this computational study, we utilized homology modeling and the AlphaFold2 to construct structural models of the ADAR1 deaminase domain in wild-type and two pathogenic variants, R892H and Y1112F, to decipher the structural impact on the reduced deaminase activity. Our findings illuminate the critical role of structural complementarity between the ADAR1 deaminase domain and dsRNA in enzyme-substrate recognition. That is, the relative position of E1008 and K1120 must be maintained so that they can insert into the minor and major grooves of the substrate dsRNA, respectively, facilitating the flipping-out of adenosine to be accommodated within a cavity surrounding E912. Both amino acid replacements studied, R892H at the orthosteric site and Y1112F at the allosteric site, alter K1120 position and ultimately hinder substrate RNA binding.

作用于 RNA 的腺苷脱氨酶(ADARs)是一种关键的 RNA 编辑酶,负责将双链 RNA(dsRNA)中的腺苷转化为肌苷。ADAR1 编辑活性失调通常是由基因突变引起的,与干扰素水平升高和自身炎症性疾病的发病有关。然而,由于缺乏通过实验确定的 ADAR1 脱氨酶结构域结构,人们无法了解这种失调的分子基础。在这项计算研究中,我们利用同源建模和 AlphaFold2 构建了野生型和两种致病变体(R892H 和 Y1112F)中 ADAR1 脱氨酶结构域的结构模型,以破译结构对脱氨酶活性降低的影响。我们的发现阐明了 ADAR1 脱氨酶结构域与 dsRNA 之间的结构互补性在酶底物识别中的关键作用。也就是说,必须保持 E1008 和 K1120 的相对位置,这样它们才能分别插入底物 dsRNA 的小凹槽和大凹槽,促进腺苷的翻转,使其容纳在 E912 周围的空腔中。所研究的这两种氨基酸置换(正表位点上的 R892H 和异表位点上的 Y1112F)都改变了 K1120 的位置,最终阻碍了底物 RNA 的结合。
{"title":"Structural impacts of two disease-linked ADAR1 mutants: a molecular dynamics study","authors":"Wen-Chieh Huang,&nbsp;Chia-Hung Hsu,&nbsp;Titus V. Albu,&nbsp;Chia-Ning Yang","doi":"10.1007/s10822-024-00565-1","DOIUrl":"10.1007/s10822-024-00565-1","url":null,"abstract":"<div><p>Adenosine deaminases acting on RNA (ADARs) are pivotal RNA-editing enzymes responsible for converting adenosine to inosine within double-stranded RNA (dsRNA). Dysregulation of ADAR1 editing activity, often arising from genetic mutations, has been linked to elevated interferon levels and the onset of autoinflammatory diseases. However, understanding the molecular underpinnings of this dysregulation is impeded by the lack of an experimentally determined structure for the ADAR1 deaminase domain. In this computational study, we utilized homology modeling and the AlphaFold2 to construct structural models of the ADAR1 deaminase domain in wild-type and two pathogenic variants, R892H and Y1112F, to decipher the structural impact on the reduced deaminase activity. Our findings illuminate the critical role of structural complementarity between the ADAR1 deaminase domain and dsRNA in enzyme-substrate recognition. That is, the relative position of E1008 and K1120 must be maintained so that they can insert into the minor and major grooves of the substrate dsRNA, respectively, facilitating the flipping-out of adenosine to be accommodated within a cavity surrounding E912. Both amino acid replacements studied, R892H at the orthosteric site and Y1112F at the allosteric site, alter K1120 position and ultimately hinder substrate RNA binding.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141625551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
User-centric design of a 3D search interface for protein-ligand complexes 以用户为中心设计蛋白质配体三维搜索界面。
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-05-30 DOI: 10.1007/s10822-024-00563-3
Konrad Diedrich, Christiane Ehrt, Joel Graef, Martin Poppinga, Norbert Ritter, Matthias Rarey

In this work, we present the frontend of GeoMine and showcase its application, focusing on the new features of its latest version. GeoMine is a search engine for ligand-bound and predicted empty binding sites in the Protein Data Bank. In addition to its basic text-based search functionalities, GeoMine offers a geometric query type for searching binding sites with a specific relative spatial arrangement of chemical features such as heavy atoms and intermolecular interactions. In contrast to a text search that requires simple and easy-to-formulate user input, a 3D input is more complex, and its specification can be challenging for users. GeoMine’s new version aims to address this issue from the graphical user interface perspective by introducing an additional visualization concept and a new query template type. In its latest version, GeoMine extends its query-building capabilities primarily through input formulation in 2D. The 2D editor is fully synchronized with GeoMine’s 3D editor and provides the same functionality. It enables template-free query generation and template-based query selection directly in 2D pose diagrams. In addition, the query generation with the 3D editor now supports predicted empty binding sites for AlphaFold structures as query templates. GeoMine is freely accessible on the ProteinsPlus web server (https://proteins.plus).

在这项工作中,我们将介绍 GeoMine 的前端并展示其应用,重点介绍其最新版本的新功能。GeoMine 是蛋白质数据库中配体结合位点和预测空结合位点的搜索引擎。除了基本的文本搜索功能外,GeoMine 还提供了一种几何查询类型,用于搜索重原子和分子间相互作用等化学特征具有特定相对空间排列的结合位点。文本搜索要求用户输入的信息简单易懂,与之相比,三维输入则更为复杂,对用户而言,其具体说明可能具有挑战性。GeoMine 的新版本旨在通过引入额外的可视化概念和新的查询模板类型,从图形用户界面的角度解决这一问题。在最新版本中,GeoMine 主要通过二维输入表述来扩展其查询创建功能。2D 编辑器与 GeoMine 的 3D 编辑器完全同步,并提供相同的功能。它可以直接在二维姿态图中实现无模板查询生成和基于模板的查询选择。此外,三维编辑器的查询生成功能现在还支持将 AlphaFold 结构的预测空结合位点作为查询模板。GeoMine 可在 ProteinsPlus 网络服务器(https://proteins.plus )上免费访问。
{"title":"User-centric design of a 3D search interface for protein-ligand complexes","authors":"Konrad Diedrich,&nbsp;Christiane Ehrt,&nbsp;Joel Graef,&nbsp;Martin Poppinga,&nbsp;Norbert Ritter,&nbsp;Matthias Rarey","doi":"10.1007/s10822-024-00563-3","DOIUrl":"10.1007/s10822-024-00563-3","url":null,"abstract":"<div><p>In this work, we present the frontend of GeoMine and showcase its application, focusing on the new features of its latest version. GeoMine is a search engine for ligand-bound and predicted empty binding sites in the Protein Data Bank. In addition to its basic text-based search functionalities, GeoMine offers a geometric query type for searching binding sites with a specific relative spatial arrangement of chemical features such as heavy atoms and intermolecular interactions. In contrast to a text search that requires simple and easy-to-formulate user input, a 3D input is more complex, and its specification can be challenging for users. GeoMine’s new version aims to address this issue from the graphical user interface perspective by introducing an additional visualization concept and a new query template type. In its latest version, GeoMine extends its query-building capabilities primarily through input formulation in 2D. The 2D editor is fully synchronized with GeoMine’s 3D editor and provides the same functionality. It enables template-free query generation and template-based query selection directly in 2D pose diagrams. In addition, the query generation with the 3D editor now supports predicted empty binding sites for AlphaFold structures as query templates. GeoMine is freely accessible on the Proteins<i>Plus</i> web server (https://proteins.plus).</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11139749/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141173926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correlation of protein binding pocket properties with hits’ chemistries used in generation of ultra-large virtual libraries 用于生成超大型虚拟库的蛋白质结合袋特性与命中化学成分的相关性。
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-05-16 DOI: 10.1007/s10822-024-00562-4
Robert X. Song, Marc C. Nicklaus, Nadya I. Tarasova

Although the size of virtual libraries of synthesizable compounds is growing rapidly, we are still enumerating only tiny fractions of the drug-like chemical universe. Our capability to mine these newly generated libraries also lags their growth. That is why fragment-based approaches that utilize on-demand virtual combinatorial libraries are gaining popularity in drug discovery. These à la carte libraries utilize synthetic blocks found to be effective binders in parts of target protein pockets and a variety of reliable chemistries to connect them. There is, however, no data on the potential impact of the chemistries used for making on-demand libraries on the hit rates during virtual screening. There are also no rules to guide in the selection of these synthetic methods for production of custom libraries. We have used the SAVI (Synthetically Accessible Virtual Inventory) library, constructed using 53 reliable reaction types (transforms), to evaluate the impact of these chemistries on docking hit rates for 40 well-characterized protein pockets. The data shows that the virtual hit rates differ significantly for different chemistries with cross coupling reactions such as Sonogashira, Suzuki–Miyaura, Hiyama and Liebeskind–Srogl coupling producing the highest hit rates. Virtual hit rates appear to depend not only on the property of the formed chemical bond but also on the diversity of available building blocks and the scope of the reaction. The data identifies reactions that deserve wider use through increasing the number of corresponding building blocks and suggests the reactions that are more effective for pockets with certain physical and hydrogen bond-forming properties.

尽管可合成化合物虚拟库的规模正在迅速增长,但我们仍然只列举了类药物化学宇宙中的极小部分。我们挖掘这些新生成化合物库的能力也落后于它们的增长。这就是为什么利用按需虚拟组合库的基于片段的方法在药物发现领域越来越受欢迎的原因。这些 "点菜式 "文库利用在目标蛋白质口袋部分有效结合的合成块,并利用各种可靠的化学方法将它们连接起来。然而,目前还没有数据表明,用于制作按需文库的化学物质对虚拟筛选过程中的命中率有潜在影响。在选择这些合成方法来生产定制文库时,也没有任何指导规则。我们使用 53 种可靠的反应类型(转换)构建的 SAVI(可合成虚拟库存)库,评估了这些化学方法对 40 个特征明确的蛋白质口袋的对接命中率的影响。数据显示,不同化学反应的虚拟命中率差别很大,交叉偶联反应(如 Sonogashira、Suzuki-Miyaura、Hiyama 和 Liebeskind-Srogl 偶联)的命中率最高。虚拟命中率似乎不仅取决于所形成化学键的性质,还取决于可用构件的多样性和反应的范围。这些数据确定了值得通过增加相应构筑模块的数量来更广泛使用的反应,并提出了对具有某些物理和氢键形成特性的口袋更有效的反应。
{"title":"Correlation of protein binding pocket properties with hits’ chemistries used in generation of ultra-large virtual libraries","authors":"Robert X. Song,&nbsp;Marc C. Nicklaus,&nbsp;Nadya I. Tarasova","doi":"10.1007/s10822-024-00562-4","DOIUrl":"10.1007/s10822-024-00562-4","url":null,"abstract":"<div><p>Although the size of virtual libraries of synthesizable compounds is growing rapidly, we are still enumerating only tiny fractions of the drug-like chemical universe. Our capability to mine these newly generated libraries also lags their growth. That is why fragment-based approaches that utilize on-demand virtual combinatorial libraries are gaining popularity in drug discovery. These <i>à la carte</i> libraries utilize synthetic blocks found to be effective binders in parts of target protein pockets and a variety of reliable chemistries to connect them. There is, however, no data on the potential impact of the chemistries used for making on-demand libraries on the hit rates during virtual screening. There are also no rules to guide in the selection of these synthetic methods for production of custom libraries. We have used the SAVI (Synthetically Accessible Virtual Inventory) library, constructed using 53 reliable reaction types (transforms), to evaluate the impact of these chemistries on docking hit rates for 40 well-characterized protein pockets. The data shows that the virtual hit rates differ significantly for different chemistries with cross coupling reactions such as Sonogashira, Suzuki–Miyaura, Hiyama and Liebeskind–Srogl coupling producing the highest hit rates. Virtual hit rates appear to depend not only on the property of the formed chemical bond but also on the diversity of available building blocks and the scope of the reaction. The data identifies reactions that deserve wider use through increasing the number of corresponding building blocks and suggests the reactions that are more effective for pockets with certain physical and hydrogen bond-forming properties.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11098933/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140943273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reactivities of acrylamide warheads toward cysteine targets: a QM/ML approach to covalent inhibitor design 丙烯酰胺弹头对半胱氨酸目标的反应活性:共价抑制剂设计的 QM/ML 方法
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-05-01 DOI: 10.1007/s10822-024-00560-6
Aaron D. Danilack, Callum J. Dickson, Cihan Soylu, Mike Fortunato, Stephane Rodde, Hagen Munkler, Viktor Hornak, Jose S. Duca

Covalent inhibition offers many advantages over non-covalent inhibition, but covalent warhead reactivity must be carefully balanced to maintain potency while avoiding unwanted side effects. While warhead reactivities are commonly measured with assays, a computational model to predict warhead reactivities could be useful for several aspects of the covalent inhibitor design process. Studies have shown correlations between covalent warhead reactivities and quantum mechanic (QM) properties that describe important aspects of the covalent reaction mechanism. However, the models from these studies are often linear regression equations and can have limitations associated with their usage. Applications of machine learning (ML) models to predict covalent warhead reactivities with QM descriptors are not extensively seen in the literature. This study uses QM descriptors, calculated at different levels of theory, to train ML models to predict reactivities of covalent acrylamide warheads. The QM/ML models are compared with linear regression models built upon the same QM descriptors and with ML models trained on structure-based features like Morgan fingerprints and RDKit descriptors. Experiments show that the QM/ML models outperform the linear regression models and the structure-based ML models, and literature test sets demonstrate the power of the QM/ML models to predict reactivities of unseen acrylamide warhead scaffolds. Ultimately, these QM/ML models are effective, computationally feasible tools that can expedite the design of new covalent inhibitors.

与非共价抑制相比,共价抑制具有许多优势,但必须仔细平衡共价弹头反应性,以保持效力,同时避免不必要的副作用。虽然弹头反应性通常通过化验来测量,但预测弹头反应性的计算模型对共价抑制剂设计过程的多个方面都很有用。研究表明,共价弹头反应活性与描述共价反应机理重要方面的量子力学(QM)特性之间存在相关性。然而,这些研究中的模型通常是线性回归方程,在使用时可能会受到限制。使用 QM 描述子预测共价弹头反应性的机器学习(ML)模型的应用在文献中并不多见。本研究使用按不同理论水平计算的 QM 描述符来训练 ML 模型,以预测共价丙烯酰胺弹头的反应性。QM/ML模型与基于相同QM描述符建立的线性回归模型以及基于摩根指纹和RDKit描述符等基于结构特征训练的ML模型进行了比较。实验表明,QM/ML 模型优于线性回归模型和基于结构的 ML 模型,文献测试集证明了 QM/ML 模型预测未见丙烯酰胺弹头支架反应性的能力。最终,这些 QM/ML 模型是有效的、计算上可行的工具,可以加快新共价抑制剂的设计。
{"title":"Reactivities of acrylamide warheads toward cysteine targets: a QM/ML approach to covalent inhibitor design","authors":"Aaron D. Danilack,&nbsp;Callum J. Dickson,&nbsp;Cihan Soylu,&nbsp;Mike Fortunato,&nbsp;Stephane Rodde,&nbsp;Hagen Munkler,&nbsp;Viktor Hornak,&nbsp;Jose S. Duca","doi":"10.1007/s10822-024-00560-6","DOIUrl":"10.1007/s10822-024-00560-6","url":null,"abstract":"<div><p>Covalent inhibition offers many advantages over non-covalent inhibition, but covalent warhead reactivity must be carefully balanced to maintain potency while avoiding unwanted side effects. While warhead reactivities are commonly measured with assays, a computational model to predict warhead reactivities could be useful for several aspects of the covalent inhibitor design process. Studies have shown correlations between covalent warhead reactivities and quantum mechanic (QM) properties that describe important aspects of the covalent reaction mechanism. However, the models from these studies are often linear regression equations and can have limitations associated with their usage. Applications of machine learning (ML) models to predict covalent warhead reactivities with QM descriptors are not extensively seen in the literature. This study uses QM descriptors, calculated at different levels of theory, to train ML models to predict reactivities of covalent acrylamide warheads. The QM/ML models are compared with linear regression models built upon the same QM descriptors and with ML models trained on structure-based features like Morgan fingerprints and RDKit descriptors. Experiments show that the QM/ML models outperform the linear regression models and the structure-based ML models, and literature test sets demonstrate the power of the QM/ML models to predict reactivities of unseen acrylamide warhead scaffolds. Ultimately, these QM/ML models are effective, computationally feasible tools that can expedite the design of new covalent inhibitors.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140836933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
De novo drug design as GPT language modeling: large chemistry models with supervised and reinforcement learning 作为 GPT 语言建模的新药设计:采用监督和强化学习的大型化学模型
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-04-22 DOI: 10.1007/s10822-024-00559-z
Gavin Ye

In recent years, generative machine learning algorithms have been successful in designing innovative drug-like molecules. SMILES is a sequence-like language used in most effective drug design models. Due to data’s sequential structure, models such as recurrent neural networks and transformers can design pharmacological compounds with optimized efficacy. Large language models have advanced recently, but their implications on drug design have not yet been explored. Although one study successfully pre-trained a large chemistry model (LCM), its application to specific tasks in drug discovery is unknown. In this study, the drug design task is modeled as a causal language modeling problem. Thus, the procedure of reward modeling, supervised fine-tuning, and proximal policy optimization was used to transfer the LCM to drug design, similar to Open AI’s ChatGPT and InstructGPT procedures. By combining the SMILES sequence with chemical descriptors, the novel efficacy evaluation model exceeded its performance compared to previous studies. After proximal policy optimization, the drug design model generated molecules with 99.2% having efficacy pIC50 > 7 towards the amyloid precursor protein, with 100% of the generated molecules being valid and novel. This demonstrated the applicability of LCMs in drug discovery, with benefits including less data consumption while fine-tuning. The applicability of LCMs to drug discovery opens the door for larger studies involving reinforcement-learning with human feedback, where chemists provide feedback to LCMs and generate higher-quality molecules. LCMs’ ability to design similar molecules from datasets paves the way for more accessible, non-patented alternatives to drug molecules.

近年来,生成式机器学习算法已成功设计出创新的类药物分子。SMILES 是一种序列类语言,用于大多数有效的药物设计模型。由于数据的序列结构,递归神经网络和变换器等模型可以设计出药效最优的药物化合物。大型语言模型近来取得了进展,但它们对药物设计的影响尚未得到探讨。虽然有一项研究成功预训练了大型化学模型(LCM),但其在药物发现特定任务中的应用还不得而知。在本研究中,药物设计任务被建模为因果语言建模问题。因此,与 Open AI 的 ChatGPT 和 InstructGPT 程序类似,我们采用了奖励建模、监督微调和近似策略优化的程序,将 LCM 移植到药物设计中。通过将 SMILES 序列与化学描述符相结合,新型药效评估模型的性能超过了以往的研究。经过近端策略优化后,药物设计模型生成的分子对淀粉样前体蛋白的药效 pIC50 > 7 为 99.2%,生成的分子 100%有效且新颖。这证明了 LCM 在药物发现中的适用性,其优势包括在微调时消耗的数据更少。LCMs 在药物发现方面的适用性为更大规模的研究打开了大门,这些研究涉及带有人类反馈的强化学习,即化学家向 LCMs 提供反馈并生成更高质量的分子。LCMs 能够根据数据集设计类似的分子,这为开发更容易获得的非专利药物分子替代品铺平了道路。
{"title":"De novo drug design as GPT language modeling: large chemistry models with supervised and reinforcement learning","authors":"Gavin Ye","doi":"10.1007/s10822-024-00559-z","DOIUrl":"10.1007/s10822-024-00559-z","url":null,"abstract":"<div><p>In recent years, generative machine learning algorithms have been successful in designing innovative drug-like molecules. SMILES is a sequence-like language used in most effective drug design models. Due to data’s sequential structure, models such as recurrent neural networks and transformers can design pharmacological compounds with optimized efficacy. Large language models have advanced recently, but their implications on drug design have not yet been explored. Although one study successfully pre-trained a <i>large chemistry model</i> (LCM), its application to specific tasks in drug discovery is unknown. In this study, the drug design task is modeled as a causal language modeling problem. Thus, the procedure of reward modeling, supervised fine-tuning, and proximal policy optimization was used to transfer the LCM to drug design, similar to Open AI’s ChatGPT and InstructGPT procedures. By combining the SMILES sequence with chemical descriptors, the novel efficacy evaluation model exceeded its performance compared to previous studies. After proximal policy optimization, the drug design model generated molecules with 99.2% having efficacy pIC<sub>50</sub> &gt; 7 towards the amyloid precursor protein, with 100% of the generated molecules being valid and novel. This demonstrated the applicability of LCMs in drug discovery, with benefits including less data consumption while fine-tuning. The applicability of LCMs to drug discovery opens the door for larger studies involving reinforcement-learning with human feedback, where chemists provide feedback to LCMs and generate higher-quality molecules. LCMs’ ability to design similar molecules from datasets paves the way for more accessible, non-patented alternatives to drug molecules.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-024-00559-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140673020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From UK-2A to florylpicoxamid: Active learning to identify a mimic of a macrocyclic natural product 从UK-2A到氟啶虫酰胺:主动学习识别大环天然产物的模拟物
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-04-17 DOI: 10.1007/s10822-024-00555-3
Ann E. Cleves, Ajay N. Jain, David A. Demeter, Zachary A. Buchan, Jeremy Wilmot, Erin N. Hancock

Scaffold replacement as part of an optimization process that requires maintenance of potency, desirable biodistribution, metabolic stability, and considerations of synthesis at very large scale is a complex challenge. Here, we consider a set of over 1000 time-stamped compounds, beginning with a macrocyclic natural-product lead and ending with a broad-spectrum crop anti-fungal. We demonstrate the application of the QuanSA 3D-QSAR method employing an active learning procedure that combines two types of molecular selection. The first identifies compounds predicted to be most active of those most likely to be well-covered by the model. The second identifies compounds predicted to be most informative based on exhibiting low predicted activity but showing high 3D similarity to a highly active nearest-neighbor training molecule. Beginning with just 100 compounds, using a deterministic and automatic procedure, five rounds of 20-compound selection and model refinement identifies the binding metabolic form of florylpicoxamid. We show how iterative refinement broadens the domain of applicability of the successive models while also enhancing predictive accuracy. We also demonstrate how a simple method requiring very sparse data can be used to generate relevant ideas for synthetic candidates.

作为优化过程的一部分,支架置换要求维持药效、理想的生物分布、代谢稳定性,并考虑大规模合成,这是一项复杂的挑战。在这里,我们考虑了一组超过 1000 个有时间戳的化合物,从一个大环天然产物先导化合物开始,到一个广谱作物抗真菌药物。我们展示了 QuanSA 3D-QSAR 方法的应用,该方法采用了一种结合两种分子选择类型的主动学习程序。第一种是在最有可能被模型很好覆盖的化合物中识别出最有活性的化合物。第二种方法是根据预测活性较低,但与高活性近邻训练分子的三维相似性较高的情况,确定预测信息量最大的化合物。从仅有的 100 个化合物开始,使用确定性的自动程序,经过五轮 20 个化合物的筛选和模型完善,确定了氟啶虫酰胺的结合代谢形式。我们展示了迭代改进如何拓宽连续模型的适用范围,同时提高预测准确性。我们还展示了如何利用一种需要非常稀少数据的简单方法来产生合成候选化合物的相关想法。
{"title":"From UK-2A to florylpicoxamid: Active learning to identify a mimic of a macrocyclic natural product","authors":"Ann E. Cleves,&nbsp;Ajay N. Jain,&nbsp;David A. Demeter,&nbsp;Zachary A. Buchan,&nbsp;Jeremy Wilmot,&nbsp;Erin N. Hancock","doi":"10.1007/s10822-024-00555-3","DOIUrl":"10.1007/s10822-024-00555-3","url":null,"abstract":"<div><p>Scaffold replacement as part of an optimization process that requires maintenance of potency, desirable biodistribution, metabolic stability, and considerations of synthesis at very large scale is a complex challenge. Here, we consider a set of over 1000 time-stamped compounds, beginning with a macrocyclic natural-product lead and ending with a broad-spectrum crop anti-fungal. We demonstrate the application of the QuanSA 3D-QSAR method employing an active learning procedure that combines two types of molecular selection. The first identifies compounds predicted to be most active of those most likely to be well-covered by the model. The second identifies compounds predicted to be most <i>informative</i> based on exhibiting low predicted activity but showing high 3D similarity to a highly active nearest-neighbor training molecule. Beginning with just 100 compounds, using a deterministic and automatic procedure, five rounds of 20-compound selection and model refinement identifies the binding metabolic form of florylpicoxamid. We show how iterative refinement broadens the domain of applicability of the successive models while also enhancing predictive accuracy. We also demonstrate how a simple method requiring very sparse data can be used to generate relevant ideas for synthetic candidates.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-024-00555-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140614475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Computer-Aided Molecular Design
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1