首页 > 最新文献

Journal of Computer-Aided Molecular Design最新文献

英文 中文
In silico exploration of natural xanthone derivatives as potential inhibitors of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) replication and cellular entry
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-02-17 DOI: 10.1007/s10822-025-00585-5
Vincent A. Obakachi, Vaderament-A. Nchiozem-Ngnitedem, Krishna K. Govender, Penny P. Govender

The COVID-19 pandemic, caused by SARS-CoV-2, has underscored the urgent need for effective antiviral therapies, particularly against vaccine-resistant variants. This study investigates natural xanthone derivatives as potential inhibitors of the ACE2 receptor, a critical entry point for the virus. We computationally evaluated 91 xanthone compounds derived from Swertia chirayita, identifying two promising candidates: 8-O-[β-D-Xylopyranosyl-(1→6)-β-D-glucopyranosyl]-1,7-dihydroxy-3-methoxy xanthone (XAN71) and 8-O-[β-D-Xylopyranosyl-(1→6)-β-D-glucopyranosyl]-1-hydroxy-3,7-dimethoxy-xanthone (XAN72). Molecular docking and dynamics simulations (MDDS) were performed to assess their binding energy and stability within the ACE2 active site, comparing them to the reference inhibitor MLN-4067. The top six compounds were selected based on their docking performance, followed by Molecular Mechanics/Poisson-Boltzmann Surface Area (MM/PBSA) calculations to quantify binding affinities. Additionally, molecular electrostatic potential (MEP) analysis was conducted to visualize electron density regions relevant to binding interactions. Our results demonstrate that XAN71 and XAN72 exhibit superior binding affinities of -70.97 and − 69.85 kcal/mol, respectively, outperforming MLN-4067 (-61.33 kcal/mol). MD simulations revealed stable interactions with key ACE2 residues, primarily through hydrogen bonds and hydrophobic contacts. The Molecular Electrostatic Potential(MEP) analysis further elucidated critical electron density regions that enhance binding stability. This study establishes XAN71 and XAN72 as viable candidates for ACE2 inhibition, providing a structural basis for their development as natural xanthone-based therapeutics against SARS-CoV-2. These findings highlight the potential of targeting ACE2 with natural compounds to combat COVID-19, particularly in light of emerging viral variants.

{"title":"In silico exploration of natural xanthone derivatives as potential inhibitors of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) replication and cellular entry","authors":"Vincent A. Obakachi,&nbsp;Vaderament-A. Nchiozem-Ngnitedem,&nbsp;Krishna K. Govender,&nbsp;Penny P. Govender","doi":"10.1007/s10822-025-00585-5","DOIUrl":"10.1007/s10822-025-00585-5","url":null,"abstract":"<div><p>The COVID-19 pandemic, caused by SARS-CoV-2, has underscored the urgent need for effective antiviral therapies, particularly against vaccine-resistant variants. This study investigates natural xanthone derivatives as potential inhibitors of the ACE2 receptor, a critical entry point for the virus. We computationally evaluated 91 xanthone compounds derived from <i>Swertia chirayita</i>, identifying two promising candidates: 8-O-[β-D-Xylopyranosyl-(1→6)-β-D-glucopyranosyl]-1,7-dihydroxy-3-methoxy xanthone (XAN71) and 8-O-[β-D-Xylopyranosyl-(1→6)-β-D-glucopyranosyl]-1-hydroxy-3,7-dimethoxy-xanthone (XAN72). Molecular docking and dynamics simulations (MDDS) were performed to assess their binding energy and stability within the ACE2 active site, comparing them to the reference inhibitor MLN-4067. The top six compounds were selected based on their docking performance, followed by Molecular Mechanics/Poisson-Boltzmann Surface Area (MM/PBSA) calculations to quantify binding affinities. Additionally, molecular electrostatic potential (MEP) analysis was conducted to visualize electron density regions relevant to binding interactions. Our results demonstrate that XAN71 and XAN72 exhibit superior binding affinities of -70.97 and − 69.85 kcal/mol, respectively, outperforming MLN-4067 (-61.33 kcal/mol). MD simulations revealed stable interactions with key ACE2 residues, primarily through hydrogen bonds and hydrophobic contacts. The Molecular Electrostatic Potential(MEP) analysis further elucidated critical electron density regions that enhance binding stability. This study establishes XAN71 and XAN72 as viable candidates for ACE2 inhibition, providing a structural basis for their development as natural xanthone-based therapeutics against SARS-CoV-2. These findings highlight the potential of targeting ACE2 with natural compounds to combat COVID-19, particularly in light of emerging viral variants.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"39 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-025-00585-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Elucidating allosteric signal disruption in PBP2a: impact of N146K/E150K mutations on ceftaroline resistance in methicillin-resistant Staphylococcus aureus
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-02-07 DOI: 10.1007/s10822-025-00584-6
Fangfang Jiao, Ran Xu, Qing Luo, Xinkang Li, Henry H. Y. Tong, Jingjing Guo

Ceftaroline (CFT) effectively combats methicillin-resistant Staphylococcus aureus (MRSA) by binding to the allosteric site on penicillin-binding protein 2a (PBP2a) and activating allosteric signals that remotely open the active pocket. However, the widespread clinical use of CFT has led to specific mutations, such as N146K/E150K, at the PBP2a allosteric site, which confers resistance to CFT in MRSA by disrupting the transmission of allosteric signals. Herein, computational simulations were employed to elucidate how the mutations disrupt the transmission of allosteric signals, thereby enhancing the resistance of MRSA to CFT. Specifically, the mutations alter the salt bridge network and electrostatic environment, resulting in a dynamic setting and decreased binding affinity of CFT within the allosteric pocket. Additionally, dynamical network analysis and the identification of allosteric pathways revealed that the reduced binding affinity diminishes the propagation of allosteric signals to the active site. Further evaluations demonstrated that this diminished signaling reduces the openness of the active pocket in the mutant systems, with “gatekeeper” residues and functional loops remaining partially closed. Redocking experiments confirmed that mutations lead to decreased docking scores and unfavorable docking poses for CFT within the active pocket. These findings highlight the complex interactions between structural changes induced by mutations and antibiotic resistance, providing crucial insights for developing new therapeutic strategies against MRSA resistance.

{"title":"Elucidating allosteric signal disruption in PBP2a: impact of N146K/E150K mutations on ceftaroline resistance in methicillin-resistant Staphylococcus aureus","authors":"Fangfang Jiao,&nbsp;Ran Xu,&nbsp;Qing Luo,&nbsp;Xinkang Li,&nbsp;Henry H. Y. Tong,&nbsp;Jingjing Guo","doi":"10.1007/s10822-025-00584-6","DOIUrl":"10.1007/s10822-025-00584-6","url":null,"abstract":"<div><p>Ceftaroline (CFT) effectively combats methicillin-resistant <i>Staphylococcus aureus</i> (MRSA) by binding to the allosteric site on penicillin-binding protein 2a (PBP2a) and activating allosteric signals that remotely open the active pocket. However, the widespread clinical use of CFT has led to specific mutations, such as N146K/E150K, at the PBP2a allosteric site, which confers resistance to CFT in MRSA by disrupting the transmission of allosteric signals. Herein, computational simulations were employed to elucidate how the mutations disrupt the transmission of allosteric signals, thereby enhancing the resistance of MRSA to CFT. Specifically, the mutations alter the salt bridge network and electrostatic environment, resulting in a dynamic setting and decreased binding affinity of CFT within the allosteric pocket. Additionally, dynamical network analysis and the identification of allosteric pathways revealed that the reduced binding affinity diminishes the propagation of allosteric signals to the active site. Further evaluations demonstrated that this diminished signaling reduces the openness of the active pocket in the mutant systems, with “gatekeeper” residues and functional loops remaining partially closed. Redocking experiments confirmed that mutations lead to decreased docking scores and unfavorable docking poses for CFT within the active pocket. These findings highlight the complex interactions between structural changes induced by mutations and antibiotic resistance, providing crucial insights for developing new therapeutic strategies against MRSA resistance.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"39 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143361716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In silico design of dehydrophenylalanine containing peptide activators of glucokinase using pharmacophore modelling, molecular dynamics and machine learning: implications in type 2 diabetes 利用药效团模型、分子动力学和机器学习,用计算机设计含有葡萄糖激酶肽激活剂的脱氢苯丙氨酸:对2型糖尿病的影响
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-31 DOI: 10.1007/s10822-024-00583-z
Siddharth Yadav, Swati Rana, Manish Manish, Sohini Singh, Andrew Lynn, Puniti Mathur

Diabetes represents a significant global health challenge associated with substantial healthcare costs and therapeutic complexities. Current diabetes therapies often entail adverse effects, necessitating the exploration of novel agents. Glucokinase (GK), a key enzyme in glucose homeostasis, primarily regulates blood glucose levels in hepatocytes and pancreatic cells. Unlike other hexokinases, GK exhibits unique kinetic properties, such as a high Km and lack of feedback inhibition, allowing it to function as a glucose sensor Glucokinase activators (GKAs) have emerged as promising candidates for managing type-2 diabetes by allosterically enhancing GK activity. Despite initial promise, existing GKAs face significant safety concerns, driving the need for compounds with improved safety profiles. This study introduces a novel chemical scaffold within the GKA landscape: peptide-based GKAs incorporating non-standard amino acid residues such as α,β-dehydrophenylalanine (ΔPhe/ΔF). A virtual library containing 3,368,000 peptides was constructed and screened using a hybrid pharmacophore, namely DHRR (D: donor; H: hydrogen; R: aromatic ring). Molecular docking and molecular dynamics simulations assisted in identifying three peptides, Pep-11, Pep-15, and Pep-16, which depicted stable binding at the allosteric site of Glucokinase. These peptides were synthesized using a combination of solid and solution phase synthesis methods. In vitro enzymatic activity of glucokinase was increased by at least 1.5 times in the presence of these peptides. Several machine learning algorithms were explored as alternatives to conventional in-silico methods for predicting GK activity. Regression and tree-based algorithms outperformed other methods, with the logistic regression and random forest classifiers both achieving an ROC-AUC of 0.98.

糖尿病是一项重大的全球健康挑战,涉及大量医疗保健费用和治疗复杂性。目前的糖尿病治疗往往会带来不良反应,需要探索新的药物。葡萄糖激酶(GK)是葡萄糖稳态的关键酶,主要调节肝细胞和胰腺细胞的血糖水平。与其他己糖激酶不同,GK表现出独特的动力学特性,如高Km和缺乏反馈抑制,使其能够作为葡萄糖传感器发挥作用,葡萄糖激酶激活剂(gka)已成为通过变张力增强GK活性来治疗2型糖尿病的有希望的候选物。尽管最初有希望,但现有的gka面临着重大的安全问题,这推动了对安全性更高的化合物的需求。本研究在GKA领域引入了一种新的化学支架:基于肽的GKA,包含非标准氨基酸残基,如α,β-脱氢苯丙氨酸(ΔPhe/ΔF)。构建了包含3368,000个肽段的虚拟文库,并使用混合药效团DHRR (D: donor;H:氢;R:芳香环)。分子对接和分子动力学模拟帮助鉴定了三种肽,Pep-11, Pep-15和Pep-16,它们在葡萄糖激酶的变构位点稳定结合。这些肽是用固相和液相相结合的合成方法合成的。在这些肽的存在下,葡萄糖激酶的体外酶活性增加了至少1.5倍。研究人员探索了几种机器学习算法,作为预测GK活动的传统计算机方法的替代方法。回归和基于树的算法优于其他方法,逻辑回归和随机森林分类器的ROC-AUC均达到0.98。
{"title":"In silico design of dehydrophenylalanine containing peptide activators of glucokinase using pharmacophore modelling, molecular dynamics and machine learning: implications in type 2 diabetes","authors":"Siddharth Yadav,&nbsp;Swati Rana,&nbsp;Manish Manish,&nbsp;Sohini Singh,&nbsp;Andrew Lynn,&nbsp;Puniti Mathur","doi":"10.1007/s10822-024-00583-z","DOIUrl":"10.1007/s10822-024-00583-z","url":null,"abstract":"<div><p>Diabetes represents a significant global health challenge associated with substantial healthcare costs and therapeutic complexities. Current diabetes therapies often entail adverse effects, necessitating the exploration of novel agents. Glucokinase (GK), a key enzyme in glucose homeostasis, primarily regulates blood glucose levels in hepatocytes and pancreatic cells. Unlike other hexokinases, GK exhibits unique kinetic properties, such as a high Km and lack of feedback inhibition, allowing it to function as a glucose sensor Glucokinase activators (GKAs) have emerged as promising candidates for managing type-2 diabetes by allosterically enhancing GK activity. Despite initial promise, existing GKAs face significant safety concerns, driving the need for compounds with improved safety profiles. This study introduces a novel chemical scaffold within the GKA landscape: peptide-based GKAs incorporating non-standard amino acid residues such as α,β-dehydrophenylalanine (ΔPhe/ΔF). A virtual library containing 3,368,000 peptides was constructed and screened using a hybrid pharmacophore, namely DHRR (D: donor; H: hydrogen; R: aromatic ring). Molecular docking and molecular dynamics simulations assisted in identifying three peptides, Pep-11, Pep-15, and Pep-16, which depicted stable binding at the allosteric site of Glucokinase. These peptides were synthesized using a combination of solid and solution phase synthesis methods. In vitro enzymatic activity of glucokinase was increased by at least 1.5 times in the presence of these peptides. Several machine learning algorithms were explored as alternatives to conventional in-silico methods for predicting GK activity. Regression and tree-based algorithms outperformed other methods, with the logistic regression and random forest classifiers both achieving an ROC-AUC of 0.98.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"39 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142906101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ConoDL: a deep learning framework for rapid generation and prediction of conotoxins ConoDL:用于快速生成和预测ConoDL毒素的深度学习框架
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-26 DOI: 10.1007/s10822-024-00582-0
Menghan Guo, Zengpeng Li, Xuejin Deng, Ding Luo, Jingyi Yang, Yingjun Chen, Weiwei Xue

Conotoxins, being small disulfide-rich and bioactive peptides, manifest notable pharmacological potential and find extensive applications. However, the exploration of conotoxins’ vast molecular space using traditional methods is severely limited, necessitating the urgent need of developing novel approaches. Recently, deep learning (DL)-based methods have advanced to the molecular generation of proteins and peptides. Nevertheless, the limited data and the intricate structure of conotoxins constrain the application of deep learning models in the generation of conotoxins. We propose ConoDL, a framework for the generation and prediction of conotoxins, comprising the end-to-end conotoxin generation model (ConoGen) and the conotoxin prediction model (ConoPred). ConoGen employs transfer learning and a large language model (LLM) to tackle the challenges in conotoxin generation. Meanwhile, ConoPred filters artificial conotoxins generated by ConoGen, narrowing down the scope for subsequent research. A comprehensive evaluation of the peptide properties at both sequence and structure levels indicates that the artificial conotoxins generated by ConoDL exhibit a certain degree of similarity to natural conotoxins. Furthermore, ConoDL has generated artificial conotoxins with novel cysteine scaffolds. Therefore, ConoDL may uncover new cysteine scaffolds and conotoxin molecules, facilitating further exploration of the molecular space of conotoxins and the discovery of pharmacologically active variants.

Conotoxins是一种小的富含二硫化物的生物活性肽,具有显著的药理潜力和广泛的应用。然而,利用传统方法对conotoxins广阔的分子空间的探索受到严重限制,迫切需要开发新的方法。最近,基于深度学习(DL)的方法已经发展到蛋白质和肽的分子生成。然而,有限的数据和复杂的螺毒素结构限制了深度学习模型在螺毒素生成中的应用。我们提出ConoDL,一个用于conotoxin生成和预测的框架,包括端到端conotoxin生成模型(ConoGen)和ConoPred conotoxin预测模型(ConoPred)。ConoGen采用迁移学习和大型语言模型(LLM)来解决concontoxin生成的挑战。同时,ConoPred过滤了ConoGen产生的人工松香毒素,缩小了后续研究的范围。从序列和结构两方面对其肽特性进行综合评价表明,ConoDL合成的人工conobay毒素与天然conobay毒素具有一定的相似性。此外,ConoDL还利用新型半胱氨酸支架生成人工conotoxin。因此,ConoDL可能会发现新的半胱氨酸支架和螺毒素分子,有助于进一步探索螺毒素的分子空间和发现具有药理活性的变体。
{"title":"ConoDL: a deep learning framework for rapid generation and prediction of conotoxins","authors":"Menghan Guo,&nbsp;Zengpeng Li,&nbsp;Xuejin Deng,&nbsp;Ding Luo,&nbsp;Jingyi Yang,&nbsp;Yingjun Chen,&nbsp;Weiwei Xue","doi":"10.1007/s10822-024-00582-0","DOIUrl":"10.1007/s10822-024-00582-0","url":null,"abstract":"<div><p>Conotoxins, being small disulfide-rich and bioactive peptides, manifest notable pharmacological potential and find extensive applications. However, the exploration of conotoxins’ vast molecular space using traditional methods is severely limited, necessitating the urgent need of developing novel approaches. Recently, deep learning (DL)-based methods have advanced to the molecular generation of proteins and peptides. Nevertheless, the limited data and the intricate structure of conotoxins constrain the application of deep learning models in the generation of conotoxins. We propose ConoDL, a framework for the generation and prediction of conotoxins, comprising the end-to-end conotoxin generation model (ConoGen) and the conotoxin prediction model (ConoPred). ConoGen employs transfer learning and a large language model (LLM) to tackle the challenges in conotoxin generation. Meanwhile, ConoPred filters artificial conotoxins generated by ConoGen, narrowing down the scope for subsequent research. A comprehensive evaluation of the peptide properties at both sequence and structure levels indicates that the artificial conotoxins generated by ConoDL exhibit a certain degree of similarity to natural conotoxins. Furthermore, ConoDL has generated artificial conotoxins with novel cysteine scaffolds. Therefore, ConoDL may uncover new cysteine scaffolds and conotoxin molecules, facilitating further exploration of the molecular space of conotoxins and the discovery of pharmacologically active variants.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"39 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142889745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MolGraph: a Python package for the implementation of molecular graphs and graph neural networks with TensorFlow and Keras MolGraph:一个Python包,用于使用TensorFlow和Keras实现分子图和图神经网络
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-05 DOI: 10.1007/s10822-024-00578-w
Alexander Kensert, Gert Desmet, Deirdre Cabooter

Molecular machine learning (ML) has proven important for tackling various molecular problems, such as predicting molecular properties based on molecular descriptors or fingerprints. Since relatively recently, graph neural network (GNN) algorithms have been implemented for molecular ML, showing comparable or superior performance to descriptor or fingerprint-based approaches. Although various tools and packages exist to apply GNNs in molecular ML, a new GNN package, named MolGraph, was developed in this work with the motivation to create GNN model pipelines highly compatible with the TensorFlow and Keras application programming interface (API). MolGraph also implements a module to accommodate the generation of small molecular graphs, which can be passed to a GNN algorithm to solve a molecular ML problem. To validate the GNNs, benchmarking was conducted using the datasets from MoleculeNet, as well as three chromatographic retention time datasets. The benchmarking results demonstrate that the GNNs performed in line with expectations. Additionally, the GNNs proved useful for molecular identification and improved interpretability of chromatographic retention time data. MolGraph is available at https://github.com/akensert/molgraph. Installation, tutorials and implementation details can be found at https://molgraph.readthedocs.io/en/latest/.

事实证明,分子机器学习(ML)对于解决各种分子问题非常重要,例如基于分子描述符或指纹来预测分子性质。最近,图神经网络(GNN)算法已经在分子机器学习中实现,表现出与描述符或基于指纹的方法相当或更好的性能。尽管存在各种工具和包来将GNN应用于分子ML中,但在这项工作中开发了一个名为MolGraph的新GNN包,其动机是创建与TensorFlow和Keras应用程序编程接口(API)高度兼容的GNN模型管道。MolGraph还实现了一个模块来容纳小分子图的生成,这些小分子图可以传递给GNN算法来解决分子ML问题。为了验证gnn,使用来自MoleculeNet的数据集以及三个色谱保留时间数据集进行基准测试。基准测试结果表明,gnn的性能符合预期。此外,gnn被证明有助于分子鉴定和提高色谱保留时间数据的可解释性。MolGraph可在https://github.com/akensert/molgraph上获得。安装、教程和实现细节可以在https://molgraph.readthedocs.io/en/latest/上找到。
{"title":"MolGraph: a Python package for the implementation of molecular graphs and graph neural networks with TensorFlow and Keras","authors":"Alexander Kensert,&nbsp;Gert Desmet,&nbsp;Deirdre Cabooter","doi":"10.1007/s10822-024-00578-w","DOIUrl":"10.1007/s10822-024-00578-w","url":null,"abstract":"<div><p>Molecular machine learning (ML) has proven important for tackling various molecular problems, such as predicting molecular properties based on molecular descriptors or fingerprints. Since relatively recently, graph neural network (GNN) algorithms have been implemented for molecular ML, showing comparable or superior performance to descriptor or fingerprint-based approaches. Although various tools and packages exist to apply GNNs in molecular ML, a new GNN package, named MolGraph, was developed in this work with the motivation to create GNN model pipelines highly compatible with the TensorFlow and Keras application programming interface (API). MolGraph also implements a module to accommodate the generation of small molecular graphs, which can be passed to a GNN algorithm to solve a molecular ML problem. To validate the GNNs, benchmarking was conducted using the datasets from MoleculeNet, as well as three chromatographic retention time datasets. The benchmarking results demonstrate that the GNNs performed in line with expectations. Additionally, the GNNs proved useful for molecular identification and improved interpretability of chromatographic retention time data. MolGraph is available at https://github.com/akensert/molgraph. Installation, tutorials and implementation details can be found at https://molgraph.readthedocs.io/en/latest/.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"39 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142778406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining crystallographic and binding affinity data towards a novel dataset of small molecule overlays 结合晶体学和结合亲和数据对一个新的小分子覆盖数据集。
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-12-04 DOI: 10.1007/s10822-024-00581-1
Sophia M. N. Hönig, Torben Gutermuth, Christiane Ehrt, Christian Lemmen, Matthias Rarey

Although small molecule superposition is a standard technique in drug discovery, a rigorous performance assessment of the corresponding methods is currently challenging. Datasets in this field are sparse, small, tailored to specific applications, unavailable, or outdated. The newly developed LOBSTER set described herein offers a publicly available and method-independent dataset for benchmarking and method optimization. LOBSTER stands for “Ligand Overlays from Binding SiTe Ensemble Representatives”. All ligands were derived from the PDB in a fully automated workflow, including a ligand efficiency filter. So-called ligand ensembles were assembled by aligning identical binding sites. Thus, the ligands within the ensembles are superimposed according to their experimentally determined binding orientation and conformation. Overall, 671 representative ligand ensembles comprise 3583 ligands from 3521 proteins. Altogether, 72,734 ligand pairs based on the ensembles were grouped into ten distinct subsets based on their volume overlap, for the benefit of introducing different degrees of difficulty for evaluating superposition methods. Statistics on the physicochemical properties of the compounds indicate that the dataset represents drug-like compounds. Consensus Diversity Plots show predominantly high Bemis–Murcko scaffold diversity and low median MACCS fingerprint similarity for each ensemble. An analysis of the underlying protein classes further demonstrates the heterogeneity within our dataset. The LOBSTER set offers a variety of applications like benchmarking multiple as well as pairwise alignments, generating training and test sets, for example based on time splits, or empirical software performance evaluation studies. The LOBSTER set is publicly available at https://doi.org/10.5281/zenodo.12658320, representing a stable and versioned data resource. The Python scripts are available at https://github.com/rareylab/LOBSTER, open-source, and allow for updating or recreating superposition sets with different data sources.

Simplified illustration of the LOBSTER dataset generation.

虽然小分子叠加是药物发现的标准技术,但对相应方法的严格性能评估目前具有挑战性。该领域的数据集稀疏、小、针对特定应用量身定制、不可用或过时。本文描述的新开发的LOBSTER集提供了一个公开可用的、与方法无关的数据集,用于基准测试和方法优化。龙虾代表“结合位点集合代表的配体叠加”。所有的配体都是在一个完全自动化的工作流程中从PDB中提取的,包括一个配体效率过滤器。所谓的配体组合是通过排列相同的结合位点来组装的。因此,根据实验确定的结合取向和构象,组合内的配体是叠加的。总的来说,671个有代表性的配体集合包含3583个配体,来自3521个蛋白质。总的来说,基于这些组合的72,734对配体基于它们的体积重叠被分为10个不同的子集,以便引入不同程度的难度来评估叠加方法。对化合物的物理化学性质的统计表明,该数据集代表药物样化合物。一致性多样性图显示,每个集合的Bemis-Murcko骨架多样性显著较高,而MACCS指纹相似度中值较低。对潜在蛋白质类别的分析进一步证明了我们数据集中的异质性。LOBSTER集提供了各种各样的应用程序,如基准测试多重和成对对齐,生成训练和测试集,例如基于时间分裂,或经验软件性能评估研究。LOBSTER集可以在https://doi.org/10.5281/zenodo.12658320上公开获得,它表示稳定且有版本的数据资源。Python脚本可从https://github.com/rareylab/LOBSTER(开源)获取,并允许使用不同的数据源更新或重新创建叠加集。
{"title":"Combining crystallographic and binding affinity data towards a novel dataset of small molecule overlays","authors":"Sophia M. N. Hönig,&nbsp;Torben Gutermuth,&nbsp;Christiane Ehrt,&nbsp;Christian Lemmen,&nbsp;Matthias Rarey","doi":"10.1007/s10822-024-00581-1","DOIUrl":"10.1007/s10822-024-00581-1","url":null,"abstract":"<p>Although small molecule superposition is a standard technique in drug discovery, a rigorous performance assessment of the corresponding methods is currently challenging. Datasets in this field are sparse, small, tailored to specific applications, unavailable, or outdated. The newly developed LOBSTER set described herein offers a publicly available and method-independent dataset for benchmarking and method optimization. LOBSTER stands for “Ligand Overlays from Binding SiTe Ensemble Representatives”. All ligands were derived from the PDB in a fully automated workflow, including a ligand efficiency filter. So-called ligand ensembles were assembled by aligning identical binding sites. Thus, the ligands within the ensembles are superimposed according to their experimentally determined binding orientation and conformation. Overall, 671 representative ligand ensembles comprise 3583 ligands from 3521 proteins. Altogether, 72,734 ligand pairs based on the ensembles were grouped into ten distinct subsets based on their volume overlap, for the benefit of introducing different degrees of difficulty for evaluating superposition methods. Statistics on the physicochemical properties of the compounds indicate that the dataset represents drug-like compounds. Consensus Diversity Plots show predominantly high Bemis–Murcko scaffold diversity and low median MACCS fingerprint similarity for each ensemble. An analysis of the underlying protein classes further demonstrates the heterogeneity within our dataset. The LOBSTER set offers a variety of applications like benchmarking multiple as well as pairwise alignments, generating training and test sets, for example based on time splits, or empirical software performance evaluation studies. The LOBSTER set is publicly available at https://doi.org/10.5281/zenodo.12658320, representing a stable and versioned data resource. The Python scripts are available at https://github.com/rareylab/LOBSTER, open-source, and allow for updating or recreating superposition sets with different data sources. </p><p>Simplified illustration of the LOBSTER dataset generation.</p>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"39 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-024-00581-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142764979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Promoter recognition specificity of Corynebacterium glutamicum stress response sigma factors σD and σH deciphered using computer modeling and point mutagenesis 利用计算机建模和点突变破译谷氨酸棒杆菌应激反应sigma因子σD和σH的启动子识别特异性。
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-11-25 DOI: 10.1007/s10822-024-00577-x
J. Blumenstein, H. Dostálová, L. Rucká, V. Štěpánek, T. Busche, J. Kalinowski, M. Pátek, I. Barvík

This study aimed to reveal interactions of the stress response sigma subunits (factors) σD and σH of RNA polymerase and promoters in Gram-positive bacterium Corynebacterium glutamicum by combining wet-lab obtained data and in silico modeling. Computer modeling-guided point mutagenesis of C. glutamicum σH subunit led to the creation of a panel of σH variants. Their ability to initiate transcription from naturally occurring hybrid σDH-dependent promoter Pcg0441 and two control canonical promoters (σD-dependent PrsdA and σH-dependent PuvrD3) was measured and interpreted using molecular dynamics simulations of homology models of all complexes. The results led us to design the artificial hybrid promoter PD35H10 combining the −10 element of the PuvrD3 promoter and the −35 element of the PrsdA promoter. This artificial hybrid promoter PD35-rsdAH10-uvrD3 showed almost optimal properties needed for the bio-orthogonal transcription (not interfering with the native biological processes).

本研究旨在通过将湿实验室获得的数据与硅学建模相结合,揭示革兰氏阳性菌谷氨酸棒杆菌(Corynebacterium glutamicum)RNA聚合酶的应激反应sigma亚基(因子)σD和σH与启动子之间的相互作用。计算机建模指导下的谷氨酸棒杆菌σH亚基点突变产生了一系列σH变体。通过对所有复合体的同源模型进行分子动力学模拟,我们测量并解释了它们从天然存在的σD/σH依赖性混合启动子Pcg0441和两个对照规范启动子(σD依赖性PrsdA和σH依赖性PuvrD3)启动转录的能力。这些结果促使我们设计了人工混合启动子 PD35H10,它结合了 PuvrD3 启动子的 -10 元件和 PrsdA 启动子的 -35 元件。这种人工混合启动子 PD35-rsdAH10-uvrD3 几乎表现出了生物正交转录(不干扰原生生物过程)所需的最佳特性。
{"title":"Promoter recognition specificity of Corynebacterium glutamicum stress response sigma factors σD and σH deciphered using computer modeling and point mutagenesis","authors":"J. Blumenstein,&nbsp;H. Dostálová,&nbsp;L. Rucká,&nbsp;V. Štěpánek,&nbsp;T. Busche,&nbsp;J. Kalinowski,&nbsp;M. Pátek,&nbsp;I. Barvík","doi":"10.1007/s10822-024-00577-x","DOIUrl":"10.1007/s10822-024-00577-x","url":null,"abstract":"<div><p>This study aimed to reveal interactions of the stress response sigma subunits (factors) σ<sup>D</sup> and σ<sup>H</sup> of RNA polymerase and promoters in Gram-positive bacterium <i>Corynebacterium glutamicum</i> by combining wet-lab obtained data and in silico modeling. Computer modeling-guided point mutagenesis of <i>C. glutamicum</i> σ<sup>H</sup> subunit led to the creation of a panel of σ<sup>H</sup> variants. Their ability to initiate transcription from naturally occurring hybrid σ<sup>D</sup>/σ<sup>H</sup>-dependent promoter P<i>cg0441</i> and two control canonical promoters (σ<sup>D</sup>-dependent P<i>rsdA</i> and σ<sup>H</sup>-dependent P<i>uvrD3</i>) was measured and interpreted using molecular dynamics simulations of homology models of all complexes. The results led us to design the artificial hybrid promoter P<i>D</i><sub><i>35</i></sub><i>H</i><sub><i>10</i></sub> combining the −10 element of the P<i>uvrD3</i> promoter and the −35 element of the P<i>rsdA</i> promoter. This artificial hybrid promoter P<i>D</i><sub><i>35-rsdA</i></sub><i>H</i><sub><i>10-uvrD3</i></sub> showed almost optimal properties needed for the bio-orthogonal transcription (not interfering with the native biological processes).</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"39 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-024-00577-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142708720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding the relationship between preferential interactions of peptides in water-acetonitrile mixtures with protein-solvent contact surface area 了解肽在水-乙腈混合物中的优先相互作用与蛋白质-溶剂接触表面积之间的关系。
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-11-13 DOI: 10.1007/s10822-024-00579-9
Monika Phougat, Narinder Singh Sahni, Devapriya Choudhury

The influence of polar, water-miscible organic solvents (POS) on protein structure, stability, and functional activity is a subject of significant interest and complexity. This study examines the effects of acetonitrile (ACN), a semipolar, aprotic solvent, on the solvation properties of blocked Ace-Gly-X-Gly-Nme tripeptides (where Ace and Nme stands for acetyl and N-methyl amide groups respectively and X is any amino acid) through extensive molecular dynamics simulations. Individual simulations were conducted for each peptide, encompassing five different ACN concentrations within the range of χACN = 0.1–0.9. The preferential solvation parameter (Γ) calculated using the Kirkwood-Buff integral method was used for the assessment of peptide interactions with water/ACN. Additionally, weighted Voronoi tessellation was applied to obtain a three-way data set containing four time-averaged contact surface area types between peptide atoms and water/ACN atoms. A mathematical technique known as N-way Partial Least Squares (NPLS) was utilized to anticipate the preferential interactions between peptides and water/ACN from the contact surface areas. Furthermore, the temperature dependency of peptide-solvent interactions was investigated using a subset of 10 amino acids representing a range of hydrophobicities. MD simulations were conducted at five temperatures, spanning from 283 to 343 K, with subsequent analysis of data focusing on both preferential solvation and peptide-solvent contact surface areas. The results demonstrate the efficacy of utilizing contact surface areas between the peptide and solvent constituents for successfully predicting preferential interactions in water/ACN mixtures across various ACN concentrations and temperatures.

极性水溶性有机溶剂(POS)对蛋白质结构、稳定性和功能活性的影响是一个非常有趣和复杂的课题。本研究通过大量分子动力学模拟,研究了半极性钝化溶剂乙腈(ACN)对阻断的 Ace-Gly-X-Gly-Nme 三肽(其中 Ace 和 Nme 分别代表乙酰基和 N-甲基酰胺基团,X 代表任何氨基酸)溶解特性的影响。在 χACN = 0.1-0.9 的范围内,对每种肽进行了五种不同浓度的 ACN 模拟。使用柯克伍德-巴夫积分法计算的优先溶解参数(Γ)用于评估多肽与水/ACN 的相互作用。此外,还采用加权沃罗诺网格划分法获得了三向数据集,其中包含肽原子与水/ACN 原子间的四种时间平均接触表面积类型。利用一种称为 N 向偏最小二乘法(NPLS)的数学技术,从接触表面积中预测肽与水/ACN 之间的优先相互作用。此外,还使用代表一系列疏水性的 10 个氨基酸子集研究了肽与溶剂相互作用的温度依赖性。在 283 至 343 K 的五个温度范围内进行了 MD 模拟,随后对数据进行了分析,重点是优先溶解和肽-溶剂接触表面积。结果表明,利用肽和溶剂成分之间的接触表面积可以成功预测水/ACN 混合物在不同 ACN 浓度和温度下的优先相互作用。
{"title":"Understanding the relationship between preferential interactions of peptides in water-acetonitrile mixtures with protein-solvent contact surface area","authors":"Monika Phougat,&nbsp;Narinder Singh Sahni,&nbsp;Devapriya Choudhury","doi":"10.1007/s10822-024-00579-9","DOIUrl":"10.1007/s10822-024-00579-9","url":null,"abstract":"<div><p>The influence of polar, water-miscible organic solvents (POS) on protein structure, stability, and functional activity is a subject of significant interest and complexity. This study examines the effects of acetonitrile (ACN), a semipolar, aprotic solvent, on the solvation properties of blocked Ace-Gly-X-Gly-Nme tripeptides (where Ace and Nme stands for acetyl and N-methyl amide groups respectively and X is any amino acid) through extensive molecular dynamics simulations. Individual simulations were conducted for each peptide, encompassing five different ACN concentrations within the range of <i>χ</i><sub>ACN</sub> = 0.1–0.9. The preferential solvation parameter (Γ) calculated using the Kirkwood-Buff integral method was used for the assessment of peptide interactions with water/ACN. Additionally, weighted Voronoi tessellation was applied to obtain a three-way data set containing four time-averaged contact surface area types between peptide atoms and water/ACN atoms. A mathematical technique known as <i>N</i>-way Partial Least Squares (NPLS) was utilized to anticipate the preferential interactions between peptides and water/ACN from the contact surface areas. Furthermore, the temperature dependency of peptide-solvent interactions was investigated using a subset of 10 amino acids representing a range of hydrophobicities. MD simulations were conducted at five temperatures, spanning from 283 to 343 K, with subsequent analysis of data focusing on both preferential solvation and peptide-solvent contact surface areas. The results demonstrate the efficacy of utilizing contact surface areas between the peptide and solvent constituents for successfully predicting preferential interactions in water/ACN mixtures across various ACN concentrations and temperatures.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142611743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of novel inhibitors targeting PI3Kα via ensemble-based virtual screening method, biological evaluation and molecular dynamics simulation 通过基于集合的虚拟筛选方法、生物学评价和分子动力学模拟,鉴定靶向 PI3Kα 的新型抑制剂
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-11-11 DOI: 10.1007/s10822-024-00580-2
Hui Zhang, Hua-Zhao Qi, Ya-Juan Li, Xiu-Yun Shi, Mei-Ling Hu, Xiang-Long Chen, Yuan Li

PIK3CA gene encoding PI3K p110α is one of the most frequently mutated and overexpressed in majority of human cancers. Development of potent and selective novel inhibitors targeting PI3Kα was considered as the most promising approaches for cancer treatment. In this investigation, a virtual screening platform for PI3Kα inhibitors was established by employing machine learning methods, pharmacophore modeling, and molecular docking approaches. 28 potential PI3Kα inhibitors with different scaffolds were selected from the databases with 295,024 compounds. Among the 28 hits, hit15 exhibited the best inhibitory effect against PI3Kα with IC50 value less than 1.0 µM. The molecular dynamics simulation indicated that hit15 could stably bind to the active site of PI3Kα, interact with some residues by hydrophobic, electrostatic and hydrogen bonding interactions, and finally induced PI3Kα active pocket substantial conformation changes. Stable H-bond interactions were formed between hit15 and residues of Lys776, Asp810 and Asp933. The binding free energy of PI3Kα-hit15 was − 65.3 kJ/mol. The free energy decomposition indicated that key residues of Asp805, Ile848 and Ile932 contributed stronger energies to the binding free energy. The above results indicated that hit15 with novel scaffold was a potent PI3Kα inhibitor and considered as a promising candidate for further drug development to treat various cancers with PI3Kα over activated.

Graphical Abstract

编码 PI3K p110α 的 PIK3CA 基因是大多数人类癌症中最常发生突变和过度表达的基因之一。开发针对 PI3Kα 的强效、选择性新型抑制剂被认为是最有希望的癌症治疗方法。在这项研究中,通过采用机器学习方法、药理模型和分子对接方法,建立了一个 PI3Kα 抑制剂的虚拟筛选平台。从295,024个化合物的数据库中筛选出28个具有不同支架的潜在PI3Kα抑制剂。在这28个化合物中,第15个化合物对PI3Kα的抑制效果最好,IC50值小于1.0 µM。分子动力学模拟结果表明,hit15能稳定地结合到PI3Kα的活性位点,通过疏水、静电和氢键作用与一些残基相互作用,最终诱导PI3Kα活性口袋发生实质性的构象变化。hit15与Lys776、Asp810和Asp933残基之间形成了稳定的氢键相互作用。PI3Kα-hit15 的结合自由能为 - 65.3 kJ/mol。自由能分解结果表明,Asp805、Ile848 和 Ile932 等关键残基对结合自由能的贡献较大。上述结果表明,具有新型支架的hit15是一种强效的PI3Kα抑制剂,有望作为候选药物进一步开发,用于治疗PI3Kα过度激活的各种癌症。
{"title":"Identification of novel inhibitors targeting PI3Kα via ensemble-based virtual screening method, biological evaluation and molecular dynamics simulation","authors":"Hui Zhang,&nbsp;Hua-Zhao Qi,&nbsp;Ya-Juan Li,&nbsp;Xiu-Yun Shi,&nbsp;Mei-Ling Hu,&nbsp;Xiang-Long Chen,&nbsp;Yuan Li","doi":"10.1007/s10822-024-00580-2","DOIUrl":"10.1007/s10822-024-00580-2","url":null,"abstract":"<div><p>PIK3CA gene encoding PI3K p110α is one of the most frequently mutated and overexpressed in majority of human cancers. Development of potent and selective novel inhibitors targeting PI3Kα was considered as the most promising approaches for cancer treatment. In this investigation, a virtual screening platform for PI3Kα inhibitors was established by employing machine learning methods, pharmacophore modeling, and molecular docking approaches. 28 potential PI3Kα inhibitors with different scaffolds were selected from the databases with 295,024 compounds. Among the 28 hits, hit15 exhibited the best inhibitory effect against PI3Kα with IC<sub>50</sub> value less than 1.0 µM. The molecular dynamics simulation indicated that hit15 could stably bind to the active site of PI3Kα, interact with some residues by hydrophobic, electrostatic and hydrogen bonding interactions, and finally induced PI3Kα active pocket substantial conformation changes. Stable H-bond interactions were formed between hit15 and residues of Lys776, Asp810 and Asp933. The binding free energy of PI3Kα-hit15 was − 65.3 kJ/mol. The free energy decomposition indicated that key residues of Asp805, Ile848 and Ile932 contributed stronger energies to the binding free energy. The above results indicated that hit15 with novel scaffold was a potent PI3Kα inhibitor and considered as a promising candidate for further drug development to treat various cancers with PI3Kα over activated.</p><h3>Graphical Abstract</h3>\u0000<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative assessment of physics-based in silico methods to calculate relative solubilities 对基于物理的计算相对溶解度的硅学方法进行比较评估
IF 3 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-10-29 DOI: 10.1007/s10822-024-00576-y
Adiran Garaizar Suarez, Andreas H. Göller, Michael E. Beck, Sadra Kashef Ol Gheta, Katharina Meier

Relative solubilities, i.e. whether a given molecule is more soluble in one solvent compared to others, is a critical parameter for pharmaceutical and agricultural formulation development and chemical synthesis, material science, and environmental chemistry. In silico predictions of this crucial variable can help reducing experiments, waste of solvents and synthesis optimization. In this study, we evaluate the performance of different physics-based methods for predicting relative solubilities. Our assessment involves quantum mechanics-based COSMO-RS and molecular dynamics-based free energy methods using OPLS4, the open-source OpenFF Sage, and GAFF force fields, spanning over 200 solvent–solute combinations. Our investigation highlights the important role of compound multimerization, an effect which must be accounted for to obtain accurate relative solubility predictions. The performance landscape of these methods is varied, with significant differences in precision depending on both the method used and the solute considered, thereby offering an improved understanding of the predictive power of physics-based methods in chemical research.

相对溶解度,即特定分子在一种溶剂中是否比在其他溶剂中更易溶解,是医药和农业配方开发、化学合成、材料科学和环境化学的一个关键参数。对这一关键变量进行硅学预测有助于减少实验、溶剂浪费和合成优化。在本研究中,我们评估了不同物理方法在预测相对溶解度方面的性能。我们的评估涉及基于量子力学的 COSMO-RS 和基于分子动力学的自由能方法,使用 OPLS4、开源 OpenFF Sage 和 GAFF 力场,涵盖 200 多种溶剂-溶质组合。我们的研究强调了化合物多聚化的重要作用,要获得准确的相对溶解度预测,必须考虑到这种效应。这些方法的性能各不相同,其精度因所使用的方法和所考虑的溶质而存在显著差异,从而使人们更好地了解了基于物理的方法在化学研究中的预测能力。
{"title":"Comparative assessment of physics-based in silico methods to calculate relative solubilities","authors":"Adiran Garaizar Suarez,&nbsp;Andreas H. Göller,&nbsp;Michael E. Beck,&nbsp;Sadra Kashef Ol Gheta,&nbsp;Katharina Meier","doi":"10.1007/s10822-024-00576-y","DOIUrl":"10.1007/s10822-024-00576-y","url":null,"abstract":"<div><p>Relative solubilities, i.e. whether a given molecule is more soluble in one solvent compared to others, is a critical parameter for pharmaceutical and agricultural formulation development and chemical synthesis, material science, and environmental chemistry. In silico predictions of this crucial variable can help reducing experiments, waste of solvents and synthesis optimization. In this study, we evaluate the performance of different physics-based methods for predicting relative solubilities. Our assessment involves quantum mechanics-based COSMO-RS and molecular dynamics-based free energy methods using OPLS4, the open-source OpenFF Sage, and GAFF force fields, spanning over 200 solvent–solute combinations. Our investigation highlights the important role of compound multimerization, an effect which must be accounted for to obtain accurate relative solubility predictions. The performance landscape of these methods is varied, with significant differences in precision depending on both the method used and the solute considered, thereby offering an improved understanding of the predictive power of physics-based methods in chemical research.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142524487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Computer-Aided Molecular Design
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1