首页 > 最新文献

Journal of Chemical Information and Modeling 最新文献

英文 中文
CHARMM-GUI Hybrid ML/MM Builder for Hybrid Machine Learning and Molecular Mechanical Modeling and Simulations. CHARMM-GUI混合ML/MM Builder用于混合机器学习和分子机械建模和模拟。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-03-09 DOI: 10.1021/acs.jcim.6c00060
Florence Szczepaniak,Donghyuk Suh,Wonpil Im
Recent advances in machine learning (ML) have enabled new developments in molecular dynamics simulation. Neural network potentials (NNPs) trained on quantum mechanical (QM) data provide highly accurate descriptions of drug-like molecules. Analogous to a QM and molecular mechanical (QM/MM) approach, hybrid ML/MM simulations employ NNPs to describe a localized region of the system, such as a ligand, while the rest of the system is treated using classical MM force fields. This hybrid framework enables simulations of protein-ligand complexes with near-QM accuracy for the ligand at a substantially reduced computational cost. CHARMM-GUI Hybrid ML/MM Builder automates the preparation of system and input files required for hybrid ML/MM modeling and simulation. This new module generates all necessary files to simulate protein-ligand complexes in solution or membrane using TorchANI-AMBER and OpenMM-ML. Currently supported NNPs include MACE and ANI. In this paper, we present Hybrid ML/MM Builder and representative application systems that demonstrate its usage and capabilities.
机器学习(ML)的最新进展使分子动力学模拟取得了新的发展。在量子力学(QM)数据上训练的神经网络电位(NNPs)提供了对类药物分子的高度精确描述。与QM和分子力学(QM/MM)方法类似,混合ML/MM模拟使用NNPs来描述系统的局部区域,如配体,而系统的其余部分则使用经典的MM力场处理。这种混合框架能够以接近qm的精度模拟配体的蛋白质-配体复合物,大大降低了计算成本。CHARMM-GUI Hybrid ML/MM Builder自动准备混合ML/MM建模和仿真所需的系统和输入文件。使用TorchANI-AMBER和OpenMM-ML,这个新模块生成所有必要的文件来模拟溶液或膜中的蛋白质配体复合物。目前支持的nnp包括MACE和ANI。在本文中,我们介绍了混合式ML/MM构建器和典型的应用系统,展示了它的使用和功能。
{"title":"CHARMM-GUI Hybrid ML/MM Builder for Hybrid Machine Learning and Molecular Mechanical Modeling and Simulations.","authors":"Florence Szczepaniak,Donghyuk Suh,Wonpil Im","doi":"10.1021/acs.jcim.6c00060","DOIUrl":"https://doi.org/10.1021/acs.jcim.6c00060","url":null,"abstract":"Recent advances in machine learning (ML) have enabled new developments in molecular dynamics simulation. Neural network potentials (NNPs) trained on quantum mechanical (QM) data provide highly accurate descriptions of drug-like molecules. Analogous to a QM and molecular mechanical (QM/MM) approach, hybrid ML/MM simulations employ NNPs to describe a localized region of the system, such as a ligand, while the rest of the system is treated using classical MM force fields. This hybrid framework enables simulations of protein-ligand complexes with near-QM accuracy for the ligand at a substantially reduced computational cost. CHARMM-GUI Hybrid ML/MM Builder automates the preparation of system and input files required for hybrid ML/MM modeling and simulation. This new module generates all necessary files to simulate protein-ligand complexes in solution or membrane using TorchANI-AMBER and OpenMM-ML. Currently supported NNPs include MACE and ANI. In this paper, we present Hybrid ML/MM Builder and representative application systems that demonstrate its usage and capabilities.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"6 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147381343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cyclin-E/A/CDK1/2 Kinetic Landscapes Drive Cell Cycle Phase-Specific Progression and Guide Cyclin-E Degradation Strategy. Cyclin-E/A/CDK1/2动力学景观驱动细胞周期阶段特异性进展和指导Cyclin-E降解策略
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-03-09 DOI: 10.1021/acs.jcim.6c00279
Wengang Zhang,Devin Bradburn,Yonglan Liu,Hyunbum Jang,Mardo Kõivomägi,Ruth Nussinov
The cell cycle relies on sequential activation of cyclin-dependent kinases (CDKs) by phase-specific cyclins. Previously, we proposed that their conformations and activation speed are tuned to the needs of their respective phases. We demonstrated this principle by using molecular dynamics simulations to evaluate the slower activation and catalytic kinetics of Cyclin-D/CDK4 during the long G1 phase compared to the rapid activation of Cyclin-E/CDK2 in the brief G1/S transition, and the higher intrinsic activity of Cyclin-D/CDK6 required for rapid hematopoietic cell proliferation. Here, we ask whether this principle also holds for subsequent cell cycle phases. We explore how the dynamic behavior of structurally similar Cyclin-E/CDK2, Cyclin-A/CDK2, and Cyclin-A/CDK1 controls their distinct tasks, and how the cell ensures that Cyclin-A/CDK2 and Cyclin-A/CDK1, which share the same allosteric effector Cyclin-A, avoid redundantly triggering S and M-phase events out of order. Through molecular dynamics simulations, we find that their functional differences relate to their distinct conformational energy landscapes and kinetic profiles. Unlike the plastic interface of CDK1 complexes, the Cyclin-E/CDK2 complex, governing the G1/S transition, is conformationally constrained by a stable interface and is less dependent on its catalytic outputs. In contrast, the high catalytic efficiency of Cyclin-A/CDK2 can support rapid phosphorylation of S phase replication factors, thereby preventing DNA rereplication through preorganization of the CDK2 DFG-motif. We translate our results to the clinic by proposing an innovative allosteric degrader strategy for selective Cyclin-E degradation. We further validate our design workflow by reproducing the ternary complex of a known CDK2 degrader, and applying this approach to model an allosteric degrader thereby establishing the structural parameters required to target this specific Cyclin-E/CDK2-cereblon conformational state.
细胞周期依赖于周期蛋白依赖性激酶(CDKs)的顺序激活。在此之前,我们提出了它们的构象和激活速度是根据各自相的需要而调整的。我们通过使用分子动力学模拟来证明这一原理,以评估与Cyclin-E/CDK2在短暂的G1/S过渡期间的快速激活相比,Cyclin-D/CDK4在长G1期的缓慢激活和催化动力学,以及Cyclin-D/CDK6在快速造血细胞增殖所需的更高的内在活性。在这里,我们问这一原则是否也适用于随后的细胞周期阶段。我们探讨了结构相似的Cyclin-E/CDK2、Cyclin-A/CDK2和Cyclin-A/CDK1的动态行为如何控制它们不同的任务,以及细胞如何确保具有相同变构效应的Cyclin-A - a /CDK2和Cyclin-A/CDK1避免无序地冗余触发S期和m期事件。通过分子动力学模拟,我们发现它们的功能差异与它们不同的构象能量景观和动力学剖面有关。与CDK1复合物的塑料界面不同,控制G1/S转变的cycline /CDK2复合物受稳定界面的构象约束,较少依赖于其催化输出。相反,Cyclin-A/CDK2的高催化效率可以支持S期复制因子的快速磷酸化,从而通过CDK2 dfg基序的预组织阻止DNA复制。我们通过提出一种创新的变构降解策略来选择性降解Cyclin-E,将我们的结果转化为临床应用。我们通过复制已知CDK2降解物的三元配合物来进一步验证我们的设计工作流程,并将这种方法应用于变构降解物的建模,从而建立针对这种特定Cyclin-E/CDK2-cereblon构象状态所需的结构参数。
{"title":"Cyclin-E/A/CDK1/2 Kinetic Landscapes Drive Cell Cycle Phase-Specific Progression and Guide Cyclin-E Degradation Strategy.","authors":"Wengang Zhang,Devin Bradburn,Yonglan Liu,Hyunbum Jang,Mardo Kõivomägi,Ruth Nussinov","doi":"10.1021/acs.jcim.6c00279","DOIUrl":"https://doi.org/10.1021/acs.jcim.6c00279","url":null,"abstract":"The cell cycle relies on sequential activation of cyclin-dependent kinases (CDKs) by phase-specific cyclins. Previously, we proposed that their conformations and activation speed are tuned to the needs of their respective phases. We demonstrated this principle by using molecular dynamics simulations to evaluate the slower activation and catalytic kinetics of Cyclin-D/CDK4 during the long G1 phase compared to the rapid activation of Cyclin-E/CDK2 in the brief G1/S transition, and the higher intrinsic activity of Cyclin-D/CDK6 required for rapid hematopoietic cell proliferation. Here, we ask whether this principle also holds for subsequent cell cycle phases. We explore how the dynamic behavior of structurally similar Cyclin-E/CDK2, Cyclin-A/CDK2, and Cyclin-A/CDK1 controls their distinct tasks, and how the cell ensures that Cyclin-A/CDK2 and Cyclin-A/CDK1, which share the same allosteric effector Cyclin-A, avoid redundantly triggering S and M-phase events out of order. Through molecular dynamics simulations, we find that their functional differences relate to their distinct conformational energy landscapes and kinetic profiles. Unlike the plastic interface of CDK1 complexes, the Cyclin-E/CDK2 complex, governing the G1/S transition, is conformationally constrained by a stable interface and is less dependent on its catalytic outputs. In contrast, the high catalytic efficiency of Cyclin-A/CDK2 can support rapid phosphorylation of S phase replication factors, thereby preventing DNA rereplication through preorganization of the CDK2 DFG-motif. We translate our results to the clinic by proposing an innovative allosteric degrader strategy for selective Cyclin-E degradation. We further validate our design workflow by reproducing the ternary complex of a known CDK2 degrader, and applying this approach to model an allosteric degrader thereby establishing the structural parameters required to target this specific Cyclin-E/CDK2-cereblon conformational state.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"17 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147381344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Force Field Developer and Optimizer Platform: Torsion Reparameterization 自动化力场开发和优化平台:扭转重参数化
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-03-09 DOI: 10.1021/acs.jcim.6c00528
Alejandro Blanco-Gonzalez,William Betancourt,Ryan Michael Snyder,Shi Zhang,Timothy J. Giese,Zeke A. Piskulich,Andreas W. Götz,Kenneth M. Merz Jr.,Darrin M. York,Hasan Metin Aktulga,Madushanka Manathunga
General force fields such as General Amber Force Field (GAFF) have been designed for broad applicability and are widely used in protein–ligand binding simulations in structure-based drug discovery. However, the force field parameters are not always transferable across ligand molecules, and custom reparameterization is sometimes necessary for accurate binding free energy simulations. This is especially true for torsion parameters, which are highly dependent on stereoelectronic and steric effects. Here, we report a novel, flexible, and user-friendly computational tool called the Automated Force Field Developer and Optimizer (AFFDO) platform that allows generating accurate, tailored GAFF2 torsion parameters for drug-like molecules. For a given ligand, AFFDO selects the most important torsions, carries out GPU-accelerated density functional theory calculations to collect reference data and fits torsion terms using a fast gradient-based optimizer that leverages automated differentiation. We benchmark AFFDO by parametrizing a series of drug-like molecules and carrying out protein–ligand relative binding free energy (RBFE) simulations. The results show that AFFDO can significantly improve GAFF2 torsion parameters against QM reference data, which in some cases translates into better agreement with experimental RBFE values within a reasonable computational time.
通用力场(General Amber force Field, GAFF)具有广泛的适用性,被广泛应用于基于结构的药物发现中的蛋白质-配体结合模拟。然而,力场参数并不总是可以在配体分子之间传递,为了精确地模拟结合自由能,有时需要自定义重新参数化。对于高度依赖于立体电子效应和空间效应的扭转参数尤其如此。在这里,我们报告了一种新颖的、灵活的、用户友好的计算工具,称为自动化力场开发和优化器(AFFDO)平台,它允许为类药物分子生成准确的、定制的GAFF2扭转参数。对于给定的配体,AFFDO选择最重要的扭转,执行gpu加速的密度泛函理论计算以收集参考数据,并使用利用自动微分的快速梯度优化器拟合扭转项。我们通过参数化一系列药物样分子并进行蛋白质-配体相对结合自由能(RBFE)模拟来对AFFDO进行基准测试。结果表明,AFFDO可以显著提高GAFF2对QM参考数据的扭转参数,在某些情况下,在合理的计算时间内转化为与实验RBFE值更好的一致性。
{"title":"Automated Force Field Developer and Optimizer Platform: Torsion Reparameterization","authors":"Alejandro Blanco-Gonzalez,William Betancourt,Ryan Michael Snyder,Shi Zhang,Timothy J. Giese,Zeke A. Piskulich,Andreas W. Götz,Kenneth M. Merz Jr.,Darrin M. York,Hasan Metin Aktulga,Madushanka Manathunga","doi":"10.1021/acs.jcim.6c00528","DOIUrl":"https://doi.org/10.1021/acs.jcim.6c00528","url":null,"abstract":"General force fields such as General Amber Force Field (GAFF) have been designed for broad applicability and are widely used in protein–ligand binding simulations in structure-based drug discovery. However, the force field parameters are not always transferable across ligand molecules, and custom reparameterization is sometimes necessary for accurate binding free energy simulations. This is especially true for torsion parameters, which are highly dependent on stereoelectronic and steric effects. Here, we report a novel, flexible, and user-friendly computational tool called the Automated Force Field Developer and Optimizer (AFFDO) platform that allows generating accurate, tailored GAFF2 torsion parameters for drug-like molecules. For a given ligand, AFFDO selects the most important torsions, carries out GPU-accelerated density functional theory calculations to collect reference data and fits torsion terms using a fast gradient-based optimizer that leverages automated differentiation. We benchmark AFFDO by parametrizing a series of drug-like molecules and carrying out protein–ligand relative binding free energy (RBFE) simulations. The results show that AFFDO can significantly improve GAFF2 torsion parameters against QM reference data, which in some cases translates into better agreement with experimental RBFE values within a reasonable computational time.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"45 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147383804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trustworthy Compound-Protein Interaction Prediction with Interpretable and Conformalized Cross-Attention Transformers. 具有可解释和共形交叉注意转换器的可信赖化合物-蛋白质相互作用预测。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-03-06 DOI: 10.1021/acs.jcim.5c02709
Peiyao Li,Lan Hua,Ye Liu,Jun Zhu
Deep learning has accelerated drug discovery by enabling large-scale virtual screening, but current models often act as "black boxes" and provide no formal guarantees about prediction reliability. This limitation is particularly critical for compound-protein interaction (CPI) prediction, where data sets are highly imbalanced and erroneous predictions can lead to costly failures. Here we introduce ConfBiXtCPI, an integrated framework that unifies accurate prediction, interpretability, and statistically rigorous uncertainty quantification. At its core is a bidirectional cross-attention transformer that captures molecular recognition patterns from sequence-level inputs, achieving state-of-the-art accuracy across multiple benchmarks. To address class imbalance and uncertainty, we incorporate Mondrian conformal prediction, which guarantees valid coverage for both majority and minority classes. Building on this, a conformal selection procedure enables principled control of the false discovery rate, allowing users to specify risk thresholds while maintaining discovery power. Beyond accuracy, ConfBiXtCPI provides mechanistic interpretability through attention maps that localize to biophysically relevant binding sites, and its uncertainty estimates support efficient active learning strategies. Together, these advances establish ConfBiXtCPI as a trustworthy and practical tool for guiding experimental validation and accelerating therapeutic discovery.
深度学习通过实现大规模虚拟筛选,加速了药物的发现,但目前的模型往往充当“黑盒子”,对预测的可靠性没有提供正式的保证。这种限制对于化合物-蛋白质相互作用(CPI)预测尤其重要,因为数据集高度不平衡,错误的预测可能导致代价高昂的失败。在这里,我们介绍了ConfBiXtCPI,这是一个集成的框架,它统一了准确的预测、可解释性和统计上严格的不确定性量化。其核心是一个双向交叉注意转换器,从序列级输入捕获分子识别模式,在多个基准测试中实现最先进的准确性。为了解决阶级不平衡和不确定性,我们结合了蒙德里安的适形预测,它保证了对多数和少数阶级的有效覆盖。在此基础上,适形选择程序可以对错误发现率进行原则性控制,允许用户在保持发现能力的同时指定风险阈值。除了准确性之外,ConfBiXtCPI还通过定位于生物物理相关结合位点的注意图提供了机制上的可解释性,其不确定性估计支持有效的主动学习策略。总之,这些进展使ConfBiXtCPI成为指导实验验证和加速治疗发现的可靠实用工具。
{"title":"Trustworthy Compound-Protein Interaction Prediction with Interpretable and Conformalized Cross-Attention Transformers.","authors":"Peiyao Li,Lan Hua,Ye Liu,Jun Zhu","doi":"10.1021/acs.jcim.5c02709","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02709","url":null,"abstract":"Deep learning has accelerated drug discovery by enabling large-scale virtual screening, but current models often act as \"black boxes\" and provide no formal guarantees about prediction reliability. This limitation is particularly critical for compound-protein interaction (CPI) prediction, where data sets are highly imbalanced and erroneous predictions can lead to costly failures. Here we introduce ConfBiXtCPI, an integrated framework that unifies accurate prediction, interpretability, and statistically rigorous uncertainty quantification. At its core is a bidirectional cross-attention transformer that captures molecular recognition patterns from sequence-level inputs, achieving state-of-the-art accuracy across multiple benchmarks. To address class imbalance and uncertainty, we incorporate Mondrian conformal prediction, which guarantees valid coverage for both majority and minority classes. Building on this, a conformal selection procedure enables principled control of the false discovery rate, allowing users to specify risk thresholds while maintaining discovery power. Beyond accuracy, ConfBiXtCPI provides mechanistic interpretability through attention maps that localize to biophysically relevant binding sites, and its uncertainty estimates support efficient active learning strategies. Together, these advances establish ConfBiXtCPI as a trustworthy and practical tool for guiding experimental validation and accelerating therapeutic discovery.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"110 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147368351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating Multiview Information for Enhanced Deep Learning-Based Acute Dermal Toxicity Prediction. 基于深度学习的急性皮肤毒性预测集成多视图信息。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-03-06 DOI: 10.1021/acs.jcim.5c02959
Wei Lin,Chi Chung Alan Fung
Accurate prediction of acute dermal toxicity is vital for the safe and effective development of contact drugs. While numerous deep learning models have been created to replace costly and ethically challenging animal toxicity tests, most approaches overlook the multiview information on molecules. To overcome this limitation, we introduce a novel model named MVIToxNet, which integrates multiview features from both molecular fingerprints and SMILES sequences. To capture the multiview information on SMILES, MVIToxNet incorporates character-level and atom-level features. In addition, byte-pair encoding tokenization is utilized to capture substructural details within molecules, allowing the model to differentiate similar SMILES by assigning distinct tokens to different substructures. Since the data sets in this study are small and imbalanced, we argue that selecting a single model based solely on the best validation performance may not reliably reflect the best generalization for test sets. Therefore, we propose a weighted model averaging approach that combines multiple trained models according to their top-K validation scores into one model, yielding an improved model for inference. Extensive experimental results demonstrate that MVIToxNet significantly outperforms existing baselines in acute dermal toxicity prediction, validating the effectiveness of utilizing multiview features and the weighted model averaging strategy. Furthermore, our proposed methods demonstrate the potential for data-driven model design.
准确预测急性皮肤毒性对于安全有效地开发接触性药物至关重要。虽然已经创建了许多深度学习模型来取代昂贵且具有道德挑战性的动物毒性测试,但大多数方法都忽略了分子的多视图信息。为了克服这一限制,我们引入了一种名为MVIToxNet的新模型,该模型集成了来自分子指纹和SMILES序列的多视图特征。为了在SMILES上捕获多视图信息,MVIToxNet结合了字符级和原子级功能。此外,利用字节对编码标记化来捕获分子内的子结构细节,允许模型通过将不同的标记分配给不同的子结构来区分相似的SMILES。由于本研究中的数据集较小且不平衡,我们认为仅基于最佳验证性能选择单个模型可能无法可靠地反映测试集的最佳泛化。因此,我们提出了一种加权模型平均方法,该方法将多个训练模型根据其top-K验证分数组合成一个模型,从而产生一个改进的推理模型。大量的实验结果表明,MVIToxNet在急性皮肤毒性预测方面明显优于现有的基线,验证了利用多视图特征和加权模型平均策略的有效性。此外,我们提出的方法展示了数据驱动模型设计的潜力。
{"title":"Integrating Multiview Information for Enhanced Deep Learning-Based Acute Dermal Toxicity Prediction.","authors":"Wei Lin,Chi Chung Alan Fung","doi":"10.1021/acs.jcim.5c02959","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02959","url":null,"abstract":"Accurate prediction of acute dermal toxicity is vital for the safe and effective development of contact drugs. While numerous deep learning models have been created to replace costly and ethically challenging animal toxicity tests, most approaches overlook the multiview information on molecules. To overcome this limitation, we introduce a novel model named MVIToxNet, which integrates multiview features from both molecular fingerprints and SMILES sequences. To capture the multiview information on SMILES, MVIToxNet incorporates character-level and atom-level features. In addition, byte-pair encoding tokenization is utilized to capture substructural details within molecules, allowing the model to differentiate similar SMILES by assigning distinct tokens to different substructures. Since the data sets in this study are small and imbalanced, we argue that selecting a single model based solely on the best validation performance may not reliably reflect the best generalization for test sets. Therefore, we propose a weighted model averaging approach that combines multiple trained models according to their top-K validation scores into one model, yielding an improved model for inference. Extensive experimental results demonstrate that MVIToxNet significantly outperforms existing baselines in acute dermal toxicity prediction, validating the effectiveness of utilizing multiview features and the weighted model averaging strategy. Furthermore, our proposed methods demonstrate the potential for data-driven model design.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"3 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147359308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-View Collaboration Feature Fusion for Protein Function Prediction. 基于多视角协同特征融合的蛋白质功能预测。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-03-06 DOI: 10.1021/acs.jcim.5c03057
Hailong Yang,Zhongyu Wang,Haijun Shi,Qiao Ning,Zhaohong Deng,Shudong Hu,Yanqi Zhong
With the rapid growth of high-throughput sequencing data, many proteins remain uncharacterized, while experimental validation is costly and time-consuming. Automatic Function Prediction (AFP) is thus urgently needed. Protein functions are complex and multilevel, with inherent interactions among features such as sequence, structure, and evolution. Existing methods relying on single-level representations or simple feature aggregation struggle to capture the hierarchical dependencies and semantic collaborative relationships in the Gene Ontology (GO) label system, limiting prediction accuracy and generalization. To overcome these challenges, we propose a Multi-View Collaboration Feature Fusion (MVCFF) framework, which leverages complementary features from multiple sequence perspectives to enhance protein function prediction. In MVCFF, a sequential feature extraction subnetwork is designed to capture view-specific information, incorporating both local patterns and long-range dependencies within amino acid sequences. Building on this, a multi-view collaboration paradigm is employed, enabling interactive learning of key positional information through integrated multi-view features and facilitating synergistic information fusion. The resulting multi-view representations are then fed into downstream label predictors to perform classification tasks. To further boost predictive accuracy, we introduce an extended version, MVCFF+, which combines the original MVCFF framework with sequence-similarity-based prediction methods via a weighted fusion strategy. Extensive experiments demonstrate that our approach substantially improves prediction performance, outperforming existing methods by a clear margin. The source code is publicly available at https://github.com/AGI-FBHC/MVCFF.
随着高通量测序数据的快速增长,许多蛋白质仍未被表征,而实验验证既昂贵又耗时。因此,迫切需要自动功能预测(AFP)。蛋白质的功能是复杂的、多层次的,在序列、结构和进化等特征之间具有内在的相互作用。现有的方法依赖于单级表示或简单的特征聚合,难以捕获基因本体(GO)标签系统中的层次依赖和语义协作关系,限制了预测的准确性和泛化。为了克服这些挑战,我们提出了一个多视图协作特征融合(MVCFF)框架,该框架利用多个序列角度的互补特征来增强蛋白质功能预测。在MVCFF中,序列特征提取子网络被设计用于捕获特定视图的信息,结合氨基酸序列中的本地模式和远程依赖关系。在此基础上,采用多视图协作模式,通过集成多视图特征,实现关键位置信息的交互式学习,促进协同信息融合。然后将得到的多视图表示馈送到下游标签预测器中以执行分类任务。为了进一步提高预测精度,我们引入了一个扩展版本MVCFF+,它通过加权融合策略将原始MVCFF框架与基于序列相似性的预测方法相结合。大量的实验表明,我们的方法大大提高了预测性能,明显优于现有的方法。源代码可在https://github.com/AGI-FBHC/MVCFF上公开获得。
{"title":"Multi-View Collaboration Feature Fusion for Protein Function Prediction.","authors":"Hailong Yang,Zhongyu Wang,Haijun Shi,Qiao Ning,Zhaohong Deng,Shudong Hu,Yanqi Zhong","doi":"10.1021/acs.jcim.5c03057","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c03057","url":null,"abstract":"With the rapid growth of high-throughput sequencing data, many proteins remain uncharacterized, while experimental validation is costly and time-consuming. Automatic Function Prediction (AFP) is thus urgently needed. Protein functions are complex and multilevel, with inherent interactions among features such as sequence, structure, and evolution. Existing methods relying on single-level representations or simple feature aggregation struggle to capture the hierarchical dependencies and semantic collaborative relationships in the Gene Ontology (GO) label system, limiting prediction accuracy and generalization. To overcome these challenges, we propose a Multi-View Collaboration Feature Fusion (MVCFF) framework, which leverages complementary features from multiple sequence perspectives to enhance protein function prediction. In MVCFF, a sequential feature extraction subnetwork is designed to capture view-specific information, incorporating both local patterns and long-range dependencies within amino acid sequences. Building on this, a multi-view collaboration paradigm is employed, enabling interactive learning of key positional information through integrated multi-view features and facilitating synergistic information fusion. The resulting multi-view representations are then fed into downstream label predictors to perform classification tasks. To further boost predictive accuracy, we introduce an extended version, MVCFF+, which combines the original MVCFF framework with sequence-similarity-based prediction methods via a weighted fusion strategy. Extensive experiments demonstrate that our approach substantially improves prediction performance, outperforming existing methods by a clear margin. The source code is publicly available at https://github.com/AGI-FBHC/MVCFF.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"67 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147359156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hydroxylase Thermostability Prediction Based on Self-Trained Semisupervised Iteration and Bayesian Dynamic Tuning. 基于自训练半监督迭代和贝叶斯动态整定的羟化酶热稳定性预测。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-03-05 DOI: 10.1021/acs.jcim.6c00102
Sujuan Liu,Mengyu Yu,Lei Zhang,Dongyan Wen,Xiaotong Yu,Jianmei Luo,Chuanlei Zhang
Current enzyme thermostability prediction models are predominantly designed for cross-family generalization, with limited focus on hydroxylases, which restricts their accuracy and applicability in hydroxylase-specific thermostability design. In this study, we develop HyS-BST, a dedicated self-trained semisupervised framework for hydroxylase thermostability prediction. Leveraging a limited hydroxylase data set, HyS-BST integrates a self-training strategy with Bayesian dynamic tuning to achieve high-precision prediction of mutant thermostability in terms of ΔΔG. Experimental results demonstrate that after only ten training iterations, HyS-BST attains a coefficient of determination (R2) of 0.96, a Pearson correlation coefficient (PCC) of 0.98, and a root mean squared error (RMSE) as low as 0.06 on the test set. Compared with the optimal cross-family generalization model, HyS-BST improves PCC and RMSE by approximately 70%. Overall, this framework provides a specialized, efficient, and cost-effective solution for hydroxylase thermostability prediction, substantially reducing the candidate search space and experimental resources required for downstream validation.
目前的酶热稳定性预测模型主要是为了跨家族推广而设计的,对羟化酶的关注有限,这限制了它们在羟化酶特异性热稳定性设计中的准确性和适用性。在这项研究中,我们开发了HyS-BST,一个专门用于羟化酶热稳定性预测的自我训练半监督框架。利用有限的羟化酶数据集,HyS-BST将自我训练策略与贝叶斯动态调整相结合,以实现ΔΔG突变体热稳定性的高精度预测。实验结果表明,仅经过10次训练迭代,HyS-BST在测试集上的决定系数(R2)为0.96,Pearson相关系数(PCC)为0.98,均方根误差(RMSE)低至0.06。与最优交叉族泛化模型相比,HyS-BST的PCC和RMSE提高了约70%。总体而言,该框架为羟化酶热稳定性预测提供了一个专业、高效、经济的解决方案,大大减少了下游验证所需的候选搜索空间和实验资源。
{"title":"Hydroxylase Thermostability Prediction Based on Self-Trained Semisupervised Iteration and Bayesian Dynamic Tuning.","authors":"Sujuan Liu,Mengyu Yu,Lei Zhang,Dongyan Wen,Xiaotong Yu,Jianmei Luo,Chuanlei Zhang","doi":"10.1021/acs.jcim.6c00102","DOIUrl":"https://doi.org/10.1021/acs.jcim.6c00102","url":null,"abstract":"Current enzyme thermostability prediction models are predominantly designed for cross-family generalization, with limited focus on hydroxylases, which restricts their accuracy and applicability in hydroxylase-specific thermostability design. In this study, we develop HyS-BST, a dedicated self-trained semisupervised framework for hydroxylase thermostability prediction. Leveraging a limited hydroxylase data set, HyS-BST integrates a self-training strategy with Bayesian dynamic tuning to achieve high-precision prediction of mutant thermostability in terms of ΔΔG. Experimental results demonstrate that after only ten training iterations, HyS-BST attains a coefficient of determination (R2) of 0.96, a Pearson correlation coefficient (PCC) of 0.98, and a root mean squared error (RMSE) as low as 0.06 on the test set. Compared with the optimal cross-family generalization model, HyS-BST improves PCC and RMSE by approximately 70%. Overall, this framework provides a specialized, efficient, and cost-effective solution for hydroxylase thermostability prediction, substantially reducing the candidate search space and experimental resources required for downstream validation.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"15 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147351174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Topology-Aware Generation and Activity-Based Filtering: A Computational-Experimental Framework for Data-Scarce Quaternary Ammonium Compound Discovery. 拓扑感知生成和基于活动的过滤:数据稀缺季铵化合物发现的计算-实验框架。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-03-05 DOI: 10.1021/acs.jcim.6c00390
Shiva Ghaemi,Amanda Consylman,Bo Pan,Alice Wu,Ashley Petersen,Gabe Chang,Diana McDonough,Mark Forman,Elise L Bezold,William M Wuest,Amarda Shehu,Liang Zhao,Kevin P C Minbiole
Quaternary ammonium compounds (QACs) are widely used antimicrobial disinfectants whose efficacy is threatened by increased bacterial resistance. Artificial intelligence-guided development of novel QACs is constrained by historically sparse structure-activity data and methods to generate novel chemical entities with bioactivity. This paper presents a comparative experimental study of two computational workflows designed to accelerate QAC discovery under data-limited conditions. Both workflows employ a topology-aware variational autoencoder to generate novel candidates. In Workflow 1, generated QAC structures were directly subjected to expert evaluation within a fixed time constraint through the systematic application of chemistry-domain decision criteria. In Workflow 2, generated candidates were first computationally filtered using predictive models trained to anticipate antimicrobial activity, advancing only molecules projected to be highly active against at least one bacterial strain for expert evaluation. This predictive filtering enabled the assessment of a larger, higher-quality candidate pool within the same time constraint. Comparative assessment of the compound sets resulting from the two workflows revealed substantial improvements in candidate quality: compounds deemed synthesis-worthy increased from 9% to 38%, while invalid outputs decreased from 21% to 0%. Experimental characterization of 29 selected compounds across both workflows yielded 11 novel QACs with experimentally validated minimum inhibitory concentrations of 1-32 μM against four bacterial pathogens. These results demonstrate that topology-aware generation coupled with computational prefiltering enables systematic navigation of data-scarce chemical spaces while respecting practical constraints on expert evaluation time.
季铵类化合物是目前广泛使用的抗菌消毒剂,但其抗菌效果受到细菌耐药性的威胁。人工智能引导的新型QACs的开发受到历史上稀疏的结构-活性数据和产生具有生物活性的新型化学实体的方法的限制。本文对两种计算工作流程进行了对比实验研究,旨在加速数据有限条件下QAC的发现。这两个工作流都使用拓扑感知的变分自动编码器来生成新的候选对象。在工作流1中,通过系统地应用化学域决策准则,生成的QAC结构在固定的时间约束内直接接受专家评估。在工作流2中,生成的候选药物首先使用经过训练的预测模型进行计算过滤,以预测抗菌活性,仅推进预计对至少一种细菌菌株具有高度活性的分子,以供专家评估。这种预测过滤能够在相同的时间限制内评估更大、更高质量的候选池。对两种工作流程产生的化合物集的比较评估显示,候选化合物的质量有了实质性的提高:被认为有合成价值的化合物从9%增加到38%,而无效的产出从21%下降到0%。对29种选定的化合物进行了实验表征,得到了11种新的QACs,实验验证了它们对4种细菌病原体的最低抑制浓度为1-32 μM。这些结果表明,拓扑感知生成与计算预滤波相结合,可以在尊重专家评估时间的实际约束的情况下,对数据稀缺的化学空间进行系统导航。
{"title":"Topology-Aware Generation and Activity-Based Filtering: A Computational-Experimental Framework for Data-Scarce Quaternary Ammonium Compound Discovery.","authors":"Shiva Ghaemi,Amanda Consylman,Bo Pan,Alice Wu,Ashley Petersen,Gabe Chang,Diana McDonough,Mark Forman,Elise L Bezold,William M Wuest,Amarda Shehu,Liang Zhao,Kevin P C Minbiole","doi":"10.1021/acs.jcim.6c00390","DOIUrl":"https://doi.org/10.1021/acs.jcim.6c00390","url":null,"abstract":"Quaternary ammonium compounds (QACs) are widely used antimicrobial disinfectants whose efficacy is threatened by increased bacterial resistance. Artificial intelligence-guided development of novel QACs is constrained by historically sparse structure-activity data and methods to generate novel chemical entities with bioactivity. This paper presents a comparative experimental study of two computational workflows designed to accelerate QAC discovery under data-limited conditions. Both workflows employ a topology-aware variational autoencoder to generate novel candidates. In Workflow 1, generated QAC structures were directly subjected to expert evaluation within a fixed time constraint through the systematic application of chemistry-domain decision criteria. In Workflow 2, generated candidates were first computationally filtered using predictive models trained to anticipate antimicrobial activity, advancing only molecules projected to be highly active against at least one bacterial strain for expert evaluation. This predictive filtering enabled the assessment of a larger, higher-quality candidate pool within the same time constraint. Comparative assessment of the compound sets resulting from the two workflows revealed substantial improvements in candidate quality: compounds deemed synthesis-worthy increased from 9% to 38%, while invalid outputs decreased from 21% to 0%. Experimental characterization of 29 selected compounds across both workflows yielded 11 novel QACs with experimentally validated minimum inhibitory concentrations of 1-32 μM against four bacterial pathogens. These results demonstrate that topology-aware generation coupled with computational prefiltering enables systematic navigation of data-scarce chemical spaces while respecting practical constraints on expert evaluation time.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"24 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147359157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Molecular Dynamics Simulation of Passive Diffusion across a Human Breast Cancer Cell Membrane Model. Comparison between Cisplatin and Its Pt(IV) Derivatives. 通过人乳腺癌细胞膜模型被动扩散的分子动力学模拟。顺铂及其铂(IV)衍生物的比较。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-03-05 DOI: 10.1021/acs.jcim.5c02819
Daniele Belletto,Stefano Scoditti,Stefano Borocci,Nico Sanna,Costantino Zazza,Emilia Sicilia
The efficacy of platinum(II) drugs, despite their wide use in clinical practice, is seriously limited by their well-known drawbacks. Octahedral Pt(IV) congeners are considered a sort of Holy Grail in cancer research as, being significantly more inert, they should be able to overcome the limitations of current platinum-based drugs, such as resistance and side effects, acting as prodrugs. Additionally, their anticancer activity can be tuned through a proper choice of the axial ligands released inside cancer cells when these compounds are reduced, making them even capable of potentially working as multiaction agents. However, despite their very satisfactory anticancer effects, no Pt(IV) complex has been approved for clinical use. As cell membrane permeation is the critical step, very poorly understood, of the whole mechanism of action of any drug, the investigation of the eventual differences in behavior between four-coordinate Pt(II) and six-coordinate Pt(IV) complexes when they diffuse in a lipid bilayer might be of significant relevance. The outcomes of a biased molecular dynamics (MD) investigation of the permeation of cisplatin and three simple cisplatin Pt(IV) derivatives through a membrane model prototype of human breast cancer cells are illustrated here. This comparative analysis of Pt(II) and Pt(IV) complex passive diffusion has been carried out with the aim of gaining indications about the factors that play a role in favoring or hindering membrane penetration and, ultimately, in determining the efficacy of their anticancer action.
铂(II)类药物尽管在临床实践中被广泛使用,但其疗效受到其众所周知的缺陷的严重限制。八面体铂(IV)同系物被认为是癌症研究中的圣杯,因为它们具有明显的惰性,应该能够克服目前铂基药物的局限性,如耐药和副作用,作为前药。此外,当这些化合物被还原时,它们的抗癌活性可以通过适当选择在癌细胞内释放的轴向配体来调节,使它们甚至能够潜在地作为多作用剂工作。然而,尽管铂(IV)复合物具有非常令人满意的抗癌效果,但尚未批准用于临床。由于细胞膜渗透是任何药物整个作用机制的关键步骤,但人们对其了解甚少,因此研究四坐标Pt(II)和六坐标Pt(IV)复合物在脂质双分子层中扩散时最终的行为差异可能具有重要意义。顺铂和三种简单的顺铂铂(IV)衍生物通过人乳腺癌细胞膜模型原型的渗透性的偏分子动力学(MD)研究的结果如下所示。铂(II)和铂(IV)复合物被动扩散的比较分析,目的是获得有利于或阻碍膜渗透的因素的指示,并最终确定其抗癌作用的功效。
{"title":"Molecular Dynamics Simulation of Passive Diffusion across a Human Breast Cancer Cell Membrane Model. Comparison between Cisplatin and Its Pt(IV) Derivatives.","authors":"Daniele Belletto,Stefano Scoditti,Stefano Borocci,Nico Sanna,Costantino Zazza,Emilia Sicilia","doi":"10.1021/acs.jcim.5c02819","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02819","url":null,"abstract":"The efficacy of platinum(II) drugs, despite their wide use in clinical practice, is seriously limited by their well-known drawbacks. Octahedral Pt(IV) congeners are considered a sort of Holy Grail in cancer research as, being significantly more inert, they should be able to overcome the limitations of current platinum-based drugs, such as resistance and side effects, acting as prodrugs. Additionally, their anticancer activity can be tuned through a proper choice of the axial ligands released inside cancer cells when these compounds are reduced, making them even capable of potentially working as multiaction agents. However, despite their very satisfactory anticancer effects, no Pt(IV) complex has been approved for clinical use. As cell membrane permeation is the critical step, very poorly understood, of the whole mechanism of action of any drug, the investigation of the eventual differences in behavior between four-coordinate Pt(II) and six-coordinate Pt(IV) complexes when they diffuse in a lipid bilayer might be of significant relevance. The outcomes of a biased molecular dynamics (MD) investigation of the permeation of cisplatin and three simple cisplatin Pt(IV) derivatives through a membrane model prototype of human breast cancer cells are illustrated here. This comparative analysis of Pt(II) and Pt(IV) complex passive diffusion has been carried out with the aim of gaining indications about the factors that play a role in favoring or hindering membrane penetration and, ultimately, in determining the efficacy of their anticancer action.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"3 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147359155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combined All-Atom Simulations and Biophysical Assays Uncover Loop-Driven Stabilization in the HRAS i-motif 结合全原子模拟和生物物理分析揭示环驱动的稳定在HRAS i-motif
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-03-04 DOI: 10.1021/acs.jcim.5c03176
Alhadji Malloum,Valentina Arciuolo,Pavlína Pokorná,Luca Grisanti,Bruno Pagano,Jussara Amato,Alessandra Magistrato
I-motifs are noncanonical DNA secondary structures stabilized by hemiprotonated C+:C base pairs. Their intrinsic flexibility, conformational heterogeneity, and sensitivity to environmental conditions often hinder structural characterization. Here, all-atom simulations, combined with biophysical experiments, were used to characterize the structure of the i-motif monomer formed by the HRAS gene promoter (iHRAS), a member of the RAS proto-oncogene family. Our results reveal that iHRAS exhibits intricate conformational behavior characterized by multiple interconverting states. The core i-motif is stabilized by a protective G:G cap, a recurrent i-motif-stabilizing factor, on one side, while the C+:C base pairs content on the other side is variable. Structural heterogeneity is most pronounced in loops, which sample several base-exposed states aided by K+ ion binding. These findings contribute to a deeper understanding of the i-motif structure and dynamics.
i基序是由半质子化的C+:C碱基对稳定的非典型DNA二级结构。它们固有的灵活性,构象的异质性和对环境条件的敏感性往往阻碍结构表征。本文采用全原子模拟与生物物理实验相结合的方法,对RAS原癌基因家族成员——HRAS基因启动子(iHRAS)形成的i-motif单体结构进行了表征。我们的研究结果表明,iHRAS具有复杂的构象行为,其特征是多个相互转换状态。核心i-motif的一端是一个保护性的G:G帽,这是一个循环的i-motif稳定因子,而另一端的C+:C碱基对含量是可变的。结构非均质性在环中最为明显,它在K+离子结合的帮助下采样了几个碱基暴露态。这些发现有助于加深对i-motif结构和动态的理解。
{"title":"Combined All-Atom Simulations and Biophysical Assays Uncover Loop-Driven Stabilization in the HRAS i-motif","authors":"Alhadji Malloum,Valentina Arciuolo,Pavlína Pokorná,Luca Grisanti,Bruno Pagano,Jussara Amato,Alessandra Magistrato","doi":"10.1021/acs.jcim.5c03176","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c03176","url":null,"abstract":"I-motifs are noncanonical DNA secondary structures stabilized by hemiprotonated C+:C base pairs. Their intrinsic flexibility, conformational heterogeneity, and sensitivity to environmental conditions often hinder structural characterization. Here, all-atom simulations, combined with biophysical experiments, were used to characterize the structure of the i-motif monomer formed by the HRAS gene promoter (iHRAS), a member of the RAS proto-oncogene family. Our results reveal that iHRAS exhibits intricate conformational behavior characterized by multiple interconverting states. The core i-motif is stabilized by a protective G:G cap, a recurrent i-motif-stabilizing factor, on one side, while the C+:C base pairs content on the other side is variable. Structural heterogeneity is most pronounced in loops, which sample several base-exposed states aided by K+ ion binding. These findings contribute to a deeper understanding of the i-motif structure and dynamics.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"402 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147346717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Chemical Information and Modeling
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1