首页 > 最新文献

Journal of Chemical Information and Modeling 最新文献

英文 中文
Adaptive Transition-State Refinement with Learned Equilibrium Flows 基于学习平衡流的自适应过渡状态优化
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-02 DOI: 10.1021/acs.jcim.5c02902
Samir Darouich,Vinh Tong,Tanja Bien,Johannes Kästner,Mathias Niepert
Identifying transition states (TSs), the high-energy configurations that molecules pass through during chemical reactions, is essential for understanding and designing chemical processes. However, accurately and efficiently identifying these states remains one of the most challenging problems in computational chemistry. In this work, we introduce a new generative AI approach that improves the quality of initial guesses for TS structures. Our method can be combined with a variety of existing techniques, including both machine-learning models and fast, approximate quantum methods, to refine their predictions and bring them closer to chemically accurate results. Applied to TS guesses from a state-of-the-art machine-learning model, our approach reduces the median structural error to 0.077 Å and lowers the median absolute error in reaction barrier heights to 0.40 kcal mol–1. When starting from a widely used tight-binding approximation, it increases the success rate of locating valid TSs by 41% and speeds up high-level quantum optimization by a factor of 3. By making TS searches more accurate, robust, and efficient, this method could accelerate reaction mechanism discovery and support the development of new materials, catalysts, and pharmaceuticals.
识别过渡态(TSs),即分子在化学反应中经过的高能构型,对于理解和设计化学过程至关重要。然而,准确有效地识别这些状态仍然是计算化学中最具挑战性的问题之一。在这项工作中,我们引入了一种新的生成人工智能方法,提高了TS结构的初始猜测质量。我们的方法可以与各种现有技术相结合,包括机器学习模型和快速近似量子方法,以改进其预测并使其更接近化学精确的结果。应用于最先进的机器学习模型的TS猜测,我们的方法将中位数结构误差降低到0.077 Å,并将反应势垒高度的中位数绝对误差降低到0.40 kcal mol-1。当从广泛使用的紧结合近似开始时,它将定位有效TSs的成功率提高了41%,并将高级量子优化速度提高了3倍。通过使TS搜索更加准确、稳健和高效,该方法可以加速反应机理的发现,并支持新材料、催化剂和药物的开发。
{"title":"Adaptive Transition-State Refinement with Learned Equilibrium Flows","authors":"Samir Darouich,Vinh Tong,Tanja Bien,Johannes Kästner,Mathias Niepert","doi":"10.1021/acs.jcim.5c02902","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02902","url":null,"abstract":"Identifying transition states (TSs), the high-energy configurations that molecules pass through during chemical reactions, is essential for understanding and designing chemical processes. However, accurately and efficiently identifying these states remains one of the most challenging problems in computational chemistry. In this work, we introduce a new generative AI approach that improves the quality of initial guesses for TS structures. Our method can be combined with a variety of existing techniques, including both machine-learning models and fast, approximate quantum methods, to refine their predictions and bring them closer to chemically accurate results. Applied to TS guesses from a state-of-the-art machine-learning model, our approach reduces the median structural error to 0.077 Å and lowers the median absolute error in reaction barrier heights to 0.40 kcal mol–1. When starting from a widely used tight-binding approximation, it increases the success rate of locating valid TSs by 41% and speeds up high-level quantum optimization by a factor of 3. By making TS searches more accurate, robust, and efficient, this method could accelerate reaction mechanism discovery and support the development of new materials, catalysts, and pharmaceuticals.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"3 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146097965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward More Trustworthy QSAR: A Systematic Discussion on Data Set Partitioning 迈向更可信的QSAR:关于数据集划分的系统讨论
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-02 DOI: 10.1021/acs.jcim.5c02465
Shangyu Li,Peizhe Sun
With the surge in QSAR model development, concerns about evaluation rigor, particularly regarding the influence of data splitting, have grown. Using five data sets of various sizes, we systematically assessed the effects of random splits (RS), similarity-based splits (SS), and random-seed variability on model generalizability under two scenarios: limited data for chemical screening and standard modeling with ample data. Both the choice of data set partitioning method and the selection of random seeds can substantially affect internal test performance, which may not reliably reflect true predictive capability. Although SS can improve internal test performance in many settings, these gains do not necessarily translate into stronger external generalizability. Moreover, under low sampling ratios, SS may perform worse than RS on both internal and external tests. This challenges the implicit assumption that rational splits optimized for internal performance universally improve model performance. Notably, variability across random seeds was high on internal tests in the smallest data set (R2: 0.453–0.783), whereas on the fixed external data set R2 varied less (0.633–0.672), regardless of applicability domain (AD) filtering. This undermined cross-study comparability and underscored the risk of overly optimistic conclusions. Our findings highlighted that test-set construction must be aligned with real-world application scenarios. Researchers should avoid relying on single or cherry-picked random seeds or unsuitable rational partitioning. Transparent, application-aligned partitioning protocols and AD methods should be employed to emphasize true external generalizability over potentially inflated internal metrics.
随着QSAR模型开发的激增,对评估严谨性的关注,特别是对数据分割的影响的关注也在增加。使用5个不同规模的数据集,我们系统地评估了随机分裂(RS)、基于相似性的分裂(SS)和随机种子变异性在两种情况下对模型可泛化性的影响:化学筛选数据有限和数据充足的标准建模。数据集划分方法的选择和随机种子的选择都会极大地影响内部测试的性能,这可能不能可靠地反映真实的预测能力。虽然SS可以在许多情况下提高内部测试性能,但这些增益并不一定转化为更强的外部泛化性。此外,在低采样比下,SS在内部和外部测试中的表现都可能比RS差。这挑战了为内部性能优化的合理分割普遍提高模型性能的隐含假设。值得注意的是,在最小数据集的内部测试中,随机种子的变异性很高(R2: 0.453-0.783),而在固定的外部数据集上,R2变化较小(0.633-0.672),无论适用域(AD)过滤如何。这破坏了交叉研究的可比性,并强调了过度乐观结论的风险。我们的发现强调了测试集的构建必须与实际应用场景保持一致。研究人员应避免依赖单一或随机挑选的种子或不适当的合理分配。应该使用透明的、与应用程序一致的分区协议和AD方法来强调真正的外部通用性,而不是潜在的夸大的内部指标。
{"title":"Toward More Trustworthy QSAR: A Systematic Discussion on Data Set Partitioning","authors":"Shangyu Li,Peizhe Sun","doi":"10.1021/acs.jcim.5c02465","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02465","url":null,"abstract":"With the surge in QSAR model development, concerns about evaluation rigor, particularly regarding the influence of data splitting, have grown. Using five data sets of various sizes, we systematically assessed the effects of random splits (RS), similarity-based splits (SS), and random-seed variability on model generalizability under two scenarios: limited data for chemical screening and standard modeling with ample data. Both the choice of data set partitioning method and the selection of random seeds can substantially affect internal test performance, which may not reliably reflect true predictive capability. Although SS can improve internal test performance in many settings, these gains do not necessarily translate into stronger external generalizability. Moreover, under low sampling ratios, SS may perform worse than RS on both internal and external tests. This challenges the implicit assumption that rational splits optimized for internal performance universally improve model performance. Notably, variability across random seeds was high on internal tests in the smallest data set (R2: 0.453–0.783), whereas on the fixed external data set R2 varied less (0.633–0.672), regardless of applicability domain (AD) filtering. This undermined cross-study comparability and underscored the risk of overly optimistic conclusions. Our findings highlighted that test-set construction must be aligned with real-world application scenarios. Researchers should avoid relying on single or cherry-picked random seeds or unsuitable rational partitioning. Transparent, application-aligned partitioning protocols and AD methods should be employed to emphasize true external generalizability over potentially inflated internal metrics.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"37 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146097961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ME-PFP: An Ensemble Learning Approach Fusing Multi-Source Features for Protein Function Prediction ME-PFP:一种融合多源特征的蛋白质功能预测集成学习方法
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-01 DOI: 10.1021/acs.jcim.5c02513
Haoxing Luo,Yue Hu,Chaolin Song,Xinhui Li,Yuyin Ma,Yurong Qian,Lei Deng
Proteins, as essential components of living organisms, play a critical role in both drug discovery and disease mechanism research. Multiple empirical studies have shown that there is a significant correlation between protein function and drug targets with therapeutic potential. Therefore, how to accurately and efficiently predict protein function is an urgent issue that needs to be addressed. Existing research faces challenges such as insufficient utilization of protein data and low heterogeneous fusion performance. In this paper, we propose ME-PFP, a novel ensemble learning framework that integrates sequence representations from a protein language model, domain, and protein–protein interaction data to improve protein function prediction. To effectively capture and utilize heterogeneous features, we design three specialized attention-based feature extractors tailored to each data modality. These features are then fused through a dynamic weighting strategy to enable complementary information exchange between different modalities, thereby improving protein function prediction performance. Extensive experiments on benchmark data sets show that ME-PFP significantly outperforms sequence-based and multisource fusion models. Notably, it achieved an average improvement of 13.23% on the human data set and 11.11% on the yeast data set. The experimental results show that this study not only improves the accuracy of protein function prediction, but also promotes progress in the field of computational biology.
蛋白质作为生物体的重要组成部分,在药物发现和疾病机制研究中都起着至关重要的作用。多项实证研究表明,蛋白质功能与具有治疗潜力的药物靶点之间存在显著相关性。因此,如何准确、高效地预测蛋白质功能是一个迫切需要解决的问题。现有的研究面临着蛋白质数据利用不足、异质融合性能低等挑战。在本文中,我们提出了ME-PFP,这是一种新的集成学习框架,它集成了来自蛋白质语言模型、结构域和蛋白质-蛋白质相互作用数据的序列表示,以提高蛋白质功能预测。为了有效地捕获和利用异构特征,我们针对每种数据模式设计了三个专门的基于注意力的特征提取器。然后通过动态加权策略融合这些特征,以实现不同模式之间的互补信息交换,从而提高蛋白质功能预测性能。在基准数据集上的大量实验表明,ME-PFP显著优于基于序列和多源融合模型。值得注意的是,它在人类数据集上实现了13.23%的平均改进,在酵母数据集上实现了11.11%的平均改进。实验结果表明,本研究不仅提高了蛋白质功能预测的准确性,而且促进了计算生物学领域的进步。
{"title":"ME-PFP: An Ensemble Learning Approach Fusing Multi-Source Features for Protein Function Prediction","authors":"Haoxing Luo,Yue Hu,Chaolin Song,Xinhui Li,Yuyin Ma,Yurong Qian,Lei Deng","doi":"10.1021/acs.jcim.5c02513","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02513","url":null,"abstract":"Proteins, as essential components of living organisms, play a critical role in both drug discovery and disease mechanism research. Multiple empirical studies have shown that there is a significant correlation between protein function and drug targets with therapeutic potential. Therefore, how to accurately and efficiently predict protein function is an urgent issue that needs to be addressed. Existing research faces challenges such as insufficient utilization of protein data and low heterogeneous fusion performance. In this paper, we propose ME-PFP, a novel ensemble learning framework that integrates sequence representations from a protein language model, domain, and protein–protein interaction data to improve protein function prediction. To effectively capture and utilize heterogeneous features, we design three specialized attention-based feature extractors tailored to each data modality. These features are then fused through a dynamic weighting strategy to enable complementary information exchange between different modalities, thereby improving protein function prediction performance. Extensive experiments on benchmark data sets show that ME-PFP significantly outperforms sequence-based and multisource fusion models. Notably, it achieved an average improvement of 13.23% on the human data set and 11.11% on the yeast data set. The experimental results show that this study not only improves the accuracy of protein function prediction, but also promotes progress in the field of computational biology.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"58 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146097967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine-Learning Framework for Excitation Energies of Chromophores in Polarizable Environments. 极化环境中发色团激发能的机器学习框架。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-01-31 DOI: 10.1021/acs.jcim.5c02424
Chris John,Edoardo Cignoni,Lorenzo Cupellini,Benedetta Mennucci
Excited states of embedded chromophores are highly influenced by their interaction with the environment. Herein, we present a machine-learning (ML) framework capable of predicting the different environmental contributions to excitation energies of chromophores in a polarizable embedding. Our ML models are built in a hierarchical structure to capture both the effect of ground-state polarization and the response of the polarizable environment to the electronic transition. With the use of the right descriptors, the models trained on the quantum mechanics/molecular mechanics (QM/MM) calculations in a nonpolarizable environment are able to successfully predict the effects of a polarizable environment on excitation energies. The ML models are applied to three chromophores present in light-harvesting complexes (chlorophyll a, chlorophyll b, and lutein) and are used to reproduce the excitonic structure of a multichromophoric system unseen in the training set to a level of accuracy offered by a polarizable QM/MM calculation, while taking a fraction of its time.
嵌入式发色团的激发态受其与环境的相互作用的高度影响。在此,我们提出了一个机器学习(ML)框架,能够预测极化嵌入中不同环境对发色团激发能的贡献。我们的机器学习模型建立在一个层次结构中,以捕捉基态极化的影响和极化环境对电子跃迁的响应。利用正确的描述符,在非极化环境下进行量子力学/分子力学(QM/MM)计算训练的模型能够成功地预测极化环境对激发能的影响。ML模型应用于光捕获复合物(叶绿素a,叶绿素b和叶黄素)中存在的三种发色团,并用于再现训练集中未见的多发色系统的激子结构,达到极化QM/MM计算提供的精度水平,同时占用其时间的一小部分。
{"title":"Machine-Learning Framework for Excitation Energies of Chromophores in Polarizable Environments.","authors":"Chris John,Edoardo Cignoni,Lorenzo Cupellini,Benedetta Mennucci","doi":"10.1021/acs.jcim.5c02424","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02424","url":null,"abstract":"Excited states of embedded chromophores are highly influenced by their interaction with the environment. Herein, we present a machine-learning (ML) framework capable of predicting the different environmental contributions to excitation energies of chromophores in a polarizable embedding. Our ML models are built in a hierarchical structure to capture both the effect of ground-state polarization and the response of the polarizable environment to the electronic transition. With the use of the right descriptors, the models trained on the quantum mechanics/molecular mechanics (QM/MM) calculations in a nonpolarizable environment are able to successfully predict the effects of a polarizable environment on excitation energies. The ML models are applied to three chromophores present in light-harvesting complexes (chlorophyll a, chlorophyll b, and lutein) and are used to reproduce the excitonic structure of a multichromophoric system unseen in the training set to a level of accuracy offered by a polarizable QM/MM calculation, while taking a fraction of its time.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"55 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Traj2Relax: A Trajectory-Supervised Method for Robust Structure Relaxation. Traj2Relax:一种鲁棒结构松弛的轨迹监督方法。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-01-30 DOI: 10.1021/acs.jcim.5c02703
Zhiyuan Liu,Quan Qian
Structure relaxation plays a crucial role in atomic simulation and materials modeling, yet traditional first-principles approaches remain computationally expensive and, therefore, difficult to scale to high-throughput applications. In this work, we propose Traj2Relax, a trajectory-supervised structure relaxation framework based on conditional velocity field modeling. Instead of relying on explicit energy or force evaluations, Traj2Relax learns a time-dependent velocity field from geometric differences between successive configurations along real relaxation trajectories, enabling physically consistent structural convergence across a wide range of perturbation magnitudes. A time-scheduled noise mechanism is introduced during training to improve stability under highly distorted inputs, while deterministic integration during inference produces smooth, interpretable relaxation trajectories. Experimental results show that Traj2Relax achieves competitive accuracy under near-equilibrium conditions and demonstrates clear advantages under moderate to strong perturbations, where energy-driven and distribution-based relaxation methods tend to degrade. On representative inorganic crystal systems, Traj2Relax attains a root-mean-square deviation of 0.26 Å and a space-group consistency of 82.3% under equilibrium settings and maintains a root-mean-square deviation of 0.38 Å with a recovery rate of 5.8% under strong perturbations. The framework further supports deterministic, batch-parallel relaxation, yielding an order-of-magnitude improvement in inference throughput compared with iterative energy-minimization-based approaches. Overall, Traj2Relax provides an efficient and physically grounded alternative for learning-driven structure relaxation, particularly suited for high-throughput screening scenarios involving nonequilibrium or highly perturbed structures.
结构松弛在原子模拟和材料建模中起着至关重要的作用,然而传统的第一性原理方法仍然计算昂贵,因此难以扩展到高通量应用。在这项工作中,我们提出了Traj2Relax,一个基于条件速度场建模的轨迹监督结构松弛框架。Traj2Relax不依赖于明确的能量或力评估,而是从沿着真实弛豫轨迹的连续配置之间的几何差异中学习与时间相关的速度场,从而在广泛的扰动幅度范围内实现物理上一致的结构收敛。在训练过程中引入时序噪声机制,以提高高度扭曲输入下的稳定性,而在推理过程中的确定性集成产生平滑的,可解释的松弛轨迹。实验结果表明,Traj2Relax在接近平衡条件下具有相当的精度,并且在中强扰动下表现出明显的优势,其中能量驱动和基于分布的松弛方法往往会退化。在具有代表性的无机晶体体系中,Traj2Relax在平衡状态下的均方根偏差为0.26 Å,空间群一致性为82.3%;在强扰动下,Traj2Relax的均方根偏差为0.38 Å,回收率为5.8%。该框架进一步支持确定性、批并行松弛,与基于迭代能量最小化的方法相比,在推理吞吐量方面有了数量级的提高。总的来说,Traj2Relax为学习驱动的结构松弛提供了一种高效的物理基础替代方案,特别适用于涉及非平衡或高度摄动结构的高通量筛选场景。
{"title":"Traj2Relax: A Trajectory-Supervised Method for Robust Structure Relaxation.","authors":"Zhiyuan Liu,Quan Qian","doi":"10.1021/acs.jcim.5c02703","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02703","url":null,"abstract":"Structure relaxation plays a crucial role in atomic simulation and materials modeling, yet traditional first-principles approaches remain computationally expensive and, therefore, difficult to scale to high-throughput applications. In this work, we propose Traj2Relax, a trajectory-supervised structure relaxation framework based on conditional velocity field modeling. Instead of relying on explicit energy or force evaluations, Traj2Relax learns a time-dependent velocity field from geometric differences between successive configurations along real relaxation trajectories, enabling physically consistent structural convergence across a wide range of perturbation magnitudes. A time-scheduled noise mechanism is introduced during training to improve stability under highly distorted inputs, while deterministic integration during inference produces smooth, interpretable relaxation trajectories. Experimental results show that Traj2Relax achieves competitive accuracy under near-equilibrium conditions and demonstrates clear advantages under moderate to strong perturbations, where energy-driven and distribution-based relaxation methods tend to degrade. On representative inorganic crystal systems, Traj2Relax attains a root-mean-square deviation of 0.26 Å and a space-group consistency of 82.3% under equilibrium settings and maintains a root-mean-square deviation of 0.38 Å with a recovery rate of 5.8% under strong perturbations. The framework further supports deterministic, batch-parallel relaxation, yielding an order-of-magnitude improvement in inference throughput compared with iterative energy-minimization-based approaches. Overall, Traj2Relax provides an efficient and physically grounded alternative for learning-driven structure relaxation, particularly suited for high-throughput screening scenarios involving nonequilibrium or highly perturbed structures.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"16 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146073238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chloride Binding in Trimeric Coiled Coils: Free Energy and Structural Determinants from Molecular Simulations. 三聚体卷绕线圈中的氯化物结合:分子模拟中的自由能和结构决定因素。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-01-30 DOI: 10.1021/acs.jcim.5c02565
Riccardo Nifosì,Luca Bellucci
Coiled coils, owing to their simple yet versatile architecture, serve as valuable model systems for both experimental and computational studies in protein science. Whereas the sequence-structure relationships that govern their oligomeric state and stability have been thoroughly investigated, important gaps remain, most notably regarding the role of central chloride ions coordinated by asparagine triads observed in several trimeric coiled-coil (TCC) crystal structures. To investigate the thermodynamics of chloride binding at this site, we performed extensive molecular simulations using metadynamics and alchemical free-energy calculations, both enhanced with replica exchange, to determine the chloride binding free energy (ΔGbind) in three TCCs of similar length but different stability (PDB IDs: 2wpy, 4dzk, 1mof). Despite the nearly identical local coordination environment, the computed ΔGbind values strongly depend on the overall protein structure, with variations in superhelical radius R0 upon ion removal systematically accompanying the observed binding thermodynamics. In particular, both the metastable TCC 2wpy─a variant of the GCN4 leucine-zipper domain previously shown to be unstable in the absence of chloride─and the synthetic design 4dzk exhibit highly unfavorable binding, suggesting that current biomolecular force fields may not fully capture either the stabilizing role of chloride or the conformational ensemble of the unbound state. By contrast, the calculated ΔGbind in 1mof, a fragment of the MoMuLV retroviral transmembrane protein, is favorable and is associated with the presence of an additional C-terminal leash domain that modulates the binding-site environment. These results identify TCCs as critical benchmarks for improving the description of anion-protein interactions and the balance between bound and unbound states in future force-field developments.
由于其结构简单而用途广泛,因此在蛋白质科学的实验和计算研究中都是有价值的模型系统。尽管控制其寡聚状态和稳定性的序列-结构关系已被彻底研究,但重要的差距仍然存在,最值得注意的是在几种三聚体线圈(TCC)晶体结构中观察到的由天冬酰胺三合体协调的中心氯离子的作用。为了研究氯化物在该位点的结合热力学,我们使用元动力学和炼金术自由能计算进行了广泛的分子模拟,并进行了副本交换,以确定三种长度相似但稳定性不同的tcc (PDB id: 2wpy, 4dzk, 1mof)中的氯化物结合自由能(ΔGbind)。尽管几乎相同的局部配位环境,计算的ΔGbind值强烈依赖于整体蛋白质结构,随着离子去除,超螺旋半径R0的变化系统地伴随着观察到的结合热力学。特别是,亚稳态的TCC 2wpy (GCN4亮氨酸拉链结构域的一种变体)和合成设计的4dzk都表现出高度不利的结合,这表明当前的生物分子力场可能无法完全捕获氯化物的稳定作用或未结合状态的构象集合。相比之下,计算出的MoMuLV逆转录病毒跨膜蛋白片段1mof中的ΔGbind是有利的,并且与调节结合位点环境的额外c端牵链结构域的存在有关。这些结果表明,在未来的力场发展中,tcc是改进阴离子-蛋白质相互作用描述以及结合和非结合状态平衡的关键基准。
{"title":"Chloride Binding in Trimeric Coiled Coils: Free Energy and Structural Determinants from Molecular Simulations.","authors":"Riccardo Nifosì,Luca Bellucci","doi":"10.1021/acs.jcim.5c02565","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02565","url":null,"abstract":"Coiled coils, owing to their simple yet versatile architecture, serve as valuable model systems for both experimental and computational studies in protein science. Whereas the sequence-structure relationships that govern their oligomeric state and stability have been thoroughly investigated, important gaps remain, most notably regarding the role of central chloride ions coordinated by asparagine triads observed in several trimeric coiled-coil (TCC) crystal structures. To investigate the thermodynamics of chloride binding at this site, we performed extensive molecular simulations using metadynamics and alchemical free-energy calculations, both enhanced with replica exchange, to determine the chloride binding free energy (ΔGbind) in three TCCs of similar length but different stability (PDB IDs: 2wpy, 4dzk, 1mof). Despite the nearly identical local coordination environment, the computed ΔGbind values strongly depend on the overall protein structure, with variations in superhelical radius R0 upon ion removal systematically accompanying the observed binding thermodynamics. In particular, both the metastable TCC 2wpy─a variant of the GCN4 leucine-zipper domain previously shown to be unstable in the absence of chloride─and the synthetic design 4dzk exhibit highly unfavorable binding, suggesting that current biomolecular force fields may not fully capture either the stabilizing role of chloride or the conformational ensemble of the unbound state. By contrast, the calculated ΔGbind in 1mof, a fragment of the MoMuLV retroviral transmembrane protein, is favorable and is associated with the presence of an additional C-terminal leash domain that modulates the binding-site environment. These results identify TCCs as critical benchmarks for improving the description of anion-protein interactions and the balance between bound and unbound states in future force-field developments.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"23 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
XRepDDA: An Interpretable Drug-Disease Association Prediction Framework Leveraging Pretrained Chemical Language Models. XRepDDA:利用预训练化学语言模型的可解释药物-疾病关联预测框架。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-01-30 DOI: 10.1021/acs.jcim.5c02901
Chenyi Zhang,Yun Zuo,Qiao Ning,Sisi Yuan,Zhaohong Deng,Hongwei Yin,Anjing Zhao
Drug repositioning aims to identify new indications for existing drugs, offering a cost-effective and time-efficient strategy for therapeutic development. Its core challenge lies in accurately predicting potential drug-disease associations (DDAs). However, existing computational approaches often suffer from inadequate drug representation, insufficient modeling of disease semantics, and imbalanced data distributions, which collectively limit predictive accuracy and generalization ability. To address these challenges, we propose an innovative framework, termed XRepDDA, that integrates multimodal feature representation with deep metric learning to improve DDA prediction accuracy and robustness. For drug representation, the SMI-TED pretrained chemical language model encodes SMILES sequences into chemically informative molecular embeddings. For disease representation, a hierarchical semantic graph based on the MeSH ontology is constructed together with a semantic-enhanced graph embedding strategy to capture hierarchical and semantic relationships among diseases. To mitigate class imbalance, we applied the AllKNN adaptive undersampling strategy. The prediction module is built upon an improved ModernNCA architecture, which learns a discriminative embedding space through deep metric learning. Experiments on multiple public benchmark data sets demonstrate that XRepDDA consistently outperforms diverse baseline models, including traditional machine learning, tree-based ensemble, and deep learning methods, achieving AUC and AUPR values of up to 0.9990 and 0.9991, respectively. Furthermore, molecular docking experiments on top-ranked candidate drugs for Alzheimer's disease and stomach neoplasms provide in silico validation of predictive reliability. To enhance interpretability, a multilevel explainability framework is established, combining SHAP-based global feature attribution with attention mechanisms and molecular perturbation analyses to identify key features and pharmacophores at the local level. These results support the chemical interpretability and the biological plausibility of the predictions.
药物重新定位旨在确定现有药物的新适应症,为治疗开发提供具有成本效益和时间效率的策略。其核心挑战在于准确预测潜在的药物-疾病关联(DDAs)。然而,现有的计算方法往往存在药物表示不足、疾病语义建模不足和数据分布不平衡的问题,这些问题共同限制了预测的准确性和泛化能力。为了应对这些挑战,我们提出了一个名为XRepDDA的创新框架,该框架将多模态特征表示与深度度量学习相结合,以提高DDA预测的准确性和鲁棒性。对于药物表示,SMI-TED预训练的化学语言模型将SMILES序列编码为化学信息丰富的分子嵌入。在疾病表示方面,构建了基于MeSH本体的分层语义图,并结合语义增强的图嵌入策略来捕获疾病之间的层次和语义关系。为了缓解类不平衡,我们采用了AllKNN自适应欠采样策略。预测模块建立在改进的ModernNCA架构上,该架构通过深度度量学习学习判别嵌入空间。在多个公共基准数据集上的实验表明,XRepDDA始终优于各种基线模型,包括传统的机器学习、基于树的集成和深度学习方法,AUC和AUPR值分别高达0.9990和0.9991。此外,针对阿尔茨海默病和胃肿瘤的顶级候选药物的分子对接实验提供了预测可靠性的计算机验证。为了提高可解释性,我们建立了一个多层次的可解释性框架,将基于shap的全局特征归因与注意机制和分子微扰分析相结合,在局部水平上识别关键特征和药效团。这些结果支持了这些预测的化学可解释性和生物学合理性。
{"title":"XRepDDA: An Interpretable Drug-Disease Association Prediction Framework Leveraging Pretrained Chemical Language Models.","authors":"Chenyi Zhang,Yun Zuo,Qiao Ning,Sisi Yuan,Zhaohong Deng,Hongwei Yin,Anjing Zhao","doi":"10.1021/acs.jcim.5c02901","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02901","url":null,"abstract":"Drug repositioning aims to identify new indications for existing drugs, offering a cost-effective and time-efficient strategy for therapeutic development. Its core challenge lies in accurately predicting potential drug-disease associations (DDAs). However, existing computational approaches often suffer from inadequate drug representation, insufficient modeling of disease semantics, and imbalanced data distributions, which collectively limit predictive accuracy and generalization ability. To address these challenges, we propose an innovative framework, termed XRepDDA, that integrates multimodal feature representation with deep metric learning to improve DDA prediction accuracy and robustness. For drug representation, the SMI-TED pretrained chemical language model encodes SMILES sequences into chemically informative molecular embeddings. For disease representation, a hierarchical semantic graph based on the MeSH ontology is constructed together with a semantic-enhanced graph embedding strategy to capture hierarchical and semantic relationships among diseases. To mitigate class imbalance, we applied the AllKNN adaptive undersampling strategy. The prediction module is built upon an improved ModernNCA architecture, which learns a discriminative embedding space through deep metric learning. Experiments on multiple public benchmark data sets demonstrate that XRepDDA consistently outperforms diverse baseline models, including traditional machine learning, tree-based ensemble, and deep learning methods, achieving AUC and AUPR values of up to 0.9990 and 0.9991, respectively. Furthermore, molecular docking experiments on top-ranked candidate drugs for Alzheimer's disease and stomach neoplasms provide in silico validation of predictive reliability. To enhance interpretability, a multilevel explainability framework is established, combining SHAP-based global feature attribution with attention mechanisms and molecular perturbation analyses to identify key features and pharmacophores at the local level. These results support the chemical interpretability and the biological plausibility of the predictions.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"191 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ArtiDock: Accurate Machine Learning Approach to Protein-Ligand Docking Optimized for High-Throughput Virtual Screening. ArtiDock:针对高通量虚拟筛选优化的蛋白质配体对接的精确机器学习方法。
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-01-30 DOI: 10.1021/acs.jcim.5c02777
Taras Voitsitskyi, Ihor Koleiev, Roman Stratiichuk, Oleksandr Kot, Roman Kyrylenko, Illia Savchenko, Vladyslav Husak, Semen Yesylevskyy, Sergii Starosyla, Alan Nafiiev

Classical protein-ligand docking has been a cornerstone technique in computational drug discovery for decades but has reached an accuracy and performance plateau. Recently introduced Machine Learning (ML)-based docking methods offer a promising paradigm shift, but their practical adoption is hampered by accuracy-to-speed trade-offs, inadequate benchmarking standards, and questionable chemical validity of predicted poses. In this study, we introduce ArtiDock─an ML-based docking technique optimized for high-throughput virtual screening applications. To evaluate ArtiDock, we developed a dedicated performance and accuracy benchmark for pocket-specific rigid protein-ligand docking, which mimics realistic industrial drug discovery scenarios and is based on the novel PLINDER data set. We demonstrate that ArtiDock is 29-38% more accurate in comparison to leading open-source and commercial classical docking techniques such as AutoDock, Vina, and Glide, while providing a low computational cost. ArtiDock notably excels in challenging docking scenarios involving unbound protein structures and binding sites containing ions and structured water molecules. Additionally, we demonstrated competitive accuracy of our approach at considerably higher throughput compared to a wide range of AI docking and AI cofolding methods using the PoseX benchmark. Our results show that ArtiDock could be considered as a method of choice in high-throughput virtual screening scenarios.

几十年来,经典的蛋白质配体对接一直是计算药物发现的基础技术,但其准确性和性能已经达到了平台期。最近推出的基于机器学习(ML)的对接方法提供了一个有希望的范式转变,但它们的实际应用受到准确性与速度权衡、基准标准不足以及预测姿势的化学有效性问题的阻碍。在这项研究中,我们介绍了ArtiDock──一种基于ml的对接技术,针对高通量虚拟筛选应用进行了优化。为了评估ArtiDock,我们开发了一个专用的口袋特异性刚性蛋白质配体对接的性能和准确性基准,该基准模拟了现实的工业药物发现场景,并基于新的PLINDER数据集。我们证明,与AutoDock、Vina和Glide等领先的开源和商业经典对接技术相比,ArtiDock的精度提高了29-38%,同时提供了较低的计算成本。ArtiDock在涉及未结合蛋白质结构和含有离子和结构水分子的结合位点的具有挑战性的对接场景中表现出色。此外,与使用PoseX基准的广泛的AI对接和AI共折叠方法相比,我们证明了我们的方法在更高吞吐量下具有竞争力的准确性。我们的研究结果表明,ArtiDock可以被认为是高通量虚拟筛选场景中的一种选择方法。
{"title":"ArtiDock: Accurate Machine Learning Approach to Protein-Ligand Docking Optimized for High-Throughput Virtual Screening.","authors":"Taras Voitsitskyi, Ihor Koleiev, Roman Stratiichuk, Oleksandr Kot, Roman Kyrylenko, Illia Savchenko, Vladyslav Husak, Semen Yesylevskyy, Sergii Starosyla, Alan Nafiiev","doi":"10.1021/acs.jcim.5c02777","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02777","url":null,"abstract":"<p><p>Classical protein-ligand docking has been a cornerstone technique in computational drug discovery for decades but has reached an accuracy and performance plateau. Recently introduced Machine Learning (ML)-based docking methods offer a promising paradigm shift, but their practical adoption is hampered by accuracy-to-speed trade-offs, inadequate benchmarking standards, and questionable chemical validity of predicted poses. In this study, we introduce ArtiDock─an ML-based docking technique optimized for high-throughput virtual screening applications. To evaluate ArtiDock, we developed a dedicated performance and accuracy benchmark for pocket-specific rigid protein-ligand docking, which mimics realistic industrial drug discovery scenarios and is based on the novel PLINDER data set. We demonstrate that ArtiDock is 29-38% more accurate in comparison to leading open-source and commercial classical docking techniques such as AutoDock, Vina, and Glide, while providing a low computational cost. ArtiDock notably excels in challenging docking scenarios involving unbound protein structures and binding sites containing ions and structured water molecules. Additionally, we demonstrated competitive accuracy of our approach at considerably higher throughput compared to a wide range of AI docking and AI cofolding methods using the PoseX benchmark. Our results show that ArtiDock could be considered as a method of choice in high-throughput virtual screening scenarios.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146091591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertainty-Aware Prediction of 195Pt Chemical Shifts from Limited Data. 有限数据对195Pt化学位移的不确定性预测。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-01-29 DOI: 10.1021/acs.jcim.5c02541
Alexander Meßler,Hilke Bahmann
Platinum (Pt) complexes are highly relevant for medicinal chemistry and homogeneous catalysis. In the development of novel Pt-based chemotherapeutic agents and catalysts, characterization of the compounds using nuclear magnetic resonance (NMR) spectroscopy of the 195Pt nucleus is standard. However, measuring 195Pt-NMR signals can be tedious due to the large chemical shift range and limited resolution. To facilitate experimental measurements by narrowing down the shift range, reliable predictions of the chemical shift are needed. Especially for lighter nuclei such as 1H and 13C, machine learning (ML) methods predict chemical shifts accurately, while analogous models for heavier nuclei are scarce. In this work, we propose Gaussian Process Regression (GPR) models for the prediction of 195Pt chemical shifts. The underlying data set comprises 292 structures and three different descriptors were used to encode structural and chemical features of the molecules. Based on the prediction uncertainties derived from the posterior variance of the models, a reasonably narrow shift range can be estimated for a given Pt complex. The most robust model yields a mean absolute error (MAE) of 114 ppm on the holdout test set, which is significantly more accurate than relativistic DFT calculations.
铂(Pt)配合物与药物化学和均相催化密切相关。在新型基于pt的化疗药物和催化剂的开发中,使用195Pt核磁共振(NMR)光谱来表征化合物是标准的。然而,由于化学位移范围大,分辨率有限,测量195Pt-NMR信号可能很繁琐。为了通过缩小位移范围来方便实验测量,需要对化学位移进行可靠的预测。特别是对于1H和13C等较轻的原子核,机器学习(ML)方法可以准确地预测化学位移,而对于较重的原子核,类似的模型很少。在这项工作中,我们提出了高斯过程回归(GPR)模型来预测195Pt的化学位移。基础数据集包括292个结构,并使用三种不同的描述符来编码分子的结构和化学特征。基于模型后验方差的预测不确定性,对于给定的铂复合体,可以估计出一个相当窄的偏移范围。最稳健的模型在holdout测试集上产生的平均绝对误差(MAE)为114 ppm,这比相对论DFT计算要准确得多。
{"title":"Uncertainty-Aware Prediction of 195Pt Chemical Shifts from Limited Data.","authors":"Alexander Meßler,Hilke Bahmann","doi":"10.1021/acs.jcim.5c02541","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02541","url":null,"abstract":"Platinum (Pt) complexes are highly relevant for medicinal chemistry and homogeneous catalysis. In the development of novel Pt-based chemotherapeutic agents and catalysts, characterization of the compounds using nuclear magnetic resonance (NMR) spectroscopy of the 195Pt nucleus is standard. However, measuring 195Pt-NMR signals can be tedious due to the large chemical shift range and limited resolution. To facilitate experimental measurements by narrowing down the shift range, reliable predictions of the chemical shift are needed. Especially for lighter nuclei such as 1H and 13C, machine learning (ML) methods predict chemical shifts accurately, while analogous models for heavier nuclei are scarce. In this work, we propose Gaussian Process Regression (GPR) models for the prediction of 195Pt chemical shifts. The underlying data set comprises 292 structures and three different descriptors were used to encode structural and chemical features of the molecules. Based on the prediction uncertainties derived from the posterior variance of the models, a reasonably narrow shift range can be estimated for a given Pt complex. The most robust model yields a mean absolute error (MAE) of 114 ppm on the holdout test set, which is significantly more accurate than relativistic DFT calculations.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"182 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146073276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unravelling the Role Played by Non-covalent Interactions in the Action Mechanism of PCDDs within Cells. 揭示非共价相互作用在细胞内pcdd作用机制中的作用。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-01-29 DOI: 10.1021/acs.jcim.5c02555
Lorena Ruano,Álvaro Pérez-Barcia,Vito F Palmisano,Juan J Nogueira,Marcos Mandado,Nicolás Ramos-Berdullas
The aryl hydrocarbon receptor (AhR) is a ligand-activated transcription factor that mediates biological signals and regulates diverse cellular functions. Of particular concern are the effects triggered by dioxins and dioxin-like compounds (DLCs), whose toxicological outcomes arise through both canonical and noncanonical pathways, leading to the designation of AhR as the "dioxin receptor". However, conventional risk assessment approaches based on toxic equivalency factors (TEFs), which primarily reflect the capacity of these compounds to bind and activate AhR, do not fully account for critical aspects such as environmental concentration and bioavailability, potentially underestimating their true impact. In this work, we present a comparative analysis of polychlorinated dibenzo-p-dioxins (PCDDs) with varying degrees of chlorination, focusing on their interactions with the AhR at the ligand-binding domain and on their permeation abilities across a model lipid membrane. To this end, we combine classical molecular dynamics (CMD) simulations with a hybrid quantum mechanics/molecular mechanics energy decomposition analysis (QM/MM-EDA) framework. This integrated approach enables a molecular-level characterization of receptor binding affinities and membrane permeation efficiencies. Our findings provide novel insights into the mechanisms underlying the relative toxicity of DLCs and highlight the need for integrative assessment strategies that encompass both receptor-ligand interactions and physicochemical behavior in biological environments. It is noteworthy that the toxicity of these compounds, as quantified by the pEC50 index, correlates with the membrane permeation barrier rather than with AhR binding affinity, identifying permeation as the key mechanistic step in the toxicological process of these compounds.
芳烃受体(aryl hydrocarbon receptor, AhR)是一种配体激活的转录因子,介导生物信号,调节多种细胞功能。特别值得关注的是二恶英和二恶英样化合物(dlc)引发的影响,其毒理学结果通过规范和非规范途径产生,导致AhR被指定为“二恶英受体”。然而,基于毒性等效因子(tef)的传统风险评估方法主要反映了这些化合物结合和激活AhR的能力,并没有充分考虑到环境浓度和生物利用度等关键方面,可能低估了它们的真正影响。在这项工作中,我们对不同氯化程度的多氯二苯并对二恶英(pcdd)进行了比较分析,重点研究了它们在配体结合区域与AhR的相互作用以及它们在模型脂质膜上的渗透能力。为此,我们将经典分子动力学(CMD)模拟与混合量子力学/分子力学能量分解分析(QM/MM-EDA)框架相结合。这种综合方法能够在分子水平上表征受体结合亲和力和膜渗透效率。我们的研究结果为dlc的相对毒性机制提供了新的见解,并强调了综合评估策略的必要性,包括受体-配体相互作用和生物环境中的物理化学行为。值得注意的是,通过pEC50指数量化的这些化合物的毒性与膜渗透屏障相关,而不是与AhR结合亲和力相关,这表明渗透是这些化合物毒理学过程中的关键机制步骤。
{"title":"Unravelling the Role Played by Non-covalent Interactions in the Action Mechanism of PCDDs within Cells.","authors":"Lorena Ruano,Álvaro Pérez-Barcia,Vito F Palmisano,Juan J Nogueira,Marcos Mandado,Nicolás Ramos-Berdullas","doi":"10.1021/acs.jcim.5c02555","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02555","url":null,"abstract":"The aryl hydrocarbon receptor (AhR) is a ligand-activated transcription factor that mediates biological signals and regulates diverse cellular functions. Of particular concern are the effects triggered by dioxins and dioxin-like compounds (DLCs), whose toxicological outcomes arise through both canonical and noncanonical pathways, leading to the designation of AhR as the \"dioxin receptor\". However, conventional risk assessment approaches based on toxic equivalency factors (TEFs), which primarily reflect the capacity of these compounds to bind and activate AhR, do not fully account for critical aspects such as environmental concentration and bioavailability, potentially underestimating their true impact. In this work, we present a comparative analysis of polychlorinated dibenzo-p-dioxins (PCDDs) with varying degrees of chlorination, focusing on their interactions with the AhR at the ligand-binding domain and on their permeation abilities across a model lipid membrane. To this end, we combine classical molecular dynamics (CMD) simulations with a hybrid quantum mechanics/molecular mechanics energy decomposition analysis (QM/MM-EDA) framework. This integrated approach enables a molecular-level characterization of receptor binding affinities and membrane permeation efficiencies. Our findings provide novel insights into the mechanisms underlying the relative toxicity of DLCs and highlight the need for integrative assessment strategies that encompass both receptor-ligand interactions and physicochemical behavior in biological environments. It is noteworthy that the toxicity of these compounds, as quantified by the pEC50 index, correlates with the membrane permeation barrier rather than with AhR binding affinity, identifying permeation as the key mechanistic step in the toxicological process of these compounds.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"34 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146073277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Chemical Information and Modeling
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1