Journal of Chemical Information and Modeling 最新文献_第3页

Constant-pH Molecular Dynamics of Cationic Peptide Dendrimers Binding to siRNA. 阳离子肽树状大分子与siRNA结合的恒ph分子动力学。

IF 5.3 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling

Pub Date : 2026-02-05 DOI: 10.1021/acs.jcim.5c02636

Filipe E P Rodrigues, Tamis Darbre, Miguel Machuqueiro

Transfection, the process of delivering genetic material into eukaryotic cells, is crucial in biotechnology and the development of treatments. Naked nucleic acids face challenges such as enzymatic degradation, poor pharmacokinetics, and immunogenicity, which can be mitigated by delivery systems such as liposomes, cationic polymers, and dendrimers that protect and enhance uptake. Peptide dendrimers, in particular, show promise as nucleic acid carriers due to their lower cytotoxicity and immunogenicity, though their mechanisms, efficiency, and optimization remain to be clarified. Here, we characterized the configurational, conformational, and protonation landscapes of different peptide dendrimers in complex with siRNA. We found that nucleic acids modulate dendrimer structure, with electrostatic interactions strengthened at low pH through enhanced protonation of the N-termini. Although experimental data show that the more hydrophobic dendrimer examined displays the highest apparent affinity for siRNA, its reduced number of lysine residues results in weaker overall binding due to diminished charge density. This higher affinity observed is likely linked to increased aggregation propensity. In contrast, the dendrimer sequence with branching residues of inverted chirality, which performs worse, shows the lowest propensity for aggregation. Our work suggests that chirality has only a negligible effect on the dendrimer-siRNA binding modes, and that such differences are subtle, particularly at the monomeric level. Overall, this work provides mechanistic insight into dendrimer-siRNA interactions and outlines potential strategies to refine dendrimer design for improved nucleic acid delivery.

转染是将遗传物质传递到真核细胞的过程，在生物技术和治疗发展中至关重要。裸核酸面临着酶降解、不良药代动力学和免疫原性等挑战，这些挑战可以通过脂质体、阳离子聚合物和保护和增强摄取的树状大分子等递送系统来缓解。特别是肽树状大分子，由于其较低的细胞毒性和免疫原性，显示出作为核酸载体的希望，尽管其机制、效率和优化仍有待阐明。在这里，我们表征了不同的肽树状大分子与siRNA复合物的构型、构象和质子化景观。我们发现核酸调节树突结构，在低pH下，通过增强n端质子化，静电相互作用得到加强。尽管实验数据表明，越疏水的树状大分子对siRNA表现出最高的表观亲和力，但由于电荷密度的降低，其赖氨酸残基数量的减少导致整体结合较弱。观察到的这种较高的亲和力可能与增加的聚集倾向有关。相反，具有倒手性分支残基的树状大分子序列表现较差，其聚集倾向最低。我们的研究表明，手性对树突- sirna结合模式的影响可以忽略不计，而且这种差异是微妙的，特别是在单体水平上。总的来说，这项工作提供了树突- sirna相互作用的机制见解，并概述了改进树突设计以改善核酸传递的潜在策略。

{"title":"Constant-pH Molecular Dynamics of Cationic Peptide Dendrimers Binding to siRNA.","authors":"Filipe E P Rodrigues, Tamis Darbre, Miguel Machuqueiro","doi":"10.1021/acs.jcim.5c02636","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02636","url":null,"abstract":"Transfection, the process of delivering genetic material into eukaryotic cells, is crucial in biotechnology and the development of treatments. Naked nucleic acids face challenges such as enzymatic degradation, poor pharmacokinetics, and immunogenicity, which can be mitigated by delivery systems such as liposomes, cationic polymers, and dendrimers that protect and enhance uptake. Peptide dendrimers, in particular, show promise as nucleic acid carriers due to their lower cytotoxicity and immunogenicity, though their mechanisms, efficiency, and optimization remain to be clarified. Here, we characterized the configurational, conformational, and protonation landscapes of different peptide dendrimers in complex with siRNA. We found that nucleic acids modulate dendrimer structure, with electrostatic interactions strengthened at low pH through enhanced protonation of the N-termini. Although experimental data show that the more hydrophobic dendrimer examined displays the highest apparent affinity for siRNA, its reduced number of lysine residues results in weaker overall binding due to diminished charge density. This higher affinity observed is likely linked to increased aggregation propensity. In contrast, the dendrimer sequence with branching residues of inverted chirality, which performs worse, shows the lowest propensity for aggregation. Our work suggests that chirality has only a negligible effect on the dendrimer-siRNA binding modes, and that such differences are subtle, particularly at the monomeric level. Overall, this work provides mechanistic insight into dendrimer-siRNA interactions and outlines potential strategies to refine dendrimer design for improved nucleic acid delivery.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146117231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prediction of Charged Small Molecule Conformations in Solution Using a Balanced ML/MM Potential. 利用平衡ML/MM电位预测溶液中带电小分子构象。

IF 5.3 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling

Pub Date : 2026-02-05 DOI: 10.1021/acs.jcim.5c02706

Christopher D Williams, Neil A Burton, Richard A Bryce

Reliably evaluating the most stable conformations of charged small organic molecules in solution poses a major challenge to computational chemistry, due to limitations in the accuracy or efficiency of conventional methodologies. In this paper, we present a hybrid machine learning/molecular mechanics (ML/MM) potential, with dynamic, conformationally dependent charges on the atoms in the ML region, that can be used to address this important challenge in drug design. By way of a case study, metadynamics-enhanced molecular dynamics simulations were used to compare the performance of several intermolecular potentials in evaluating the solution phase conformational free energy differences of pharmaceutically relevant ligands based on the protonated 2-phenylethylamine scaffold. A straightforward approach to an ML/MM potential, in which the solute's intramolecular interactions are substituted by an ML model and making no other modifications to the force field, yields results inferior to a conventional fixed-charge MM potential, due to an imbalance created in the intra- and intermolecular interactions. To remedy this shortcoming, we present a new ML/MM potential that combines an accurate, trained-for-purpose ML model (PairFEQ-Net) of the charged ligand, with the polarizable SWM3 MM model of water. Crucially, by employing an empirical parameter to scale gas phase charges, the ML model predicts dynamic, solvent-polarized charges that embed the ligand in the solvent in a more balanced way. This ML/MM potential results in mean absolute errors in the prediction of conformational free energies of just 0.5 kcal mol^-1, hence furnishing a possible route to the chemically accurate prediction of the shapes of charged organic molecules in aqueous solution.

由于传统方法的准确性和效率的限制，可靠地评估溶液中带电有机小分子的最稳定构象对计算化学构成了重大挑战。在本文中，我们提出了一种混合机器学习/分子力学（ML/MM）电位，在ML区域的原子上具有动态的构象依赖电荷，可用于解决药物设计中的这一重要挑战。通过实例研究，采用元动力学增强的分子动力学模拟，比较了基于质子化2-苯基乙胺支架的药物相关配体的溶液相构象自由能差异的几个分子间势的性能。用ML模型代替溶质的分子内相互作用，对力场不作任何修改，直接得到ML/MM电位的方法，由于分子内和分子间相互作用产生的不平衡，其结果不如传统的固定电荷MM电位。为了弥补这一缺陷，我们提出了一种新的ML/MM电位，该电位结合了带电配体的精确、训练的ML模型（PairFEQ-Net）和水的极化SWM3 MM模型。至关重要的是，通过采用经验参数来衡量气相电荷，ML模型预测了以更平衡的方式将配体嵌入溶剂中的动态溶剂极化电荷。这种ML/MM电位导致预测构象自由能的平均绝对误差仅为0.5 kcal mol-1，因此为精确预测水溶液中带电有机分子的形状提供了一条可能的途径。

{"title":"Prediction of Charged Small Molecule Conformations in Solution Using a Balanced ML/MM Potential.","authors":"Christopher D Williams, Neil A Burton, Richard A Bryce","doi":"10.1021/acs.jcim.5c02706","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02706","url":null,"abstract":"Reliably evaluating the most stable conformations of charged small organic molecules in solution poses a major challenge to computational chemistry, due to limitations in the accuracy or efficiency of conventional methodologies. In this paper, we present a hybrid machine learning/molecular mechanics (ML/MM) potential, with dynamic, conformationally dependent charges on the atoms in the ML region, that can be used to address this important challenge in drug design. By way of a case study, metadynamics-enhanced molecular dynamics simulations were used to compare the performance of several intermolecular potentials in evaluating the solution phase conformational free energy differences of pharmaceutically relevant ligands based on the protonated 2-phenylethylamine scaffold. A straightforward approach to an ML/MM potential, in which the solute's intramolecular interactions are substituted by an ML model and making no other modifications to the force field, yields results inferior to a conventional fixed-charge MM potential, due to an imbalance created in the intra- and intermolecular interactions. To remedy this shortcoming, we present a new ML/MM potential that combines an accurate, trained-for-purpose ML model (PairFEQ-Net) of the charged ligand, with the polarizable SWM3 MM model of water. Crucially, by employing an empirical parameter to scale gas phase charges, the ML model predicts dynamic, solvent-polarized charges that embed the ligand in the solvent in a more balanced way. This ML/MM potential results in mean absolute errors in the prediction of conformational free energies of just 0.5 kcal mol-1, hence furnishing a possible route to the chemically accurate prediction of the shapes of charged organic molecules in aqueous solution.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146123177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

C++ Toolkit for Bimetallic Cluster Structure Optimization Using Collaborative Differential Evolution 基于协同差分进化的双金属簇结构优化c++工具包

IF 5.6 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling

Pub Date : 2026-02-04 DOI: 10.1021/acs.jcim.5c02790

Xiaomin Wu,Miao He,Yousi Lin

Global optimization of bimetallic and monometallic cluster structures remains computationally challenging, particularly due to the rapid increase in homotops with system size and compositional complexity. To address this issue, we present a Collaborative Differential Evolution (CDE) algorithm featuring a multisubpopulation collaborative architecture specifically designed for efficient structure prediction of diverse nanocluster systems. The framework integrates three functionally specialized subpopulations for exploration, exploitation, and balance along with adaptive operations tailored for metallic nanoclusters. This algorithm is implemented as a user-friendly online C++ toolkit. We demonstrate the versatility and robustness of our approach through comprehensive structural optimization across three distinct case studies: Pt–Pd and Cu–Au bimetallic clusters, as well as monometallic Pt clusters. The CDE algorithm consistently achieves 50–100% faster convergence and superior stability compared to conventional methods across all tested systems, establishing itself as a robust and generalizable tool for accelerating the discovery of stable configurations in diverse cluster materials.

双金属和单金属簇结构的全局优化在计算上仍然具有挑战性，特别是由于同伦顶随着系统尺寸和组成复杂性的快速增加。为了解决这一问题，我们提出了一种协作差分进化（CDE）算法，该算法具有多亚群协作架构，专为不同纳米团簇系统的有效结构预测而设计。该框架集成了三个功能专门的亚群，用于勘探、开发和平衡，以及为金属纳米团簇量身定制的自适应操作。该算法是作为一个用户友好的在线c++工具包实现的。我们通过三个不同的案例研究（Pt - pd和Cu-Au双金属团簇以及单金属Pt团簇）的全面结构优化，展示了我们方法的多功能性和稳健性。与传统方法相比，CDE算法在所有测试系统中都能实现50-100%的快速收敛和卓越的稳定性，使其成为一种强大且可推广的工具，可加速发现不同簇材料的稳定结构。

{"title":"C++ Toolkit for Bimetallic Cluster Structure Optimization Using Collaborative Differential Evolution","authors":"Xiaomin Wu,Miao He,Yousi Lin","doi":"10.1021/acs.jcim.5c02790","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02790","url":null,"abstract":"Global optimization of bimetallic and monometallic cluster structures remains computationally challenging, particularly due to the rapid increase in homotops with system size and compositional complexity. To address this issue, we present a Collaborative Differential Evolution (CDE) algorithm featuring a multisubpopulation collaborative architecture specifically designed for efficient structure prediction of diverse nanocluster systems. The framework integrates three functionally specialized subpopulations for exploration, exploitation, and balance along with adaptive operations tailored for metallic nanoclusters. This algorithm is implemented as a user-friendly online C++ toolkit. We demonstrate the versatility and robustness of our approach through comprehensive structural optimization across three distinct case studies: Pt–Pd and Cu–Au bimetallic clusters, as well as monometallic Pt clusters. The CDE algorithm consistently achieves 50–100% faster convergence and superior stability compared to conventional methods across all tested systems, establishing itself as a robust and generalizable tool for accelerating the discovery of stable configurations in diverse cluster materials.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"1 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146111105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Mapping Still Matters: Coarse-Graining with Machine Learning Potentials 映射仍然重要：具有机器学习潜力的粗粒度

IF 5.6 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling

Pub Date : 2026-02-04 DOI: 10.1021/acs.jcim.5c03035

Franz Görlich,Julija Zavadlav

Coarse-grained (CG) modeling enables molecular simulations to reach time and length scales inaccessible to fully atomistic methods. For classical CG models, the choice of mapping, that is, how atoms are grouped into CG sites, is a major determinant of accuracy and transferability. At the same time, the emergence of machine learning potentials (MLPs) offers new opportunities to build CG models that can in principle learn the true potential of the mean force for any mapping. In this work, we systematically investigate how the choice of mapping influences the representations learned by equivariant MLPs by studying liquid hexane, amino acids, and polyalanine. We find that when the length scales of bonded and nonbonded interactions overlap, unphysical bond permutations can occur. We also demonstrate that correctly encoding species and maintaining stereochemistry are crucial, as neglecting either introduces unphysical symmetries. Our findings provide practical guidance for selecting CG mappings compatible with modern architectures and guide the development of transferable CG models.

粗粒度（CG）建模使分子模拟能够达到完全原子化方法无法达到的时间和长度尺度。对于经典的CG模型，选择映射，即原子如何分组到CG位置，是准确性和可转移性的主要决定因素。与此同时，机器学习潜力（mlp）的出现为构建CG模型提供了新的机会，这些模型原则上可以学习任何映射的平均力的真正潜力。在这项工作中，我们通过研究液体己烷、氨基酸和聚丙氨酸，系统地研究了映射的选择如何影响等变mlp学习的表征。我们发现当键和非键相互作用的长度尺度重叠时，会发生非物理键排列。我们还证明了正确编码物种和维持立体化学是至关重要的，因为忽略任何一个都会引入非物理对称性。我们的发现为选择与现代建筑兼容的CG映射提供了实践指导，并指导了可转移CG模型的发展。

引用次数: 0

Accelerating Siloxane-Based Ionizable Lipid Design for LNPs with Data-Efficient Kolmogorov–Arnold Networks 利用数据高效的Kolmogorov-Arnold网络加速LNPs中基于硅氧烷的可电离脂质设计

IF 5.6 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling

Pub Date : 2026-02-04 DOI: 10.1021/acs.jcim.5c02871

Yujing Zhao,Juntao Wang,Yuxin Song,Qilei Liu,Jiaqi Lin

Ionizable lipids are fundamental to the efficacy of lipid nanoparticles (LNPs) in pivotal areas including mRNA vaccines. Their development, however, is hindered by intricate structure–property relationships and limited experimental data. To address these challenges, this study proposed a small-data-driven framework that pioneered the use of Kolmogorov–Arnold networks (KANs)─a symbolic regression-based machine learning (ML) approach─to accelerate the discovery of novel siloxane-based ionizable lipids. Using only 36 training samples, the resulting KAN model demonstrated high predictive accuracy for mRNA delivery efficiency (Qcv2 = 0.710), outperforming conventional ML models by an average absolute improvement of 0.627 in cross-validation and yielding explicit mathematical formulas. Combined with virtual screening and umbrella sampling simulations, the framework identified three candidate lipids with superior predicted performance. Molecular dynamics simulations validated that the optimal candidate achieved stronger binding affinity to the endosomal membrane, as evidenced by a 187% reduction (from −1.048 to −3.011 kcal/mol) in the binding free energy minimum compared to the best experimental control. This result aligns with the delivery efficiency predicted by the KAN model. Overall, the proposed framework establishes a data-efficient paradigm for ML-guided ionizable lipid design, bridging symbolic regression with molecular dynamics validation for next-generation LNP therapeutics.

可电离脂质是脂质纳米颗粒（LNPs）在关键领域（包括mRNA疫苗）功效的基础。然而，它们的发展受到复杂的结构-性质关系和有限的实验数据的阻碍。为了应对这些挑战，本研究提出了一个小数据驱动的框架，该框架率先使用Kolmogorov-Arnold网络（KANs）──一种基于符号回归的机器学习（ML）方法──来加速发现新型硅氧烷基可电离脂类。仅使用36个训练样本，所得到的KAN模型对mRNA传递效率的预测精度很高（Qcv2 = 0.710），在交叉验证中优于传统ML模型的平均绝对改进0.627，并产生明确的数学公式。结合虚拟筛选和伞式采样模拟，该框架确定了三种具有优越预测性能的候选脂质。分子动力学模拟证实，与最佳实验对照相比，最优候选物与内体膜具有更强的结合亲和力，结合自由能最小值降低了187%（从−1.048 kcal/mol降至−3.011 kcal/mol）。这一结果与KAN模型预测的输送效率一致。总的来说，提出的框架为ml引导的可电离脂质设计建立了一个数据高效的范例，将符号回归与下一代LNP治疗的分子动力学验证联系起来。

{"title":"Accelerating Siloxane-Based Ionizable Lipid Design for LNPs with Data-Efficient Kolmogorov–Arnold Networks","authors":"Yujing Zhao,Juntao Wang,Yuxin Song,Qilei Liu,Jiaqi Lin","doi":"10.1021/acs.jcim.5c02871","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02871","url":null,"abstract":"Ionizable lipids are fundamental to the efficacy of lipid nanoparticles (LNPs) in pivotal areas including mRNA vaccines. Their development, however, is hindered by intricate structure–property relationships and limited experimental data. To address these challenges, this study proposed a small-data-driven framework that pioneered the use of Kolmogorov–Arnold networks (KANs)─a symbolic regression-based machine learning (ML) approach─to accelerate the discovery of novel siloxane-based ionizable lipids. Using only 36 training samples, the resulting KAN model demonstrated high predictive accuracy for mRNA delivery efficiency (Qcv2 = 0.710), outperforming conventional ML models by an average absolute improvement of 0.627 in cross-validation and yielding explicit mathematical formulas. Combined with virtual screening and umbrella sampling simulations, the framework identified three candidate lipids with superior predicted performance. Molecular dynamics simulations validated that the optimal candidate achieved stronger binding affinity to the endosomal membrane, as evidenced by a 187% reduction (from −1.048 to −3.011 kcal/mol) in the binding free energy minimum compared to the best experimental control. This result aligns with the delivery efficiency predicted by the KAN model. Overall, the proposed framework establishes a data-efficient paradigm for ML-guided ionizable lipid design, bridging symbolic regression with molecular dynamics validation for next-generation LNP therapeutics.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"398 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146111085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Transforming MOF Modeling with Machine-Learned Potentials: Progress and Perspectives 用机器学习潜力转化MOF建模：进展与展望

IF 5.6 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling

Pub Date : 2026-02-03 DOI: 10.1021/acs.jcim.5c02712

Omer Tayfuroglu,Seda Keskin

Machine-learned potentials (MLPs) have emerged as transformative tools for modeling metal–organic frameworks (MOFs), bridging the accuracy of quantum mechanics with the efficiency required for large-scale molecular simulations. By learning the potential energy surface directly from quantum-mechanical reference data, MLPs enable a unified description of the complex nature of MOFs and their interactions with guest molecules across multiple length and time scales. Recent developments have demonstrated the capability of MLPs to model intrinsic MOF properties such as lattice dynamics, thermal expansion, and mechanical response, as well as to describe adsorption thermodynamics, diffusion, and cooperative host–guest behavior in flexible frameworks. Developing reliable and transferable MLPs for MOFs remains a significant challenge due to the vast chemical and structural diversity of MOFs and the complexity of sampling guest-framework configurations. The lack of openly shared, standardized, and user-friendly MLP implementations also limits their broader adoption. This review focuses on the current progress in MLP-based modeling of MOFs, highlighting methodological advances, data-generation strategies, and active-learning protocols, while outlining the key challenges and future directions for developing transferable, accessible, and universal MLPs for the predictive design and discovery of MOFs.

机器学习势（mlp）已经成为模拟金属有机框架（mof）的变革性工具，将量子力学的准确性与大规模分子模拟所需的效率联系起来。通过直接从量子力学参考数据中学习势能面，mlp能够统一描述mof的复杂性及其与客体分子在多个长度和时间尺度上的相互作用。最近的发展表明，mlp能够模拟MOF的固有性质，如晶格动力学、热膨胀和机械响应，以及描述柔性框架中的吸附热力学、扩散和主客体合作行为。由于mof的巨大化学和结构多样性以及取样宾客框架配置的复杂性，为mof开发可靠且可转移的mlp仍然是一项重大挑战。缺乏公开共享、标准化和用户友好的MLP实现也限制了它们的广泛采用。本文重点介绍了基于mlp的mof建模的当前进展，重点介绍了方法进展、数据生成策略和主动学习协议，同时概述了开发可转移、可访问和通用的mlp用于mof预测设计和发现的关键挑战和未来方向。

{"title":"Transforming MOF Modeling with Machine-Learned Potentials: Progress and Perspectives","authors":"Omer Tayfuroglu,Seda Keskin","doi":"10.1021/acs.jcim.5c02712","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02712","url":null,"abstract":"Machine-learned potentials (MLPs) have emerged as transformative tools for modeling metal–organic frameworks (MOFs), bridging the accuracy of quantum mechanics with the efficiency required for large-scale molecular simulations. By learning the potential energy surface directly from quantum-mechanical reference data, MLPs enable a unified description of the complex nature of MOFs and their interactions with guest molecules across multiple length and time scales. Recent developments have demonstrated the capability of MLPs to model intrinsic MOF properties such as lattice dynamics, thermal expansion, and mechanical response, as well as to describe adsorption thermodynamics, diffusion, and cooperative host–guest behavior in flexible frameworks. Developing reliable and transferable MLPs for MOFs remains a significant challenge due to the vast chemical and structural diversity of MOFs and the complexity of sampling guest-framework configurations. The lack of openly shared, standardized, and user-friendly MLP implementations also limits their broader adoption. This review focuses on the current progress in MLP-based modeling of MOFs, highlighting methodological advances, data-generation strategies, and active-learning protocols, while outlining the key challenges and future directions for developing transferable, accessible, and universal MLPs for the predictive design and discovery of MOFs.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"158 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146111112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

RetNeXt: A Pretrained Model for Transfer Learning Across the MOF Adsorption Space RetNeXt：跨MOF吸附空间迁移学习的预训练模型

IF 5.6 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling

Pub Date : 2026-02-03 DOI: 10.1021/acs.jcim.5c02698

Antonios P. Sarikas,Konstantinos Gkagkas,George E. Froudakis

Because of their ultrahigh porosity and tunable chemistry, metal–organic frameworks (MOFs) have emerged as leading candidates for gas adsorption applications. Nevertheless, their combinatorial nature induces a vast chemical space, challenging traditional exploration methods. In recent years, machine learning (ML) predictive models have enabled large-scale screening, but they are typically developed for a single adsorption property. This entails that for a new property one must train a model from scratch, a process that requires large amounts of labeled data that are not always available. In our previous work, we demonstrated that combining the potential energy surface─a 3D energy image of the material─with a convolutional neural network improves sample efficiency compared to conventional ML approaches. Here we extend this framework by introducing multitask and transfer learning to foster generalization across gases and conditions, even in data-scarce scenarios. To this end, we developed RetNeXt, a multitask pretrained model on 3.2 million publicly available adsorption-related data, which can be readily adapted to new domains and adsorption tasks. RetNeXt outperforms conventional single-task transfer approaches and achieves up to a 100-fold increase in sample efficiency compared to training from scratch. As such, it can serve as a foundation for future advances in the data-driven adsorption modeling of MOFs.

由于其超高孔隙率和可调化学性质，金属有机框架（mof）已成为气体吸附应用的主要候选材料。然而，它们的组合性质带来了巨大的化学空间，挑战了传统的勘探方法。近年来，机器学习（ML）预测模型已经实现了大规模筛选，但它们通常是针对单一吸附特性开发的。这意味着对于一个新的属性，必须从头开始训练一个模型，这个过程需要大量的标记数据，而这些数据并不总是可用的。在我们之前的工作中，我们证明了与传统的机器学习方法相比，将势能表面（材料的3D能量图像）与卷积神经网络相结合可以提高样本效率。在这里，我们通过引入多任务和迁移学习来扩展该框架，以促进跨气体和条件的泛化，即使在数据稀缺的情况下也是如此。为此，我们开发了RetNeXt，这是一个基于320万公开可用的吸附相关数据的多任务预训练模型，可以很容易地适应新的领域和吸附任务。RetNeXt优于传统的单任务迁移方法，与从头开始训练相比，样本效率提高了100倍。因此，它可以作为未来数据驱动的mof吸附建模的基础。

{"title":"RetNeXt: A Pretrained Model for Transfer Learning Across the MOF Adsorption Space","authors":"Antonios P. Sarikas,Konstantinos Gkagkas,George E. Froudakis","doi":"10.1021/acs.jcim.5c02698","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02698","url":null,"abstract":"Because of their ultrahigh porosity and tunable chemistry, metal–organic frameworks (MOFs) have emerged as leading candidates for gas adsorption applications. Nevertheless, their combinatorial nature induces a vast chemical space, challenging traditional exploration methods. In recent years, machine learning (ML) predictive models have enabled large-scale screening, but they are typically developed for a single adsorption property. This entails that for a new property one must train a model from scratch, a process that requires large amounts of labeled data that are not always available. In our previous work, we demonstrated that combining the potential energy surface─a 3D energy image of the material─with a convolutional neural network improves sample efficiency compared to conventional ML approaches. Here we extend this framework by introducing multitask and transfer learning to foster generalization across gases and conditions, even in data-scarce scenarios. To this end, we developed RetNeXt, a multitask pretrained model on 3.2 million publicly available adsorption-related data, which can be readily adapted to new domains and adsorption tasks. RetNeXt outperforms conventional single-task transfer approaches and achieves up to a 100-fold increase in sample efficiency compared to training from scratch. As such, it can serve as a foundation for future advances in the data-driven adsorption modeling of MOFs.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"253 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146111137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Novel Metabolic Pathway Design Method Based on Evolutionary Algorithms and Metabolic Network Evaluation 一种基于进化算法和代谢网络评价的代谢途径设计新方法

IF 5.6 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling

Pub Date : 2026-02-03 DOI: 10.1021/acs.jcim.5c02219

Xin Zhao,Xueying Sun,Tao Zhang,Shuxin Cui,Yahui Cao,Bingzhi Li,Heng Song,Shuo Zheng

Metabolic pathway design is a fundamental aspect of metabolic engineering, playing a crucial role in the microbial synthesis of high-value compounds. While metabolic engineers recognize the prevalence of branching reactions─side reactions that divert metabolic flux toward nontarget compounds─current automated pathway design tools often focus primarily on linear pathway optimization. This focus may lead to incomplete efficiency assessments and suboptimal pathway selection due to unaccounted metabolic complexity. To address this gap, we introduce a novel metabolic pathway design method, EA-MNE (Evolutionary Algorithm-based Metabolic Network Evaluation). Within the EA-MNE method, we propose a new approach for expanding linear pathways into metabolic networks and two new evaluation criteria: (1) the number of effective branching reactions, which assesses the extent of branching impacts, and (2) the network theoretical yield, which precisely quantifies yield losses caused by branching reactions. Additionally, we integrate four key criteria─the number of effective branching reactions, network theoretical yield, network toxicity, and Gibbs free energy─for metabolic pathway design. This integrated approach provides a systematic solution for addressing branching reaction challenges, significantly improving both the accuracy of pathway evaluation and the synthetic efficiency of microbial systems.

代谢途径设计是代谢工程的一个基本方面，在微生物合成高价值化合物中起着至关重要的作用。虽然代谢工程师认识到分支反应──将代谢通量转向非目标化合物的副反应──的普遍存在，但目前的自动化途径设计工具往往主要侧重于线性途径优化。由于未考虑代谢复杂性，这种关注可能导致不完整的效率评估和次优途径选择。为了解决这一差距，我们引入了一种新的代谢途径设计方法，EA-MNE（基于进化算法的代谢网络评估）。在EA-MNE方法中，我们提出了一种将线性途径扩展到代谢网络的新方法和两个新的评估标准：(1)有效分支反应的数量，评估分支影响的程度；(2)网络理论产率，精确量化分支反应造成的产率损失。此外，我们整合了四个关键标准──有效分支反应的数量、网络理论产率、网络毒性和吉布斯自由能──用于代谢途径设计。这种综合方法为解决分支反应挑战提供了系统的解决方案，显著提高了途径评估的准确性和微生物系统的合成效率。

{"title":"A Novel Metabolic Pathway Design Method Based on Evolutionary Algorithms and Metabolic Network Evaluation","authors":"Xin Zhao,Xueying Sun,Tao Zhang,Shuxin Cui,Yahui Cao,Bingzhi Li,Heng Song,Shuo Zheng","doi":"10.1021/acs.jcim.5c02219","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02219","url":null,"abstract":"Metabolic pathway design is a fundamental aspect of metabolic engineering, playing a crucial role in the microbial synthesis of high-value compounds. While metabolic engineers recognize the prevalence of branching reactions─side reactions that divert metabolic flux toward nontarget compounds─current automated pathway design tools often focus primarily on linear pathway optimization. This focus may lead to incomplete efficiency assessments and suboptimal pathway selection due to unaccounted metabolic complexity. To address this gap, we introduce a novel metabolic pathway design method, EA-MNE (Evolutionary Algorithm-based Metabolic Network Evaluation). Within the EA-MNE method, we propose a new approach for expanding linear pathways into metabolic networks and two new evaluation criteria: (1) the number of effective branching reactions, which assesses the extent of branching impacts, and (2) the network theoretical yield, which precisely quantifies yield losses caused by branching reactions. Additionally, we integrate four key criteria─the number of effective branching reactions, network theoretical yield, network toxicity, and Gibbs free energy─for metabolic pathway design. This integrated approach provides a systematic solution for addressing branching reaction challenges, significantly improving both the accuracy of pathway evaluation and the synthetic efficiency of microbial systems.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"104 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146111087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Variational Bayesian Multi-Kernel Adaptive Deep Fusion for Microbe-Related Drug Prediction 基于变分贝叶斯多核自适应深度融合的微生物相关药物预测

IF 5.6 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling

Pub Date : 2026-02-02 DOI: 10.1021/acs.jcim.5c02748

Yingjun Ma,MingXu Luo,Liyu Yan,Yuanyuan Ma

Exploring potential microbe-drug associations (MDAs) not only facilitates drug discovery and clinical treatment but also contributes to a deeper understanding of microbial mechanisms. However, most MDA discoveries rely on biological experiments, which are time-consuming and costly. Therefore, developing an effective computational model to predict novel MDAs is of great importance. In this study, we propose a Variational Bayesian Multi-Kernel Adaptive Deep Fusion (VBMKADF) model for MDA prediction. We first integrate multiomics data to construct drug molecular graphs and a microbe hypergraph. Then, we perform multilayer graph convolution and hypergraph convolution to extract multilevel similarities of drugs and microbes, respectively. An attention mechanism is subsequently introduced to adaptively fuse these multilevel similarities, which are then incorporated into the Bayesian logistic matrix factorization framework to guide the generation of latent variable distributions. Additionally, we develop a variational Expectation-Maximization algorithm for adaptive inference of model hyperparameters and latent variables, which also guides the training of the deep learning model. Experimental results on two benchmark data sets across three scenarios show that, compared to other state-of-the-art methods, VBMKADF achieves higher AUPR, AUC, and F1 scores in both balanced and highly imbalanced settings. Moreover, case studies further confirm that VBMKADF can serve as an effective tool for MDA prediction.

探索潜在的微生物-药物关联（MDAs）不仅有助于药物发现和临床治疗，而且有助于更深入地了解微生物机制。然而，大多数MDA的发现依赖于生物实验，这既耗时又昂贵。因此，开发一种有效的计算模型来预测新的mda是非常重要的。在这项研究中，我们提出了一种用于MDA预测的变分贝叶斯多核自适应深度融合（VBMKADF）模型。我们首先整合多组学数据来构建药物分子图和微生物超图。然后，我们分别进行多层图卷积和超图卷积来提取药物和微生物的多层次相似度。随后引入注意机制自适应融合这些多层次相似性，然后将其纳入贝叶斯逻辑矩阵分解框架，以指导潜在变量分布的生成。此外，我们开发了一种变分期望最大化算法，用于模型超参数和潜在变量的自适应推理，这也指导了深度学习模型的训练。在三种情况下的两个基准数据集上的实验结果表明，与其他最先进的方法相比，VBMKADF在平衡和高度不平衡设置下都获得了更高的AUPR、AUC和F1分数。此外，案例研究进一步证实了VBMKADF可以作为MDA预测的有效工具。

{"title":"Variational Bayesian Multi-Kernel Adaptive Deep Fusion for Microbe-Related Drug Prediction","authors":"Yingjun Ma,MingXu Luo,Liyu Yan,Yuanyuan Ma","doi":"10.1021/acs.jcim.5c02748","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02748","url":null,"abstract":"Exploring potential microbe-drug associations (MDAs) not only facilitates drug discovery and clinical treatment but also contributes to a deeper understanding of microbial mechanisms. However, most MDA discoveries rely on biological experiments, which are time-consuming and costly. Therefore, developing an effective computational model to predict novel MDAs is of great importance. In this study, we propose a Variational Bayesian Multi-Kernel Adaptive Deep Fusion (VBMKADF) model for MDA prediction. We first integrate multiomics data to construct drug molecular graphs and a microbe hypergraph. Then, we perform multilayer graph convolution and hypergraph convolution to extract multilevel similarities of drugs and microbes, respectively. An attention mechanism is subsequently introduced to adaptively fuse these multilevel similarities, which are then incorporated into the Bayesian logistic matrix factorization framework to guide the generation of latent variable distributions. Additionally, we develop a variational Expectation-Maximization algorithm for adaptive inference of model hyperparameters and latent variables, which also guides the training of the deep learning model. Experimental results on two benchmark data sets across three scenarios show that, compared to other state-of-the-art methods, VBMKADF achieves higher AUPR, AUC, and F1 scores in both balanced and highly imbalanced settings. Moreover, case studies further confirm that VBMKADF can serve as an effective tool for MDA prediction.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"80 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146097964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decoding Protein–Membrane Binding Interfaces from Surface-Fingerprint-Based Geometric Deep Learning and Molecular Dynamics Simulations 从基于表面指纹的几何深度学习和分子动力学模拟解码蛋白质-膜结合界面

IF 5.6 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling

Pub Date : 2026-02-02 DOI: 10.1021/acs.jcim.5c02566

ByungUk Park,Reid C. Van Lehn

Predicting protein–membrane interactions is a formidable challenge due to the subtle physicochemical features that distinguish membrane-binding regions of a protein surface as well as the scarcity of experimentally resolved membrane-bound protein conformations. Here, we present MaSIF-PMP, a geometric deep learning model that leverages molecular surface fingerprints to predict interfacial binding sites (IBSs) of peripheral membrane proteins (PMPs). MaSIF-PMP integrates geometric and chemical surface features to produce spatially resolved IBS predictions. Compared to existing models, MaSIF-PMP achieves superior performance for IBS classification, while feature ablation studies and transfer learning analyses reveal distinct determinants governing protein–membrane versus protein–protein interactions. We further show that molecular dynamics (MD) simulations can validate model predictions, refine IBS labels, and capture composition-dependent membrane binding patterns. These results establish MaSIF-PMP as an effective framework for IBS prediction and highlight the potential of incorporating conformational dynamics from MD to improve both the model accuracy and biological interpretability.

由于区分蛋白质表面膜结合区域的细微物理化学特征以及实验解决的膜结合蛋白质构象的稀缺性，预测蛋白质-膜相互作用是一项艰巨的挑战。在这里，我们提出了MaSIF-PMP，这是一种几何深度学习模型，利用分子表面指纹来预测外周膜蛋白（pmp）的界面结合位点（ibs）。MaSIF-PMP集成了几何和化学表面特征，以产生空间分辨的IBS预测。与现有模型相比，MaSIF-PMP在IBS分类方面表现优异，而特征消融研究和迁移学习分析揭示了蛋白质-膜与蛋白质-蛋白质相互作用的不同决定因素。我们进一步表明，分子动力学（MD）模拟可以验证模型预测，完善IBS标签，并捕获成分依赖的膜结合模式。这些结果确立了MaSIF-PMP作为IBS预测的有效框架，并强调了结合MD构象动力学来提高模型准确性和生物学可解释性的潜力。

{"title":"Decoding Protein–Membrane Binding Interfaces from Surface-Fingerprint-Based Geometric Deep Learning and Molecular Dynamics Simulations","authors":"ByungUk Park,Reid C. Van Lehn","doi":"10.1021/acs.jcim.5c02566","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02566","url":null,"abstract":"Predicting protein–membrane interactions is a formidable challenge due to the subtle physicochemical features that distinguish membrane-binding regions of a protein surface as well as the scarcity of experimentally resolved membrane-bound protein conformations. Here, we present MaSIF-PMP, a geometric deep learning model that leverages molecular surface fingerprints to predict interfacial binding sites (IBSs) of peripheral membrane proteins (PMPs). MaSIF-PMP integrates geometric and chemical surface features to produce spatially resolved IBS predictions. Compared to existing models, MaSIF-PMP achieves superior performance for IBS classification, while feature ablation studies and transfer learning analyses reveal distinct determinants governing protein–membrane versus protein–protein interactions. We further show that molecular dynamics (MD) simulations can validate model predictions, refine IBS labels, and capture composition-dependent membrane binding patterns. These results establish MaSIF-PMP as an effective framework for IBS prediction and highlight the potential of incorporating conformational dynamics from MD to improve both the model accuracy and biological interpretability.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"216 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146097963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0