首页 > 最新文献

Journal of Chemical Information and Modeling 最新文献

英文 中文
A Transferable Force Field for Simulating Adsorption in Metal-Organic Frameworks with Open Metal Sites Based on the 12-6-4 Lennard-Jones Potential. 基于12-6-4 Lennard-Jones势模拟开放金属位金属-有机骨架吸附的可转移力场
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-09 Epub Date: 2026-01-24 DOI: 10.1021/acs.jcim.5c02893
Meng Du, Alan Rodriguez, Matthew Z Lin, Haoyuan Chen

Metal-organic frameworks (MOFs) that contain coordinatively unsaturated open metal sites (OMSs) provide strong host-guest interactions, making them promising sorbents for low-concentration gas adsorption applications such as direct air capture and atmospheric water harvesting. However, accurately modeling host-guest interactions involving OMSs remains challenging for classical force fields (FFs) based on the 12-6 Lennard-Jones (LJ) potential, as the polarization effect of the guest molecule induced by the positively charged OMS is not considered. Here, we introduce an FF based on the 12-6-4 LJ potential, which incorporates charge-induced dipole interactions and is parametrized against a diverse set of host-guest potential energy surfaces (PESs) obtained from density functional theory (DFT). The resulting FF, trained on a generic trimetallic cluster, performs well in both host-guest binding energetics and gas adsorption isotherms across different OMS-containing MOFs, including MOF-74 series and Cu-BTC. These results highlight the excellent transferability of our approach and its potential to enhance the accuracy and robustness of high-throughput MOF discovery workflows, particularly for gas adsorption and separation in large and diverse MOF databases.

含有协调不饱和开放金属位点(oms)的金属有机框架(mof)提供了强大的主客体相互作用,使它们成为低浓度气体吸附应用的有前途的吸附剂,如直接空气捕获和大气水收集。然而,基于12-6 Lennard-Jones (LJ)势的经典力场(FFs)中,由于未考虑正电荷OMS诱导的客体分子极化效应,涉及OMS的主客体相互作用的精确建模仍然具有挑战性。在这里,我们引入了一个基于12-6-4 LJ势的FF,它包含了电荷诱导的偶极相互作用,并根据密度泛函理论(DFT)获得的多种主客体势能面(PESs)进行参数化。所得到的FF在通用三金属簇上进行训练,在不同含ms的mof(包括MOF-74系列和Cu-BTC)上表现良好,具有主客体结合能和气体吸附等温线。这些结果突出了我们的方法的出色可转移性,以及它在提高高通量MOF发现工作流程的准确性和稳健性方面的潜力,特别是在大型和多样化的MOF数据库中的气体吸附和分离方面。
{"title":"A Transferable Force Field for Simulating Adsorption in Metal-Organic Frameworks with Open Metal Sites Based on the 12-6-4 Lennard-Jones Potential.","authors":"Meng Du, Alan Rodriguez, Matthew Z Lin, Haoyuan Chen","doi":"10.1021/acs.jcim.5c02893","DOIUrl":"10.1021/acs.jcim.5c02893","url":null,"abstract":"<p><p>Metal-organic frameworks (MOFs) that contain coordinatively unsaturated open metal sites (OMSs) provide strong host-guest interactions, making them promising sorbents for low-concentration gas adsorption applications such as direct air capture and atmospheric water harvesting. However, accurately modeling host-guest interactions involving OMSs remains challenging for classical force fields (FFs) based on the 12-6 Lennard-Jones (LJ) potential, as the polarization effect of the guest molecule induced by the positively charged OMS is not considered. Here, we introduce an FF based on the 12-6-4 LJ potential, which incorporates charge-induced dipole interactions and is parametrized against a diverse set of host-guest potential energy surfaces (PESs) obtained from density functional theory (DFT). The resulting FF, trained on a generic trimetallic cluster, performs well in both host-guest binding energetics and gas adsorption isotherms across different OMS-containing MOFs, including MOF-74 series and Cu-BTC. These results highlight the excellent transferability of our approach and its potential to enhance the accuracy and robustness of high-throughput MOF discovery workflows, particularly for gas adsorption and separation in large and diverse MOF databases.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"1704-1714"},"PeriodicalIF":5.3,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146043621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BIPE: Artificial Intelligence-Driven Peptide Bitterness Intensity Prediction Engine. BIPE:人工智能驱动的肽苦味强度预测引擎。
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-09 Epub Date: 2026-01-20 DOI: 10.1021/acs.jcim.5c02678
Jianda Yue, Hua Tan, Jiawei Xu, Tingting Li, Zihui Chen, Xie Li, Zhaoyang Tang, Songping Liang, Zhonghua Liu, Ying Wang

Bitterness, alongside sour, sweet, umami, and salty tastes, constitutes one of the five basic tastes and serves as a key dimension in shaping food flavor profiles. Food protein processing readily generates bitter peptides, whose intense bitterness often leads to consumer rejection, yet these peptides frequently carry beneficial bioactivities, necessitating a trade-off between flavor and functionality. This necessitates the quantitative assessment of bitterness intensity in the early stages of product development. However, experimental assays relying on sensory evaluation and electronic tongue instruments are complex, costly, and limited in throughput, constraining the systematic identification of bitter peptides and process optimization. Here, we present BIPE (Bitterness Intensity Prediction Engine), an end-to-end regression model that integrates ESM3 protein language model representations with a multilayer perceptron readout, performing regression of bitterness thresholds in log space to directly assess bitterness intensity from sequence alone. BIPE achieves R2 = 0.9050 under 10-fold cross-validation and R2 = 0.9449 on an independent test set. BIPE accurately reproduces trends in both electronic tongue readouts and human sensory scores, demonstrating a consistent external validity across assays. Besides, BIPE accurately differentiates the bitterness intensities of soybean protein hydrolysates produced by multiple commercial proteases. Finally, systematic scanning of the complete pentapeptide sequence space by BIPE further reveals amino acid compositional patterns associated with bitterness, providing mechanistic insights. By advancing from classification to quantitative regression, BIPE enables rational design of low-bitterness peptides, supports flavor engineering and process optimization, and establishes a reusable baseline for taste modeling.

苦味与酸、甜、鲜、咸一道,构成五种基本味道之一,是塑造食物风味的关键因素。食品蛋白质加工容易产生苦味肽,其强烈的苦味经常导致消费者的排斥,但这些肽通常携带有益的生物活性,需要在风味和功能之间进行权衡。这就需要在产品开发的早期阶段对苦味强度进行定量评估。然而,依赖感官评价和电子舌仪器的实验分析复杂、昂贵且通量有限,限制了苦味肽的系统鉴定和工艺优化。在这里,我们提出了BIPE(苦味强度预测引擎),这是一个端到端回归模型,将ESM3蛋白质语言模型表示与多层感知器读出相结合,在对数空间中执行苦味阈值回归,从而直接从序列中评估苦味强度。BIPE在10倍交叉验证下R2 = 0.9050,在独立测试集上R2 = 0.9449。BIPE准确地再现了电子舌头读数和人类感官评分的趋势,在各种分析中显示出一致的外部有效性。此外,BIPE可以准确区分多种商业蛋白酶生产的大豆蛋白水解物的苦味强度。最后,通过BIPE系统扫描完整的五肽序列空间,进一步揭示了与苦味相关的氨基酸组成模式,提供了机制见解。通过从分类到定量回归,BIPE可以实现低苦味肽的合理设计,支持风味工程和工艺优化,并为味觉建模建立可重复使用的基线。
{"title":"BIPE: Artificial Intelligence-Driven Peptide Bitterness Intensity Prediction Engine.","authors":"Jianda Yue, Hua Tan, Jiawei Xu, Tingting Li, Zihui Chen, Xie Li, Zhaoyang Tang, Songping Liang, Zhonghua Liu, Ying Wang","doi":"10.1021/acs.jcim.5c02678","DOIUrl":"10.1021/acs.jcim.5c02678","url":null,"abstract":"<p><p>Bitterness, alongside sour, sweet, umami, and salty tastes, constitutes one of the five basic tastes and serves as a key dimension in shaping food flavor profiles. Food protein processing readily generates bitter peptides, whose intense bitterness often leads to consumer rejection, yet these peptides frequently carry beneficial bioactivities, necessitating a trade-off between flavor and functionality. This necessitates the quantitative assessment of bitterness intensity in the early stages of product development. However, experimental assays relying on sensory evaluation and electronic tongue instruments are complex, costly, and limited in throughput, constraining the systematic identification of bitter peptides and process optimization. Here, we present BIPE (<u>B</u>itterness <u>I</u>ntensity <u>P</u>rediction <u>E</u>ngine), an end-to-end regression model that integrates ESM3 protein language model representations with a multilayer perceptron readout, performing regression of bitterness thresholds in log space to directly assess bitterness intensity from sequence alone. BIPE achieves <i>R</i><sup>2</sup> = 0.9050 under 10-fold cross-validation and <i>R</i><sup>2</sup> = 0.9449 on an independent test set. BIPE accurately reproduces trends in both electronic tongue readouts and human sensory scores, demonstrating a consistent external validity across assays. Besides, BIPE accurately differentiates the bitterness intensities of soybean protein hydrolysates produced by multiple commercial proteases. Finally, systematic scanning of the complete pentapeptide sequence space by BIPE further reveals amino acid compositional patterns associated with bitterness, providing mechanistic insights. By advancing from classification to quantitative regression, BIPE enables rational design of low-bitterness peptides, supports flavor engineering and process optimization, and establishes a reusable baseline for taste modeling.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"1522-1538"},"PeriodicalIF":5.3,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146002639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Selector: A General Python Library for Diverse Subset Selection. 选择器:一个通用的Python库,用于不同的子集选择。
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-09 Epub Date: 2026-01-27 DOI: 10.1021/acs.jcim.5c01499
Fanwang Meng, Marco Martínez González, Valerii Chuiko, Alireza Tehrani, Abdul Rahman Al Nabulsi, Abigail Broscius, Hasan Khaleel, Kenneth López-Pérez, Ramón Alain Miranda-Quintana, Paul W Ayers, Farnaz Heidar-Zadeh

Selector is a free, open-source Python library for selecting diverse subsets from any dataset, making it a versatile tool across a wide range of application domains. Selector implements different subset sampling algorithms based on sample distance, similarity, and spatial partitioning along with metrics to quantify subset diversity. It is flexible and integrates seamlessly with popular Python libraries such as Scikit-Learn, demonstrating the interoperability of the implemented algorithms with data analysis workflows. Selector is an operating-system-agnostic, accessible, and easily extensible package designed with modern software development practices, including version control, unit testing, and continuous integration. Interactive quick-start notebooks, which are also web-accessible, provide user-friendly tutorials for all skill levels, showcasing applications in computational chemistry, drug discovery, and chemical library design. Additionally, a web interface has been developed that allows users to easily upload datasets, configure sampling settings, and run subset selection algorithms with no programming required. This work serves as the official release note for the Selector package, offering a technical overview of its features, use cases, and development practices that ensure its quality and maintainability.

Selector是一个免费的开源Python库,用于从任何数据集中选择不同的子集,使其成为跨广泛应用领域的通用工具。Selector基于样本距离、相似性和空间划分以及量化子集多样性的指标实现不同的子集采样算法。它是灵活的,并与流行的Python库(如Scikit-Learn)无缝集成,展示了实现算法与数据分析工作流的互操作性。Selector是一个与操作系统无关的、可访问的、易于扩展的软件包,它是用现代软件开发实践设计的,包括版本控制、单元测试和持续集成。交互式快速启动笔记本,也可通过网络访问,为所有技能水平提供用户友好的教程,展示计算化学、药物发现和化学库设计方面的应用。此外,已经开发了一个web界面,允许用户轻松上传数据集,配置采样设置,并运行子集选择算法,而无需编程。这项工作作为Selector包的官方发布说明,提供了其特性、用例和开发实践的技术概述,以确保其质量和可维护性。
{"title":"Selector: A General Python Library for Diverse Subset Selection.","authors":"Fanwang Meng, Marco Martínez González, Valerii Chuiko, Alireza Tehrani, Abdul Rahman Al Nabulsi, Abigail Broscius, Hasan Khaleel, Kenneth López-Pérez, Ramón Alain Miranda-Quintana, Paul W Ayers, Farnaz Heidar-Zadeh","doi":"10.1021/acs.jcim.5c01499","DOIUrl":"10.1021/acs.jcim.5c01499","url":null,"abstract":"<p><p>Selector is a free, open-source Python library for selecting diverse subsets from any dataset, making it a versatile tool across a wide range of application domains. Selector implements different subset sampling algorithms based on sample distance, similarity, and spatial partitioning along with metrics to quantify subset diversity. It is flexible and integrates seamlessly with popular Python libraries such as Scikit-Learn, demonstrating the interoperability of the implemented algorithms with data analysis workflows. Selector is an operating-system-agnostic, accessible, and easily extensible package designed with modern software development practices, including version control, unit testing, and continuous integration. Interactive quick-start notebooks, which are also web-accessible, provide user-friendly tutorials for all skill levels, showcasing applications in computational chemistry, drug discovery, and chemical library design. Additionally, a web interface has been developed that allows users to easily upload datasets, configure sampling settings, and run subset selection algorithms with no programming required. This work serves as the official release note for the Selector package, offering a technical overview of its features, use cases, and development practices that ensure its quality and maintainability.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"1275-1285"},"PeriodicalIF":5.3,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146049733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DFDD: A Cloud-Ready Tool for Distance-Guided Fully Dynamic Docking in Host-Guest Complexation. DFDD:用于主客综合体中距离引导全动态对接的云就绪工具。
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-07 DOI: 10.1021/acs.jcim.5c02852
Kowit Hengphasatporn, Lian Duan, Ryuhei Harada, Yasuteru Shigeta

Fully dynamic sampling of host-guest inclusion remains difficult because conventional docking and conventional molecular dynamics simulations can sample inclusion, but crystal-like binding is typically stochastic and difficult to reproduce. Here, we introduce DFDD (Distance-Guided Fully Dynamic Docking), a cloud-ready implementation of the LB-PaCS-MD framework designed to capture inclusion processes via unbiased molecular dynamics in explicit solvent. DFDD automates system setup, parameter generation, iterative short-cycle MD sampling, and trajectory analysis within a single workflow that runs on Google Colab without any installation. Progress toward complexation is guided only by the host-guest center-of-mass distance, allowing force-free exploration of insertion pathways and enabling the recovery of both stable and transient binding modes. Using β-cyclodextrin as a representative host, DFDD reproduces experimentally observed inclusion geometries within minutes and reveals intermediate states along the insertion route. Optional coupling with pKaNET-Cloud enables pH-aware, stereochemically consistent ligand protonation states prior to simulation, supporting robust host-guest modeling. This Application Note provides a transparent and accessible platform for efficient host-guest complexation studies. The DFDD framework is publicly available at https://github.com/nyelidl/DFDD.

{"title":"DFDD: A Cloud-Ready Tool for Distance-Guided Fully Dynamic Docking in Host-Guest Complexation.","authors":"Kowit Hengphasatporn, Lian Duan, Ryuhei Harada, Yasuteru Shigeta","doi":"10.1021/acs.jcim.5c02852","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02852","url":null,"abstract":"<p><p>Fully dynamic sampling of host-guest inclusion remains difficult because conventional docking and conventional molecular dynamics simulations can sample inclusion, but crystal-like binding is typically stochastic and difficult to reproduce. Here, we introduce DFDD (Distance-Guided Fully Dynamic Docking), a cloud-ready implementation of the LB-PaCS-MD framework designed to capture inclusion processes via unbiased molecular dynamics in explicit solvent. DFDD automates system setup, parameter generation, iterative short-cycle MD sampling, and trajectory analysis within a single workflow that runs on Google Colab without any installation. Progress toward complexation is guided only by the host-guest center-of-mass distance, allowing force-free exploration of insertion pathways and enabling the recovery of both stable and transient binding modes. Using β-cyclodextrin as a representative host, DFDD reproduces experimentally observed inclusion geometries within minutes and reveals intermediate states along the insertion route. Optional coupling with pKaNET-Cloud enables pH-aware, stereochemically consistent ligand protonation states prior to simulation, supporting robust host-guest modeling. This Application Note provides a transparent and accessible platform for efficient host-guest complexation studies. The DFDD framework is publicly available at https://github.com/nyelidl/DFDD.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146130531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Subtimizer: Computational Workflow for Structure-Guided Design of Potent and Selective Kinase Peptide Substrates. Subtimizer:有效和选择性激酶肽底物结构导向设计的计算工作流。
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-07 DOI: 10.1021/acs.jcim.5c02430
Abeeb A Yekeen, Cynthia J Meyer, Melissa McCoy, Bruce Posner, Kenneth D Westover

Kinases are pivotal cell signaling regulators and prominent drug targets. Short peptide substrates are widely used in kinase activity assays essential for investigating kinase biology and drug discovery. However, designing substrates with high activity and specificity remains challenging. Here, we present Subtimizer (substrate optimizer), a streamlined computational pipeline for structure-guided kinase peptide substrate design using AlphaFold-Multimer for structure modeling, ProteinMPNN for sequence design, and AlphaFold2-based interface evaluation. Applied to five kinases, four showed substantially improved activity (up to 350%) with designed peptides. Kinetic analyses revealed >2-fold reductions in the Michaelis constant (Km), indicating improved enzyme-substrate affinity. Designed peptides for MET and ROS1 exhibited reciprocal selectivity, with 4-fold and 11-fold preferences for their intended targets, respectively. This study demonstrates AI-driven structure-guided protein design as an effective approach for developing potent and selective kinase substrates, facilitating assay development for drug discovery and functional investigation of the kinome.

{"title":"Subtimizer: Computational Workflow for Structure-Guided Design of Potent and Selective Kinase Peptide Substrates.","authors":"Abeeb A Yekeen, Cynthia J Meyer, Melissa McCoy, Bruce Posner, Kenneth D Westover","doi":"10.1021/acs.jcim.5c02430","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02430","url":null,"abstract":"<p><p>Kinases are pivotal cell signaling regulators and prominent drug targets. Short peptide substrates are widely used in kinase activity assays essential for investigating kinase biology and drug discovery. However, designing substrates with high activity and specificity remains challenging. Here, we present Subtimizer (<u>sub</u>strate op<u>timizer</u>), a streamlined computational pipeline for structure-guided kinase peptide substrate design using AlphaFold-Multimer for structure modeling, ProteinMPNN for sequence design, and AlphaFold2-based interface evaluation. Applied to five kinases, four showed substantially improved activity (up to 350%) with designed peptides. Kinetic analyses revealed >2-fold reductions in the Michaelis constant (<i>K</i><sub>m</sub>), indicating improved enzyme-substrate affinity. Designed peptides for MET and ROS1 exhibited reciprocal selectivity, with 4-fold and 11-fold preferences for their intended targets, respectively. This study demonstrates AI-driven structure-guided protein design as an effective approach for developing potent and selective kinase substrates, facilitating assay development for drug discovery and functional investigation of the kinome.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146130549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Janus-QUBO: A Duality-Aware Framework for Navigating Chemical Space with a Tunable Quantum-Inspired Landscape. Janus-QUBO:用可调量子景观导航化学空间的二元感知框架。
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-07 DOI: 10.1021/acs.jcim.5c02820
Dinghao Liu, Wenyu Zhu, Yuanpeng Fu, Xinyi Wang, Yuchen Zhou, Mengzhen Guo, Jun Liao

Discovering novel molecules within the vast chemical space is a central scientific challenge, increasingly delegated to deep generative models. However, the prevailing "black box" paradigm, built on continuous latent spaces, faces a fundamental mismatch between smooth optimization landscapes and inherently discrete molecular structures, often limiting global exploration. To overcome these limitations, we introduce Janus, a framework that recasts molecular design as a transparent, physics-inspired combinatorial optimization problem. At its core, Janus employs a Transformer-based autoencoder with a regularized binary bottleneck to map molecules into a compact discrete latent space. This representation enables the reformulation of molecular generation and optimization as a Quadratic Unconstrained Binary Optimization (QUBO) problem. This approach unlocks synergistic capabilities. For molecular generation, Janus leverages classical and quantum annealers to efficiently traverse the structured energy landscape, enabling the global discovery of diverse chemical scaffolds. Crucially, for molecular optimization, it moves beyond blind search by utilizing quantifiable feature interactions as machine-discovered SAR rules. This allows for rational, interpretable optimization─selectively modifying latent bits to enhance properties. Benchmarking against state-of-the-art methods reveals that this approach achieves superior multiobjective performance while preserving scaffold integrity, avoiding the structural fragmentation common in heuristic baselines. We validate the feasibility of the workflow on a quantum annealer and demonstrate its efficacy in drug-like property optimization. By unifying powerful combinatorial exploration with deep model interpretability, Janus establishes a robust framework for rational, quantum-assisted molecular design.

{"title":"Janus-QUBO: A Duality-Aware Framework for Navigating Chemical Space with a Tunable Quantum-Inspired Landscape.","authors":"Dinghao Liu, Wenyu Zhu, Yuanpeng Fu, Xinyi Wang, Yuchen Zhou, Mengzhen Guo, Jun Liao","doi":"10.1021/acs.jcim.5c02820","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02820","url":null,"abstract":"<p><p>Discovering novel molecules within the vast chemical space is a central scientific challenge, increasingly delegated to deep generative models. However, the prevailing \"black box\" paradigm, built on continuous latent spaces, faces a fundamental mismatch between smooth optimization landscapes and inherently discrete molecular structures, often limiting global exploration. To overcome these limitations, we introduce Janus, a framework that recasts molecular design as a transparent, physics-inspired combinatorial optimization problem. At its core, Janus employs a Transformer-based autoencoder with a regularized binary bottleneck to map molecules into a compact discrete latent space. This representation enables the reformulation of molecular generation and optimization as a Quadratic Unconstrained Binary Optimization (QUBO) problem. This approach unlocks synergistic capabilities. For molecular generation, Janus leverages classical and quantum annealers to efficiently traverse the structured energy landscape, enabling the global discovery of diverse chemical scaffolds. Crucially, for molecular optimization, it moves beyond blind search by utilizing quantifiable feature interactions as machine-discovered SAR rules. This allows for rational, interpretable optimization─selectively modifying latent bits to enhance properties. Benchmarking against state-of-the-art methods reveals that this approach achieves superior multiobjective performance while preserving scaffold integrity, avoiding the structural fragmentation common in heuristic baselines. We validate the feasibility of the workflow on a quantum annealer and demonstrate its efficacy in drug-like property optimization. By unifying powerful combinatorial exploration with deep model interpretability, Janus establishes a robust framework for rational, quantum-assisted molecular design.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146130505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Blind Challenges Let Us See the Path Forward for Predictive Models. 盲目的挑战让我们看到预测模型的前进道路。
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-06 DOI: 10.1021/acs.jcim.6c00205
John D Chodera, W Patrick Walters, Sriram Kosuri, James S Fraser

The rapid proliferation of AI/ML models in drug discovery heralds an era of extraordinary progress but also raises urgent questions about whether the true predictive performance is as good as advertised. On-target prediction models often benefit from high-resolution structural or atomistic representations that capture the subtleties of binding affinity and pose. In contrast, off-target and ADMET liabilities have typically relied on more implicit representations of molecular interactions. Retrospective benchmarks often provide a misleading picture of how successful these diverse representations are at predicting properties, and the community lacks standardized, prospective comparisons. Blind challenges, such as the OpenADMET × ASAP × PolarisHub Challenge featured in this issue, are crucial for realistically evaluating progress, encouraging iterations, and directing collective efforts toward major accuracy barriers. With ongoing investment in large-scale, open data creation, and community-led challenges, predictive modeling is poised to rapidly transform drug discovery by enabling accurate, multiparameter optimization.

{"title":"Blind Challenges Let Us See the Path Forward for Predictive Models.","authors":"John D Chodera, W Patrick Walters, Sriram Kosuri, James S Fraser","doi":"10.1021/acs.jcim.6c00205","DOIUrl":"https://doi.org/10.1021/acs.jcim.6c00205","url":null,"abstract":"<p><p>The rapid proliferation of AI/ML models in drug discovery heralds an era of extraordinary progress but also raises urgent questions about whether the true predictive performance is as good as advertised. On-target prediction models often benefit from high-resolution structural or atomistic representations that capture the subtleties of binding affinity and pose. In contrast, off-target and ADMET liabilities have typically relied on more implicit representations of molecular interactions. Retrospective benchmarks often provide a misleading picture of how successful these diverse representations are at predicting properties, and the community lacks standardized, prospective comparisons. Blind challenges, such as the OpenADMET × ASAP × PolarisHub Challenge featured in this issue, are crucial for realistically evaluating progress, encouraging iterations, and directing collective efforts toward major accuracy barriers. With ongoing investment in large-scale, open data creation, and community-led challenges, predictive modeling is poised to rapidly transform drug discovery by enabling accurate, multiparameter optimization.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146130520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
xTB-Based High-Throughput Screening of TADF Emitters: 747-Molecule Benchmark. 基于xtb的TADF发射体高通量筛选:747-分子基准。
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-06 DOI: 10.1021/acs.jcim.5c02978
Jean-Pierre Tchapet Njafa, Elvira Vanelle Kameni Tcheuffa, Aissatou Maghame Foumkpou, Serge Guy Nana Engo

We validate semiempirical sTDA-xTB and sTD-DFT-xTB methods for high-throughput screening of thermally activated delayed fluorescence (TADF) emitters using 747 experimentally characterized molecules─the largest such benchmark to date. Our framework achieves >99% computational cost reduction versus TD-DFT while maintaining strong internal consistency (Pearson r ≈ 0.82) and reasonable agreement with 312 experimental singlet-triplet gaps (MAE ≈ 0.17 eV). Large-scale analysis statistically validates key design principles: D-A-D architectures outperform other motifs, and optimal torsional angles of 50°-90° maximize TADF efficiency, while PCA confirms a low-dimensional property space. This work establishes xTB methods as cost-effective tools for accelerating TADF discovery.

{"title":"xTB-Based High-Throughput Screening of TADF Emitters: 747-Molecule Benchmark.","authors":"Jean-Pierre Tchapet Njafa, Elvira Vanelle Kameni Tcheuffa, Aissatou Maghame Foumkpou, Serge Guy Nana Engo","doi":"10.1021/acs.jcim.5c02978","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02978","url":null,"abstract":"<p><p>We validate semiempirical sTDA-xTB and sTD-DFT-xTB methods for high-throughput screening of thermally activated delayed fluorescence (TADF) emitters using 747 experimentally characterized molecules─the largest such benchmark to date. Our framework achieves >99% computational cost reduction versus TD-DFT while maintaining strong internal consistency (Pearson <i>r</i> ≈ 0.82) and reasonable agreement with 312 experimental singlet-triplet gaps (MAE ≈ 0.17 eV). Large-scale analysis statistically validates key design principles: D-A-D architectures outperform other motifs, and optimal torsional angles of 50<sup>°</sup>-90<sup>°</sup> maximize TADF efficiency, while PCA confirms a low-dimensional property space. This work establishes xTB methods as cost-effective tools for accelerating TADF discovery.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146123227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How Minor Sequence Changes Enable Mechanistic Diversity in MFS Transporters? An Atomic-Level Rationale for Symport Emergence in NarU. 微小的序列改变如何使MFS转运蛋白的机制多样性?NarU中同体出现的原子级理论基础。
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-06 DOI: 10.1021/acs.jcim.5c02971
Tanner J Dean, Jiangyan Feng, Diwakar Shukla

Closely related membrane transporters can diverge sharply in their modes of transport despite minimal sequence differences, underscoring how minor structural features can alter the transport function. This divergence is exemplified in nitrate and nitrite transport across bacterial membranes, which supports anaerobic respiration and involves the major facilitator superfamily (MFS) transporters NarK and NarU. NarK operates as a nitrate/nitrite antiporter, whereas NarU's mechanism remains unresolved, with evidence suggesting potential symport activity. Using extensive adaptive molecular dynamics simulations and Markov State Modeling, we mapped NarU's conformational free-energy landscape and assessed how its behavior contrasts with mechanistic principles established for NarK. NarU follows a similar gating pathway but displays pronounced asymmetry favoring the outward-facing state and stabilizes an apo-occluded intermediate inaccessible to antiporters. This state arises from rotation of an arginine gating pair and a hinged glycine substitution that enhances gate flexibility. These sequence-dependent adaptations alter gating energetics and reprogram the scaffold to permit coupled cotransport. Our results show that the presence of a few strategic residue substitutions in the binding pocket and translocation pathway could alter the transport mechanism of transporters with high sequence and structural similarity.

{"title":"How Minor Sequence Changes Enable Mechanistic Diversity in MFS Transporters? An Atomic-Level Rationale for Symport Emergence in NarU.","authors":"Tanner J Dean, Jiangyan Feng, Diwakar Shukla","doi":"10.1021/acs.jcim.5c02971","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02971","url":null,"abstract":"<p><p>Closely related membrane transporters can diverge sharply in their modes of transport despite minimal sequence differences, underscoring how minor structural features can alter the transport function. This divergence is exemplified in nitrate and nitrite transport across bacterial membranes, which supports anaerobic respiration and involves the major facilitator superfamily (MFS) transporters NarK and NarU. NarK operates as a nitrate/nitrite antiporter, whereas NarU's mechanism remains unresolved, with evidence suggesting potential symport activity. Using extensive adaptive molecular dynamics simulations and Markov State Modeling, we mapped NarU's conformational free-energy landscape and assessed how its behavior contrasts with mechanistic principles established for NarK. NarU follows a similar gating pathway but displays pronounced asymmetry favoring the outward-facing state and stabilizes an <i>apo</i>-occluded intermediate inaccessible to antiporters. This state arises from rotation of an arginine gating pair and a hinged glycine substitution that enhances gate flexibility. These sequence-dependent adaptations alter gating energetics and reprogram the scaffold to permit coupled cotransport. Our results show that the presence of a few strategic residue substitutions in the binding pocket and translocation pathway could alter the transport mechanism of transporters with high sequence and structural similarity.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146123223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
More Accurate Binding Affinity Prediction Using Protein Homology and Ligand-Based Transfer Learning. 利用蛋白质同源性和基于配体的转移学习进行更准确的结合亲和力预测。
IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2026-02-06 DOI: 10.1021/acs.jcim.5c02334
Justin Purnomo, Caitlin Kim, Kunyang Sun, Yingze Wang, Teresa Head-Gordon

Accurate and rapid prediction of protein-ligand binding affinities is critical for drug discovery, particularly when evaluating large chemical libraries or new drug molecules from high-throughput generative models. We present UCBbind, a hybrid framework that combines a similarity-based transfer module with a deep-learning-based prediction module, to efficiently estimate binding affinities of small molecules to target proteins. For each query protein-ligand pair, UCBbind transfers experimental data from highly similar reference pairs when available and applies the prediction module when no sufficiently similar reference exists. We benchmarked UCBbind on multiple datasets, including the CASF-2016 set, the HiQBind dataset post 2020, and the COVID Moonshot database. Our results show that UCBbind achieves state-of-the-art predictive performance, particularly for test entries with high similarity to well-characterized reference proteins and ligands, and can support downstream tasks such as binding site prediction and binder/nonbinder classification.

{"title":"More Accurate Binding Affinity Prediction Using Protein Homology and Ligand-Based Transfer Learning.","authors":"Justin Purnomo, Caitlin Kim, Kunyang Sun, Yingze Wang, Teresa Head-Gordon","doi":"10.1021/acs.jcim.5c02334","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c02334","url":null,"abstract":"<p><p>Accurate and rapid prediction of protein-ligand binding affinities is critical for drug discovery, particularly when evaluating large chemical libraries or new drug molecules from high-throughput generative models. We present UCBbind, a hybrid framework that combines a similarity-based transfer module with a deep-learning-based prediction module, to efficiently estimate binding affinities of small molecules to target proteins. For each query protein-ligand pair, UCBbind transfers experimental data from highly similar reference pairs when available and applies the prediction module when no sufficiently similar reference exists. We benchmarked UCBbind on multiple datasets, including the CASF-2016 set, the HiQBind dataset post 2020, and the COVID Moonshot database. Our results show that UCBbind achieves state-of-the-art predictive performance, particularly for test entries with high similarity to well-characterized reference proteins and ligands, and can support downstream tasks such as binding site prediction and binder/nonbinder classification.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.3,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146130590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Chemical Information and Modeling
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1