Data-driven parametrization of molecular mechanics force fields for expansive chemical space coverage†

IF 7.4 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY Chemical Science Pub Date : 2024-12-31 DOI:10.1039/D4SC06640E
Tianze Zheng, Ailun Wang, Xu Han, Yu Xia, Xingyuan Xu, Jiawei Zhan, Yu Liu, Yang Chen, Zhi Wang, Xiaojie Wu, Sheng Gong and Wen Yan
{"title":"Data-driven parametrization of molecular mechanics force fields for expansive chemical space coverage†","authors":"Tianze Zheng, Ailun Wang, Xu Han, Yu Xia, Xingyuan Xu, Jiawei Zhan, Yu Liu, Yang Chen, Zhi Wang, Xiaojie Wu, Sheng Gong and Wen Yan","doi":"10.1039/D4SC06640E","DOIUrl":null,"url":null,"abstract":"<p >A force field is a critical component in molecular dynamics simulations for computational drug discovery. It must achieve high accuracy within the constraints of molecular mechanics' (MM) limited functional forms, which offers high computational efficiency. With the rapid expansion of synthetically accessible chemical space, traditional look-up table approaches face significant challenges. In this study, we address this issue using a modern data-driven approach, developing ByteFF, an Amber-compatible force field for drug-like molecules. To create ByteFF, we generated an expansive and highly diverse molecular dataset at the B3LYP-D3(BJ)/DZVP level of theory. This dataset includes 2.4 million optimized molecular fragment geometries with analytical Hessian matrices, along with 3.2 million torsion profiles. We then trained an edge-augmented, symmetry-preserving molecular graph neural network (GNN) on this dataset, employing a carefully optimized training strategy. Our model predicts all bonded and non-bonded MM force field parameters for drug-like molecules simultaneously across a broad chemical space. ByteFF demonstrates state-of-the-art performance on various benchmark datasets, excelling in predicting relaxed geometries, torsional energy profiles, and conformational energies and forces. Its exceptional accuracy and expansive chemical space coverage make ByteFF a valuable tool for multiple stages of computational drug discovery.</p>","PeriodicalId":9909,"journal":{"name":"Chemical Science","volume":" 6","pages":" 2730-2740"},"PeriodicalIF":7.4000,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/sc/d4sc06640e?page=search","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical Science","FirstCategoryId":"92","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/sc/d4sc06640e","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

A force field is a critical component in molecular dynamics simulations for computational drug discovery. It must achieve high accuracy within the constraints of molecular mechanics' (MM) limited functional forms, which offers high computational efficiency. With the rapid expansion of synthetically accessible chemical space, traditional look-up table approaches face significant challenges. In this study, we address this issue using a modern data-driven approach, developing ByteFF, an Amber-compatible force field for drug-like molecules. To create ByteFF, we generated an expansive and highly diverse molecular dataset at the B3LYP-D3(BJ)/DZVP level of theory. This dataset includes 2.4 million optimized molecular fragment geometries with analytical Hessian matrices, along with 3.2 million torsion profiles. We then trained an edge-augmented, symmetry-preserving molecular graph neural network (GNN) on this dataset, employing a carefully optimized training strategy. Our model predicts all bonded and non-bonded MM force field parameters for drug-like molecules simultaneously across a broad chemical space. ByteFF demonstrates state-of-the-art performance on various benchmark datasets, excelling in predicting relaxed geometries, torsional energy profiles, and conformational energies and forces. Its exceptional accuracy and expansive chemical space coverage make ByteFF a valuable tool for multiple stages of computational drug discovery.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
扩展化学空间覆盖的分子力学力场数据驱动参数化
在计算药物发现的分子动力学模拟中,力场是一个关键的组成部分。它必须在分子力学(MM)有限函数形式的约束下达到高精度,从而提供了很高的计算效率。随着合成可及化学空间的迅速扩大,传统的查表方法面临着重大挑战。在这项研究中,我们使用现代数据驱动的方法解决了这个问题,开发了ByteFF,一种与琥珀相容的药物类分子力场。为了创建ByteFF,我们在B3LYP-D3(BJ)/DZVP理论水平上生成了一个广泛且高度多样化的分子数据集。该数据集包括240万个优化的分子片段几何形状与分析黑森矩阵,以及320万个扭转轮廓。然后,我们在该数据集上训练了一个边缘增强、对称保持的分子图神经网络(GNN),采用了精心优化的训练策略。我们的模型在广阔的化学空间中同时预测了类药物分子的所有键合和非键合MM力场参数。ByteFF在各种基准数据集上展示了最先进的性能,在预测松弛几何形状、扭转能分布、构象能和力方面表现出色。其卓越的准确性和广泛的化学空间覆盖使ByteFF成为计算药物发现的多个阶段的宝贵工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Chemical Science
Chemical Science CHEMISTRY, MULTIDISCIPLINARY-
CiteScore
14.40
自引率
4.80%
发文量
1352
审稿时长
2.1 months
期刊介绍: Chemical Science is a journal that encompasses various disciplines within the chemical sciences. Its scope includes publishing ground-breaking research with significant implications for its respective field, as well as appealing to a wider audience in related areas. To be considered for publication, articles must showcase innovative and original advances in their field of study and be presented in a manner that is understandable to scientists from diverse backgrounds. However, the journal generally does not publish highly specialized research.
期刊最新文献
Microenvironment regulation in nanomaterial synthesis Dual-locked Strategy for Fluorescence/Chemiluminescence Dual-modal Imaging Reprogramming Targeted reactant activation and spatial charge separation for efficient photocatalytic C(sp3)–H bond oxidation Near Infrared Light Regulated Crystallization-Driven Self-Assembly: A Versatile Platform for Photo-Controlled Preparation of Uniform π-Conjugated Functional Nanostructures Directing Structural Transformation in Gold(I)-Carborane Nanoclusters to Unlock Ultrafast Hypergolic Ignition
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1