QMe14S: A Comprehensive and Efficient Spectral Data Set for Small Organic Molecules

IF 4.6 2区 化学 Q2 CHEMISTRY, PHYSICAL The Journal of Physical Chemistry Letters Pub Date : 2025-04-13 DOI:10.1021/acs.jpclett.5c00839
Mingzhi Yuan, Zihan Zou, Yi Luo, Jun Jiang, Wei Hu
{"title":"QMe14S: A Comprehensive and Efficient Spectral Data Set for Small Organic Molecules","authors":"Mingzhi Yuan, Zihan Zou, Yi Luo, Jun Jiang, Wei Hu","doi":"10.1021/acs.jpclett.5c00839","DOIUrl":null,"url":null,"abstract":"Developing machine learning protocols for molecular simulations requires comprehensive and efficient data sets. Here we introduce the QMe14S data set, comprising 186,102 small organic molecules featuring 14 elements (H, B, C, N, O, F, Al, Si, P, S, Cl, As, Se, and Br) and 47 functional groups. Using density functional theory at the B3LYP/TZVP level, we optimized the geometries and calculated properties, including energy, atomic charge, atomic force, dipole moment, quadrupole moment, polarizability, octupole moment, first hyperpolarizability, and Hessian. At the same level, we obtained the harmonic IR, Raman, and NMR spectra. Furthermore, we conducted ab initio molecular dynamics simulations to generate dynamic configurations and extract nonequilibrium properties, including energy, forces, and Hessians. By leveraging our E(3)-equivariant message-passing neural network (DetaNet), we demonstrated that models trained on QMe14S outperform those trained on the previously developed QM9S data set in simulating molecular spectra. The QMe14S data set thus serves as a comprehensive benchmark for molecular simulations, offering valuable insights into structure–property relationships.","PeriodicalId":62,"journal":{"name":"The Journal of Physical Chemistry Letters","volume":"218 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Physical Chemistry Letters","FirstCategoryId":"1","ListUrlMain":"https://doi.org/10.1021/acs.jpclett.5c00839","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Developing machine learning protocols for molecular simulations requires comprehensive and efficient data sets. Here we introduce the QMe14S data set, comprising 186,102 small organic molecules featuring 14 elements (H, B, C, N, O, F, Al, Si, P, S, Cl, As, Se, and Br) and 47 functional groups. Using density functional theory at the B3LYP/TZVP level, we optimized the geometries and calculated properties, including energy, atomic charge, atomic force, dipole moment, quadrupole moment, polarizability, octupole moment, first hyperpolarizability, and Hessian. At the same level, we obtained the harmonic IR, Raman, and NMR spectra. Furthermore, we conducted ab initio molecular dynamics simulations to generate dynamic configurations and extract nonequilibrium properties, including energy, forces, and Hessians. By leveraging our E(3)-equivariant message-passing neural network (DetaNet), we demonstrated that models trained on QMe14S outperform those trained on the previously developed QM9S data set in simulating molecular spectra. The QMe14S data set thus serves as a comprehensive benchmark for molecular simulations, offering valuable insights into structure–property relationships.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
QMe14S:有机小分子综合高效光谱数据集
开发用于分子模拟的机器学习协议需要全面有效的数据集。本文介绍了QMe14S数据集,包含186,102个有机小分子,包含14种元素(H、B、C、N、O、F、Al、Si、P、S、Cl、As、Se和Br)和47个官能团。利用B3LYP/TZVP水平的密度功能理论,我们优化了几何结构并计算了其性能,包括能量、原子电荷、原子力、偶极矩、四极矩、极化率、八极矩、第一超极化率和Hessian。在同一能级上,我们得到了谐波红外光谱、拉曼光谱和核磁共振光谱。此外,我们进行从头算分子动力学模拟以生成动态构型并提取非平衡性质,包括能量,力和Hessians。通过利用我们的E(3)-等变消息传递神经网络(DetaNet),我们证明了在QMe14S上训练的模型在模拟分子光谱方面优于先前开发的QM9S数据集。因此,QMe14S数据集可以作为分子模拟的综合基准,为结构-性质关系提供有价值的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
The Journal of Physical Chemistry Letters
The Journal of Physical Chemistry Letters CHEMISTRY, PHYSICAL-NANOSCIENCE & NANOTECHNOLOGY
CiteScore
9.60
自引率
7.00%
发文量
1519
审稿时长
1.6 months
期刊介绍: The Journal of Physical Chemistry (JPC) Letters is devoted to reporting new and original experimental and theoretical basic research of interest to physical chemists, biophysical chemists, chemical physicists, physicists, material scientists, and engineers. An important criterion for acceptance is that the paper reports a significant scientific advance and/or physical insight such that rapid publication is essential. Two issues of JPC Letters are published each month.
期刊最新文献
Direct Observation of Franck–Condon Stimulated Emission and Sub-20 fs Relaxation in Photoexcited Flavins In Situ Zn2+ Etching of MXene Coating Unlocks 5000-Cycle Dendrite-Free Zinc Anodes. Universality Encoded in Fano Lineshapes: A Decoupled Phonon Framework for Rapid Raman Parameter Extraction Ultrafast Structural Dynamics of Biomolecular Complexes Probed by Broadband Time-Resolved Circular Dichroism Halogen Engineering and Orbital Origins of Large Second-Harmonic Generation in Organic–Inorganic Hybrid Metal Halides
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1