Ioannis Stylianakis, Nikolaos Zervos, Jenn-Huei Lii, Dimitrios A. Pantazis, Antonios Kolocouris
{"title":"Conformational energies of reference organic molecules: benchmarking of common efficient computational methods against coupled cluster theory","authors":"Ioannis Stylianakis, Nikolaos Zervos, Jenn-Huei Lii, Dimitrios A. Pantazis, Antonios Kolocouris","doi":"10.1007/s10822-023-00513-5","DOIUrl":null,"url":null,"abstract":"<div><p>We selected 145 reference organic molecules that include model fragments used in computer-aided drug design. We calculated 158 conformational energies and barriers using force fields, with wide applicability in commercial and free softwares and extensive application on the calculation of conformational energies of organic molecules, e.g. the UFF and DREIDING force fields, the Allinger’s force fields MM3-96, MM3-00, MM4-8, the MM2-91 clones MMX and MM+, the MMFF94 force field, MM4, ab initio Hartree–Fock (HF) theory with different basis sets, the standard density functional theory B3LYP, the second-order post-HF MP2 theory and the Domain-based Local Pair Natural Orbital Coupled Cluster DLPNO-CCSD(T) theory, with the latter used for accurate reference values. The data set of the organic molecules includes hydrocarbons, haloalkanes, conjugated compounds, and oxygen-, nitrogen-, phosphorus- and sulphur-containing compounds. We reviewed in detail the conformational aspects of these model organic molecules providing the current understanding of the steric and electronic factors that determine the stability of low energy conformers and the literature including previous experimental observations and calculated findings. While progress on the computer hardware allows the calculations of thousands of conformations for later use in drug design projects, this study is an update from previous classical studies that used, as reference values, experimental ones using a variety of methods and different environments. The lowest mean error against the DLPNO-CCSD(T) reference was calculated for MP2 (0.35 kcal mol<sup>−1</sup>), followed by B3LYP (0.69 kcal mol<sup>−1</sup>) and the HF theories (0.81–1.0 kcal mol<sup>−1</sup>). As regards the force fields, the lowest errors were observed for the Allinger’s force fields MM3-00 (1.28 kcal mol<sup>−1</sup>), ΜΜ3-96 (1.40 kcal mol<sup>−1</sup>) and the Halgren’s MMFF94 force field (1.30 kcal mol<sup>−1</sup>) and then for the MM2-91 clones MMX (1.77 kcal mol<sup>−1</sup>) and MM+ (2.01 kcal mol<sup>−1</sup>) and MM4 (2.05 kcal mol<sup>−1</sup>). The DREIDING (3.63 kcal mol<sup>−1</sup>) and UFF (3.77 kcal mol<sup>−1</sup>) force fields have the lowest performance. These model organic molecules we used are often present as fragments in drug-like molecules. The values calculated using DLPNO-CCSD(T) make up a valuable data set for further comparisons and for improved force field parameterization.</p><h3>Graphical abstract</h3>\n <div><figure><div><div><picture><source><img></source></picture></div></div></figure></div>\n </div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"37 12","pages":"607 - 656"},"PeriodicalIF":3.0000,"publicationDate":"2023-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer-Aided Molecular Design","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10822-023-00513-5","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
We selected 145 reference organic molecules that include model fragments used in computer-aided drug design. We calculated 158 conformational energies and barriers using force fields, with wide applicability in commercial and free softwares and extensive application on the calculation of conformational energies of organic molecules, e.g. the UFF and DREIDING force fields, the Allinger’s force fields MM3-96, MM3-00, MM4-8, the MM2-91 clones MMX and MM+, the MMFF94 force field, MM4, ab initio Hartree–Fock (HF) theory with different basis sets, the standard density functional theory B3LYP, the second-order post-HF MP2 theory and the Domain-based Local Pair Natural Orbital Coupled Cluster DLPNO-CCSD(T) theory, with the latter used for accurate reference values. The data set of the organic molecules includes hydrocarbons, haloalkanes, conjugated compounds, and oxygen-, nitrogen-, phosphorus- and sulphur-containing compounds. We reviewed in detail the conformational aspects of these model organic molecules providing the current understanding of the steric and electronic factors that determine the stability of low energy conformers and the literature including previous experimental observations and calculated findings. While progress on the computer hardware allows the calculations of thousands of conformations for later use in drug design projects, this study is an update from previous classical studies that used, as reference values, experimental ones using a variety of methods and different environments. The lowest mean error against the DLPNO-CCSD(T) reference was calculated for MP2 (0.35 kcal mol−1), followed by B3LYP (0.69 kcal mol−1) and the HF theories (0.81–1.0 kcal mol−1). As regards the force fields, the lowest errors were observed for the Allinger’s force fields MM3-00 (1.28 kcal mol−1), ΜΜ3-96 (1.40 kcal mol−1) and the Halgren’s MMFF94 force field (1.30 kcal mol−1) and then for the MM2-91 clones MMX (1.77 kcal mol−1) and MM+ (2.01 kcal mol−1) and MM4 (2.05 kcal mol−1). The DREIDING (3.63 kcal mol−1) and UFF (3.77 kcal mol−1) force fields have the lowest performance. These model organic molecules we used are often present as fragments in drug-like molecules. The values calculated using DLPNO-CCSD(T) make up a valuable data set for further comparisons and for improved force field parameterization.
期刊介绍:
The Journal of Computer-Aided Molecular Design provides a form for disseminating information on both the theory and the application of computer-based methods in the analysis and design of molecules. The scope of the journal encompasses papers which report new and original research and applications in the following areas:
- theoretical chemistry;
- computational chemistry;
- computer and molecular graphics;
- molecular modeling;
- protein engineering;
- drug design;
- expert systems;
- general structure-property relationships;
- molecular dynamics;
- chemical database development and usage.