Data-driven methods have transformed the prospects of the computational chemical sciences, with machine-learned interatomic potentials (MLIPs) speeding up calculations by several orders of magnitude. I reflect on theory-driven, as opposed to data-driven, discovery based on ab initio random structure searching (AIRSS), and then introduce two new methods that exploit machine-learning acceleration. I show how long high-throughput anneals, between direct structural relaxation, enabled by ephemeral data-derived potentials (EDDPs), can be incorporated into AIRSS to bias the sampling of challenging systems towards low-energy configurations. Hot AIRSS (hot-AIRSS) preserves the parallel advantage of random search, while allowing much more complex systems to be tackled. This is demonstrated through searches for complex boron structures in large unit cells. I then show how low-energy carbon structures can be directly generated from a single, experimentally determined, diamond structure. An extension to the generation of random sensible structures, candidates are stochastically generated and then optimised to minimise the difference between the EDDP environment vector and that of the reference diamond structure. The distance-based cost function is captured in an actively learned EDDP. Graphite, small nanotubes and caged, fullerene-like, structures emerge from searches using this potential, along with a rich variety of tetrahedral framework structures. Using the same approach, the pyrope, Mg3Al2(SiO4)3, garnet structure is recovered from a low-energy AIRSS structure generated in a smaller unit cell with a different chemical composition. The relationship of this approach to modern diffusion-model-based generative methods is discussed.
{"title":"Beyond theory-driven discovery: introducing hot random search and datum-derived structures","authors":"Chris J. Pickard","doi":"10.1039/D4FD00134F","DOIUrl":"10.1039/D4FD00134F","url":null,"abstract":"<p >Data-driven methods have transformed the prospects of the computational chemical sciences, with machine-learned interatomic potentials (MLIPs) speeding up calculations by several orders of magnitude. I reflect on theory-driven, as opposed to data-driven, discovery based on <em>ab initio</em> random structure searching (AIRSS), and then introduce two new methods that exploit machine-learning acceleration. I show how long high-throughput anneals, between direct structural relaxation, enabled by ephemeral data-derived potentials (EDDPs), can be incorporated into AIRSS to bias the sampling of challenging systems towards low-energy configurations. Hot AIRSS (hot-AIRSS) preserves the parallel advantage of random search, while allowing much more complex systems to be tackled. This is demonstrated through searches for complex boron structures in large unit cells. I then show how low-energy carbon structures can be directly generated from a single, experimentally determined, diamond structure. An extension to the generation of random sensible structures, candidates are stochastically generated and then optimised to minimise the difference between the EDDP environment vector and that of the reference diamond structure. The distance-based cost function is captured in an actively learned EDDP. Graphite, small nanotubes and caged, fullerene-like, structures emerge from searches using this potential, along with a rich variety of tetrahedral framework structures. Using the same approach, the pyrope, Mg<small><sub>3</sub></small>Al<small><sub>2</sub></small>(SiO<small><sub>4</sub></small>)<small><sub>3</sub></small>, garnet structure is recovered from a low-energy AIRSS structure generated in a smaller unit cell with a different chemical composition. The relationship of this approach to modern diffusion-model-based generative methods is discussed.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 61-84"},"PeriodicalIF":3.4,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00134f?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141945614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Krzysztof Maziarz, Austin Tripp, Guoqing Liu, Megan Stanley, Shufang Xie, Piotr Gaiński, Philipp Seidl and Marwin H. S. Segler
Automated synthesis planning has recently re-emerged as a research area at the intersection of chemistry and machine learning. Despite the appearance of steady progress, we argue that imperfect benchmarks and inconsistent comparisons mask systematic shortcomings of existing techniques, and unnecessarily hamper progress. To remedy this, we present a synthesis planning library with an extensive benchmarking framework, called SYNTHESEUS, which promotes best practice by default, enabling consistent meaningful evaluation of single-step and multi-step synthesis planning algorithms. We demonstrate the capabilities of SYNTHESEUS by re-evaluating several previous retrosynthesis algorithms, and find that the ranking of state-of-the-art models changes in controlled evaluation experiments. We end with guidance for future works in this area, and call on the community to engage in the discussion on how to improve benchmarks for synthesis planning.
{"title":"Re-evaluating retrosynthesis algorithms with Syntheseus†","authors":"Krzysztof Maziarz, Austin Tripp, Guoqing Liu, Megan Stanley, Shufang Xie, Piotr Gaiński, Philipp Seidl and Marwin H. S. Segler","doi":"10.1039/D4FD00093E","DOIUrl":"10.1039/D4FD00093E","url":null,"abstract":"<p >Automated synthesis planning has recently re-emerged as a research area at the intersection of chemistry and machine learning. Despite the appearance of steady progress, we argue that imperfect benchmarks and inconsistent comparisons mask systematic shortcomings of existing techniques, and unnecessarily hamper progress. To remedy this, we present a synthesis planning library with an extensive benchmarking framework, called SYNTHESEUS, which promotes best practice by default, enabling consistent meaningful evaluation of single-step and multi-step synthesis planning algorithms. We demonstrate the capabilities of SYNTHESEUS by re-evaluating several previous retrosynthesis algorithms, and find that the ranking of state-of-the-art models changes in controlled evaluation experiments. We end with guidance for future works in this area, and call on the community to engage in the discussion on how to improve benchmarks for synthesis planning.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 568-586"},"PeriodicalIF":3.4,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141945617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Veronika Juraskova, Gers Tusha, Hanwen Zhang, Lars V. Schäfer and Fernanda Duarte
Metal ions are irreplaceable in many areas of chemistry, including (bio)catalysis, self-assembly and charge transfer processes. Yet, modelling their structural and dynamic properties in diverse chemical environments remains challenging for both force fields and ab initio methods. Here, we introduce a strategy to train machine learning potentials (MLPs) using MACE, an equivariant message-passing neural network, for metal–ligand complexes in explicit solvents. We explore the structure and ligand exchange dynamics of Mg2+ in water and Pd2+ in acetonitrile as two illustrative model systems. The trained potentials accurately reproduce equilibrium structures of the complexes in solution, including different coordination numbers and geometries. Furthermore, the MLPs can model structural changes between metal ions and ligands in the first coordination shell, and reproduce the free energy barriers for the corresponding ligand exchange. The strategy presented here provides a computationally efficient approach to model metal ions in solution, paving the way for modelling larger and more diverse metal complexes relevant to biomolecules and supramolecular assemblies.
{"title":"Modelling ligand exchange in metal complexes with machine learning potentials†","authors":"Veronika Juraskova, Gers Tusha, Hanwen Zhang, Lars V. Schäfer and Fernanda Duarte","doi":"10.1039/D4FD00140K","DOIUrl":"10.1039/D4FD00140K","url":null,"abstract":"<p >Metal ions are irreplaceable in many areas of chemistry, including (bio)catalysis, self-assembly and charge transfer processes. Yet, modelling their structural and dynamic properties in diverse chemical environments remains challenging for both force fields and <em>ab initio</em> methods. Here, we introduce a strategy to train machine learning potentials (MLPs) using MACE, an equivariant message-passing neural network, for metal–ligand complexes in explicit solvents. We explore the structure and ligand exchange dynamics of Mg<small><sup>2+</sup></small> in water and Pd<small><sup>2+</sup></small> in acetonitrile as two illustrative model systems. The trained potentials accurately reproduce equilibrium structures of the complexes in solution, including different coordination numbers and geometries. Furthermore, the MLPs can model structural changes between metal ions and ligands in the first coordination shell, and reproduce the free energy barriers for the corresponding ligand exchange. The strategy presented here provides a computationally efficient approach to model metal ions in solution, paving the way for modelling larger and more diverse metal complexes relevant to biomolecules and supramolecular assemblies.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 156-176"},"PeriodicalIF":3.4,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00140k?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141887128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Arya Changiarath, Aayush Arya, Vasileios A. Xenidis, Jan Padeken and Lukas S. Stelzl
Elucidating how protein sequence determines the properties of disordered proteins and their phase-separated condensates is a great challenge in computational chemistry, biology, and biophysics. Quantitative molecular dynamics simulations and derived free energy values can in principle capture how a sequence encodes the chemical and biological properties of a protein. These calculations are, however, computationally demanding, even after reducing the representation by coarse-graining; exploring the large spaces of potentially relevant sequences remains a formidable task. We employ an “active learning” scheme introduced by Yang et al. (bioRxiv, 2022, https://doi.org/10.1101/2022.08.05.502972) to reduce the number of labelled examples needed from simulations, where a neural network-based model suggests the most useful examples for the next training cycle. Applying this Bayesian optimisation framework, we determine properties of protein sequences with coarse-grained molecular dynamics, which enables the network to establish sequence–property relationships for disordered proteins and their self-interactions and their interactions in phase-separated condensates. We show how iterative training with second virial coefficients derived from the simulations of disordered protein sequences leads to a rapid improvement in predicting peptide self-interactions. We employ this Bayesian approach to efficiently search for new sequences that bind to condensates of the disordered C-terminal domain (CTD) of RNA Polymerase II, by simulating molecular recognition of peptides to phase-separated condensates in coarse-grained molecular dynamics. By searching for protein sequences which prefer to self-interact rather than interact with another protein sequence we are able to shape the morphology of protein condensates and design multiphasic protein condensates.
阐明蛋白质序列如何决定无序蛋白质及其相分离凝聚物的特性,是计算化学、生物学和生物物理学的一大挑战。定量分子动力学模拟和推导出的自由能值原则上可以捕捉序列如何编码蛋白质的化学和生物特性。然而,这些计算对计算要求很高,即使在通过粗粒化减少表征之后也是如此;探索潜在相关序列的巨大空间仍然是一项艰巨的任务。我们采用了杨等人提出的 "主动学习 "方案(bioRxiv 2022.08.05.502972)来减少模拟所需的标记示例数量,其中基于神经网络的模型为下一个训练周期提出了最有用的示例。通过应用这种贝叶斯优化框架,我们用粗粒度分子动力学确定了蛋白质序列的属性,从而使网络能够建立无序蛋白质的序列属性关系及其在相分离凝聚体中的自我相互作用和相互作用。我们展示了如何利用从无序蛋白质序列模拟中得出的第二病毒系数进行迭代训练,从而快速提高肽自相互作用的预测能力。我们采用这种贝叶斯方法,通过在粗粒度分子动力学中模拟分子识别肽与相分离凝聚物的过程,有效地搜索与 RNA 聚合酶 II 的无序 C 端结构域 (CTD) 凝聚物结合的新序列。通过寻找更倾向于自我相互作用而不是与另一个蛋白质序列相互作用的蛋白质序列,我们能够塑造蛋白质凝聚物的形态并设计多相蛋白质凝聚物。
{"title":"Sequence determinants of protein phase separation and recognition by protein phase-separated condensates through molecular dynamics and active learning†","authors":"Arya Changiarath, Aayush Arya, Vasileios A. Xenidis, Jan Padeken and Lukas S. Stelzl","doi":"10.1039/D4FD00099D","DOIUrl":"10.1039/D4FD00099D","url":null,"abstract":"<p >Elucidating how protein sequence determines the properties of disordered proteins and their phase-separated condensates is a great challenge in computational chemistry, biology, and biophysics. Quantitative molecular dynamics simulations and derived free energy values can in principle capture how a sequence encodes the chemical and biological properties of a protein. These calculations are, however, computationally demanding, even after reducing the representation by coarse-graining; exploring the large spaces of potentially relevant sequences remains a formidable task. We employ an “active learning” scheme introduced by Yang <em>et al.</em> (<em>bioRxiv</em>, 2022, https://doi.org/10.1101/2022.08.05.502972) to reduce the number of labelled examples needed from simulations, where a neural network-based model suggests the most useful examples for the next training cycle. Applying this Bayesian optimisation framework, we determine properties of protein sequences with coarse-grained molecular dynamics, which enables the network to establish sequence–property relationships for disordered proteins and their self-interactions and their interactions in phase-separated condensates. We show how iterative training with second virial coefficients derived from the simulations of disordered protein sequences leads to a rapid improvement in predicting peptide self-interactions. We employ this Bayesian approach to efficiently search for new sequences that bind to condensates of the disordered C-terminal domain (CTD) of RNA Polymerase II, by simulating molecular recognition of peptides to phase-separated condensates in coarse-grained molecular dynamics. By searching for protein sequences which prefer to self-interact rather than interact with another protein sequence we are able to shape the morphology of protein condensates and design multiphasic protein condensates.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 235-254"},"PeriodicalIF":3.4,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00099d?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141884834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dynamical mean-field theory (DMFT) and its cluster extensions provide an efficient Green’s function formalism to simulate spectral properties of periodic systems at the quantum many-body level. However, traditional cluster DMFT breaks translational invariance in solid-state materials, and the best strategy to capture non-local correlation effects within cluster DMFT remains elusive. In this work, we investigate the use of overlapping atom-centered impurity fragments in recently-developed ab initio all-orbital DMFT, where all local orbitals within the impurity are treated with high-level quantum chemistry impurity solvers. We demonstrate how the translational symmetry of the lattice self-energy can be restored by designing symmetry-adapted embedding problems, which results in an improved description of spectral functions in two-dimensional boron nitride monolayers and graphene at the levels of many-body perturbation theory (GW) and coupled-cluster theory. Furthermore, we study the convergence of self-energy and density of states as the embedding size is systematically expanded in one-shot and self-consistent DMFT calculations.
动态均场理论(DMFT)及其簇扩展提供了一种高效的格林函数形式,可在量子多体水平上模拟周期系统的光谱特性。然而,传统的簇均场理论打破了固态材料的平移不变性,而在簇均场理论中捕捉非局部相关效应的最佳策略仍然难以捉摸。在这项工作中,我们研究了在最近开发的 ab initio 全轨道 DMFT 中使用重叠原子中心杂质片段的问题,其中杂质内的所有局部轨道都用高级量子化学杂质求解器处理。我们展示了如何通过设计对称适配嵌入问题来恢复晶格自能的平移对称性,从而在多体扰动理论(GW)和耦合簇理论的水平上改进了对二维氮化硼单层和石墨烯中光谱函数的描述。此外,我们还研究了在单次和自洽 DMFT 计算中,随着嵌入尺寸的系统性扩大,自能和状态密度的收敛性。
{"title":"Restoring translational symmetry in periodic all-orbital dynamical mean-field theory simulations","authors":"Jiachen Li and Tianyu Zhu","doi":"10.1039/D4FD00068D","DOIUrl":"10.1039/D4FD00068D","url":null,"abstract":"<p >Dynamical mean-field theory (DMFT) and its cluster extensions provide an efficient Green’s function formalism to simulate spectral properties of periodic systems at the quantum many-body level. However, traditional cluster DMFT breaks translational invariance in solid-state materials, and the best strategy to capture non-local correlation effects within cluster DMFT remains elusive. In this work, we investigate the use of overlapping atom-centered impurity fragments in recently-developed <em>ab initio</em> all-orbital DMFT, where all local orbitals within the impurity are treated with high-level quantum chemistry impurity solvers. We demonstrate how the translational symmetry of the lattice self-energy can be restored by designing symmetry-adapted embedding problems, which results in an improved description of spectral functions in two-dimensional boron nitride monolayers and graphene at the levels of many-body perturbation theory (GW) and coupled-cluster theory. Furthermore, we study the convergence of self-energy and density of states as the embedding size is systematically expanded in one-shot and self-consistent DMFT calculations.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"254 ","pages":" 641-652"},"PeriodicalIF":3.4,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/fd/d4fd00068d?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141791345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Austin M. Mroz, Piotr N. Toka, Ehecatl Antonio del Río Chanona and Kim E. Jelfs
Historically, the chemical discovery process has predominantly been a matter of trial-and-improvement, where small modifications are made to a chemical system, guided by chemical knowledge, with the aim of optimising towards a target property or combination of properties. While a trial-and-improvement approach is frequently successful, especially when assisted by the help of serendipity, the approach is incredibly time- and resource-intensive. Complicating this further, the available chemical space that could, in theory, be explored is remarkably vast. As we are faced with near infinite possibilities and limited resources, we require improved search methods to effectively move towards desired optima, e.g. chemical systems exhibiting a target property, or several desired properties. Bayesian optimisation (BO) has recently gained significant traction in chemistry, where within the BO framework, prior knowledge is used to inform and guide the search process to optimise towards desired chemical targets, e.g. optimal reaction conditions to maximise yield, or optimal catalyst exhibiting improved catalytic activity. While powerful, implementing BO algorithms in practice is largely limited to interfacing via various APIs – requiring advanced coding experience and bespoke scripts for each optimisation task. Further, it is challenging to seamlessly link these with electronic lab notebooks via a graphical user interface (GUI). Ultimately, this limits the accessibility of BO algorithms. Here, we present Web-BO, a GUI to support BO for chemical optimisation tasks. We demonstrate its performance using an open source dataset and associated emulator, and link the platform with an existing electronic lab notebook, datalab. By providing a GUI-based BO service, we hope to improve the accessibility of data-driven optimisation tools in chemistry; https://suprashare.rcs.ic.ac.uk/web-bo/.
从历史上看,化学发现过程主要是一个试验和改进的过程,即在化学知识的指导下,对化学体系进行微小的修改,目的是优化目标特性或特性组合。虽然试验和改进方法经常取得成功,尤其是在偶然性的帮助下,但这种方法需要耗费大量的时间和资源。更复杂的是,理论上可以探索的可用化学空间非常广阔。由于我们面临着近乎无限的可能性和有限的资源,我们需要改进搜索方法,以有效地实现理想的最优结果,例如,化学体系表现出一种或几种目标特性。在贝叶斯优化(BO)框架内,先验知识被用来为搜索过程提供信息和指导,以优化实现所需的化学目标,例如使产量最大化的最佳反应条件,或表现出更高催化活性的最佳催化剂。虽然 BO 算法功能强大,但在实际应用中主要局限于通过各种应用程序接口(API)进行连接,这就需要高级编码经验和为每个优化任务定制脚本。此外,通过图形用户界面(GUI)将这些算法与电子实验笔记本无缝连接起来也很有难度。最终,这限制了 BO 算法的可访问性。在此,我们提出了 Web-BO,一种支持化学优化任务中 BO 的图形用户界面。我们使用一个开源数据集和相关模拟器演示了它的性能,并将该平台与现有的电子实验笔记本 datalab 相连接。我们希望通过提供基于图形用户界面的 BO 服务,提高化学领域数据驱动优化工具的可访问性;https://suprashare.rcs.ic.ac.uk/web-bo/。
{"title":"Web-BO: towards increased accessibility of Bayesian optimisation (BO) for chemistry","authors":"Austin M. Mroz, Piotr N. Toka, Ehecatl Antonio del Río Chanona and Kim E. Jelfs","doi":"10.1039/D4FD00109E","DOIUrl":"10.1039/D4FD00109E","url":null,"abstract":"<p >Historically, the chemical discovery process has predominantly been a matter of trial-and-improvement, where small modifications are made to a chemical system, guided by chemical knowledge, with the aim of optimising towards a target property or combination of properties. While a trial-and-improvement approach is frequently successful, especially when assisted by the help of serendipity, the approach is incredibly time- and resource-intensive. Complicating this further, the available chemical space that could, in theory, be explored is remarkably vast. As we are faced with near infinite possibilities and limited resources, we require improved search methods to effectively move towards desired optima, <em>e.g.</em> chemical systems exhibiting a target property, or several desired properties. Bayesian optimisation (BO) has recently gained significant traction in chemistry, where within the BO framework, prior knowledge is used to inform and guide the search process to optimise towards desired chemical targets, <em>e.g.</em> optimal reaction conditions to maximise yield, or optimal catalyst exhibiting improved catalytic activity. While powerful, implementing BO algorithms in practice is largely limited to interfacing <em>via</em> various APIs – requiring advanced coding experience and bespoke scripts for each optimisation task. Further, it is challenging to seamlessly link these with electronic lab notebooks <em>via</em> a graphical user interface (GUI). Ultimately, this limits the accessibility of BO algorithms. Here, we present Web-BO, a GUI to support BO for chemical optimisation tasks. We demonstrate its performance using an open source dataset and associated emulator, and link the platform with an existing electronic lab notebook, <em>datalab</em>. By providing a GUI-based BO service, we hope to improve the accessibility of data-driven optimisation tools in chemistry; https://suprashare.rcs.ic.ac.uk/web-bo/.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 221-234"},"PeriodicalIF":3.4,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00109e?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141864347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Approximating the electronic wave function for strongly correlated systems remains a major theoretical challenge. Emerging quantum computers can enable new types of wave-function ansatz to be considered, with the potential to overcome the exponential memory storage for strong correlation. I have recently introduced the tiled Unitary Product States (tUPS) ansatz, which successfully combines the preservation of particle-number and spin symmetry with shallow quantum circuits and local qubit connectivity [H. G. A. Burton, Phys. Rev. Res., 2024, 6, 023300]. In this contribution, I investigate the accuracy of this tUPS hierarchy for strongly-correlated Hamiltonians. I consider the picket-fence pairing Hamiltonian and the two-dimensional Hubbard lattice, which collectively describe a range of strong correlation mechanisms found in molecules. Numerical results demonstrate that highly accurate energies can be achieved with a compact approximation for both weak and strong correlation in the Hubbard model, and the repulsive pairing regime. These data provide valuable insights into the applicability of the tUPS hierarchy for strong electron correlation.
逼近强相关系统的电子波函数仍然是一项重大理论挑战。新兴的量子计算机可以考虑新型的波函数解析,并有可能克服强相关性的指数级内存存储。我最近介绍了平铺单元积状态(tUPS)解析,它成功地将粒子数和自旋对称性的保留与浅量子电路和局部量子比特连通性结合起来[H. G. A. Burton, Phys. Rev. Res., 2024, 6, 023300]。在这篇论文中,我研究了强相关哈密顿的 tUPS 层次结构的准确性。我考虑了篱笆配对哈密顿和二维哈伯德晶格,它们共同描述了分子中发现的一系列强相关机制。数值结果表明,对于哈伯德模型中的弱相关和强相关,以及排斥配对机制,都可以通过紧凑近似获得高精度能量。这些数据为 tUPS 层次结构在强电子相关性方面的适用性提供了宝贵的见解。
{"title":"Tiled unitary product states for strongly correlated Hamiltonians","authors":"Hugh G. A. Burton","doi":"10.1039/D4FD00064A","DOIUrl":"10.1039/D4FD00064A","url":null,"abstract":"<p >Approximating the electronic wave function for strongly correlated systems remains a major theoretical challenge. Emerging quantum computers can enable new types of wave-function ansatz to be considered, with the potential to overcome the exponential memory storage for strong correlation. I have recently introduced the tiled Unitary Product States (tUPS) ansatz, which successfully combines the preservation of particle-number and spin symmetry with shallow quantum circuits and local qubit connectivity [H. G. A. Burton, <em>Phys. Rev. Res.</em>, 2024, <strong>6</strong>, 023300]. In this contribution, I investigate the accuracy of this tUPS hierarchy for strongly-correlated Hamiltonians. I consider the picket-fence pairing Hamiltonian and the two-dimensional Hubbard lattice, which collectively describe a range of strong correlation mechanisms found in molecules. Numerical results demonstrate that highly accurate energies can be achieved with a compact approximation for both weak and strong correlation in the Hubbard model, and the repulsive pairing regime. These data provide valuable insights into the applicability of the tUPS hierarchy for strong electron correlation.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"254 ","pages":" 157-169"},"PeriodicalIF":3.4,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/fd/d4fd00064a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141755712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anisotropy in crystals plays a pivotal role in many technological applications. For example, anisotropic electronic and thermal transport are thought to be beneficial for thermoelectric applications, while anisotropic mechanical properties are of interest for emerging metamaterials, and anisotropic dielectric materials have been suggested as a novel platform for dark matter detection. Understanding and tailoring anisotropy in crystals is therefore essential for the design of next-generation functional materials. To date, however, most data-driven approaches have focused on the prediction of scalar crystal properties, such as the spherically averaged dielectric tensor or the bulk and shear elastic moduli. Here, we adopt the latest approaches in equivariant graph neural networks to develop a model that can predict the full dielectric tensor of crystals. Our model, trained on the Materials Project dataset of c.a. 6700 dielectric tensors, achieves state-of-the-art accuracy in scalar dielectric prediction in addition to capturing the directional response. We showcase the performance of the model by discovering crystals with almost isotropic connectivity but highly anisotropic dielectric tensors, thereby broadening our knowledge of the structure–property relationships in dielectric crystals.
{"title":"Discovery of highly anisotropic dielectric crystals with equivariant graph neural networks†","authors":"Yuchen Lou and Alex M. Ganose","doi":"10.1039/D4FD00096J","DOIUrl":"10.1039/D4FD00096J","url":null,"abstract":"<p >Anisotropy in crystals plays a pivotal role in many technological applications. For example, anisotropic electronic and thermal transport are thought to be beneficial for thermoelectric applications, while anisotropic mechanical properties are of interest for emerging metamaterials, and anisotropic dielectric materials have been suggested as a novel platform for dark matter detection. Understanding and tailoring anisotropy in crystals is therefore essential for the design of next-generation functional materials. To date, however, most data-driven approaches have focused on the prediction of scalar crystal properties, such as the spherically averaged dielectric tensor or the bulk and shear elastic moduli. Here, we adopt the latest approaches in equivariant graph neural networks to develop a model that can predict the full dielectric tensor of crystals. Our model, trained on the Materials Project dataset of <em>c.a.</em> 6700 dielectric tensors, achieves state-of-the-art accuracy in scalar dielectric prediction in addition to capturing the directional response. We showcase the performance of the model by discovering crystals with almost isotropic connectivity but highly anisotropic dielectric tensors, thereby broadening our knowledge of the structure–property relationships in dielectric crystals.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 255-274"},"PeriodicalIF":3.4,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00096j?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141774441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
We benchmark the rSCAN and r2SCAN exchange–correlation functionals by comparing the Nuclear Magnetic Resonance (NMR) magnetic shieldings predicted by Density Functional Theory (DFT) to experimentally observed chemical shifts of halide and oxide inorganic compounds. Significant improvement in accuracy is achieved compared to the Generalised Gradient Approximation (GGA) at a marginally higher computational cost. When using rSCAN or r2SCAN, the correlation coefficient between computationally predicted and experimental values approaches the theoretically expected value of −1 while reducing the deviation, allowing more accurate and reliable spectrum assignments of complex compounds in experimental investigations.
{"title":"Accurate predictions of chemical shifts with the rSCAN and r2SCAN mGGA exchange–correlation functionals†","authors":"Jonathan R. Yates and Albert P. Bartók","doi":"10.1039/D4FD00142G","DOIUrl":"10.1039/D4FD00142G","url":null,"abstract":"<p >We benchmark the rSCAN and r<small><sup>2</sup></small>SCAN exchange–correlation functionals by comparing the Nuclear Magnetic Resonance (NMR) magnetic shieldings predicted by Density Functional Theory (DFT) to experimentally observed chemical shifts of halide and oxide inorganic compounds. Significant improvement in accuracy is achieved compared to the Generalised Gradient Approximation (GGA) at a marginally higher computational cost. When using rSCAN or r<small><sup>2</sup></small>SCAN, the correlation coefficient between computationally predicted and experimental values approaches the theoretically expected value of −1 while reducing the deviation, allowing more accurate and reliable spectrum assignments of complex compounds in experimental investigations.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":" 0","pages":" 192-202"},"PeriodicalIF":3.4,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00142g?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Enzymes are being engineered to catalyze chemical reactions for many practical applications in chemistry and biotechnology. The approaches used are surveyed in this short review, emphasizing methods for accessing reactivities not expressed by native protein scaffolds. The successful generation of completely de novo enzymes that rival the rates and selectivities of their natural counterparts highlights the potential role that designer enzymes may play in the coming years in research, industry, and medicine. Some challenges that need to be addressed to realize this ambitious dream are considered together with possible solutions.
{"title":"Spiers Memorial Lecture: Engineering biocatalysts","authors":"Donald Hilvert","doi":"10.1039/D4FD00139G","DOIUrl":"10.1039/D4FD00139G","url":null,"abstract":"<p >Enzymes are being engineered to catalyze chemical reactions for many practical applications in chemistry and biotechnology. The approaches used are surveyed in this short review, emphasizing methods for accessing reactivities not expressed by native protein scaffolds. The successful generation of completely <em>de novo</em> enzymes that rival the rates and selectivities of their natural counterparts highlights the potential role that designer enzymes may play in the coming years in research, industry, and medicine. Some challenges that need to be addressed to realize this ambitious dream are considered together with possible solutions.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"252 ","pages":" 9-28"},"PeriodicalIF":3.4,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/fd/d4fd00139g?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141750535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}