首页 > 最新文献

Faraday Discussions最新文献

英文 中文
Beyond theory-driven discovery: introducing hot random search and datum-derived structures 超越理论驱动的发现:引入热随机搜索和基准衍生结构
IF 3.4 3区 化学 Q2 Chemistry Pub Date : 2024-08-06 DOI: 10.1039/D4FD00134F
Chris J. Pickard

Data-driven methods have transformed the prospects of the computational chemical sciences, with machine-learned interatomic potentials (MLIPs) speeding up calculations by several orders of magnitude. I reflect on theory-driven, as opposed to data-driven, discovery based on ab initio random structure searching (AIRSS), and then introduce two new methods that exploit machine-learning acceleration. I show how long high-throughput anneals, between direct structural relaxation, enabled by ephemeral data-derived potentials (EDDPs), can be incorporated into AIRSS to bias the sampling of challenging systems towards low-energy configurations. Hot AIRSS (hot-AIRSS) preserves the parallel advantage of random search, while allowing much more complex systems to be tackled. This is demonstrated through searches for complex boron structures in large unit cells. I then show how low-energy carbon structures can be directly generated from a single, experimentally determined, diamond structure. An extension to the generation of random sensible structures, candidates are stochastically generated and then optimised to minimise the difference between the EDDP environment vector and that of the reference diamond structure. The distance-based cost function is captured in an actively learned EDDP. Graphite, small nanotubes and caged, fullerene-like, structures emerge from searches using this potential, along with a rich variety of tetrahedral framework structures. Using the same approach, the pyrope, Mg3Al2(SiO4)3, garnet structure is recovered from a low-energy AIRSS structure generated in a smaller unit cell with a different chemical composition. The relationship of this approach to modern diffusion-model-based generative methods is discussed.

数据驱动方法改变了计算化学科学的前景,机器学习原子间势(MLIP)将计算速度提高了几个数量级。与数据驱动相比,我对理论驱动的发现进行了反思,并介绍了两种利用机器学习加速的新方法。我展示了如何通过短暂数据衍生电位(EDDPs)在直接结构弛豫之间进行长时间高通量退火,并将其纳入 AIRSS,从而将具有挑战性的系统取样偏向于低能配置。热 AIRSS(hot-AIRSS)保留了随机搜索的并行优势,同时允许处理更复杂的系统。我将通过搜索大单元中的复杂硼结构来证明这一点。然后,我展示了如何从实验确定的单一金刚石结构直接生成低能碳结构。作为随机合理结构生成的延伸,候选结构是随机生成的,然后进行优化,以最小化 EDDP 环境向量与参考金刚石结构环境向量之间的差异。基于距离的成本函数被捕捉到主动学习的 EDDP 中。通过使用这种势能进行搜索,出现了石墨、小型纳米管和笼状富勒烯结构,以及种类丰富的四面体框架结构。利用同样的方法,从一个化学成分不同的较小单元格中产生的低能量 AIRSS 结构中恢复了石榴石结构 Mg3Al2(SiO4)3。讨论了这种方法与基于现代扩散模型的生成方法之间的关系。
{"title":"Beyond theory-driven discovery: introducing hot random search and datum-derived structures","authors":"Chris J. Pickard","doi":"10.1039/D4FD00134F","DOIUrl":"10.1039/D4FD00134F","url":null,"abstract":"<p >Data-driven methods have transformed the prospects of the computational chemical sciences, with machine-learned interatomic potentials (MLIPs) speeding up calculations by several orders of magnitude. I reflect on theory-driven, as opposed to data-driven, discovery based on <em>ab initio</em> random structure searching (AIRSS), and then introduce two new methods that exploit machine-learning acceleration. I show how long high-throughput anneals, between direct structural relaxation, enabled by ephemeral data-derived potentials (EDDPs), can be incorporated into AIRSS to bias the sampling of challenging systems towards low-energy configurations. Hot AIRSS (hot-AIRSS) preserves the parallel advantage of random search, while allowing much more complex systems to be tackled. This is demonstrated through searches for complex boron structures in large unit cells. I then show how low-energy carbon structures can be directly generated from a single, experimentally determined, diamond structure. An extension to the generation of random sensible structures, candidates are stochastically generated and then optimised to minimise the difference between the EDDP environment vector and that of the reference diamond structure. The distance-based cost function is captured in an actively learned EDDP. Graphite, small nanotubes and caged, fullerene-like, structures emerge from searches using this potential, along with a rich variety of tetrahedral framework structures. Using the same approach, the pyrope, Mg<small><sub>3</sub></small>Al<small><sub>2</sub></small>(SiO<small><sub>4</sub></small>)<small><sub>3</sub></small>, garnet structure is recovered from a low-energy AIRSS structure generated in a smaller unit cell with a different chemical composition. The relationship of this approach to modern diffusion-model-based generative methods is discussed.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 61-84"},"PeriodicalIF":3.4,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00134f?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141945614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Re-evaluating retrosynthesis algorithms with Syntheseus† 用 Syntheseus 重新评估逆合成算法
IF 3.4 3区 化学 Q2 Chemistry Pub Date : 2024-08-05 DOI: 10.1039/D4FD00093E
Krzysztof Maziarz, Austin Tripp, Guoqing Liu, Megan Stanley, Shufang Xie, Piotr Gaiński, Philipp Seidl and Marwin H. S. Segler

Automated synthesis planning has recently re-emerged as a research area at the intersection of chemistry and machine learning. Despite the appearance of steady progress, we argue that imperfect benchmarks and inconsistent comparisons mask systematic shortcomings of existing techniques, and unnecessarily hamper progress. To remedy this, we present a synthesis planning library with an extensive benchmarking framework, called SYNTHESEUS, which promotes best practice by default, enabling consistent meaningful evaluation of single-step and multi-step synthesis planning algorithms. We demonstrate the capabilities of SYNTHESEUS by re-evaluating several previous retrosynthesis algorithms, and find that the ranking of state-of-the-art models changes in controlled evaluation experiments. We end with guidance for future works in this area, and call on the community to engage in the discussion on how to improve benchmarks for synthesis planning.

自动合成规划最近再次成为化学与机器学习交叉领域的研究热点。尽管看起来取得了稳步进展,但我们认为,不完善的基准和不一致的比较掩盖了现有技术的系统性缺陷,不必要地阻碍了进展。为了弥补这一缺陷,我们提出了一个具有广泛基准测试框架的合成规划库,名为 Syntheseus,它在默认情况下提倡最佳实践,能够对单步和多步合成规划算法进行一致而有意义的评估。我们通过重新评估之前的几种逆合成算法来证明 Syntheseus 的能力,并发现在受控评估实验中,最先进模型的排名发生了变化。最后,我们为这一领域的未来工作提供了指导,并呼吁社会各界参与讨论如何改进合成规划的基准。
{"title":"Re-evaluating retrosynthesis algorithms with Syntheseus†","authors":"Krzysztof Maziarz, Austin Tripp, Guoqing Liu, Megan Stanley, Shufang Xie, Piotr Gaiński, Philipp Seidl and Marwin H. S. Segler","doi":"10.1039/D4FD00093E","DOIUrl":"10.1039/D4FD00093E","url":null,"abstract":"<p >Automated synthesis planning has recently re-emerged as a research area at the intersection of chemistry and machine learning. Despite the appearance of steady progress, we argue that imperfect benchmarks and inconsistent comparisons mask systematic shortcomings of existing techniques, and unnecessarily hamper progress. To remedy this, we present a synthesis planning library with an extensive benchmarking framework, called SYNTHESEUS, which promotes best practice by default, enabling consistent meaningful evaluation of single-step and multi-step synthesis planning algorithms. We demonstrate the capabilities of SYNTHESEUS by re-evaluating several previous retrosynthesis algorithms, and find that the ranking of state-of-the-art models changes in controlled evaluation experiments. We end with guidance for future works in this area, and call on the community to engage in the discussion on how to improve benchmarks for synthesis planning.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 568-586"},"PeriodicalIF":3.4,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141945617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling ligand exchange in metal complexes with machine learning potentials† 用机器学习势能模拟金属复合物中的配体交换
IF 3.4 3区 化学 Q2 Chemistry Pub Date : 2024-08-03 DOI: 10.1039/D4FD00140K
Veronika Juraskova, Gers Tusha, Hanwen Zhang, Lars V. Schäfer and Fernanda Duarte

Metal ions are irreplaceable in many areas of chemistry, including (bio)catalysis, self-assembly and charge transfer processes. Yet, modelling their structural and dynamic properties in diverse chemical environments remains challenging for both force fields and ab initio methods. Here, we introduce a strategy to train machine learning potentials (MLPs) using MACE, an equivariant message-passing neural network, for metal–ligand complexes in explicit solvents. We explore the structure and ligand exchange dynamics of Mg2+ in water and Pd2+ in acetonitrile as two illustrative model systems. The trained potentials accurately reproduce equilibrium structures of the complexes in solution, including different coordination numbers and geometries. Furthermore, the MLPs can model structural changes between metal ions and ligands in the first coordination shell, and reproduce the free energy barriers for the corresponding ligand exchange. The strategy presented here provides a computationally efficient approach to model metal ions in solution, paving the way for modelling larger and more diverse metal complexes relevant to biomolecules and supramolecular assemblies.

金属离子在(生物)催化、自组装和电荷转移过程等许多化学领域都具有不可替代的作用。然而,在不同的化学环境中模拟金属离子的结构和动态特性,对于力场和自洽方法来说仍然具有挑战性。在此,我们介绍了一种利用等变信息传递神经网络 MACE 训练显式溶剂中金属配体复合物的机器学习势(MLP)的策略。我们探索了 Mg2+ 在水中和 Pd2+ 在乙腈中的结构和配体交换动力学,以此作为两个示例模型系统。经过训练的电位能准确再现复合物在溶液中的平衡结构,包括不同的配位数和几何形状。此外,MLP 还能模拟金属离子和配体在第一配位层中的结构变化,并再现相应配体交换的自由能障。本文介绍的策略提供了一种计算高效的方法来模拟溶液中的金属离子,为模拟与生物大分子和超分子组装体相关的更大型、更多样化的金属配合物铺平了道路。
{"title":"Modelling ligand exchange in metal complexes with machine learning potentials†","authors":"Veronika Juraskova, Gers Tusha, Hanwen Zhang, Lars V. Schäfer and Fernanda Duarte","doi":"10.1039/D4FD00140K","DOIUrl":"10.1039/D4FD00140K","url":null,"abstract":"<p >Metal ions are irreplaceable in many areas of chemistry, including (bio)catalysis, self-assembly and charge transfer processes. Yet, modelling their structural and dynamic properties in diverse chemical environments remains challenging for both force fields and <em>ab initio</em> methods. Here, we introduce a strategy to train machine learning potentials (MLPs) using MACE, an equivariant message-passing neural network, for metal–ligand complexes in explicit solvents. We explore the structure and ligand exchange dynamics of Mg<small><sup>2+</sup></small> in water and Pd<small><sup>2+</sup></small> in acetonitrile as two illustrative model systems. The trained potentials accurately reproduce equilibrium structures of the complexes in solution, including different coordination numbers and geometries. Furthermore, the MLPs can model structural changes between metal ions and ligands in the first coordination shell, and reproduce the free energy barriers for the corresponding ligand exchange. The strategy presented here provides a computationally efficient approach to model metal ions in solution, paving the way for modelling larger and more diverse metal complexes relevant to biomolecules and supramolecular assemblies.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 156-176"},"PeriodicalIF":3.4,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00140k?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141887128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sequence determinants of protein phase separation and recognition by protein phase-separated condensates through molecular dynamics and active learning† 通过分子动力学和主动学习研究蛋白质相分离和蛋白质相分离凝聚物识别的序列决定因素
IF 3.4 3区 化学 Q2 Chemistry Pub Date : 2024-08-03 DOI: 10.1039/D4FD00099D
Arya Changiarath, Aayush Arya, Vasileios A. Xenidis, Jan Padeken and Lukas S. Stelzl

Elucidating how protein sequence determines the properties of disordered proteins and their phase-separated condensates is a great challenge in computational chemistry, biology, and biophysics. Quantitative molecular dynamics simulations and derived free energy values can in principle capture how a sequence encodes the chemical and biological properties of a protein. These calculations are, however, computationally demanding, even after reducing the representation by coarse-graining; exploring the large spaces of potentially relevant sequences remains a formidable task. We employ an “active learning” scheme introduced by Yang et al. (bioRxiv, 2022, https://doi.org/10.1101/2022.08.05.502972) to reduce the number of labelled examples needed from simulations, where a neural network-based model suggests the most useful examples for the next training cycle. Applying this Bayesian optimisation framework, we determine properties of protein sequences with coarse-grained molecular dynamics, which enables the network to establish sequence–property relationships for disordered proteins and their self-interactions and their interactions in phase-separated condensates. We show how iterative training with second virial coefficients derived from the simulations of disordered protein sequences leads to a rapid improvement in predicting peptide self-interactions. We employ this Bayesian approach to efficiently search for new sequences that bind to condensates of the disordered C-terminal domain (CTD) of RNA Polymerase II, by simulating molecular recognition of peptides to phase-separated condensates in coarse-grained molecular dynamics. By searching for protein sequences which prefer to self-interact rather than interact with another protein sequence we are able to shape the morphology of protein condensates and design multiphasic protein condensates.

阐明蛋白质序列如何决定无序蛋白质及其相分离凝聚物的特性,是计算化学、生物学和生物物理学的一大挑战。定量分子动力学模拟和推导出的自由能值原则上可以捕捉序列如何编码蛋白质的化学和生物特性。然而,这些计算对计算要求很高,即使在通过粗粒化减少表征之后也是如此;探索潜在相关序列的巨大空间仍然是一项艰巨的任务。我们采用了杨等人提出的 "主动学习 "方案(bioRxiv 2022.08.05.502972)来减少模拟所需的标记示例数量,其中基于神经网络的模型为下一个训练周期提出了最有用的示例。通过应用这种贝叶斯优化框架,我们用粗粒度分子动力学确定了蛋白质序列的属性,从而使网络能够建立无序蛋白质的序列属性关系及其在相分离凝聚体中的自我相互作用和相互作用。我们展示了如何利用从无序蛋白质序列模拟中得出的第二病毒系数进行迭代训练,从而快速提高肽自相互作用的预测能力。我们采用这种贝叶斯方法,通过在粗粒度分子动力学中模拟分子识别肽与相分离凝聚物的过程,有效地搜索与 RNA 聚合酶 II 的无序 C 端结构域 (CTD) 凝聚物结合的新序列。通过寻找更倾向于自我相互作用而不是与另一个蛋白质序列相互作用的蛋白质序列,我们能够塑造蛋白质凝聚物的形态并设计多相蛋白质凝聚物。
{"title":"Sequence determinants of protein phase separation and recognition by protein phase-separated condensates through molecular dynamics and active learning†","authors":"Arya Changiarath, Aayush Arya, Vasileios A. Xenidis, Jan Padeken and Lukas S. Stelzl","doi":"10.1039/D4FD00099D","DOIUrl":"10.1039/D4FD00099D","url":null,"abstract":"<p >Elucidating how protein sequence determines the properties of disordered proteins and their phase-separated condensates is a great challenge in computational chemistry, biology, and biophysics. Quantitative molecular dynamics simulations and derived free energy values can in principle capture how a sequence encodes the chemical and biological properties of a protein. These calculations are, however, computationally demanding, even after reducing the representation by coarse-graining; exploring the large spaces of potentially relevant sequences remains a formidable task. We employ an “active learning” scheme introduced by Yang <em>et al.</em> (<em>bioRxiv</em>, 2022, https://doi.org/10.1101/2022.08.05.502972) to reduce the number of labelled examples needed from simulations, where a neural network-based model suggests the most useful examples for the next training cycle. Applying this Bayesian optimisation framework, we determine properties of protein sequences with coarse-grained molecular dynamics, which enables the network to establish sequence–property relationships for disordered proteins and their self-interactions and their interactions in phase-separated condensates. We show how iterative training with second virial coefficients derived from the simulations of disordered protein sequences leads to a rapid improvement in predicting peptide self-interactions. We employ this Bayesian approach to efficiently search for new sequences that bind to condensates of the disordered C-terminal domain (CTD) of RNA Polymerase II, by simulating molecular recognition of peptides to phase-separated condensates in coarse-grained molecular dynamics. By searching for protein sequences which prefer to self-interact rather than interact with another protein sequence we are able to shape the morphology of protein condensates and design multiphasic protein condensates.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 235-254"},"PeriodicalIF":3.4,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00099d?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141884834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Restoring translational symmetry in periodic all-orbital dynamical mean-field theory simulations 在周期性全轨道动态均场理论模拟中恢复平移对称性。
IF 3.4 3区 化学 Q2 Chemistry Pub Date : 2024-07-30 DOI: 10.1039/D4FD00068D
Jiachen Li and Tianyu Zhu

Dynamical mean-field theory (DMFT) and its cluster extensions provide an efficient Green’s function formalism to simulate spectral properties of periodic systems at the quantum many-body level. However, traditional cluster DMFT breaks translational invariance in solid-state materials, and the best strategy to capture non-local correlation effects within cluster DMFT remains elusive. In this work, we investigate the use of overlapping atom-centered impurity fragments in recently-developed ab initio all-orbital DMFT, where all local orbitals within the impurity are treated with high-level quantum chemistry impurity solvers. We demonstrate how the translational symmetry of the lattice self-energy can be restored by designing symmetry-adapted embedding problems, which results in an improved description of spectral functions in two-dimensional boron nitride monolayers and graphene at the levels of many-body perturbation theory (GW) and coupled-cluster theory. Furthermore, we study the convergence of self-energy and density of states as the embedding size is systematically expanded in one-shot and self-consistent DMFT calculations.

动态均场理论(DMFT)及其簇扩展提供了一种高效的格林函数形式,可在量子多体水平上模拟周期系统的光谱特性。然而,传统的簇均场理论打破了固态材料的平移不变性,而在簇均场理论中捕捉非局部相关效应的最佳策略仍然难以捉摸。在这项工作中,我们研究了在最近开发的 ab initio 全轨道 DMFT 中使用重叠原子中心杂质片段的问题,其中杂质内的所有局部轨道都用高级量子化学杂质求解器处理。我们展示了如何通过设计对称适配嵌入问题来恢复晶格自能的平移对称性,从而在多体扰动理论(GW)和耦合簇理论的水平上改进了对二维氮化硼单层和石墨烯中光谱函数的描述。此外,我们还研究了在单次和自洽 DMFT 计算中,随着嵌入尺寸的系统性扩大,自能和状态密度的收敛性。
{"title":"Restoring translational symmetry in periodic all-orbital dynamical mean-field theory simulations","authors":"Jiachen Li and Tianyu Zhu","doi":"10.1039/D4FD00068D","DOIUrl":"10.1039/D4FD00068D","url":null,"abstract":"<p >Dynamical mean-field theory (DMFT) and its cluster extensions provide an efficient Green’s function formalism to simulate spectral properties of periodic systems at the quantum many-body level. However, traditional cluster DMFT breaks translational invariance in solid-state materials, and the best strategy to capture non-local correlation effects within cluster DMFT remains elusive. In this work, we investigate the use of overlapping atom-centered impurity fragments in recently-developed <em>ab initio</em> all-orbital DMFT, where all local orbitals within the impurity are treated with high-level quantum chemistry impurity solvers. We demonstrate how the translational symmetry of the lattice self-energy can be restored by designing symmetry-adapted embedding problems, which results in an improved description of spectral functions in two-dimensional boron nitride monolayers and graphene at the levels of many-body perturbation theory (GW) and coupled-cluster theory. Furthermore, we study the convergence of self-energy and density of states as the embedding size is systematically expanded in one-shot and self-consistent DMFT calculations.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"254 ","pages":" 641-652"},"PeriodicalIF":3.4,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/fd/d4fd00068d?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141791345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Web-BO: towards increased accessibility of Bayesian optimisation (BO) for chemistry Web-BO:提高化学贝叶斯优化(BO)的可及性
IF 3.4 3区 化学 Q2 Chemistry Pub Date : 2024-07-30 DOI: 10.1039/D4FD00109E
Austin M. Mroz, Piotr N. Toka, Ehecatl Antonio del Río Chanona and Kim E. Jelfs

Historically, the chemical discovery process has predominantly been a matter of trial-and-improvement, where small modifications are made to a chemical system, guided by chemical knowledge, with the aim of optimising towards a target property or combination of properties. While a trial-and-improvement approach is frequently successful, especially when assisted by the help of serendipity, the approach is incredibly time- and resource-intensive. Complicating this further, the available chemical space that could, in theory, be explored is remarkably vast. As we are faced with near infinite possibilities and limited resources, we require improved search methods to effectively move towards desired optima, e.g. chemical systems exhibiting a target property, or several desired properties. Bayesian optimisation (BO) has recently gained significant traction in chemistry, where within the BO framework, prior knowledge is used to inform and guide the search process to optimise towards desired chemical targets, e.g. optimal reaction conditions to maximise yield, or optimal catalyst exhibiting improved catalytic activity. While powerful, implementing BO algorithms in practice is largely limited to interfacing via various APIs – requiring advanced coding experience and bespoke scripts for each optimisation task. Further, it is challenging to seamlessly link these with electronic lab notebooks via a graphical user interface (GUI). Ultimately, this limits the accessibility of BO algorithms. Here, we present Web-BO, a GUI to support BO for chemical optimisation tasks. We demonstrate its performance using an open source dataset and associated emulator, and link the platform with an existing electronic lab notebook, datalab. By providing a GUI-based BO service, we hope to improve the accessibility of data-driven optimisation tools in chemistry; https://suprashare.rcs.ic.ac.uk/web-bo/.

从历史上看,化学发现过程主要是一个试验和改进的过程,即在化学知识的指导下,对化学体系进行微小的修改,目的是优化目标特性或特性组合。虽然试验和改进方法经常取得成功,尤其是在偶然性的帮助下,但这种方法需要耗费大量的时间和资源。更复杂的是,理论上可以探索的可用化学空间非常广阔。由于我们面临着近乎无限的可能性和有限的资源,我们需要改进搜索方法,以有效地实现理想的最优结果,例如,化学体系表现出一种或几种目标特性。在贝叶斯优化(BO)框架内,先验知识被用来为搜索过程提供信息和指导,以优化实现所需的化学目标,例如使产量最大化的最佳反应条件,或表现出更高催化活性的最佳催化剂。虽然 BO 算法功能强大,但在实际应用中主要局限于通过各种应用程序接口(API)进行连接,这就需要高级编码经验和为每个优化任务定制脚本。此外,通过图形用户界面(GUI)将这些算法与电子实验笔记本无缝连接起来也很有难度。最终,这限制了 BO 算法的可访问性。在此,我们提出了 Web-BO,一种支持化学优化任务中 BO 的图形用户界面。我们使用一个开源数据集和相关模拟器演示了它的性能,并将该平台与现有的电子实验笔记本 datalab 相连接。我们希望通过提供基于图形用户界面的 BO 服务,提高化学领域数据驱动优化工具的可访问性;https://suprashare.rcs.ic.ac.uk/web-bo/。
{"title":"Web-BO: towards increased accessibility of Bayesian optimisation (BO) for chemistry","authors":"Austin M. Mroz, Piotr N. Toka, Ehecatl Antonio del Río Chanona and Kim E. Jelfs","doi":"10.1039/D4FD00109E","DOIUrl":"10.1039/D4FD00109E","url":null,"abstract":"<p >Historically, the chemical discovery process has predominantly been a matter of trial-and-improvement, where small modifications are made to a chemical system, guided by chemical knowledge, with the aim of optimising towards a target property or combination of properties. While a trial-and-improvement approach is frequently successful, especially when assisted by the help of serendipity, the approach is incredibly time- and resource-intensive. Complicating this further, the available chemical space that could, in theory, be explored is remarkably vast. As we are faced with near infinite possibilities and limited resources, we require improved search methods to effectively move towards desired optima, <em>e.g.</em> chemical systems exhibiting a target property, or several desired properties. Bayesian optimisation (BO) has recently gained significant traction in chemistry, where within the BO framework, prior knowledge is used to inform and guide the search process to optimise towards desired chemical targets, <em>e.g.</em> optimal reaction conditions to maximise yield, or optimal catalyst exhibiting improved catalytic activity. While powerful, implementing BO algorithms in practice is largely limited to interfacing <em>via</em> various APIs – requiring advanced coding experience and bespoke scripts for each optimisation task. Further, it is challenging to seamlessly link these with electronic lab notebooks <em>via</em> a graphical user interface (GUI). Ultimately, this limits the accessibility of BO algorithms. Here, we present Web-BO, a GUI to support BO for chemical optimisation tasks. We demonstrate its performance using an open source dataset and associated emulator, and link the platform with an existing electronic lab notebook, <em>datalab</em>. By providing a GUI-based BO service, we hope to improve the accessibility of data-driven optimisation tools in chemistry; https://suprashare.rcs.ic.ac.uk/web-bo/.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 221-234"},"PeriodicalIF":3.4,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00109e?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141864347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tiled unitary product states for strongly correlated Hamiltonians 强相关哈密顿的平铺单元乘积态
IF 3.4 3区 化学 Q2 Chemistry Pub Date : 2024-07-25 DOI: 10.1039/D4FD00064A
Hugh G. A. Burton

Approximating the electronic wave function for strongly correlated systems remains a major theoretical challenge. Emerging quantum computers can enable new types of wave-function ansatz to be considered, with the potential to overcome the exponential memory storage for strong correlation. I have recently introduced the tiled Unitary Product States (tUPS) ansatz, which successfully combines the preservation of particle-number and spin symmetry with shallow quantum circuits and local qubit connectivity [H. G. A. Burton, Phys. Rev. Res., 2024, 6, 023300]. In this contribution, I investigate the accuracy of this tUPS hierarchy for strongly-correlated Hamiltonians. I consider the picket-fence pairing Hamiltonian and the two-dimensional Hubbard lattice, which collectively describe a range of strong correlation mechanisms found in molecules. Numerical results demonstrate that highly accurate energies can be achieved with a compact approximation for both weak and strong correlation in the Hubbard model, and the repulsive pairing regime. These data provide valuable insights into the applicability of the tUPS hierarchy for strong electron correlation.

逼近强相关系统的电子波函数仍然是一项重大理论挑战。新兴的量子计算机可以考虑新型的波函数解析,并有可能克服强相关性的指数级内存存储。我最近介绍了平铺单元积状态(tUPS)解析,它成功地将粒子数和自旋对称性的保留与浅量子电路和局部量子比特连通性结合起来[H. G. A. Burton, Phys. Rev. Res., 2024, 6, 023300]。在这篇论文中,我研究了强相关哈密顿的 tUPS 层次结构的准确性。我考虑了篱笆配对哈密顿和二维哈伯德晶格,它们共同描述了分子中发现的一系列强相关机制。数值结果表明,对于哈伯德模型中的弱相关和强相关,以及排斥配对机制,都可以通过紧凑近似获得高精度能量。这些数据为 tUPS 层次结构在强电子相关性方面的适用性提供了宝贵的见解。
{"title":"Tiled unitary product states for strongly correlated Hamiltonians","authors":"Hugh G. A. Burton","doi":"10.1039/D4FD00064A","DOIUrl":"10.1039/D4FD00064A","url":null,"abstract":"<p >Approximating the electronic wave function for strongly correlated systems remains a major theoretical challenge. Emerging quantum computers can enable new types of wave-function ansatz to be considered, with the potential to overcome the exponential memory storage for strong correlation. I have recently introduced the tiled Unitary Product States (tUPS) ansatz, which successfully combines the preservation of particle-number and spin symmetry with shallow quantum circuits and local qubit connectivity [H. G. A. Burton, <em>Phys. Rev. Res.</em>, 2024, <strong>6</strong>, 023300]. In this contribution, I investigate the accuracy of this tUPS hierarchy for strongly-correlated Hamiltonians. I consider the picket-fence pairing Hamiltonian and the two-dimensional Hubbard lattice, which collectively describe a range of strong correlation mechanisms found in molecules. Numerical results demonstrate that highly accurate energies can be achieved with a compact approximation for both weak and strong correlation in the Hubbard model, and the repulsive pairing regime. These data provide valuable insights into the applicability of the tUPS hierarchy for strong electron correlation.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"254 ","pages":" 157-169"},"PeriodicalIF":3.4,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/fd/d4fd00064a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141755712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovery of highly anisotropic dielectric crystals with equivariant graph neural networks† 利用等变图神经网络发现高各向异性介电晶体
IF 3.4 3区 化学 Q2 Chemistry Pub Date : 2024-07-25 DOI: 10.1039/D4FD00096J
Yuchen Lou and Alex M. Ganose

Anisotropy in crystals plays a pivotal role in many technological applications. For example, anisotropic electronic and thermal transport are thought to be beneficial for thermoelectric applications, while anisotropic mechanical properties are of interest for emerging metamaterials, and anisotropic dielectric materials have been suggested as a novel platform for dark matter detection. Understanding and tailoring anisotropy in crystals is therefore essential for the design of next-generation functional materials. To date, however, most data-driven approaches have focused on the prediction of scalar crystal properties, such as the spherically averaged dielectric tensor or the bulk and shear elastic moduli. Here, we adopt the latest approaches in equivariant graph neural networks to develop a model that can predict the full dielectric tensor of crystals. Our model, trained on the Materials Project dataset of c.a. 6700 dielectric tensors, achieves state-of-the-art accuracy in scalar dielectric prediction in addition to capturing the directional response. We showcase the performance of the model by discovering crystals with almost isotropic connectivity but highly anisotropic dielectric tensors, thereby broadening our knowledge of the structure–property relationships in dielectric crystals.

晶体中的各向异性在许多技术应用中起着举足轻重的作用。例如,各向异性的电子和热传输被认为有利于热电应用,而各向异性的机械特性则是新兴超材料的兴趣所在,各向异性的介电材料被认为是暗物质探测的新型平台。因此,了解和调整晶体中的各向异性对于设计下一代功能材料至关重要。然而,迄今为止,大多数数据驱动方法都侧重于标量晶体特性的预测,如球面平均介电张量或体积弹性模量和剪切弹性模量。在此,我们采用等变图神经网络的最新方法,开发出一种可预测晶体全介电常量的模型。我们的模型是在包含约 6,700 个介电张量的材料项目数据集上训练出来的,除了捕捉方向性响应外,在标量介电预测方面也达到了最先进的精度。我们通过发现具有几乎各向同性连接但介电张量高度各向异性的晶体来展示该模型的性能,从而拓宽了我们对介电晶体结构-性能关系的认识。
{"title":"Discovery of highly anisotropic dielectric crystals with equivariant graph neural networks†","authors":"Yuchen Lou and Alex M. Ganose","doi":"10.1039/D4FD00096J","DOIUrl":"10.1039/D4FD00096J","url":null,"abstract":"<p >Anisotropy in crystals plays a pivotal role in many technological applications. For example, anisotropic electronic and thermal transport are thought to be beneficial for thermoelectric applications, while anisotropic mechanical properties are of interest for emerging metamaterials, and anisotropic dielectric materials have been suggested as a novel platform for dark matter detection. Understanding and tailoring anisotropy in crystals is therefore essential for the design of next-generation functional materials. To date, however, most data-driven approaches have focused on the prediction of scalar crystal properties, such as the spherically averaged dielectric tensor or the bulk and shear elastic moduli. Here, we adopt the latest approaches in equivariant graph neural networks to develop a model that can predict the full dielectric tensor of crystals. Our model, trained on the Materials Project dataset of <em>c.a.</em> 6700 dielectric tensors, achieves state-of-the-art accuracy in scalar dielectric prediction in addition to capturing the directional response. We showcase the performance of the model by discovering crystals with almost isotropic connectivity but highly anisotropic dielectric tensors, thereby broadening our knowledge of the structure–property relationships in dielectric crystals.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"256 ","pages":" 255-274"},"PeriodicalIF":3.4,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00096j?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141774441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accurate predictions of chemical shifts with the rSCAN and r2SCAN mGGA exchange–correlation functionals† 利用 rSCAN 和 r2SCAN mGGA 交换相关函数精确预测化学位移
IF 3.4 3区 化学 Q2 Chemistry Pub Date : 2024-07-25 DOI: 10.1039/D4FD00142G
Jonathan R. Yates and Albert P. Bartók

We benchmark the rSCAN and r2SCAN exchange–correlation functionals by comparing the Nuclear Magnetic Resonance (NMR) magnetic shieldings predicted by Density Functional Theory (DFT) to experimentally observed chemical shifts of halide and oxide inorganic compounds. Significant improvement in accuracy is achieved compared to the Generalised Gradient Approximation (GGA) at a marginally higher computational cost. When using rSCAN or r2SCAN, the correlation coefficient between computationally predicted and experimental values approaches the theoretically expected value of −1 while reducing the deviation, allowing more accurate and reliable spectrum assignments of complex compounds in experimental investigations.

我们通过比较密度泛函理论预测的核磁共振磁屏蔽与实验观察到的卤化物和氧化物无机化合物化学位移,对 rSCAN 和 r2SCAN 交换相关函数进行了基准测试。与广义梯度近似法相比,在计算成本略有增加的情况下,精确度有了显著提高。当使用 rSCAN 或 r2SCAN 时,计算预测值与实验值之间的相关系数接近理论预期值-1,同时降低了偏差,从而使实验研究中复杂化合物的光谱分配更加准确可靠。
{"title":"Accurate predictions of chemical shifts with the rSCAN and r2SCAN mGGA exchange–correlation functionals†","authors":"Jonathan R. Yates and Albert P. Bartók","doi":"10.1039/D4FD00142G","DOIUrl":"10.1039/D4FD00142G","url":null,"abstract":"<p >We benchmark the rSCAN and r<small><sup>2</sup></small>SCAN exchange–correlation functionals by comparing the Nuclear Magnetic Resonance (NMR) magnetic shieldings predicted by Density Functional Theory (DFT) to experimentally observed chemical shifts of halide and oxide inorganic compounds. Significant improvement in accuracy is achieved compared to the Generalised Gradient Approximation (GGA) at a marginally higher computational cost. When using rSCAN or r<small><sup>2</sup></small>SCAN, the correlation coefficient between computationally predicted and experimental values approaches the theoretically expected value of −1 while reducing the deviation, allowing more accurate and reliable spectrum assignments of complex compounds in experimental investigations.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":" 0","pages":" 192-202"},"PeriodicalIF":3.4,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/fd/d4fd00142g?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spiers Memorial Lecture: Engineering biocatalysts 斯皮尔斯纪念讲座:生物催化剂工程。
IF 3.4 3区 化学 Q2 Chemistry Pub Date : 2024-07-24 DOI: 10.1039/D4FD00139G
Donald Hilvert

Enzymes are being engineered to catalyze chemical reactions for many practical applications in chemistry and biotechnology. The approaches used are surveyed in this short review, emphasizing methods for accessing reactivities not expressed by native protein scaffolds. The successful generation of completely de novo enzymes that rival the rates and selectivities of their natural counterparts highlights the potential role that designer enzymes may play in the coming years in research, industry, and medicine. Some challenges that need to be addressed to realize this ambitious dream are considered together with possible solutions.

人们正在设计酶来催化化学反应,以用于化学和生物技术领域的许多实际应用。本短文对所使用的方法进行了概述,重点介绍了获得原生蛋白质支架无法表达的反应活性的方法。完全从头开始的酶的成功生成,其速率和选择性可与天然酶媲美,这突出表明了设计酶在未来几年中可能在研究、工业和医药领域发挥的潜在作用。为实现这一宏伟梦想,需要应对一些挑战,并考虑可能的解决方案。
{"title":"Spiers Memorial Lecture: Engineering biocatalysts","authors":"Donald Hilvert","doi":"10.1039/D4FD00139G","DOIUrl":"10.1039/D4FD00139G","url":null,"abstract":"<p >Enzymes are being engineered to catalyze chemical reactions for many practical applications in chemistry and biotechnology. The approaches used are surveyed in this short review, emphasizing methods for accessing reactivities not expressed by native protein scaffolds. The successful generation of completely <em>de novo</em> enzymes that rival the rates and selectivities of their natural counterparts highlights the potential role that designer enzymes may play in the coming years in research, industry, and medicine. Some challenges that need to be addressed to realize this ambitious dream are considered together with possible solutions.</p>","PeriodicalId":49075,"journal":{"name":"Faraday Discussions","volume":"252 ","pages":" 9-28"},"PeriodicalIF":3.4,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/fd/d4fd00139g?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141750535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Faraday Discussions
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1