首页 > 最新文献

Faraday Discussions最新文献

英文 中文
Concluding remarks: Faraday Discussion on NMR crystallography. 结束语:法拉第核磁共振晶体学讨论。
IF 3.3 3区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-10-18 DOI: 10.1039/d4fd00155a
Sharon E Ashbrook

This Faraday Discussion explored the field of NMR crystallography, and considered recent developments in experimental and theoretical approaches, new advances in machine learning and in the generation and handling of large amounts of data. Applications to a wide range of disordered, amorphous and dynamic systems demonstrated the range and quality of information available from this approach and the challenges that are faced in exploiting automation and developing best practice. In these closing remarks I will reflect on the discussions on the current state of the art, questions about what we want from these studies, how accurate we need results to be, how we best generate models for complex materials and what machine learning approaches can offer. These remarks close with thoughts about the future direction of the field, who will be carrying out this type of research, how they might be doing it and what their focus will be, along with likely possible challenges and opportunities.

本次法拉第讨论会探讨了核磁共振晶体学领域,并审议了实验和理论方法的最新发展、机器学习的新进展以及大量数据的生成和处理。对各种无序、无定形和动态系统的应用展示了这种方法可提供的信息的范围和质量,以及在利用自动化和开发最佳实践方面所面临的挑战。在结束语中,我将反思有关当前技术水平的讨论、我们希望从这些研究中得到什么、我们需要多精确的结果、我们如何最好地为复杂材料生成模型以及机器学习方法可以提供什么等问题。最后,我将对该领域的未来发展方向、谁将开展此类研究、如何开展研究、研究重点以及可能面临的挑战和机遇进行思考。
{"title":"Concluding remarks: <i>Faraday Discussion</i> on NMR crystallography.","authors":"Sharon E Ashbrook","doi":"10.1039/d4fd00155a","DOIUrl":"10.1039/d4fd00155a","url":null,"abstract":"<p><p>This <i>Faraday Discussion</i> explored the field of NMR crystallography, and considered recent developments in experimental and theoretical approaches, new advances in machine learning and in the generation and handling of large amounts of data. Applications to a wide range of disordered, amorphous and dynamic systems demonstrated the range and quality of information available from this approach and the challenges that are faced in exploiting automation and developing best practice. In these closing remarks I will reflect on the discussions on the current state of the art, questions about what we want from these studies, how accurate we need results to be, how we best generate models for complex materials and what machine learning approaches can offer. These remarks close with thoughts about the future direction of the field, who will be carrying out this type of research, how they might be doing it and what their focus will be, along with likely possible challenges and opportunities.</p>","PeriodicalId":76,"journal":{"name":"Faraday Discussions","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142453621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large property models: a new generative machine-learning formulation for molecules.
IF 3.3 3区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-09-27 DOI: 10.1039/d4fd00113c
Tianfan Jin, Veerupaksh Singla, Hsuan-Hao Hsu, Brett M Savoie

Generative models for the inverse design of molecules with particular properties have been heavily hyped, but have yet to demonstrate significant gains over machine-learning-augmented expert intuition. A major challenge of such models is their limited accuracy in predicting molecules with targeted properties in the data-scarce regime, which is the regime typical of the prized outliers that it is hoped inverse models will discover. For example, activity data for a drug target or stability data for a material may only number in the tens to hundreds of samples, which is insufficient to learn an accurate and reasonably general property-to-structure inverse mapping from scratch. We've hypothesized that the property-to-structure mapping becomes unique when a sufficient number of properties are supplied to the models during training. This hypothesis has several important corollaries if true. It would imply that data-scarce properties can be completely determined using a set of more accessible molecular properties. It would also imply that a generative model trained on multiple properties would exhibit an accuracy phase transition after achieving a sufficient size-a process analogous to what has been observed in the context of large language models. To interrogate these behaviors, we have built the first transformers trained on the property-to-molecular-graph task, which we dub "large property models" (LPMs). A key ingredient is supplementing these models during training with relatively basic but abundant chemical property data. The motivation for the large-property-model paradigm, the model architectures, and case studies are presented here.

{"title":"Large property models: a new generative machine-learning formulation for molecules.","authors":"Tianfan Jin, Veerupaksh Singla, Hsuan-Hao Hsu, Brett M Savoie","doi":"10.1039/d4fd00113c","DOIUrl":"https://doi.org/10.1039/d4fd00113c","url":null,"abstract":"<p><p>Generative models for the inverse design of molecules with particular properties have been heavily hyped, but have yet to demonstrate significant gains over machine-learning-augmented expert intuition. A major challenge of such models is their limited accuracy in predicting molecules with targeted properties in the data-scarce regime, which is the regime typical of the prized outliers that it is hoped inverse models will discover. For example, activity data for a drug target or stability data for a material may only number in the tens to hundreds of samples, which is insufficient to learn an accurate and reasonably general property-to-structure inverse mapping from scratch. We've hypothesized that the property-to-structure mapping becomes unique when a sufficient number of properties are supplied to the models during training. This hypothesis has several important corollaries if true. It would imply that data-scarce properties can be completely determined using a set of more accessible molecular properties. It would also imply that a generative model trained on multiple properties would exhibit an accuracy phase transition after achieving a sufficient size-a process analogous to what has been observed in the context of large language models. To interrogate these behaviors, we have built the first transformers trained on the property-to-molecular-graph task, which we dub \"large property models\" (LPMs). A key ingredient is supplementing these models during training with relatively basic but abundant chemical property data. The motivation for the large-property-model paradigm, the model architectures, and case studies are presented here.</p>","PeriodicalId":76,"journal":{"name":"Faraday Discussions","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142805623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of uncertainty of neural fingerprint-based models. 基于神经指纹模型的不确定性分析。
IF 3.3 3区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-09-25 DOI: 10.1039/d4fd00095a
Christian W Feldmann, Jochen Sieg, Miriam Mathea

Machine learning has gained popularity for predicting molecular properties based on molecular structure. This study explores the uncertainty estimates of neural fingerprint-based models by comparing pure graph neural networks (GNN) to classical machine learning algorithms combined with neural fingerprints. We investigate the advantage of extracting the neural fingerprint from the GNN and integrating it into a method known for producing better-calibrated probability estimates. Comparisons are made using three classical machine learning methods and the Chemprop model, considering different molecular representations and calibration techniques. We utilize 19 datasets from Toxcast, reflecting real-world scenarios with balanced accuracies ranging from 0.6 to 0.8. Results demonstrate that neural fingerprints combined with classical machine learning methods exhibit a slight decrease in prediction performance compared to the native Chemprop model. However, these models provide significantly improved uncertainty estimates. Notably, uncertainty estimates of neural fingerprint-based methods remain relatively robust for molecules dissimilar to the training set. This suggests that methods like random forest with neural fingerprints can deliver strong prediction performance and reliable uncertainty estimates. When considering both performance and uncertainty, the calibrated Chemprop model and the combination of neural fingerprints with random forest or support vector classifier (SVC) yield comparable results. Surprisingly, the SVC method shows promising performance when combined with neural or count fingerprints. These findings are particularly relevant in real-world industrial projects where accurate predictions and reliable uncertainty estimates are crucial.

机器学习在基于分子结构预测分子特性方面越来越受欢迎。本研究通过比较纯图神经网络(GNN)与结合神经指纹的经典机器学习算法,探讨了基于神经指纹的模型的不确定性估计。我们研究了从 GNN 中提取神经指纹并将其整合到一种已知能产生更好校准概率估计值的方法中的优势。我们使用三种经典机器学习方法和 Chemprop 模型进行了比较,并考虑了不同的分子表征和校准技术。我们利用了来自 Toxcast 的 19 个数据集,这些数据集反映了现实世界中的各种情况,其平衡精度在 0.6 到 0.8 之间。结果表明,与原生 Chemprop 模型相比,神经指纹结合经典机器学习方法的预测性能略有下降。不过,这些模型提供的不确定性估计值有了明显改善。值得注意的是,对于与训练集不同的分子,基于神经指纹方法的不确定性估计仍然相对稳健。这表明,采用神经指纹的随机森林等方法可以提供强大的预测性能和可靠的不确定性估计。在同时考虑性能和不确定性时,经过校准的 Chemprop 模型和神经指纹与随机森林或支持向量分类器(SVC)的组合产生了不相上下的结果。令人惊讶的是,SVC 方法在与神经或计数指纹相结合时表现出了良好的性能。这些发现与现实世界中的工业项目尤其相关,因为在这些项目中,准确的预测和可靠的不确定性估计至关重要。
{"title":"Analysis of uncertainty of neural fingerprint-based models.","authors":"Christian W Feldmann, Jochen Sieg, Miriam Mathea","doi":"10.1039/d4fd00095a","DOIUrl":"10.1039/d4fd00095a","url":null,"abstract":"<p><p>Machine learning has gained popularity for predicting molecular properties based on molecular structure. This study explores the uncertainty estimates of neural fingerprint-based models by comparing pure graph neural networks (GNN) to classical machine learning algorithms combined with neural fingerprints. We investigate the advantage of extracting the neural fingerprint from the GNN and integrating it into a method known for producing better-calibrated probability estimates. Comparisons are made using three classical machine learning methods and the Chemprop model, considering different molecular representations and calibration techniques. We utilize 19 datasets from Toxcast, reflecting real-world scenarios with balanced accuracies ranging from 0.6 to 0.8. Results demonstrate that neural fingerprints combined with classical machine learning methods exhibit a slight decrease in prediction performance compared to the native Chemprop model. However, these models provide significantly improved uncertainty estimates. Notably, uncertainty estimates of neural fingerprint-based methods remain relatively robust for molecules dissimilar to the training set. This suggests that methods like random forest with neural fingerprints can deliver strong prediction performance and reliable uncertainty estimates. When considering both performance and uncertainty, the calibrated Chemprop model and the combination of neural fingerprints with random forest or support vector classifier (SVC) yield comparable results. Surprisingly, the SVC method shows promising performance when combined with neural or count fingerprints. These findings are particularly relevant in real-world industrial projects where accurate predictions and reliable uncertainty estimates are crucial.</p>","PeriodicalId":76,"journal":{"name":"Faraday Discussions","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142337476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metastable layered lithium-rich niobium and tantalum oxides via nearly instantaneous cation exchange. 通过近乎瞬时的阳离子交换实现可蜕变的层状富锂铌和钽氧化物。
IF 3.3 3区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-09-20 DOI: 10.1039/d4fd00103f
Sarah L Ko, Jordan A Dorrell, Andrew J Morris, Kent J Griffith

Lithium-rich early transition metal oxides are the source of excess removeable lithium that affords high energy density to lithium-rich battery cathodes. They are also candidates for solid electrolytes in all-solid-state batteries. These highly ionic compounds are sparse on phase diagrams of thermodynamically stable oxides, but soft chemical routes offer an alternative to explore new alkali-rich crystal chemistries. In this work, a new layered polymorph of Li3NbO4 with coplanar [Nb4O16]12- clusters is discovered through ion exchange chemistry. A more detailed study of the ion exchange reaction reveals that it takes place almost instantaneously, changing the crystal volume by more than 22% within seconds. The transformation of coplanar [Nb4O16]12- in L-Li3NbO4 into the supertetrahedral [Nb4O16]12- clusters found in the stable cubic c-Li3NbO4 is also explored. Furthermore, this synthetic pathway is extended to access a new layered polymorph of Li3TaO4. NMR crystallography with 6,7Li, 23Na, and 93Nb NMR, X-ray diffraction, neutron diffraction, and first-principles calculations is applied to A3MO4 (A = Li, Na; M = Nb, Ta) to identify local and long-range atomic structure, to monitor the unusually rapid reaction progression, and to track the phase transitions from the metastable layered phases to the known compounds found using high-temperature synthesis. A mechanism is proposed whereby some sodium is retained at short reaction times, which then undergoes proton exchange during water washing, forming a phase with hydrogen bonds bridging the coplanar [Nb4O16]12- clusters. This study has implications for lithium-rich transition metal oxides and associated battery materials and for ion exchange chemistry in non-framework structures. The role of techniques that can detect light elements, local structure, and subtle structural changes in soft-chemical synthesis is emphasized.

富锂早期过渡金属氧化物是过量可移除锂的来源,可为富锂电池正极提供高能量密度。它们也是全固态电池固态电解质的候选材料。这些高离子化合物在热力学稳定的氧化物相图中并不常见,但软化学路线为探索新的富碱晶体化学提供了另一种选择。在这项研究中,通过离子交换化学发现了一种具有共面[Nb4O16]12-团簇的新型层状多晶体 Li3NbO4。对离子交换反应的更详细研究表明,该反应几乎是瞬间发生的,在几秒钟内晶体体积就改变了 22% 以上。此外,还探讨了 L-Li3NbO4 中共面[Nb4O16]12-转变为稳定立方 c-Li3NbO4 中超四面体[Nb4O16]12- 簇的过程。此外,这一合成途径还扩展到了一种新的层状多晶体 Li3TaO4。利用 6,7Li、23Na 和 93Nb NMR、X 射线衍射、中子衍射和第一原理计算对 A3MO4(A = Li、Na;M = Nb、Ta)进行了核磁共振晶体学研究,以确定局部和长程原子结构,监控异常快速的反应进程,并跟踪从可蜕变层状相到利用高温合成发现的已知化合物的相变过程。研究提出了一种机制,即在短反应时间内保留了一些钠,然后在水洗过程中进行质子交换,形成以氢键桥接共面[Nb4O16]12-团簇的相。这项研究对富锂过渡金属氧化物和相关电池材料以及非框架结构中的离子交换化学具有重要意义。该研究强调了可检测轻元素、局部结构和微妙结构变化的技术在软化学合成中的作用。
{"title":"Metastable layered lithium-rich niobium and tantalum oxides <i>via</i> nearly instantaneous cation exchange.","authors":"Sarah L Ko, Jordan A Dorrell, Andrew J Morris, Kent J Griffith","doi":"10.1039/d4fd00103f","DOIUrl":"10.1039/d4fd00103f","url":null,"abstract":"<p><p>Lithium-rich early transition metal oxides are the source of excess removeable lithium that affords high energy density to lithium-rich battery cathodes. They are also candidates for solid electrolytes in all-solid-state batteries. These highly ionic compounds are sparse on phase diagrams of thermodynamically stable oxides, but soft chemical routes offer an alternative to explore new alkali-rich crystal chemistries. In this work, a new layered polymorph of Li<sub>3</sub>NbO<sub>4</sub> with coplanar [Nb<sub>4</sub>O<sub>16</sub>]<sup>12-</sup> clusters is discovered through ion exchange chemistry. A more detailed study of the ion exchange reaction reveals that it takes place almost instantaneously, changing the crystal volume by more than 22% within seconds. The transformation of coplanar [Nb<sub>4</sub>O<sub>16</sub>]<sup>12-</sup> in L-Li<sub>3</sub>NbO<sub>4</sub> into the supertetrahedral [Nb<sub>4</sub>O<sub>16</sub>]<sup>12-</sup> clusters found in the stable cubic c-Li<sub>3</sub>NbO<sub>4</sub> is also explored. Furthermore, this synthetic pathway is extended to access a new layered polymorph of Li<sub>3</sub>TaO<sub>4</sub>. NMR crystallography with <sup>6,7</sup>Li, <sup>23</sup>Na, and <sup>93</sup>Nb NMR, X-ray diffraction, neutron diffraction, and first-principles calculations is applied to A<sub>3</sub>MO<sub>4</sub> (A = Li, Na; M = Nb, Ta) to identify local and long-range atomic structure, to monitor the unusually rapid reaction progression, and to track the phase transitions from the metastable layered phases to the known compounds found using high-temperature synthesis. A mechanism is proposed whereby some sodium is retained at short reaction times, which then undergoes proton exchange during water washing, forming a phase with hydrogen bonds bridging the coplanar [Nb<sub>4</sub>O<sub>16</sub>]<sup>12-</sup> clusters. This study has implications for lithium-rich transition metal oxides and associated battery materials and for ion exchange chemistry in non-framework structures. The role of techniques that can detect light elements, local structure, and subtle structural changes in soft-chemical synthesis is emphasized.</p>","PeriodicalId":76,"journal":{"name":"Faraday Discussions","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142277402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How to do impactful research in artificial intelligence for chemistry and materials science. 如何在化学和材料科学领域开展有影响力的人工智能研究。
IF 3.4 3区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-09-13 DOI: 10.1039/d4fd00153b
Alan Aspuru-Guzik, Austin Cheng, Marta Skreta, Cher Tian Ser, Andres Guzman-Cordero, Luca Thiede, Andreas Burger, Sergio Pablo-García, Abdulrahman Aldossary, Shi Xuan Leong, Felix Strieth-Kalthoff
Machine learning has been pervasively touching many fields of science. Chemistry and materials science are no exception. While machine learning has been making a great impact, it is still not reaching its full potential or maturity. In this perspective, we first outline the pervasive current applications. Then, we discuss how machine learning researchers view and approach problems in the field. Finally, we provide our considerations for maximizing impact when researching machine learning for chemistry.
机器学习已经渗透到许多科学领域。化学和材料科学也不例外。虽然机器学习已经产生了巨大的影响,但其潜力和成熟度仍未充分发挥出来。在本视角中,我们首先概述了当前的普遍应用。然后,我们讨论机器学习研究人员如何看待和处理该领域的问题。最后,我们提出了在研究化学机器学习时如何最大限度地发挥其影响力的注意事项。
{"title":"How to do impactful research in artificial intelligence for chemistry and materials science.","authors":"Alan Aspuru-Guzik, Austin Cheng, Marta Skreta, Cher Tian Ser, Andres Guzman-Cordero, Luca Thiede, Andreas Burger, Sergio Pablo-García, Abdulrahman Aldossary, Shi Xuan Leong, Felix Strieth-Kalthoff","doi":"10.1039/d4fd00153b","DOIUrl":"https://doi.org/10.1039/d4fd00153b","url":null,"abstract":"Machine learning has been pervasively touching many fields of science. Chemistry and materials science are no exception. While machine learning has been making a great impact, it is still not reaching its full potential or maturity. In this perspective, we first outline the pervasive current applications. Then, we discuss how machine learning researchers view and approach problems in the field. Finally, we provide our considerations for maximizing impact when researching machine learning for chemistry.","PeriodicalId":76,"journal":{"name":"Faraday Discussions","volume":"32 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142262638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NMR Crystallography 核磁共振晶体学
IF 3.4 3区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-09-06 DOI: 10.1039/d4fd00151f
Lyndon Emsley
Chemical function is directly related to the spatial arrangement of atoms. Consequently, the determination of atomic-level three-dimensional structures has transformed molecular and materials science over the past 60 years. In this context, solid-state NMR has emerged to become the method of choice for atomic-level characterization of complex materials in powder form. In the following we present an overview of current methods for chemical shift driven NMR crystallography, illustrated with applications to complex materials
化学功能与原子的空间排列直接相关。因此,在过去 60 年中,原子级三维结构的测定改变了分子和材料科学。在此背景下,固态 NMR 已成为粉末状复杂材料原子级表征的首选方法。下面我们将概述目前化学位移驱动核磁共振晶体学的方法,并以复杂材料的应用为例进行说明
{"title":"NMR Crystallography","authors":"Lyndon Emsley","doi":"10.1039/d4fd00151f","DOIUrl":"https://doi.org/10.1039/d4fd00151f","url":null,"abstract":"Chemical function is directly related to the spatial arrangement of atoms. Consequently, the determination of atomic-level three-dimensional structures has transformed molecular and materials science over the past 60 years. In this context, solid-state NMR has emerged to become the method of choice for atomic-level characterization of complex materials in powder form. In the following we present an overview of current methods for chemical shift driven NMR crystallography, illustrated with applications to complex materials","PeriodicalId":76,"journal":{"name":"Faraday Discussions","volume":"262 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142208830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Making the InChI FAIR and sustainable while moving to Inorganics 在转向无机物的同时,使 InChI 具有 FAIR 和可持续性
IF 3.4 3区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-09-04 DOI: 10.1039/d4fd00145a
Gerd Blanke, Jan Brammer, Djordje Baljozovic, Nauman Khan, Frank Lange, Felix Bänsch, Clare A. Tovee, Ulrich Schatzschneider, Richard M Hartshorn, Sonja Herres-Pawlis
The InChI (International Chemical Identifier) standard stands as a cornerstone in chemical informatics, facilitating the structure-based identification and exchange of chemical compounds across various platforms and databases. The InChI as a unique canonical line notation has made chemical structures searchable on the internet at a broad scale. The largest repositories working with InChIs contain more than 1 billion structures. Central to the functionality of the InChI is its codebase, which orchestrates a series of intricate steps to generate unique identifiers for chemical compounds. Up to now, these steps have been sparsely documented and the InChI algorithm had to be seen as a black box. For the new v1.07 release, the code has been analyzed and the major steps documented, more than 3000 bugs and security issues, as well as nearly 60 Google OSS-Fuzz issues have been fixed. New test systems have been implemented that allow users to directly test the code developments. The move to GitHub has not only made the development more transparent but will also enable external contributors to join the further development of the InChI code. Motivation for this modernisation was the urgency to treat molecular inorganic compounds by the InChI in a meaningful way. Until now, no classic string representation fulfills this need of molecular inorganic chemistry. The connection of metal bonds is by definition disconnected which makes most inorganic InChIs meaningless at the moment. Herein, we propose new routines to remedy this problem in the representation of molecular inorganic compounds by the InChI.
InChI(国际化学标识符)标准是化学信息学的基石,有助于在各种平台和数据库中进行基于结构的化合物识别和交换。InChI 作为一种独特的规范行符号,使化学结构可以在互联网上进行广泛搜索。使用 InChIs 的最大资源库包含 10 亿多个结构。InChI 功能的核心是其代码库,它协调了一系列复杂的步骤来生成化合物的唯一标识符。到目前为止,对这些步骤的记录还很少,InChI 算法只能被看作是一个黑盒子。在新发布的 v1.07 版中,对代码进行了分析,并记录了主要步骤,修复了 3000 多个错误和安全问题,以及近 60 个 Google OSS-Fuzz 问题。此外,还实施了新的测试系统,允许用户直接测试代码开发。迁移到 GitHub 不仅使开发工作更加透明,还能让外部贡献者加入到 InChI 代码的进一步开发中。InChI之所以要进行现代化改造,是因为迫切需要以一种有意义的方式来处理分子无机化合物。到目前为止,还没有一种经典的字符串表示法能满足分子无机化学的这一需求。根据定义,金属键的连接是断开的,这使得大多数无机 InChI 目前毫无意义。在此,我们提出了新的例程,以弥补用 InChI 表示分子无机化合物的这一问题。
{"title":"Making the InChI FAIR and sustainable while moving to Inorganics","authors":"Gerd Blanke, Jan Brammer, Djordje Baljozovic, Nauman Khan, Frank Lange, Felix Bänsch, Clare A. Tovee, Ulrich Schatzschneider, Richard M Hartshorn, Sonja Herres-Pawlis","doi":"10.1039/d4fd00145a","DOIUrl":"https://doi.org/10.1039/d4fd00145a","url":null,"abstract":"The InChI (International Chemical Identifier) standard stands as a cornerstone in chemical informatics, facilitating the structure-based identification and exchange of chemical compounds across various platforms and databases. The InChI as a unique canonical line notation has made chemical structures searchable on the internet at a broad scale. The largest repositories working with InChIs contain more than 1 billion structures. Central to the functionality of the InChI is its codebase, which orchestrates a series of intricate steps to generate unique identifiers for chemical compounds. Up to now, these steps have been sparsely documented and the InChI algorithm had to be seen as a black box. For the new v1.07 release, the code has been analyzed and the major steps documented, more than 3000 bugs and security issues, as well as nearly 60 Google OSS-Fuzz issues have been fixed. New test systems have been implemented that allow users to directly test the code developments. The move to GitHub has not only made the development more transparent but will also enable external contributors to join the further development of the InChI code. Motivation for this modernisation was the urgency to treat molecular inorganic compounds by the InChI in a meaningful way. Until now, no classic string representation fulfills this need of molecular inorganic chemistry. The connection of metal bonds is by definition disconnected which makes most inorganic InChIs meaningless at the moment. Herein, we propose new routines to remedy this problem in the representation of molecular inorganic compounds by the InChI.","PeriodicalId":76,"journal":{"name":"Faraday Discussions","volume":"18 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142208832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regulation of Transmembrane Current through Modulation of Biomimetic Lipid Membrane Composition 通过调节仿生脂质膜成分调节跨膜电流
IF 3.4 3区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-08-23 DOI: 10.1039/d4fd00149d
Zhiwei Shang, Jing Zhao, Mengyu Yang, Yuling Xiao, Wenjing Chu, Yilin Cai, Xiaoqing Yi, Meihua Lin, Fan Xia
Ion transport through biological channels is influenced not only by the structural properties of the channels themselves but also by the composition of the phospholipid membrane, which acts as a scaffold for these nanochannels. Drawing inspiration from how lipid membrane composition modulates ion currents, as seen in the activation of the K+ channel in Streptomyces A (KcsA) by anionic lipids, we propose a biomimetic nanochannel system that integrates DNA nanotechnology with two-dimensional graphene oxide (GO) nanosheets. By modifying the length of the multibranched DNA nanowires generated through the hybridization chain reactions (HCR) and varying the concentration of the linker strands that integrate these DNA nanowire structures with the GO membrane, the composition of the membrane can be effectively adjusted, consequently impacting ion transport. This method provides a strategy for developing devices with highly efficient and tunable ion transport, suitable for applications in mass transport, environmental protection, biomimetic channels, and biosensors.
离子通过生物通道的传输不仅受通道本身结构特性的影响,还受磷脂膜成分的影响,磷脂膜是这些纳米通道的支架。从阴离子脂质激活链霉菌 A 的 K+ 通道(KcsA)的过程中,我们从脂质膜成分如何调节离子电流中汲取了灵感,提出了一种将 DNA 纳米技术与二维氧化石墨烯(GO)纳米片相结合的仿生纳米通道系统。通过改变杂交链反应(HCR)产生的多分支 DNA 纳米线的长度,以及改变将这些 DNA 纳米线结构与 GO 膜结合在一起的连接链的浓度,可以有效调整膜的组成,从而影响离子传输。这种方法为开发具有高效和可调离子传输功能的设备提供了一种策略,适用于质量传输、环境保护、仿生通道和生物传感器等应用领域。
{"title":"Regulation of Transmembrane Current through Modulation of Biomimetic Lipid Membrane Composition","authors":"Zhiwei Shang, Jing Zhao, Mengyu Yang, Yuling Xiao, Wenjing Chu, Yilin Cai, Xiaoqing Yi, Meihua Lin, Fan Xia","doi":"10.1039/d4fd00149d","DOIUrl":"https://doi.org/10.1039/d4fd00149d","url":null,"abstract":"Ion transport through biological channels is influenced not only by the structural properties of the channels themselves but also by the composition of the phospholipid membrane, which acts as a scaffold for these nanochannels. Drawing inspiration from how lipid membrane composition modulates ion currents, as seen in the activation of the K+ channel in Streptomyces A (KcsA) by anionic lipids, we propose a biomimetic nanochannel system that integrates DNA nanotechnology with two-dimensional graphene oxide (GO) nanosheets. By modifying the length of the multibranched DNA nanowires generated through the hybridization chain reactions (HCR) and varying the concentration of the linker strands that integrate these DNA nanowire structures with the GO membrane, the composition of the membrane can be effectively adjusted, consequently impacting ion transport. This method provides a strategy for developing devices with highly efficient and tunable ion transport, suitable for applications in mass transport, environmental protection, biomimetic channels, and biosensors.","PeriodicalId":76,"journal":{"name":"Faraday Discussions","volume":"7 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142208841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction rigidities for data-driven chemistry 数据驱动化学的预测刚性
IF 3.4 3区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-08-23 DOI: 10.1039/d4fd00101j
Sanggyu Chong, Filippo Bigi, Federico Grasselli, Philip Loche, Matthias Kellner, Michele Ceriotti
The widespread application of machine learning (ML) to the chemical sciences is making it very important to understand how the ML models learn to correlate chemical structures with their properties, and what can be done to improve the training efficiency whilst guaranteeing interpretability and transferability. In this work, we demonstrate the wide utility of prediction rigidities, a faimily of metrics derived from the loss function, in understanding the robustness of ML model predictions. We show that the prediction rigidities allow the assessment of the model not only at the global level, but also on the local or the component-wise level at which the intermediate (e.g. atomic, body-ordered, or range-separated) predictions are made. We leverage these metrics to understand the learning behavior of different ML models, and to guide efficient dataset construction for model training. We finally implement the formalism for a ML model targeting a coarse-grained system to demonstrate the applicability of the prediction rigidities to an even broader class of atomistic modeling problems.
机器学习(ML)在化学科学领域的广泛应用,使得了解 ML 模型如何学习将化学结构与其性质联系起来,以及如何在保证可解释性和可转移性的同时提高训练效率变得非常重要。在这项工作中,我们展示了预测刚性的广泛实用性,它是由损失函数衍生出的一系列指标,有助于理解 ML 模型预测的鲁棒性。我们表明,预测刚度不仅可以在全局层面对模型进行评估,还可以在局部或组件层面对模型进行评估,而中间预测(如原子、体有序或范围分离)就是在局部或组件层面进行的。我们利用这些指标来了解不同 ML 模型的学习行为,并指导模型训练的高效数据集构建。最后,我们针对粗粒度系统实现了 ML 模型的形式主义,以证明预测刚性适用于更广泛的原子建模问题。
{"title":"Prediction rigidities for data-driven chemistry","authors":"Sanggyu Chong, Filippo Bigi, Federico Grasselli, Philip Loche, Matthias Kellner, Michele Ceriotti","doi":"10.1039/d4fd00101j","DOIUrl":"https://doi.org/10.1039/d4fd00101j","url":null,"abstract":"The widespread application of machine learning (ML) to the chemical sciences is making it very important to understand how the ML models learn to correlate chemical structures with their properties, and what can be done to improve the training efficiency whilst guaranteeing interpretability and transferability. In this work, we demonstrate the wide utility of prediction rigidities, a faimily of metrics derived from the loss function, in understanding the robustness of ML model predictions. We show that the prediction rigidities allow the assessment of the model not only at the global level, but also on the local or the component-wise level at which the intermediate (e.g. atomic, body-ordered, or range-separated) predictions are made. We leverage these metrics to understand the learning behavior of different ML models, and to guide efficient dataset construction for model training. We finally implement the formalism for a ML model targeting a coarse-grained system to demonstrate the applicability of the prediction rigidities to an even broader class of atomistic modeling problems.","PeriodicalId":76,"journal":{"name":"Faraday Discussions","volume":"47 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142208831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Charge induced deformation of scanning electrolyte before contact 接触前扫描电解质的电荷诱导变形
IF 3.4 3区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-08-20 DOI: 10.1039/d4fd00147h
Liang Liu
The recent developments of scanning electrochemical probe techniques focus on the strategy of scanning electrolyte. For example, scanning electrochemical cell microscopy (SECCM) is based on holding the electrolyte in a glass capillary, while scanning gel electrochemical microscopy (SGECM) immobilizes the gel electrolyte on micro-disk electrodes or etched metal wires. In both SECCM and SGECM, the first and essential step is to approach the electrolyte probe to be in contact with the sample, which is very often achieved by current feedback with a constant applied potential between the probe and the sample. This work attempts to theoretically analyse the deformation of electrolyte during this approaching process. For liquid electrolyte in SECCM, surface tension is considered to counterbalance the gravity and electrostatic force in 2D cylindrical coordinates with axial symmetry. The deformation at equilibrium is solved under certain conditions. For gel electrolyte, a viscoelastic gel is analysed with simplified 1D geometry. Both equilibrium and dynamic approaching are considered. The results suggest that for both liquid and gel electrolytes, critical conditions exist for breaking the equilibrium. When applied potential is higher or the distance is lower than the threshold, the force will not equilibrate and the electrolyte will deform until contact. The critical condition depends on the properties (surface tension for liquid, elastic and viscous modulus for gel) and geometry (radius of capillary for liquid, thickness for gel) of electrolyte. Prospects of further extending the work closer to real experimental scenarios, especially SGECM, are also discussed.
扫描电化学探针技术的最新发展主要集中在扫描电解质的策略上。例如,扫描电化学电池显微镜(SECCM)是将电解质固定在玻璃毛细管中,而扫描凝胶电化学显微镜(SGECM)是将凝胶电解质固定在微盘电极或蚀刻金属丝上。扫描凝胶电化学显微镜和扫描凝胶电化学显微镜的第一步都是使电解质探针与样品接触,这通常是通过探针和样品之间的恒定电位电流反馈来实现的。这项工作试图从理论上分析电解质在接近过程中的变形。对于 SECCM 中的液态电解质,在轴对称的二维圆柱坐标中,考虑了表面张力来抵消重力和静电力。平衡时的变形在一定条件下求解。对于凝胶电解质,采用简化的一维几何形状分析粘弹性凝胶。同时考虑了平衡和动态两种方法。结果表明,对于液体和凝胶电解质,都存在打破平衡的临界条件。当外加电势高于临界值或距离小于临界值时,力将不会平衡,电解质将变形直至接触。临界条件取决于电解质的特性(液体的表面张力,凝胶的弹性和粘性模量)和几何形状(液体的毛细管半径,凝胶的厚度)。此外,还讨论了进一步扩展这项工作的前景,使其更接近实际实验场景,特别是 SGECM。
{"title":"Charge induced deformation of scanning electrolyte before contact","authors":"Liang Liu","doi":"10.1039/d4fd00147h","DOIUrl":"https://doi.org/10.1039/d4fd00147h","url":null,"abstract":"The recent developments of scanning electrochemical probe techniques focus on the strategy of scanning electrolyte. For example, scanning electrochemical cell microscopy (SECCM) is based on holding the electrolyte in a glass capillary, while scanning gel electrochemical microscopy (SGECM) immobilizes the gel electrolyte on micro-disk electrodes or etched metal wires. In both SECCM and SGECM, the first and essential step is to approach the electrolyte probe to be in contact with the sample, which is very often achieved by current feedback with a constant applied potential between the probe and the sample. This work attempts to theoretically analyse the deformation of electrolyte during this approaching process. For liquid electrolyte in SECCM, surface tension is considered to counterbalance the gravity and electrostatic force in 2D cylindrical coordinates with axial symmetry. The deformation at equilibrium is solved under certain conditions. For gel electrolyte, a viscoelastic gel is analysed with simplified 1D geometry. Both equilibrium and dynamic approaching are considered. The results suggest that for both liquid and gel electrolytes, critical conditions exist for breaking the equilibrium. When applied potential is higher or the distance is lower than the threshold, the force will not equilibrate and the electrolyte will deform until contact. The critical condition depends on the properties (surface tension for liquid, elastic and viscous modulus for gel) and geometry (radius of capillary for liquid, thickness for gel) of electrolyte. Prospects of further extending the work closer to real experimental scenarios, especially SGECM, are also discussed.","PeriodicalId":76,"journal":{"name":"Faraday Discussions","volume":"17 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142208842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Faraday Discussions
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1