首页 > 最新文献

Digital discovery最新文献

英文 中文
Application-specific machine-learned interatomic potentials: exploring the trade-off between DFT convergence, MLIP expressivity, and computational cost 特定应用的机器学习原子间势:探索DFT收敛性、MLIP表达性和计算成本之间的权衡
IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-11-18 DOI: 10.1039/D5DD00294J
Ilgar Baghishov, Jan Janssen, Graeme Henkelman and Danny Perez

Machine-learned interatomic potentials (MLIPs) are revolutionizing computational materials science and chemistry by offering an efficient alternative to ab initio molecular dynamics (MD) simulations. However, fitting high-quality MLIPs remains a challenging, time-consuming, and computationally intensive task where numerous trade-offs have to be considered, e.g., How much and what kind of atomic configurations should be included in the training set? Which level of ab initio convergence should be used to generate the training set? Which loss function should be used for fitting the MLIP? Which machine learning architecture should be used to train the MLIP? The answers to these questions significantly impact both the computational cost of MLIP training and the accuracy and computational cost of subsequent MLIP MD simulations. In this study, we use a configurationally diverse beryllium dataset and quadratic spectral neighbor analysis potential. We demonstrate that joint optimization of energy versus force weights, training set selection strategies, and convergence settings of the ab initio reference simulations, as well as model complexity can lead to a significant reduction in the overall computational cost associated with training and evaluating MLIPs. This opens the door to computationally efficient generation of high-quality MLIPs for a range of applications which demand different accuracy versus training and evaluation cost trade-offs.

机器学习原子间势(MLIPs)通过提供从头算分子动力学(MD)模拟的有效替代方案,正在彻底改变计算材料科学和化学。然而,拟合高质量的mlip仍然是一项具有挑战性、耗时和计算密集型的任务,其中必须考虑许多权衡,例如,训练集中应该包含多少原子配置以及哪种原子配置?应该使用哪一级的从头算收敛来生成训练集?应该使用哪个损失函数来拟合MLIP?应该使用哪种机器学习架构来训练MLIP?这些问题的答案对MLIP训练的计算成本以及后续MLIP MD模拟的准确性和计算成本都有很大的影响。在这项研究中,我们使用了一个构型多样的铍数据集和二次光谱邻域分析电位。我们证明了能量与力权重、训练集选择策略、从头开始参考模拟的收敛设置以及模型复杂性的联合优化可以显著降低与训练和评估mlip相关的总体计算成本。这为高质量mlip的计算效率生成打开了大门,适用于要求不同精度的一系列应用,而不是培训和评估成本权衡。
{"title":"Application-specific machine-learned interatomic potentials: exploring the trade-off between DFT convergence, MLIP expressivity, and computational cost","authors":"Ilgar Baghishov, Jan Janssen, Graeme Henkelman and Danny Perez","doi":"10.1039/D5DD00294J","DOIUrl":"https://doi.org/10.1039/D5DD00294J","url":null,"abstract":"<p >Machine-learned interatomic potentials (MLIPs) are revolutionizing computational materials science and chemistry by offering an efficient alternative to <em>ab initio</em> molecular dynamics (MD) simulations. However, fitting high-quality MLIPs remains a challenging, time-consuming, and computationally intensive task where numerous trade-offs have to be considered, <em>e.g.,</em> How much and what kind of atomic configurations should be included in the training set? Which level of <em>ab initio</em> convergence should be used to generate the training set? Which loss function should be used for fitting the MLIP? Which machine learning architecture should be used to train the MLIP? The answers to these questions significantly impact both the computational cost of MLIP training and the accuracy and computational cost of subsequent MLIP MD simulations. In this study, we use a configurationally diverse beryllium dataset and quadratic spectral neighbor analysis potential. We demonstrate that joint optimization of energy <em>versus</em> force weights, training set selection strategies, and convergence settings of the <em>ab initio</em> reference simulations, as well as model complexity can lead to a significant reduction in the overall computational cost associated with training and evaluating MLIPs. This opens the door to computationally efficient generation of high-quality MLIPs for a range of applications which demand different accuracy <em>versus</em> training and evaluation cost trade-offs.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 1","pages":" 332-347"},"PeriodicalIF":6.2,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2026/dd/d5dd00294j?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146006977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Retrosynformer: planning multi-step chemical synthesis routes via a decision transformer 逆向合成器:通过决策变压器规划多步化学合成路线
IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-11-18 DOI: 10.1039/D5DD00153F
Emma Granqvist, Rocío Mercado and Samuel Genheden

We present RetroSynFormer, a novel approach to multi-step retrosynthesis planning. Here, we express the task of iteratively breaking down a compound into building blocks as a sequence-modeling problem and train a model based on the Decision Transformer. The synthesis routes are generated by iteratively predicting chemical reactions from a set of predefined rules that encode known transformations, and routes are scored during construction using a novel reward function. RetroSynFormer was trained on routes extracted from the PaRoutes dataset of patented experimental routes. On targets from the PaRoutes test set, the RetroSynFormer could find routes to commercial starting materials for 92% of the targets, and we show that the produced routes on average are close to the reference patented route and of good quality. Furthermore, we explore alternative model implementations and discuss the robustness of the model with respect to beam width, reward function, and template space size. We also compare RetroSynFormer to AiZynthFinder, a conventional retrosynthesis algorithm, and find that our novel model is competitive and complementary to the established methodology, thus forming a valuable addition to the field of computer-aided synthesis planning.

我们提出RetroSynFormer,一种多步骤逆转录规划的新方法。在这里,我们将迭代地将化合物分解为构建块的任务表示为序列建模问题,并基于Decision Transformer训练模型。合成路线是通过从一组预定义的规则中迭代预测化学反应来生成的,这些规则对已知的转换进行编码,并且在构建过程中使用新的奖励函数对路线进行评分。RetroSynFormer对从PaRoutes专利实验路线数据集中提取的路线进行训练。在PaRoutes测试集的目标上,RetroSynFormer可以为92%的目标找到通往商业起始材料的路线,并且我们表明,生成的路线平均接近参考专利路线并且质量良好。此外,我们探索了可选的模型实现,并讨论了模型在波束宽度、奖励函数和模板空间大小方面的鲁棒性。我们还将RetroSynFormer与AiZynthFinder(一种传统的逆转录合成算法)进行了比较,发现我们的新模型与现有方法具有竞争力和互补性,从而为计算机辅助合成规划领域提供了有价值的补充。
{"title":"Retrosynformer: planning multi-step chemical synthesis routes via a decision transformer","authors":"Emma Granqvist, Rocío Mercado and Samuel Genheden","doi":"10.1039/D5DD00153F","DOIUrl":"https://doi.org/10.1039/D5DD00153F","url":null,"abstract":"<p >We present RetroSynFormer, a novel approach to multi-step retrosynthesis planning. Here, we express the task of iteratively breaking down a compound into building blocks as a sequence-modeling problem and train a model based on the Decision Transformer. The synthesis routes are generated by iteratively predicting chemical reactions from a set of predefined rules that encode known transformations, and routes are scored during construction using a novel reward function. RetroSynFormer was trained on routes extracted from the PaRoutes dataset of patented experimental routes. On targets from the PaRoutes test set, the RetroSynFormer could find routes to commercial starting materials for 92% of the targets, and we show that the produced routes on average are close to the reference patented route and of good quality. Furthermore, we explore alternative model implementations and discuss the robustness of the model with respect to beam width, reward function, and template space size. We also compare RetroSynFormer to AiZynthFinder, a conventional retrosynthesis algorithm, and find that our novel model is competitive and complementary to the established methodology, thus forming a valuable addition to the field of computer-aided synthesis planning.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 1","pages":" 348-362"},"PeriodicalIF":6.2,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2026/dd/d5dd00153f?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146006978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An automated evaluation agent for Q&A pairs and reticular synthesis conditions 问答对和网状合成条件的自动评价代理
IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-11-18 DOI: 10.1039/D5DD00413F
Nakul Rampal, Dongrong Joe Fu, Chengbin Zhao, Hanan S. Murayshid, Albatool A. Abaalkhail, Nahla E. Alhazmi, Majed O. Alawad, Christian Borgs, Jennifer T. Chayes and Omar M. Yaghi

We report an automated evaluation agent that can reliably assign classification labels to different Q&A pairs of both single-hop and multi-hop types, as well as to synthesis conditions datasets. Our agent is built around a suite of large language models (LLMs) and is designed to eliminate human involvement in the evaluation process. Even though we believe that this approach has broad applicability, for concreteness, we apply it here to reticular chemistry. Through extensive testing of various approaches such as DSPy and finetuning, among others, we found that the performance of a given LLM on these Q&A and synthesis conditions classification tasks is determined primarily by the architecture of the agent, where how the different inputs are parsed and processed and how the LLMs are called make a significant difference. We also found that the quality of the prompt provided remains paramount, irrespective of the sophistication of the underlying model. Even models considered state-of-the-art, such as GPT-o1, exhibit poor performance when the prompt lacks sufficient detail and structure. To overcome these challenges, we performed systematic prompt optimization, iteratively refining the prompt to significantly improve classification accuracy and achieve human-level evaluation benchmarks. We show that while LLMs have made remarkable progress, they still fall short of human reasoning without substantial prompt engineering. The agent presented here provides a robust and reproducible tool for evaluating Q&A pairs and synthesis conditions in a scalable manner and can serve as a foundation for future developments in automated evaluation of LLM inputs and outputs and more generally to create foundation models in chemistry.

我们报告了一个自动评估代理,它可以可靠地为不同的单跳和多跳类型的Q&;A对以及合成条件数据集分配分类标签。我们的智能体是围绕一套大型语言模型(llm)构建的,旨在消除人类在评估过程中的参与。尽管我们相信这种方法具有广泛的适用性,但具体而言,我们在这里将其应用于网状化学。通过对各种方法(如DSPy和微调等)的广泛测试,我们发现给定LLM在这些Q&; a和合成条件分类任务上的性能主要由代理的体系结构决定,其中如何解析和处理不同的输入以及如何调用LLM会产生显着差异。我们还发现,无论底层模型的复杂程度如何,所提供的提示的质量仍然至关重要。即使被认为是最先进的模型,如gpt - 01,在提示符缺乏足够的细节和结构时,也会表现不佳。为了克服这些挑战,我们进行了系统的提示优化,迭代地改进提示,以显着提高分类精度并达到人类水平的评估基准。我们表明,虽然法学硕士取得了显著的进步,但在没有大量提示工程的情况下,它们仍然无法达到人类的推理能力。本文介绍的代理为以可扩展的方式评估Q&; a对和合成条件提供了一个强大且可重复的工具,可以作为LLM输入和输出自动化评估的未来发展的基础,更广泛地说,可以创建化学基础模型。
{"title":"An automated evaluation agent for Q&A pairs and reticular synthesis conditions","authors":"Nakul Rampal, Dongrong Joe Fu, Chengbin Zhao, Hanan S. Murayshid, Albatool A. Abaalkhail, Nahla E. Alhazmi, Majed O. Alawad, Christian Borgs, Jennifer T. Chayes and Omar M. Yaghi","doi":"10.1039/D5DD00413F","DOIUrl":"https://doi.org/10.1039/D5DD00413F","url":null,"abstract":"<p >We report an automated evaluation agent that can reliably assign classification labels to different Q&amp;A pairs of both single-hop and multi-hop types, as well as to synthesis conditions datasets. Our agent is built around a suite of large language models (LLMs) and is designed to eliminate human involvement in the evaluation process. Even though we believe that this approach has broad applicability, for concreteness, we apply it here to reticular chemistry. Through extensive testing of various approaches such as DSPy and finetuning, among others, we found that the performance of a given LLM on these Q&amp;A and synthesis conditions classification tasks is determined primarily by the architecture of the agent, where how the different inputs are parsed and processed and how the LLMs are called make a significant difference. We also found that the quality of the prompt provided remains paramount, irrespective of the sophistication of the underlying model. Even models considered state-of-the-art, such as GPT-o1, exhibit poor performance when the prompt lacks sufficient detail and structure. To overcome these challenges, we performed systematic prompt optimization, iteratively refining the prompt to significantly improve classification accuracy and achieve human-level evaluation benchmarks. We show that while LLMs have made remarkable progress, they still fall short of human reasoning without substantial prompt engineering. The agent presented here provides a robust and reproducible tool for evaluating Q&amp;A pairs and synthesis conditions in a scalable manner and can serve as a foundation for future developments in automated evaluation of LLM inputs and outputs and more generally to create foundation models in chemistry.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 1","pages":" 231-240"},"PeriodicalIF":6.2,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2026/dd/d5dd00413f?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146006998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High throughput tight binding calculation of electronic HOMO–LUMO gaps and its prediction for natural compounds 天然化合物电子HOMO-LUMO间隙的高通量紧密结合计算及其预测
IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-11-17 DOI: 10.1039/D5DD00186B
Sascha Thinius

This research investigates predicting the Highest Occupied Molecular Orbital and the Lowest Unoccupied Molecular Orbital (HOMO–LUMO; short HL) gap of natural compounds, a crucial property for understanding molecular electronic behavior relevant to cheminformatics and materials science. To address the high computational cost of traditional methods, this study develops a high-throughput, machine learning (ML)-based approach. Using 407 000 molecules from the COCONUT database, RDKit was employed to calculate and select molecular descriptors. The computational workflow, managed by Toil and CWL on a high-performance computing (HPC) Slurm cluster, utilized Geometry – Frequency – Noncovalent – eXtended Tight Binding (GFN2-xTB) for electronic structure calculations with Boltzmann weighting across multiple conformational states. Three ensemble methods, namely Gradient Boosting Regression (GBR), eXtreme Gradient Boosting Regression (XGBR), Random Forrest Regression (RFR) and a Multi-layer Perceptron Regressor (MLPR) were compared based on their ability to accurately predict HL-gaps in this chemical space. Key findings reveal molecular polarizability, particularly SMR_VSA descriptors, as crucial for HL-gap determination in all models. Aromatic rings and functional groups, such as ketones, also significantly influence the HL-gap prediction. While the MLPR model demonstrated good overall predictive performance, accuracy varied across molecular subsets. Challenges were observed in predicting HL-gaps for molecules containing aliphatic carboxylic acids, alcohols, and amines in molecular systems with complex electronic structure. This work emphasizes the importance of polarizability and structural features in HL-gap predictive modeling, showcasing the potential of machine learning while also highlighting limitations in handling specific structural motifs. These limitations point towards promising perspectives for further model improvements.

本研究旨在预测天然化合物的最高已占据分子轨道和最低未占据分子轨道(HOMO-LUMO; short HL)间隙,这是理解与化学信息学和材料科学相关的分子电子行为的重要性质。为了解决传统方法的高计算成本,本研究开发了一种基于机器学习(ML)的高通量方法。利用COCONUT数据库中的407 000个分子,使用RDKit计算和选择分子描述符。计算工作流由Toil和CWL在高性能计算(HPC) Slurm集群上管理,利用几何-频率-非共价-扩展紧密结合(GFN2-xTB)进行电子结构计算,并在多个构象状态上使用玻尔兹曼加权。比较了梯度增强回归(GBR)、极端梯度增强回归(XGBR)、随机Forrest回归(RFR)和多层感知器回归(MLPR)三种集成方法对该化学空间中hl -gap的准确预测能力。关键发现揭示了分子极化率,特别是SMR_VSA描述子,在所有模型中都是确定HL-gap的关键因素。芳香环和官能团(如酮类)也显著影响HL-gap的预测。虽然MLPR模型显示出良好的整体预测性能,但准确性在分子亚群之间存在差异。在具有复杂电子结构的分子体系中,预测含有脂肪族羧酸、醇和胺的分子的hl -间隙存在挑战。这项工作强调了极化和结构特征在HL-gap预测建模中的重要性,展示了机器学习的潜力,同时也强调了处理特定结构主题的局限性。这些限制为进一步的模型改进指明了有希望的前景。
{"title":"High throughput tight binding calculation of electronic HOMO–LUMO gaps and its prediction for natural compounds","authors":"Sascha Thinius","doi":"10.1039/D5DD00186B","DOIUrl":"https://doi.org/10.1039/D5DD00186B","url":null,"abstract":"<p >This research investigates predicting the Highest Occupied Molecular Orbital and the Lowest Unoccupied Molecular Orbital (HOMO–LUMO; short HL) gap of natural compounds, a crucial property for understanding molecular electronic behavior relevant to cheminformatics and materials science. To address the high computational cost of traditional methods, this study develops a high-throughput, machine learning (ML)-based approach. Using 407 000 molecules from the COCONUT database, RDKit was employed to calculate and select molecular descriptors. The computational workflow, managed by Toil and CWL on a high-performance computing (HPC) Slurm cluster, utilized Geometry – Frequency – Noncovalent – eXtended Tight Binding (GFN2-xTB) for electronic structure calculations with Boltzmann weighting across multiple conformational states. Three ensemble methods, namely Gradient Boosting Regression (GBR), eXtreme Gradient Boosting Regression (XGBR), Random Forrest Regression (RFR) and a Multi-layer Perceptron Regressor (MLPR) were compared based on their ability to accurately predict HL-gaps in this chemical space. Key findings reveal molecular polarizability, particularly SMR_VSA descriptors, as crucial for HL-gap determination in all models. Aromatic rings and functional groups, such as ketones, also significantly influence the HL-gap prediction. While the MLPR model demonstrated good overall predictive performance, accuracy varied across molecular subsets. Challenges were observed in predicting HL-gaps for molecules containing aliphatic carboxylic acids, alcohols, and amines in molecular systems with complex electronic structure. This work emphasizes the importance of polarizability and structural features in HL-gap predictive modeling, showcasing the potential of machine learning while also highlighting limitations in handling specific structural motifs. These limitations point towards promising perspectives for further model improvements.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 1","pages":" 203-213"},"PeriodicalIF":6.2,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2026/dd/d5dd00186b?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146006971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CatBot – a high-throughput catalyst synthesis and testing system with roll to roll transfer CatBot -一种高通量催化剂合成和测试系统,具有卷到卷传递功能
IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-11-13 DOI: 10.1039/D5DD00403A
Paolo Vincenzo Freiesleben de Blasio, Rune Kruger, Nis Fisker-Bødker, Jin Hyun Chang and Christodoulos Chatzichristodoulou

Fast and accurate synthesis and testing of electrocatalysts is essential to accelerate development of next generation catalysts for sustainable energy technologies. In this paper, we introduce CatBot, a fully automated platform for reliable synthesis and testing of electrocatalysts capable of operating at temperatures of up to 100 °C from highly acidic to highly alkaline conditions. The platform leverages roll-to-roll transfer, integrating customizable stages for substrate cleaning, catalyst loading, and electrochemical testing, with a custom made liquid distribution system enabling multi-element electrocatalyst synthesis via electrodeposition. CatBot enables fabrication and testing of up to 100 electrocatalysts per day, significantly accelerating catalyst discovery and optimization. We demonstrate the platform's reproducibility, through synthesis and testing of various catalytic coatings for the hydrogen evolution reaction (HER) in alkaline conditions, achieving overpotential uncertainties in the range of 4–13 mV at −100 mA cm−2. Additionally, we benchmark the platform by comparing anodic and cathodic redox peaks for nickel in alkaline solutions confirming consistency with previous studies. Thus, CatBot comprises an automated, fast, reproducible, accurate and scalable synthesis and testing system for the accelerated development of next generation electrocatalysts.

快速准确地合成和测试电催化剂对于加速下一代可持续能源技术催化剂的开发至关重要。在本文中,我们介绍了CatBot,一个完全自动化的平台,用于可靠的合成和测试电催化剂,能够在高达100°C的温度下从高酸性到高碱性条件下工作。该平台利用卷对卷传输,集成了可定制的基板清洗、催化剂装载和电化学测试阶段,并配有定制的液体分配系统,可通过电沉积合成多元素电催化剂。CatBot每天可以制造和测试多达100种电催化剂,大大加快了催化剂的发现和优化。通过在碱性条件下合成和测试各种析氢反应(HER)的催化涂层,我们证明了该平台的可重复性,在- 100 mA cm - 2下实现了4-13 mV的过电位不确定度。此外,我们通过比较碱性溶液中镍的阳极和阴极氧化还原峰来对平台进行基准测试,以确认与先前研究的一致性。因此,CatBot包括一个自动化,快速,可重复,准确和可扩展的合成和测试系统,用于加速下一代电催化剂的开发。
{"title":"CatBot – a high-throughput catalyst synthesis and testing system with roll to roll transfer","authors":"Paolo Vincenzo Freiesleben de Blasio, Rune Kruger, Nis Fisker-Bødker, Jin Hyun Chang and Christodoulos Chatzichristodoulou","doi":"10.1039/D5DD00403A","DOIUrl":"https://doi.org/10.1039/D5DD00403A","url":null,"abstract":"<p >Fast and accurate synthesis and testing of electrocatalysts is essential to accelerate development of next generation catalysts for sustainable energy technologies. In this paper, we introduce CatBot, a fully automated platform for reliable synthesis and testing of electrocatalysts capable of operating at temperatures of up to 100 °C from highly acidic to highly alkaline conditions. The platform leverages roll-to-roll transfer, integrating customizable stages for substrate cleaning, catalyst loading, and electrochemical testing, with a custom made liquid distribution system enabling multi-element electrocatalyst synthesis <em>via</em> electrodeposition. CatBot enables fabrication and testing of up to 100 electrocatalysts per day, significantly accelerating catalyst discovery and optimization. We demonstrate the platform's reproducibility, through synthesis and testing of various catalytic coatings for the hydrogen evolution reaction (HER) in alkaline conditions, achieving overpotential uncertainties in the range of 4–13 mV at −100 mA cm<small><sup>−2</sup></small>. Additionally, we benchmark the platform by comparing anodic and cathodic redox peaks for nickel in alkaline solutions confirming consistency with previous studies. Thus, CatBot comprises an automated, fast, reproducible, accurate and scalable synthesis and testing system for the accelerated development of next generation electrocatalysts.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 12","pages":" 3810-3817"},"PeriodicalIF":6.2,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00403a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computational design of polypeptide-based compartments for synthetic cells 合成细胞多肽基隔室的计算设计
IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-11-12 DOI: 10.1039/D5DD00291E
Jianming Mao, Yongkang Xi, Armin Shayesteh Zadeh, Allen P. Liu and Andrew L. Ferguson

Synthetic cells are prevalent models for understanding and recapitulating complicated functions of natural cells such as DNA replication and protein expression. Lipid-based vesicles are widely employed but are limited by their fragility under mechanical forces or osmotic pressure. Elastin-like polypeptides (ELPs) composed of repetitive (VPGXG) sequences present alternative building blocks with which to construct the delimiting membrane of synthetic cells possessing high structural stability and tolerance of harsh environmental stress. In this work, we present a high-throughput virtual screening pipeline combining coarse-grained simulations, alchemical free energy calculations, Gaussian process regression, and Bayesian optimization to traverse a library of amphiphilic diblock ELPs for mutant sequences predicted to form thermodynamically stable bilayer vesicles. From our screening campaign, we have identified a range of novel ELP candidates with enhanced predicted stability. Analysis of our screening data exposes new rational design principles that suggest incorporating particular guest residues in hydrophilic blocks – including histidine, tyrosine, and threonine – and in hydrophobic blocks – including alanine, phenylalanine, cysteine, and isoleucine – to enhance the thermodynamic stability of ELP bilayer vesicles. The computational pipeline greatly accelerates the discovery of ELP building blocks for synthetic cells, exposes new design principles for these molecules, and furnishes a transferable framework for designing peptides with desirable structural or functional properties.

合成细胞是理解和概括自然细胞复杂功能(如DNA复制和蛋白质表达)的普遍模型。脂基囊泡被广泛应用,但由于其在机械力或渗透压下的脆弱性而受到限制。由重复(VPGXG)序列组成的弹性蛋白样多肽(ELPs)是构建具有高结构稳定性和耐恶劣环境应力的合成细胞分隔膜的替代构建块。在这项工作中,我们提出了一个高通量的虚拟筛选管道,结合粗粒度模拟,炼金术自由能计算,高斯过程回归和贝叶斯优化来遍历两亲性二嵌段elp库,用于预测形成热力学稳定的双层囊泡的突变序列。从我们的筛选活动中,我们已经确定了一系列具有增强预测稳定性的新型ELP候选药物。我们的筛选数据分析揭示了新的合理设计原则,建议在亲水性块(包括组氨酸、酪氨酸和苏氨酸)和疏水性块(包括丙氨酸、苯丙氨酸、半胱氨酸和异亮氨酸)中加入特定的客体残基,以增强ELP双层囊泡的热力学稳定性。计算管道极大地加速了合成细胞的ELP构建块的发现,揭示了这些分子的新设计原则,并为设计具有理想结构或功能特性的肽提供了可转移的框架。
{"title":"Computational design of polypeptide-based compartments for synthetic cells","authors":"Jianming Mao, Yongkang Xi, Armin Shayesteh Zadeh, Allen P. Liu and Andrew L. Ferguson","doi":"10.1039/D5DD00291E","DOIUrl":"https://doi.org/10.1039/D5DD00291E","url":null,"abstract":"<p >Synthetic cells are prevalent models for understanding and recapitulating complicated functions of natural cells such as DNA replication and protein expression. Lipid-based vesicles are widely employed but are limited by their fragility under mechanical forces or osmotic pressure. Elastin-like polypeptides (ELPs) composed of repetitive (VPGXG) sequences present alternative building blocks with which to construct the delimiting membrane of synthetic cells possessing high structural stability and tolerance of harsh environmental stress. In this work, we present a high-throughput virtual screening pipeline combining coarse-grained simulations, alchemical free energy calculations, Gaussian process regression, and Bayesian optimization to traverse a library of amphiphilic diblock ELPs for mutant sequences predicted to form thermodynamically stable bilayer vesicles. From our screening campaign, we have identified a range of novel ELP candidates with enhanced predicted stability. Analysis of our screening data exposes new rational design principles that suggest incorporating particular guest residues in hydrophilic blocks – including histidine, tyrosine, and threonine – and in hydrophobic blocks – including alanine, phenylalanine, cysteine, and isoleucine – to enhance the thermodynamic stability of ELP bilayer vesicles. The computational pipeline greatly accelerates the discovery of ELP building blocks for synthetic cells, exposes new design principles for these molecules, and furnishes a transferable framework for designing peptides with desirable structural or functional properties.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 1","pages":" 214-230"},"PeriodicalIF":6.2,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2026/dd/d5dd00291e?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146006972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-modal contrastive learning for chemical structure elucidation with VibraCLIP 用VibraCLIP进行化学结构解析的多模态对比学习
IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-11-11 DOI: 10.1039/D5DD00269A
Pau Rocabert-Oriols, Camilla Lo Conte, Núria López and Javier Heras-Domingo

Identifying molecular structures from vibrational spectra is central to chemical analysis but remains challenging due to spectral ambiguity and the limitations of single-modality methods. While deep learning has advanced various spectroscopic characterization techniques, leveraging the complementary nature of infrared (IR) and Raman spectroscopies remains largely underexplored. We introduce VibraCLIP, a contrastive learning framework that embeds molecular graphs, IR and Raman spectra into a shared latent space. A lightweight fine-tuning protocol ensures generalization from theoretical to experimental datasets. VibraCLIP enables accurate, scalable, and data-efficient molecular identification, linking vibrational spectroscopy with structural interpretation. This tri-modal design captures rich structure–spectra relationships, achieving Top-1 retrieval accuracy of 81.7% and reaching 98.9% Top-25 accuracy with molecular mass integration. By integrating complementary vibrational spectroscopic signals with molecular representations, VibraCLIP provides a practical framework for automated spectral analysis, with potential applications in fields such as synthesis monitoring, drug development, and astrochemical detection.

从振动光谱中识别分子结构是化学分析的核心,但由于光谱模糊和单模态方法的局限性,仍然具有挑战性。虽然深度学习已经发展了各种光谱表征技术,但利用红外(IR)和拉曼光谱的互补性在很大程度上仍未得到充分探索。我们介绍了VibraCLIP,这是一个对比学习框架,它将分子图、红外光谱和拉曼光谱嵌入到共享潜在空间中。轻量级的微调协议确保从理论到实验数据集的泛化。VibraCLIP实现精确、可扩展和数据高效的分子鉴定,将振动光谱与结构解释联系起来。这种三模态设计捕获了丰富的结构-光谱关系,获得了81.7%的Top-1检索精度和98.9%的Top-25精度与分子质量集成。通过将互补的振动光谱信号与分子表征相结合,VibraCLIP为自动光谱分析提供了一个实用的框架,在合成监测、药物开发和天体化学检测等领域具有潜在的应用前景。
{"title":"Multi-modal contrastive learning for chemical structure elucidation with VibraCLIP","authors":"Pau Rocabert-Oriols, Camilla Lo Conte, Núria López and Javier Heras-Domingo","doi":"10.1039/D5DD00269A","DOIUrl":"https://doi.org/10.1039/D5DD00269A","url":null,"abstract":"<p >Identifying molecular structures from vibrational spectra is central to chemical analysis but remains challenging due to spectral ambiguity and the limitations of single-modality methods. While deep learning has advanced various spectroscopic characterization techniques, leveraging the complementary nature of infrared (IR) and Raman spectroscopies remains largely underexplored. We introduce VibraCLIP, a contrastive learning framework that embeds molecular graphs, IR and Raman spectra into a shared latent space. A lightweight fine-tuning protocol ensures generalization from theoretical to experimental datasets. VibraCLIP enables accurate, scalable, and data-efficient molecular identification, linking vibrational spectroscopy with structural interpretation. This tri-modal design captures rich structure–spectra relationships, achieving Top-1 retrieval accuracy of 81.7% and reaching 98.9% Top-25 accuracy with molecular mass integration. By integrating complementary vibrational spectroscopic signals with molecular representations, VibraCLIP provides a practical framework for automated spectral analysis, with potential applications in fields such as synthesis monitoring, drug development, and astrochemical detection.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 12","pages":" 3818-3827"},"PeriodicalIF":6.2,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00269a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extrapolating beyond C60: advancing prediction of fullerene isomers with FullereneNet C60以外的外推:利用FullereneNet推进富勒烯异构体的预测
IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-11-11 DOI: 10.1039/D5DD00241A
Bin Liu, Jirui Jin and Mingjie Liu

Fullerenes, carbon-based nanomaterials with sp2-hybridized carbon atoms arranged in polyhedral cages, exhibit diverse isomeric structures with promising applications in optoelectronics, solar cells, and medicine. However, the vast number of possible fullerene isomers complicates efficient property prediction. In this study, we introduce FullereneNet, a graph neural network-based model that predicts fundamental properties of fullerenes using topological features derived solely from unoptimized structures, eliminating the need for computationally expensive quantum chemistry optimizations. The model leverages topological representations based on the chemical environments of pentagonal and hexagonal rings, enabling efficient capture of local structural details. We show that this approach yields superior performance in predicting the C–C binding energy for a wide range of fullerene sizes, achieving mean absolute errors of 3 meV per atom for C60, 4 meV per atom for C70, and 6 meV per atom for C72–C100, surpassing the values of the state-of-the-art machine learning interatomic potential GAP-20. Additionally, the FullereneNet model accurately predicts 11 other properties, including the HOMO–LUMO gap and solvation free energy, demonstrating robustness and transferability across fullerene types. This work provides a computationally efficient framework for high-throughput screening of fullerene candidates and establishes a foundation for future data-driven studies in fullerene chemistry.

富勒烯是一种由sp2杂化碳原子排列在多面体笼中的碳基纳米材料,具有多种异构结构,在光电子、太阳能电池和医学等领域具有广阔的应用前景。然而,大量可能的富勒烯异构体使有效的性质预测复杂化。在这项研究中,我们引入了FullereneNet,这是一种基于图神经网络的模型,它使用仅来自未优化结构的拓扑特征来预测富勒烯的基本性质,从而消除了计算上昂贵的量子化学优化的需要。该模型利用基于五边形和六边形环的化学环境的拓扑表示,能够有效地捕获局部结构细节。我们表明,这种方法在预测各种富勒烯尺寸的C-C结合能方面具有优异的性能,C60的平均绝对误差为3 meV /原子,C70的平均绝对误差为4 meV /原子,C72-C100的平均绝对误差为6 meV /原子,超过了最先进的机器学习原子间势GAP-20的值。此外,FullereneNet模型准确预测了11种其他性质,包括HOMO-LUMO间隙和溶剂化自由能,证明了富勒烯类型的稳健性和可转移性。这项工作为高通量筛选候选富勒烯提供了一个计算效率高的框架,并为未来富勒烯化学的数据驱动研究奠定了基础。
{"title":"Extrapolating beyond C60: advancing prediction of fullerene isomers with FullereneNet","authors":"Bin Liu, Jirui Jin and Mingjie Liu","doi":"10.1039/D5DD00241A","DOIUrl":"https://doi.org/10.1039/D5DD00241A","url":null,"abstract":"<p >Fullerenes, carbon-based nanomaterials with sp<small><sup>2</sup></small>-hybridized carbon atoms arranged in polyhedral cages, exhibit diverse isomeric structures with promising applications in optoelectronics, solar cells, and medicine. However, the vast number of possible fullerene isomers complicates efficient property prediction. In this study, we introduce FullereneNet, a graph neural network-based model that predicts fundamental properties of fullerenes using topological features derived solely from unoptimized structures, eliminating the need for computationally expensive quantum chemistry optimizations. The model leverages topological representations based on the chemical environments of pentagonal and hexagonal rings, enabling efficient capture of local structural details. We show that this approach yields superior performance in predicting the C–C binding energy for a wide range of fullerene sizes, achieving mean absolute errors of 3 meV per atom for C<small><sub>60</sub></small>, 4 meV per atom for C<small><sub>70</sub></small>, and 6 meV per atom for C<small><sub>72</sub></small>–C<small><sub>100</sub></small>, surpassing the values of the state-of-the-art machine learning interatomic potential GAP-20. Additionally, the FullereneNet model accurately predicts 11 other properties, including the HOMO–LUMO gap and solvation free energy, demonstrating robustness and transferability across fullerene types. This work provides a computationally efficient framework for high-throughput screening of fullerene candidates and establishes a foundation for future data-driven studies in fullerene chemistry.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 1","pages":" 123-133"},"PeriodicalIF":6.2,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2026/dd/d5dd00241a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146006991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of simple-structured conjugated polymers for organic solar cells by machine learning-assisted structural modification and experimental validation 基于机器学习辅助结构修饰和实验验证的有机太阳能电池简单结构共轭聚合物设计
IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-11-11 DOI: 10.1039/D5DD00418G
Shogo Tadokoro, Ryosuke Kamimura, Fumitaka Ishiwari and Akinori Saeki

Improving the performance of organic photovoltaics (OPVs) depends on the development of new p-type polymers and n-type non-fullerene acceptor (NFA) molecules. However, conventional experimental and theoretical methods are inefficient for exploring the vast chemical space. In this report, we use machine learning (ML) to explore simple-structured p-type polymers. The structural simplicity is associated with a small synthesis step relevant for low-cost, large-scale production. By considering the structural simplicity (primitively based on the molecular weight of its repeating unit) of the 200 thousand virtually generated polymers, together with synthetic accessibility, we focus on copolymers composed of benzoxadiazole as an acceptor and thiophene (or phenylene) as a donor. Although the structures of these copolymers resemble a high-performance simple-structured PTQ10, their structural symmetries (regioregularity) are modified for synthetic reasons. Through the characterization of the synthesized polymers, their OPV devices blended with Y6 NFA, and resultant synthetic complexity scores, we show that our polymer with a minor manual modification of the donor and alkyl chain exhibits a power conversion efficiency of 5.56%, which closely aligns with that predicted by ML and provides a basis for the further development of novel polymers with low synthesis and search costs.

提高有机光伏(OPVs)的性能取决于新型p型聚合物和n型非富勒烯受体(NFA)分子的发展。然而,传统的实验和理论方法对于探索广阔的化学空间是低效的。在本报告中,我们使用机器学习(ML)来探索简单结构的p型聚合物。结构简单与低成本、大规模生产相关的小合成步骤有关。考虑到20万种虚拟生成的聚合物的结构简单性(主要基于其重复单元的分子量)以及合成的可及性,我们将重点放在以苯并恶二唑为受体和噻吩(或苯基)为供体的共聚物上。虽然这些共聚物的结构类似于高性能的简单结构PTQ10,但由于合成原因,它们的结构对称性(区域规则性)被修改了。通过对合成聚合物及其与Y6 NFA共混的OPV装置的表征和合成复杂性评分,我们表明,我们的聚合物在对供体和烷基链进行少量人工修饰的情况下,功率转换效率为5.56%,这与ML预测的结果非常接近,为进一步开发低合成和低搜索成本的新型聚合物提供了基础。
{"title":"Design of simple-structured conjugated polymers for organic solar cells by machine learning-assisted structural modification and experimental validation","authors":"Shogo Tadokoro, Ryosuke Kamimura, Fumitaka Ishiwari and Akinori Saeki","doi":"10.1039/D5DD00418G","DOIUrl":"https://doi.org/10.1039/D5DD00418G","url":null,"abstract":"<p >Improving the performance of organic photovoltaics (OPVs) depends on the development of new p-type polymers and n-type non-fullerene acceptor (NFA) molecules. However, conventional experimental and theoretical methods are inefficient for exploring the vast chemical space. In this report, we use machine learning (ML) to explore simple-structured p-type polymers. The structural simplicity is associated with a small synthesis step relevant for low-cost, large-scale production. By considering the structural simplicity (primitively based on the molecular weight of its repeating unit) of the 200 thousand virtually generated polymers, together with synthetic accessibility, we focus on copolymers composed of benzoxadiazole as an acceptor and thiophene (or phenylene) as a donor. Although the structures of these copolymers resemble a high-performance simple-structured PTQ10, their structural symmetries (regioregularity) are modified for synthetic reasons. Through the characterization of the synthesized polymers, their OPV devices blended with Y6 NFA, and resultant synthetic complexity scores, we show that our polymer with a minor manual modification of the donor and alkyl chain exhibits a power conversion efficiency of 5.56%, which closely aligns with that predicted by ML and provides a basis for the further development of novel polymers with low synthesis and search costs.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 12","pages":" 3774-3781"},"PeriodicalIF":6.2,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00418g?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145659238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MultiTaskDeltaNet: change detection-based image segmentation for operando ETEM with application to carbon gasification kinetics MultiTaskDeltaNet:基于变化检测的操作ETEM图像分割及其在碳气化动力学中的应用
IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Pub Date : 2025-11-11 DOI: 10.1039/D5DD00333D
Yushuo Niu, Tianyu Li, Yuanyuan Zhu and Qian Yang

Transforming in situ transmission electron microscopy (TEM) imaging into a tool for spatially-resolved operando characterization of solid-state reactions requires automated, high-precision semantic segmentation of dynamically evolving features. However, traditional deep learning methods for semantic segmentation often face limitations due to the scarcity of labeled data, visually ambiguous features of interest, and scenarios involving small objects. To tackle these challenges, we introduce MultiTaskDeltaNet (MTDN), a novel deep learning architecture that creatively reconceptualizes the segmentation task as a change detection problem. By implementing a unique Siamese network with a U-Net backbone and using paired images to capture feature changes, MTDN effectively leverages minimal data to produce high-quality segmentations. Furthermore, MTDN utilizes a multi-task learning strategy to exploit correlations between physical features of interest. In an evaluation using data from in situ environmental TEM (ETEM) videos of filamentous carbon gasification, MTDN demonstrated a significant advantage over conventional segmentation models, particularly in accurately delineating fine structural features. Notably, MTDN achieved a 10.22% performance improvement over conventional segmentation models in predicting small and visually ambiguous physical features. This work bridges key gaps between deep learning and practical TEM image analysis, advancing automated characterization of nanomaterials in complex experimental settings.

将原位透射电子显微镜(TEM)成像转化为固态反应的空间分辨操作分子表征工具,需要对动态演变特征进行自动化、高精度的语义分割。然而,由于标记数据的稀缺性、感兴趣的视觉模糊特征以及涉及小对象的场景,传统的深度学习语义分割方法经常面临局限性。为了应对这些挑战,我们引入了MultiTaskDeltaNet (MTDN),这是一种新颖的深度学习架构,创造性地将分割任务重新定义为变化检测问题。通过使用U-Net骨干网实现独特的暹罗网络,并使用配对图像来捕获特征变化,MTDN有效地利用最少的数据来产生高质量的分割。此外,MTDN利用多任务学习策略来利用感兴趣的物理特征之间的相关性。在使用丝状碳气化现场环境透射电镜(ETEM)视频数据的评估中,MTDN显示出比传统分割模型显著的优势,特别是在准确描绘精细结构特征方面。值得注意的是,在预测小的和视觉上模糊的物理特征方面,MTDN比传统分割模型的性能提高了10.22%。这项工作弥合了深度学习和实际TEM图像分析之间的关键差距,推进了复杂实验环境中纳米材料的自动化表征。
{"title":"MultiTaskDeltaNet: change detection-based image segmentation for operando ETEM with application to carbon gasification kinetics","authors":"Yushuo Niu, Tianyu Li, Yuanyuan Zhu and Qian Yang","doi":"10.1039/D5DD00333D","DOIUrl":"https://doi.org/10.1039/D5DD00333D","url":null,"abstract":"<p >Transforming <em>in situ</em> transmission electron microscopy (TEM) imaging into a tool for spatially-resolved <em>operando</em> characterization of solid-state reactions requires automated, high-precision semantic segmentation of dynamically evolving features. However, traditional deep learning methods for semantic segmentation often face limitations due to the scarcity of labeled data, visually ambiguous features of interest, and scenarios involving small objects. To tackle these challenges, we introduce MultiTaskDeltaNet (MTDN), a novel deep learning architecture that creatively reconceptualizes the segmentation task as a change detection problem. By implementing a unique Siamese network with a U-Net backbone and using paired images to capture feature changes, MTDN effectively leverages minimal data to produce high-quality segmentations. Furthermore, MTDN utilizes a multi-task learning strategy to exploit correlations between physical features of interest. In an evaluation using data from <em>in situ</em> environmental TEM (ETEM) videos of filamentous carbon gasification, MTDN demonstrated a significant advantage over conventional segmentation models, particularly in accurately delineating fine structural features. Notably, MTDN achieved a 10.22% performance improvement over conventional segmentation models in predicting small and visually ambiguous physical features. This work bridges key gaps between deep learning and practical TEM image analysis, advancing automated characterization of nanomaterials in complex experimental settings.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 1","pages":" 290-303"},"PeriodicalIF":6.2,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2026/dd/d5dd00333d?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146007003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Digital discovery
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1