首页 > 最新文献

APL Machine Learning最新文献

英文 中文
Computational experiments with cellular-automata generated images reveal intrinsic limitations of convolutional neural networks on pattern recognition tasks 细胞自动生成图像的计算实验揭示了卷积神经网络在模式识别任务中的内在局限性
Pub Date : 2024-07-15 DOI: 10.1063/5.0213905
Weihua Lei, Cleber Zanchettin, Flávio A. O. Santos, Luís A. Nunes Amaral
The extraordinary success of convolutional neural networks (CNNs) in various computer vision tasks has revitalized the field of artificial intelligence. The out-sized expectations created by this extraordinary success have, however, been tempered by a recognition of CNNs’ fragility. Importantly, the magnitude of the problem is unclear due to a lack of rigorous benchmark datasets. Here, we propose a solution to the benchmarking problem that reveals the extent of the vulnerabilities of CNNs and of the methods used to provide interpretability to their predictions. We employ cellular automata (CA) to generate images with rigorously controllable characteristics. CA allow for the definition of both extraordinarily simple and highly complex discrete functions and allow for the generation of boundless datasets of images without repeats. In this work, we systematically investigate the fragility and interpretability of the three popular CNN architectures using CA-generated datasets. We find a sharp transition from a learnable phase to an unlearnable phase as the latent space entropy of the discrete CA functions increases. Furthermore, we demonstrate that shortcut learning is an inherent trait of CNNs. Given a dataset with an easy-to-learn and strongly predictive pattern, CNN will consistently learn the shortcut even if the pattern occurs only on a small fraction of the image. Finally, we show that widely used attribution methods aiming to add interpretability to CNN outputs are strongly CNN-architecture specific and vary widely in their ability to identify input regions of high importance to the model. Our results provide significant insight into the limitations of both CNNs and the approaches developed to add interpretability to their predictions and raise concerns about the types of tasks that should be entrusted to them.
卷积神经网络(CNN)在各种计算机视觉任务中取得了非凡的成功,振兴了人工智能领域。然而,由于认识到卷积神经网络的脆弱性,人们对这一非凡成功产生了过高的期望。重要的是,由于缺乏严格的基准数据集,问题的严重性尚不明确。在此,我们提出了一个基准测试问题的解决方案,以揭示 CNN 的脆弱性程度,以及为其预测提供可解释性的方法。我们采用细胞自动机(CA)生成具有严格可控特征的图像。细胞自动机允许定义异常简单和高度复杂的离散函数,并允许生成无穷无尽的无重复图像数据集。在这项工作中,我们利用 CA 生成的数据集系统地研究了三种流行的 CNN 架构的脆弱性和可解释性。我们发现,随着离散 CA 函数的潜在空间熵的增加,可学习阶段会急剧过渡到不可学习阶段。此外,我们还证明了捷径学习是 CNN 的固有特性。如果数据集具有易于学习且预测性很强的模式,即使该模式只出现在一小部分图像上,CNN 也能持续学习该捷径。最后,我们表明,广泛使用的旨在为 CNN 输出增加可解释性的归因方法具有很强的 CNN 体系结构特性,在识别对模型非常重要的输入区域的能力方面存在很大差异。我们的研究结果让我们深入了解了 CNN 和为增加其预测的可解释性而开发的方法的局限性,并引起了人们对应该委托给它们的任务类型的关注。
{"title":"Computational experiments with cellular-automata generated images reveal intrinsic limitations of convolutional neural networks on pattern recognition tasks","authors":"Weihua Lei, Cleber Zanchettin, Flávio A. O. Santos, Luís A. Nunes Amaral","doi":"10.1063/5.0213905","DOIUrl":"https://doi.org/10.1063/5.0213905","url":null,"abstract":"The extraordinary success of convolutional neural networks (CNNs) in various computer vision tasks has revitalized the field of artificial intelligence. The out-sized expectations created by this extraordinary success have, however, been tempered by a recognition of CNNs’ fragility. Importantly, the magnitude of the problem is unclear due to a lack of rigorous benchmark datasets. Here, we propose a solution to the benchmarking problem that reveals the extent of the vulnerabilities of CNNs and of the methods used to provide interpretability to their predictions. We employ cellular automata (CA) to generate images with rigorously controllable characteristics. CA allow for the definition of both extraordinarily simple and highly complex discrete functions and allow for the generation of boundless datasets of images without repeats. In this work, we systematically investigate the fragility and interpretability of the three popular CNN architectures using CA-generated datasets. We find a sharp transition from a learnable phase to an unlearnable phase as the latent space entropy of the discrete CA functions increases. Furthermore, we demonstrate that shortcut learning is an inherent trait of CNNs. Given a dataset with an easy-to-learn and strongly predictive pattern, CNN will consistently learn the shortcut even if the pattern occurs only on a small fraction of the image. Finally, we show that widely used attribution methods aiming to add interpretability to CNN outputs are strongly CNN-architecture specific and vary widely in their ability to identify input regions of high importance to the model. Our results provide significant insight into the limitations of both CNNs and the approaches developed to add interpretability to their predictions and raise concerns about the types of tasks that should be entrusted to them.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"115 14","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141647072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simulation-trained machine learning models for Lorentz transmission electron microscopy 用于洛伦兹透射电子显微镜的模拟训练机器学习模型
Pub Date : 2024-06-01 DOI: 10.1063/5.0197138
A. McCray, Alec Bender, Amanda Petford-Long, C. Phatak
Understanding the collective behavior of complex spin textures, such as lattices of magnetic skyrmions, is of fundamental importance for exploring and controlling the emergent ordering of these spin textures and inducing phase transitions. It is also critical to understand the skyrmion–skyrmion interactions for applications such as magnetic skyrmion-enabled reservoir or neuromorphic computing. Magnetic skyrmion lattices can be studied using in situ Lorentz transmission electron microscopy (LTEM), but quantitative and statistically robust analysis of the skyrmion lattices from LTEM images can be difficult. In this work, we show that a convolutional neural network, trained on simulated data, can be applied to perform segmentation of spin textures and to extract quantitative data, such as spin texture size and location, from experimental LTEM images, which cannot be obtained manually. This includes quantitative information about skyrmion size, position, and shape, which can, in turn, be used to calculate skyrmion–skyrmion interactions and lattice ordering. We apply this approach to segmenting images of Néel skyrmion lattices so that we can accurately identify skyrmion size and deformation in both dense and sparse lattices. The model is trained using a large set of micromagnetic simulations as well as simulated LTEM images. This entirely open-source training pipeline can be applied to a wide variety of magnetic features and materials, enabling large-scale statistical studies of spin textures using LTEM.
了解复杂自旋纹理(如磁性天丝晶格)的集体行为,对于探索和控制这些自旋纹理的新兴有序性以及诱导相变具有根本性的重要意义。此外,了解磁性天元与天元之间的相互作用对于磁性天元水库或神经形态计算等应用也至关重要。利用原位洛伦兹透射电子显微镜(LTEM)可以研究磁性天空离子晶格,但从 LTEM 图像中对天空离子晶格进行定量和统计分析却很困难。在这项工作中,我们展示了在模拟数据上训练的卷积神经网络可用于对自旋纹理进行分割,并从 LTEM 实验图像中提取自旋纹理大小和位置等定量数据,而这些数据是无法手动获取的。这包括有关自旋微粒大小、位置和形状的定量信息,反过来,这些信息可用于计算自旋微粒与自旋微粒之间的相互作用和晶格有序性。我们将这一方法应用于内尔斯空粒子晶格的图像分割,从而可以准确识别密集和稀疏晶格中的斯空粒子大小和变形。我们使用大量微磁模拟和模拟 LTEM 图像对模型进行了训练。这一完全开源的训练管道可应用于各种磁性特征和材料,从而利用 LTEM 对自旋纹理进行大规模统计研究。
{"title":"Simulation-trained machine learning models for Lorentz transmission electron microscopy","authors":"A. McCray, Alec Bender, Amanda Petford-Long, C. Phatak","doi":"10.1063/5.0197138","DOIUrl":"https://doi.org/10.1063/5.0197138","url":null,"abstract":"Understanding the collective behavior of complex spin textures, such as lattices of magnetic skyrmions, is of fundamental importance for exploring and controlling the emergent ordering of these spin textures and inducing phase transitions. It is also critical to understand the skyrmion–skyrmion interactions for applications such as magnetic skyrmion-enabled reservoir or neuromorphic computing. Magnetic skyrmion lattices can be studied using in situ Lorentz transmission electron microscopy (LTEM), but quantitative and statistically robust analysis of the skyrmion lattices from LTEM images can be difficult. In this work, we show that a convolutional neural network, trained on simulated data, can be applied to perform segmentation of spin textures and to extract quantitative data, such as spin texture size and location, from experimental LTEM images, which cannot be obtained manually. This includes quantitative information about skyrmion size, position, and shape, which can, in turn, be used to calculate skyrmion–skyrmion interactions and lattice ordering. We apply this approach to segmenting images of Néel skyrmion lattices so that we can accurately identify skyrmion size and deformation in both dense and sparse lattices. The model is trained using a large set of micromagnetic simulations as well as simulated LTEM images. This entirely open-source training pipeline can be applied to a wide variety of magnetic features and materials, enabling large-scale statistical studies of spin textures using LTEM.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"130 33","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141281750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced spectrum prediction using deep learning models with multi-frequency supplementary inputs 利用多频率补充输入的深度学习模型加强频谱预测
Pub Date : 2024-05-16 DOI: 10.1063/5.0203931
Xiaohua Xing, Yuqi Ren, Die Zou, Qiankun Zhang, Bingxuan Mao, Jianquan Yao, Deyi Xiong, Liang Wu
Recently, the rapid progress of deep learning techniques has brought unprecedented transformations and innovations across various fields. While neural network-based approaches can effectively encode data and detect underlying patterns of features, the diverse formats and compositions of data in different fields pose challenges in effectively utilizing these data, especially for certain research fields in the early stages of integrating deep learning. Therefore, it is crucial to find more efficient ways to utilize existing datasets. Here, we demonstrate that the predictive accuracy of the network can be improved dramatically by simply adding supplementary multi-frequency inputs to the existing dataset in the target spectrum predicting process. This design methodology paves the way for interdisciplinary research and applications at the interface of deep learning and other fields, such as photonics, composite material design, and biological medicine.
近来,深度学习技术的飞速发展为各个领域带来了前所未有的变革和创新。虽然基于神经网络的方法可以有效地编码数据并检测潜在的特征模式,但不同领域的数据格式和组成各不相同,这给有效利用这些数据带来了挑战,尤其是对某些处于深度学习集成早期阶段的研究领域而言。因此,找到更有效的方法来利用现有数据集至关重要。在此,我们证明了在目标频谱预测过程中,只需在现有数据集上添加补充多频输入,就能显著提高网络的预测精度。这种设计方法为深度学习与其他领域(如光子学、复合材料设计和生物医学)的跨学科研究和应用铺平了道路。
{"title":"Enhanced spectrum prediction using deep learning models with multi-frequency supplementary inputs","authors":"Xiaohua Xing, Yuqi Ren, Die Zou, Qiankun Zhang, Bingxuan Mao, Jianquan Yao, Deyi Xiong, Liang Wu","doi":"10.1063/5.0203931","DOIUrl":"https://doi.org/10.1063/5.0203931","url":null,"abstract":"Recently, the rapid progress of deep learning techniques has brought unprecedented transformations and innovations across various fields. While neural network-based approaches can effectively encode data and detect underlying patterns of features, the diverse formats and compositions of data in different fields pose challenges in effectively utilizing these data, especially for certain research fields in the early stages of integrating deep learning. Therefore, it is crucial to find more efficient ways to utilize existing datasets. Here, we demonstrate that the predictive accuracy of the network can be improved dramatically by simply adding supplementary multi-frequency inputs to the existing dataset in the target spectrum predicting process. This design methodology paves the way for interdisciplinary research and applications at the interface of deep learning and other fields, such as photonics, composite material design, and biological medicine.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"32 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140967580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cell detection with convolutional spiking neural network for neuromorphic cytometry 利用卷积尖峰神经网络进行细胞检测,实现神经形态细胞测定法
Pub Date : 2024-05-08 DOI: 10.1063/5.0199514
Ziyao Zhang, Haoxiang Yang, J. K. Eshraghian, Jiayin Li, Ken-Tye Yong, D. Vigolo, Helen M. McGuire, Omid Kavehei
Imaging flow cytometry (IFC) is an advanced cell-analytic technology offering rich spatial information and fluorescence intensity for multi-parametric characterization. Manual gating in cytometry data enables the classification of discrete populations from the sample based on extracted features. However, this expert-driven technique can be subjective and laborious, often presenting challenges in reproducibility and being inherently limited to bivariate analysis. Numerous AI-driven cell classifications have recently emerged to automate the process of including multivariate data with enhanced reproducibility and accuracy. Our previous work demonstrated the early development of neuromorphic imaging cytometry, evaluating its feasibility in resolving conventional frame-based imaging systems’ limitations in data redundancy, fluorescence sensitivity, and compromised throughput. Herein, we adopted a convolutional spiking neural network (SNN) combined with the YOLOv3 model (SNN-YOLO) to perform cell classification and detection on label-free samples under neuromorphic vision. Spiking techniques are inherently suitable post-processing techniques for neuromorphic vision sensing. The experiment was conducted with polystyrene-based microparticles, THP-1, and LL/2 cell lines. The network’s performance was compared with that of a traditional YOLOv3 model fed with event-generated frame data to serve as a baseline. In this work, our SNN-YOLO outperformed the YOLOv3 baseline by achieving the highest average class accuracy of 0.974, compared to 0.962 for YOLOv3. Both models reported comparable performances across other key metrics and should be further explored for future auto-gating strategies and cytometry applications.
成像流式细胞仪(IFC)是一种先进的细胞分析技术,可提供丰富的空间信息和荧光强度,用于多参数表征。通过对流式细胞仪数据进行手动选通,可根据提取的特征对样本中的离散群体进行分类。然而,这种专家驱动的技术可能比较主观和费力,往往在可重复性方面存在挑战,而且本质上仅限于二变量分析。最近出现了许多人工智能驱动的细胞分类方法,可自动纳入多变量数据,提高可重复性和准确性。我们之前的工作展示了神经形态成像细胞计量学的早期发展,评估了其在解决传统基于帧的成像系统在数据冗余、荧光灵敏度和受影响的吞吐量方面的局限性的可行性。在这里,我们采用卷积尖峰神经网络(SNN)结合 YOLOv3 模型(SNN-YOLO),在神经形态视觉下对无标记样本进行细胞分类和检测。尖峰技术本身就是适合神经形态视觉传感的后处理技术。实验使用基于聚苯乙烯的微颗粒、THP-1 和 LL/2 细胞系进行。该网络的性能与传统 YOLOv3 模型的性能进行了比较,后者以事件生成的帧数据作为基线。在这项工作中,我们的 SNN-YOLO 的表现优于 YOLOv3 基线,达到了最高的平均分类准确率 0.974,而 YOLOv3 为 0.962。这两个模型在其他关键指标上的表现相当,应在未来的自动分级策略和细胞测量应用中进一步探索。
{"title":"Cell detection with convolutional spiking neural network for neuromorphic cytometry","authors":"Ziyao Zhang, Haoxiang Yang, J. K. Eshraghian, Jiayin Li, Ken-Tye Yong, D. Vigolo, Helen M. McGuire, Omid Kavehei","doi":"10.1063/5.0199514","DOIUrl":"https://doi.org/10.1063/5.0199514","url":null,"abstract":"Imaging flow cytometry (IFC) is an advanced cell-analytic technology offering rich spatial information and fluorescence intensity for multi-parametric characterization. Manual gating in cytometry data enables the classification of discrete populations from the sample based on extracted features. However, this expert-driven technique can be subjective and laborious, often presenting challenges in reproducibility and being inherently limited to bivariate analysis. Numerous AI-driven cell classifications have recently emerged to automate the process of including multivariate data with enhanced reproducibility and accuracy. Our previous work demonstrated the early development of neuromorphic imaging cytometry, evaluating its feasibility in resolving conventional frame-based imaging systems’ limitations in data redundancy, fluorescence sensitivity, and compromised throughput. Herein, we adopted a convolutional spiking neural network (SNN) combined with the YOLOv3 model (SNN-YOLO) to perform cell classification and detection on label-free samples under neuromorphic vision. Spiking techniques are inherently suitable post-processing techniques for neuromorphic vision sensing. The experiment was conducted with polystyrene-based microparticles, THP-1, and LL/2 cell lines. The network’s performance was compared with that of a traditional YOLOv3 model fed with event-generated frame data to serve as a baseline. In this work, our SNN-YOLO outperformed the YOLOv3 baseline by achieving the highest average class accuracy of 0.974, compared to 0.962 for YOLOv3. Both models reported comparable performances across other key metrics and should be further explored for future auto-gating strategies and cytometry applications.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":" 10","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140998464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The development of thermodynamically consistent and physics-informed equation-of-state model through machine learning 通过机器学习开发热力学一致和物理信息的状态方程模型
Pub Date : 2024-05-07 DOI: 10.1063/5.0192447
J. Hinz, Dayou Yu, Deep Shankar Pandey, Hitesh Sapkota, Qi Yu, D. Mihaylov, V. V. Karasiev, S. Hu
Ab initio molecular dynamics (AIMD) simulations have become an important tool used in the construction of equations of state (EOS) tables for warm dense matter. Due to computational costs, only a limited number of system state conditions can be simulated, and the remaining EOS surface must be interpolated for use in radiation-hydrodynamic simulations of experiments. In this work, we develop a thermodynamically consistent EOS model that utilizes a physics-informed machine learning approach to implicitly learn the underlying Helmholtz free-energy from AIMD generated energies and pressures. The model, referred to as PIML-EOS, was trained and tested on warm dense polystyrene producing a fit within a 1% relative error for both energy and pressure and is shown to satisfy both the Maxwell and Gibbs–Duhem relations. In addition, we provide a path toward obtaining thermodynamic quantities, such as the total entropy and chemical potential (containing both ionic and electronic contributions), which are not available from current AIMD simulations.
原子分子动力学(AIMD)模拟已成为构建热致密物质状态方程(EOS)表的重要工具。由于计算成本的原因,只能模拟有限数量的系统状态条件,其余的 EOS 表必须内插到辐射流体力学模拟实验中使用。在这项工作中,我们开发了一种热力学一致的 EOS 模型,该模型利用物理信息机器学习方法,从 AIMD 生成的能量和压力中隐含地学习底层赫尔姆霍兹自由能。该模型被称为 PIML-EOS,在暖致密聚苯乙烯上进行了训练和测试,能量和压力的拟合相对误差均在 1%以内,并证明它同时满足麦克斯韦和吉布斯-杜恒关系。此外,我们还提供了一条获得热力学量的途径,如总熵和化学势(包含离子和电子贡献),这些都是目前的 AIMD 模拟无法获得的。
{"title":"The development of thermodynamically consistent and physics-informed equation-of-state model through machine learning","authors":"J. Hinz, Dayou Yu, Deep Shankar Pandey, Hitesh Sapkota, Qi Yu, D. Mihaylov, V. V. Karasiev, S. Hu","doi":"10.1063/5.0192447","DOIUrl":"https://doi.org/10.1063/5.0192447","url":null,"abstract":"Ab initio molecular dynamics (AIMD) simulations have become an important tool used in the construction of equations of state (EOS) tables for warm dense matter. Due to computational costs, only a limited number of system state conditions can be simulated, and the remaining EOS surface must be interpolated for use in radiation-hydrodynamic simulations of experiments. In this work, we develop a thermodynamically consistent EOS model that utilizes a physics-informed machine learning approach to implicitly learn the underlying Helmholtz free-energy from AIMD generated energies and pressures. The model, referred to as PIML-EOS, was trained and tested on warm dense polystyrene producing a fit within a 1% relative error for both energy and pressure and is shown to satisfy both the Maxwell and Gibbs–Duhem relations. In addition, we provide a path toward obtaining thermodynamic quantities, such as the total entropy and chemical potential (containing both ionic and electronic contributions), which are not available from current AIMD simulations.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"77 s327","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141002337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simulating CO2 diffusivity in rigid and flexible Mg-MOF-74 with machine-learning force fields 利用机器学习力场模拟刚性和柔性 Mg-MOF-74 中的二氧化碳扩散性
Pub Date : 2024-05-07 DOI: 10.1063/5.0190372
Bowen Zheng, Grace X. Gu, Carine Ribeiro dos Santos, Rodrigo Neumann Barros Ferreira, Mathias Steiner, Binquan Luan
The flexibility of metal–organic frameworks (MOFs) affects their gas adsorption and diffusion properties. However, reliable force fields for simulating flexible MOFs are lacking. As a result, most atomistic simulations so far have been carried out assuming rigid MOFs, which inevitably overestimates the gas adsorption energy. Here, we show that this issue can be addressed by applying a machine-learning potential, trained on quantum chemistry data, to atomistic simulations. We find that inclusion of flexibility is particularly important for simulating CO2 chemisorption in MOFs with coordinatively unsaturated metal sites. Specifically, we demonstrate that the diffusion of CO2 in a flexible Mg-MOF-74 structure is about one order of magnitude faster than in a rigid one, challenging the rigid-MOF assumption in previous simulations.
金属有机框架(MOFs)的柔性会影响其气体吸附和扩散特性。然而,目前还缺乏用于模拟柔性 MOF 的可靠力场。因此,迄今为止大多数原子模拟都是在假定 MOFs 具有刚性的情况下进行的,这就不可避免地高估了气体吸附能。在此,我们展示了通过量子化学数据训练的机器学习势能在原子模拟中的应用可以解决这一问题。我们发现,在具有配位不饱和金属位点的 MOFs 中模拟二氧化碳化学吸附时,加入灵活性尤为重要。具体来说,我们证明了二氧化碳在柔性 Mg-MOF-74 结构中的扩散速度比在刚性结构中快约一个数量级,这对之前模拟中的刚性-MOF 假设提出了挑战。
{"title":"Simulating CO2 diffusivity in rigid and flexible Mg-MOF-74 with machine-learning force fields","authors":"Bowen Zheng, Grace X. Gu, Carine Ribeiro dos Santos, Rodrigo Neumann Barros Ferreira, Mathias Steiner, Binquan Luan","doi":"10.1063/5.0190372","DOIUrl":"https://doi.org/10.1063/5.0190372","url":null,"abstract":"The flexibility of metal–organic frameworks (MOFs) affects their gas adsorption and diffusion properties. However, reliable force fields for simulating flexible MOFs are lacking. As a result, most atomistic simulations so far have been carried out assuming rigid MOFs, which inevitably overestimates the gas adsorption energy. Here, we show that this issue can be addressed by applying a machine-learning potential, trained on quantum chemistry data, to atomistic simulations. We find that inclusion of flexibility is particularly important for simulating CO2 chemisorption in MOFs with coordinatively unsaturated metal sites. Specifically, we demonstrate that the diffusion of CO2 in a flexible Mg-MOF-74 structure is about one order of magnitude faster than in a rigid one, challenging the rigid-MOF assumption in previous simulations.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"96 7","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141003530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the optimal design space of transparent perovskite solar cells for four-terminal tandem applications through Pareto front optimization 通过帕累托前沿优化探索四端串联应用透明过氧化物太阳能电池的最佳设计空间
Pub Date : 2024-04-24 DOI: 10.1063/5.0187208
Hu Quee Tan, Xinhai Zhao, Akhil Ambardekar, Erik Birgersson, Hansong Xue
Machine learning algorithms can enhance the design and experimental processing of solar cells, resulting in increased conversion efficiency. In this study, we introduce a novel machine learning-based methodology for optimizing the Pareto front of four-terminal (4T) perovskite-copper indium selenide (CIS) tandem solar cells (TSCs). By training a neural network using the Bayesian regularization-backpropagation algorithm via Hammersley sampling, we achieve high prediction accuracy when testing with unseen data through random sampling. This surrogate model not only reduces computational costs but also potentially enhances device performance, increasing from 29.4% to 30.4% while simultaneously reducing material costs for fabrication by 50%. Comparing experimentally fabricated cells with the predicted optimal cells, the latter show a thinner front contact electrode, charge-carrier transport layer, and back contact electrode. Highly efficient perovskite cells identified from the Pareto front have a perovskite layer thickness ranging from 420 to 580 nm. Further analysis reveals the front contact electrode needs to be thin, while the back contact electrode can have a thickness ranging from 100 to 145 nm and still achieve high efficiency. The charge-carrier transport layers play a crucial role in minimizing interface recombination and ensuring unidirectional current flow. The optimal design space suggests thinner electron and hole transport layer thicknesses of 7 nm, down from 23 to 10 nm, respectively. It indicates a balanced charge-carrier extraction is crucial for an optimized perovskite cell. Overall, the presented methodology and optimized design parameters have the potential to enhance the performance of 4T perovskite/CIS TSC while reducing material fabrication costs.
机器学习算法可以提高太阳能电池的设计和实验处理能力,从而提高转换效率。在本研究中,我们介绍了一种基于机器学习的新方法,用于优化四端子(4T)包晶铜铟硒(CIS)串联太阳能电池(TSCs)的帕累托前沿。通过哈默斯利采样,我们使用贝叶斯正则化-反向传播算法训练了一个神经网络,在通过随机抽样使用未见数据进行测试时,我们实现了很高的预测准确率。这种代理模型不仅降低了计算成本,还可能提高器件性能,使器件性能从 29.4% 提高到 30.4%,同时将制造材料成本降低 50%。实验制作的电池与预测的最佳电池相比,后者的前接触电极、电荷载流子传输层和背接触电极更薄。从帕累托前沿确定的高效包晶电池的包晶层厚度在 420 纳米到 580 纳米之间。进一步分析表明,前接触电极需要很薄,而背接触电极的厚度可在 100 至 145 纳米之间,但仍能实现高效率。电荷载流子传输层在最大限度地减少界面重组和确保单向电流流动方面起着至关重要的作用。最佳设计空间建议将电子和空穴传输层的厚度分别从 23 纳米减至 10 纳米,即减薄 7 纳米。这表明平衡的电荷载流子提取对于优化的过氧化物电池至关重要。总之,所介绍的方法和优化的设计参数有望提高 4T 包晶石/CIS TSC 的性能,同时降低材料制造成本。
{"title":"Exploring the optimal design space of transparent perovskite solar cells for four-terminal tandem applications through Pareto front optimization","authors":"Hu Quee Tan, Xinhai Zhao, Akhil Ambardekar, Erik Birgersson, Hansong Xue","doi":"10.1063/5.0187208","DOIUrl":"https://doi.org/10.1063/5.0187208","url":null,"abstract":"Machine learning algorithms can enhance the design and experimental processing of solar cells, resulting in increased conversion efficiency. In this study, we introduce a novel machine learning-based methodology for optimizing the Pareto front of four-terminal (4T) perovskite-copper indium selenide (CIS) tandem solar cells (TSCs). By training a neural network using the Bayesian regularization-backpropagation algorithm via Hammersley sampling, we achieve high prediction accuracy when testing with unseen data through random sampling. This surrogate model not only reduces computational costs but also potentially enhances device performance, increasing from 29.4% to 30.4% while simultaneously reducing material costs for fabrication by 50%. Comparing experimentally fabricated cells with the predicted optimal cells, the latter show a thinner front contact electrode, charge-carrier transport layer, and back contact electrode. Highly efficient perovskite cells identified from the Pareto front have a perovskite layer thickness ranging from 420 to 580 nm. Further analysis reveals the front contact electrode needs to be thin, while the back contact electrode can have a thickness ranging from 100 to 145 nm and still achieve high efficiency. The charge-carrier transport layers play a crucial role in minimizing interface recombination and ensuring unidirectional current flow. The optimal design space suggests thinner electron and hole transport layer thicknesses of 7 nm, down from 23 to 10 nm, respectively. It indicates a balanced charge-carrier extraction is crucial for an optimized perovskite cell. Overall, the presented methodology and optimized design parameters have the potential to enhance the performance of 4T perovskite/CIS TSC while reducing material fabrication costs.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"100 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140659225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AlGaN/GaN MOS-HEMT enabled optoelectronic artificial synaptic devices for neuromorphic computing 用于神经形态计算的 AlGaN/GaN MOS-HEMT 光电人工突触器件
Pub Date : 2024-04-24 DOI: 10.1063/5.0194083
Jiaxiang Chen, Haitao Du, Haolan Qu, Han Gao, Yitian Gu, Yitai Zhu, Wenbo Ye, Jun Zou, Hongzhi Wang, Xinbo Zou
Artificial optoelectronic synaptic transistors have attracted extensive research interest as an essential component for neuromorphic computing systems and brain emulation applications. However, performance challenges still remain for synaptic devices, including low energy consumption, high integration density, and flexible modulation. Employing trapping and detrapping relaxation, a novel optically stimulated synaptic transistor enabled by the AlGaN/GaN hetero-structure metal-oxide semiconductor high-electron-mobility transistor has been successfully demonstrated in this study. Synaptic functions, including excitatory postsynaptic current (EPSC), paired-pulse facilitation index, and transition from short-term memory to long-term memory, are well mimicked and explicitly investigated. In a single EPSC event, the AlGaN/GaN synaptic transistor shows the characteristics of low energy consumption and a high signal-to-noise ratio. The EPSC of the synaptic transistor can be synergistically modulated by both optical stimulation and gate/drain bias. Moreover, utilizing a convolution neural network, hand-written digit images were used to verify the data preprocessing capability for neuromorphic computing applications.
人工光电突触晶体管作为神经形态计算系统和大脑仿真应用的重要组成部分,已经引起了广泛的研究兴趣。然而,突触器件的性能仍面临挑战,包括低能耗、高集成度和灵活调制。本研究利用捕获和解捕获弛豫,成功演示了由氮化铝/氮化镓异质结构金属氧化物半导体高电子迁移率晶体管实现的新型光刺激突触晶体管。该研究很好地模拟并明确研究了突触功能,包括兴奋性突触后电流(EPSC)、成对脉冲促进指数以及从短期记忆到长期记忆的过渡。在单次 EPSC 事件中,AlGaN/GaN 突触晶体管表现出低能耗和高信噪比的特点。突触晶体管的 EPSC 可同时受到光刺激和栅极/漏极偏置的协同调制。此外,利用卷积神经网络,手写数字图像被用来验证神经形态计算应用的数据预处理能力。
{"title":"AlGaN/GaN MOS-HEMT enabled optoelectronic artificial synaptic devices for neuromorphic computing","authors":"Jiaxiang Chen, Haitao Du, Haolan Qu, Han Gao, Yitian Gu, Yitai Zhu, Wenbo Ye, Jun Zou, Hongzhi Wang, Xinbo Zou","doi":"10.1063/5.0194083","DOIUrl":"https://doi.org/10.1063/5.0194083","url":null,"abstract":"Artificial optoelectronic synaptic transistors have attracted extensive research interest as an essential component for neuromorphic computing systems and brain emulation applications. However, performance challenges still remain for synaptic devices, including low energy consumption, high integration density, and flexible modulation. Employing trapping and detrapping relaxation, a novel optically stimulated synaptic transistor enabled by the AlGaN/GaN hetero-structure metal-oxide semiconductor high-electron-mobility transistor has been successfully demonstrated in this study. Synaptic functions, including excitatory postsynaptic current (EPSC), paired-pulse facilitation index, and transition from short-term memory to long-term memory, are well mimicked and explicitly investigated. In a single EPSC event, the AlGaN/GaN synaptic transistor shows the characteristics of low energy consumption and a high signal-to-noise ratio. The EPSC of the synaptic transistor can be synergistically modulated by both optical stimulation and gate/drain bias. Moreover, utilizing a convolution neural network, hand-written digit images were used to verify the data preprocessing capability for neuromorphic computing applications.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"51 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140664450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-supervised learning of shedding droplet dynamics during steam condensation 蒸汽冷凝过程中脱落液滴动力学的自监督学习
Pub Date : 2024-04-10 DOI: 10.1063/5.0188620
Siavash Khodakarami, Pouya Kabirzadeh, Nenad Miljkovic
Knowledge of condensate shedding droplet dynamics provides important information for the characterization of two-phase heat and mass transfer phenomena. Detecting and segmenting the droplets during shedding requires considerable time and effort if performed manually. Here, we developed a self-supervised deep learning model for segmenting shedding droplets from a variety of dropwise and filmwise condensing surfaces. The model eliminates the need for image annotation by humans in the training step and, therefore, reduces labor significantly. The trained model achieved an average accuracy greater than 0.9 on a new unseen test dataset. After extracting the shedding droplet size and speed, we developed a data-driven model for shedding droplet dynamics based on condensation heat flux and surface properties such as wettability and tube diameter. Our results demonstrate that condensate droplet departure size is both heat flux and tube size dependent and follows different trends based on the condensation mode. The results of this work provide an annotation-free methodology for falling droplet segmentation as well as a statistical understanding of droplet dynamics during condensation.
冷凝液脱落液滴动力学知识为表征两相传热和传质现象提供了重要信息。在冷凝液脱落过程中检测和分割液滴需要大量的时间和精力。在此,我们开发了一种自监督深度学习模型,用于从各种液滴和薄膜冷凝表面分割脱落液滴。该模型在训练步骤中无需人工进行图像标注,因此大大减少了劳动力。经过训练的模型在新的未见测试数据集上的平均准确率超过了 0.9。在提取了脱落液滴的大小和速度后,我们根据冷凝热通量和表面特性(如润湿性和管径)开发了一个数据驱动的脱落液滴动态模型。我们的研究结果表明,冷凝液滴的离去尺寸与热通量和管道尺寸有关,并根据冷凝模式呈现出不同的趋势。这项工作的结果提供了一种无注释的下降液滴分割方法,以及对冷凝过程中液滴动态的统计理解。
{"title":"Self-supervised learning of shedding droplet dynamics during steam condensation","authors":"Siavash Khodakarami, Pouya Kabirzadeh, Nenad Miljkovic","doi":"10.1063/5.0188620","DOIUrl":"https://doi.org/10.1063/5.0188620","url":null,"abstract":"Knowledge of condensate shedding droplet dynamics provides important information for the characterization of two-phase heat and mass transfer phenomena. Detecting and segmenting the droplets during shedding requires considerable time and effort if performed manually. Here, we developed a self-supervised deep learning model for segmenting shedding droplets from a variety of dropwise and filmwise condensing surfaces. The model eliminates the need for image annotation by humans in the training step and, therefore, reduces labor significantly. The trained model achieved an average accuracy greater than 0.9 on a new unseen test dataset. After extracting the shedding droplet size and speed, we developed a data-driven model for shedding droplet dynamics based on condensation heat flux and surface properties such as wettability and tube diameter. Our results demonstrate that condensate droplet departure size is both heat flux and tube size dependent and follows different trends based on the condensation mode. The results of this work provide an annotation-free methodology for falling droplet segmentation as well as a statistical understanding of droplet dynamics during condensation.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"156 20 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140717731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multivariate Gaussian process surrogates for predicting basic structural parameters of refractory non-dilute random alloys 用于预测难熔非稀释随机合金基本结构参数的多变量高斯过程替代物
Pub Date : 2024-04-10 DOI: 10.1063/5.0186045
Cesar Ruiz, Anshu Raj, Shuozhi Xu
Refractory non-dilute random alloys consist of two or more principal refractory metals with complex interactions that modify their basic structural properties such as lattice parameters and elastic constants. Atomistic simulations (ASs) are an effective method to compute such basic structural parameters. However, accurate predictions from ASs are computationally expensive due to the size and number of atomistic structures required. To reduce the computational burden, multivariate Gaussian process regression (MVGPR) is proposed as a surrogate model that only requires computing a small number of configurations for training. The elemental atom percentage in the hyper-spherical coordinates is demonstrated to be an effective feature for surrogate modeling. An additive approximation of the full MVGPR model is also proposed to further reduce computations. To improve surrogate accuracy, active learning is used to select a small number of alloys to simulate. Numerical studies based on AS data show the accuracy of the surrogate methodology and the additive approximation, as well as the effectiveness and robustness of the active learning for selecting new alloy designs to simulate.
难熔非稀释无规合金由两种或两种以上的主要难熔金属组成,其复杂的相互作用改变了它们的基本结构特性,如晶格参数和弹性常数。原子模拟(AS)是计算此类基本结构参数的有效方法。然而,由于需要原子结构的大小和数量,通过原子模拟进行精确预测的计算成本很高。为了减轻计算负担,我们提出了多变量高斯过程回归(MVGPR)作为一种替代模型,它只需要计算少量的构型进行训练。超球面坐标中的元素原子百分比被证明是代用模型的有效特征。此外,还提出了完整 MVGPR 模型的加法近似值,以进一步减少计算量。为了提高代用精度,采用了主动学习方法来选择少量合金进行模拟。基于 AS 数据的数值研究显示了代用方法和加法近似的准确性,以及主动学习在选择新合金设计进行模拟时的有效性和稳健性。
{"title":"Multivariate Gaussian process surrogates for predicting basic structural parameters of refractory non-dilute random alloys","authors":"Cesar Ruiz, Anshu Raj, Shuozhi Xu","doi":"10.1063/5.0186045","DOIUrl":"https://doi.org/10.1063/5.0186045","url":null,"abstract":"Refractory non-dilute random alloys consist of two or more principal refractory metals with complex interactions that modify their basic structural properties such as lattice parameters and elastic constants. Atomistic simulations (ASs) are an effective method to compute such basic structural parameters. However, accurate predictions from ASs are computationally expensive due to the size and number of atomistic structures required. To reduce the computational burden, multivariate Gaussian process regression (MVGPR) is proposed as a surrogate model that only requires computing a small number of configurations for training. The elemental atom percentage in the hyper-spherical coordinates is demonstrated to be an effective feature for surrogate modeling. An additive approximation of the full MVGPR model is also proposed to further reduce computations. To improve surrogate accuracy, active learning is used to select a small number of alloys to simulate. Numerical studies based on AS data show the accuracy of the surrogate methodology and the additive approximation, as well as the effectiveness and robustness of the active learning for selecting new alloy designs to simulate.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"104 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140720480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
APL Machine Learning
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1