首页 > 最新文献

APL Machine Learning最新文献

英文 中文
Sparse subnetwork inference for neural network epistemic uncertainty estimation with improved Hessian approximation 用改进的 Hessian 近似法进行神经网络表观不确定性估计的稀疏子网络推理
Pub Date : 2024-04-05 DOI: 10.1063/5.0193951
Yinsong Chen, Samson Yu, J. Eshraghian, Chee Peng Lim
Despite significant advances in deep neural networks across diverse domains, challenges persist in safety-critical contexts, including domain shift sensitivity and unreliable uncertainty estimation. To address these issues, this study investigates Bayesian learning for uncertainty handling in modern neural networks. However, the high-dimensional, non-convex nature of the posterior distribution poses practical limitations for epistemic uncertainty estimation. The Laplace approximation, as a cost-efficient Bayesian method, offers a practical solution by approximating the posterior as a multivariate normal distribution but faces computational bottlenecks in precise covariance matrix computation and storage. This research employs subnetwork inference, utilizing only a subset of the parameter space for Bayesian inference. In addition, a Kronecker-factored and low-rank representation is explored to reduce space complexity and computational costs. Several corrections are introduced to converge the approximated curvature to the exact Hessian matrix. Numerical results demonstrate the effectiveness and competitiveness of this method, whereas qualitative experiments highlight the impact of Hessian approximation granularity and parameter space utilization in Bayesian inference on mitigating overconfidence in predictions and obtaining high-quality uncertainty estimates.
尽管深度神经网络在不同领域取得了重大进展,但在安全关键型环境中仍然存在挑战,包括领域偏移敏感性和不可靠的不确定性估计。为了解决这些问题,本研究对现代神经网络中的不确定性处理进行了贝叶斯学习研究。然而,后验分布的高维、非凸性质给认识论不确定性估计带来了实际限制。拉普拉斯近似作为一种具有成本效益的贝叶斯方法,通过将后验近似为多元正态分布提供了一种实用的解决方案,但在精确协方差矩阵计算和存储方面面临计算瓶颈。这项研究采用了子网络推断法,只利用参数空间的一个子集进行贝叶斯推断。此外,还探索了克朗克因子和低秩表示法,以降低空间复杂性和计算成本。为了使近似曲率收敛到精确的 Hessian 矩阵,还引入了几种修正方法。数值结果证明了这一方法的有效性和竞争力,而定性实验则强调了贝叶斯推理中的黑森近似粒度和参数空间利用对减轻预测过度自信和获得高质量不确定性估计的影响。
{"title":"Sparse subnetwork inference for neural network epistemic uncertainty estimation with improved Hessian approximation","authors":"Yinsong Chen, Samson Yu, J. Eshraghian, Chee Peng Lim","doi":"10.1063/5.0193951","DOIUrl":"https://doi.org/10.1063/5.0193951","url":null,"abstract":"Despite significant advances in deep neural networks across diverse domains, challenges persist in safety-critical contexts, including domain shift sensitivity and unreliable uncertainty estimation. To address these issues, this study investigates Bayesian learning for uncertainty handling in modern neural networks. However, the high-dimensional, non-convex nature of the posterior distribution poses practical limitations for epistemic uncertainty estimation. The Laplace approximation, as a cost-efficient Bayesian method, offers a practical solution by approximating the posterior as a multivariate normal distribution but faces computational bottlenecks in precise covariance matrix computation and storage. This research employs subnetwork inference, utilizing only a subset of the parameter space for Bayesian inference. In addition, a Kronecker-factored and low-rank representation is explored to reduce space complexity and computational costs. Several corrections are introduced to converge the approximated curvature to the exact Hessian matrix. Numerical results demonstrate the effectiveness and competitiveness of this method, whereas qualitative experiments highlight the impact of Hessian approximation granularity and parameter space utilization in Bayesian inference on mitigating overconfidence in predictions and obtaining high-quality uncertainty estimates.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"35 12","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140740073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Imaging in double-casing wells with convolutional neural network based on inception module 利用基于起始模块的卷积神经网络进行双套管井成像
Pub Date : 2024-04-05 DOI: 10.1063/5.0191452
Siqi Zhang, Zhoumo Zeng, Xiaocen Wang, Shili Chen, Yang Liu
The evaluation of well integrity in double-casing wells is critical for ensuring well stability, preventing oil and gas leaks, avoiding pollution, and ensuring safety throughout well development and production. However, the current predominant method of assessing cementing quality primarily focuses on single-casing wells, with limited work conducted on double-casing wells. This study introduces a novel approach for evaluating the cementing quality using the Inception module of convolutional neural networks. First, the finite-difference method is employed to generate borehole sonic data corresponding to a variety of model configurations, which are used to train a neural network that learns spatial features from the borehole sonic data to reconstruct the slowness model. By adjusting the network architecture and parameters, it is discovered that a neural network with two blocks and 4096 nodes in the fully connected layer demonstrated the best imaging results and exhibited strong anti-noise capabilities. The proposed method is validated using practical wellbore size models, demonstrating excellent results and offering a more effective means of evaluating wellbore integrity in double-casing wells. In addition, dipole acoustic logging data are used to conduct slowness model imaging of the compressional (P-) wave and shear (S-) wave in double-casing wells to verify the feasibility of cementing quality evaluation. The developed method contributes to more accurate evaluations of wellbore integrity for the oil and gas industry, leading to improved safety and environmental outcomes.
在整个油井开发和生产过程中,评估双套管井的油井完整性对于确保油井稳定性、防止油气泄漏、避免污染以及确保安全至关重要。然而,目前主流的固井质量评估方法主要针对单套管井,针对双套管井的工作十分有限。本研究介绍了一种使用卷积神经网络 Inception 模块评估固井质量的新方法。首先,采用有限差分法生成与各种模型配置相对应的井眼声波数据,并利用这些数据训练神经网络,该网络从井眼声波数据中学习空间特征,从而重建慢度模型。通过调整网络结构和参数,发现在全连接层中有两个区块和 4096 个节点的神经网络具有最佳成像效果和较强的抗噪能力。利用实际井筒尺寸模型对所提出的方法进行了验证,结果表明效果极佳,为评估双套管井的井筒完整性提供了更有效的方法。此外,利用偶极声波测井数据对双套管井中的压缩(P)波和剪切(S)波进行慢度模型成像,以验证固井质量评价的可行性。所开发的方法有助于油气行业更准确地评估井筒完整性,从而改善安全和环境状况。
{"title":"Imaging in double-casing wells with convolutional neural network based on inception module","authors":"Siqi Zhang, Zhoumo Zeng, Xiaocen Wang, Shili Chen, Yang Liu","doi":"10.1063/5.0191452","DOIUrl":"https://doi.org/10.1063/5.0191452","url":null,"abstract":"The evaluation of well integrity in double-casing wells is critical for ensuring well stability, preventing oil and gas leaks, avoiding pollution, and ensuring safety throughout well development and production. However, the current predominant method of assessing cementing quality primarily focuses on single-casing wells, with limited work conducted on double-casing wells. This study introduces a novel approach for evaluating the cementing quality using the Inception module of convolutional neural networks. First, the finite-difference method is employed to generate borehole sonic data corresponding to a variety of model configurations, which are used to train a neural network that learns spatial features from the borehole sonic data to reconstruct the slowness model. By adjusting the network architecture and parameters, it is discovered that a neural network with two blocks and 4096 nodes in the fully connected layer demonstrated the best imaging results and exhibited strong anti-noise capabilities. The proposed method is validated using practical wellbore size models, demonstrating excellent results and offering a more effective means of evaluating wellbore integrity in double-casing wells. In addition, dipole acoustic logging data are used to conduct slowness model imaging of the compressional (P-) wave and shear (S-) wave in double-casing wells to verify the feasibility of cementing quality evaluation. The developed method contributes to more accurate evaluations of wellbore integrity for the oil and gas industry, leading to improved safety and environmental outcomes.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"26 13","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140739024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Harnessing nonlinear conductive characteristic of TiO2/HfO2 memristor crossbar for implementing parallel vector–matrix multiplication 利用 TiO2/HfO2 晶闸管交叉棒的非线性传导特性实现并行矢量矩阵乘法
Pub Date : 2024-04-04 DOI: 10.1063/5.0195190
Wei Wei, Cong Wang, Chen Pan, Xing-Jian Yangdong, Zaizheng Yang, Yuekun Yang, Bin Cheng, Shi-Jun Liang, Feng Miao
Memristor crossbar arrays are expected to achieve highly energy-efficient neuromorphic computing via implementing parallel vector–matrix multiplication (VMM) in situ. The similarities between memristors and neural synapses offer opportunities for realizing hardware-based brain-inspired computing, such as spike neural networks. However, the nonlinear I–V characteristics of the memristors limit the implementation of parallel VMM on passive memristor crossbar arrays. In our work, we propose to utilize differential conductance as a synaptic weight to implement linear VMM operations on a passive memristor array in parallel. We fabricated a TiO2/HfO2 memristor crossbar array, in which differential-conductance-based synaptic weight exhibits plasticity, nonvolatility, multi-states, and tunable ON/OFF ratio. The noise-dependent accuracy performance of VMM operations based on the proposed approach was evaluated, offering an optimization guideline. Furthermore, we demonstrated a spike neural network circuit capable of processing small spiking signals through the differential-conductance-based synapses. The experimental results showcase effective space-coded and time-coded spike pattern recognition. Importantly, our work opens up new possibilities for the development of passive memristor arrays, leading to increased energy and area efficiency in brain-inspired chips.
通过就地实现并行向量矩阵乘法(VMM),忆阻器横杆阵列有望实现高能效的神经形态计算。忆阻器与神经突触之间的相似性为实现基于硬件的大脑启发计算(如尖峰神经网络)提供了机会。然而,忆阻器的非线性 I-V 特性限制了在无源忆阻器交叉棒阵列上实现并行 VMM。在我们的工作中,我们提议利用差分电导作为突触权重,在被动忆阻器阵列上并行实施线性 VMM 操作。我们制作了一个二氧化钛/二氧化氢忆阻器交叉棒阵列,其中基于差分电导的突触权重具有可塑性、非波动性、多态性和可调的导通/关断比。我们评估了基于所提方法的 VMM 操作随噪声变化的精度性能,为优化提供了指导。此外,我们还展示了一种尖峰神经网络电路,它能够通过基于差分电导的突触处理小尖峰信号。实验结果展示了有效的空间编码和时间编码尖峰模式识别。重要的是,我们的工作为无源忆阻器阵列的开发开辟了新的可能性,从而提高了脑启发芯片的能量和面积效率。
{"title":"Harnessing nonlinear conductive characteristic of TiO2/HfO2 memristor crossbar for implementing parallel vector–matrix multiplication","authors":"Wei Wei, Cong Wang, Chen Pan, Xing-Jian Yangdong, Zaizheng Yang, Yuekun Yang, Bin Cheng, Shi-Jun Liang, Feng Miao","doi":"10.1063/5.0195190","DOIUrl":"https://doi.org/10.1063/5.0195190","url":null,"abstract":"Memristor crossbar arrays are expected to achieve highly energy-efficient neuromorphic computing via implementing parallel vector–matrix multiplication (VMM) in situ. The similarities between memristors and neural synapses offer opportunities for realizing hardware-based brain-inspired computing, such as spike neural networks. However, the nonlinear I–V characteristics of the memristors limit the implementation of parallel VMM on passive memristor crossbar arrays. In our work, we propose to utilize differential conductance as a synaptic weight to implement linear VMM operations on a passive memristor array in parallel. We fabricated a TiO2/HfO2 memristor crossbar array, in which differential-conductance-based synaptic weight exhibits plasticity, nonvolatility, multi-states, and tunable ON/OFF ratio. The noise-dependent accuracy performance of VMM operations based on the proposed approach was evaluated, offering an optimization guideline. Furthermore, we demonstrated a spike neural network circuit capable of processing small spiking signals through the differential-conductance-based synapses. The experimental results showcase effective space-coded and time-coded spike pattern recognition. Importantly, our work opens up new possibilities for the development of passive memristor arrays, leading to increased energy and area efficiency in brain-inspired chips.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"33 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140744586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Study of the adsorption sites of high entropy alloys for CO2 reduction using graph convolutional network 利用图卷积网络研究高熵合金的二氧化碳还原吸附点
Pub Date : 2024-04-03 DOI: 10.1063/5.0198043
H. Oliaei, N. Aluru
Carbon dioxide reduction is a major step toward building a cleaner and safer environment. There is a surge of interest in exploring high-entropy alloys (HEAs) as active catalysts for CO2 reduction; however, so far, it is mainly limited to quinary HEAs. Inspired by the successful synthesis of octonary and denary HEAs, herein, the CO2 reduction reaction (CO2RR) performance of an HEA composed of Ag, Au, Cu, Pd, Pt, Co, Ga, Ni, and Zn is studied by developing a high-fidelity graph neural network (GNN) framework. Within this framework, the adsorption site geometry and physics are employed through the featurization of elements. Particularly, featurization is performed using various intrinsic properties, such as electronegativity and atomic radius, to enable not only the supervised learning of CO2RR performance descriptors, namely, CO and H adsorption energies, but also the learning of adsorption physics and generalization to unseen metals and alloys. The developed model evaluates the adsorption strength of ∼3.5 and ∼0.4 billion possible sites for CO and H, respectively. Despite the enormous space of the AgAuCuPdPtCoGaNiZn alloy and the rather small size of the training data, the GNN framework demonstrated high accuracy and good robustness. This study paves the way for the rapid screening and intelligent synthesis of CO2RR-active and selective HEAs.
二氧化碳减排是建设更清洁、更安全环境的重要一步。人们对探索高熵合金(HEAs)作为二氧化碳还原活性催化剂的兴趣日益高涨;然而,迄今为止,这种研究主要局限于二元 HEAs。受成功合成八元和二元 HEA 的启发,本文通过开发高保真图神经网络 (GNN) 框架,研究了由 Ag、Au、Cu、Pd、Pt、Co、Ga、Ni 和 Zn 组成的 HEA 的二氧化碳还原反应 (CO2RR) 性能。在此框架内,通过元素的特征化,采用了吸附位点的几何形状和物理特性。特别是利用电负性和原子半径等各种固有属性进行特征化,不仅实现了 CO2RR 性能描述符(即 CO 和 H 吸附能)的监督学习,还实现了吸附物理学的学习以及对未知金属和合金的泛化。所开发的模型分别评估了 35 亿个和 4 亿个可能的 CO 和 H 吸附位点的吸附强度。尽管 AgAuCuPdPtCoGaNiZn 合金的空间巨大,而且训练数据的规模相当小,但 GNN 框架仍表现出很高的准确性和良好的鲁棒性。这项研究为快速筛选和智能合成具有 CO2RR 活性和选择性的 HEA 铺平了道路。
{"title":"Study of the adsorption sites of high entropy alloys for CO2 reduction using graph convolutional network","authors":"H. Oliaei, N. Aluru","doi":"10.1063/5.0198043","DOIUrl":"https://doi.org/10.1063/5.0198043","url":null,"abstract":"Carbon dioxide reduction is a major step toward building a cleaner and safer environment. There is a surge of interest in exploring high-entropy alloys (HEAs) as active catalysts for CO2 reduction; however, so far, it is mainly limited to quinary HEAs. Inspired by the successful synthesis of octonary and denary HEAs, herein, the CO2 reduction reaction (CO2RR) performance of an HEA composed of Ag, Au, Cu, Pd, Pt, Co, Ga, Ni, and Zn is studied by developing a high-fidelity graph neural network (GNN) framework. Within this framework, the adsorption site geometry and physics are employed through the featurization of elements. Particularly, featurization is performed using various intrinsic properties, such as electronegativity and atomic radius, to enable not only the supervised learning of CO2RR performance descriptors, namely, CO and H adsorption energies, but also the learning of adsorption physics and generalization to unseen metals and alloys. The developed model evaluates the adsorption strength of ∼3.5 and ∼0.4 billion possible sites for CO and H, respectively. Despite the enormous space of the AgAuCuPdPtCoGaNiZn alloy and the rather small size of the training data, the GNN framework demonstrated high accuracy and good robustness. This study paves the way for the rapid screening and intelligent synthesis of CO2RR-active and selective HEAs.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"749 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140749153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computational synthesis of a new generation of 2D-based perovskite quantum materials 新一代基于二维的过氧化物量子材料的计算合成
Pub Date : 2024-04-02 DOI: 10.1063/5.0189497
C. Ekuma
Perovskite-based optoelectronic devices have emerged as a promising energy source due to their potential for scalable production. This study introduces “perovskene,” a novel class of 2D materials derived from the ABC3-like perovskites, synthesized via a data-driven, high-throughput computational strategy. We harness machine learning and multitarget deep neural networks to systematically investigate the structure–property relations, paving the way for targeted material design and optimization in fields such as renewable energy, electronics, and catalysis. The characterization of over 1500 synthesized structures shows that more than 500 structures are stable, revealing properties such as ultra-low work function and large magnetic moment, underscoring the potential for advanced technological applications.
由于具有可规模化生产的潜力,基于包光体的光电器件已成为一种前景广阔的能源。本研究介绍了 "perovskene",这是一类新型二维材料,源自 ABC3 类包晶石,是通过数据驱动的高通量计算策略合成的。我们利用机器学习和多目标深度神经网络系统地研究了结构-性能关系,为可再生能源、电子和催化等领域的目标材料设计和优化铺平了道路。对 1500 多种合成结构的表征表明,500 多种结构是稳定的,揭示了超低功函数和大磁矩等特性,彰显了先进技术应用的潜力。
{"title":"Computational synthesis of a new generation of 2D-based perovskite quantum materials","authors":"C. Ekuma","doi":"10.1063/5.0189497","DOIUrl":"https://doi.org/10.1063/5.0189497","url":null,"abstract":"Perovskite-based optoelectronic devices have emerged as a promising energy source due to their potential for scalable production. This study introduces “perovskene,” a novel class of 2D materials derived from the ABC3-like perovskites, synthesized via a data-driven, high-throughput computational strategy. We harness machine learning and multitarget deep neural networks to systematically investigate the structure–property relations, paving the way for targeted material design and optimization in fields such as renewable energy, electronics, and catalysis. The characterization of over 1500 synthesized structures shows that more than 500 structures are stable, revealing properties such as ultra-low work function and large magnetic moment, underscoring the potential for advanced technological applications.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"24 10","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140753357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VERI-D: A new dataset and method for multi-camera vehicle re-identification of damaged cars under varying lighting conditions VERI-D:用于在不同光照条件下对受损汽车进行多摄像头车辆再识别的新数据集和方法
Pub Date : 2024-03-01 DOI: 10.1063/5.0183408
Shao Liu, S. Agaian
Vehicle re-identification (V-ReID) is a critical task that aims to match the same vehicle across images from different camera viewpoints. The previous studies have leveraged attribute clues, such as color, model, and license plate, to enhance the V-ReID performance. However, these methods often lack effective interaction between the global–local features and the final V-ReID objective. Moreover, they do not address the challenging issues in real-world scenarios, such as high viewpoint variations, extreme illumination conditions, and car appearance changes (e.g., due to damage or wrong driving). We propose a novel framework to tackle these problems and advance the research in V-ReID, which can handle various types of car appearance changes and achieve robust V-ReID under varying lighting conditions. Our main contributions are as follows: (i) we propose a new Re-ID architecture named global–local self-attention network, which integrates local information into the feature learning process and enhances the feature representation for V-ReID and (ii) we introduce a novel damaged vehicle Re-ID dataset called VERI-D, which is the first publicly available dataset that focuses on this challenging yet practical scenario. The dataset contains both natural and synthetic images of damaged vehicles captured from multiple camera viewpoints and under different lighting conditions. (iii) We conduct extensive experiments on the VERI-D dataset and demonstrate the effectiveness of our approach in addressing the challenges associated with damaged vehicle re-identification. We also compare our method to several state-of-the-art V-ReID methods and show its superiority.
车辆再识别(V-ReID)是一项关键任务,其目的是在不同摄像机视角的图像中匹配同一辆车。以往的研究利用颜色、车型和车牌等属性线索来提高 V-ReID 性能。然而,这些方法往往缺乏全局-局部特征与最终 V-ReID 目标之间的有效互动。此外,这些方法无法解决现实世界中的挑战性问题,如视角变化大、极端光照条件和汽车外观变化(如损坏或错误驾驶)。我们提出了一个新颖的框架来解决这些问题,并推动了 V-ReID 的研究,该框架可以处理各种类型的汽车外观变化,并在不同光照条件下实现稳健的 V-ReID 技术。我们的主要贡献如下(i) 我们提出了一种名为全局-局部自注意力网络的新型 Re-ID 架构,该架构将局部信息整合到特征学习过程中,并增强了 V-ReID 的特征表示;(ii) 我们引入了一种名为 VERI-D 的新型受损车辆 Re-ID 数据集,这是首个公开可用的数据集,重点关注这一具有挑战性但实用的场景。该数据集包含从多个摄像机视角在不同光照条件下捕捉到的受损车辆的自然图像和合成图像。(iii) 我们在 VERI-D 数据集上进行了大量实验,证明了我们的方法在应对受损车辆再识别相关挑战方面的有效性。我们还将我们的方法与几种最先进的 V-ReID 方法进行了比较,并展示了其优越性。
{"title":"VERI-D: A new dataset and method for multi-camera vehicle re-identification of damaged cars under varying lighting conditions","authors":"Shao Liu, S. Agaian","doi":"10.1063/5.0183408","DOIUrl":"https://doi.org/10.1063/5.0183408","url":null,"abstract":"Vehicle re-identification (V-ReID) is a critical task that aims to match the same vehicle across images from different camera viewpoints. The previous studies have leveraged attribute clues, such as color, model, and license plate, to enhance the V-ReID performance. However, these methods often lack effective interaction between the global–local features and the final V-ReID objective. Moreover, they do not address the challenging issues in real-world scenarios, such as high viewpoint variations, extreme illumination conditions, and car appearance changes (e.g., due to damage or wrong driving). We propose a novel framework to tackle these problems and advance the research in V-ReID, which can handle various types of car appearance changes and achieve robust V-ReID under varying lighting conditions. Our main contributions are as follows: (i) we propose a new Re-ID architecture named global–local self-attention network, which integrates local information into the feature learning process and enhances the feature representation for V-ReID and (ii) we introduce a novel damaged vehicle Re-ID dataset called VERI-D, which is the first publicly available dataset that focuses on this challenging yet practical scenario. The dataset contains both natural and synthetic images of damaged vehicles captured from multiple camera viewpoints and under different lighting conditions. (iii) We conduct extensive experiments on the VERI-D dataset and demonstrate the effectiveness of our approach in addressing the challenges associated with damaged vehicle re-identification. We also compare our method to several state-of-the-art V-ReID methods and show its superiority.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140271683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning-enabled probing of irradiation-induced defects in time-series micrographs 利用深度学习探测时间序列显微照片中辐照诱发的缺陷
Pub Date : 2024-03-01 DOI: 10.1063/5.0186046
K. Burns, Kayvon Tadj, Tarun Allaparti, Liliana Arias, Nan Li, A. Aitkaliyeva, Amit Misra, M. Scott, Khalid Hattar
Modeling time-series data with convolutional neural networks (CNNs) requires building a model to learn in batches as opposed to training sequentially. Coupling CNNs with in situ or operando techniques opens the possibility of accurately segmenting dynamic reactions and mass transport phenomena to understand how materials behave under the conditions in which they are used. In this article, in situ ion irradiation transmission electron microscopy (TEM) images are used as inputs into the CNN to assess the defect generation rate, defect cluster density, and saturation of defects. We then use the output segmentation maps to correlate with conventional TEM micrographs to assess the model’s ability to detail nanoscale interactions. Next, we discuss the implications of preprocessing and hyperparameters on model variability, accuracy when expanded to other datasets, and the role of regularization when controlling model variance. Ultimately, we eliminate human bias when extrapolating physical metrics, speed up analysis time, decouple reactions that happen at 100 ms intervals, and deploy models that are both accurate and transferable to similar experiments.
利用卷积神经网络(CNN)对时间序列数据建模,需要建立一个分批学习的模型,而不是按顺序进行训练。将卷积神经网络与原位或操作技术相结合,可以准确地分割动态反应和质量传输现象,从而了解材料在使用条件下的行为。在本文中,原位离子照射透射电子显微镜(TEM)图像被用作 CNN 的输入,以评估缺陷生成率、缺陷群密度和缺陷饱和度。然后,我们使用输出分割图与传统 TEM 显微照片进行关联,以评估该模型详细描述纳米级相互作用的能力。接下来,我们讨论了预处理和超参数对模型变异性的影响、扩展到其他数据集时的准确性以及正则化在控制模型变异性时的作用。最终,我们消除了推断物理指标时的人为偏差,加快了分析时间,解耦了以 100 毫秒间隔发生的反应,并部署了既准确又可移植到类似实验的模型。
{"title":"Deep learning-enabled probing of irradiation-induced defects in time-series micrographs","authors":"K. Burns, Kayvon Tadj, Tarun Allaparti, Liliana Arias, Nan Li, A. Aitkaliyeva, Amit Misra, M. Scott, Khalid Hattar","doi":"10.1063/5.0186046","DOIUrl":"https://doi.org/10.1063/5.0186046","url":null,"abstract":"Modeling time-series data with convolutional neural networks (CNNs) requires building a model to learn in batches as opposed to training sequentially. Coupling CNNs with in situ or operando techniques opens the possibility of accurately segmenting dynamic reactions and mass transport phenomena to understand how materials behave under the conditions in which they are used. In this article, in situ ion irradiation transmission electron microscopy (TEM) images are used as inputs into the CNN to assess the defect generation rate, defect cluster density, and saturation of defects. We then use the output segmentation maps to correlate with conventional TEM micrographs to assess the model’s ability to detail nanoscale interactions. Next, we discuss the implications of preprocessing and hyperparameters on model variability, accuracy when expanded to other datasets, and the role of regularization when controlling model variance. Ultimately, we eliminate human bias when extrapolating physical metrics, speed up analysis time, decouple reactions that happen at 100 ms intervals, and deploy models that are both accurate and transferable to similar experiments.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"543 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140280958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving the mechanical properties of Cantor-like alloys with Bayesian optimization 用贝叶斯优化法改善康托合金的机械性能
Pub Date : 2024-03-01 DOI: 10.1063/5.0179844
Valtteri Torsti, T. Mäkinen, Silvia Bonfanti, J. Koivisto, Mikko J. Alava
The search for better compositions in high entropy alloys is a formidable challenge in materials science. Here, we demonstrate a systematic Bayesian optimization method to enhance the mechanical properties of the paradigmatic five-element Cantor alloy in silico. This method utilizes an automated loop with an online database, a Bayesian optimization algorithm, thermodynamic modeling, and molecular dynamics simulations. Starting from the equiatomic Cantor composition, our approach optimizes the relative fractions of its constituent elements, searching for better compositions while maintaining the thermodynamic phase stability. With 24 steps, we find Fe21Cr20Mn5Co20Ni34 with a yield stress improvement of 58%, and with 72 steps, we find Fe6Cr22Mn5Co32Ni35 where the yield stress has improved by 74%. These optimized compositions correspond to Ni-rich medium entropy alloys with enhanced mechanical properties and superior face-centered-cubic phase stability compared to the traditional equiatomic Cantor alloy. The automatic approach devised here paves the way for designing high entropy alloys with tailored properties, opening avenues for numerous potential applications.
在高熵合金中寻找更好的成分是材料科学中的一项艰巨挑战。在这里,我们展示了一种系统化的贝叶斯优化方法,以提高五元素康托合金在硅学中的机械性能。该方法利用了在线数据库、贝叶斯优化算法、热力学建模和分子动力学模拟的自动循环。我们的方法从等原子康托尔成分开始,优化其组成元素的相对比例,在保持热力学相稳定性的同时寻找更好的成分。通过 24 步优化,我们发现 Fe21Cr20Mn5Co20Ni34 的屈服应力提高了 58%;通过 72 步优化,我们发现 Fe6Cr22Mn5Co32Ni35 的屈服应力提高了 74%。与传统的等熵康托合金相比,这些优化成分对应的富镍中熵合金具有更高的机械性能和优异的面心立方相稳定性。本文设计的自动方法为设计具有定制性能的高熵合金铺平了道路,为众多潜在应用开辟了途径。
{"title":"Improving the mechanical properties of Cantor-like alloys with Bayesian optimization","authors":"Valtteri Torsti, T. Mäkinen, Silvia Bonfanti, J. Koivisto, Mikko J. Alava","doi":"10.1063/5.0179844","DOIUrl":"https://doi.org/10.1063/5.0179844","url":null,"abstract":"The search for better compositions in high entropy alloys is a formidable challenge in materials science. Here, we demonstrate a systematic Bayesian optimization method to enhance the mechanical properties of the paradigmatic five-element Cantor alloy in silico. This method utilizes an automated loop with an online database, a Bayesian optimization algorithm, thermodynamic modeling, and molecular dynamics simulations. Starting from the equiatomic Cantor composition, our approach optimizes the relative fractions of its constituent elements, searching for better compositions while maintaining the thermodynamic phase stability. With 24 steps, we find Fe21Cr20Mn5Co20Ni34 with a yield stress improvement of 58%, and with 72 steps, we find Fe6Cr22Mn5Co32Ni35 where the yield stress has improved by 74%. These optimized compositions correspond to Ni-rich medium entropy alloys with enhanced mechanical properties and superior face-centered-cubic phase stability compared to the traditional equiatomic Cantor alloy. The automatic approach devised here paves the way for designing high entropy alloys with tailored properties, opening avenues for numerous potential applications.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"269 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140275165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Physics-agnostic inverse design using transfer matrices 利用传递矩阵进行物理无关的逆设计
Pub Date : 2024-02-28 DOI: 10.1063/5.0179457
Nathaniel Morrison, Shuaiwei Pan, Eric Y. Ma
Inverse design is an application of machine learning to device design, giving the computer maximal latitude in generating novel structures, learning from their performance, and optimizing them to suit the designer’s needs. Gradient-based optimizers, augmented by the adjoint method to efficiently compute the gradient, are particularly attractive for this approach and have proven highly successful with finite-element and finite-difference physics simulators. Here, we extend adjoint optimization to the transfer matrix method, an accurate and efficient simulator for a wide variety of quasi-1D physical phenomena. We leverage this versatility to develop a physics-agnostic inverse design framework and apply it to three distinct problems, each presenting a substantial challenge for conventional design methods: optics, designing a multivariate optical element for compressive sensing; acoustics, designing a high-performance anti-sonar submarine coating; and quantum mechanics, designing a tunable double-bandpass electron energy filter.
逆向设计是机器学习在设备设计中的应用,它赋予计算机最大的自由度来生成新颖的结构,学习它们的性能,并对它们进行优化,以满足设计者的需求。以梯度为基础的优化器,辅以高效计算梯度的邻接法,对这种方法特别有吸引力,并已在有限元和有限差分物理模拟器中取得了巨大成功。在这里,我们将邻接优化扩展到传递矩阵法,这是一种精确高效的模拟器,可用于模拟各种准一维物理现象。我们利用这种多功能性开发了一个物理无关的逆向设计框架,并将其应用于三个不同的问题,每个问题都对传统设计方法提出了巨大挑战:光学,设计用于压缩传感的多变量光学元件;声学,设计高性能反声纳潜艇涂层;量子力学,设计可调谐双带通电子能量滤波器。
{"title":"Physics-agnostic inverse design using transfer matrices","authors":"Nathaniel Morrison, Shuaiwei Pan, Eric Y. Ma","doi":"10.1063/5.0179457","DOIUrl":"https://doi.org/10.1063/5.0179457","url":null,"abstract":"Inverse design is an application of machine learning to device design, giving the computer maximal latitude in generating novel structures, learning from their performance, and optimizing them to suit the designer’s needs. Gradient-based optimizers, augmented by the adjoint method to efficiently compute the gradient, are particularly attractive for this approach and have proven highly successful with finite-element and finite-difference physics simulators. Here, we extend adjoint optimization to the transfer matrix method, an accurate and efficient simulator for a wide variety of quasi-1D physical phenomena. We leverage this versatility to develop a physics-agnostic inverse design framework and apply it to three distinct problems, each presenting a substantial challenge for conventional design methods: optics, designing a multivariate optical element for compressive sensing; acoustics, designing a high-performance anti-sonar submarine coating; and quantum mechanics, designing a tunable double-bandpass electron energy filter.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"43 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140421641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Training self-learning circuits for power-efficient solutions 培训自学习电路,实现高能效解决方案
Pub Date : 2024-02-27 DOI: 10.1063/5.0181382
M. Stern, Sam Dillavou, Dinesh Jayaraman, D. Durian, Andrea J. Liu
As the size and ubiquity of artificial intelligence and computational machine learning models grow, the energy required to train and use them is rapidly becoming economically and environmentally unsustainable. Recent laboratory prototypes of self-learning electronic circuits, such as “physical learning machines,” open the door to analog hardware that directly employs physics to learn desired functions from examples at a low energy cost. In this work, we show that this hardware platform allows for an even further reduction in energy consumption by using good initial conditions and a new learning algorithm. Using analytical calculations, simulations, and experiments, we show that a trade-off emerges when learning dynamics attempt to minimize both the error and the power consumption of the solution—greater power reductions can be achieved at the cost of decreasing solution accuracy. Finally, we demonstrate a practical procedure to weigh the relative importance of error and power minimization, improving the power efficiency given a specific tolerance to error.
随着人工智能和计算机器学习模型规模的扩大和普及,训练和使用这些模型所需的能源在经济上和环境上正迅速变得不可持续。最近的实验室自学习电子电路原型,如 "物理学习机",为模拟硬件打开了一扇门,它可以直接利用物理学,以较低的能源成本从示例中学习所需的功能。在这项工作中,我们表明,通过使用良好的初始条件和新的学习算法,这种硬件平台可以进一步降低能耗。通过分析计算、模拟和实验,我们表明,当学习动态试图同时最小化解决方案的误差和能耗时,就会出现一种权衡--以降低解决方案的准确性为代价,可以实现更高的能耗降低。最后,我们展示了一种实用程序,用于权衡误差最小化和功耗最小化的相对重要性,在特定误差容忍度下提高功耗效率。
{"title":"Training self-learning circuits for power-efficient solutions","authors":"M. Stern, Sam Dillavou, Dinesh Jayaraman, D. Durian, Andrea J. Liu","doi":"10.1063/5.0181382","DOIUrl":"https://doi.org/10.1063/5.0181382","url":null,"abstract":"As the size and ubiquity of artificial intelligence and computational machine learning models grow, the energy required to train and use them is rapidly becoming economically and environmentally unsustainable. Recent laboratory prototypes of self-learning electronic circuits, such as “physical learning machines,” open the door to analog hardware that directly employs physics to learn desired functions from examples at a low energy cost. In this work, we show that this hardware platform allows for an even further reduction in energy consumption by using good initial conditions and a new learning algorithm. Using analytical calculations, simulations, and experiments, we show that a trade-off emerges when learning dynamics attempt to minimize both the error and the power consumption of the solution—greater power reductions can be achieved at the cost of decreasing solution accuracy. Finally, we demonstrate a practical procedure to weigh the relative importance of error and power minimization, improving the power efficiency given a specific tolerance to error.","PeriodicalId":502250,"journal":{"name":"APL Machine Learning","volume":"55 7","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140427750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
APL Machine Learning
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1