首页 > 最新文献

Advanced Engineering Informatics最新文献

英文 中文
Hydro-steel structure digital twins: Application in structural health monitoring and maintenance of large-scale reservoir 水工钢结构数字双胞胎:在大型水库结构健康监测和维护中的应用
IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-01 DOI: 10.1016/j.aei.2024.102922
Helin Li , Shufeng Zheng , Yonghao Shen , Minghai Han , Rui Zhang , Huadong Zhao
In the context of frequent accidents during hydro-steel structures (HSS) operations due to harsh environments and extended service conditions, a novel approach is proposed to reduce the frequency of structural failure incidents and ensure safe and reliable operation. The approach begins with introducing a comprehensive DT modeling framework. Subsequently, detailed DT modeling and DT-based SHM methods are developed. Finally, a platform with perception, interaction, analysis, and decision-making for intelligent health monitoring and maintenance of HSS is constructed and validated in China’s large-scale reservoir project, Luhun Reservoir. The platform includes functions of condition monitoring, fault feature recognition, health status assessment, and maintenance strategies optimization. The integration of DT technology has led to significant improvements in health monitoring and maintenance quality, which includes data collection, model optimization, comprehensive evaluation, and decision-making. This approach has also demonstrated its effectiveness by reducing the operation and maintenance response time and enhancing the overall efficiency and reliability.
在水工钢结构(HSS)运行过程中,由于恶劣的环境和长时间的使用条件,事故频发,在此背景下,提出了一种新的方法,以减少结构失效事故的发生频率,确保安全、可靠的运行。该方法首先引入了一个全面的 DT 建模框架。随后,开发了详细的 DT 建模和基于 DT 的 SHM 方法。最后,构建了集感知、交互、分析和决策于一体的 HSS 智能健康监测和维护平台,并在中国大型水库项目--鲁浑水库中进行了验证。该平台包括状态监测、故障特征识别、健康状况评估和维护策略优化等功能。DT 技术的集成显著提高了健康监测和维护质量,包括数据收集、模型优化、综合评估和决策。这种方法还缩短了运行和维护响应时间,提高了整体效率和可靠性,从而证明了其有效性。
{"title":"Hydro-steel structure digital twins: Application in structural health monitoring and maintenance of large-scale reservoir","authors":"Helin Li ,&nbsp;Shufeng Zheng ,&nbsp;Yonghao Shen ,&nbsp;Minghai Han ,&nbsp;Rui Zhang ,&nbsp;Huadong Zhao","doi":"10.1016/j.aei.2024.102922","DOIUrl":"10.1016/j.aei.2024.102922","url":null,"abstract":"<div><div>In the context of frequent accidents during hydro-steel structures (HSS) operations due to harsh environments and extended service conditions, a novel approach is proposed to reduce the frequency of structural failure incidents and ensure safe and reliable operation. The approach begins with introducing a comprehensive DT modeling framework. Subsequently, detailed DT modeling and DT-based SHM methods are developed. Finally, a platform with perception, interaction, analysis, and decision-making for intelligent health monitoring and maintenance of HSS is constructed and validated in China’s large-scale reservoir project, Luhun Reservoir. The platform includes functions of condition monitoring, fault feature recognition, health status assessment, and maintenance strategies optimization. The integration of DT technology has led to significant improvements in health monitoring and maintenance quality, which includes data collection, model optimization, comprehensive evaluation, and decision-making. This approach has also demonstrated its effectiveness by reducing the operation and maintenance response time and enhancing the overall efficiency and reliability.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102922"},"PeriodicalIF":8.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142586519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic identification of bottlenecks for ambulance passage on urban streets: A deep learning-based approach 自动识别城市道路上救护车通行的瓶颈:基于深度学习的方法
IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-01 DOI: 10.1016/j.aei.2024.102931
Shuo Pan , Zhuo Liu , Hai Yan , Ning Chen , Xiaoxiong Zhao , Sichun Li , Frank Witlox
Urban streets exhibit a diverse range of characteristics, with some presenting significant challenges to ambulance passage, directly impacting the safety of residents. Thus, ensuring unimpeded passage for ambulances on streets is a key focus of urban renewal and street governance initiatives. However, the identification of bottlenecks for emergency vehicle passage on urban streets currently relies on labor-intensive and inefficient on-site manual audits. This study proposes a deep learning-based approach to achieve automatic identification of ambulance passage on urban streets. The Vision Transformer network is utilized to construct the classification model of Impassable Narrow Roads, Passable Narrow Roads, and Wide Roads based on street view images. To train and test the constructed models, a specialized dataset is established, consisting of street view images labeled by experienced ambulance drivers. Comparative experiments are conducted to confirm the optimal structure of the model and the necessity of semantic segmentation preprocessing for street view images. To confirm the superiority of the proposed approach, four commonly used deep learning methods, MobileNet, ShuffleNet, SuperViT and DualViT serve as the baseline tests. Experimental results reveal that the model with four-head and one sequential encoder achieves the highest evaluation accuracy at 75.65% among the proposed models on the original dataset, significantly outperforming benchmark models. Meanwhile, the segmentation of street view images improves accuracy to 77.42%, but it reduces computational efficiency from 0.01 to 3 seconds per image. Finally, the optimal model is applied to the area within the Second Ring Road of Beijing as an example to discuss how the deep learning-based approach proposed in this paper supports urban planning practice and emergency medical response. The proposed approach facilitates the rapid and large-scale identification of bottlenecks in urban streets for ambulances with very limited costs, making a significant contribution to the accurate identification of key areas for urban renewal and street governance efforts. The proposed method can further assist emergency vehicle dispatchers and drivers in identifying accessible routes with greater precision during operations, thereby enabling more timely transportation of patients to medical facilities.
城市街道呈现出多种多样的特点,其中一些给救护车通行带来了巨大挑战,直接影响到居民的安全。因此,确保救护车在街道上畅通无阻是城市改造和街道治理行动的重点。然而,目前城市街道急救车通行瓶颈的识别主要依赖于劳动密集型和低效的现场人工审核。本研究提出了一种基于深度学习的方法,以实现自动识别城市街道上的救护车通行情况。利用视觉转换器网络,基于街景图像构建了不可通行的狭窄道路、可通行的狭窄道路和宽阔道路的分类模型。为了训练和测试所构建的模型,建立了一个专门的数据集,由经验丰富的救护车司机标注的街景图像组成。通过对比实验确认了模型的最佳结构以及对街景图像进行语义分割预处理的必要性。为了证实所提方法的优越性,四种常用的深度学习方法:MobileNet、ShuffleNet、SuperViT 和 DualViT 作为基线测试。实验结果表明,在原始数据集上,采用四头和一个顺序编码器的模型达到了 75.65% 的最高评估准确率,明显优于基准模型。同时,街景图像分割的准确率提高到 77.42%,但计算效率却从每幅图像 0.01 秒降低到 3 秒。最后,以北京二环路以内区域为例,讨论了本文提出的基于深度学习的方法如何支持城市规划实践和紧急医疗响应。所提出的方法有助于以非常有限的成本快速、大规模地识别城市街道中的救护车瓶颈,为准确识别城市更新和街道治理工作的重点区域做出了重要贡献。所提出的方法还能进一步帮助急救车调度员和驾驶员在行动中更精确地识别无障碍路线,从而更及时地将病人运送到医疗设施。
{"title":"Automatic identification of bottlenecks for ambulance passage on urban streets: A deep learning-based approach","authors":"Shuo Pan ,&nbsp;Zhuo Liu ,&nbsp;Hai Yan ,&nbsp;Ning Chen ,&nbsp;Xiaoxiong Zhao ,&nbsp;Sichun Li ,&nbsp;Frank Witlox","doi":"10.1016/j.aei.2024.102931","DOIUrl":"10.1016/j.aei.2024.102931","url":null,"abstract":"<div><div>Urban streets exhibit a diverse range of characteristics, with some presenting significant challenges to ambulance passage, directly impacting the safety of residents. Thus, ensuring unimpeded passage for ambulances on streets is a key focus of urban renewal and street governance initiatives. However, the identification of bottlenecks for emergency vehicle passage on urban streets currently relies on labor-intensive and inefficient on-site manual audits. This study proposes a deep learning-based approach to achieve automatic identification of ambulance passage on urban streets. The Vision Transformer network is utilized to construct the classification model of Impassable Narrow Roads, Passable Narrow Roads, and Wide Roads based on street view images. To train and test the constructed models, a specialized dataset is established, consisting of street view images labeled by experienced ambulance drivers. Comparative experiments are conducted to confirm the optimal structure of the model and the necessity of semantic segmentation preprocessing for street view images. To confirm the superiority of the proposed approach, four commonly used deep learning methods, MobileNet, ShuffleNet, SuperViT and DualViT serve as the baseline tests. Experimental results reveal that the model with four-head and one sequential encoder achieves the highest evaluation accuracy at 75.65% among the proposed models on the original dataset, significantly outperforming benchmark models. Meanwhile, the segmentation of street view images improves accuracy to 77.42%, but it reduces computational efficiency from 0.01 to 3 seconds per image. Finally, the optimal model is applied to the area within the Second Ring Road of Beijing as an example to discuss how the deep learning-based approach proposed in this paper supports urban planning practice and emergency medical response. The proposed approach facilitates the rapid and large-scale identification of bottlenecks in urban streets for ambulances with very limited costs, making a significant contribution to the accurate identification of key areas for urban renewal and street governance efforts. The proposed method can further assist emergency vehicle dispatchers and drivers in identifying accessible routes with greater precision during operations, thereby enabling more timely transportation of patients to medical facilities.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102931"},"PeriodicalIF":8.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scene information guided aerial photogrammetric mission recomposition towards detailed level building reconstruction 场景信息指导下的航空摄影测量任务重构,实现详细级别的建筑物重建
IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-01 DOI: 10.1016/j.aei.2024.102913
Akram Akbar , Chun Liu , Hangbin Wu , Shoujun Jia , Zeran Xu
Real 3D building models have become indispensable data sources for building spatial information bases for smart cities by leveraging structural correlations and rich semantic expressions of real-world scene entities. The essential prerequisite for real 3D reconstruction is real-time and dynamic detailed-level observations. Low-altitude multicopter UAV platforms are optimal for automatic and periodic building scene observations. However, there are still several challenges in UAV-based path planning for real 3D data capture while maintaining the overall fidelity of architectural details due to observational scale variations, surrounding uncertainties, structural complexity, and topological delicacy. We propose a scene information guided aerial photogrammetric mission recomposition method in response to this challenge. Depending on the architectural complexity, the two proposed observation patterns, parallel inspection and surface enveloping, can be recomposed to achieve UAV obstacle avoidance and complete coverage of individual buildings in a restricted space, capturing global surface detail with millimeter resolution and low texture distortion. The virtual simulation environment, which is constructed based on the semantics and elevation values of the surroundings, provides a basis for selecting the observation pattern and optimal flight parameters based on the reconstruction requirements of the building. In order to achieve quality control of 3D reconstruction models, this paper introduces a reconstruction quality assessment scheme consisting of four quantitative evaluation metrics, namely coverage, resolution distribution, texture distortion score, and geometric accuracy, which effectively establishes a close relationship between mission planning and 3D reconstruction. The observation capability of the proposed method is better than other typical observation patterns, obtaining a model of globally homogeneous resolution distribution over the main body of the building, reaching an average level of 7.01 mm and the highest level of 2.12 mm (façade region), which can provide high-quality data for the semantic extraction and instantiation of multiple surface elements of buildings.
利用真实世界场景实体的结构关联和丰富的语义表达,真实三维建筑模型已成为建立智慧城市空间信息库不可或缺的数据源。真实三维重建的基本前提是实时和动态的细节级观测。低空多旋翼无人机平台是自动定期观测建筑场景的最佳选择。然而,由于观测尺度的变化、周围环境的不确定性、结构的复杂性和拓扑的微妙性,在基于无人机的路径规划以获取真实三维数据的同时保持建筑细节的整体保真度方面仍面临着一些挑战。针对这一挑战,我们提出了一种场景信息指导下的航空摄影测量任务重组方法。根据建筑的复杂程度,我们提出的两种观测模式--平行检测和表面包络--可以重新组合,以实现无人机避障和在有限空间内完整覆盖单个建筑,以毫米级分辨率和低纹理失真捕捉全局表面细节。根据周围环境的语义和高程值构建的虚拟仿真环境,为根据建筑物的重建要求选择观测模式和最佳飞行参数提供了依据。为了实现三维重建模型的质量控制,本文介绍了一种由覆盖率、分辨率分布、纹理失真评分和几何精度四个定量评价指标组成的重建质量评估方案,有效地建立了任务规划与三维重建之间的密切关系。该方法的观测能力优于其他典型观测模式,可获得建筑物主体全局均匀分辨率分布的模型,平均分辨率达到 7.01 mm,最高分辨率达到 2.12 mm(立面区域),可为建筑物多表面元素的语义提取和实例化提供高质量数据。
{"title":"Scene information guided aerial photogrammetric mission recomposition towards detailed level building reconstruction","authors":"Akram Akbar ,&nbsp;Chun Liu ,&nbsp;Hangbin Wu ,&nbsp;Shoujun Jia ,&nbsp;Zeran Xu","doi":"10.1016/j.aei.2024.102913","DOIUrl":"10.1016/j.aei.2024.102913","url":null,"abstract":"<div><div>Real 3D building models have become indispensable data sources for building spatial information bases for smart cities by leveraging structural correlations and rich semantic expressions of real-world scene entities. The essential prerequisite for real 3D reconstruction is real-time and dynamic detailed-level observations. Low-altitude multicopter UAV platforms are optimal for automatic and periodic building scene observations. However, there are still several challenges in UAV-based path planning for real 3D data capture while maintaining the overall fidelity of architectural details due to observational scale variations, surrounding uncertainties, structural complexity, and topological delicacy. We propose a scene information guided aerial photogrammetric mission recomposition method in response to this challenge. Depending on the architectural complexity, the two proposed observation patterns, parallel inspection and surface enveloping, can be recomposed to achieve UAV obstacle avoidance and complete coverage of individual buildings in a restricted space, capturing global surface detail with millimeter resolution and low texture distortion. The virtual simulation environment, which is constructed based on the semantics and elevation values of the surroundings, provides a basis for selecting the observation pattern and optimal flight parameters based on the reconstruction requirements of the building. In order to achieve quality control of 3D reconstruction models, this paper introduces a reconstruction quality assessment scheme consisting of four quantitative evaluation metrics, namely coverage, resolution distribution, texture distortion score, and geometric accuracy, which effectively establishes a close relationship between mission planning and 3D reconstruction. The observation capability of the proposed method is better than other typical observation patterns, obtaining a model of globally homogeneous resolution distribution over the main body of the building, reaching an average level of 7.01 mm and the highest level of 2.12 mm (façade region), which can provide high-quality data for the semantic extraction and instantiation of multiple surface elements of buildings.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102913"},"PeriodicalIF":8.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
XGBoost-based global sensitivity analysis of ground settlement caused by shield tunneling in dense karst areas 基于 XGBoost 的密集岩溶地区盾构掘进引起的地面沉降全球敏感性分析
IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-01 DOI: 10.1016/j.aei.2024.102928
Shifan Qiao , Haoyu Li , S. Thomas Ng , Junkun Tan , Yingyu Tang , Baoquan Cheng
Predicting ground settlement and identifying key influential factors during shield tunneling in dense karst areas presents a significant engineering challenge due to irregular geological conditions and the complex nonlinear interactions among multiple factors. Traditional computational methods and existing machine learning models often lack either accuracy or interpretability, limiting their practical application in such environments. To address this gap, a novel global sensitivity analysis (GSA) framework has been developed, specifically tailored for dense karst areas. This framework integrates eXtreme Gradient Boosting (XGBoost) as an interpretable metamodel enhanced with SHAP analysis and combines it with the Sobol method for comprehensive sensitivity quantification. In addition, this framework incorporates integrated detection methods and karst structural parameters to ensure its applicability in dense karst construction environments. By applying this framework to actual data from the Shenzhen Metro Line 14 project, key tunneling parameters such as synchronous grouting pressure, actual excavation volume, karst cross-section total area, and karst-to-tunnel distance were accurately identified as having a significant impact on ground settlement. This approach fills a critical research gap by providing an interpretable and accurate tool for shield tunneling in dense karst areas, ultimately improving safety and efficiency in these challenging environments.
由于地质条件不规则以及多种因素之间复杂的非线性相互作用,在密集岩溶地区进行盾构隧道掘进时,预测地面沉降并确定关键影响因素是一项重大的工程挑战。传统的计算方法和现有的机器学习模型往往缺乏准确性或可解释性,限制了它们在此类环境中的实际应用。为了弥补这一不足,我们开发了一个新颖的全局灵敏度分析(GSA)框架,专门为密集岩溶地区量身定制。该框架将极梯度提升(XGBoost)作为一种可解释的元模型与 SHAP 分析进行了整合,并将其与 Sobol 方法相结合,以实现全面的灵敏度量化。此外,该框架还结合了综合检测方法和岩溶结构参数,以确保其在密集岩溶施工环境中的适用性。通过将该框架应用于深圳地铁 14 号线项目的实际数据,准确识别了同步注浆压力、实际开挖量、岩溶断面总面积和岩溶到隧道距离等关键隧道参数对地面沉降的重要影响。这种方法填补了一项重要的研究空白,为密集岩溶地区的盾构掘进提供了一种可解释的精确工具,最终提高了这些挑战性环境中的安全性和效率。
{"title":"XGBoost-based global sensitivity analysis of ground settlement caused by shield tunneling in dense karst areas","authors":"Shifan Qiao ,&nbsp;Haoyu Li ,&nbsp;S. Thomas Ng ,&nbsp;Junkun Tan ,&nbsp;Yingyu Tang ,&nbsp;Baoquan Cheng","doi":"10.1016/j.aei.2024.102928","DOIUrl":"10.1016/j.aei.2024.102928","url":null,"abstract":"<div><div>Predicting ground settlement and identifying key influential factors during shield tunneling in dense karst areas presents a significant engineering challenge due to irregular geological conditions and the complex nonlinear interactions among multiple factors. Traditional computational methods and existing machine learning models often lack either accuracy or interpretability, limiting their practical application in such environments. To address this gap, a novel global sensitivity analysis (GSA) framework has been developed, specifically tailored for dense karst areas. This framework integrates eXtreme Gradient Boosting (XGBoost) as an interpretable metamodel enhanced with SHAP analysis and combines it with the Sobol method for comprehensive sensitivity quantification. In addition, this framework incorporates integrated detection methods and karst structural parameters to ensure its applicability in dense karst construction environments. By applying this framework to actual data from the Shenzhen Metro Line 14 project, key tunneling parameters such as synchronous grouting pressure, actual excavation volume, karst cross-section total area, and karst-to-tunnel distance were accurately identified as having a significant impact on ground settlement. This approach fills a critical research gap by providing an interpretable and accurate tool for shield tunneling in dense karst areas, ultimately improving safety and efficiency in these challenging environments.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102928"},"PeriodicalIF":8.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Unet-inspired spatial-attention transformer model for segmenting gear tooth surface defects 用于分割齿轮齿面缺陷的 Unet 启发式空间注意变换器模型
IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-01 DOI: 10.1016/j.aei.2024.102933
Xin Zhou , Yongchao Zhang , Zhaohui Ren , Tianchuan Mi , Zeyu Jiang , Tianzhuang Yu , Shihua Zhou
Automated vision defect detection is a crucial step in monitoring product quality in industrial production. Despite the widespread utilization of deep learning methods for surface defect identification, several challenges persist in the context of gear applications. Firstly, there is a lack of dedicated defect detection methods specifically tailored for gear tooth surfaces. As surface defects vary in size, the regular single-scale attention computation at each transformer layer tends to compromise spatial information. To address these challenges, we first propose a novel U-shaped spatial-attention transformer model for tooth surface detection. A shunted-window method is introduced to create a pyramid receptive field within a single self-attention layer. This method captures fine-grained features with a small window while preserving coarse-grained features with a larger window. Consequently, this technique enables effective multi-scale information fusion, accommodating objects of different sizes. We curate a dataset of defective samples collected under various working conditions using the CL-100 gear wear machine. Experimental results demonstrate that the proposed model outperforms the state-of-the-art (SOTA) U-shaped SwinUnet by +8.74% AP and +4.40% Sm, while surpassing the excellent defect detection method of ResT-UperNet by +0.63% AP and +4.69% Sm.
自动视觉缺陷检测是监控工业生产中产品质量的关键步骤。尽管深度学习方法已广泛应用于表面缺陷识别,但在齿轮应用方面仍存在一些挑战。首先,缺乏专门针对齿轮齿面的专用缺陷检测方法。由于表面缺陷大小不一,在每个变压器层进行常规的单尺度注意力计算往往会损害空间信息。为了应对这些挑战,我们首先提出了一种用于齿面检测的新型 U 形空间注意力变压器模型。我们引入了一种分流窗口方法,在单个自我注意层内创建一个金字塔形的感受野。这种方法用小窗口捕捉细粒度特征,同时用大窗口保留粗粒度特征。因此,这种技术能有效地进行多尺度信息融合,以适应不同大小的物体。我们利用 CL-100 磨齿机收集了不同工作条件下的缺陷样本数据集。实验结果表明,所提出的模型在 AP 和 Sm 方面分别比最先进(SOTA)的 U 型 SwinUnet 高出+8.74%和+4.40%,同时在 AP 和 Sm 方面分别比 ResT-UperNet 高出+0.63%和+4.69%。
{"title":"A Unet-inspired spatial-attention transformer model for segmenting gear tooth surface defects","authors":"Xin Zhou ,&nbsp;Yongchao Zhang ,&nbsp;Zhaohui Ren ,&nbsp;Tianchuan Mi ,&nbsp;Zeyu Jiang ,&nbsp;Tianzhuang Yu ,&nbsp;Shihua Zhou","doi":"10.1016/j.aei.2024.102933","DOIUrl":"10.1016/j.aei.2024.102933","url":null,"abstract":"<div><div>Automated vision defect detection is a crucial step in monitoring product quality in industrial production. Despite the widespread utilization of deep learning methods for surface defect identification, several challenges persist in the context of gear applications. Firstly, there is a lack of dedicated defect detection methods specifically tailored for gear tooth surfaces. As surface defects vary in size, the regular single-scale attention computation at each transformer layer tends to compromise spatial information. To address these challenges, we first propose a novel U-shaped spatial-attention transformer model for tooth surface detection. A shunted-window method is introduced to create a pyramid receptive field within a single self-attention layer. This method captures fine-grained features with a small window while preserving coarse-grained features with a larger window. Consequently, this technique enables effective multi-scale information fusion, accommodating objects of different sizes. We curate a dataset of defective samples collected under various working conditions using the CL-100 gear wear machine. Experimental results demonstrate that the proposed model outperforms the state-of-the-art (SOTA) U-shaped SwinUnet by +8.74% AP and +4.40% Sm, while surpassing the excellent defect detection method of ResT-UperNet by +0.63% AP and +4.69% Sm.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102933"},"PeriodicalIF":8.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142659117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-time identification of precursors in commercial aviation using multiple-instance learning 利用多实例学习实时识别商用航空中的前兆物
IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-01 DOI: 10.1016/j.aei.2024.102856
Zhiwei Xiang , Zhenxing Gao , Yansong Gao , Yangyang Zhang , Runhao Zhang
This research pioneers the application of precursor concepts to preemptively identify and prevent aviation safety incidents using Machine Learning (ML). Airlines and governing organizations, such as the Federal Aviation Administration (FAA) in the United States, have been trying to prevent safety incidents during routine operations. However, this task is challenging due to the lack of timestep-wise event annotation in flights and the complexity involved in the timely identification of incidents prior to their occurrence. To address these issues, we propose a real-time precursor identification methodology combining Multiple-Instance Learning (MIL) and feature-based Knowledge Distillation (KD) learning. Our two-stage approach, involving deep MIL for labeling and a KD-based model for real-time warnings, demonstrates state-of-the-art performance and a time delay of 2.99ms using a dataset of 23,549 real flights. Further experiments using t-distributed Stochastic Neighbor Embedding (t-SNE) and occlusion method confirm our model’s transparency, enabling the generation of reliable quantitative precursor scores and facilitating reasoning about the causes of safety incidents at the parameter level. Additionally, statistical analysis of precursors reveals varying evolution times for different safety events, which indicates that pilots have at least 8 s to react after receiving a warning. In conclusion, our research provides a theoretical foundation and technical support for the next generation of online risk warning systems, enhancing flight safety and offering a pathway towards more intelligent and secure flight operations.
本研究率先应用前兆概念,利用机器学习(ML)技术预先识别和预防航空安全事故。航空公司和管理机构,如美国联邦航空管理局(FAA),一直在努力预防日常运营中的安全事故。然而,由于缺乏按时间顺序排列的飞行事件注释,以及在事件发生前及时识别事件的复杂性,这项任务极具挑战性。为了解决这些问题,我们提出了一种结合多实例学习(MIL)和基于特征的知识蒸馏(KD)学习的实时前兆识别方法。我们的两阶段方法包括用于标记的深度 MIL 和用于实时警告的基于 KD 的模型,在使用 23,549 次真实航班的数据集时,表现出了一流的性能和 2.99 毫秒的时间延迟。使用 t 分布随机邻域嵌入(t-SNE)和闭塞方法进行的进一步实验证实了我们模型的透明度,能够生成可靠的定量前兆分数,并有助于在参数级别推理安全事故的原因。此外,对前兆的统计分析显示,不同安全事件的演变时间各不相同,这表明飞行员在收到警告后至少有 8 秒钟的反应时间。总之,我们的研究为下一代在线风险预警系统提供了理论基础和技术支持,提高了飞行安全,为实现更智能、更安全的飞行操作提供了途径。
{"title":"Real-time identification of precursors in commercial aviation using multiple-instance learning","authors":"Zhiwei Xiang ,&nbsp;Zhenxing Gao ,&nbsp;Yansong Gao ,&nbsp;Yangyang Zhang ,&nbsp;Runhao Zhang","doi":"10.1016/j.aei.2024.102856","DOIUrl":"10.1016/j.aei.2024.102856","url":null,"abstract":"<div><div>This research pioneers the application of precursor concepts to preemptively identify and prevent aviation safety incidents using Machine Learning (ML). Airlines and governing organizations, such as the Federal Aviation Administration (FAA) in the United States, have been trying to prevent safety incidents during routine operations. However, this task is challenging due to the lack of timestep-wise event annotation in flights and the complexity involved in the timely identification of incidents prior to their occurrence. To address these issues, we propose a real-time precursor identification methodology combining Multiple-Instance Learning (MIL) and feature-based Knowledge Distillation (KD) learning. Our two-stage approach, involving deep MIL for labeling and a KD-based model for real-time warnings, demonstrates state-of-the-art performance and a time delay of 2.99ms using a dataset of 23,549 real flights. Further experiments using t-distributed Stochastic Neighbor Embedding (t-SNE) and occlusion method confirm our model’s transparency, enabling the generation of reliable quantitative precursor scores and facilitating reasoning about the causes of safety incidents at the parameter level. Additionally, statistical analysis of precursors reveals varying evolution times for different safety events, which indicates that pilots have at least 8 s to react after receiving a warning. In conclusion, our research provides a theoretical foundation and technical support for the next generation of online risk warning systems, enhancing flight safety and offering a pathway towards more intelligent and secure flight operations.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102856"},"PeriodicalIF":8.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142417049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mutual stacked autoencoder for unsupervised fault detection under complex multi-residual correlations 用于复杂多残差相关性下无监督故障检测的互叠自动编码器
IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-01 DOI: 10.1016/j.aei.2024.102837
Jianbo Yu , Zhaomin Lv , Hang Ruan , Shijie Hu , Qingchao Jiang , Xuefeng Yan , Yuping Liu , Xiaofeng Yang
Due to the increasing complexity of variable relationships, fault detection has garnered significant attention, as it is crucial for ensuring industrial safety and engineering reliability. Traditional detection methods can be classified as twofold: global-based and local-based strategies, which respectively focus on mining macro- and micro-level information. However, our theoretical derivation and experiment results reveal that some spurious assumptions, such as local groups and their provided information are mutually independent are implicitly adhered to but are hardly satisfied in unsupervised fault detection under real industrial scenarios. Hence, this study introduces a novel mutual stacked autoencoder (M-SAE) which can be divided into three sub-networks: L-Net, R-Net, and M-Net. L-Net enriches local information learning through multiple local backbones by incorporating the unsupervised clustering algorithm. R-Net, employing a multi-scale attention mechanism, leverages complete local information for residual strength calculation and utilizes local features to capture residual information within the latent feature space. M-Net fuses the multi-scale local feature information to perform a reconstruction for each local. A multitask entropy-aided loss function is introduced to enrich local details, the global structure, and the residual associations. Finally, results on eleven datasets validate the high-performance of the proposed M-SAE and the ablation experiments demonstrate the efficacy of each component in M-SAE, confirming that this research effectively and accurately addresses multivariable industrial fault detection tasks, thereby enabling timely interventions that are crucial for maintaining operational safety in real-world scenarios.
由于变量关系日益复杂,故障检测已引起人们的极大关注,因为它对确保工业安全和工程可靠性至关重要。传统的检测方法可分为两类:基于全局的策略和基于局部的策略,它们分别侧重于挖掘宏观和微观层面的信息。然而,我们的理论推导和实验结果表明,在实际工业场景下的无监督故障检测中,一些虚假的假设,如局部群组及其提供的信息相互独立的假设被隐含地遵守,但却难以满足。因此,本研究引入了一种新型互叠自动编码器(M-SAE),它可分为三个子网络:它可分为三个子网络:L-网络、R-网络和 M-网络。L-Net 结合了无监督聚类算法,通过多个本地骨干网丰富了本地信息的学习。R-Net 采用多尺度关注机制,利用完整的本地信息计算残差强度,并利用本地特征捕捉潜在特征空间中的残差信息。M-Net 融合多尺度局部特征信息,对每个局部进行重构。M-Net 引入了多任务熵辅助损失函数,以丰富局部细节、全局结构和残差关联。最后,11 个数据集的结果验证了所提出的 M-SAE 的高性能,而烧蚀实验则证明了 M-SAE 中每个组件的功效,证实这项研究能有效、准确地解决多变量工业故障检测任务,从而实现及时干预,这对维护现实世界场景中的运行安全至关重要。
{"title":"Mutual stacked autoencoder for unsupervised fault detection under complex multi-residual correlations","authors":"Jianbo Yu ,&nbsp;Zhaomin Lv ,&nbsp;Hang Ruan ,&nbsp;Shijie Hu ,&nbsp;Qingchao Jiang ,&nbsp;Xuefeng Yan ,&nbsp;Yuping Liu ,&nbsp;Xiaofeng Yang","doi":"10.1016/j.aei.2024.102837","DOIUrl":"10.1016/j.aei.2024.102837","url":null,"abstract":"<div><div>Due to the increasing complexity of variable relationships, fault detection has garnered significant attention, as it is crucial for ensuring industrial safety and engineering reliability. Traditional detection methods can be classified as twofold: global-based and local-based strategies, which respectively focus on mining macro- and micro-level information. However, our theoretical derivation and experiment results reveal that some spurious assumptions, such as local groups and their provided information are mutually independent are implicitly adhered to but are hardly satisfied in unsupervised fault detection under real industrial scenarios. Hence, this study introduces a novel mutual stacked autoencoder (M-SAE) which can be divided into three sub-networks: L-Net, R-Net, and M-Net. L-Net enriches local information learning through multiple local backbones by incorporating the unsupervised clustering algorithm. R-Net, employing a multi-scale attention mechanism, leverages complete local information for residual strength calculation and utilizes local features to capture residual information within the latent feature space. M-Net fuses the multi-scale local feature information to perform a reconstruction for each local. A multitask entropy-aided loss function is introduced to enrich local details, the global structure, and the residual associations. Finally, results on eleven datasets validate the high-performance of the proposed M-SAE and the ablation experiments demonstrate the efficacy of each component in M-SAE, confirming that this research effectively and accurately addresses multivariable industrial fault detection tasks, thereby enabling timely interventions that are crucial for maintaining operational safety in real-world scenarios.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102837"},"PeriodicalIF":8.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142417050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Relational descriptors for retrieving design features in a B-rep model using the similarity-based retrieval approach 使用基于相似性的检索方法检索 B-rep 模型中设计特征的关系描述符
IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-01 DOI: 10.1016/j.aei.2024.102877
Changmo Yeo, Sang-Uk Cheon, Seungeun Lim, Jun Hwan Park, Duhwan Mun
Design features refer to local shapes or regions within a part that perform specific functions such as fastening and force transmission. These design features must be identified from product design results to conduct design verification, manufacturing evaluation, and process planning. Design features are formed by combining various form features, which poses a challenge when using existing methods to retrieve individual features. Therefore, this study introduced a relational descriptor that describes the relational characteristics between topological elements to retrieve design features in boundary representation (B-rep) models. In addition, a method to retrieve design features by combining the relational descriptor with shape descriptors was proposed. Experiments were performed to identify specific design features to validate the proposed method. The experimental results successfully retrieved all the design features included in the B-rep model.
设计特征指的是零件内的局部形状或区域,它们具有特定的功能,如紧固和力传递。这些设计特征必须从产品设计结果中识别出来,以便进行设计验证、制造评估和流程规划。设计特征是由各种形状特征组合而成的,这给使用现有方法检索单个特征带来了挑战。因此,本研究引入了一种关系描述符,用于描述拓扑元素之间的关系特征,以检索边界表示(B-rep)模型中的设计特征。此外,还提出了一种通过将关系描述符与形状描述符相结合来检索设计特征的方法。为了验证所提出的方法,进行了识别特定设计特征的实验。实验结果成功检索了 B-rep 模型中包含的所有设计特征。
{"title":"Relational descriptors for retrieving design features in a B-rep model using the similarity-based retrieval approach","authors":"Changmo Yeo,&nbsp;Sang-Uk Cheon,&nbsp;Seungeun Lim,&nbsp;Jun Hwan Park,&nbsp;Duhwan Mun","doi":"10.1016/j.aei.2024.102877","DOIUrl":"10.1016/j.aei.2024.102877","url":null,"abstract":"<div><div>Design features refer to local shapes or regions within a part that perform specific functions such as fastening and force transmission. These design features must be identified from product design results to conduct design verification, manufacturing evaluation, and process planning. Design features are formed by combining various form features, which poses a challenge when using existing methods to retrieve individual features. Therefore, this study introduced a relational descriptor that describes the relational characteristics between topological elements to retrieve design features in boundary representation (B-rep) models. In addition, a method to retrieve design features by combining the relational descriptor with shape descriptors was proposed. Experiments were performed to identify specific design features to validate the proposed method. The experimental results successfully retrieved all the design features included in the B-rep model.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102877"},"PeriodicalIF":8.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142442571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spectrum-guided GAN with density-directionality sampling: Diverse high-fidelity signal generation for fault diagnosis of rotating machinery 具有密度方向性采样的频谱引导 GAN:为旋转机械故障诊断生成多样化高保真信号
IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-01 DOI: 10.1016/j.aei.2024.102821
Taehun Kim , Jin Uk Ko , Jinwook Lee , Yong Chae Kim , Joon Ha Jung , Byeng D. Youn
In the field of fault diagnosis for rotating machinery, where available fault data are limited, numerous studies have employed a generative adversarial network (GAN) for data generation. However, the limited fault data for training GAN exacerbate GAN’s inherent training instability and mode collapse issues, which are induced by adversarial training. Moreover, the stochastic nature of random sampling for latent vectors sampling often results in low-fidelity and poor diversity generation, which negatively affects the fault diagnosis models. To address these issues, this paper presents two novel approaches: a spectrum-guided GAN (SGAN) and density-directionality sampling (DDS). SGAN mitigates training instability and mode collapse through combinatorial data utilization, adversarial spectral loss, and a tailored model structure. DDS ensures the high-fidelity and high-diversity of the generated data by selectively sampling the latent vectors through two steps: density-based filtering and directionality-based sampling in the feature space. Validation on both rotor and rolling element bearing datasets demonstrates that SGAN-DDS considerably improves classification results under the limited fault data. Furthermore, fidelity and diversity analyses are conducted to validate DDS, which increase the credibility of the proposed method; and offer advancement toward the application of deep-learning and GAN in industrial fields.
在旋转机械故障诊断领域,由于可用的故障数据有限,许多研究都采用了生成式对抗网络(GAN)来生成数据。然而,用于训练 GAN 的故障数据有限,这加剧了 GAN 固有的训练不稳定性和模式崩溃问题,而这些问题都是由对抗训练引起的。此外,用于潜在向量采样的随机采样的随机性往往导致生成的数据保真度低、多样性差,从而对故障诊断模型产生负面影响。为了解决这些问题,本文提出了两种新方法:频谱引导 GAN(SGAN)和密度定向采样(DDS)。SGAN 通过组合数据利用、对抗性频谱损失和定制的模型结构来缓解训练不稳定性和模式崩溃。DDS 通过两个步骤对潜在向量进行选择性采样,确保生成数据的高保真和高多样性:在特征空间中进行基于密度的过滤和基于方向性的采样。转子和滚动轴承数据集的验证结果表明,在故障数据有限的情况下,SGAN-DDS 能显著改善分类结果。此外,还进行了保真度和多样性分析来验证 DDS,从而提高了所提方法的可信度,并推动了深度学习和 GAN 在工业领域的应用。
{"title":"Spectrum-guided GAN with density-directionality sampling: Diverse high-fidelity signal generation for fault diagnosis of rotating machinery","authors":"Taehun Kim ,&nbsp;Jin Uk Ko ,&nbsp;Jinwook Lee ,&nbsp;Yong Chae Kim ,&nbsp;Joon Ha Jung ,&nbsp;Byeng D. Youn","doi":"10.1016/j.aei.2024.102821","DOIUrl":"10.1016/j.aei.2024.102821","url":null,"abstract":"<div><div>In the field of fault diagnosis for rotating machinery, where available fault data are limited, numerous studies have employed a generative adversarial network (GAN) for data generation. However, the limited fault data for training GAN exacerbate GAN’s inherent training instability and mode collapse issues, which are induced by adversarial training. Moreover, the stochastic nature of random sampling for latent vectors sampling often results in low-fidelity and poor diversity generation, which negatively affects the fault diagnosis models. To address these issues, this paper presents two novel approaches: a spectrum-guided GAN (SGAN) and density-directionality sampling (DDS). SGAN mitigates training instability and mode collapse through combinatorial data utilization, adversarial spectral loss, and a tailored model structure. DDS ensures the high-fidelity and high-diversity of the generated data by selectively sampling the latent vectors through two steps: density-based filtering and directionality-based sampling in the feature space. Validation on both rotor and rolling element bearing datasets demonstrates that SGAN-DDS considerably improves classification results under the limited fault data. Furthermore, fidelity and diversity analyses are conducted to validate DDS, which increase the credibility of the proposed method; and offer advancement toward the application of deep-learning and GAN in industrial fields.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102821"},"PeriodicalIF":8.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142529668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TF-F-GAN: A GAN-based model to predict the assembly physical fields under multi-modal variables fusion on vision transformer TF-F-GAN:基于 GAN 的模型,用于预测视觉变压器多模态变量融合下的装配物理场
IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-01 DOI: 10.1016/j.aei.2024.102871
Yuming Liu , Wencai Yu , Qingyuan Lin , Wei Wang , Ende Ge , Aihua Su , Yong Zhao
Assembly is the final step in ensuring the precision and performance of mechanical products. Geometric variables, process variables, and other material or physical variables during the assembly process can all impact the assembly outcome. Therefore, the key for analyzing and predicting assembly results lies in establishing the mapping relationship between various assembly variables and the results. Traditional analysis methods typically consider the evolution of a single variable in relation to the assembly results and often focus on the value at a few nodes. Essentially, this approach constructs a value-to-value nonlinear mapping model, ignoring the coupling relationships between different variables. However, with the increase in assembly precision requirements and advancements in measurement equipment, assembly analysis has evolved from value-to-value prediction to field-to-field prediction. This shift necessitates the study of the assembly physical field results for specific regions rather than focusing on a few nodes. Therefore, this paper proposes an analysis framework, TF-F-GAN (Transformer-based- Field-Generative adversarial network), which is suitable for multi-source assembly variable inputs and physical field outputs. The framework draws inspiration from multimodal fusion and text-image generation models, leveraging the Vision Transformer (VIT) network to integrate multi-source heterogeneous data from the assembly process. The physical field data is color-mapped into a cloud image format, transforming the physical field prediction into a cloud image generation problem. The CFRP bolted joint structure assembly is used as a case study in this paper. Since assembly accuracy primarily focuses on geometric deformation, the deformation field of key regions in the CFRP bolted joint is taken as the output variable. In the case study, the geometric deviations of parts and mechanical behavior during the assembly process were considered. Data augmentation methods were used to construct the dataset. After training TF-F-GAN on this dataset, transfer learning was further conducted using experimental data. The final prediction error of TF-F-GAN relative to the experimental data was less than 15 %, with a computation time of less than 7 s. This prediction framework can serve as an effective tool for predicting the physical fields of general mechanical product assembly.
装配是确保机械产品精度和性能的最后一步。装配过程中的几何变量、工艺变量和其他材料或物理变量都会影响装配结果。因此,分析和预测装配结果的关键在于建立各种装配变量与结果之间的映射关系。传统的分析方法通常只考虑单个变量与装配结果之间的演变关系,而且往往只关注几个节点的值。从本质上讲,这种方法构建了一个从值到值的非线性映射模型,忽略了不同变量之间的耦合关系。然而,随着装配精度要求的提高和测量设备的进步,装配分析已从值到值预测发展到场到场预测。这种转变要求对特定区域的装配物理场结果进行研究,而不是只关注几个节点。因此,本文提出了一个分析框架 TF-F-GAN(基于变压器的现场生成对抗网络),它适用于多源装配变量输入和物理场输出。该框架从多模态融合和文本图像生成模型中汲取灵感,利用视觉变换器(VIT)网络整合装配过程中的多源异构数据。物理现场数据被彩色映射为云图像格式,从而将物理现场预测转化为云图像生成问题。本文以 CFRP 螺栓连接结构装配为例进行研究。由于装配精度主要关注几何变形,因此将 CFRP 螺栓连接关键区域的变形场作为输出变量。在案例研究中,考虑了部件的几何偏差和装配过程中的机械行为。使用数据增强方法构建数据集。在该数据集上训练 TF-F-GAN 后,使用实验数据进一步进行迁移学习。相对于实验数据,TF-F-GAN 的最终预测误差小于 15%,计算时间小于 7 秒。该预测框架可作为预测一般机械产品装配物理场的有效工具。
{"title":"TF-F-GAN: A GAN-based model to predict the assembly physical fields under multi-modal variables fusion on vision transformer","authors":"Yuming Liu ,&nbsp;Wencai Yu ,&nbsp;Qingyuan Lin ,&nbsp;Wei Wang ,&nbsp;Ende Ge ,&nbsp;Aihua Su ,&nbsp;Yong Zhao","doi":"10.1016/j.aei.2024.102871","DOIUrl":"10.1016/j.aei.2024.102871","url":null,"abstract":"<div><div>Assembly is the final step in ensuring the precision and performance of mechanical products. Geometric variables, process variables, and other material or physical variables during the assembly process can all impact the assembly outcome. Therefore, the key for analyzing and predicting assembly results lies in establishing the mapping relationship between various assembly variables and the results. Traditional analysis methods typically consider the evolution of a single variable in relation to the assembly results and often focus on the value at a few nodes. Essentially, this approach constructs a value-to-value nonlinear mapping model, ignoring the coupling relationships between different variables. However, with the increase in assembly precision requirements and advancements in measurement equipment, assembly analysis has evolved from value-to-value prediction to field-to-field prediction. This shift necessitates the study of the assembly physical field results for specific regions rather than focusing on a few nodes. Therefore, this paper proposes an analysis framework, TF-F-GAN (Transformer-based- Field-Generative adversarial network), which is suitable for multi-source assembly variable inputs and physical field outputs. The framework draws inspiration from multimodal fusion and text-image generation models, leveraging the Vision Transformer (VIT) network to integrate multi-source heterogeneous data from the assembly process. The physical field data is color-mapped into a cloud image format, transforming the physical field prediction into a cloud image generation problem. The CFRP bolted joint structure assembly is used as a case study in this paper. Since assembly accuracy primarily focuses on geometric deformation, the deformation field of key regions in the CFRP bolted joint is taken as the output variable. In the case study, the geometric deviations of parts and mechanical behavior during the assembly process were considered. Data augmentation methods were used to construct the dataset. After training TF-F-GAN on this dataset, transfer learning was further conducted using experimental data. The final prediction error of TF-F-GAN relative to the experimental data was less than 15 %, with a computation time of less than 7 s. This prediction framework can serve as an effective tool for predicting the physical fields of general mechanical product assembly.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102871"},"PeriodicalIF":8.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142532088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Advanced Engineering Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1