Advanced Engineering Informatics最新文献_第4页

A Multi-Source Attention Graph Neural Network for modeling long and short-term dependencies in chemical process forecasting 化工过程预测中长、短期依赖关系建模的多源注意图神经网络

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics

Pub Date : 2026-02-05 DOI: 10.1016/j.aei.2026.104395

Jian Long , Bin Wang , Haifei Peng , Hengmin Zhang

Chemical process data exhibit both long-term physical dependencies and short-term dynamic variations due to complex interactions among variables. To simultaneously model these heterogeneous dependencies, this paper proposes a Multi-Source Attention Graph Neural Network (MSAGNN) for soft sensing in chemical processes. MSAGNN adopts a dual-path graph recurrent architecture, where a static graph encodes prior physical relationships, and an adaptive graph structure learning module dynamically captures time-varying correlations from data. A multi-source attention mechanism is further introduced to integrate node and neighborhood information and enhance the representation of spatial–temporal dependencies. The proposed MSAGNN is evaluated on three representative industrial processes, including the Debutanizer Column (DC), the Tennessee Eastman (TE) process, and the Fluid Catalytic Cracking (FCC) unit. Experimental results show that MSAGNN consistently achieves lower RMSE, MAE, and MAPE, and higher R

^{2}

than state-of-the-art deep learning and graph-based models, demonstrating its superior prediction accuracy and robustness. Visualization of the learned dynamic graphs and attention scores indicates that MSAGNN can reveal meaningful variable interactions, confirming the effectiveness and interpretability of the proposed approach for complex chemical processes.

化学过程数据既表现出长期的物理依赖性，又表现出由于变量之间复杂的相互作用而产生的短期动态变化。为了同时对这些异构依赖性进行建模，本文提出了一种用于化工过程软检测的多源注意图神经网络（MSAGNN）。MSAGNN采用双路径图循环架构，其中静态图编码先验物理关系，自适应图结构学习模块动态捕获数据中的时变相关性。进一步引入多源关注机制，整合节点和邻域信息，增强时空依赖关系的表征。在三个具有代表性的工业过程中，包括脱硝塔（DC）、田纳西伊士曼（TE）过程和流体催化裂化（FCC）装置，对所提出的MSAGNN进行了评估。实验结果表明，与最先进的深度学习和基于图的模型相比，MSAGNN的RMSE、MAE和MAPE均较低，R2较高，显示了其优越的预测精度和鲁棒性。学习到的动态图和注意力分数的可视化表明，MSAGNN可以揭示有意义的变量相互作用，证实了所提出的方法在复杂化学过程中的有效性和可解释性。

{"title":"A Multi-Source Attention Graph Neural Network for modeling long and short-term dependencies in chemical process forecasting","authors":"Jian Long , Bin Wang , Haifei Peng , Hengmin Zhang","doi":"10.1016/j.aei.2026.104395","DOIUrl":"10.1016/j.aei.2026.104395","url":null,"abstract":"<div><div>Chemical process data exhibit both long-term physical dependencies and short-term dynamic variations due to complex interactions among variables. To simultaneously model these heterogeneous dependencies, this paper proposes a Multi-Source Attention Graph Neural Network (MSAGNN) for soft sensing in chemical processes. MSAGNN adopts a dual-path graph recurrent architecture, where a static graph encodes prior physical relationships, and an adaptive graph structure learning module dynamically captures time-varying correlations from data. A multi-source attention mechanism is further introduced to integrate node and neighborhood information and enhance the representation of spatial–temporal dependencies. The proposed MSAGNN is evaluated on three representative industrial processes, including the Debutanizer Column (DC), the Tennessee Eastman (TE) process, and the Fluid Catalytic Cracking (FCC) unit. Experimental results show that MSAGNN consistently achieves lower RMSE, MAE, and MAPE, and higher R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> than state-of-the-art deep learning and graph-based models, demonstrating its superior prediction accuracy and robustness. Visualization of the learned dynamic graphs and attention scores indicates that MSAGNN can reveal meaningful variable interactions, confirming the effectiveness and interpretability of the proposed approach for complex chemical processes.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104395"},"PeriodicalIF":9.9,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A 3D geometry adaptive approach for fusing heterogeneous point clouds in construction sites 建筑工地异构点云的三维几何自适应融合方法

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics

Pub Date : 2026-02-05 DOI: 10.1016/j.aei.2026.104398

Yujie Lu , Tao Zhong , Haoyu Deng , Shuo Wang , Chuan Yang , Xianzhong Zhao

High-fidelity 3D modeling of large-scale construction sites requires the fusion of multi-source point clouds to satisfy the diverse requirements of various engineering tasks. A primary challenge in this process is the non-uniform quality of the data, as the relative precision between different sources varies spatially across the site. This variability can cause holistic fusion approaches to erroneously discard high-quality local data. To address this, the presented framework reframes the global fusion challenge into a more tractable problem of local partitioning and optimization. A novel hierarchical partitioning method creates local units that are geometrically adaptive to building components, enhancing both geometric continuity and processing efficiency. The methodology consists of three sequential stages: (1) high-precision registration of all source data; (2) hierarchical partitioning at both architectural and sub-component levels to create basic fusion units; and (3) multi-strategy local filtering based on data quality within each unit. The framework was validated on a challenging case involving the fusion of multi-source, image-based point clouds from a super-tall building site. The results demonstrate a precision improvement of 6.8% for the entire site and up to 31.9% for detailed regions compared to the simply merged data. The methodology enhances the integrity of as-built models and improves the representation of surface textures and structural dimensions, providing a reliable data foundation for downstream construction management tasks such as spatial progress monitoring.

大型建筑工地的高保真三维建模需要多源点云的融合，以满足各种工程任务的多样化需求。这个过程中的一个主要挑战是数据的质量不一致，因为不同来源之间的相对精度在整个站点的空间上是不同的。这种可变性可能导致整体融合方法错误地丢弃高质量的本地数据。为了解决这一问题，本文提出的框架将全局融合问题重构为一个更易于处理的局部划分和优化问题。一种新颖的分层划分方法创建了几何上适应建筑构件的局部单元，提高了几何连续性和处理效率。该方法包括三个连续阶段：(1)所有源数据的高精度配准；(2)在体系结构和子组件层面进行分层划分，创建基本融合单元；(3)基于各单元内数据质量的多策略局部滤波。该框架在一个具有挑战性的案例中得到了验证，该案例涉及来自超高建筑工地的多源、基于图像的点云融合。结果表明，与简单合并的数据相比，整个站点的精度提高了6.8%，详细区域的精度提高了31.9%。该方法提高了竣工模型的完整性，改善了表面纹理和结构维度的表达，为下游施工管理任务（如空间进度监测）提供了可靠的数据基础。

{"title":"A 3D geometry adaptive approach for fusing heterogeneous point clouds in construction sites","authors":"Yujie Lu , Tao Zhong , Haoyu Deng , Shuo Wang , Chuan Yang , Xianzhong Zhao","doi":"10.1016/j.aei.2026.104398","DOIUrl":"10.1016/j.aei.2026.104398","url":null,"abstract":"<div><div>High-fidelity 3D modeling of large-scale construction sites requires the fusion of multi-source point clouds to satisfy the diverse requirements of various engineering tasks. A primary challenge in this process is the non-uniform quality of the data, as the relative precision between different sources varies spatially across the site. This variability can cause holistic fusion approaches to erroneously discard high-quality local data. To address this, the presented framework reframes the global fusion challenge into a more tractable problem of local partitioning and optimization. A novel hierarchical partitioning method creates local units that are geometrically adaptive to building components, enhancing both geometric continuity and processing efficiency. The methodology consists of three sequential stages: (1) high-precision registration of all source data; (2) hierarchical partitioning at both architectural and sub-component levels to create basic fusion units; and (3) multi-strategy local filtering based on data quality within each unit. The framework was validated on a challenging case involving the fusion of multi-source, image-based point clouds from a super-tall building site. The results demonstrate a precision improvement of 6.8% for the entire site and up to 31.9% for detailed regions compared to the simply merged data. The methodology enhances the integrity of as-built models and improves the representation of surface textures and structural dimensions, providing a reliable data foundation for downstream construction management tasks such as spatial progress monitoring.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104398"},"PeriodicalIF":9.9,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146187868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dual-branch interactive fusion network for dam displacement prediction based on parallel temporal representation and gated cross-attention 基于并行时间表征和门控交叉关注的大坝位移预测双分支交互融合网络

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics

Pub Date : 2026-02-05 DOI: 10.1016/j.aei.2026.104393

Qiubing Ren , Ruizhe Liu , Mingchao Li , Zhiyong Qi , Xuhuang Du , Jin Yuan

Accurate dam displacement prediction is vital for optimizing maintenance and ensuring structural safety. Nevertheless, current models often struggle to effectively capture the complex relationships between structural responses and environmental variables, alongside the interactions between temporal dynamics and multivariate data, resulting in suboptimal predictive accuracy. Therefore, we propose a dual-branch interactive fusion network (DBIFN) for dam displacement prediction using parallel temporal representation and gated cross-attention. The dual-branch architecture, which parallelly integrates the enhanced Transformer (eTransformer) and long short-term memory (LSTM), is designed to optimize feature extraction and interaction modeling across multiple dimensions. Specifically, eTransformer is dedicated to extracting features from targeted displacement sequences, while LSTM effectively processes auxiliary environmental dynamics, enabling a comprehensive analysis of underlying patterns within monitoring data. To fully fuse the interpreted temporal features from dual-branch outputs, we introduce a new cross-attention module to utilize the multi-dimensional gated attention unit to efficiently encode them into semantic representations, followed by a Kolmogorov-Arnold network mapping for further representation enhancement. The effectiveness of the proposed model is validated using real-world monitoring datasets collected from a concrete dam project, with experiments conducted across multiple monitoring points. Results demonstrate that DBIFN achieves superior prediction accuracy compared to both single-branch and conventional baseline models. Across all monitoring points, the proposed model can effectively capture temporal variations, attaining an average coefficient of determination of over 0.95 on the test set and outperforming comparative models in most metrics. Furthermore, statistical significance testing confirms the reliability and reproducibility of the results, while computational efficiency is maintained within inference time constraints. These findings offer valuable insights into the practical application of DBIFN-based monitoring models and support informed decision-making.

准确的坝体位移预测对优化维修和保证结构安全至关重要。然而，目前的模型往往难以有效地捕捉结构响应和环境变量之间的复杂关系，以及时间动态和多变量数据之间的相互作用，导致预测精度不理想。因此，我们提出了一个双分支交互融合网络（DBIFN）用于大坝位移预测，该网络采用并行时间表征和门控交叉注意。双分支架构并行集成了增强型变压器（eTransformer）和长短期记忆（LSTM），旨在优化多维特征提取和交互建模。具体来说，eTransformer致力于从目标位移序列中提取特征，而LSTM则有效地处理辅助环境动态，从而能够全面分析监测数据中的潜在模式。为了充分融合来自双分支输出的解释时间特征，我们引入了一个新的交叉注意模块，利用多维门控注意单元将它们有效地编码为语义表示，然后使用Kolmogorov-Arnold网络映射来进一步增强表示。利用从混凝土大坝项目收集的真实监测数据集验证了所提出模型的有效性，并在多个监测点进行了实验。结果表明，与单分支模型和常规基线模型相比，DBIFN具有更高的预测精度。在所有监测点上，所提出的模型可以有效地捕获时间变化，在测试集中获得超过0.95的平均决定系数，并且在大多数度量中优于比较模型。此外，统计显著性检验证实了结果的可靠性和可重复性，同时在推理时间限制内保持了计算效率。这些发现为基于dbifn的监测模型的实际应用提供了有价值的见解，并支持明智的决策。

{"title":"Dual-branch interactive fusion network for dam displacement prediction based on parallel temporal representation and gated cross-attention","authors":"Qiubing Ren , Ruizhe Liu , Mingchao Li , Zhiyong Qi , Xuhuang Du , Jin Yuan","doi":"10.1016/j.aei.2026.104393","DOIUrl":"10.1016/j.aei.2026.104393","url":null,"abstract":"<div><div>Accurate dam displacement prediction is vital for optimizing maintenance and ensuring structural safety. Nevertheless, current models often struggle to effectively capture the complex relationships between structural responses and environmental variables, alongside the interactions between temporal dynamics and multivariate data, resulting in suboptimal predictive accuracy. Therefore, we propose a dual-branch interactive fusion network (DBIFN) for dam displacement prediction using parallel temporal representation and gated cross-attention. The dual-branch architecture, which parallelly integrates the enhanced Transformer (eTransformer) and long short-term memory (LSTM), is designed to optimize feature extraction and interaction modeling across multiple dimensions. Specifically, eTransformer is dedicated to extracting features from targeted displacement sequences, while LSTM effectively processes auxiliary environmental dynamics, enabling a comprehensive analysis of underlying patterns within monitoring data. To fully fuse the interpreted temporal features from dual-branch outputs, we introduce a new cross-attention module to utilize the multi-dimensional gated attention unit to efficiently encode them into semantic representations, followed by a Kolmogorov-Arnold network mapping for further representation enhancement. The effectiveness of the proposed model is validated using real-world monitoring datasets collected from a concrete dam project, with experiments conducted across multiple monitoring points. Results demonstrate that DBIFN achieves superior prediction accuracy compared to both single-branch and conventional baseline models. Across all monitoring points, the proposed model can effectively capture temporal variations, attaining an average coefficient of determination of over 0.95 on the test set and outperforming comparative models in most metrics. Furthermore, statistical significance testing confirms the reliability and reproducibility of the results, while computational efficiency is maintained within inference time constraints. These findings offer valuable insights into the practical application of DBIFN-based monitoring models and support informed decision-making.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104393"},"PeriodicalIF":9.9,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robust VSLAM algorithm based on multi-source information composite constraint model and heterogeneous kernel function optimization 基于多源信息复合约束模型和异构核函数优化的鲁棒VSLAM算法

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics

Pub Date : 2026-02-05 DOI: 10.1016/j.aei.2026.104417

Jiangting Zhao , Xiaoyu Zhang , Zhanfeng Qi , Kunpeng He , Yandong Yang

Visual-inertial navigation, as a widely adopted integrated navigation approach, serves as a key enabling technology for unmanned systems to operate in unknown environments. To address the challenges of degraded adaptability and localization accuracy in practical scenarios, which often result from diverse platform motions and complex environmental conditions, this paper proposes a robust visual simultaneous localization and mapping (VSLAM) algorithm named MSHK-SLAM. The method is built upon a multi-source information composite constraint model and a heterogeneous kernel function optimization. First, an adaptive keyframe (KF) selection strategy is introduced to prevent KF omission caused by abrupt viewpoint changes during motion, while providing more reliable information for back-end processing. Second, an optimization method based on a heterogeneous kernel function using Gaussian–Cauchy mixture correntropy (GCM) is designed to effectively suppress interference from mixed environmental noise, further enhancing system robustness. Experimental results on public datasets demonstrate that MSHK-SLAM outperforms other state-of-the-art algorithms in both adaptability and accuracy under complex environmental conditions.

视觉惯性导航作为一种被广泛采用的组合导航方法，是无人系统在未知环境下运行的关键使能技术。针对实际场景中平台运动变化和复杂环境条件导致的自适应能力和定位精度下降的问题，提出了一种鲁棒视觉同步定位与映射（VSLAM）算法，命名为MSHK-SLAM。该方法建立在多源信息组合约束模型和异构核函数优化的基础上。首先，引入自适应关键帧（KF）选择策略，防止运动过程中视点突然变化导致的关键帧遗漏，同时为后端处理提供更可靠的信息；其次，设计了一种基于高斯-柯西混合熵（GCM）的异构核函数优化方法，有效抑制混合环境噪声的干扰，进一步增强系统的鲁棒性。在公共数据集上的实验结果表明，MSHK-SLAM在复杂环境条件下的自适应性和准确性方面都优于其他最先进的算法。

{"title":"Robust VSLAM algorithm based on multi-source information composite constraint model and heterogeneous kernel function optimization","authors":"Jiangting Zhao , Xiaoyu Zhang , Zhanfeng Qi , Kunpeng He , Yandong Yang","doi":"10.1016/j.aei.2026.104417","DOIUrl":"10.1016/j.aei.2026.104417","url":null,"abstract":"<div><div>Visual-inertial navigation, as a widely adopted integrated navigation approach, serves as a key enabling technology for unmanned systems to operate in unknown environments. To address the challenges of degraded adaptability and localization accuracy in practical scenarios, which often result from diverse platform motions and complex environmental conditions, this paper proposes a robust visual simultaneous localization and mapping (VSLAM) algorithm named MSHK-SLAM. The method is built upon a multi-source information composite constraint model and a heterogeneous kernel function optimization. First, an adaptive keyframe (KF) selection strategy is introduced to prevent KF omission caused by abrupt viewpoint changes during motion, while providing more reliable information for back-end processing. Second, an optimization method based on a heterogeneous kernel function using Gaussian–Cauchy mixture correntropy (GCM) is designed to effectively suppress interference from mixed environmental noise, further enhancing system robustness. Experimental results on public datasets demonstrate that MSHK-SLAM outperforms other state-of-the-art algorithms in both adaptability and accuracy under complex environmental conditions.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104417"},"PeriodicalIF":9.9,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146187867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Appointed-time prescribed performance path tracking control for autonomous vehicles considering initial constraint problem 考虑初始约束问题的自动驾驶车辆指定时间预定性能路径跟踪控制

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics

Pub Date : 2026-02-05 DOI: 10.1016/j.aei.2026.104418

Yang Tian , Lei Pan , Yicai Liu , Yahui Zhang , Yushu Li , Yihong Fan , Xiangyu Wang , Liang Li , Bingxin Ma

Path tracking control (PTC) as a core function of autonomous vehicles, has been widely studied, and many prescribed performance control (PPC) methods have been applied. However, PTC faces challenges such as model nonlinearity, parameter variations, and external disturbances. The initial constraint problem (ICP) also limits PPC, as the tracking error often exceeds the performance constraint boundary at the initial stage and must be regulated back within the permissible set within a reasonable time. This article proposes a model-free PPC scheme that considers ICP, achieving the appointed time performance and adaptive adjustment of envelope curvature. First, the path tracking error is abstracted as a preview error to avoid the complexities of time-varying nonlinearities. An appointed time prescribed performance function is proposed, where the envelope becomes concave to accelerate error convergence for small initial errors and convex to mitigate ICP for large initial errors. Subsequently, this function is integrated with a nonlinear manifold control method to ensure the appointed time stability of the preview error and its derivative. Simulations and vehicle experiment results verify the effectiveness, robustness, and efficiency of the proposed controller.

路径跟踪控制（PTC）作为自动驾驶汽车的核心功能，得到了广泛的研究，并应用了许多规定性能控制（PPC）方法。然而，PTC面临着模型非线性、参数变化和外部干扰等挑战。初始约束问题（ICP）也限制了PPC，因为跟踪误差通常在初始阶段超过性能约束边界，必须在合理的时间内调节回允许的范围内。本文提出了一种考虑ICP的无模型PPC方案，实现了指定时间性能和包络曲率的自适应调节。首先，将路径跟踪误差抽象为预览误差，避免了时变非线性的复杂性；提出了一个指定时间规定的性能函数，其中包络线为凹形以加速小初始误差的收敛，为凸形以减轻大初始误差的ICP。然后，将该函数与非线性流形控制方法相结合，保证了预瞄误差及其导数的指定时间稳定性。仿真和车辆实验结果验证了该控制器的有效性、鲁棒性和高效性。

{"title":"Appointed-time prescribed performance path tracking control for autonomous vehicles considering initial constraint problem","authors":"Yang Tian , Lei Pan , Yicai Liu , Yahui Zhang , Yushu Li , Yihong Fan , Xiangyu Wang , Liang Li , Bingxin Ma","doi":"10.1016/j.aei.2026.104418","DOIUrl":"10.1016/j.aei.2026.104418","url":null,"abstract":"<div><div>Path tracking control (PTC) as a core function of autonomous vehicles, has been widely studied, and many prescribed performance control (PPC) methods have been applied. However, PTC faces challenges such as model nonlinearity, parameter variations, and external disturbances. The initial constraint problem (ICP) also limits PPC, as the tracking error often exceeds the performance constraint boundary at the initial stage and must be regulated back within the permissible set within a reasonable time. This article proposes a model-free PPC scheme that considers ICP, achieving the appointed time performance and adaptive adjustment of envelope curvature. First, the path tracking error is abstracted as a preview error to avoid the complexities of time-varying nonlinearities. An appointed time prescribed performance function is proposed, where the envelope becomes concave to accelerate error convergence for small initial errors and convex to mitigate ICP for large initial errors. Subsequently, this function is integrated with a nonlinear manifold control method to ensure the appointed time stability of the preview error and its derivative. Simulations and vehicle experiment results verify the effectiveness, robustness, and efficiency of the proposed controller.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104418"},"PeriodicalIF":9.9,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robust robotic assembly via hierarchical diffusion policy-guided reinforcement learning 基于分层扩散策略导向强化学习的鲁棒机器人装配

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics

Pub Date : 2026-02-05 DOI: 10.1016/j.aei.2026.104399

Yibang Zhou , Xiangkai Li , Yue Yin , Liwei Chen , Haiming Xu , Jiajie Fu , Aoyu Zhou , Jianjun Yi

Robot assembly is a core task in industrial manufacturing systems, and its autonomy and stability have been a research focus. Traditional methods are mostly limited to specific assembly tasks, struggling to robustly accomplish the entire assembly process in unstructured scenarios. This paper proposes a reinforcement learning method based on a hierarchical diffusion policy for robot assembly tasks. The proposed method decomposes the assembly policy space into a high-level task planning controller and a low-level policy controller. The low-level policy controller employs a multimodal conditional diffusion policy, leveraging the environmental perception and task generalization capabilities of visual observations, as well as the contact distribution fitting capability of force observations, to efficiently and robustly accomplish assembly subtasks. The high-level task planning controller learns a task scheduling policy, supporting multi-timescale decision-making in complex dynamic assembly environments. Experimental results demonstrate that the proposed method can stably and efficiently complete assembly tasks on components of different shapes, even in the presence of significant initial pose errors, exhibiting higher learning efficiency and generalization performance compared to traditional methods.

机器人装配是工业制造系统的核心任务，其自主性和稳定性一直是研究的热点。传统方法大多局限于特定的装配任务，难以在非结构化场景中健壮地完成整个装配过程。提出了一种基于分层扩散策略的机器人装配任务强化学习方法。该方法将装配策略空间分解为高级任务规划控制器和低级策略控制器。低层策略控制器采用多模态条件扩散策略，利用视觉观测的环境感知和任务泛化能力以及力观测的接触分布拟合能力，高效鲁棒地完成装配子任务。高级任务规划控制器学习任务调度策略，支持复杂动态装配环境下的多时间尺度决策。实验结果表明，即使存在较大的初始位姿误差，该方法也能稳定高效地完成不同形状部件的装配任务，与传统方法相比，具有更高的学习效率和泛化性能。

{"title":"Robust robotic assembly via hierarchical diffusion policy-guided reinforcement learning","authors":"Yibang Zhou , Xiangkai Li , Yue Yin , Liwei Chen , Haiming Xu , Jiajie Fu , Aoyu Zhou , Jianjun Yi","doi":"10.1016/j.aei.2026.104399","DOIUrl":"10.1016/j.aei.2026.104399","url":null,"abstract":"<div><div>Robot assembly is a core task in industrial manufacturing systems, and its autonomy and stability have been a research focus. Traditional methods are mostly limited to specific assembly tasks, struggling to robustly accomplish the entire assembly process in unstructured scenarios. This paper proposes a reinforcement learning method based on a hierarchical diffusion policy for robot assembly tasks. The proposed method decomposes the assembly policy space into a high-level task planning controller and a low-level policy controller. The low-level policy controller employs a multimodal conditional diffusion policy, leveraging the environmental perception and task generalization capabilities of visual observations, as well as the contact distribution fitting capability of force observations, to efficiently and robustly accomplish assembly subtasks. The high-level task planning controller learns a task scheduling policy, supporting multi-timescale decision-making in complex dynamic assembly environments. Experimental results demonstrate that the proposed method can stably and efficiently complete assembly tasks on components of different shapes, even in the presence of significant initial pose errors, exhibiting higher learning efficiency and generalization performance compared to traditional methods.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104399"},"PeriodicalIF":9.9,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Motion system modeling and acceleration analysis of five-axis hybrid machine tool 五轴混合动力机床运动系统建模及加速度分析

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics

Pub Date : 2026-02-04 DOI: 10.1016/j.aei.2026.104368

Guang Yu , Jia Liu , Lai Hu , Anbang Jiang , Liping Wang

Aiming to improve the machining efficiency and accuracy of five-axis hybrid machine tool (FaHMT), the kinematics system of FaHMT was modeled. According to the kinematics model, the velocity mapping model and error transfer model of the FaHMT were obtained by derivation and perturbation differentiation. The three-loop control model of permanent magnet synchronous motor used in machine tools was focused on modeling. Meanwhile, the mapping characteristics of the movement speed of the tool and the feed speed of each driving shaft under different postures of the parallel mechanism were analyzed. In the acceleration experiment, the position where the inflection point of error change appeared at a = 4.6 mm/s². When the acceleration was greater than this value, the X_W axis follow-up error increased significantly. For Z₁, Z₂ and Z₃ axes, their movement positions and errors were consistent basically. The modeling method and practical conclusions analyzed provide effective theoretical support and experimental verification for studying kinematics and acceleration analysis of FaHMT.

为了提高五轴混合动力机床的加工效率和加工精度，对五轴混合动力机床的运动系统进行了建模。在运动学模型的基础上，通过导数和微扰微分，建立了FaHMT的速度映射模型和误差传递模型。重点研究了机床用永磁同步电机三环控制模型的建模。同时，分析了并联机构不同姿态下刀具运动速度与各传动轴进给速度的映射特性。在加速度实验中，误差变化拐点出现的位置为a = 4.6 mm/s2。当加速度大于该值时，XW轴随动误差明显增大。对于Z1、Z2和Z3轴，它们的运动位置和误差基本一致。所分析的建模方法和实际结论为研究FaHMT的运动学和加速度分析提供了有效的理论支持和实验验证。

{"title":"Motion system modeling and acceleration analysis of five-axis hybrid machine tool","authors":"Guang Yu , Jia Liu , Lai Hu , Anbang Jiang , Liping Wang","doi":"10.1016/j.aei.2026.104368","DOIUrl":"10.1016/j.aei.2026.104368","url":null,"abstract":"<div><div>Aiming to improve the machining efficiency and accuracy of five-axis hybrid machine tool (FaHMT), the kinematics system of FaHMT was modeled. According to the kinematics model, the velocity mapping model and error transfer model of the FaHMT were obtained by derivation and perturbation differentiation. The three-loop control model of permanent magnet synchronous motor used in machine tools was focused on modeling. Meanwhile, the mapping characteristics of the movement speed of the tool and the feed speed of each driving shaft under different postures of the parallel mechanism were analyzed. In the acceleration experiment, the position where the inflection point of error change appeared at <em>a</em> = 4.6 mm/s<sup>2</sup>. When the acceleration was greater than this value, the <em>X<sub>W</sub></em> axis follow-up error increased significantly. For <em>Z</em><sub>1</sub>, <em>Z</em><sub>2</sub> and <em>Z</em><sub>3</sub> axes, their movement positions and errors were consistent basically. The modeling method and practical conclusions analyzed provide effective theoretical support and experimental verification for studying kinematics and acceleration analysis of FaHMT.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104368"},"PeriodicalIF":9.9,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

BearGen: LLM-guided signal generation framework for bearing fault diagnosis BearGen：基于llm的轴承故障诊断信号生成框架

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics

Pub Date : 2026-02-04 DOI: 10.1016/j.aei.2026.104400

Jaeyoung Lee , Hyuna Jeon , Uiin Kim , Misuk Kim

Signal data are essential for condition monitoring, fault diagnosis, and decision-making across industrial domains, and research leveraging signal data has been actively pursued in areas such as healthcare and manufacturing. However, acquiring such data is costly and difficult due to factors such as the risk of equipment damage, the need for expert labeling, and the scarcity of fault data. Moreover, collected data often contain sensitive operational information, making sharing difficult, and enterprises are restricted from using high-performance models hosted on external servers due to security concerns. To address these challenges, we propose BearGen, a novel framework that combines the strong generative capabilities of Large Language Models (LLMs) with the precise data distribution learning of diffusion models to synthesize high-quality signal data in on-premise environments. BearGen first employs an LLM to generate descriptions of existing signals and then conditions a description-guided diffusion model on these descriptions to generate high-quality synthetic signals. We evaluated BearGen on eight publicly available bearing fault diagnosis datasets, and the results showed superior performance compared to existing approaches. In addition, we experimentally validated the reliability and usefulness of the generated signal descriptions. Further experiments under conditions simulating real industrial environments — such as limited data availability and severe data imbalance — verified the practical applicability of the framework. By operating in on-premise environments, BearGen resolves data security concerns while alleviating data scarcity and imbalance. Furthermore, by providing natural language descriptions, it enhances interpretability and offers significant potential for decision support in real-world industrial applications.

信号数据对于工业领域的状态监测、故障诊断和决策至关重要，利用信号数据的研究已在医疗保健和制造业等领域得到积极开展。然而，由于设备损坏的风险、需要专家标记以及故障数据的稀缺性等因素，获取此类数据既昂贵又困难。此外，收集的数据通常包含敏感的操作信息，这使得共享变得困难，而且出于安全考虑，企业被限制使用托管在外部服务器上的高性能模型。为了应对这些挑战，我们提出了BearGen，这是一个将大型语言模型（llm）的强大生成能力与扩散模型的精确数据分布学习相结合的新框架，可以在内部部署环境中合成高质量的信号数据。BearGen首先使用LLM生成现有信号的描述，然后在这些描述上条件描述引导扩散模型以生成高质量的合成信号。我们在8个公开可用的轴承故障诊断数据集上对BearGen进行了评估，结果显示，与现有方法相比，BearGen的性能更好。此外，我们通过实验验证了生成的信号描述的可靠性和实用性。在模拟真实工业环境的条件下进行的进一步实验-例如有限的数据可用性和严重的数据不平衡-验证了该框架的实际适用性。通过在内部部署环境中运行，BearGen解决了数据安全问题，同时缓解了数据稀缺和不平衡。此外，通过提供自然语言描述，它增强了可解释性，并为实际工业应用中的决策支持提供了重要的潜力。

{"title":"BearGen: LLM-guided signal generation framework for bearing fault diagnosis","authors":"Jaeyoung Lee , Hyuna Jeon , Uiin Kim , Misuk Kim","doi":"10.1016/j.aei.2026.104400","DOIUrl":"10.1016/j.aei.2026.104400","url":null,"abstract":"<div><div>Signal data are essential for condition monitoring, fault diagnosis, and decision-making across industrial domains, and research leveraging signal data has been actively pursued in areas such as healthcare and manufacturing. However, acquiring such data is costly and difficult due to factors such as the risk of equipment damage, the need for expert labeling, and the scarcity of fault data. Moreover, collected data often contain sensitive operational information, making sharing difficult, and enterprises are restricted from using high-performance models hosted on external servers due to security concerns. To address these challenges, we propose <span>BearGen</span>, a novel framework that combines the strong generative capabilities of Large Language Models (LLMs) with the precise data distribution learning of diffusion models to synthesize high-quality signal data in on-premise environments. <span>BearGen</span> first employs an LLM to generate descriptions of existing signals and then conditions a description-guided diffusion model on these descriptions to generate high-quality synthetic signals. We evaluated <span>BearGen</span> on eight publicly available bearing fault diagnosis datasets, and the results showed superior performance compared to existing approaches. In addition, we experimentally validated the reliability and usefulness of the generated signal descriptions. Further experiments under conditions simulating real industrial environments — such as limited data availability and severe data imbalance — verified the practical applicability of the framework. By operating in on-premise environments, <span>BearGen</span> resolves data security concerns while alleviating data scarcity and imbalance. Furthermore, by providing natural language descriptions, it enhances interpretability and offers significant potential for decision support in real-world industrial applications.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104400"},"PeriodicalIF":9.9,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

RDF-based knowledge graph integration with deep learning for fault diagnosis 基于rdf的知识图与深度学习的故障诊断集成

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics

Pub Date : 2026-02-04 DOI: 10.1016/j.aei.2026.104422

Maximilian-Peter Radtke , Marco F. Huber , Jürgen Bock

Combining system knowledge with deep learning for fault diagnosis in industrial applications offers the potential to reduce the dependency of deep learning algorithms on extensive labeled datasets. However, existing methods often rely on highly specialized, problem-specific knowledge or demand detailed physical insights into the system, which limits their generalizability. Additionally, inconsistencies in knowledge representation hinder the ability to compare and build upon prior approaches. In this work, we address these challenges by leveraging commonly available knowledge about the phase structure of systems and the hierarchical organization of condition spaces. This information is systematically represented using knowledge graphs (KGs) based on the Resource Description Framework (RDF). To integrate this knowledge into deep learning, we transform the input data and the corresponding labels based on the KGs, and employ a graph neural network (GNN) trained with a semantic loss function informed by the knowledge about the condition space. The proposed approach is evaluated on three diverse datasets with varying characteristics under the two scenarios of domain generalization and novel fault detection.

将系统知识与深度学习相结合，用于工业应用中的故障诊断，可以减少深度学习算法对大量标记数据集的依赖。然而，现有的方法通常依赖于高度专业化的、特定于问题的知识，或者需要对系统进行详细的物理洞察，这限制了它们的通用性。此外，知识表示的不一致性阻碍了比较和建立先前方法的能力。在这项工作中，我们通过利用关于系统相位结构和条件空间分层组织的通用知识来解决这些挑战。这些信息使用基于资源描述框架（RDF）的知识图（KGs）系统地表示。为了将这些知识整合到深度学习中，我们基于KGs对输入数据和相应的标签进行转换，并使用由关于条件空间的知识告知的语义损失函数训练的图神经网络（GNN）。在领域泛化和新故障检测两种场景下，对三种不同特征的数据集进行了评估。

{"title":"RDF-based knowledge graph integration with deep learning for fault diagnosis","authors":"Maximilian-Peter Radtke , Marco F. Huber , Jürgen Bock","doi":"10.1016/j.aei.2026.104422","DOIUrl":"10.1016/j.aei.2026.104422","url":null,"abstract":"<div><div>Combining system knowledge with deep learning for fault diagnosis in industrial applications offers the potential to reduce the dependency of deep learning algorithms on extensive labeled datasets. However, existing methods often rely on highly specialized, problem-specific knowledge or demand detailed physical insights into the system, which limits their generalizability. Additionally, inconsistencies in knowledge representation hinder the ability to compare and build upon prior approaches. In this work, we address these challenges by leveraging commonly available knowledge about the phase structure of systems and the hierarchical organization of condition spaces. This information is systematically represented using knowledge graphs (KGs) based on the Resource Description Framework (RDF). To integrate this knowledge into deep learning, we transform the input data and the corresponding labels based on the KGs, and employ a graph neural network (GNN) trained with a semantic loss function informed by the knowledge about the condition space. The proposed approach is evaluated on three diverse datasets with varying characteristics under the two scenarios of domain generalization and novel fault detection.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104422"},"PeriodicalIF":9.9,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Physics-informed edge-enhanced temporal graph convolutional network for multi-risk evolution prediction in deep excavation 基于物理信息的边缘增强时间图卷积网络在深基坑多风险演化预测中的应用

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics

Pub Date : 2026-02-03 DOI: 10.1016/j.aei.2026.104391

Jian Wei, Yue Pan, Jin-Jian Chen

Accurate prediction of excavation-induced multi-risk evolution is essential for intelligent construction and on-site safety management. This study proposes a Physics-Informed Edge-Enhanced Temporal Graph Convolutional Network (PI-ETGCN) that seamlessly integrates mechanistic priors with data-driven learning for spatiotemporal multi-risk forecasting in deep excavations. During graph modeling, multi-source construction information is encoded as node attributes to construct a Multi-Source Risk Graph (MRG) that links heterogeneous variables across space and time. Mechanical laws governing wall–soil–pipeline interactions are parameterized with learnable edge coefficients, enabling adaptive modeling of spatial–physical couplings under varying construction conditions. During graph learning, PI-ETGCN combines edge-enhanced graph convolution with temporal modules to capture coupled data–mechanics relationships and spatiotemporal dependencies within the MRG. A physics-informed regularization term is incorporated into the training objective to promote mechanically consistent risk-evolution patterns and improve robustness and generalization. Validated on a real-world Shanghai Rail Transit project, PI-ETGCN delivers superior predictive performance, reducing errors by over 20% relative to the second-best baseline. Ablation studies further confirm that the proposed physics–data fusion strategy maintains high accuracy under limited samples, noise, and outliers. Overall, PI-ETGCN provides interpretable, decision-ready risk information for real-time monitoring and reliability-aware on-site safety management.

准确预测开挖引发的多风险演化，对智能施工和现场安全管理至关重要。该研究提出了一种基于物理信息的边缘增强时态图卷积网络（PI-ETGCN），该网络将机械先验与数据驱动学习无缝集成，用于深度挖掘的时空多风险预测。在图建模过程中，将多源构建信息编码为节点属性，构建跨空间和时间连接异构变量的多源风险图（MRG）。控制墙-土-管道相互作用的力学规律用可学习的边缘系数参数化，使空间-物理耦合在不同施工条件下的自适应建模成为可能。在图学习过程中，PI-ETGCN将边缘增强的图卷积与时间模块相结合，以捕获MRG内的耦合数据力学关系和时空依赖关系。一个物理信息正则化项被纳入训练目标，以促进机械一致的风险演化模式，提高鲁棒性和泛化。经过上海轨道交通实际项目的验证，PI-ETGCN提供了卓越的预测性能，相对于第二好的基线减少了20%以上的误差。消融研究进一步证实了所提出的物理数据融合策略在有限的样本、噪声和异常值下保持了很高的精度。总体而言，PI-ETGCN为实时监控和可靠性感知现场安全管理提供了可解释的、决策就绪的风险信息。

{"title":"Physics-informed edge-enhanced temporal graph convolutional network for multi-risk evolution prediction in deep excavation","authors":"Jian Wei, Yue Pan, Jin-Jian Chen","doi":"10.1016/j.aei.2026.104391","DOIUrl":"10.1016/j.aei.2026.104391","url":null,"abstract":"<div><div>Accurate prediction of excavation-induced multi-risk evolution is essential for intelligent construction and on-site safety management. This study proposes a Physics-Informed Edge-Enhanced Temporal Graph Convolutional Network (PI-ETGCN) that seamlessly integrates mechanistic priors with data-driven learning for spatiotemporal multi-risk forecasting in deep excavations. During graph modeling, multi-source construction information is encoded as node attributes to construct a Multi-Source Risk Graph (MRG) that links heterogeneous variables across space and time. Mechanical laws governing wall–soil–pipeline interactions are parameterized with learnable edge coefficients, enabling adaptive modeling of spatial–physical couplings under varying construction conditions. During graph learning, PI-ETGCN combines edge-enhanced graph convolution with temporal modules to capture coupled data–mechanics relationships and spatiotemporal dependencies within the MRG. A physics-informed regularization term is incorporated into the training objective to promote mechanically consistent risk-evolution patterns and improve robustness and generalization. Validated on a real-world Shanghai Rail Transit project, PI-ETGCN delivers superior predictive performance, reducing errors by over 20% relative to the second-best baseline. Ablation studies further confirm that the proposed physics–data fusion strategy maintains high accuracy under limited samples, noise, and outliers. Overall, PI-ETGCN provides interpretable, decision-ready risk information for real-time monitoring and reliability-aware on-site safety management.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104391"},"PeriodicalIF":9.9,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0