首页 > 最新文献

Advanced Engineering Informatics最新文献

英文 中文
A novel robust mixture-of-experts model with causal priors for interpretable water quality diagnosis 一个新的鲁棒混合专家模型与因果先验的可解释水质诊断
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-30 DOI: 10.1016/j.aei.2025.104267
Menghao Wang , Yafeng Yang , Fawen Li , Lin Liu , Wenmei Cao
The degradation of water quality, compounded by industrial activity and climatic stressors, complicates effective environmental monitoring and decision-making. Although conventional data-driven models offer predictive utility, they frequently function as opaque systems that prioritize statistical correlation over physical causality, leaving them vulnerable to distributional shifts. To transcend these limitations, this study proposes the Causal Mixture-of-Experts framework, which embeds causal structure discovery directly into a modular deep learning architecture. By utilizing the Nonlinear NOTEARS algorithm to derive transparent causal priors from observational data, the framework strictly constrains the expert system. Specifically, each expert is required to model a distinct data-generating subsystem to enforce mechanistic interpretability. Concurrently, systemic robustness is bolstered by a Causal Expert Modulation Module that integrates parent–child dependencies with a Drop-Expert regularization strategy to dynamically compensate for potential module failures. Rigorous evaluations across transnational, multi-source datasets from China, the United States, Canada, and the United Kingdom, conducted under strict comparisons with various advanced baselines, yield a classification accuracy exceeding 0.96. Notably, the model retains superior efficacy with scores remaining above 0.93 even under expert ablation, outperforming baselines such as TabKANet and LightGBM. Beyond predictive precision, the framework disentangles heterogeneous pollution drivers by identifying distinct mechanisms such as eutrophication dominance in China and organic pollution in Canada. This capability effectively bridges the divide between algorithmic modeling and accountable environmental stewardship.
水质的退化,再加上工业活动和气候压力因素,使有效的环境监测和决策复杂化。尽管传统的数据驱动模型提供了预测效用,但它们往往是不透明的系统,优先考虑统计相关性而不是物理因果关系,使它们容易受到分布变化的影响。为了超越这些限制,本研究提出了因果混合专家框架,该框架将因果结构发现直接嵌入到模块化深度学习架构中。该框架利用非线性NOTEARS算法从观测数据中推导出透明的因果先验,对专家系统进行严格约束。具体来说,每个专家都需要对一个不同的数据生成子系统进行建模,以加强机制上的可解释性。同时,系统的鲁棒性由因果专家调制模块增强,该模块集成了亲子依赖关系和Drop-Expert正则化策略,以动态补偿潜在的模块故障。对来自中国、美国、加拿大和英国的跨国、多源数据集进行严格评估,并与各种先进基线进行严格比较,得出的分类精度超过0.96。值得注意的是,即使在专家消融的情况下,该模型仍保持了优越的疗效,得分仍高于0.93,优于TabKANet和LightGBM等基线。除了预测精度之外,该框架还通过识别不同的机制(如中国的富营养化主导和加拿大的有机污染)来理清异质性污染驱动因素。这种能力有效地弥合了算法建模和负责任的环境管理之间的鸿沟。
{"title":"A novel robust mixture-of-experts model with causal priors for interpretable water quality diagnosis","authors":"Menghao Wang ,&nbsp;Yafeng Yang ,&nbsp;Fawen Li ,&nbsp;Lin Liu ,&nbsp;Wenmei Cao","doi":"10.1016/j.aei.2025.104267","DOIUrl":"10.1016/j.aei.2025.104267","url":null,"abstract":"<div><div>The degradation of water quality, compounded by industrial activity and climatic stressors, complicates effective environmental monitoring and decision-making. Although conventional data-driven models offer predictive utility, they frequently function as opaque systems that prioritize statistical correlation over physical causality, leaving them vulnerable to distributional shifts. To transcend these limitations, this study proposes the Causal Mixture-of-Experts framework, which embeds causal structure discovery directly into a modular deep learning architecture. By utilizing the Nonlinear NOTEARS algorithm to derive transparent causal priors from observational data, the framework strictly constrains the expert system. Specifically, each expert is required to model a distinct data-generating subsystem to enforce mechanistic interpretability. Concurrently, systemic robustness is bolstered by a Causal Expert Modulation Module that integrates parent–child dependencies with a Drop-Expert regularization strategy to dynamically compensate for potential module failures. Rigorous evaluations across transnational, multi-source datasets from China, the United States, Canada, and the United Kingdom, conducted under strict comparisons with various advanced baselines, yield a classification accuracy exceeding 0.96. Notably, the model retains superior efficacy with scores remaining above 0.93 even under expert ablation, outperforming baselines such as TabKANet and LightGBM. Beyond predictive precision, the framework disentangles heterogeneous pollution drivers by identifying distinct mechanisms such as eutrophication dominance in China and organic pollution in Canada. This capability effectively bridges the divide between algorithmic modeling and accountable environmental stewardship.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104267"},"PeriodicalIF":9.9,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ball tree structure-informed phase space warping: a robust algorithm for dynamic degradation tracking under variable speed conditions 基于球树结构的相空间翘曲:变速条件下动态退化跟踪的鲁棒算法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-30 DOI: 10.1016/j.aei.2025.104288
Rui Yuan , Hengyu Liu , Yong Lv , Yuejian Chen , Xingkai Yang , Hewenxuan Li , David Chelidze
With the advancement of predictive maintenance strategies, the accuracy of degradation tracking in mechanical systems has become a growing concern. This paper proposes a novel ball tree structure-informed phase space warping (BTPSW) algorithm, which couples high-dimensional nonlinear dynamics with efficient geometric search strategies to robustly track bearing degradation. To tackle the challenges of high-dimensional data and uneven distribution of points in the reconstructed phase space (PS), a physics-informed dynamic model is constructed to simulate outer race crack evolution under speed fluctuations. The resulting vibration signals are then reconstructed into high-dimensional PS, where trajectory curvature serves as a degradation indicator. The BTPSW algorithm reduces overlap in high-dimensional spaces, improving data search efficiency. Furthermore, considering the fluctuations in the optimal reconstruction parameters, the BTPSW algorithm demonstrates enhanced data adaptability, mitigating the accuracy loss caused by parameter fluctuations. By constructing a simulation model of rolling bearing degradation to simulate the crack propagation in the outer race, the paper validates the application of the BTPSW algorithm in tracking crack degradation. Both simulation and accelerated degradation experiments confirm that BTPSW achieves high tracking accuracy, strong parameter robustness, and superior adaptability under fluctuating operating conditions, making it a powerful tool for predictive maintenance and long-term reliability assessment.
随着预测性维护策略的发展,机械系统退化跟踪的准确性日益受到关注。提出了一种基于球树结构的相空间翘曲(BTPSW)算法,该算法将高维非线性动力学与高效几何搜索策略相结合,实现了对轴承退化的鲁棒跟踪。针对重构相空间(PS)中数据高维和点分布不均匀的问题,建立了基于物理信息的外圈裂纹动态模型,模拟了速度波动下的外圈裂纹演化过程。然后将得到的振动信号重构为高维PS,其中轨迹曲率作为退化指标。BTPSW算法减少了高维空间的重叠,提高了数据搜索效率。此外,考虑到最优重构参数的波动,BTPSW算法表现出更强的数据适应性,减轻了参数波动带来的精度损失。通过构建滚动轴承退化仿真模型,模拟外圈裂纹扩展过程,验证了BTPSW算法在裂纹退化跟踪中的应用。仿真和加速退化实验均证实,BTPSW具有较高的跟踪精度、较强的参数鲁棒性和对波动工况的优越适应性,是预测性维护和长期可靠性评估的有力工具。
{"title":"Ball tree structure-informed phase space warping: a robust algorithm for dynamic degradation tracking under variable speed conditions","authors":"Rui Yuan ,&nbsp;Hengyu Liu ,&nbsp;Yong Lv ,&nbsp;Yuejian Chen ,&nbsp;Xingkai Yang ,&nbsp;Hewenxuan Li ,&nbsp;David Chelidze","doi":"10.1016/j.aei.2025.104288","DOIUrl":"10.1016/j.aei.2025.104288","url":null,"abstract":"<div><div>With the advancement of predictive maintenance strategies, the accuracy of degradation tracking in mechanical systems has become a growing concern. This paper proposes a novel ball tree structure-informed phase space warping (BTPSW) algorithm, which couples high-dimensional nonlinear dynamics with efficient geometric search strategies to robustly track bearing degradation. To tackle the challenges of high-dimensional data and uneven distribution of points in the reconstructed phase space (PS), a physics-informed dynamic model is constructed to simulate outer race crack evolution under speed fluctuations. The resulting vibration signals are then reconstructed into high-dimensional PS, where trajectory curvature serves as a degradation indicator. The BTPSW algorithm reduces overlap in high-dimensional spaces, improving data search efficiency. Furthermore, considering the fluctuations in the optimal reconstruction parameters, the BTPSW algorithm demonstrates enhanced data adaptability, mitigating the accuracy loss caused by parameter fluctuations. By constructing a simulation model of rolling bearing degradation to simulate the crack propagation in the outer race, the paper validates the application of the BTPSW algorithm in tracking crack degradation. Both simulation and accelerated degradation experiments confirm that BTPSW achieves high tracking accuracy, strong parameter robustness, and superior adaptability under fluctuating operating conditions, making it a powerful tool for predictive maintenance and long-term reliability assessment.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104288"},"PeriodicalIF":9.9,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Behavioral modelling of roadway construction workers: Improving deep learning-based trajectory prediction with contextual information in traffic work zones 道路施工工人的行为建模:利用交通工作区的上下文信息改进基于深度学习的轨迹预测
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-30 DOI: 10.1016/j.aei.2025.104277
Daniel Bin Lu, Semiha Ergan
Construction workers face rising risks of fatal injuries from vehicle crashes in roadway work zones. While transportation safety research has focused on motorists’ behavior, the behavior of roadway workers remains underexplored. Existing trajectory prediction models, developed for pedestrians or generic construction workers, typically do not account for the unique roadway work zone activities and traffic interactions faced by roadway workers. This study leverages a virtual reality (VR) and traffic simulation-based platform to capture detailed context data, such as roadwork activities and nearby vehicles in the worker’s field of view. The study’s main objective is to evaluate whether including this context improves trajectory prediction accuracy of deep learning-based models, particularly gated recurrent units (GRU) and transformer architectures. Results indicate that transformers can improve their trajectory prediction accuracy (i.e., lower miss-rate) when accounting for both the worker’s behavioral and traffic context data compared to a transformer trained on trajectory position data alone. These improvements in accuracy are observed across different roadwork construction tasks (e.g., installing sensor cable, distributing grout) and different proximities to traffic vehicles. These findings contribute to the development of more precise roadway worker trajectory models for use in autonomous vehicles and safety systems.
建筑工人在道路工作区域因车辆碰撞而受到致命伤害的风险不断上升。虽然交通安全研究的重点是驾驶者的行为,但道路工人的行为仍未得到充分研究。现有的轨迹预测模型是为行人或普通建筑工人开发的,通常不能考虑道路工人所面临的独特的道路工作区活动和交通相互作用。这项研究利用虚拟现实(VR)和基于交通模拟的平台来捕获详细的环境数据,例如道路施工活动和工人视野中的附近车辆。该研究的主要目的是评估包含此上下文是否可以提高基于深度学习的模型的轨迹预测精度,特别是门控循环单元(GRU)和变压器架构。结果表明,与仅使用轨迹位置数据训练的变压器相比,在考虑工作人员的行为和交通环境数据时,变压器可以提高其轨迹预测精度(即更低的漏报率)。在不同的道路施工任务(例如,安装传感器电缆,分配灌浆)和不同距离的交通车辆中观察到这些精度的提高。这些发现有助于开发更精确的道路工人轨迹模型,用于自动驾驶汽车和安全系统。
{"title":"Behavioral modelling of roadway construction workers: Improving deep learning-based trajectory prediction with contextual information in traffic work zones","authors":"Daniel Bin Lu,&nbsp;Semiha Ergan","doi":"10.1016/j.aei.2025.104277","DOIUrl":"10.1016/j.aei.2025.104277","url":null,"abstract":"<div><div>Construction workers face rising risks of fatal injuries from vehicle crashes in roadway work zones. While transportation safety research has focused on motorists’ behavior, the behavior of roadway workers remains underexplored. Existing trajectory prediction models, developed for pedestrians or generic construction workers, typically do not account for the unique roadway work zone activities and traffic interactions faced by roadway workers. This study leverages a virtual reality (VR) and traffic simulation-based platform to capture detailed context data, such as roadwork activities and nearby vehicles in the worker’s field of view. The study’s main objective is to evaluate whether including this context improves trajectory prediction accuracy of deep learning-based models, particularly gated recurrent units (GRU) and transformer architectures. Results indicate that transformers can improve their trajectory prediction accuracy (i.e., lower miss-rate) when accounting for both the worker’s behavioral and traffic context data compared to a transformer trained on trajectory position data alone. These improvements in accuracy are observed across different roadwork construction tasks (e.g., installing sensor cable, distributing grout) and different proximities to traffic vehicles. These findings contribute to the development of more precise roadway worker trajectory models for use in autonomous vehicles and safety systems.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104277"},"PeriodicalIF":9.9,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Road surface classification with texture-feature-embedded ResNet for the active suspension systems in complex environments 基于纹理特征嵌入式ResNet的复杂环境下主动悬架路面分类
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-29 DOI: 10.1016/j.aei.2025.104280
Zihe Pang , Zhiyao Zhang , Ke Jin , Pengzhiyuan Chen , Enrico Zio , Peng Guo
Active suspension systems can improve the operational stability of mobile precision equipment in the field while reducing equipment wear and maintenance costs. However, existing methods still exhibit limitations in generalization ability and forward-looking perception performance under real-world complex environments. Research on road surface classification in complex environments can provide new solutions for enhancing the forward-looking perception capability of active suspension systems. Firstly, this paper constructs a real-world multi-class road surface dataset named MTRSD, which includes image data of structured and unstructured road surfaces under various illumination conditions. On this basis, we propose the TF-ResNet road surface classification model. Its core components include an ambient illuminance compensation strategy and a texture feature embedding module. The illuminance compensation strategy adaptively adjusts image brightness to enhance the visibility of road surface features, thereby improving classification accuracy. The texture feature embedding module guides the model to focus on road texture patterns while suppressing background interference, thus increasing model stability. Experimental results show that the proposed method achieves an accuracy of 87.73% with a standard deviation of ±1.43% in the road surface classification task, outperforming existing mainstream methods.
主动悬架系统可以提高移动精密设备在野外的运行稳定性,同时降低设备的磨损和维护成本。然而,在现实复杂环境下,现有的方法在泛化能力和前瞻性感知性能方面仍然存在局限性。复杂环境下路面分类的研究可以为提高主动悬架系统的前视感知能力提供新的解决方案。首先,本文构建了一个真实的多类别路面数据集MTRSD,该数据集包含了不同光照条件下的结构化和非结构化路面图像数据。在此基础上,提出TF-ResNet路面分类模型。其核心组件包括环境照度补偿策略和纹理特征嵌入模块。照度补偿策略自适应调整图像亮度,增强路面特征的可见性,从而提高分类精度。纹理特征嵌入模块引导模型专注于道路纹理模式,同时抑制背景干扰,提高模型稳定性。实验结果表明,该方法在路面分类任务中准确率达到87.73%,标准差为±1.43%,优于现有主流方法。
{"title":"Road surface classification with texture-feature-embedded ResNet for the active suspension systems in complex environments","authors":"Zihe Pang ,&nbsp;Zhiyao Zhang ,&nbsp;Ke Jin ,&nbsp;Pengzhiyuan Chen ,&nbsp;Enrico Zio ,&nbsp;Peng Guo","doi":"10.1016/j.aei.2025.104280","DOIUrl":"10.1016/j.aei.2025.104280","url":null,"abstract":"<div><div>Active suspension systems can improve the operational stability of mobile precision equipment in the field while reducing equipment wear and maintenance costs. However, existing methods still exhibit limitations in generalization ability and forward-looking perception performance under real-world complex environments. Research on road surface classification in complex environments can provide new solutions for enhancing the forward-looking perception capability of active suspension systems. Firstly, this paper constructs a real-world multi-class road surface dataset named MTRSD, which includes image data of structured and unstructured road surfaces under various illumination conditions. On this basis, we propose the TF-ResNet road surface classification model. Its core components include an ambient illuminance compensation strategy and a texture feature embedding module. The illuminance compensation strategy adaptively adjusts image brightness to enhance the visibility of road surface features, thereby improving classification accuracy. The texture feature embedding module guides the model to focus on road texture patterns while suppressing background interference, thus increasing model stability. Experimental results show that the proposed method achieves an accuracy of 87.73% with a standard deviation of <span><math><mrow><mo>±</mo><mn>1</mn><mo>.</mo><mn>43</mn><mtext>%</mtext></mrow></math></span> in the road surface classification task, outperforming existing mainstream methods.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104280"},"PeriodicalIF":9.9,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145884917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unknown intervention-aware neural Granger causal discovery via Kullback–Leibler divergence constraint 基于Kullback-Leibler散度约束的未知干预感知神经Granger因果发现
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-29 DOI: 10.1016/j.aei.2025.104285
Chenze Wang , Tianyi Yin , Han Wang , Xiaohan Zhang , Gaowei Xu , Jingwei Wang , Min Liu
The widely studied problem of inferring causal structures in time series data, particularly through Granger causality (GC), has gained prominence in various applications due to its compatibility with deep neural network-based predictive modeling. However, most existing approaches presuppose a single causal structure within multivariate time series and neglect the impact of unknown intervention targets, leading to limitations in complex real-world scenarios. Here, we focus on identifying causal structures from time series with unknown interventions and propose a neural network-based GC method. First, we construct global and interventional causal neural networks based on the causal probability matrix (CPM), enabling dual-scale GC discovery: coarse-grained across unknown intervention environments and fine-grained within each. Then, a novel interactive training framework using a Kullback–Leibler divergence constraint is proposed to provide the perception of unknown interventions and exchange of causal information. The proposed method demonstrates superior performance compared to various baselines on both synthetic and real-world interventional time series datasets.
在时间序列数据中推断因果结构的问题,特别是通过格兰杰因果关系(GC),由于其与基于深度神经网络的预测建模的兼容性,在各种应用中得到了突出的研究。然而,大多数现有方法在多元时间序列中预设单一因果结构,忽略了未知干预目标的影响,导致在复杂的现实场景中存在局限性。在这里,我们专注于从未知干预的时间序列中识别因果结构,并提出了一种基于神经网络的GC方法。首先,我们基于因果概率矩阵(CPM)构建了全局和介入因果神经网络,实现了双尺度GC发现:在未知干预环境中进行粗粒度发现,在每个未知干预环境中进行细粒度发现。然后,提出了一种基于Kullback-Leibler散度约束的交互式训练框架,以提供未知干预的感知和因果信息的交换。在合成和实际干预时间序列数据集上,与各种基线相比,所提出的方法表现出优越的性能。
{"title":"Unknown intervention-aware neural Granger causal discovery via Kullback–Leibler divergence constraint","authors":"Chenze Wang ,&nbsp;Tianyi Yin ,&nbsp;Han Wang ,&nbsp;Xiaohan Zhang ,&nbsp;Gaowei Xu ,&nbsp;Jingwei Wang ,&nbsp;Min Liu","doi":"10.1016/j.aei.2025.104285","DOIUrl":"10.1016/j.aei.2025.104285","url":null,"abstract":"<div><div>The widely studied problem of inferring causal structures in time series data, particularly through Granger causality (GC), has gained prominence in various applications due to its compatibility with deep neural network-based predictive modeling. However, most existing approaches presuppose a single causal structure within multivariate time series and neglect the impact of unknown intervention targets, leading to limitations in complex real-world scenarios. Here, we focus on identifying causal structures from time series with unknown interventions and propose a neural network-based GC method. First, we construct global and interventional causal neural networks based on the causal probability matrix (CPM), enabling dual-scale GC discovery: coarse-grained across unknown intervention environments and fine-grained within each. Then, a novel interactive training framework using a Kullback–Leibler divergence constraint is proposed to provide the perception of unknown interventions and exchange of causal information. The proposed method demonstrates superior performance compared to various baselines on both synthetic and real-world interventional time series datasets.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104285"},"PeriodicalIF":9.9,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145884918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent forecasting of tunnel deformation in underground coal mines using a dynamic swarm-tuned adaptive neuro-fuzzy inference system for knowledge-driven ground control 基于动态群调谐自适应神经模糊推理系统的煤矿井下巷道变形智能预测
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-29 DOI: 10.1016/j.aei.2025.104287
Satar Mahdevari
Timely and accurate forecasting of tunnel deformation is vital for ensuring safety and operational continuity in underground coal mining and other geotechnical infrastructures. However, conventional empirical design practices often lack the formalization, precision, and adaptability required for modern decision-support systems. This study presents an informatics-driven framework, the Dynamic Swarm-Tuned Adaptive Neuro-Fuzzy Inference System (DST-ANFIS), which formalizes empirical geomechanical knowledge by embedding expert-derived fuzzy rules into an adaptive hybrid computational intelligence model. The framework adopts a dual-phase metaheuristic optimization strategy—Improved Lion Optimization Algorithm (ILOA) for global exploration and Dynamic Particle Swarm Optimization (DPSO) for local refinement—to achieve a robust and interpretable representation of complex rock mass behavior. A case study using geomechanical data from the Tabas coal mine demonstrates that the proposed DST-ANFIS model consistently outperforms both conventional ANFIS-based variants and established machine learning benchmarks—including GA-ANFIS, PSO-ANFIS, XGBoost, SVR, and RF. On the test set, DST-ANFIS achieved the highest predictive accuracy, with an R2 of 0.952 and an RMSE of 12.530. It notably surpassed the best-performing benchmark, XGBoost, improving R2 by 2.8% (0.952 vs. 0.926) and reducing RMSE by 19.9% (12.530 vs. 15.640). The model also outperformed PSO-ANFIS, yielding a 2.7% increase in R2 and a 24.8% decrease in RMSE, further confirming its robustness and precision across comparative metrics. Beyond predictive performance, the framework generates an interpretable fuzzy rule base that clarifies causal relationships between geomechanical parameters and deformation responses, transforming raw monitoring data into formalized, actionable engineering knowledge for intelligent and proactive ground control. This research advances engineering informatics by providing a scalable methodology for embedding empirical expertise into adaptive computational intelligence, thereby contributing to knowledge formalization and intelligent decision-support in engineering. While demonstrated in mining, the methodology generalizes to adaptive infrastructure management and complex geotechnical systems, underscoring the broader potential of dual-phase hybrid neuro-fuzzy models for engineering informatics.
及时、准确地预测巷道变形对煤矿地下开采和其他岩土基础设施的安全和运行连续性至关重要。然而,传统的经验设计实践往往缺乏现代决策支持系统所需的形式化、精确性和适应性。本研究提出了一个信息学驱动的框架,即动态群体调谐自适应神经模糊推理系统(DST-ANFIS),该系统通过将专家衍生的模糊规则嵌入到自适应混合计算智能模型中来形式化经验地质力学知识。该框架采用双阶段元启发式优化策略——改进的狮子优化算法(ILOA)进行全局勘探,动态粒子群优化(DPSO)进行局部细化——以实现复杂岩体行为的鲁棒性和可解释性表示。使用Tabas煤矿地质力学数据的案例研究表明,所提出的DST-ANFIS模型始终优于传统的基于anfis的变量和已建立的机器学习基准,包括GA-ANFIS、PSO-ANFIS、XGBoost、SVR和RF。在测试集上,DST-ANFIS的预测准确率最高,R2为0.952,RMSE为12.530。它明显超过了表现最好的基准XGBoost, R2提高了2.8%(0.952对0.926),RMSE降低了19.9%(12.530对15.640)。该模型也优于PSO-ANFIS, R2增加2.7%,RMSE降低24.8%,进一步证实了其在比较指标中的稳健性和准确性。除了预测性能之外,该框架还生成了一个可解释的模糊规则库,澄清了地质力学参数和变形响应之间的因果关系,将原始监测数据转换为形式化的、可操作的工程知识,用于智能和主动的地面控制。本研究通过提供可扩展的方法将经验专业知识嵌入到自适应计算智能中,从而促进了工程中的知识形式化和智能决策支持,从而推动了工程信息学的发展。虽然在采矿中得到了证明,但该方法可以推广到自适应基础设施管理和复杂的岩土工程系统,强调了工程信息学的双相混合神经模糊模型的更广泛潜力。
{"title":"Intelligent forecasting of tunnel deformation in underground coal mines using a dynamic swarm-tuned adaptive neuro-fuzzy inference system for knowledge-driven ground control","authors":"Satar Mahdevari","doi":"10.1016/j.aei.2025.104287","DOIUrl":"10.1016/j.aei.2025.104287","url":null,"abstract":"<div><div>Timely and accurate forecasting of tunnel deformation is vital for ensuring safety and operational continuity in underground coal mining and other geotechnical infrastructures. However, conventional empirical design practices often lack the formalization, precision, and adaptability required for modern decision-support systems. This study presents an informatics-driven framework, the Dynamic Swarm-Tuned Adaptive Neuro-Fuzzy Inference System (DST-ANFIS), which formalizes empirical geomechanical knowledge by embedding expert-derived fuzzy rules into an adaptive hybrid computational intelligence model. The framework adopts a dual-phase metaheuristic optimization strategy—Improved Lion Optimization Algorithm (ILOA) for global exploration and Dynamic Particle Swarm Optimization (DPSO) for local refinement—to achieve a robust and interpretable representation of complex rock mass behavior. A case study using geomechanical data from the Tabas coal mine demonstrates that the proposed DST-ANFIS model consistently outperforms both conventional ANFIS-based variants and established machine learning benchmarks—including GA-ANFIS, PSO-ANFIS, XGBoost, SVR, and RF. On the test set, DST-ANFIS achieved the highest predictive accuracy, with an R<sup>2</sup> of 0.952 and an RMSE of 12.530. It notably surpassed the best-performing benchmark, XGBoost, improving R<sup>2</sup> by 2.8% (0.952 vs. 0.926) and reducing RMSE by 19.9% (12.530 vs. 15.640). The model also outperformed PSO-ANFIS, yielding a 2.7% increase in R<sup>2</sup> and a 24.8% decrease in RMSE, further confirming its robustness and precision across comparative metrics. Beyond predictive performance, the framework generates an interpretable fuzzy rule base that clarifies causal relationships between geomechanical parameters and deformation responses, transforming raw monitoring data into formalized, actionable engineering knowledge for intelligent and proactive ground control. This research advances engineering informatics by providing a scalable methodology for embedding empirical expertise into adaptive computational intelligence, thereby contributing to knowledge formalization and intelligent decision-support in engineering. While demonstrated in mining, the methodology generalizes to adaptive infrastructure management and complex geotechnical systems, underscoring the broader potential of dual-phase hybrid neuro-fuzzy models for engineering informatics.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104287"},"PeriodicalIF":9.9,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145884916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Iteratively modified variational mode extraction (IMVME): A noise-robust transient feature nonlinear extraction approach for aero-engine fault diagnosis 迭代改进变分模提取(IMVME):一种用于航空发动机故障诊断的噪声鲁棒瞬态特征非线性提取方法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-29 DOI: 10.1016/j.aei.2025.104284
Duxi Shang , Rui Yuan , Yong Lv , Hongan Wu , Hengyu Liu , Zhuyun Chen
Due to the complex structure and harsh operating conditions of aero-engine, the transient feature caused by bearing damage can be easily masked by various interference noises, which brings great challenges to transient feature extraction and aero-engine fault diagnosis. Although optimal bandpass filtering methods are widely used for transient feature extraction, it cannot effectively suppress in-band noise, which diminishes the sensitivity of feature indicators and may cause extraction failure. To address above challenges, this paper proposes a noise-robust transient feature nonlinear extraction approach called iteratively modified variational mode extraction (IMVME). Initially, a noise-robust filter, the enhanced variational Wiener filter (EVWF) is proposed. EVWF performs narrowband demodulation while suppressing in-band noise through amplitude reconstruction, thereby enhancing local transient feature and facilitating the weak transient extraction. Subsequently, the modulation spectral density function (MSDF) is introduced as a feature indicator to distinguish fault transient from interference noise and to guide the EVWF in selecting the optimal magnitude order. Finally, IMVME adopts an adaptive filter parameter iterative optimization framework to solve the optimal EVWF by maximizing MSDF, thereby enabling more accurate fault frequency band localization, robust transient feature extraction under complex noise conditions, and greater adaptability and flexibility in filter design. Through validation on multiple scenarios, including simulation signal and aero-engine fault signal, the superiority of IMVME is demonstrated through its ability to adaptively and accurately extract transient feature while maintaining robustness to noise and interference.
由于航空发动机结构复杂、工作条件恶劣,轴承损伤引起的瞬态特征很容易被各种干扰噪声掩盖,这给航空发动机瞬态特征提取和故障诊断带来了很大的挑战。虽然最优带通滤波方法被广泛用于瞬态特征提取,但它不能有效抑制带内噪声,降低了特征指标的灵敏度,可能导致提取失败。针对上述问题,本文提出了一种抗噪声暂态特征非线性提取方法——迭代改进变分模提取(IMVME)。首先,提出了一种抗噪声的增强变分维纳滤波器(EVWF)。EVWF进行窄带解调,同时通过幅度重构抑制带内噪声,增强局部瞬态特征,便于弱瞬态提取。随后,引入调制谱密度函数(MSDF)作为特征指标,用于区分故障暂态和干扰噪声,并指导EVWF选择最优数量级。最后,IMVME采用自适应滤波器参数迭代优化框架,通过最大化MSDF来求解最优EVWF,从而实现更精确的故障频带定位和复杂噪声条件下的鲁棒瞬态特征提取,增强了滤波器设计的适应性和灵活性。通过对仿真信号和航空发动机故障信号等多种场景的验证,IMVME能够自适应准确提取瞬态特征,同时保持对噪声和干扰的鲁棒性,证明了该方法的优越性。
{"title":"Iteratively modified variational mode extraction (IMVME): A noise-robust transient feature nonlinear extraction approach for aero-engine fault diagnosis","authors":"Duxi Shang ,&nbsp;Rui Yuan ,&nbsp;Yong Lv ,&nbsp;Hongan Wu ,&nbsp;Hengyu Liu ,&nbsp;Zhuyun Chen","doi":"10.1016/j.aei.2025.104284","DOIUrl":"10.1016/j.aei.2025.104284","url":null,"abstract":"<div><div>Due to the complex structure and harsh operating conditions of aero-engine, the transient feature caused by bearing damage can be easily masked by various interference noises, which brings great challenges to transient feature extraction and aero-engine fault diagnosis. Although optimal bandpass filtering methods are widely used for transient feature extraction, it cannot effectively suppress in-band noise, which diminishes the sensitivity of feature indicators and may cause extraction failure. To address above challenges, this paper proposes a noise-robust transient feature nonlinear extraction approach called iteratively modified variational mode extraction (IMVME). Initially, a noise-robust filter, the enhanced variational Wiener filter (EVWF) is proposed. EVWF performs narrowband demodulation while suppressing in-band noise through amplitude reconstruction, thereby enhancing local transient feature and facilitating the weak transient extraction. Subsequently, the modulation spectral density function (MSDF) is introduced as a feature indicator to distinguish fault transient from interference noise and to guide the EVWF in selecting the optimal magnitude order. Finally, IMVME adopts an adaptive filter parameter iterative optimization framework to solve the optimal EVWF by maximizing MSDF, thereby enabling more accurate fault frequency band localization, robust transient feature extraction under complex noise conditions, and greater adaptability and flexibility in filter design. Through validation on multiple scenarios, including simulation signal and aero-engine fault signal, the superiority of IMVME is demonstrated through its ability to adaptively and accurately extract transient feature while maintaining robustness to noise and interference.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104284"},"PeriodicalIF":9.9,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145884959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Engineformer: A digital twin model for predicting aero-engine performance and degradation 用于预测航空发动机性能和退化的数字孪生模型
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-29 DOI: 10.1016/j.aei.2025.104273
Dasheng Xiao, Aiyang Yu, Shuo Song, Hong Xiao, Zhanxue Wang
Health management plays a critical role in ensuring the safety and reliability of engine operation. This study proposed Engineformer, a novel digital twin modelling architecture for aero-engines that integrates the transformer framework with prior physical knowledge. The compression components were modelled using the transformer encoder, whereas the expansion components were modelled by the decoder. A cross-attention mechanism was employed to extract the interactive features between compression and expansion components. To better adapt to engine data, the feedforward layers in both encoder and decoder used one-dimensional convolutional layers. The proposed Engineformer model was evaluated through two case studies: exhaust gas temperature (EGT) and remaining useful life (RUL) prediction. In EGT prediction, the model was tested on flight datasets from four civil high-bypass-ratio engines and achieved average mean absolute relative errors of 1.88%, 1.78%, 0.48%, and 0.49% across 10 training rounds. In RUL prediction, based on the DS02 subset of the N-CMAPSS dataset, Engineformer achieved a minimum root mean square error of 4.142 over 10 training rounds. This reduced further to 3.708 after optimization. The results demonstrated that Engineformer achieved state-of-the-art performance in both tasks. This verifies its reliability for predicting aero-engine performance and degradation.
健康管理是保证发动机安全可靠运行的关键。本研究提出了一种新的航空发动机数字孪生建模架构Engineformer,它将变压器框架与先前的物理知识集成在一起。压缩组件使用变压器编码器建模,而扩展组件由解码器建模。采用交叉注意机制提取压缩组件和膨胀组件之间的交互特征。为了更好地适应引擎数据,编码器和解码器的前馈层都采用了一维卷积层。通过废气温度(EGT)预测和剩余使用寿命(RUL)预测两个案例对所提出的Engineformer模型进行了评估。在EGT预测中,该模型在4台民用大涵道比发动机的飞行数据集上进行了测试,10轮训练的平均绝对相对误差分别为1.88%、1.78%、0.48%和0.49%。在RUL预测中,基于N-CMAPSS数据集的DS02子集,Engineformer在10轮训练中实现了4.142的最小均方根误差。优化后进一步降低到3.708。结果表明,Engineformer在这两项任务中都取得了最先进的性能。验证了该方法预测航空发动机性能和退化的可靠性。
{"title":"Engineformer: A digital twin model for predicting aero-engine performance and degradation","authors":"Dasheng Xiao,&nbsp;Aiyang Yu,&nbsp;Shuo Song,&nbsp;Hong Xiao,&nbsp;Zhanxue Wang","doi":"10.1016/j.aei.2025.104273","DOIUrl":"10.1016/j.aei.2025.104273","url":null,"abstract":"<div><div>Health management plays a critical role in ensuring the safety and reliability of engine operation. This study proposed Engineformer, a novel digital twin modelling architecture for aero-engines that integrates the transformer framework with prior physical knowledge. The compression components were modelled using the transformer encoder, whereas the expansion components were modelled by the decoder. A cross-attention mechanism was employed to extract the interactive features between compression and expansion components. To better adapt to engine data, the feedforward layers in both encoder and decoder used one-dimensional convolutional layers. The proposed Engineformer model was evaluated through two case studies: exhaust gas temperature (EGT) and remaining useful life (RUL) prediction. In EGT prediction, the model was tested on flight datasets from four civil high-bypass-ratio engines and achieved average mean absolute relative errors of 1.88%, 1.78%, 0.48%, and 0.49% across 10 training rounds. In RUL prediction, based on the DS02 subset of the N-CMAPSS dataset, Engineformer achieved a minimum root mean square error of 4.142 over 10 training rounds. This reduced further to 3.708 after optimization. The results demonstrated that Engineformer achieved state-of-the-art performance in both tasks. This verifies its reliability for predicting aero-engine performance and degradation.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104273"},"PeriodicalIF":9.9,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145884958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-agent reinforcement learning method for joint optimization of block assignment and yard crane redeployment at river-sea intermodal container terminal 基于多智能体强化学习的江海联运集装箱码头分段分配与堆场起重机调配联合优化
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-27 DOI: 10.1016/j.aei.2025.104271
Huakun Liu , Wenyuan Wang , Yun Peng , Shuzheng Yang , Hongbin Tian , Qiang Qi
The continuous growth of global maritime trade and dynamic operational characteristics of river-sea intermodal container terminals create an urgent need to enhance transshipment efficiency. Block assignment and yard crane (YC) redeployment (BA-YCR) are two tightly coupled scheduling processes that significantly impact overall yard operational efficiency. In response to minute-level workload fluctuations, effectively optimizing the BA-YCR problem in real time remains challenging. To this end, this study proposes a multi-agent reinforcement learning (MARL)-based approach for real-time optimization of the BA-YCR problem. The BA-YCR is formulated as a Markov Decision Process model, with the objective of minimizing YC redeployments, operational delays, and transport time. A hybrid reward mechanism is designed to balance exploration and coordination between agents. A two-stage multi-agent decision framework is developed, in which the coupling scheduling policies are trained using the Proximal Policy Optimization algorithm. Numerical experiments demonstrate that, the proposed MARL-based approach consistently outperforms benchmark methods. The well-trained scheduling policies achieve improvements of 0.2% to 81.3% in solution quality, while maintaining a computation time of less than 5 seconds, even in large-scale scenarios. Furthermore, sensitivity analyses based on a real-world container terminal further validate the practical applicability and generalization of the proposed approach. The results not only support terminal operators in developing reliable real-time BA-YCR strategies, but also offer practical insights for real-time scheduling optimization using MARL-based method in broader engineering applications.
全球海上贸易的持续增长和河海多式联运集装箱码头的动态运作特点,迫切需要提高转运效率。区块分配和堆场起重机(YC)重新部署(BA-YCR)是两个紧密耦合的调度过程,对整个堆场的运营效率产生重大影响。为了应对分钟级的工作负载波动,实时有效地优化BA-YCR问题仍然具有挑战性。为此,本研究提出了一种基于多智能体强化学习(MARL)的方法来实时优化BA-YCR问题。BA-YCR是一个马尔可夫决策过程模型,其目标是最大限度地减少YC的重新部署、操作延迟和传输时间。设计了一种混合奖励机制来平衡智能体之间的探索和协调。提出了一种两阶段多智能体决策框架,其中使用近端策略优化算法训练耦合调度策略。数值实验表明,本文提出的基于marl的方法始终优于基准方法。训练有素的调度策略可以将解决方案质量提高0.2%到81.3%,同时即使在大规模场景下,计算时间也保持在5秒以下。此外,基于实际集装箱码头的敏感性分析进一步验证了该方法的实用性和泛化性。研究结果不仅支持码头运营商制定可靠的实时BA-YCR策略,还为在更广泛的工程应用中使用基于marl的方法进行实时调度优化提供了实际见解。
{"title":"Multi-agent reinforcement learning method for joint optimization of block assignment and yard crane redeployment at river-sea intermodal container terminal","authors":"Huakun Liu ,&nbsp;Wenyuan Wang ,&nbsp;Yun Peng ,&nbsp;Shuzheng Yang ,&nbsp;Hongbin Tian ,&nbsp;Qiang Qi","doi":"10.1016/j.aei.2025.104271","DOIUrl":"10.1016/j.aei.2025.104271","url":null,"abstract":"<div><div>The continuous growth of global maritime trade and dynamic operational characteristics of river-sea intermodal container terminals create an urgent need to enhance transshipment efficiency. Block assignment and yard crane (YC) redeployment (BA-YCR) are two tightly coupled scheduling processes that significantly impact overall yard operational efficiency. In response to minute-level workload fluctuations, effectively optimizing the BA-YCR problem in real time remains challenging. To this end, this study proposes a multi-agent reinforcement learning (MARL)-based approach for real-time optimization of the BA-YCR problem. The BA-YCR is formulated as a Markov Decision Process model, with the objective of minimizing YC redeployments, operational delays, and transport time. A hybrid reward mechanism is designed to balance exploration and coordination between agents. A two-stage multi-agent decision framework is developed, in which the coupling scheduling policies are trained using the Proximal Policy Optimization algorithm. Numerical experiments demonstrate that, the proposed MARL-based approach consistently outperforms benchmark methods. The well-trained scheduling policies achieve improvements of 0.2% to 81.3% in solution quality, while maintaining a computation time of less than 5 seconds, even in large-scale scenarios. Furthermore, sensitivity analyses based on a real-world container terminal further validate the practical applicability and generalization of the proposed approach. The results not only support terminal operators in developing reliable real-time BA-YCR strategies, but also offer practical insights for real-time scheduling optimization using MARL-based method in broader engineering applications.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104271"},"PeriodicalIF":9.9,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards large language model for cognitive industrial mixed reality: A survey 面向认知工业混合现实的大语言模型研究
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-27 DOI: 10.1016/j.aei.2025.104276
Wei Fang , Yingfen Xu , Tienong Zhang , Yiwei Wang , Lianyu Zheng
Since Large Language Model (LLM) has attracted extensive attention in recent years, the integration of LLM and Mixed Reality (MR) has revolutionized the paradigm of human-centric intelligent manufacturing activities with natural interaction and intuitive instruction. However, there has been no systematic discussion from a holistic perspective on the integration of LLM and MR in the manufacturing sector under the Industry 5.0 framework. With the release of ChatGPT at the end of 2022, LLM has rapidly sparked a research boom and demonstrated significant application potential in the industrial sector. Thus, this paper aims to provide a systematic review of LLM and MR in industrial activities from 2023 to May 2025, mainly focusing on four typical application scenarios. We further analyze the underlying technologies in these scenarios, including multimodal interaction, context-aware augmentation, knowledge-enhanced technology, as well as ergonomics and usability, which are significant for the actual deployment of LLM and MR in human-centric intelligent manufacturing systems. Furthermore, we discuss the challenges and opportunities faced by LLM and MR in current manufacturing activities, and find that, although the feasibility of LLM and MR in the industrial field has been demonstrated, further explorations in areas such as resource limitations, reliability, and usability are still necessary to facilitate their widespread application. It is also worth mentioning that this paper presents the current state of the art in integrating LLM and MR technologies and provides insights on how these technologies can be leveraged to drive the intelligent transformation of human-centric Industry 5.0.
近年来,大语言模型(LLM)受到广泛关注,LLM与混合现实(MR)的融合彻底改变了以人为中心的智能制造活动模式,实现了自然交互和直观指令。然而,对于工业5.0框架下制造业LLM和MR的整合,目前还没有从整体的角度进行系统的讨论。随着ChatGPT在2022年底的发布,LLM迅速引发了研究热潮,并在工业领域显示出巨大的应用潜力。因此,本文旨在对2023年至2025年5月LLM和MR在工业活动中的应用进行系统回顾,主要关注四个典型的应用场景。我们进一步分析了这些场景中的基础技术,包括多模态交互、上下文感知增强、知识增强技术以及人体工程学和可用性,这对于在以人为中心的智能制造系统中实际部署LLM和MR具有重要意义。此外,我们讨论了LLM和MR在当前制造活动中面临的挑战和机遇,并发现,尽管LLM和MR在工业领域的可行性已经得到证明,但为了促进它们的广泛应用,还需要在资源限制、可靠性和可用性等领域进行进一步的探索。值得一提的是,本文介绍了整合LLM和MR技术的最新技术,并就如何利用这些技术推动以人为中心的工业5.0的智能转型提供了见解。
{"title":"Towards large language model for cognitive industrial mixed reality: A survey","authors":"Wei Fang ,&nbsp;Yingfen Xu ,&nbsp;Tienong Zhang ,&nbsp;Yiwei Wang ,&nbsp;Lianyu Zheng","doi":"10.1016/j.aei.2025.104276","DOIUrl":"10.1016/j.aei.2025.104276","url":null,"abstract":"<div><div>Since Large Language Model (LLM) has attracted extensive attention in recent years, the integration of LLM and Mixed Reality (MR) has revolutionized the paradigm of human-centric intelligent manufacturing activities with natural interaction and intuitive instruction. However, there has been no systematic discussion from a holistic perspective on the integration of LLM and MR in the manufacturing sector under the Industry 5.0 framework. With the release of ChatGPT at the end of 2022, LLM has rapidly sparked a research boom and demonstrated significant application potential in the industrial sector. Thus, this paper aims to provide a systematic review of LLM and MR in industrial activities from 2023 to May 2025, mainly focusing on four typical application scenarios. We further analyze the underlying technologies in these scenarios, including multimodal interaction, context-aware augmentation, knowledge-enhanced technology, as well as ergonomics and usability, which are significant for the actual deployment of LLM and MR in human-centric intelligent manufacturing systems. Furthermore, we discuss the challenges and opportunities faced by LLM and MR in current manufacturing activities, and find that, although the feasibility of LLM and MR in the industrial field has been demonstrated, further explorations in areas such as resource limitations, reliability, and usability are still necessary to facilitate their widespread application. It is also worth mentioning that this paper presents the current state of the art in integrating LLM and MR technologies and provides insights on how these technologies can be leveraged to drive the intelligent transformation of human-centric Industry 5.0.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104276"},"PeriodicalIF":9.9,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Advanced Engineering Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1