首页 > 最新文献

Advanced Engineering Informatics最新文献

英文 中文
A novel discriminative joint adversarial network for quantitatively detecting wheel polygonization of heavy-haul locomotives across variable running conditions 一种用于重载机车变工况下车轮多边形定量检测的新型判别联合对抗网络
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-24 DOI: 10.1016/j.aei.2026.104377
Maoyong Dong, Shiqian Chen, Hongbing Wang, Wanming Zhai
Timely quantitative detection of wheel polygonal wear is of great significance for railway maintenance and improving the train running quality. However, existing deep learning-based detection methods struggle with speed variation-induced feature distribution shifts, exhibiting weak transferability and failing to achieve quantitative diagnosis of wheel defects. To address these issues, a novel discriminative joint adversarial network (NDJAN) for polygonal fault detection under varying running speeds is proposed in this paper. A multi-branch parallel ResNet is first developed to extract sensitive features from raw signals using shortcut connections, which can preserve critical wear amplitude-related information and alleviate gradient vanishing problems. Then, a two-level discriminative feature fusion (TLDFF) scheme is designed with a hybrid attention mechanism and lightweight depthwise separable convolutions. The former is employed to amplify discriminative features, while the latter achieves intelligent fusion of multi-branch features through learnable weighting coefficients, ensuring optimal integration of complementary information from different branches. Finally, an implicit-explicit joint distribution alignment (IEJDA) strategy is presented to address fundamental transfer distribution discrepancies under variable operating conditions. This module accomplishes global distribution matching and fine-grained adaptation of decision boundaries by acting on the feature layer and regression decision layer, respectively. Both dynamics simulations and field tests are carried out to demonstrate that the proposed NDJAN approach can effectively and accurately detect the polygonal wear amplitudes.
车轮多边形磨损的及时定量检测对铁路维修和提高列车运行质量具有重要意义。然而,现有的基于深度学习的检测方法难以应对速度变化引起的特征分布偏移,可转移性较弱,无法实现车轮缺陷的定量诊断。针对这些问题,本文提出了一种新的用于变转速下多边形故障检测的判别联合对抗网络(NDJAN)。首先开发了一个多分支并行ResNet,使用快捷连接从原始信号中提取敏感特征,可以保留关键磨损幅度相关信息并缓解梯度消失问题。然后,设计了一种混合注意机制和轻量级深度可分离卷积的两级判别特征融合方案。前者用于放大判别特征,后者通过可学习的加权系数实现多分支特征的智能融合,保证不同分支互补信息的最优融合。最后,提出了一种隐式显式联合分配对齐(IEJDA)策略来解决变工况下的基本转移分配差异。该模块分别作用于特征层和回归决策层,实现全局分布匹配和决策边界的细粒度自适应。动力学仿真和现场试验结果表明,所提出的NDJAN方法能够有效、准确地检测多边形磨损幅值。
{"title":"A novel discriminative joint adversarial network for quantitatively detecting wheel polygonization of heavy-haul locomotives across variable running conditions","authors":"Maoyong Dong,&nbsp;Shiqian Chen,&nbsp;Hongbing Wang,&nbsp;Wanming Zhai","doi":"10.1016/j.aei.2026.104377","DOIUrl":"10.1016/j.aei.2026.104377","url":null,"abstract":"<div><div>Timely quantitative detection of wheel polygonal wear is of great significance for railway maintenance and improving the train running quality. However, existing deep learning-based detection methods struggle with speed variation-induced feature distribution shifts, exhibiting weak transferability and failing to achieve quantitative diagnosis of wheel defects. To address these issues, a novel discriminative joint adversarial network (NDJAN) for polygonal fault detection under varying running speeds is proposed in this paper. A multi-branch parallel ResNet is first developed to extract sensitive features from raw signals using shortcut connections, which can preserve critical wear amplitude-related information and alleviate gradient vanishing problems. Then, a two-level discriminative feature fusion (TLDFF) scheme is designed with a hybrid attention mechanism and lightweight depthwise separable convolutions. The former is employed to amplify discriminative features, while the latter achieves intelligent fusion of multi-branch features through learnable weighting coefficients, ensuring optimal integration of complementary information from different branches. Finally, an implicit-explicit joint distribution alignment (IEJDA) strategy is presented to address fundamental transfer distribution discrepancies under variable operating conditions. This module accomplishes global distribution matching and fine-grained adaptation of decision boundaries by acting on the feature layer and regression decision layer, respectively. Both dynamics simulations and field tests are carried out to demonstrate that the proposed NDJAN approach can effectively and accurately detect the polygonal wear amplitudes.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104377"},"PeriodicalIF":9.9,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146078378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement learning-based hyper-heuristic algorithm for multi-warehouse and multi-machine agricultural machinery operation scheduling problem considering soil conditions and carbon emissions 考虑土壤条件和碳排放的多仓多机农机作业调度问题基于强化学习的超启发式算法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-24 DOI: 10.1016/j.aei.2026.104346
Tengfei Wu , Lanyue Zhang , YingLu He , Liangcheng Zhou , Yiming Chen , Qing Yuan , Xingyun Duan , Xiaorong Lv
Multi-machine coordinated scheduling systems are a key technology for efficient production in unmanned farms and play an important role in advancing agricultural informatization. However, existing agricultural machinery scheduling studies often neglect cross-regional operational requirements in China’s hilly and mountainous areas and overlook the influence of soil moisture on machinery performance. To address these challenges, this study formulates a multi-warehouse and multi-machine agricultural machinery scheduling model and develops a tri-objective optimization framework to simultaneously minimize travel distance, operation time, and carbon emissions. A reinforcement learning-based hyper-heuristic algorithm (RLHHA) is proposed, which adopts a five-layer encoding scheme tailored to the problem structure and integrates eight customized low-level heuristics with a Q-learning controller. Hypervolume and spacing metrics are employed as state features to guide the adaptive selection of heuristics, thereby improving the accuracy and stability of scheduling decisions. Extensive experiments are conducted on six benchmark instances of different scales. Comparative results with three classical algorithms and two advanced hybrid algorithms demonstrate that the proposed RLHHA achieves superior performance in terms of solution accuracy, convergence quality, and robustness. The results indicate that the proposed model and algorithm can effectively support accurate, reliable, and sustainable decision-making for cross-regional agricultural machinery scheduling in real-world scenarios.
多机协同调度系统是实现无人农场高效生产的关键技术,对推进农业信息化具有重要作用。然而,现有的农机调度研究往往忽视了中国丘陵山区的跨区域作业要求,忽略了土壤湿度对机械性能的影响。针对这些挑战,本研究构建了多仓多机农机调度模型,并构建了三目标优化框架,以同时实现出行距离、运行时间和碳排放最小化。提出了一种基于强化学习的超启发式算法(RLHHA),该算法采用针对问题结构定制的五层编码方案,并将8种定制的低级启发式算法与Q-learning控制器集成在一起。采用超体积和间隔度量作为状态特征,指导启发式算法的自适应选择,从而提高调度决策的准确性和稳定性。在六个不同尺度的基准实例上进行了大量的实验。与三种经典算法和两种先进混合算法的比较结果表明,该算法在求解精度、收敛质量和鲁棒性方面均取得了较好的效果。结果表明,该模型和算法能够有效支持现实场景下跨区域农机调度决策的准确、可靠和可持续。
{"title":"Reinforcement learning-based hyper-heuristic algorithm for multi-warehouse and multi-machine agricultural machinery operation scheduling problem considering soil conditions and carbon emissions","authors":"Tengfei Wu ,&nbsp;Lanyue Zhang ,&nbsp;YingLu He ,&nbsp;Liangcheng Zhou ,&nbsp;Yiming Chen ,&nbsp;Qing Yuan ,&nbsp;Xingyun Duan ,&nbsp;Xiaorong Lv","doi":"10.1016/j.aei.2026.104346","DOIUrl":"10.1016/j.aei.2026.104346","url":null,"abstract":"<div><div>Multi-machine coordinated scheduling systems are a key technology for efficient production in unmanned farms and play an important role in advancing agricultural informatization. However, existing agricultural machinery scheduling studies often neglect cross-regional operational requirements in China’s hilly and mountainous areas and overlook the influence of soil moisture on machinery performance. To address these challenges, this study formulates a multi-warehouse and multi-machine agricultural machinery scheduling model and develops a tri-objective optimization framework to simultaneously minimize travel distance, operation time, and carbon emissions. A reinforcement learning-based hyper-heuristic algorithm (RLHHA) is proposed, which adopts a five-layer encoding scheme tailored to the problem structure and integrates eight customized low-level heuristics with a Q-learning controller. Hypervolume and spacing metrics are employed as state features to guide the adaptive selection of heuristics, thereby improving the accuracy and stability of scheduling decisions. Extensive experiments are conducted on six benchmark instances of different scales. Comparative results with three classical algorithms and two advanced hybrid algorithms demonstrate that the proposed RLHHA achieves superior performance in terms of solution accuracy, convergence quality, and robustness. The results indicate that the proposed model and algorithm can effectively support accurate, reliable, and sustainable decision-making for cross-regional agricultural machinery scheduling in real-world scenarios.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104346"},"PeriodicalIF":9.9,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative AI-driven data augmentation and object-guided vision-language reasoning for PPE compliance analysis in work-at-height 高空作业PPE符合性分析的生成人工智能驱动数据增强和对象引导视觉语言推理
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-24 DOI: 10.1016/j.aei.2026.104364
Wenyu Xu , Wen Yi , Yi Tan
PPE compliance is a fundamental prerequisite for ensuring safety in work-at-height. Although computer vision has advanced PPE detection, challenges remain in dataset scarcity that limits generalization and in weak semantic reasoning that hinders reliable compliance verification. To address these limitations, this paper presents a generative AI-driven data augmentation and an object-guided vision-language model (VLM) to analyze PPE compliance in work-at-height. Safety standards on work-at-height and PPE (e.g., GB 80-2016, GB 2811-2019) are formalized via ChatGPT 4o into a variable pool and structured prompts, which are used as inputs to text-to-image (T2I) generation model for generating a synthetic dataset. Object detection model is employed to detect PPE elements, and the structured outputs of object detection model are integrated with VLM, enabling vision-language reasoning that combines object detection with natural language understanding. Experimental results demonstrate that DALL·E 3 produces a more realistic synthetic dataset than other image generation models, with the hybrid dataset significantly improving detection performance ([email protected]=88.5%, Small Object [email protected]=75.8%). Using YOLOv11 detections as structured inputs, Qwen2.5-VL-7B achieves reliable compliance reasoning (CRA=87.6%, SC=0.83, EQ=4.2), and these advances are consolidated in an integrated platform supporting automated reporting and interactive analysis. This framework enhances work-at-height safety by alleviating data scarcity through generative augmentation and strengthening PPE compliance reasoning.
遵守个人防护装备是确保高空工作安全的基本先决条件。尽管计算机视觉具有先进的PPE检测,但数据集稀缺性限制了泛化,弱语义推理阻碍了可靠的符合性验证,这些方面仍然存在挑战。为了解决这些限制,本文提出了一个生成式人工智能驱动的数据增强和一个对象引导的视觉语言模型(VLM)来分析高空工作中的PPE合规性。通过ChatGPT 40将高空作业和个人防护安全标准(如GB 80-2016、GB 2811-2019)形式化为变量池和结构化提示,并将其作为文本到图像(t2c)生成模型的输入,生成合成数据集。采用目标检测模型对PPE元素进行检测,并将目标检测模型的结构化输出与VLM相结合,实现了目标检测与自然语言理解相结合的视觉语言推理。实验结果表明,与其他图像生成模型相比,DALL·e3生成的合成数据集更真实,混合数据集显著提高了检测性能([email protected]=88.5%, Small Object [email protected]=75.8%)。Qwen2.5-VL-7B使用YOLOv11检测作为结构化输入,实现了可靠的符合性推理(CRA=87.6%, SC=0.83, EQ=4.2),并将这些进展整合到一个支持自动报告和交互式分析的集成平台中。该框架通过生成增强和加强PPE合规推理来缓解数据稀缺性,从而提高高空作业安全性。
{"title":"Generative AI-driven data augmentation and object-guided vision-language reasoning for PPE compliance analysis in work-at-height","authors":"Wenyu Xu ,&nbsp;Wen Yi ,&nbsp;Yi Tan","doi":"10.1016/j.aei.2026.104364","DOIUrl":"10.1016/j.aei.2026.104364","url":null,"abstract":"<div><div>PPE compliance is a fundamental prerequisite for ensuring safety in work-at-height. Although computer vision has advanced PPE detection, challenges remain in dataset scarcity that limits generalization and in weak semantic reasoning that hinders reliable compliance verification. To address these limitations, this paper presents a generative AI-driven data augmentation and an object-guided vision-language model (VLM) to analyze PPE compliance in work-at-height. Safety standards on work-at-height and PPE (e.g., GB 80-2016, GB 2811-2019) are formalized via ChatGPT 4o into a variable pool and structured prompts, which are used as inputs to text-to-image (T2I) generation model for generating a synthetic dataset. Object detection model is employed to detect PPE elements, and the structured outputs of object detection model are integrated with VLM, enabling vision-language reasoning that combines object detection with natural language understanding. Experimental results demonstrate that DALL·E 3 produces a more realistic synthetic dataset than other image generation models, with the hybrid dataset significantly improving detection performance ([email protected]=88.5%, Small Object [email protected]=75.8%). Using YOLOv11 detections as structured inputs, Qwen2.5-VL-7B achieves reliable compliance reasoning (CRA=87.6%, SC=0.83, EQ=4.2), and these advances are consolidated in an integrated platform supporting automated reporting and interactive analysis. This framework enhances work-at-height safety by alleviating data scarcity through generative augmentation and strengthening PPE compliance reasoning.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104364"},"PeriodicalIF":9.9,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146078380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatio-temporal motion-aware intelligent robotic grasping with velocity estimation for moving objects 基于运动物体速度估计的时空运动感知智能机器人抓取
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-24 DOI: 10.1016/j.aei.2026.104367
Qing Jiao , Weifei Hu , Tingjie Wang , Geyu Shao , Ning Tang , Jiayi Wang , Long Fang
Dynamic grasping capabilities, i.e., grasping moving objects in unstructured environments, could render robotic systems more competitive in both industrial and daily life applications. However, previous studies mostly relied on restrictive assumptions, such as static objects subject to slight perturbations or pre-learned object motion patterns, which severely limited adaptability to unknown trajectories. While recent learning-based methods relax these assumptions, they prioritize object or grasp tracking to ensure smooth robot motion over future grasp pose prediction. The scarcity of dynamic grasp datasets further hinders the advancement of learning-based methods. To address these challenges, this paper presents a moving-object grasp prediction method based on Conv-T (Convolutional Transformer), a hierarchical architecture that fuses spatiotemporal features for motion-aware dynamic grasping. By integrating velocity estimation, this method models the dynamics of the latent motion trajectories from time-series depth images to predict future grasp poses. The Conv-T is built based on a proposed SLiding Window Multi-head Self-Attention (SLW-MSA) mechanism, which balances computational efficiency with performance by integrating the properties of convolutional operations and self-attention mechanisms. Additionally, a dynamic grasp dataset generation pipeline combining data synthesis with data expansion techniques is developed to efficiently embed temporal motion cues into the training data. The proposed method is validated on the constructed dynamic grasp datasets as well as in simulated and real‐world robotic environments. Experimental results demonstrate that our Conv-T-based method not only outperforms state-of-the-art networks on datasets but also exhibits superior robustness compared to other baselines when grasping moving objects.
动态抓取能力,即在非结构化环境中抓取移动物体,可以使机器人系统在工业和日常生活应用中更具竞争力。然而,以往的研究大多依赖于限制性假设,如静态物体受到轻微扰动或预先学习的物体运动模式,这严重限制了对未知轨迹的适应性。虽然最近基于学习的方法放松了这些假设,但它们优先考虑物体或抓取跟踪,以确保机器人在未来抓取姿势预测上的平滑运动。动态抓取数据集的缺乏进一步阻碍了基于学习的方法的发展。为了解决这些挑战,本文提出了一种基于卷积变换(convt)的运动物体抓取预测方法,这是一种融合时空特征的分层结构,用于运动感知动态抓取。该方法通过积分速度估计,对时间序列深度图像的潜在运动轨迹进行动力学建模,以预测未来的抓取姿势。该算法基于滑动窗口多头自注意(SLW-MSA)机制,通过集成卷积运算和自注意机制的特性,平衡了计算效率和性能。此外,开发了一种结合数据合成和数据扩展技术的动态抓取数据生成管道,以有效地将时间运动线索嵌入到训练数据中。在构建的动态抓取数据集以及模拟和现实机器人环境中验证了所提出的方法。实验结果表明,我们的基于卷积的方法不仅在数据集上优于最先进的网络,而且在抓取运动物体时,与其他基线相比,具有优越的鲁棒性。
{"title":"Spatio-temporal motion-aware intelligent robotic grasping with velocity estimation for moving objects","authors":"Qing Jiao ,&nbsp;Weifei Hu ,&nbsp;Tingjie Wang ,&nbsp;Geyu Shao ,&nbsp;Ning Tang ,&nbsp;Jiayi Wang ,&nbsp;Long Fang","doi":"10.1016/j.aei.2026.104367","DOIUrl":"10.1016/j.aei.2026.104367","url":null,"abstract":"<div><div>Dynamic grasping capabilities, i.e., grasping moving objects in unstructured environments, could render robotic systems more competitive in both industrial and daily life applications. However, previous studies mostly relied on restrictive assumptions, such as static objects subject to slight perturbations or pre-learned object motion patterns, which severely limited adaptability to unknown trajectories. While recent learning-based methods relax these assumptions, they prioritize object or grasp tracking to ensure smooth robot motion over future grasp pose prediction. The scarcity of dynamic grasp datasets further hinders the advancement of learning-based methods. To address these challenges, this paper presents a moving-object grasp prediction method based on Conv-T (Convolutional Transformer), a hierarchical architecture that fuses spatiotemporal features for motion-aware dynamic grasping. By integrating velocity estimation, this method models the dynamics of the latent motion trajectories from time-series depth images to predict future grasp poses. The Conv-T is built based on a proposed <strong>SL</strong>iding <strong>W</strong>indow <strong>M</strong>ulti-head <strong>S</strong>elf-<strong>A</strong>ttention (SLW-MSA) mechanism, which balances computational efficiency with performance by integrating the properties of convolutional operations and self-attention mechanisms. Additionally, a dynamic grasp dataset generation pipeline combining data synthesis with data expansion techniques is developed to efficiently embed temporal motion cues into the training data. The proposed method is validated on the constructed dynamic grasp datasets as well as in simulated and real‐world robotic environments. Experimental results demonstrate that our Conv-T-based method not only outperforms state-of-the-art networks on datasets but also exhibits superior robustness compared to other baselines when grasping moving objects.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104367"},"PeriodicalIF":9.9,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalized envelope nonlinear Gini index-gram guided two-stage chirp mode decomposition for shield machine main bearing fault diagnosis 广义包络非线性基尼指数克导两阶段啁啾模态分解在屏蔽机主轴承故障诊断中的应用
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-24 DOI: 10.1016/j.aei.2026.104354
Gang Shi, Chengjin Qin, Pengcheng Xia, Zhinan Zhang, Chengliang Liu
The main bearing is the core mechanical equipment of shield machine, known as the heart of the shield machine, mainly applied to support the rotation of cutterhead system. The health status of main bearing has significant impact on the stable and safe construction for shield machine. However, main bearing has very low rotational speed of only 1 rpm to 5 rpm, and the structure is very complex, with a diameter even exceeding 10 m. The fault features for main bearing vibration signals are relatively weak. Therefore, the main bearing fault diagnosis is a challenging task. To tackle the problem, we develop a novel generalized envelope nonlinear Gini index-gram guided two-stage chirp mode decomposition (GENGI-TSCMD) for fault diagnosis of shield machine main bearing in this article. The proposed method consists of optimized demodulated frequency band (ODFB) selection and fault frequency extraction two key parts. We firstly developed GENGI-gram for ODFB extraction, which adopting stepwise increasing segmentation and signal frequency spectrum trend to divide the signal’s frequency bands, and obtain two grams. A new GENGI fault pulse identifier was constructed to select ODFB in these two grams. GENGI can availably characterize fault pulse features and suppress noise interference by combining generalized envelope and nonlinear weights based on Gini index. For fault frequency extraction, we proposed TSCMD by establishing bandwidth guided adaptive chirp mode decomposition (BACMD) and fault frequency extraction mode decomposition (FEMD). BACMD is adopted to extract sub-signals in each demodulation frequency band of GENGI-gram. FEMD is applied to decompose each order fault frequency sub-signals of ODFB signal’s envelope. In this way, the main bearing’s fault type can be precisely diagnosed. BACMD derives the bandwidth calculation formula for sub-signal, which can extract sub-signals with specific frequency bandwidth. FEMD constructs a novel adaptive signal decomposition model, which can effectively decompose each order fault frequency sub-signal with narrow frequency band. Then we use actual main bearing fault vibration signals to verify the proposed method. The experimental results show that proposed GENGI-TSCMD can availably filter out various types of strong noise disturbance, and precisely extract each order fault frequency component. Moreover, proposed method has superior performance than current signal processing fault diagnosis methods.
主轴承是盾构机的核心机械设备,被称为盾构机的心脏,主要用于支撑刀盘系统的转动。主轴承的健康状况对盾构机的稳定、安全施工有着重要的影响。然而,主轴承的转速非常低,仅为1转/分至5转/分,结构非常复杂,直径甚至超过10米。主轴承振动信号的故障特征较弱。因此,主轴承故障诊断是一项具有挑战性的任务。为了解决这一问题,本文提出了一种用于屏蔽机主轴承故障诊断的广义包络非线性基尼指数图引导两阶段啁啾模式分解(GENGI-TSCMD)方法。该方法由优化解调频带选择和故障频率提取两个关键部分组成。我们首先开发了用于ODFB提取的GENGI-gram,采用逐级递增分割和信号频谱趋势对信号频段进行划分,得到2克。构造了一种新的GENGI故障脉冲识别器来选择这两种g中的ODFB。GENGI将广义包络与基于基尼指数的非线性权值相结合,能够有效地表征故障脉冲特征,抑制噪声干扰。在故障频率提取方面,通过建立带宽引导自适应啁啾模式分解(BACMD)和故障频率提取模式分解(FEMD),提出了TSCMD。采用BACMD提取gengi图各解调频带的子信号。利用FEMD对ODFB信号包络的各阶故障频率子信号进行分解。这样可以准确地诊断出主轴承的故障类型。BACMD导出了子信号的带宽计算公式,可以提取特定频率带宽的子信号。FEMD构建了一种新的自适应信号分解模型,该模型能有效地分解窄频带各阶故障频率子信号。然后用实际主轴承故障振动信号对所提方法进行了验证。实验结果表明,所提出的GENGI-TSCMD能够有效滤除各种类型的强噪声干扰,并精确提取各阶故障频率分量。与现有的信号处理故障诊断方法相比,该方法具有更好的性能。
{"title":"Generalized envelope nonlinear Gini index-gram guided two-stage chirp mode decomposition for shield machine main bearing fault diagnosis","authors":"Gang Shi,&nbsp;Chengjin Qin,&nbsp;Pengcheng Xia,&nbsp;Zhinan Zhang,&nbsp;Chengliang Liu","doi":"10.1016/j.aei.2026.104354","DOIUrl":"10.1016/j.aei.2026.104354","url":null,"abstract":"<div><div>The main bearing is the core mechanical equipment of shield machine, known as the heart of the shield machine, mainly applied to support the rotation of cutterhead system. The health status of main bearing has significant impact on the stable and safe construction for shield machine. However, main bearing has very low rotational speed of only 1 rpm to 5 rpm, and the structure is very complex, with a diameter even exceeding 10 m. The fault features for main bearing vibration signals are relatively weak. Therefore, the main bearing fault diagnosis is a challenging task. To tackle the problem, we develop a novel generalized envelope nonlinear Gini index-gram guided two-stage chirp mode decomposition (GENGI-TSCMD) for fault diagnosis of shield machine main bearing in this article. The proposed method consists of optimized demodulated frequency band (ODFB) selection and fault frequency extraction two key parts. We firstly developed GENGI-gram for ODFB extraction, which adopting stepwise increasing segmentation and signal frequency spectrum trend to divide the signal’s frequency bands, and obtain two grams. A new GENGI fault pulse identifier was constructed to select ODFB in these two grams. GENGI can availably characterize fault pulse features and suppress noise interference by combining generalized envelope and nonlinear weights based on Gini index. For fault frequency extraction, we proposed TSCMD by establishing bandwidth guided adaptive chirp mode decomposition (BACMD) and fault frequency extraction mode decomposition (FEMD). BACMD is adopted to extract sub-signals in each demodulation frequency band of GENGI-gram. FEMD is applied to decompose each order fault frequency sub-signals of ODFB signal’s envelope. In this way, the main bearing’s fault type can be precisely diagnosed. BACMD derives the bandwidth calculation formula for sub-signal, which can extract sub-signals with specific frequency bandwidth. FEMD constructs a novel adaptive signal decomposition model, which can effectively decompose each order fault frequency sub-signal with narrow frequency band. Then we use actual main bearing fault vibration signals to verify the proposed method. The experimental results show that proposed GENGI-TSCMD can availably filter out various types of strong noise disturbance, and precisely extract each order fault frequency component. Moreover, proposed method has superior performance than current signal processing fault diagnosis methods.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104354"},"PeriodicalIF":9.9,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implicit coordination through environment modification: Multi-agent RL approach for adaptive door placement in evacuation scenarios 通过环境改变的隐式协调:疏散场景中自适应门放置的多智能体RL方法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-24 DOI: 10.1016/j.aei.2026.104359
Isabelle Fitkau, Timo Hartmann
In evacuation modeling, architectural configurations are typically treated as fixed constraints, limiting the exploration of adaptive design strategies that respond dynamically to occupants’ movement patterns. This research investigates environment-mediated coordination in a prototypical multi-agent reinforcement learning (MARL) setting. Using Proximal Policy Optimization (PPO), a navigating agent learns escape trajectories, while a design agent modifies door placement solely based on observed movement, without explicit communication. The system was evaluated in a multi-room environment to explore how complementary strategies evolve when navigation performance and immediate floor plan changes are coupled. The results demonstrate successful implicit coordination, with substantial reductions in episode length as the door controller agent progressively shifted door placements towards positions that facilitated faster escape paths. Navigation trajectories converged on paths that maximized the reward of the navigating agent, while door placement strategies minimized evacuation times. Therefore, although allowing architectural changes during episode runtime, the results still show emerging adaptive behaviors of both agents and demonstrate the feasibility of coordinating navigation and design actions through dynamic environment modification. This approach paves the way for adaptive architectural design systems that can dynamically respond to user behavior patterns, with potential applications in performance-based building design tools and in emergency planning optimization.
在疏散建模中,建筑配置通常被视为固定约束,限制了对动态响应居住者运动模式的适应性设计策略的探索。本研究在一个典型的多智能体强化学习(MARL)环境中研究环境介导的协调。使用近端策略优化(PPO),导航代理学习逃生轨迹,而设计代理仅根据观察到的运动修改门的位置,而不需要明确的通信。该系统在多房间环境中进行了评估,以探索当导航性能和即时平面图变化相结合时,互补策略如何演变。结果证明了成功的隐式协调,随着门控制器代理逐渐将门的位置移向有利于更快逃生路径的位置,事件长度大幅减少。导航轨迹收敛于使导航代理的回报最大化的路径上,而门的放置策略则使疏散时间最小化。因此,尽管在情节运行期间允许架构变化,结果仍然显示了两个智能体的新兴自适应行为,并证明了通过动态环境修改协调导航和设计动作的可行性。这种方法为能够动态响应用户行为模式的自适应建筑设计系统铺平了道路,在基于性能的建筑设计工具和应急规划优化中具有潜在的应用前景。
{"title":"Implicit coordination through environment modification: Multi-agent RL approach for adaptive door placement in evacuation scenarios","authors":"Isabelle Fitkau,&nbsp;Timo Hartmann","doi":"10.1016/j.aei.2026.104359","DOIUrl":"10.1016/j.aei.2026.104359","url":null,"abstract":"<div><div>In evacuation modeling, architectural configurations are typically treated as fixed constraints, limiting the exploration of adaptive design strategies that respond dynamically to occupants’ movement patterns. This research investigates environment-mediated coordination in a prototypical multi-agent reinforcement learning (MARL) setting. Using Proximal Policy Optimization (PPO), a navigating agent learns escape trajectories, while a design agent modifies door placement solely based on observed movement, without explicit communication. The system was evaluated in a multi-room environment to explore how complementary strategies evolve when navigation performance and immediate floor plan changes are coupled. The results demonstrate successful implicit coordination, with substantial reductions in episode length as the door controller agent progressively shifted door placements towards positions that facilitated faster escape paths. Navigation trajectories converged on paths that maximized the reward of the navigating agent, while door placement strategies minimized evacuation times. Therefore, although allowing architectural changes during episode runtime, the results still show emerging adaptive behaviors of both agents and demonstrate the feasibility of coordinating navigation and design actions through dynamic environment modification. This approach paves the way for adaptive architectural design systems that can dynamically respond to user behavior patterns, with potential applications in performance-based building design tools and in emergency planning optimization.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104359"},"PeriodicalIF":9.9,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146078381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DuoTransFormer for nonlinear seismic response prediction: Cover large, focus local, and enforce law 用于非线性地震反应预测的DuoTransFormer:覆盖大,关注局部,并执行法律
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-24 DOI: 10.1016/j.aei.2026.104375
Xinyi Hu , Congzhen Xiao , Zhiqiang Zhang , Jie Gao
Deep learning-based methods have recently been applied to structural seismic response prediction, yet existing models still face challenges in capturing strongly nonlinear behaviors and exhibit limitations in multi-scale temporal modeling and inter-variable dependencies. Based on these observations, we propose DuoTransformer, a novel dual-view Transformer backbone that integrates a Transposed Transformer Encoder treating each variable as a token and a Patch Transformer Encoder for local temporal patterns, thereby enabling multi-scale modeling of both coarse-grained and fine-grained temporal dependencies. Inspired by response characteristics such as period elongation and stiffness degradation, the Physical Module is designed following a phenomenon-driven design paradigm. Furthermore, DuoTransFormer decouples the prediction process into two stages: the Physical Module, which hard-codes structural dynamics and outputs three parallel data chains as physics-informed feature sequences, and the Transformer Module, which takes them as inputs. Two proof-of-concept experiments are first conducted to validate the effectiveness of the proposed method. Subsequently, extensive evaluations on two open-access structural models, including a 42-story case study building and a 632-m real-world building, demonstrate that DuoTransFormer outperforms state-of-the-art time-series forecasting models (e.g., PatchTST, iTransformer) while achieving substantially lower computational cost than finite element simulations. The model-agnostic Physical Module can be flexibly combined with all baselines to deliver substantial performance gains.
基于深度学习的方法最近被应用于结构地震反应预测,但现有模型在捕获强非线性行为方面仍然面临挑战,并且在多尺度时间建模和变量间依赖性方面存在局限性。基于这些观察,我们提出了DuoTransformer,这是一种新的双视图Transformer主干,它集成了一个将每个变量视为令牌的转置Transformer编码器和一个用于本地时间模式的Patch Transformer编码器,从而实现了粗粒度和细粒度时间依赖性的多尺度建模。受周期伸长和刚度退化等响应特性的启发,物理模块的设计遵循了现象驱动的设计范式。此外,DuoTransFormer将预测过程解耦为两个阶段:物理模块(Physical Module)硬编码结构动力学并输出三个并行数据链作为物理知情的特征序列,而变压器模块(Transformer Module)将它们作为输入。首先进行了两个概念验证实验来验证所提出方法的有效性。随后,对两个开放访问结构模型(包括42层的案例研究建筑和632米的真实建筑)进行了广泛的评估,结果表明,DuoTransFormer优于最先进的时间序列预测模型(例如PatchTST, iTransformer),同时实现了比有限元模拟低得多的计算成本。与模型无关的物理模块可以灵活地与所有基线组合在一起,以提供显著的性能提升。
{"title":"DuoTransFormer for nonlinear seismic response prediction: Cover large, focus local, and enforce law","authors":"Xinyi Hu ,&nbsp;Congzhen Xiao ,&nbsp;Zhiqiang Zhang ,&nbsp;Jie Gao","doi":"10.1016/j.aei.2026.104375","DOIUrl":"10.1016/j.aei.2026.104375","url":null,"abstract":"<div><div>Deep learning-based methods have recently been applied to structural seismic response prediction, yet existing models still face challenges in capturing strongly nonlinear behaviors and exhibit limitations in multi-scale temporal modeling and inter-variable dependencies. Based on these observations, we propose <span><math><mi>DuoTransformer</mi></math></span>, a novel dual-view Transformer backbone that integrates a <span><math><mi>Trans</mi></math></span>posed Trans<span><math><mi>former</mi></math></span> Encoder treating each variable as a token and a Patch <span><math><mi>Transformer</mi></math></span> Encoder for local temporal patterns, thereby enabling multi-scale modeling of both coarse-grained and fine-grained temporal dependencies. Inspired by response characteristics such as period elongation and stiffness degradation, the Physical Module is designed following a phenomenon-driven design paradigm. Furthermore, DuoTransFormer decouples the prediction process into two stages: the Physical Module, which hard-codes structural dynamics and outputs three parallel data chains as physics-informed feature sequences, and the Transformer Module, which takes them as inputs. Two proof-of-concept experiments are first conducted to validate the effectiveness of the proposed method. Subsequently, extensive evaluations on two open-access structural models, including a 42-story case study building and a 632-m real-world building, demonstrate that DuoTransFormer outperforms state-of-the-art time-series forecasting models (e.g., PatchTST, iTransformer) while achieving substantially lower computational cost than finite element simulations. The model-agnostic Physical Module can be flexibly combined with all baselines to deliver substantial performance gains.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104375"},"PeriodicalIF":9.9,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven modelling of unloading hours using explainable gradient boosting models 使用可解释的梯度提升模型的卸载时间数据驱动建模
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-23 DOI: 10.1016/j.aei.2026.104353
Celal Cakiroglu , Najat Almasarwah , Mehmet Hakan Özdemir , Batin Latif Aylak , Manjeet Singh , Muhammet Deveci
Unloading processes denote the extraction of finished goods and raw materials from transport units and their subsequent conveyance to designated locations. The efficiency of unloading processes is vital in supply chain and logistics management, regarded as an essential component. Delays in unloading operations result in numerous challenges, including heightened operational expenses, diminished labour efficiency, and supply chain bottlenecks. Consequently, it is essential to ascertain unloading times beforehand to mitigate these challenges, resulting in diminished idle time, enhanced overall efficiency, and optimized scheduling. Therefore, precise prediction of unloading times is critically significant. The novelty of this study lies in the application of machine learning techniques to improve operational efficiency by accurately predicting unloading time. To that end, this study employed LightGBM and XGBoost to predict the unloading time in a real case. The unloading time can be predicted with R2 score greater than 0.99 utilizing both models. Subsequently, the SHapley Additive exPlanations (SHAP) methodology was used to ascertain how each input feature contributed to the model’s output. The load of leg significantly influences the unloading time more than the gross weight of truck and the leg distance.
卸货过程是指从运输单位提取制成品和原材料,并将其运送到指定地点。卸载过程的效率在供应链和物流管理中至关重要,被视为必不可少的组成部分。卸载作业的延迟会带来许多挑战,包括运营费用增加、劳动效率降低和供应链瓶颈。因此,必须事先确定卸载时间,以减轻这些挑战,从而减少闲置时间,提高整体效率,优化调度。因此,准确预测卸载时间至关重要。本研究的新颖之处在于应用机器学习技术,通过准确预测卸载时间来提高操作效率。为此,本研究采用LightGBM和XGBoost来预测实际情况下的卸载时间。两种模型均可预测卸载时间,R2评分均大于0.99。随后,使用SHapley加性解释(SHAP)方法来确定每个输入特征对模型输出的贡献。腿腿的载荷对卸载时间的影响比卡车总重和腿腿距离的影响更显著。
{"title":"Data-driven modelling of unloading hours using explainable gradient boosting models","authors":"Celal Cakiroglu ,&nbsp;Najat Almasarwah ,&nbsp;Mehmet Hakan Özdemir ,&nbsp;Batin Latif Aylak ,&nbsp;Manjeet Singh ,&nbsp;Muhammet Deveci","doi":"10.1016/j.aei.2026.104353","DOIUrl":"10.1016/j.aei.2026.104353","url":null,"abstract":"<div><div>Unloading processes denote the extraction of finished goods and raw materials from transport units and their subsequent conveyance to designated locations. The efficiency of unloading processes is vital in supply chain and logistics management, regarded as an essential component. Delays in unloading operations result in numerous challenges, including heightened operational expenses, diminished labour efficiency, and supply chain bottlenecks. Consequently, it is essential to ascertain unloading times beforehand to mitigate these challenges, resulting in diminished idle time, enhanced overall efficiency, and optimized scheduling. Therefore, precise prediction of unloading times is critically significant. The novelty of this study lies in the application of machine learning techniques to improve operational efficiency by accurately predicting unloading time. To that end, this study employed LightGBM and XGBoost to predict the unloading time in a real case. The unloading time can be predicted with <span><math><msup><mrow><mi>R</mi></mrow><mo>2</mo></msup></math></span> score greater than 0.99 utilizing both models. Subsequently, the SHapley Additive exPlanations (SHAP) methodology was used to ascertain how each input feature contributed to the model’s output. The load of leg significantly influences the unloading time more than the gross weight of truck and the leg distance.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104353"},"PeriodicalIF":9.9,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RLLM-SS: A knowledge-guided simplex search method integrating large language model and reinforcement learning for injection molding quality control RLLM-SS:一种集成大语言模型和强化学习的知识引导单纯形搜索方法,用于注塑质量控制
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-23 DOI: 10.1016/j.aei.2026.104372
Haipeng Zou , Xinyu Li , Yongkuan Yang , Ke Yao , Xiangsong Kong , Zhijiang Shao , Furong Gao
In injection molding (IM), product quality and process stability are highly dependent on the setting of key process parameters, making efficient parameter tuning essential for achieving reliable and consistent production. However, the tuning process is traditionally guided by expert experience and trial-and-error methods, which often lead to low efficiency and prolonged adjustment cycles. To address this challenge, we propose a knowledge-guided simplex search method that integrates a large language model (LLM) with the Soft Actor–Critic (SAC) reinforcement learning algorithm in a collaborative optimization framework, called RLLM-SS. In RLLM-SS, a quasi-gradient mechanism leverages historical data to dynamically estimate the step size and gradient compensation direction of the simplex search method. These estimated variables, integrated with domain knowledge, are encoded into structured prompts that guide the injection molding quality LLM in dynamically adjusting simplex coefficients through natural language reasoning. This enables the simplex search to overcome fixed-coefficient limitation and avoid local optima to the maximum extent. To mitigate the drawbacks of the LLM, such as its tendency to generate hallucinated outputs and lack of memory of past tuning adjustments, a SAC-based evaluation module is introduced. It assigns rewards based on optimization performance, thereby reinforcing effective strategies and fostering continuous policy improvement when similar conditions recur. Experimental evaluations first verified LLM-SS on standard high-dimensional benchmark functions, confirming its effectiveness in complex search spaces, and were then conducted on an injection molding quality simulation platform built on a neural network trained with practical IM process data. Results show that RLLM-SS outperforms several advanced methods, reducing the average number of iterations by 27.6% and the final Euclidean distance to the target quality curve by 68.3%. It also maintains strong robustness under Gaussian noise perturbations.
在注射成型(IM)中,产品质量和工艺稳定性高度依赖于关键工艺参数的设置,因此有效的参数调整对于实现可靠和一致的生产至关重要。然而,传统的调整过程是由专家经验和试错方法指导的,这往往导致低效率和长时间的调整周期。为了解决这一挑战,我们提出了一种知识引导的单纯形搜索方法,该方法将大型语言模型(LLM)与软行为者-评论家(SAC)强化学习算法集成在一个称为RLLM-SS的协作优化框架中。在RLLM-SS中,拟梯度机制利用历史数据动态估计单纯形搜索方法的步长和梯度补偿方向。这些估计变量与领域知识相结合,被编码成结构化提示,指导注塑质量LLM通过自然语言推理动态调整单纯形系数。这使得单纯形搜索克服了固定系数的限制,最大程度地避免了局部最优。为了减轻LLM的缺点,例如它容易产生幻觉输出和缺乏对过去调谐调整的记忆,引入了一个基于sac的评估模块。它根据优化绩效分配奖励,从而加强有效的策略,并在类似情况再次发生时促进持续的政策改进。实验评估首先验证了LLM-SS在标准高维基准函数上的有效性,验证了其在复杂搜索空间中的有效性,然后在基于实际IM过程数据训练的神经网络的注塑质量仿真平台上进行了实验评估。结果表明,RLLM-SS优于几种先进的方法,平均迭代次数减少27.6%,最终到目标质量曲线的欧几里得距离减少68.3%。在高斯噪声扰动下也保持较强的鲁棒性。
{"title":"RLLM-SS: A knowledge-guided simplex search method integrating large language model and reinforcement learning for injection molding quality control","authors":"Haipeng Zou ,&nbsp;Xinyu Li ,&nbsp;Yongkuan Yang ,&nbsp;Ke Yao ,&nbsp;Xiangsong Kong ,&nbsp;Zhijiang Shao ,&nbsp;Furong Gao","doi":"10.1016/j.aei.2026.104372","DOIUrl":"10.1016/j.aei.2026.104372","url":null,"abstract":"<div><div>In injection molding (IM), product quality and process stability are highly dependent on the setting of key process parameters, making efficient parameter tuning essential for achieving reliable and consistent production. However, the tuning process is traditionally guided by expert experience and trial-and-error methods, which often lead to low efficiency and prolonged adjustment cycles. To address this challenge, we propose a knowledge-guided simplex search method that integrates a large language model (LLM) with the Soft Actor–Critic (SAC) reinforcement learning algorithm in a collaborative optimization framework, called RLLM-SS. In RLLM-SS, a quasi-gradient mechanism leverages historical data to dynamically estimate the step size and gradient compensation direction of the simplex search method. These estimated variables, integrated with domain knowledge, are encoded into structured prompts that guide the injection molding quality LLM in dynamically adjusting simplex coefficients through natural language reasoning. This enables the simplex search to overcome fixed-coefficient limitation and avoid local optima to the maximum extent. To mitigate the drawbacks of the LLM, such as its tendency to generate hallucinated outputs and lack of memory of past tuning adjustments, a SAC-based evaluation module is introduced. It assigns rewards based on optimization performance, thereby reinforcing effective strategies and fostering continuous policy improvement when similar conditions recur. Experimental evaluations first verified LLM-SS on standard high-dimensional benchmark functions, confirming its effectiveness in complex search spaces, and were then conducted on an injection molding quality simulation platform built on a neural network trained with practical IM process data. Results show that RLLM-SS outperforms several advanced methods, reducing the average number of iterations by 27.6% and the final Euclidean distance to the target quality curve by 68.3%. It also maintains strong robustness under Gaussian noise perturbations.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104372"},"PeriodicalIF":9.9,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An adaptive spatial–temporal encoder with gated multi-convolutions for remaining useful life prediction 用于剩余使用寿命预测的门控多卷积自适应时空编码器
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-23 DOI: 10.1016/j.aei.2026.104369
Wen Liu , Jyun-You Chiang , Yi Li , Haobo Zhang
Accurate estimation of Remaining Useful Life (RUL) is essential for safe and economical operation of turbofan engines. This paper introduces an Adaptive Spatial-Temporal Encoder with Gated Multi-Convolutions (GMC-ASTE), a novel approach that simultaneously models temporal dynamics and inter-sensor relationships to enhance RUL prediction accuracy. The methodology employs a multi-scale gated convolution module to extract refined features from raw multi-sensor data, effectively reducing noise while retaining transient signals. These features are subsequently processed through an adaptive spatial–temporal encoder, which utilizes graph attention mechanisms to dynamically adjust sensor connections and multi-head temporal attention to capture long-term dependencies parallelly. Extensive experiments on the Commercial Modular Aero-Propulsion System Simulation (C-MAPSS) benchmark demonstrate that GMC-ASTE achieves superior performance, with the lowest Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Scoring metrics across all four sub-datasets. The results confirm the effectiveness and interpretability of the proposed model, providing an advanced framework that improves engine prognostic theory and offering airlines a practical tool to reduce downtime and maintenance costs.
准确估计剩余使用寿命(RUL)是保证涡扇发动机安全经济运行的关键。本文介绍了一种具有门控多卷积的自适应时空编码器(gmmc - aste),这是一种同时模拟时间动态和传感器间关系以提高RUL预测精度的新方法。该方法采用多尺度门控卷积模块从原始多传感器数据中提取精细特征,在保留瞬态信号的同时有效地降低噪声。这些特征随后通过自适应时空编码器进行处理,该编码器利用图形注意机制动态调整传感器连接和多头时间注意来并行捕获长期依赖关系。在商用模块化航空推进系统仿真(C-MAPSS)基准测试上进行的大量实验表明,GMC-ASTE在所有四个子数据集上都具有最低的均方根误差(RMSE)、平均绝对误差(MAE)和评分指标,具有卓越的性能。研究结果证实了该模型的有效性和可解释性,为改进发动机预测理论提供了一个先进的框架,并为航空公司提供了一个减少停机时间和维护成本的实用工具。
{"title":"An adaptive spatial–temporal encoder with gated multi-convolutions for remaining useful life prediction","authors":"Wen Liu ,&nbsp;Jyun-You Chiang ,&nbsp;Yi Li ,&nbsp;Haobo Zhang","doi":"10.1016/j.aei.2026.104369","DOIUrl":"10.1016/j.aei.2026.104369","url":null,"abstract":"<div><div>Accurate estimation of Remaining Useful Life (RUL) is essential for safe and economical operation of turbofan engines. This paper introduces an Adaptive Spatial-Temporal Encoder with Gated Multi-Convolutions (GMC-ASTE), a novel approach that simultaneously models temporal dynamics and inter-sensor relationships to enhance RUL prediction accuracy. The methodology employs a multi-scale gated convolution module to extract refined features from raw multi-sensor data, effectively reducing noise while retaining transient signals. These features are subsequently processed through an adaptive spatial–temporal encoder, which utilizes graph attention mechanisms to dynamically adjust sensor connections and multi-head temporal attention to capture long-term dependencies parallelly. Extensive experiments on the Commercial Modular Aero-Propulsion System Simulation (C-MAPSS) benchmark demonstrate that GMC-ASTE achieves superior performance, with the lowest Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Scoring metrics across all four sub-datasets. The results confirm the effectiveness and interpretability of the proposed model, providing an advanced framework that improves engine prognostic theory and offering airlines a practical tool to reduce downtime and maintenance costs.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104369"},"PeriodicalIF":9.9,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Advanced Engineering Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1