首页 > 最新文献

Advanced Engineering Informatics最新文献

英文 中文
Generalized envelope nonlinear Gini index-gram guided two-stage chirp mode decomposition for shield machine main bearing fault diagnosis 广义包络非线性基尼指数克导两阶段啁啾模态分解在屏蔽机主轴承故障诊断中的应用
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-24 DOI: 10.1016/j.aei.2026.104354
Gang Shi, Chengjin Qin, Pengcheng Xia, Zhinan Zhang, Chengliang Liu
The main bearing is the core mechanical equipment of shield machine, known as the heart of the shield machine, mainly applied to support the rotation of cutterhead system. The health status of main bearing has significant impact on the stable and safe construction for shield machine. However, main bearing has very low rotational speed of only 1 rpm to 5 rpm, and the structure is very complex, with a diameter even exceeding 10 m. The fault features for main bearing vibration signals are relatively weak. Therefore, the main bearing fault diagnosis is a challenging task. To tackle the problem, we develop a novel generalized envelope nonlinear Gini index-gram guided two-stage chirp mode decomposition (GENGI-TSCMD) for fault diagnosis of shield machine main bearing in this article. The proposed method consists of optimized demodulated frequency band (ODFB) selection and fault frequency extraction two key parts. We firstly developed GENGI-gram for ODFB extraction, which adopting stepwise increasing segmentation and signal frequency spectrum trend to divide the signal’s frequency bands, and obtain two grams. A new GENGI fault pulse identifier was constructed to select ODFB in these two grams. GENGI can availably characterize fault pulse features and suppress noise interference by combining generalized envelope and nonlinear weights based on Gini index. For fault frequency extraction, we proposed TSCMD by establishing bandwidth guided adaptive chirp mode decomposition (BACMD) and fault frequency extraction mode decomposition (FEMD). BACMD is adopted to extract sub-signals in each demodulation frequency band of GENGI-gram. FEMD is applied to decompose each order fault frequency sub-signals of ODFB signal’s envelope. In this way, the main bearing’s fault type can be precisely diagnosed. BACMD derives the bandwidth calculation formula for sub-signal, which can extract sub-signals with specific frequency bandwidth. FEMD constructs a novel adaptive signal decomposition model, which can effectively decompose each order fault frequency sub-signal with narrow frequency band. Then we use actual main bearing fault vibration signals to verify the proposed method. The experimental results show that proposed GENGI-TSCMD can availably filter out various types of strong noise disturbance, and precisely extract each order fault frequency component. Moreover, proposed method has superior performance than current signal processing fault diagnosis methods.
主轴承是盾构机的核心机械设备,被称为盾构机的心脏,主要用于支撑刀盘系统的转动。主轴承的健康状况对盾构机的稳定、安全施工有着重要的影响。然而,主轴承的转速非常低,仅为1转/分至5转/分,结构非常复杂,直径甚至超过10米。主轴承振动信号的故障特征较弱。因此,主轴承故障诊断是一项具有挑战性的任务。为了解决这一问题,本文提出了一种用于屏蔽机主轴承故障诊断的广义包络非线性基尼指数图引导两阶段啁啾模式分解(GENGI-TSCMD)方法。该方法由优化解调频带选择和故障频率提取两个关键部分组成。我们首先开发了用于ODFB提取的GENGI-gram,采用逐级递增分割和信号频谱趋势对信号频段进行划分,得到2克。构造了一种新的GENGI故障脉冲识别器来选择这两种g中的ODFB。GENGI将广义包络与基于基尼指数的非线性权值相结合,能够有效地表征故障脉冲特征,抑制噪声干扰。在故障频率提取方面,通过建立带宽引导自适应啁啾模式分解(BACMD)和故障频率提取模式分解(FEMD),提出了TSCMD。采用BACMD提取gengi图各解调频带的子信号。利用FEMD对ODFB信号包络的各阶故障频率子信号进行分解。这样可以准确地诊断出主轴承的故障类型。BACMD导出了子信号的带宽计算公式,可以提取特定频率带宽的子信号。FEMD构建了一种新的自适应信号分解模型,该模型能有效地分解窄频带各阶故障频率子信号。然后用实际主轴承故障振动信号对所提方法进行了验证。实验结果表明,所提出的GENGI-TSCMD能够有效滤除各种类型的强噪声干扰,并精确提取各阶故障频率分量。与现有的信号处理故障诊断方法相比,该方法具有更好的性能。
{"title":"Generalized envelope nonlinear Gini index-gram guided two-stage chirp mode decomposition for shield machine main bearing fault diagnosis","authors":"Gang Shi,&nbsp;Chengjin Qin,&nbsp;Pengcheng Xia,&nbsp;Zhinan Zhang,&nbsp;Chengliang Liu","doi":"10.1016/j.aei.2026.104354","DOIUrl":"10.1016/j.aei.2026.104354","url":null,"abstract":"<div><div>The main bearing is the core mechanical equipment of shield machine, known as the heart of the shield machine, mainly applied to support the rotation of cutterhead system. The health status of main bearing has significant impact on the stable and safe construction for shield machine. However, main bearing has very low rotational speed of only 1 rpm to 5 rpm, and the structure is very complex, with a diameter even exceeding 10 m. The fault features for main bearing vibration signals are relatively weak. Therefore, the main bearing fault diagnosis is a challenging task. To tackle the problem, we develop a novel generalized envelope nonlinear Gini index-gram guided two-stage chirp mode decomposition (GENGI-TSCMD) for fault diagnosis of shield machine main bearing in this article. The proposed method consists of optimized demodulated frequency band (ODFB) selection and fault frequency extraction two key parts. We firstly developed GENGI-gram for ODFB extraction, which adopting stepwise increasing segmentation and signal frequency spectrum trend to divide the signal’s frequency bands, and obtain two grams. A new GENGI fault pulse identifier was constructed to select ODFB in these two grams. GENGI can availably characterize fault pulse features and suppress noise interference by combining generalized envelope and nonlinear weights based on Gini index. For fault frequency extraction, we proposed TSCMD by establishing bandwidth guided adaptive chirp mode decomposition (BACMD) and fault frequency extraction mode decomposition (FEMD). BACMD is adopted to extract sub-signals in each demodulation frequency band of GENGI-gram. FEMD is applied to decompose each order fault frequency sub-signals of ODFB signal’s envelope. In this way, the main bearing’s fault type can be precisely diagnosed. BACMD derives the bandwidth calculation formula for sub-signal, which can extract sub-signals with specific frequency bandwidth. FEMD constructs a novel adaptive signal decomposition model, which can effectively decompose each order fault frequency sub-signal with narrow frequency band. Then we use actual main bearing fault vibration signals to verify the proposed method. The experimental results show that proposed GENGI-TSCMD can availably filter out various types of strong noise disturbance, and precisely extract each order fault frequency component. Moreover, proposed method has superior performance than current signal processing fault diagnosis methods.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104354"},"PeriodicalIF":9.9,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implicit coordination through environment modification: Multi-agent RL approach for adaptive door placement in evacuation scenarios 通过环境改变的隐式协调:疏散场景中自适应门放置的多智能体RL方法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-24 DOI: 10.1016/j.aei.2026.104359
Isabelle Fitkau, Timo Hartmann
In evacuation modeling, architectural configurations are typically treated as fixed constraints, limiting the exploration of adaptive design strategies that respond dynamically to occupants’ movement patterns. This research investigates environment-mediated coordination in a prototypical multi-agent reinforcement learning (MARL) setting. Using Proximal Policy Optimization (PPO), a navigating agent learns escape trajectories, while a design agent modifies door placement solely based on observed movement, without explicit communication. The system was evaluated in a multi-room environment to explore how complementary strategies evolve when navigation performance and immediate floor plan changes are coupled. The results demonstrate successful implicit coordination, with substantial reductions in episode length as the door controller agent progressively shifted door placements towards positions that facilitated faster escape paths. Navigation trajectories converged on paths that maximized the reward of the navigating agent, while door placement strategies minimized evacuation times. Therefore, although allowing architectural changes during episode runtime, the results still show emerging adaptive behaviors of both agents and demonstrate the feasibility of coordinating navigation and design actions through dynamic environment modification. This approach paves the way for adaptive architectural design systems that can dynamically respond to user behavior patterns, with potential applications in performance-based building design tools and in emergency planning optimization.
在疏散建模中,建筑配置通常被视为固定约束,限制了对动态响应居住者运动模式的适应性设计策略的探索。本研究在一个典型的多智能体强化学习(MARL)环境中研究环境介导的协调。使用近端策略优化(PPO),导航代理学习逃生轨迹,而设计代理仅根据观察到的运动修改门的位置,而不需要明确的通信。该系统在多房间环境中进行了评估,以探索当导航性能和即时平面图变化相结合时,互补策略如何演变。结果证明了成功的隐式协调,随着门控制器代理逐渐将门的位置移向有利于更快逃生路径的位置,事件长度大幅减少。导航轨迹收敛于使导航代理的回报最大化的路径上,而门的放置策略则使疏散时间最小化。因此,尽管在情节运行期间允许架构变化,结果仍然显示了两个智能体的新兴自适应行为,并证明了通过动态环境修改协调导航和设计动作的可行性。这种方法为能够动态响应用户行为模式的自适应建筑设计系统铺平了道路,在基于性能的建筑设计工具和应急规划优化中具有潜在的应用前景。
{"title":"Implicit coordination through environment modification: Multi-agent RL approach for adaptive door placement in evacuation scenarios","authors":"Isabelle Fitkau,&nbsp;Timo Hartmann","doi":"10.1016/j.aei.2026.104359","DOIUrl":"10.1016/j.aei.2026.104359","url":null,"abstract":"<div><div>In evacuation modeling, architectural configurations are typically treated as fixed constraints, limiting the exploration of adaptive design strategies that respond dynamically to occupants’ movement patterns. This research investigates environment-mediated coordination in a prototypical multi-agent reinforcement learning (MARL) setting. Using Proximal Policy Optimization (PPO), a navigating agent learns escape trajectories, while a design agent modifies door placement solely based on observed movement, without explicit communication. The system was evaluated in a multi-room environment to explore how complementary strategies evolve when navigation performance and immediate floor plan changes are coupled. The results demonstrate successful implicit coordination, with substantial reductions in episode length as the door controller agent progressively shifted door placements towards positions that facilitated faster escape paths. Navigation trajectories converged on paths that maximized the reward of the navigating agent, while door placement strategies minimized evacuation times. Therefore, although allowing architectural changes during episode runtime, the results still show emerging adaptive behaviors of both agents and demonstrate the feasibility of coordinating navigation and design actions through dynamic environment modification. This approach paves the way for adaptive architectural design systems that can dynamically respond to user behavior patterns, with potential applications in performance-based building design tools and in emergency planning optimization.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104359"},"PeriodicalIF":9.9,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146078381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DuoTransFormer for nonlinear seismic response prediction: Cover large, focus local, and enforce law 用于非线性地震反应预测的DuoTransFormer:覆盖大,关注局部,并执行法律
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-24 DOI: 10.1016/j.aei.2026.104375
Xinyi Hu , Congzhen Xiao , Zhiqiang Zhang , Jie Gao
Deep learning-based methods have recently been applied to structural seismic response prediction, yet existing models still face challenges in capturing strongly nonlinear behaviors and exhibit limitations in multi-scale temporal modeling and inter-variable dependencies. Based on these observations, we propose DuoTransformer, a novel dual-view Transformer backbone that integrates a Transposed Transformer Encoder treating each variable as a token and a Patch Transformer Encoder for local temporal patterns, thereby enabling multi-scale modeling of both coarse-grained and fine-grained temporal dependencies. Inspired by response characteristics such as period elongation and stiffness degradation, the Physical Module is designed following a phenomenon-driven design paradigm. Furthermore, DuoTransFormer decouples the prediction process into two stages: the Physical Module, which hard-codes structural dynamics and outputs three parallel data chains as physics-informed feature sequences, and the Transformer Module, which takes them as inputs. Two proof-of-concept experiments are first conducted to validate the effectiveness of the proposed method. Subsequently, extensive evaluations on two open-access structural models, including a 42-story case study building and a 632-m real-world building, demonstrate that DuoTransFormer outperforms state-of-the-art time-series forecasting models (e.g., PatchTST, iTransformer) while achieving substantially lower computational cost than finite element simulations. The model-agnostic Physical Module can be flexibly combined with all baselines to deliver substantial performance gains.
基于深度学习的方法最近被应用于结构地震反应预测,但现有模型在捕获强非线性行为方面仍然面临挑战,并且在多尺度时间建模和变量间依赖性方面存在局限性。基于这些观察,我们提出了DuoTransformer,这是一种新的双视图Transformer主干,它集成了一个将每个变量视为令牌的转置Transformer编码器和一个用于本地时间模式的Patch Transformer编码器,从而实现了粗粒度和细粒度时间依赖性的多尺度建模。受周期伸长和刚度退化等响应特性的启发,物理模块的设计遵循了现象驱动的设计范式。此外,DuoTransFormer将预测过程解耦为两个阶段:物理模块(Physical Module)硬编码结构动力学并输出三个并行数据链作为物理知情的特征序列,而变压器模块(Transformer Module)将它们作为输入。首先进行了两个概念验证实验来验证所提出方法的有效性。随后,对两个开放访问结构模型(包括42层的案例研究建筑和632米的真实建筑)进行了广泛的评估,结果表明,DuoTransFormer优于最先进的时间序列预测模型(例如PatchTST, iTransformer),同时实现了比有限元模拟低得多的计算成本。与模型无关的物理模块可以灵活地与所有基线组合在一起,以提供显著的性能提升。
{"title":"DuoTransFormer for nonlinear seismic response prediction: Cover large, focus local, and enforce law","authors":"Xinyi Hu ,&nbsp;Congzhen Xiao ,&nbsp;Zhiqiang Zhang ,&nbsp;Jie Gao","doi":"10.1016/j.aei.2026.104375","DOIUrl":"10.1016/j.aei.2026.104375","url":null,"abstract":"<div><div>Deep learning-based methods have recently been applied to structural seismic response prediction, yet existing models still face challenges in capturing strongly nonlinear behaviors and exhibit limitations in multi-scale temporal modeling and inter-variable dependencies. Based on these observations, we propose <span><math><mi>DuoTransformer</mi></math></span>, a novel dual-view Transformer backbone that integrates a <span><math><mi>Trans</mi></math></span>posed Trans<span><math><mi>former</mi></math></span> Encoder treating each variable as a token and a Patch <span><math><mi>Transformer</mi></math></span> Encoder for local temporal patterns, thereby enabling multi-scale modeling of both coarse-grained and fine-grained temporal dependencies. Inspired by response characteristics such as period elongation and stiffness degradation, the Physical Module is designed following a phenomenon-driven design paradigm. Furthermore, DuoTransFormer decouples the prediction process into two stages: the Physical Module, which hard-codes structural dynamics and outputs three parallel data chains as physics-informed feature sequences, and the Transformer Module, which takes them as inputs. Two proof-of-concept experiments are first conducted to validate the effectiveness of the proposed method. Subsequently, extensive evaluations on two open-access structural models, including a 42-story case study building and a 632-m real-world building, demonstrate that DuoTransFormer outperforms state-of-the-art time-series forecasting models (e.g., PatchTST, iTransformer) while achieving substantially lower computational cost than finite element simulations. The model-agnostic Physical Module can be flexibly combined with all baselines to deliver substantial performance gains.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104375"},"PeriodicalIF":9.9,"publicationDate":"2026-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven modelling of unloading hours using explainable gradient boosting models 使用可解释的梯度提升模型的卸载时间数据驱动建模
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-23 DOI: 10.1016/j.aei.2026.104353
Celal Cakiroglu , Najat Almasarwah , Mehmet Hakan Özdemir , Batin Latif Aylak , Manjeet Singh , Muhammet Deveci
Unloading processes denote the extraction of finished goods and raw materials from transport units and their subsequent conveyance to designated locations. The efficiency of unloading processes is vital in supply chain and logistics management, regarded as an essential component. Delays in unloading operations result in numerous challenges, including heightened operational expenses, diminished labour efficiency, and supply chain bottlenecks. Consequently, it is essential to ascertain unloading times beforehand to mitigate these challenges, resulting in diminished idle time, enhanced overall efficiency, and optimized scheduling. Therefore, precise prediction of unloading times is critically significant. The novelty of this study lies in the application of machine learning techniques to improve operational efficiency by accurately predicting unloading time. To that end, this study employed LightGBM and XGBoost to predict the unloading time in a real case. The unloading time can be predicted with R2 score greater than 0.99 utilizing both models. Subsequently, the SHapley Additive exPlanations (SHAP) methodology was used to ascertain how each input feature contributed to the model’s output. The load of leg significantly influences the unloading time more than the gross weight of truck and the leg distance.
卸货过程是指从运输单位提取制成品和原材料,并将其运送到指定地点。卸载过程的效率在供应链和物流管理中至关重要,被视为必不可少的组成部分。卸载作业的延迟会带来许多挑战,包括运营费用增加、劳动效率降低和供应链瓶颈。因此,必须事先确定卸载时间,以减轻这些挑战,从而减少闲置时间,提高整体效率,优化调度。因此,准确预测卸载时间至关重要。本研究的新颖之处在于应用机器学习技术,通过准确预测卸载时间来提高操作效率。为此,本研究采用LightGBM和XGBoost来预测实际情况下的卸载时间。两种模型均可预测卸载时间,R2评分均大于0.99。随后,使用SHapley加性解释(SHAP)方法来确定每个输入特征对模型输出的贡献。腿腿的载荷对卸载时间的影响比卡车总重和腿腿距离的影响更显著。
{"title":"Data-driven modelling of unloading hours using explainable gradient boosting models","authors":"Celal Cakiroglu ,&nbsp;Najat Almasarwah ,&nbsp;Mehmet Hakan Özdemir ,&nbsp;Batin Latif Aylak ,&nbsp;Manjeet Singh ,&nbsp;Muhammet Deveci","doi":"10.1016/j.aei.2026.104353","DOIUrl":"10.1016/j.aei.2026.104353","url":null,"abstract":"<div><div>Unloading processes denote the extraction of finished goods and raw materials from transport units and their subsequent conveyance to designated locations. The efficiency of unloading processes is vital in supply chain and logistics management, regarded as an essential component. Delays in unloading operations result in numerous challenges, including heightened operational expenses, diminished labour efficiency, and supply chain bottlenecks. Consequently, it is essential to ascertain unloading times beforehand to mitigate these challenges, resulting in diminished idle time, enhanced overall efficiency, and optimized scheduling. Therefore, precise prediction of unloading times is critically significant. The novelty of this study lies in the application of machine learning techniques to improve operational efficiency by accurately predicting unloading time. To that end, this study employed LightGBM and XGBoost to predict the unloading time in a real case. The unloading time can be predicted with <span><math><msup><mrow><mi>R</mi></mrow><mo>2</mo></msup></math></span> score greater than 0.99 utilizing both models. Subsequently, the SHapley Additive exPlanations (SHAP) methodology was used to ascertain how each input feature contributed to the model’s output. The load of leg significantly influences the unloading time more than the gross weight of truck and the leg distance.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104353"},"PeriodicalIF":9.9,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RLLM-SS: A knowledge-guided simplex search method integrating large language model and reinforcement learning for injection molding quality control RLLM-SS:一种集成大语言模型和强化学习的知识引导单纯形搜索方法,用于注塑质量控制
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-23 DOI: 10.1016/j.aei.2026.104372
Haipeng Zou , Xinyu Li , Yongkuan Yang , Ke Yao , Xiangsong Kong , Zhijiang Shao , Furong Gao
In injection molding (IM), product quality and process stability are highly dependent on the setting of key process parameters, making efficient parameter tuning essential for achieving reliable and consistent production. However, the tuning process is traditionally guided by expert experience and trial-and-error methods, which often lead to low efficiency and prolonged adjustment cycles. To address this challenge, we propose a knowledge-guided simplex search method that integrates a large language model (LLM) with the Soft Actor–Critic (SAC) reinforcement learning algorithm in a collaborative optimization framework, called RLLM-SS. In RLLM-SS, a quasi-gradient mechanism leverages historical data to dynamically estimate the step size and gradient compensation direction of the simplex search method. These estimated variables, integrated with domain knowledge, are encoded into structured prompts that guide the injection molding quality LLM in dynamically adjusting simplex coefficients through natural language reasoning. This enables the simplex search to overcome fixed-coefficient limitation and avoid local optima to the maximum extent. To mitigate the drawbacks of the LLM, such as its tendency to generate hallucinated outputs and lack of memory of past tuning adjustments, a SAC-based evaluation module is introduced. It assigns rewards based on optimization performance, thereby reinforcing effective strategies and fostering continuous policy improvement when similar conditions recur. Experimental evaluations first verified LLM-SS on standard high-dimensional benchmark functions, confirming its effectiveness in complex search spaces, and were then conducted on an injection molding quality simulation platform built on a neural network trained with practical IM process data. Results show that RLLM-SS outperforms several advanced methods, reducing the average number of iterations by 27.6% and the final Euclidean distance to the target quality curve by 68.3%. It also maintains strong robustness under Gaussian noise perturbations.
在注射成型(IM)中,产品质量和工艺稳定性高度依赖于关键工艺参数的设置,因此有效的参数调整对于实现可靠和一致的生产至关重要。然而,传统的调整过程是由专家经验和试错方法指导的,这往往导致低效率和长时间的调整周期。为了解决这一挑战,我们提出了一种知识引导的单纯形搜索方法,该方法将大型语言模型(LLM)与软行为者-评论家(SAC)强化学习算法集成在一个称为RLLM-SS的协作优化框架中。在RLLM-SS中,拟梯度机制利用历史数据动态估计单纯形搜索方法的步长和梯度补偿方向。这些估计变量与领域知识相结合,被编码成结构化提示,指导注塑质量LLM通过自然语言推理动态调整单纯形系数。这使得单纯形搜索克服了固定系数的限制,最大程度地避免了局部最优。为了减轻LLM的缺点,例如它容易产生幻觉输出和缺乏对过去调谐调整的记忆,引入了一个基于sac的评估模块。它根据优化绩效分配奖励,从而加强有效的策略,并在类似情况再次发生时促进持续的政策改进。实验评估首先验证了LLM-SS在标准高维基准函数上的有效性,验证了其在复杂搜索空间中的有效性,然后在基于实际IM过程数据训练的神经网络的注塑质量仿真平台上进行了实验评估。结果表明,RLLM-SS优于几种先进的方法,平均迭代次数减少27.6%,最终到目标质量曲线的欧几里得距离减少68.3%。在高斯噪声扰动下也保持较强的鲁棒性。
{"title":"RLLM-SS: A knowledge-guided simplex search method integrating large language model and reinforcement learning for injection molding quality control","authors":"Haipeng Zou ,&nbsp;Xinyu Li ,&nbsp;Yongkuan Yang ,&nbsp;Ke Yao ,&nbsp;Xiangsong Kong ,&nbsp;Zhijiang Shao ,&nbsp;Furong Gao","doi":"10.1016/j.aei.2026.104372","DOIUrl":"10.1016/j.aei.2026.104372","url":null,"abstract":"<div><div>In injection molding (IM), product quality and process stability are highly dependent on the setting of key process parameters, making efficient parameter tuning essential for achieving reliable and consistent production. However, the tuning process is traditionally guided by expert experience and trial-and-error methods, which often lead to low efficiency and prolonged adjustment cycles. To address this challenge, we propose a knowledge-guided simplex search method that integrates a large language model (LLM) with the Soft Actor–Critic (SAC) reinforcement learning algorithm in a collaborative optimization framework, called RLLM-SS. In RLLM-SS, a quasi-gradient mechanism leverages historical data to dynamically estimate the step size and gradient compensation direction of the simplex search method. These estimated variables, integrated with domain knowledge, are encoded into structured prompts that guide the injection molding quality LLM in dynamically adjusting simplex coefficients through natural language reasoning. This enables the simplex search to overcome fixed-coefficient limitation and avoid local optima to the maximum extent. To mitigate the drawbacks of the LLM, such as its tendency to generate hallucinated outputs and lack of memory of past tuning adjustments, a SAC-based evaluation module is introduced. It assigns rewards based on optimization performance, thereby reinforcing effective strategies and fostering continuous policy improvement when similar conditions recur. Experimental evaluations first verified LLM-SS on standard high-dimensional benchmark functions, confirming its effectiveness in complex search spaces, and were then conducted on an injection molding quality simulation platform built on a neural network trained with practical IM process data. Results show that RLLM-SS outperforms several advanced methods, reducing the average number of iterations by 27.6% and the final Euclidean distance to the target quality curve by 68.3%. It also maintains strong robustness under Gaussian noise perturbations.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104372"},"PeriodicalIF":9.9,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An adaptive spatial–temporal encoder with gated multi-convolutions for remaining useful life prediction 用于剩余使用寿命预测的门控多卷积自适应时空编码器
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-23 DOI: 10.1016/j.aei.2026.104369
Wen Liu , Jyun-You Chiang , Yi Li , Haobo Zhang
Accurate estimation of Remaining Useful Life (RUL) is essential for safe and economical operation of turbofan engines. This paper introduces an Adaptive Spatial-Temporal Encoder with Gated Multi-Convolutions (GMC-ASTE), a novel approach that simultaneously models temporal dynamics and inter-sensor relationships to enhance RUL prediction accuracy. The methodology employs a multi-scale gated convolution module to extract refined features from raw multi-sensor data, effectively reducing noise while retaining transient signals. These features are subsequently processed through an adaptive spatial–temporal encoder, which utilizes graph attention mechanisms to dynamically adjust sensor connections and multi-head temporal attention to capture long-term dependencies parallelly. Extensive experiments on the Commercial Modular Aero-Propulsion System Simulation (C-MAPSS) benchmark demonstrate that GMC-ASTE achieves superior performance, with the lowest Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Scoring metrics across all four sub-datasets. The results confirm the effectiveness and interpretability of the proposed model, providing an advanced framework that improves engine prognostic theory and offering airlines a practical tool to reduce downtime and maintenance costs.
准确估计剩余使用寿命(RUL)是保证涡扇发动机安全经济运行的关键。本文介绍了一种具有门控多卷积的自适应时空编码器(gmmc - aste),这是一种同时模拟时间动态和传感器间关系以提高RUL预测精度的新方法。该方法采用多尺度门控卷积模块从原始多传感器数据中提取精细特征,在保留瞬态信号的同时有效地降低噪声。这些特征随后通过自适应时空编码器进行处理,该编码器利用图形注意机制动态调整传感器连接和多头时间注意来并行捕获长期依赖关系。在商用模块化航空推进系统仿真(C-MAPSS)基准测试上进行的大量实验表明,GMC-ASTE在所有四个子数据集上都具有最低的均方根误差(RMSE)、平均绝对误差(MAE)和评分指标,具有卓越的性能。研究结果证实了该模型的有效性和可解释性,为改进发动机预测理论提供了一个先进的框架,并为航空公司提供了一个减少停机时间和维护成本的实用工具。
{"title":"An adaptive spatial–temporal encoder with gated multi-convolutions for remaining useful life prediction","authors":"Wen Liu ,&nbsp;Jyun-You Chiang ,&nbsp;Yi Li ,&nbsp;Haobo Zhang","doi":"10.1016/j.aei.2026.104369","DOIUrl":"10.1016/j.aei.2026.104369","url":null,"abstract":"<div><div>Accurate estimation of Remaining Useful Life (RUL) is essential for safe and economical operation of turbofan engines. This paper introduces an Adaptive Spatial-Temporal Encoder with Gated Multi-Convolutions (GMC-ASTE), a novel approach that simultaneously models temporal dynamics and inter-sensor relationships to enhance RUL prediction accuracy. The methodology employs a multi-scale gated convolution module to extract refined features from raw multi-sensor data, effectively reducing noise while retaining transient signals. These features are subsequently processed through an adaptive spatial–temporal encoder, which utilizes graph attention mechanisms to dynamically adjust sensor connections and multi-head temporal attention to capture long-term dependencies parallelly. Extensive experiments on the Commercial Modular Aero-Propulsion System Simulation (C-MAPSS) benchmark demonstrate that GMC-ASTE achieves superior performance, with the lowest Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Scoring metrics across all four sub-datasets. The results confirm the effectiveness and interpretability of the proposed model, providing an advanced framework that improves engine prognostic theory and offering airlines a practical tool to reduce downtime and maintenance costs.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104369"},"PeriodicalIF":9.9,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building façade delamination quantification framework based on infrared instance segmentation and dual-modality vision calibration 建立基于红外实例分割和双模视觉标定的图像分层量化框架
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-23 DOI: 10.1016/j.aei.2026.104341
Ziyu Wang , Zhenfen Jin , Xiaolan Zhuo , Jiangpeng Shu , Ziyue Zeng , Rongrong Wei , Lijun Ye
Building façade delamination quantification is critical for damage severity assessment and maintenance. While infrared thermography combined with deep learning reduces manual inspection costs, it fails to achieve precise detection and quantification results due to noise and insufficient spatial details in infrared images. To address these issues, this paper proposes a comprehensive framework. A multi-stage iterative infrared delamination network (IR-DNet) centered on dual-weighted features is designed for delamination segmentation. To tackle infrared data scarcity and real data collection costs, IR-DNet is trained on laboratory and simulated data while ensuring generalization to field scenarios. Additionally, a dual-modality vision calibration method is developed: after image registration, visible images serve as complementary information to correct perspective distortion and calibrate the scale factor, with the resulting parameters shared between modalities, thus creating an information fusion pipeline. The corrected infrared images are then fed into IR-DNet for segmentation and quantification. Field test yields an average quantification error of 6.02%, verifying the practical value for damage severity judgment and maintenance decisions.
建筑立面分层量化对损伤程度评估和维修至关重要。红外热成像与深度学习相结合,虽然降低了人工检测成本,但由于红外图像中的噪声和空间细节不足,无法实现精确的检测和量化结果。为了解决这些问题,本文提出了一个全面的框架。设计了一种以双加权特征为中心的多级迭代红外分层网络(IR-DNet)进行分层分割。为了解决红外数据稀缺和实际数据收集成本的问题,IR-DNet在实验室和模拟数据上进行了训练,同时确保了对现场场景的推广。此外,提出了一种双模态视觉校准方法,在图像配准后,以可见图像作为互补信息,校正透视失真,校准比例因子,并在模态之间共享得到的参数,形成信息融合管道。然后将校正后的红外图像送入IR-DNet进行分割和量化。现场试验平均量化误差为6.02%,验证了对损伤严重程度判断和维修决策的实用价值。
{"title":"Building façade delamination quantification framework based on infrared instance segmentation and dual-modality vision calibration","authors":"Ziyu Wang ,&nbsp;Zhenfen Jin ,&nbsp;Xiaolan Zhuo ,&nbsp;Jiangpeng Shu ,&nbsp;Ziyue Zeng ,&nbsp;Rongrong Wei ,&nbsp;Lijun Ye","doi":"10.1016/j.aei.2026.104341","DOIUrl":"10.1016/j.aei.2026.104341","url":null,"abstract":"<div><div>Building façade delamination quantification is critical for damage severity assessment and maintenance. While infrared thermography combined with deep learning reduces manual inspection costs, it fails to achieve precise detection and quantification results due to noise and insufficient spatial details in infrared images. To address these issues, this paper proposes a comprehensive framework. A multi-stage iterative infrared delamination network (IR-DNet) centered on dual-weighted features is designed for delamination segmentation. To tackle infrared data scarcity and real data collection costs, IR-DNet is trained on laboratory and simulated data while ensuring generalization to field scenarios. Additionally, a dual-modality vision calibration method is developed: after image registration, visible images serve as complementary information to correct perspective distortion and calibrate the scale factor, with the resulting parameters shared between modalities, thus creating an information fusion pipeline. The corrected infrared images are then fed into IR-DNet for segmentation and quantification. Field test yields an average quantification error of 6.02%, verifying the practical value for damage severity judgment and maintenance decisions.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104341"},"PeriodicalIF":9.9,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Updating digital twin in supervisory control and data acquisition for sustainable laser beam micro machining 可持续激光束微加工监控与数据采集中数字孪生的更新
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-23 DOI: 10.1016/j.aei.2026.104355
K. Venkata Rao , K. Venkata Vaneesha
In the Industry 4.0 era, improving energy efficiency and product quality is a key challenge for the manufacturing sector, with the aim of reducing both energy consumption and surface roughness. The present study proposed a supervisory control and data acquisition system to address these issues in laser beam machining of AISI 316 L stainless steel. A digital twin is integrated with supervisory control and data acquisition interface, forming a closed-loop connection between the physical system and its digital counterpart. The digital twin is a combination of a mathematical model and an artificial neural network. The mathematical models estimate the surface roughness and specific machining energy using the experimental data collected from the physical system. When the estimated surface roughness and the specific machining energy exceeded their target values, the digital twin optimizes and adjusts the input parameters such as laser power, laser frequency, and cutting speed. The process parameters were also optimized using the Taguchi method and compared with supervisory control and data acquisition. The supervisory control and data acquisition system showed superior performance over the Taguchi method, achieving 47.44 %, 54.67 %, and 36.99 % reductions in surface roughness and 34.92 %, 22.61 %, and 83.66 % reductions in specific machining energy for 4, 6, and 8 mm thick sheets, respectively. The artificial neural network within the digital twin is continuously retrained using rich, real-time vibration signals, enabling instantaneous adaptation to the dynamic behaviour of the process, resulting in reduced surface roughness and specific machining energy, and enhanced overall sustainability.
在工业4.0时代,提高能源效率和产品质量是制造业面临的关键挑战,其目的是降低能耗和表面粗糙度。针对aisi316l不锈钢激光加工过程中存在的问题,提出了一种监控与数据采集系统。数字孪生体与监控和数据采集接口集成,在物理系统和数字孪生体之间形成闭环连接。数字双胞胎是数学模型和人工神经网络的结合。数学模型利用从物理系统收集的实验数据估计表面粗糙度和比加工能量。当估计的表面粗糙度和比加工能量超过目标值时,数字孪生对激光功率、激光频率和切割速度等输入参数进行优化和调整。采用田口法对工艺参数进行了优化,并与监控和数据采集进行了比较。监测控制和数据采集系统的性能优于田口法,对4、6和8 mm厚板材的表面粗糙度分别降低了47.44%、54.67%和36.99%,比加工能量分别降低了34.92%、22.61%和83.66%。数字孪生中的人工神经网络使用丰富的实时振动信号不断进行再训练,使其能够即时适应过程的动态行为,从而降低表面粗糙度和特定加工能量,并增强整体可持续性。
{"title":"Updating digital twin in supervisory control and data acquisition for sustainable laser beam micro machining","authors":"K. Venkata Rao ,&nbsp;K. Venkata Vaneesha","doi":"10.1016/j.aei.2026.104355","DOIUrl":"10.1016/j.aei.2026.104355","url":null,"abstract":"<div><div>In the Industry 4.0 era, improving energy efficiency and product quality is a key challenge for the manufacturing sector, with the aim of reducing both energy consumption and surface roughness. The present study proposed a supervisory control and data acquisition system to address these issues in laser beam machining of AISI 316 L stainless steel. A digital twin is integrated with supervisory control and data acquisition interface, forming a closed-loop connection between the physical system and its digital counterpart. The digital twin is a combination of a mathematical model and an artificial neural network.<!--> <!-->The mathematical models estimate the surface roughness and specific machining energy using the experimental data collected from the physical system. When the estimated surface roughness and the specific machining energy exceeded their target values, the digital twin optimizes and adjusts the input parameters such as laser power, laser frequency, and cutting speed. The process parameters were also optimized using the Taguchi method and compared with supervisory control and data acquisition. The supervisory control and data acquisition system showed superior performance over the Taguchi method, achieving 47.44 %, 54.67 %, and 36.99 % reductions in surface roughness and 34.92 %, 22.61 %, and 83.66 % reductions in specific machining energy for 4, 6, and 8 mm thick sheets, respectively. The artificial neural network within the digital twin is continuously retrained using rich, real-time vibration signals, enabling instantaneous adaptation to the dynamic behaviour of the process, resulting in reduced surface roughness and specific machining energy, and enhanced overall sustainability.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104355"},"PeriodicalIF":9.9,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A general and efficient approach for uncertainty quantification in neural networks: Identifying risky decisions in AI systems 神经网络中不确定性量化的通用有效方法:识别人工智能系统中的风险决策
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-22 DOI: 10.1016/j.aei.2026.104343
Zhao Zhang, Senlin Luo, Xiaolong Wu, Xikai Gao, Jiawei Pi, Limin Pan
Uncertainty quantification in neural networks enables the assessment of predictive reliability in artificial intelligence systems, thereby reducing the risk of unsafe decisions. Existing approaches rely heavily on ensemble construction to sample the model parameter space and capture decision variability. However, under realistic resource constraints, small-scale sampling leads to insufficient evidence sources and inaccurate uncertainty estimates. In addition, the design of uncertainty metrics significantly influences estimation accuracy and may limit applicability across different types of machine learning (ML) tasks. In this paper, a Systematic Reusable Ensemble (SRE) framework is proposed for uncertainty quantification. The approach reuses and shares neural network components during retraining to efficiently generate multiple model instances within a single training process. Furthermore, a compounded ensemble pruning strategy is introduced to promote more uniform sampling in parameter space. A general fusion metric is then developed based on evidence theory with a redesigned trust allocation mechanism. Experimental results demonstrate that the proposed framework systematically reduces ensemble construction overhead while improving the reliability of uncertainty estimation. The generalization capability of the SRE is further validated through its effectiveness in identifying high-risk decisions across at least five categories of ML tasks.
神经网络中的不确定性量化能够评估人工智能系统的预测可靠性,从而降低不安全决策的风险。现有方法严重依赖于集成构造来采样模型参数空间并捕获决策可变性。然而,在现实的资源限制下,小规模抽样导致证据来源不足和不确定性估计不准确。此外,不确定性度量的设计显着影响估计的准确性,并可能限制在不同类型的机器学习(ML)任务中的适用性。提出了一种用于不确定性量化的系统可重用集成(SRE)框架。该方法在再训练过程中重用和共享神经网络组件,从而在单个训练过程中高效地生成多个模型实例。在此基础上,引入复合集合剪枝策略,使采样在参数空间上更加均匀。基于证据理论,提出了一种通用的融合度量,并重新设计了信任分配机制。实验结果表明,该框架系统地降低了集成构建开销,提高了不确定性估计的可靠性。通过在至少五类机器学习任务中识别高风险决策的有效性,进一步验证了SRE的泛化能力。
{"title":"A general and efficient approach for uncertainty quantification in neural networks: Identifying risky decisions in AI systems","authors":"Zhao Zhang,&nbsp;Senlin Luo,&nbsp;Xiaolong Wu,&nbsp;Xikai Gao,&nbsp;Jiawei Pi,&nbsp;Limin Pan","doi":"10.1016/j.aei.2026.104343","DOIUrl":"10.1016/j.aei.2026.104343","url":null,"abstract":"<div><div>Uncertainty quantification in neural networks enables the assessment of predictive reliability in artificial intelligence systems, thereby reducing the risk of unsafe decisions. Existing approaches rely heavily on ensemble construction to sample the model parameter space and capture decision variability. However, under realistic resource constraints, small-scale sampling leads to insufficient evidence sources and inaccurate uncertainty estimates. In addition, the design of uncertainty metrics significantly influences estimation accuracy and may limit applicability across different types of machine learning (ML) tasks. In this paper, a Systematic Reusable Ensemble (SRE) framework is proposed for uncertainty quantification. The approach reuses and shares neural network components during retraining to efficiently generate multiple model instances within a single training process. Furthermore, a compounded ensemble pruning strategy is introduced to promote more uniform sampling in parameter space. A general fusion metric is then developed based on evidence theory with a redesigned trust allocation mechanism. Experimental results demonstrate that the proposed framework systematically reduces ensemble construction overhead while improving the reliability of uncertainty estimation. The generalization capability of the SRE is further validated through its effectiveness in identifying high-risk decisions across at least five categories of ML tasks.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104343"},"PeriodicalIF":9.9,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A two-phase cost sensitive-based domain adversarial neural network for anomaly detection in mass customized production 大规模定制生产中基于成本敏感的两相域对抗神经网络异常检测
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-22 DOI: 10.1016/j.aei.2026.104371
Yuan Gao , Thong Ngee Goh , Qingan Cui , Qing Zhang , Zhen He
In the context of globalization and intense market competition, detecting anomalies in products is vital for enterprises to improve quality, enhance efficiency, and reduce costs. Enterprises are also pursuing mass customization to meet personalized customer needs. However, customized products are typically produced in small batches. Furthermore, modern manufacturing processes typically exhibit low defective rates. Additionally, quality results are only available after the inspections. These characteristics lead to the challenges of small sample sizes, imbalanced data, and delayed label acquisition for production data, potentially undermining the effectiveness of existing anomaly detection models. To cope with these challenges, this study proposes a two-phase cost sensitive domain adversarial neural network framework that leverages transfer learning to apply knowledge from similar mass-standardized products to customized products. When process parameters are available but inspection results are not, an unsupervised domain adversarial neural network integrated with cost-sensitive learning is employed to address the issues of small sample sizes and unlabeled data. Within this network, the cost-sensitive learning component simultaneously tackles data imbalance by biasing the label predictor towards the minority class through higher loss assignment compared to the majority class. Once inspection results become available for some customized products, the label predictor loss for these labeled samples is incorporated to further enhance anomaly detection. The experimental results demonstrate that the proposed method outperforms other state-of-the-art anomaly detection methods in practical smartphone speaker and laptop solid-state drive manufacturing processes.
在全球化和激烈的市场竞争背景下,产品异常检测对企业提高质量、提高效率、降低成本至关重要。企业也在追求大规模定制,以满足客户的个性化需求。然而,定制产品通常是小批量生产的。此外,现代制造工艺通常表现出较低的不良率。此外,质量结果只有在检验后才能得到。这些特征导致了小样本量、数据不平衡以及生产数据的标签获取延迟的挑战,潜在地破坏了现有异常检测模型的有效性。为了应对这些挑战,本研究提出了一种两阶段成本敏感域对抗神经网络框架,该框架利用迁移学习将类似大规模标准化产品的知识应用于定制产品。当过程参数可用但检测结果不可用时,采用无监督域对抗神经网络与成本敏感学习相结合来解决小样本量和未标记数据的问题。在这个网络中,成本敏感的学习组件同时通过比多数类更高的损失分配将标签预测器偏向少数类来处理数据不平衡。一旦某些定制产品的检测结果可用,这些标记样品的标签预测器损失将被纳入,以进一步增强异常检测。实验结果表明,该方法在智能手机扬声器和笔记本电脑固态硬盘制造过程中优于其他最先进的异常检测方法。
{"title":"A two-phase cost sensitive-based domain adversarial neural network for anomaly detection in mass customized production","authors":"Yuan Gao ,&nbsp;Thong Ngee Goh ,&nbsp;Qingan Cui ,&nbsp;Qing Zhang ,&nbsp;Zhen He","doi":"10.1016/j.aei.2026.104371","DOIUrl":"10.1016/j.aei.2026.104371","url":null,"abstract":"<div><div>In the context of globalization and intense market competition, detecting anomalies in products is vital for enterprises to improve quality, enhance efficiency, and reduce costs. Enterprises are also pursuing mass customization to meet personalized customer needs. However, customized products are typically produced in small batches. Furthermore, modern manufacturing processes typically exhibit low defective rates. Additionally, quality results are only available after the inspections. These characteristics lead to the challenges of small sample sizes, imbalanced data, and delayed label acquisition for production data, potentially undermining the effectiveness of existing anomaly detection models. To cope with these challenges, this study proposes a two-phase cost sensitive domain adversarial neural network framework that leverages transfer learning to apply knowledge from similar mass-standardized products to customized products. When process parameters are available but inspection results are not, an unsupervised domain adversarial neural network integrated with cost-sensitive learning is employed to address the issues of small sample sizes and unlabeled data. Within this network, the cost-sensitive learning component simultaneously tackles data imbalance by biasing the label predictor towards the minority class through higher loss assignment compared to the majority class. Once inspection results become available for some customized products, the label predictor loss for these labeled samples is incorporated to further enhance anomaly detection. The experimental results demonstrate that the proposed method outperforms other state-of-the-art anomaly detection methods in practical smartphone speaker and laptop solid-state drive manufacturing processes.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104371"},"PeriodicalIF":9.9,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Advanced Engineering Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1