首页 > 最新文献

Advanced Engineering Informatics最新文献

英文 中文
Mitigating traffic oscillations in mixed traffic flow with scalable deep Koopman predictive control 基于可扩展深度Koopman预测控制的混合交通流交通振荡抑制
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-24 DOI: 10.1016/j.aei.2025.104258
Hao Lyu , Yanyong Guo , Pan Liu , Nan Zheng , Ting Wang , Quansheng Yue
Mitigating traffic oscillations in mixed flows of connected automated vehicles (CAVs) and human-driven vehicles (HDVs) is critical for enhancing traffic stability. A key challenge lies in modeling the nonlinear, heterogeneous behaviors of HDVs within computationally tractable predictive control frameworks. This study proposes an adaptive deep Koopman predictive control framework (AdapKoopPC) to address this issue. The framework features a novel deep Koopman network, AdapKoopnet, which represents complex HDV car-following dynamics as a linear system in a high-dimensional space by adaptively learning from naturalistic data. This learned linear representation is then embedded into a Model Predictive Control (MPC) scheme, enabling real-time, scalable, and optimal control of CAVs. We validate our framework using the HighD dataset and extensive numerical simulations. Results demonstrate that AdapKoopnet achieves superior trajectory prediction accuracy over baseline models. Furthermore, the complete AdapKoopPC controller significantly dampens traffic oscillations with lower computational cost, exhibiting strong performance even at low CAV penetration rates. The proposed framework offers a scalable and data-driven solution for enhancing stability in realistic mixed traffic environments. The code is made publicly available.1
缓解自动驾驶汽车(cav)和人类驾驶汽车(HDVs)混合流中的交通振荡对于提高交通稳定性至关重要。一个关键的挑战在于在计算可处理的预测控制框架内对hdv的非线性、异构行为进行建模。本研究提出一种自适应深度库普曼预测控制框架(AdapKoopPC)来解决这个问题。该框架采用了一种新颖的深度Koopman网络AdapKoopnet,该网络通过自适应学习自然数据,将复杂的HDV汽车跟随动力学表示为高维空间中的线性系统。然后将这种学习到的线性表示嵌入到模型预测控制(MPC)方案中,实现自动驾驶汽车的实时、可扩展和最优控制。我们使用HighD数据集和广泛的数值模拟来验证我们的框架。结果表明,与基线模型相比,AdapKoopnet的轨迹预测精度更高。此外,完整的AdapKoopPC控制器以较低的计算成本显著抑制流量振荡,即使在低CAV渗透率下也表现出强大的性能。该框架提供了一个可扩展和数据驱动的解决方案,以增强现实混合交通环境中的稳定性。代码是公开的
{"title":"Mitigating traffic oscillations in mixed traffic flow with scalable deep Koopman predictive control","authors":"Hao Lyu ,&nbsp;Yanyong Guo ,&nbsp;Pan Liu ,&nbsp;Nan Zheng ,&nbsp;Ting Wang ,&nbsp;Quansheng Yue","doi":"10.1016/j.aei.2025.104258","DOIUrl":"10.1016/j.aei.2025.104258","url":null,"abstract":"<div><div>Mitigating traffic oscillations in mixed flows of connected automated vehicles (CAVs) and human-driven vehicles (HDVs) is critical for enhancing traffic stability. A key challenge lies in modeling the nonlinear, heterogeneous behaviors of HDVs within computationally tractable predictive control frameworks. This study proposes an adaptive deep Koopman predictive control framework (AdapKoopPC) to address this issue. The framework features a novel deep Koopman network, AdapKoopnet, which represents complex HDV car-following dynamics as a linear system in a high-dimensional space by adaptively learning from naturalistic data. This learned linear representation is then embedded into a Model Predictive Control (MPC) scheme, enabling real-time, scalable, and optimal control of CAVs. We validate our framework using the HighD dataset and extensive numerical simulations. Results demonstrate that AdapKoopnet achieves superior trajectory prediction accuracy over baseline models. Furthermore, the complete AdapKoopPC controller significantly dampens traffic oscillations with lower computational cost, exhibiting strong performance even at low CAV penetration rates. The proposed framework offers a scalable and data-driven solution for enhancing stability in realistic mixed traffic environments. The code is made publicly available.<span><span><sup>1</sup></span></span></div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104258"},"PeriodicalIF":9.9,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145842078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TriTopo-LGDM: A reverse design method for trabecular bone scaffolds integrating topology optimization and latent graph diffusion models TriTopo-LGDM:一种结合拓扑优化和潜图扩散模型的骨小梁支架反设计方法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-24 DOI: 10.1016/j.aei.2025.104269
Wei Zhang , Kaicheng Yu , Lijie Su , Yifeng Yao , Lihua Lu , Swee Leong Sing
The efficient design of cancellous bone tissue engineering scaffolds that closely replicate both the histological and mechanical characteristics of native cancellous bone remains a significant challenge in the field. The vast design space of porous scaffolds gives rise to a highly complex, nonlinear relationship between geometric morphology, mechanical properties, and osteogenic performance. Currently, the design and in vitro fabrication of scaffolds for cancellous bone defect repair largely depend on expert-driven forward design methods. These approaches are time-consuming, costly, and often lack reproducibility and controllability, limiting their suitability for clinical applications. To overcome these limitations, this study introduces an innovative inverse design framework – TriTopo-LGDM – which combines topological optimization priors with a latent graph diffusion generative model. Built on a triply aligned dataset encompassing structure, physics, and optimization, the framework establishes a scaffold generation pipeline specifically tailored for cancellous bone defect reconstruction. It enables the efficient generation and accurate modeling of multi-scale functional scaffold structures. Experimental evaluations confirm that TriTopo-LGDM establishes a robust bidirectional mapping between topological parameters and target mechanical properties, significantly reducing design time and cost while improving structural consistency and 3D printability. Mechanical testing and finite element simulations further validate the strong mechanical and morphological resemblance of the generated scaffolds to natural cancellous bone. This work presents a generalizable and efficient strategy for the rapid, patient-specific design of implants that promote cancellous bone regeneration.
如何高效地设计出能够复制天然松质骨的组织学和力学特性的松质骨组织工程支架,仍然是该领域的一个重大挑战。多孔支架的巨大设计空间导致其几何形态、力学性能和成骨性能之间存在高度复杂的非线性关系。目前,松质骨缺损修复支架的设计和体外制造在很大程度上依赖于专家驱动的正向设计方法。这些方法耗时、昂贵,而且往往缺乏可重复性和可控性,限制了它们在临床应用中的适用性。为了克服这些限制,本研究引入了一种创新的逆设计框架——TriTopo-LGDM,该框架将拓扑优化先验与潜在图扩散生成模型相结合。该框架建立在包含结构、物理和优化的三重对齐数据集上,建立了专门为松质骨缺损重建量身定制的支架生成管道。它可以实现多尺度功能支架结构的高效生成和精确建模。实验评估证实,TriTopo-LGDM在拓扑参数和目标机械性能之间建立了强大的双向映射,显著减少了设计时间和成本,同时提高了结构一致性和3D打印可打印性。力学测试和有限元模拟进一步验证了所生成的支架与天然松质骨具有很强的力学和形态学相似性。这项工作提出了一个通用的和有效的策略,快速,患者特异性设计的植入物,促进松质骨再生。
{"title":"TriTopo-LGDM: A reverse design method for trabecular bone scaffolds integrating topology optimization and latent graph diffusion models","authors":"Wei Zhang ,&nbsp;Kaicheng Yu ,&nbsp;Lijie Su ,&nbsp;Yifeng Yao ,&nbsp;Lihua Lu ,&nbsp;Swee Leong Sing","doi":"10.1016/j.aei.2025.104269","DOIUrl":"10.1016/j.aei.2025.104269","url":null,"abstract":"<div><div>The efficient design of cancellous bone tissue engineering scaffolds that closely replicate both the histological and mechanical characteristics of native cancellous bone remains a significant challenge in the field. The vast design space of porous scaffolds gives rise to a highly complex, nonlinear relationship between geometric morphology, mechanical properties, and osteogenic performance. Currently, the design and in vitro fabrication of scaffolds for cancellous bone defect repair largely depend on expert-driven forward design methods. These approaches are time-consuming, costly, and often lack reproducibility and controllability, limiting their suitability for clinical applications. To overcome these limitations, this study introduces an innovative inverse design framework – TriTopo-LGDM – which combines topological optimization priors with a latent graph diffusion generative model. Built on a triply aligned dataset encompassing structure, physics, and optimization, the framework establishes a scaffold generation pipeline specifically tailored for cancellous bone defect reconstruction. It enables the efficient generation and accurate modeling of multi-scale functional scaffold structures. Experimental evaluations confirm that TriTopo-LGDM establishes a robust bidirectional mapping between topological parameters and target mechanical properties, significantly reducing design time and cost while improving structural consistency and 3D printability. Mechanical testing and finite element simulations further validate the strong mechanical and morphological resemblance of the generated scaffolds to natural cancellous bone. This work presents a generalizable and efficient strategy for the rapid, patient-specific design of implants that promote cancellous bone regeneration.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104269"},"PeriodicalIF":9.9,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145842079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An effective driven geometry construction method for contour finishing of complex parts via integrating graph neural network and reinforcement learning 将图神经网络与强化学习相结合,提出了一种有效的复杂零件轮廓精加工驱动几何构造方法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-24 DOI: 10.1016/j.aei.2025.104281
Rui Huang , Fan Zhang , Shilong Yu , Bo Huang
With the widespread use of computer-aided technologies represented by CAD/CAM/CAPP in product machining, a large amount of process data is continuously generated, and the embedded rich process knowledge provides an enabling means for the data-driven applications in intelligent process planning. At present, the driven geometry construction of the machining operations strongly relies on the experienced process designers to analyze the geometric features of the parts in detail, and this process is not only time-consuming and labor-intensive, but also unable to guarantee the construction of sufficiently optimal driven geometry. To address these limitations, this paper proposes an effective driven geometry construction method for contour finishing of complex parts via integrating graph neural network and reinforcement learning. First, based on the structured process data, a machining tool prediction model based on graph neural network (GNN) is proposed to learn the mapping relationships between machining regions under the contour finishing working step and machining tools. Then, the rules for dividing the machining region to generate the cutting areas that constitute the driven geometry and the constraints for the selection of the cutting area during the generation of the driven geometry are designed to construct an effective driven geometry. Finally, based on the rules and constraints for the generation of the driven geometry, the machining tool and the geometry topology of the machining region, a reinforcement learning method based on bidirectional long short-term memory neural network (Bi-LSTM) is proposed to generate the optimal driven geometry in terms of machining time. The experimental results show that the method can effectively construct the optimal driven geometry for part contour finishing and significantly reduce the machining time.
随着以CAD/CAM/CAPP为代表的计算机辅助技术在产品加工中的广泛应用,不断产生大量的工艺数据,其中所嵌入的丰富的工艺知识为数据驱动在智能工艺规划中的应用提供了使能手段。目前,机械加工操作的驱动几何构造强烈依赖于经验丰富的工艺设计人员对零件几何特征进行详细分析,这一过程不仅费时费力,而且无法保证构造足够优化的驱动几何。针对这些局限性,本文提出了一种结合图神经网络和强化学习的复杂零件轮廓精加工驱动几何构造方法。首先,基于结构化工艺数据,提出了一种基于图神经网络(GNN)的刀具预测模型,学习轮廓精加工工序下加工区域与刀具之间的映射关系;然后,设计了加工区域划分规则以生成构成驱动几何形状的切削区域,并设计了驱动几何形状生成过程中切削区域选择的约束条件,以构建有效的驱动几何形状。最后,基于驱动几何形状、加工刀具和加工区域几何拓扑的生成规则和约束,提出了一种基于双向长短期记忆神经网络(Bi-LSTM)的强化学习方法,在加工时间方面生成最优驱动几何形状。实验结果表明,该方法能有效地构造零件轮廓精加工的最优驱动几何形状,显著缩短加工时间。
{"title":"An effective driven geometry construction method for contour finishing of complex parts via integrating graph neural network and reinforcement learning","authors":"Rui Huang ,&nbsp;Fan Zhang ,&nbsp;Shilong Yu ,&nbsp;Bo Huang","doi":"10.1016/j.aei.2025.104281","DOIUrl":"10.1016/j.aei.2025.104281","url":null,"abstract":"<div><div>With the widespread use of computer-aided technologies represented by CAD/CAM/CAPP in product machining, a large amount of process data is continuously generated, and the embedded rich process knowledge provides an enabling means for the data-driven applications in intelligent process planning. At present, the driven geometry construction of the machining operations strongly relies on the experienced process designers to analyze the geometric features of the parts in detail, and this process is not only time-consuming and labor-intensive, but also unable to guarantee the construction of sufficiently optimal driven geometry. To address these limitations, this paper proposes an effective driven geometry construction method for contour finishing of complex parts via integrating graph neural network and reinforcement learning. First, based on the structured process data, a machining tool prediction model based on graph neural network (GNN) is proposed to learn the mapping relationships between machining regions under the contour finishing working step and machining tools. Then, the rules for dividing the machining region to generate the cutting areas that constitute the driven geometry and the constraints for the selection of the cutting area during the generation of the driven geometry are designed to construct an effective driven geometry. Finally, based on the rules and constraints for the generation of the driven geometry, the machining tool and the geometry topology of the machining region, a reinforcement learning method based on bidirectional long short-term memory neural network (Bi-LSTM) is proposed to generate the optimal driven geometry in terms of machining time. The experimental results show that the method can effectively construct the optimal driven geometry for part contour finishing and significantly reduce the machining time.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104281"},"PeriodicalIF":9.9,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145842080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DF-CoopNet: Cooperative perception via local feature enhancement and global sparse attention DF-CoopNet:基于局部特征增强和全局稀疏关注的协同感知
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-24 DOI: 10.1016/j.aei.2025.104282
Hui Wu , Yu Xiao , Yisheng Chen , Chongcheng Chen , Ruihai Dong , Ding Lin
Cooperative perception technology plays a crucial role in autonomous driving systems by improving safety and enabling real-time decision-making. However, existing LiDAR point cloud processing methods face significant challenges in both local geometric feature extraction and global feature fusion. To address these issues, this paper proposes DF-CoopNet, a cooperative perception framework comprising two core modules: Local Geometry Enhancement (LGE) and Sparse Key Feature Attention (SKFA). The LGE module enhances local geometric representations using a deformable k-nearest neighbor graph structure and adaptive fusion mechanism to effectively detect occluded targets. The SKFA module introduces a hierarchical sparse attention mechanism that balances performance and computational complexity through a Top-k strategy. Extensive experiments on the OPV2V and V2V4Real datasets demonstrate that DF-CoopNet significantly outperforms existing methods while maintaining robust detection performance even with substantially reduced point cloud data, validating its effectiveness for real-world cooperative perception applications.
协同感知技术通过提高安全性和实现实时决策,在自动驾驶系统中发挥着至关重要的作用。然而,现有的激光雷达点云处理方法在局部几何特征提取和全局特征融合方面都面临着重大挑战。为了解决这些问题,本文提出了DF-CoopNet,这是一个由两个核心模块组成的协作感知框架:局部几何增强(LGE)和稀疏关键特征注意(SKFA)。LGE模块利用可变形的k近邻图结构和自适应融合机制增强局部几何表示,有效检测被遮挡目标。SKFA模块引入了一种分层稀疏注意力机制,通过Top-k策略平衡性能和计算复杂性。在OPV2V和V2V4Real数据集上进行的大量实验表明,DF-CoopNet显著优于现有方法,即使在点云数据大幅减少的情况下也能保持稳健的检测性能,验证了其在现实世界协作感知应用中的有效性。
{"title":"DF-CoopNet: Cooperative perception via local feature enhancement and global sparse attention","authors":"Hui Wu ,&nbsp;Yu Xiao ,&nbsp;Yisheng Chen ,&nbsp;Chongcheng Chen ,&nbsp;Ruihai Dong ,&nbsp;Ding Lin","doi":"10.1016/j.aei.2025.104282","DOIUrl":"10.1016/j.aei.2025.104282","url":null,"abstract":"<div><div>Cooperative perception technology plays a crucial role in autonomous driving systems by improving safety and enabling real-time decision-making. However, existing LiDAR point cloud processing methods face significant challenges in both local geometric feature extraction and global feature fusion. To address these issues, this paper proposes DF-CoopNet, a cooperative perception framework comprising two core modules: Local Geometry Enhancement (LGE) and Sparse Key Feature Attention (SKFA). The LGE module enhances local geometric representations using a deformable k-nearest neighbor graph structure and adaptive fusion mechanism to effectively detect occluded targets. The SKFA module introduces a hierarchical sparse attention mechanism that balances performance and computational complexity through a Top-k strategy. Extensive experiments on the OPV2V and V2V4Real datasets demonstrate that DF-CoopNet significantly outperforms existing methods while maintaining robust detection performance even with substantially reduced point cloud data, validating its effectiveness for real-world cooperative perception applications.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104282"},"PeriodicalIF":9.9,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145842081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Co-MixPL: An optimized semi-supervised learning method for tunnel water leakage detection Co-MixPL:一种优化的隧道漏水检测半监督学习方法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-23 DOI: 10.1016/j.aei.2025.104263
Xujie Long , Jing Teng , Zhiwei Zhu , Shaobo Zhao , Mengyang Pu , Ruifeng Shi , You Lv , Jonathan Li , Guoqing Jing
The complex geometries, environmental variability, and inconsistent imaging conditions in shield tunnel linings pose substantial challenges to water leakage detection. Existing models heavily rely on extensive annotated data from diverse environments to ensure reliable performance across varying scenarios, which incurs significant time and labor costs in data annotation. To alleviate the annotation burden, we propose Co-MixPL, a novel semi-supervised learning approach that integrates labeled data with pseudo-labels generated by the Mixed Pseudo Label (MixPL) strategy to iteratively update the teacher-student models. Specifically, Co-MixPL integrates an additional head into the MixPL framework to enhance the encoder’s discriminative capability and introduces a Soft Regression method to mitigate the inherent localization bias in pseudo-labeling, refining the regression loss of pseudo-labels through adaptive reliability scores. Remarkably, experiments on the public “water leakage” dataset, Mendeley Data V1, demonstrate that Co-MixPL approaches state-of-the-art (SOTA) performance using only one-seventh of the training data and outperforms the SOTA by 2.8 AP with merely one-third of the annotations. These findings highlight the effectiveness of Co-MixPL in delivering superior detection performance with significantly reduced annotations, thus better meeting the practical demands of engineering inspection and maintenance. Codes are available at https://github.com/LXJ010/Co-MixPL.
盾构隧道衬砌复杂的几何形状、环境的可变性和不一致的成像条件,给漏水检测带来了巨大的挑战。现有模型严重依赖于来自不同环境的大量带注释的数据,以确保跨不同场景的可靠性能,这在数据注释方面产生了大量的时间和人工成本。为了减轻标注负担,我们提出了一种新的半监督学习方法Co-MixPL,该方法将标记数据与混合伪标签(MixPL)策略生成的伪标签集成在一起,迭代更新师生模型。具体而言,Co-MixPL在MixPL框架中集成了一个额外的头部,以增强编码器的判别能力,并引入了软回归方法来减轻伪标签中固有的定位偏差,通过自适应可靠性评分来改善伪标签的回归损失。值得注意的是,在公共“漏水”数据集Mendeley数据V1上的实验表明,Co-MixPL仅使用七分之一的训练数据就接近最先进(SOTA)的性能,并且仅使用三分之一的注释就比SOTA高出2.8 AP。这些发现突出了Co-MixPL在显著减少注释的情况下提供卓越检测性能的有效性,从而更好地满足了工程检测和维护的实际需求。代码可在https://github.com/LXJ010/Co-MixPL上获得。
{"title":"Co-MixPL: An optimized semi-supervised learning method for tunnel water leakage detection","authors":"Xujie Long ,&nbsp;Jing Teng ,&nbsp;Zhiwei Zhu ,&nbsp;Shaobo Zhao ,&nbsp;Mengyang Pu ,&nbsp;Ruifeng Shi ,&nbsp;You Lv ,&nbsp;Jonathan Li ,&nbsp;Guoqing Jing","doi":"10.1016/j.aei.2025.104263","DOIUrl":"10.1016/j.aei.2025.104263","url":null,"abstract":"<div><div>The complex geometries, environmental variability, and inconsistent imaging conditions in shield tunnel linings pose substantial challenges to water leakage detection. Existing models heavily rely on extensive annotated data from diverse environments to ensure reliable performance across varying scenarios, which incurs significant time and labor costs in data annotation. To alleviate the annotation burden, we propose Co-MixPL, a novel semi-supervised learning approach that integrates labeled data with pseudo-labels generated by the Mixed Pseudo Label (MixPL) strategy to iteratively update the teacher-student models. Specifically, Co-MixPL integrates an additional head into the MixPL framework to enhance the encoder’s discriminative capability and introduces a Soft Regression method to mitigate the inherent localization bias in pseudo-labeling, refining the regression loss of pseudo-labels through adaptive reliability scores. Remarkably, experiments on the public “water leakage” dataset, Mendeley Data V1, demonstrate that Co-MixPL approaches state-of-the-art (SOTA) performance using only one-seventh of the training data and outperforms the SOTA by 2.8 AP with merely one-third of the annotations. These findings highlight the effectiveness of Co-MixPL in delivering superior detection performance with significantly reduced annotations, thus better meeting the practical demands of engineering inspection and maintenance. Codes are available at <span><span>https://github.com/LXJ010/Co-MixPL</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104263"},"PeriodicalIF":9.9,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GridGAN-TXT: An intelligent approach to partitioning architectural free-form surfaces with text prompts GridGAN-TXT:一种智能的方法,通过文本提示来划分建筑自由形式的表面
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-23 DOI: 10.1016/j.aei.2025.104265
Jiang-Jun Hou , Jinyu Lu , Jun Zou , Binglin Lai , Haichen Zhang , Na Li
Architectural free-form surfaces have been increasingly adopted in large-scale public buildings due to their unique and aesthetically appealing forms; the geometric complexity of these surfaces, nevertheless, presents significant challenges for grid partitioning, and a universally efficient method is still lacking. To this end, a novel grid partitioning approach for architectural free-form surfaces, termed GridGAN-TXT, is proposed herein, leveraging Generative Adversarial Networks (GAN), a form of generative artificial intelligence. A distinguishing feature of GridGAN-TXT is its ability to perform grid partitioning without explicit reliance on surface characteristics. Through training data mining and learning, the model autonomously extracts relevant features, enabling automated partitioning of free-form surfaces. Moreover, GridGAN-TXT can simultaneously generate grid structures composed of triangular or quadrilateral elements (via text prompts) — a capability not supported by previous methods, which typically require totally different strategies for each element type. The technical details of GridGAN-TXT are elaborated, and a parametric strategy is proposed for constructing large-scale training datasets. Additionally, a novel grid evaluation metric — similarity evaluation — is introduced to complement the existing geometric evaluation method. The effectiveness and generalizability of GridGAN-TXT are validated through ablation studies, extensive testing, and case analyses. Results demonstrate that GridGAN-TXT exhibits exceptional performance in partitioning grids on architectural free-form surfaces and can flexibly generate grid structures with varying basic elements in response to user text prompts. Such capabilities significantly enhance the efficiency of grid partitioning while simultaneously expanding the range of potential applications, highlighting the method as a strong candidate for practical implementation.
建筑自由曲面因其独特和美观的形式在大型公共建筑中被越来越多地采用;然而,这些表面的几何复杂性对网格划分提出了重大挑战,并且仍然缺乏一种普遍有效的方法。为此,本文提出了一种用于建筑自由曲面的新型网格划分方法,称为GridGAN-TXT,利用生成对抗网络(GAN),一种生成人工智能形式。GridGAN-TXT的一个显著特征是它能够在不明确依赖于表面特征的情况下执行网格划分。该模型通过训练数据挖掘和学习,自主提取相关特征,实现自由曲面的自动划分。此外,GridGAN-TXT可以同时生成由三角形或四边形元素组成的网格结构(通过文本提示)——这是以前的方法所不支持的功能,通常需要对每种元素类型使用完全不同的策略。阐述了GridGAN-TXT的技术细节,提出了一种构造大规模训练数据集的参数化策略。此外,引入了一种新的网格评价指标——相似度评价,以补充现有的几何评价方法。GridGAN-TXT的有效性和普遍性通过消融研究、广泛的测试和案例分析得到验证。结果表明,GridGAN-TXT在建筑自由曲面上划分网格方面表现出优异的性能,并能根据用户文本提示灵活生成具有不同基本元素的网格结构。这种能力大大提高了网格划分的效率,同时扩大了潜在应用的范围,突出了该方法作为实际实现的有力候选。
{"title":"GridGAN-TXT: An intelligent approach to partitioning architectural free-form surfaces with text prompts","authors":"Jiang-Jun Hou ,&nbsp;Jinyu Lu ,&nbsp;Jun Zou ,&nbsp;Binglin Lai ,&nbsp;Haichen Zhang ,&nbsp;Na Li","doi":"10.1016/j.aei.2025.104265","DOIUrl":"10.1016/j.aei.2025.104265","url":null,"abstract":"<div><div>Architectural free-form surfaces have been increasingly adopted in large-scale public buildings due to their unique and aesthetically appealing forms; the geometric complexity of these surfaces, nevertheless, presents significant challenges for grid partitioning, and a universally efficient method is still lacking. To this end, a novel grid partitioning approach for architectural free-form surfaces, termed GridGAN-TXT, is proposed herein, leveraging Generative Adversarial Networks (GAN), a form of generative artificial intelligence. A distinguishing feature of GridGAN-TXT is its ability to perform grid partitioning without explicit reliance on surface characteristics. Through training data mining and learning, the model autonomously extracts relevant features, enabling automated partitioning of free-form surfaces. Moreover, GridGAN-TXT can simultaneously generate grid structures composed of triangular or quadrilateral elements (via text prompts) — a capability not supported by previous methods, which typically require totally different strategies for each element type. The technical details of GridGAN-TXT are elaborated, and a parametric strategy is proposed for constructing large-scale training datasets. Additionally, a novel grid evaluation metric — similarity evaluation — is introduced to complement the existing geometric evaluation method. The effectiveness and generalizability of GridGAN-TXT are validated through ablation studies, extensive testing, and case analyses. Results demonstrate that GridGAN-TXT exhibits exceptional performance in partitioning grids on architectural free-form surfaces and can flexibly generate grid structures with varying basic elements in response to user text prompts. Such capabilities significantly enhance the efficiency of grid partitioning while simultaneously expanding the range of potential applications, highlighting the method as a strong candidate for practical implementation.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104265"},"PeriodicalIF":9.9,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal energy management of buildings using neural network-based thermal prediction and economic model predictive control 基于神经网络热预测和经济模型预测控制的建筑能源优化管理
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-23 DOI: 10.1016/j.aei.2025.104278
Zihang Dong , Cheng Hu , Xi Zhang , Yifan Shen , Xiaojun Shen , Jose I. Leon
Heating, ventilation, and air conditioning (HVAC) systems are significant contributors to energy consumption in buildings, directly affecting energy efficiency and occupant comfort. Traditional physics-based temperature modeling and centralized control frameworks often struggle to effectively balance the challenges of scaling across multiple buildings while minimizing operational costs. To address these challenges, this paper proposes a novel data-driven distributed control framework for the economical and resilient operation of the building community. Specifically, the framework employs artificial neural networks (ANNs) to capture multi-time-step nonlinear temperature dynamics, enhancing predictive accuracy for energy management across interconnected buildings. A distributed economic model predictive control (EMPC) strategy is developed, enabling local controllers to coordinate HVAC schedules in each building iteratively. This strategy minimizes energy shortages, optimizes overall community energy costs, and ensures thermal comfort. In addition, by facilitating energy interaction between HVAC systems, distributed energy resources (DERs), and storage units, the framework ensures electrical energy supply and demand balance during power outages. Simulation results demonstrate that the proposed strategy improves cost efficiency, resilience, and multi-step prediction accuracy, outperforming traditional physics-based EMPC approaches in coordination across multiple buildings.
供暖、通风和空调(HVAC)系统是建筑物能源消耗的重要贡献者,直接影响能源效率和居住者的舒适度。传统的基于物理的温度建模和集中控制框架通常难以有效地平衡跨多个建筑物扩展的挑战,同时最大限度地降低运营成本。为了解决这些挑战,本文提出了一种新的数据驱动的分布式控制框架,用于建筑社区的经济和弹性运行。具体来说,该框架采用人工神经网络(ann)来捕获多时间步非线性温度动态,提高了跨互联建筑能源管理的预测精度。提出了一种分布式经济模型预测控制(EMPC)策略,使本地控制器能够迭代地协调各建筑的暖通空调调度。这一策略最大限度地减少了能源短缺,优化了整体社区能源成本,并确保了热舒适性。此外,通过促进HVAC系统、分布式能源(DERs)和存储单元之间的能量交互,该框架确保了停电期间的电力供需平衡。仿真结果表明,该策略提高了成本效率、弹性和多步预测精度,在跨多个建筑物的协调中优于传统的基于物理的EMPC方法。
{"title":"Optimal energy management of buildings using neural network-based thermal prediction and economic model predictive control","authors":"Zihang Dong ,&nbsp;Cheng Hu ,&nbsp;Xi Zhang ,&nbsp;Yifan Shen ,&nbsp;Xiaojun Shen ,&nbsp;Jose I. Leon","doi":"10.1016/j.aei.2025.104278","DOIUrl":"10.1016/j.aei.2025.104278","url":null,"abstract":"<div><div>Heating, ventilation, and air conditioning (HVAC) systems are significant contributors to energy consumption in buildings, directly affecting energy efficiency and occupant comfort. Traditional physics-based temperature modeling and centralized control frameworks often struggle to effectively balance the challenges of scaling across multiple buildings while minimizing operational costs. To address these challenges, this paper proposes a novel data-driven distributed control framework for the economical and resilient operation of the building community. Specifically, the framework employs artificial neural networks (ANNs) to capture multi-time-step nonlinear temperature dynamics, enhancing predictive accuracy for energy management across interconnected buildings. A distributed economic model predictive control (EMPC) strategy is developed, enabling local controllers to coordinate HVAC schedules in each building iteratively. This strategy minimizes energy shortages, optimizes overall community energy costs, and ensures thermal comfort. In addition, by facilitating energy interaction between HVAC systems, distributed energy resources (DERs), and storage units, the framework ensures electrical energy supply and demand balance during power outages. Simulation results demonstrate that the proposed strategy improves cost efficiency, resilience, and multi-step prediction accuracy, outperforming traditional physics-based EMPC approaches in coordination across multiple buildings.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104278"},"PeriodicalIF":9.9,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Latent diffusion–driven inverse design of damping microstructures with multiaxial nonlinear mechanical targets 多轴非线性机械目标阻尼微结构的潜扩散驱动逆设计
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-22 DOI: 10.1016/j.aei.2025.104256
Tianyang Zhang, Weizhi Xu, Shuguang Wang, Dongsheng Du
This study presents an integrated generative framework for the inverse design of damping microstructures in energy-dissipating steel walls (EDSWs) for seismic applications, establishing a seamless pipeline from large-scale pixel-based datasets to latent-space representation, three-dimensional reconstruction, industrial fabrication, and finite element analysis (FEA) verification. Starting from over 140,000 boundary-identical microstructures, a variational autoencoder-based TopoFormer compresses geometric features into latent codes, enabling over 90% reduction in generation complexity while maintaining high reconstruction fidelity. Representative structures are selected via k-means clustering in the latent space and analyzed through nonlinear FEA under shear and compression to construct a performance-labeled dataset. A conditional latent diffusion transformer (DiT) is then trained to map complete nonlinear mechanical performance curves to manufacturable geometries, thus achieving a one-to-many correspondence between target responses and structural configurations. Comparative evaluations show that the proposed DiT framework surpasses multiple CondUNet baselines, achieving the lowest FID (11.367) and the highest SSIM (0.676) with balanced coverage and precision. Experimental validation using laser-cut low-yield-point steel specimens under low-cycle reciprocating loading demonstrates close agreement between generated and target hysteresis curves, confirming both geometric fidelity and mechanical reliability. The results establish a scalable, high-accuracy, and experimentally validated approach for automated, performance-driven microstructure design, providing a practical pathway for incorporating generative artificial intelligence into the engineering development of next-generation seismic energy-dissipation systems. The related codes are available at https://github.com/AshenOneme/DiT-Based-Microstructures-Design.
本研究提出了一个集成的生成框架,用于地震应用中的耗能钢墙(EDSWs)阻尼微结构的反设计,建立了从大规模基于像素的数据集到潜在空间表示、三维重建、工业制造和有限元分析(FEA)验证的无缝管道。从超过140,000个边界相同的微结构开始,基于变分自编码器的TopoFormer将几何特征压缩为潜在代码,使生成复杂性降低90%以上,同时保持高重建保真度。在潜在空间中通过k-means聚类选择具有代表性的结构,并在剪切和压缩条件下进行非线性有限元分析,构建性能标记数据集。然后训练条件潜在扩散变压器(DiT)将完整的非线性力学性能曲线映射到可制造的几何形状,从而实现目标响应和结构构型之间的一对多对应。对比评估表明,所提出的DiT框架超越了多个CondUNet基线,实现了最低FID(11.367)和最高SSIM(0.676)的平衡覆盖和精度。激光切割低屈服点钢试样在低循环往复加载下的实验验证表明,生成的迟滞曲线与目标曲线非常吻合,证实了几何保真度和机械可靠性。研究结果为自动化、性能驱动的微观结构设计建立了一种可扩展、高精度、经过实验验证的方法,为将生成式人工智能整合到下一代地震耗能系统的工程开发中提供了一条实用途径。相关代码可在https://github.com/AshenOneme/DiT-Based-Microstructures-Design上获得。
{"title":"Latent diffusion–driven inverse design of damping microstructures with multiaxial nonlinear mechanical targets","authors":"Tianyang Zhang,&nbsp;Weizhi Xu,&nbsp;Shuguang Wang,&nbsp;Dongsheng Du","doi":"10.1016/j.aei.2025.104256","DOIUrl":"10.1016/j.aei.2025.104256","url":null,"abstract":"<div><div>This study presents an integrated generative framework for the inverse design of damping microstructures in energy-dissipating steel walls (EDSWs) for seismic applications, establishing a seamless pipeline from large-scale pixel-based datasets to latent-space representation, three-dimensional reconstruction, industrial fabrication, and finite element analysis (FEA) verification. Starting from over 140,000 boundary-identical microstructures, a variational autoencoder-based TopoFormer compresses geometric features into latent codes, enabling over 90% reduction in generation complexity while maintaining high reconstruction fidelity. Representative structures are selected via k-means clustering in the latent space and analyzed through nonlinear FEA under shear and compression to construct a performance-labeled dataset. A conditional latent diffusion transformer (DiT) is then trained to map complete nonlinear mechanical performance curves to manufacturable geometries, thus achieving a one-to-many correspondence between target responses and structural configurations. Comparative evaluations show that the proposed DiT framework surpasses multiple CondUNet baselines, achieving the lowest FID (11.367) and the highest SSIM (0.676) with balanced coverage and precision. Experimental validation using laser-cut low-yield-point steel specimens under low-cycle reciprocating loading demonstrates close agreement between generated and target hysteresis curves, confirming both geometric fidelity and mechanical reliability. The results establish a scalable, high-accuracy, and experimentally validated approach for automated, performance-driven microstructure design, providing a practical pathway for incorporating generative artificial intelligence into the engineering development of next-generation seismic energy-dissipation systems. The related codes are available at <span><span><strong><em>https://github.com/AshenOneme/DiT-Based-Microstructures-Design</em></strong></span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104256"},"PeriodicalIF":9.9,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid adaptive fault-tolerant control of variable structure non-gaussian stochastic systems with feedback packet loss 具有反馈丢包的变结构非高斯随机系统的混合自适应容错控制
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-22 DOI: 10.1016/j.aei.2025.104232
Kaiyu Hu , Aili Yusup , Panfei Yan
This paper is devoted to designing a hybrid adaptive fault diagnosis (FD) and fault-tolerant control (FTC) scheme for nonlinear non-Gaussian stochastic systems with feedback packet loss and multi-mode variable structure. In order to solve the problem of inaccurate fault estimation caused by packet loss, the packet loss compensation method is proposed for adaptive learning FD. Hence the fault with incipient and large amplitudes are accurately estimated by prey adaptive strategy algorithm. Then, the second-order sliding mode FTC scheme is improved by adaptive harmonic learning functions with variable structure perturbations. Combining the nonlinearity and fault estimated by FD, this hybrid adaptive active FTC achieves the stable fault self-repair for the variable structure nonlinear non-Gaussian systems. Take the papermaking process system as an example, Lyapunov functions proves the stability, the effectiveness and superiority of the scheme are verified by numerical simulation.
针对具有反馈丢包和多模变结构的非线性非高斯随机系统,设计了一种混合自适应故障诊断和容错控制方案。为了解决丢包导致故障估计不准确的问题,提出了一种自适应学习FD的丢包补偿方法。因此,利用猎物自适应策略算法可以准确地估计出早期和振幅较大的故障。然后,利用变结构扰动的自适应谐波学习函数对二阶滑模FTC方案进行改进。结合FD估计的非线性和故障,该混合自适应有源FTC实现了变结构非线性非高斯系统的稳定故障自修复。以造纸工艺系统为例,通过Lyapunov函数验证了该方案的稳定性,并通过数值仿真验证了该方案的有效性和优越性。
{"title":"Hybrid adaptive fault-tolerant control of variable structure non-gaussian stochastic systems with feedback packet loss","authors":"Kaiyu Hu ,&nbsp;Aili Yusup ,&nbsp;Panfei Yan","doi":"10.1016/j.aei.2025.104232","DOIUrl":"10.1016/j.aei.2025.104232","url":null,"abstract":"<div><div>This paper is devoted to designing a hybrid adaptive fault diagnosis (FD) and fault-tolerant control (FTC) scheme for nonlinear non-Gaussian stochastic systems with feedback packet loss and multi-mode variable structure. In order to solve the problem of inaccurate fault estimation caused by packet loss, the packet loss compensation method is proposed for adaptive learning FD. Hence the fault with incipient and large amplitudes are accurately estimated by prey adaptive strategy algorithm. Then, the second-order sliding mode FTC scheme is improved by adaptive harmonic learning functions with variable structure perturbations. Combining the nonlinearity and fault estimated by FD, this hybrid adaptive active FTC achieves the stable fault self-repair for the variable structure nonlinear non-Gaussian systems. Take the papermaking process system as an example, Lyapunov functions proves the stability, the effectiveness and superiority of the scheme are verified by numerical simulation.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104232"},"PeriodicalIF":9.9,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TFSAF-Net: a hybrid network integrating time-frequency spectral feature enhancement and attention for fault diagnosis of rotating machinery tsaff - net:一种集时频特征增强和关注于一体的旋转机械故障诊断混合网络
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-22 DOI: 10.1016/j.aei.2025.104262
Chuang Liang , Xuelin Mu , Ende Wang , Xiaoguang Zhang , Chengcheng Wang , Yubo Shao
Rotating machinery is the core equipment of the manufacturing industry, and its stability directly determines the operation of industrial systems. As a key component of rotating machinery, the accuracy of bearing fault diagnosis is particularly critical. Recently, deep learning (DL) has achieved remarkable results in mechanical fault diagnosis. However, traditional convolutional neural networks (CNNs) still have obvious limitations, which are difficult to effectively capture high-order nonlinearity features in the time–frequency domain and also show insufficient ability to decouple fault features in strong noise environments, which seriously restricts the improvement of diagnostic accuracy and model interpretability. To address these problems, a hybrid time–frequency spectral feature enhancement and attention fusion network (TFSAF-Net) is proposed. First, a time–frequency spectral feature enhancement module (TFSFEM) is designed. The TFSFEM employs learnable weight parameters to perform quadratic convolution nonlinear transformation on the wavelet time–frequency map of the signal in order to enhance its ability to extract higher-order fault features. Simultaneously, by integrating physical-driven feature decoupling module to extract envelope-related features and to utilize adaptive norm ratio-based feature metrics to improve the distinction between fault features and noise. Moreover, a convolutional multi-scale attention fusion module (CMSAFM) is developed. The CMSAFM introduces efficient multi-scale attention, which achieves precise focusing on key features through grouped feature interactions and adaptive weight allocation. Further, it realizes the effective integration of local details and global time–frequency distribution information through parallel extraction and weighted fusion of multi-scale features. Finally, a self-built engineering application datasets and PU dataset are implemented to validate the effectiveness and superiority of the TFSAF-Net. The experimental results demonstrate that our proposed method effectively captures high-order nonlinear features in the time–frequency domain and achieves efficient separation of fault characteristics from noise in highly noisy environments, and thereby reaches a higher diagnostic accuracy rate. Meanwhile, in terms of TFSAF-Net interpretability and generalization, it provides valuable insights for exploring similar problems within the field.
旋转机械是制造业的核心设备,其稳定性直接决定着工业系统的运行。轴承作为旋转机械的关键部件,其故障诊断的准确性尤为关键。近年来,深度学习在机械故障诊断方面取得了显著的成果。然而,传统的卷积神经网络(cnn)仍然存在明显的局限性,难以有效捕获时频域的高阶非线性特征,并且在强噪声环境下对故障特征的解耦能力不足,严重制约了诊断精度和模型可解释性的提高。为了解决这些问题,提出了一种混合时频特征增强和注意力融合网络(TFSAF-Net)。首先,设计了时域频谱特征增强模块(TFSFEM)。该方法利用可学习的权参数对信号的小波时频图进行二次卷积非线性变换,以增强其提取高阶故障特征的能力。同时,通过集成物理驱动特征解耦模块提取包络相关特征,并利用基于自适应范数比的特征度量提高故障特征与噪声的区分能力。此外,还开发了一个卷积多尺度注意力融合模块(CMSAFM)。CMSAFM引入了高效的多尺度关注,通过分组特征交互和自适应权重分配实现对关键特征的精确聚焦。进一步,通过多尺度特征的并行提取和加权融合,实现局部细节信息与全局时频分布信息的有效融合。最后,以自建的工程应用数据集和PU数据集为例,验证了TFSAF-Net的有效性和优越性。实验结果表明,该方法有效地捕获了时频域的高阶非线性特征,在高噪声环境下实现了故障特征与噪声的有效分离,从而达到了较高的诊断准确率。同时,在TFSAF-Net的可解释性和泛化方面,它为探索该领域的类似问题提供了有价值的见解。
{"title":"TFSAF-Net: a hybrid network integrating time-frequency spectral feature enhancement and attention for fault diagnosis of rotating machinery","authors":"Chuang Liang ,&nbsp;Xuelin Mu ,&nbsp;Ende Wang ,&nbsp;Xiaoguang Zhang ,&nbsp;Chengcheng Wang ,&nbsp;Yubo Shao","doi":"10.1016/j.aei.2025.104262","DOIUrl":"10.1016/j.aei.2025.104262","url":null,"abstract":"<div><div>Rotating machinery is the core equipment of the manufacturing industry, and its stability directly determines the operation of industrial systems. As a key component of rotating machinery, the accuracy of bearing fault diagnosis is particularly critical. Recently, deep learning (DL) has achieved remarkable results in mechanical fault diagnosis. However, traditional convolutional neural networks (CNNs) still have obvious limitations, which are difficult to effectively capture high-order nonlinearity features in the time–frequency domain and also show insufficient ability to decouple fault features in strong noise environments, which seriously restricts the improvement of diagnostic accuracy and model interpretability. To address these problems, a hybrid time–frequency spectral feature enhancement and attention fusion network (TFSAF-Net) is proposed. First, a time–frequency spectral feature enhancement module (TFSFEM) is designed. The TFSFEM employs learnable weight parameters to perform quadratic convolution nonlinear transformation on the wavelet time–frequency map of the signal in order to enhance its ability to extract higher-order fault features. Simultaneously, by integrating physical-driven feature decoupling module to extract envelope-related features and to utilize adaptive norm ratio-based feature metrics to improve the distinction between fault features and noise. Moreover, a convolutional multi-scale attention fusion module (CMSAFM) is developed. The CMSAFM introduces efficient multi-scale attention, which achieves precise focusing on key features through grouped feature interactions and adaptive weight allocation. Further, it realizes the effective integration of local details and global time–frequency distribution information through parallel extraction and weighted fusion of multi-scale features. Finally, a self-built engineering application datasets and PU dataset are implemented to validate the effectiveness and superiority of the TFSAF-Net. The experimental results demonstrate that our proposed method effectively captures high-order nonlinear features in the time–frequency domain and achieves efficient separation of fault characteristics from noise in highly noisy environments, and thereby reaches a higher diagnostic accuracy rate. Meanwhile, in terms of TFSAF-Net interpretability and generalization, it provides valuable insights for exploring similar problems within the field.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104262"},"PeriodicalIF":9.9,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Advanced Engineering Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1