首页 > 最新文献

Advanced Engineering Informatics最新文献

英文 中文
Intelligent compaction assessment of coarse-grained subgrade using contact-type acoustic wave detection with few-shot learning in complex sound fields 复杂声场中基于接触型声波检测的少弹学习粗粒路基智能压实评估
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2025-12-15 DOI: 10.1016/j.aei.2025.104223
Fan Ye , Naifu Deng , Chuping Wu , Jianjiang Peng , Wang Guo , Shuangxi Cao , Qinglong Zhang
Conventional methods used for compaction quality control and acceptance depend on random sampling for density validation and risk generating unrepresentative results and systematic errors. Intelligent compaction (IC) enhances highway subgrade evaluation but has limitations in terms of accuracy and availability of labeled data. Although acoustic indexes are effective for characterizing coarse-grained soils, their application is constrained by complex sound fields with significant air attenuation and multi-source noise interference. To address these challenges, this study presents a novel intelligent compaction assessment that integrates contact–type acoustic compaction model, a newly defined contact-type sound compaction value (CSCV), and few-shot intelligent assessment with uncertainty quantification. The proposed approach not only overcomes the weakening and extraction difficulties of effective acoustic signals but also enables reliable model training under limited labeled samples. A case study conducted on the Chenglong project in China reveals that intelligent assessment results highly correlate with actual values for weathered slate and gravelly soil, with maximum absolute errors of 1.2516 mm and 2.11 % respectively. Integrating this method into the IC system enhances highway quality and the promotion of IC technology.
用于压实质量控制和验收的传统方法依赖于密度验证的随机抽样,并且存在产生不具代表性结果和系统误差的风险。智能压实(IC)增强了公路路基评价,但在标记数据的准确性和可用性方面存在局限性。虽然声学指标是表征粗粒土的有效方法,但其应用受到复杂声场、空气衰减明显和多源噪声干扰的限制。为了解决这些问题,本研究提出了一种新的智能压实评估方法,该方法将接触型声压实模型、新定义的接触型声压实值(CSCV)和带有不确定度量化的少射智能评估相结合。该方法不仅克服了有效声信号的弱化和提取困难,而且能够在有限的标记样本下进行可靠的模型训练。以成都工程为例,智能评价结果与风化板岩和砾质土的实际值高度相关,最大绝对误差分别为1.2516 mm和2.11%。将该方法与集成电路系统相结合,可以提高公路质量,促进集成电路技术的发展。
{"title":"Intelligent compaction assessment of coarse-grained subgrade using contact-type acoustic wave detection with few-shot learning in complex sound fields","authors":"Fan Ye ,&nbsp;Naifu Deng ,&nbsp;Chuping Wu ,&nbsp;Jianjiang Peng ,&nbsp;Wang Guo ,&nbsp;Shuangxi Cao ,&nbsp;Qinglong Zhang","doi":"10.1016/j.aei.2025.104223","DOIUrl":"10.1016/j.aei.2025.104223","url":null,"abstract":"<div><div>Conventional methods used for compaction quality control and acceptance depend on random sampling for density validation and risk generating unrepresentative results and systematic errors. Intelligent compaction (IC) enhances highway subgrade evaluation but has limitations in terms of accuracy and availability of labeled data. Although acoustic indexes are effective for characterizing coarse-grained soils, their application is constrained by complex sound fields with significant air attenuation and multi-source noise interference. To address these challenges, this study presents a novel intelligent compaction assessment that integrates contact–type acoustic compaction model, a newly defined contact-type sound compaction value (CSCV), and few-shot intelligent assessment with uncertainty quantification. The proposed approach not only overcomes the weakening and extraction difficulties of effective acoustic signals but also enables reliable model training under limited labeled samples. A case study conducted on the Chenglong project in China reveals that intelligent assessment results highly correlate with actual values for weathered slate and gravelly soil, with maximum absolute errors of 1.2516 mm and 2.11 % respectively. Integrating this method into the IC system enhances highway quality and the promotion of IC technology.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104223"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145792165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ball tree structure-informed phase space warping: a robust algorithm for dynamic degradation tracking under variable speed conditions 基于球树结构的相空间翘曲:变速条件下动态退化跟踪的鲁棒算法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2025-12-30 DOI: 10.1016/j.aei.2025.104288
Rui Yuan , Hengyu Liu , Yong Lv , Yuejian Chen , Xingkai Yang , Hewenxuan Li , David Chelidze
With the advancement of predictive maintenance strategies, the accuracy of degradation tracking in mechanical systems has become a growing concern. This paper proposes a novel ball tree structure-informed phase space warping (BTPSW) algorithm, which couples high-dimensional nonlinear dynamics with efficient geometric search strategies to robustly track bearing degradation. To tackle the challenges of high-dimensional data and uneven distribution of points in the reconstructed phase space (PS), a physics-informed dynamic model is constructed to simulate outer race crack evolution under speed fluctuations. The resulting vibration signals are then reconstructed into high-dimensional PS, where trajectory curvature serves as a degradation indicator. The BTPSW algorithm reduces overlap in high-dimensional spaces, improving data search efficiency. Furthermore, considering the fluctuations in the optimal reconstruction parameters, the BTPSW algorithm demonstrates enhanced data adaptability, mitigating the accuracy loss caused by parameter fluctuations. By constructing a simulation model of rolling bearing degradation to simulate the crack propagation in the outer race, the paper validates the application of the BTPSW algorithm in tracking crack degradation. Both simulation and accelerated degradation experiments confirm that BTPSW achieves high tracking accuracy, strong parameter robustness, and superior adaptability under fluctuating operating conditions, making it a powerful tool for predictive maintenance and long-term reliability assessment.
随着预测性维护策略的发展,机械系统退化跟踪的准确性日益受到关注。提出了一种基于球树结构的相空间翘曲(BTPSW)算法,该算法将高维非线性动力学与高效几何搜索策略相结合,实现了对轴承退化的鲁棒跟踪。针对重构相空间(PS)中数据高维和点分布不均匀的问题,建立了基于物理信息的外圈裂纹动态模型,模拟了速度波动下的外圈裂纹演化过程。然后将得到的振动信号重构为高维PS,其中轨迹曲率作为退化指标。BTPSW算法减少了高维空间的重叠,提高了数据搜索效率。此外,考虑到最优重构参数的波动,BTPSW算法表现出更强的数据适应性,减轻了参数波动带来的精度损失。通过构建滚动轴承退化仿真模型,模拟外圈裂纹扩展过程,验证了BTPSW算法在裂纹退化跟踪中的应用。仿真和加速退化实验均证实,BTPSW具有较高的跟踪精度、较强的参数鲁棒性和对波动工况的优越适应性,是预测性维护和长期可靠性评估的有力工具。
{"title":"Ball tree structure-informed phase space warping: a robust algorithm for dynamic degradation tracking under variable speed conditions","authors":"Rui Yuan ,&nbsp;Hengyu Liu ,&nbsp;Yong Lv ,&nbsp;Yuejian Chen ,&nbsp;Xingkai Yang ,&nbsp;Hewenxuan Li ,&nbsp;David Chelidze","doi":"10.1016/j.aei.2025.104288","DOIUrl":"10.1016/j.aei.2025.104288","url":null,"abstract":"<div><div>With the advancement of predictive maintenance strategies, the accuracy of degradation tracking in mechanical systems has become a growing concern. This paper proposes a novel ball tree structure-informed phase space warping (BTPSW) algorithm, which couples high-dimensional nonlinear dynamics with efficient geometric search strategies to robustly track bearing degradation. To tackle the challenges of high-dimensional data and uneven distribution of points in the reconstructed phase space (PS), a physics-informed dynamic model is constructed to simulate outer race crack evolution under speed fluctuations. The resulting vibration signals are then reconstructed into high-dimensional PS, where trajectory curvature serves as a degradation indicator. The BTPSW algorithm reduces overlap in high-dimensional spaces, improving data search efficiency. Furthermore, considering the fluctuations in the optimal reconstruction parameters, the BTPSW algorithm demonstrates enhanced data adaptability, mitigating the accuracy loss caused by parameter fluctuations. By constructing a simulation model of rolling bearing degradation to simulate the crack propagation in the outer race, the paper validates the application of the BTPSW algorithm in tracking crack degradation. Both simulation and accelerated degradation experiments confirm that BTPSW achieves high tracking accuracy, strong parameter robustness, and superior adaptability under fluctuating operating conditions, making it a powerful tool for predictive maintenance and long-term reliability assessment.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104288"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fault-tolerant effective wind speed estimation in wind farms via EKF and deep learning fusion 基于EKF和深度学习融合的风电场容错有效风速估计
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-03 DOI: 10.1016/j.aei.2025.104306
Seyyede Marzieh Mousavi , Sayyed Majid Esmailifar , Horst Schulte
This study aims to identify the most suitable technique for computing the effective wind speed (EWS) in wind farms. An Extended Kalman Filter (EKF), combined with measurements of the blade pitch angle, generator torque, and rotor speed, is employed to estimate the EWS of individual turbines accurately. Furthermore, information extracted from the turbine wake within the wind farm can be utilized to enhance estimation accuracy and improve robustness against potential faults and failures. For this purpose, the parametric Jensen wake model is adopted due to its capability for real-time analysis across different wake conditions. To integrate these sources of information, a deep neural network model (CNN-LSTM) is developed to fuse the EKF-based estimates with those derived from the wake model. The proposed method is validated using WFSim simulations of a wind farm comprising two turbines. Results show that the CNN-LSTM model outperforms the individual approaches, improving accuracy by about 40% while maintaining robustness under faulty data. In summary, simulations indicate that while the EKF alone provides the most accurate EWS estimates under fault-free conditions, a fusion with the parametric wake model ensures reliable and precise estimation in the presence of faults.
本研究旨在确定计算风电场有效风速(EWS)的最合适技术。结合叶片俯仰角、发电机转矩和转子转速的测量,采用扩展卡尔曼滤波(EKF)对单个涡轮的EWS进行了精确估计。此外,从风电场内的涡轮尾迹中提取的信息可以用来提高估计精度,并提高对潜在故障和失效的鲁棒性。为此,我们采用了参数化Jensen尾流模型,该模型具有跨不同尾流条件实时分析的能力。为了整合这些信息来源,我们开发了一个深度神经网络模型(CNN-LSTM)来融合基于ekf的估计和来自尾流模型的估计。采用WFSim对一个由两台涡轮机组成的风电场进行了仿真验证。结果表明,CNN-LSTM模型优于单个方法,在保持故障数据鲁棒性的同时,准确率提高了约40%。综上所述,仿真结果表明,在无故障条件下,单独使用EKF可以提供最准确的EWS估计,而在存在故障的情况下,与参数尾迹模型的融合可以确保可靠和精确的估计。
{"title":"Fault-tolerant effective wind speed estimation in wind farms via EKF and deep learning fusion","authors":"Seyyede Marzieh Mousavi ,&nbsp;Sayyed Majid Esmailifar ,&nbsp;Horst Schulte","doi":"10.1016/j.aei.2025.104306","DOIUrl":"10.1016/j.aei.2025.104306","url":null,"abstract":"<div><div>This study aims to identify the most suitable technique for computing the effective wind speed (EWS) in wind farms. An Extended Kalman Filter (EKF), combined with measurements of the blade pitch angle, generator torque, and rotor speed, is employed to estimate the EWS of individual turbines accurately. Furthermore, information extracted from the turbine wake within the wind farm can be utilized to enhance estimation accuracy and improve robustness against potential faults and failures. For this purpose, the parametric Jensen wake model is adopted due to its capability for real-time analysis across different wake conditions. To integrate these sources of information, a deep neural network model (CNN-LSTM) is developed to fuse the EKF-based estimates with those derived from the wake model. The proposed method is validated using WFSim simulations of a wind farm comprising two turbines. Results show that the CNN-LSTM model outperforms the individual approaches, improving accuracy by about 40% while maintaining robustness under faulty data. In summary, simulations indicate that while the EKF alone provides the most accurate EWS estimates under fault-free conditions, a fusion with the parametric wake model ensures reliable and precise estimation in the presence of faults.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104306"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large language models enable semantic-guided hierarchical games for intelligent battery coordination 大型语言模型支持语义引导的分层游戏,用于智能电池协调
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-08 DOI: 10.1016/j.aei.2026.104312
Yuntao Zou , Zihui Lin , Qianqi Zhang , Zhichun Liu , Zeling Xu
The battery energy consumption system of lunar exploration rovers, as mission-critical equipment, confronts severe challenges under extreme environmental constraints. However, existing modeling methods face fundamental dilemmas: dynamic uncertainty leads to highly ambiguous constraint boundaries, making it difficult for traditional mathematical languages to describe complex coupling relationships; even when mathematical representations are constructed, high-dimensional nonlinear optimization problems become computationally intractable, with existing algorithms unable to address complexity barriers and lacking interpretability. In response to these challenges, this paper innovatively proposes a hierarchical Stackelberg game optimization framework based on semantic embedding. This framework transcends traditional optimization paradigms by deeply integrating the cognitive intelligence of large language models with the mathematical precision of game theory: large language models acknowledge that overall behavior cannot be predicted from simple combinations of parts, processing fuzzy constraints and cross-domain knowledge integration through semantic understanding; the hierarchical structure of Stackelberg games naturally adapts to the hierarchical decision-making requirements of battery allocation, with multi-agent game frameworks effectively handling coordination and competition relationships between batteries. Through semantic embedding technology, natural language constraints are automatically transformed into mathematical objects comprehensible to game participants, with cognitive intelligence handling the “incomputable” complexity components while game theory ensures “provable” mathematical convergence, synergistically achieving the important paradigm transition from “perfect rationality” to “bounded rationality,” thereby providing a theoretically rigorous and practically viable unified solution for intelligent decision-making in mission-critical systems.
月球探测车电池能耗系统作为关键任务设备,在极端环境约束下面临严峻挑战。然而,现有的建模方法面临着根本性的困境:动态不确定性导致约束边界高度模糊,使得传统数学语言难以描述复杂的耦合关系;即使构建了数学表示,高维非线性优化问题在计算上也变得难以处理,现有算法无法解决复杂性障碍且缺乏可解释性。针对这些挑战,本文创新性地提出了一种基于语义嵌入的分层Stackelberg博弈优化框架。该框架超越了传统的优化范式,将大型语言模型的认知智能与博弈论的数学精度深度融合在一起:大型语言模型承认,整体行为不能通过简单的部件组合、模糊约束的处理以及通过语义理解进行跨领域知识整合来预测;Stackelberg博弈的分层结构自然适应了电池分配的分层决策要求,多智能体博弈框架有效处理了电池之间的协调与竞争关系。通过语义嵌入技术,自然语言约束自动转化为博弈参与者可以理解的数学对象,认知智能处理“不可计算”的复杂性成分,博弈论确保“可证明”的数学收敛,协同实现从“完全理性”到“有限理性”的重要范式转换。从而为关键任务系统的智能决策提供了理论严谨、实践可行的统一解决方案。
{"title":"Large language models enable semantic-guided hierarchical games for intelligent battery coordination","authors":"Yuntao Zou ,&nbsp;Zihui Lin ,&nbsp;Qianqi Zhang ,&nbsp;Zhichun Liu ,&nbsp;Zeling Xu","doi":"10.1016/j.aei.2026.104312","DOIUrl":"10.1016/j.aei.2026.104312","url":null,"abstract":"<div><div>The battery energy consumption system of lunar exploration rovers, as mission-critical equipment, confronts severe challenges under extreme environmental constraints. However, existing modeling methods face fundamental dilemmas: dynamic uncertainty leads to highly ambiguous constraint boundaries, making it difficult for traditional mathematical languages to describe complex coupling relationships; even when mathematical representations are constructed, high-dimensional nonlinear optimization problems become computationally intractable, with existing algorithms unable to address complexity barriers and lacking interpretability. In response to these challenges, this paper innovatively proposes a hierarchical Stackelberg game optimization framework based on semantic embedding. This framework transcends traditional optimization paradigms by deeply integrating the cognitive intelligence of large language models with the mathematical precision of game theory: large language models acknowledge that overall behavior cannot be predicted from simple combinations of parts, processing fuzzy constraints and cross-domain knowledge integration through semantic understanding; the hierarchical structure of Stackelberg games naturally adapts to the hierarchical decision-making requirements of battery allocation, with multi-agent game frameworks effectively handling coordination and competition relationships between batteries. Through semantic embedding technology, natural language constraints are automatically transformed into mathematical objects comprehensible to game participants, with cognitive intelligence handling the “incomputable” complexity components while game theory ensures “provable” mathematical convergence, synergistically achieving the important paradigm transition from “perfect rationality” to “bounded rationality,” thereby providing a theoretically rigorous and practically viable unified solution for intelligent decision-making in mission-critical systems.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104312"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large language model-empowered dynamic scheduling for intelligent hybrid flow shop using multi-agent deep reinforcement learning 基于多智能体深度强化学习的大语言模型智能混合流水车间动态调度
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-06 DOI: 10.1016/j.aei.2025.104294
Wenbin Gu , Yushang Cao , Yuxin Li , Nuandong Li , Lei Wang , Na Tang , Minghai Yuan , Fengque Pei
With the emergence of personalized and small-batch production modes, multi-agent manufacturing systems (MAMS) have become a research hotspot for intelligent workshop owing to their self‑organizing capabilities. The hybrid flow shop scheduling problem with unrelated parallel machines (HFSP-UPM) presents significant decision-making challenges due to its heterogeneous resources and dynamic environment. Meanwhile, multi-agent deep reinforcement learning (MADRL) is a prevalent method for addressing complex decision‑making problems. Therefore, this paper proposes a pre-trained large language model (LLM) empowered MADRL method for HFSP-UPM considering stage-wise coordination to minimize the makespan. Specifically, a novel MAMS is developed first, where each processing stage is modeled as an agent to enable high autonomy and reduce decision dimensionality. Then, a multi-agent collaborative scheduling framework based on the centralized training with decentralized execution paradigm (CTDE) is proposed, and the communication mechanism among agents is proposed to promote coordination and collaboration. Through structured prompt engineering, an LLM empowered state space and action selection are designed to enhance semantic understanding and policy updates. Finally, the LLM empowered multi-agent proximal policy optimization (LLM-MAPPO) is employed to train the scheduling model. Experimental results on 330 instances show the superiority of the proposed method over scheduling rules, genetic programming (GP) rules, several advanced DRL-based methods, as well as the baseline MAPPO, achieving over 8% performance improvement in most instances. Furthermore, the generalization experiment demonstrates that the proposed method has self-adjustment capability in response to production scenario changes, and an example verification is provided to verify the proposed method and the experiment platform.
随着个性化和小批量生产模式的出现,多智能体制造系统(MAMS)由于具有自组织能力而成为智能车间的研究热点。不相关并行机混合流水车间调度问题由于其资源的异构性和环境的动态性,给决策提出了很大的挑战。与此同时,多智能体深度强化学习(MADRL)是解决复杂决策问题的一种流行方法。因此,本文提出了一种用于HFSP-UPM的预训练大语言模型(LLM)授权MADRL方法,考虑阶段协调以最小化完工时间。具体而言,首先开发了一种新的MAMS,其中每个处理阶段都建模为一个代理,以实现高度自治并降低决策维度。然后,提出了一种基于集中训练分散执行范式(CTDE)的多智能体协同调度框架,并提出了智能体之间的通信机制,以促进协调与协作。通过结构化的提示工程,设计了一个LLM授权的状态空间和动作选择,以增强语义理解和策略更新。最后,利用基于LLM的多智能体近端策略优化(LLM- mappo)对调度模型进行训练。330个实例的实验结果表明,该方法优于调度规则、遗传规划(GP)规则、几种先进的基于drl的方法以及基线MAPPO,在大多数实例中性能提高了8%以上。推广实验表明,该方法具有响应生产场景变化的自适应能力,并通过实例验证了所提方法和实验平台的有效性。
{"title":"Large language model-empowered dynamic scheduling for intelligent hybrid flow shop using multi-agent deep reinforcement learning","authors":"Wenbin Gu ,&nbsp;Yushang Cao ,&nbsp;Yuxin Li ,&nbsp;Nuandong Li ,&nbsp;Lei Wang ,&nbsp;Na Tang ,&nbsp;Minghai Yuan ,&nbsp;Fengque Pei","doi":"10.1016/j.aei.2025.104294","DOIUrl":"10.1016/j.aei.2025.104294","url":null,"abstract":"<div><div>With the emergence of personalized and small-batch production modes, multi-agent manufacturing systems (MAMS) have become a research hotspot for intelligent workshop owing to their self‑organizing capabilities. The hybrid flow shop scheduling problem with unrelated parallel machines (HFSP-UPM) presents significant decision-making challenges due to its heterogeneous resources and dynamic environment. Meanwhile, multi-agent deep reinforcement learning (MADRL) is a prevalent method for addressing complex decision‑making problems. Therefore, this paper proposes a pre-trained large language model (LLM) empowered MADRL method for HFSP-UPM considering stage-wise coordination to minimize the makespan. Specifically, a novel MAMS is developed first, where each processing stage is modeled as an agent to enable high autonomy and reduce decision dimensionality. Then, a multi-agent collaborative scheduling framework based on the centralized training with decentralized execution paradigm (CTDE) is proposed, and the communication mechanism among agents is proposed to promote coordination and collaboration. Through structured prompt engineering, an LLM empowered state space and action selection are designed to enhance semantic understanding and policy updates. Finally, the LLM empowered multi-agent proximal policy optimization (LLM-MAPPO) is employed to train the scheduling model. Experimental results on 330 instances show the superiority of the proposed method over scheduling rules, genetic programming (GP) rules, several advanced DRL-based methods, as well as the baseline MAPPO, achieving over 8% performance improvement in most instances. Furthermore, the generalization experiment demonstrates that the proposed method has self-adjustment capability in response to production scenario changes, and an example verification is provided to verify the proposed method and the experiment platform.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104294"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145927073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CrackDualMamba: A lightweight dual-stream Mamba with novel focal dice balanced loss for vehicle-based road crack segmentation CrackDualMamba:一种轻量级的双流Mamba,具有新颖的焦片平衡损失,用于基于车辆的道路裂缝分割
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-02-09 DOI: 10.1016/j.aei.2026.104427
Chenrui Bai , Wenzhong Shi , Min Zhang , Huilin Zhao
The escalating environmental pressures and traffic loads make the emergence of road defects inevitable. Road networks stretching over thousands of kilometers pose challenges to road maintenance work. Vehicle-based road fine-grained defect segmentation task becomes one of the effective supports for large-scale, periodic, and high-precision road maintenance work. However, the task faces challenges in practical applications, such as limited computational resources of mobile computing platforms, cluttered background interference and an extremely small region of interest. Therefore, to address the above challenges and meet the application requirements of large-scale and high-precision maintenance tasks for complex urban road scenes, this study proposes a lightweight dual-stream Mamba, called CrackDualMamba. It consists of (i) a dual-stream encoder, named DualMamba, which is designed to enhance detail awareness while maintaining computational efficiency by combining the complementary strengths of advanced DeepCrack and Mamba architectures; (ii) a skip-layer error ablation module (SEAM), which is introduced to improve cross-scale feature fusion between encoder and decoder outputs. In addition, a novel FDB loss is proposed to address the sample imbalance and small region of interest segmentation challenges inherent in the vehicle-based road fine-grained defect segmentation task. The evaluation on public road segmentation benchmark datasets (i.e., Edmcrack600 and CrackTree260) confirms the superior performance of the proposed network compared to ten established state-of-the-art models. In conclusion, our research not only provides a new solution in theory, but also exhibits potential wide applications. The lightweight design enables it to run efficiently on mobile platforms, providing new technical support for road maintenance and management.
不断增加的环境压力和交通负荷使得道路缺陷的出现在所难免。绵延数千公里的道路网络给道路养护工作带来了挑战。基于车辆的道路细粒度缺陷分割任务成为大规模、周期性、高精度道路养护工作的有效支撑之一。然而,该任务在实际应用中面临着挑战,例如移动计算平台的计算资源有限,背景干扰混乱以及兴趣区域极小。因此,为了应对上述挑战,满足复杂城市道路场景下大规模高精度维护任务的应用需求,本研究提出了一种轻型双流曼巴,称为CrackDualMamba。它包括(i)双流编码器,名为DualMamba,旨在通过结合先进的DeepCrack和Mamba架构的互补优势,增强细节意识,同时保持计算效率;(ii)引入跳跃层误差消融模块(SEAM),以改善编码器和解码器输出之间的跨尺度特征融合。此外,针对基于车辆的道路细粒度缺陷分割任务中存在的样本不平衡和小兴趣区域分割问题,提出了一种新的FDB损失算法。对公共道路分割基准数据集(即Edmcrack600和CrackTree260)的评估证实,与10个已建立的最先进模型相比,所提出的网络具有优越的性能。总之,我们的研究不仅在理论上提供了新的解决方案,而且具有广阔的应用前景。轻量化设计使其能够在移动平台上高效运行,为道路养护管理提供新的技术支持。
{"title":"CrackDualMamba: A lightweight dual-stream Mamba with novel focal dice balanced loss for vehicle-based road crack segmentation","authors":"Chenrui Bai ,&nbsp;Wenzhong Shi ,&nbsp;Min Zhang ,&nbsp;Huilin Zhao","doi":"10.1016/j.aei.2026.104427","DOIUrl":"10.1016/j.aei.2026.104427","url":null,"abstract":"<div><div>The escalating environmental pressures and traffic loads make the emergence of road defects inevitable. Road networks stretching over thousands of kilometers pose challenges to road maintenance work. Vehicle-based road fine-grained defect segmentation task becomes one of the effective supports for large-scale, periodic, and high-precision road maintenance work. However, the task faces challenges in practical applications, such as limited computational resources of mobile computing platforms, cluttered background interference and an extremely small region of interest. Therefore, to address the above challenges and meet the application requirements of large-scale and high-precision maintenance tasks for complex urban road scenes, this study proposes a lightweight dual-stream Mamba, called CrackDualMamba. It consists of (i) a dual-stream encoder, named DualMamba, which is designed to enhance detail awareness while maintaining computational efficiency by combining the complementary strengths of advanced DeepCrack and Mamba architectures; (ii) a skip-layer error ablation module (SEAM), which is introduced to improve cross-scale feature fusion between encoder and decoder outputs. In addition, a novel FDB loss is proposed to address the sample imbalance and small region of interest segmentation challenges inherent in the vehicle-based road fine-grained defect segmentation task. The evaluation on public road segmentation benchmark datasets (i.e., Edmcrack600 and CrackTree260) confirms the superior performance of the proposed network compared to ten established state-of-the-art models. In conclusion, our research not only provides a new solution in theory, but also exhibits potential wide applications. The lightweight design enables it to run efficiently on mobile platforms, providing new technical support for road maintenance and management.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104427"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An all-in-one performance prediction model for pavement management engineering based on Bayesian Neural Network 基于贝叶斯神经网络的路面管理工程综合性能预测模型
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-02-09 DOI: 10.1016/j.aei.2026.104413
Tianqing Hei, Zheng Tong, Zhiwei Xie, Tao Ma
The maintenance history data contains knowledge on the variation patterns of pavement performance indices, the uncertainty of these variation patterns, and the boundary of normal pavement performance indices in pavement management engineering. Such knowledge is commonly mined by pavement performance prediction models to obtain interpretable representations. At present, due to task-specific model development, models used for pavement performance index prediction and maintenance plan decision-making lack interoperability. This also means that existing pavement performance prediction models are almost unusable for the task of detecting anomalies in indices, which further increases the workload of scheme development and model transfer across road networks in pavement management engineering. In addition, current models for maintenance decision-making rely on a fixed representation of data uncertainty. However, real-world data uncertainty is not fixed. Therefore, these models still present certain limitations. To address these issues, this study leverages the modeling capability of Bayesian neural networks to develop an upstream all-in-one model, namely the Bayesian Neural Network for Pavement Performance Prediction (BNN4Pav). This model enables a single architecture to generate task-specific outputs for three distinct tasks, thereby reducing the model development effort from three categories of models to a single model category. Extensible downstream models are further constructed for each of these tasks, and the upstream–downstream framework is validated using 460 km of maintenance history data from Anhui, Zhejiang, and Jiangsu provinces in China. The analysis results demonstrate that in the multi-index prediction task with uncertainty quantification, a 66.7% reduction in time consumption is achieved. In the anomaly detection task where anomalous data are completely detected, the manual workload for normal data can be reduced by approximately 70%–90%. In maintenance decision-making tasks, the BNN4Pav-based method achieves a 6%–17% improvement in maintenance effectiveness over existing methods, without compromising decision-making requirements.
养护历史数据包含了路面性能指标的变化规律、变化规律的不确定性以及路面管理工程中正常路面性能指标的边界等知识。这些知识通常由路面性能预测模型挖掘,以获得可解释的表示。目前,用于路面性能指标预测和养护计划决策的模型由于任务化模型的开发,缺乏互操作性。这也意味着现有的路面性能预测模型几乎无法用于检测指标异常的任务,这进一步增加了路面管理工程中方案制定和模型跨路网转移的工作量。此外,当前的维护决策模型依赖于数据不确定性的固定表示。然而,实际数据的不确定性并不是固定的。因此,这些模型仍然存在一定的局限性。为了解决这些问题,本研究利用贝叶斯神经网络的建模能力开发了一个上游一体化模型,即贝叶斯神经网络路面性能预测(BNN4Pav)。该模型使单个体系结构能够为三个不同的任务生成特定于任务的输出,从而将模型开发工作从三个模型类别减少到单个模型类别。针对这些任务进一步构建了可扩展的下游模型,并使用来自中国安徽、浙江和江苏等省460公里的维护历史数据验证了上下游框架。分析结果表明,在不确定性量化的多指标预测任务中,时间消耗减少了66.7%。在完全检测到异常数据的异常检测任务中,正常数据的人工工作量可减少约70%-90%。在维护决策任务中,基于bnn4pav的方法在不影响决策要求的情况下,比现有方法的维护效率提高了6%-17%。
{"title":"An all-in-one performance prediction model for pavement management engineering based on Bayesian Neural Network","authors":"Tianqing Hei,&nbsp;Zheng Tong,&nbsp;Zhiwei Xie,&nbsp;Tao Ma","doi":"10.1016/j.aei.2026.104413","DOIUrl":"10.1016/j.aei.2026.104413","url":null,"abstract":"<div><div>The maintenance history data contains knowledge on the variation patterns of pavement performance indices, the uncertainty of these variation patterns, and the boundary of normal pavement performance indices in pavement management engineering. Such knowledge is commonly mined by pavement performance prediction models to obtain interpretable representations. At present, due to task-specific model development, models used for pavement performance index prediction and maintenance plan decision-making lack interoperability. This also means that existing pavement performance prediction models are almost unusable for the task of detecting anomalies in indices, which further increases the workload of scheme development and model transfer across road networks in pavement management engineering. In addition, current models for maintenance decision-making rely on a fixed representation of data uncertainty. However, real-world data uncertainty is not fixed. Therefore, these models still present certain limitations. To address these issues, this study leverages the modeling capability of Bayesian neural networks to develop an upstream all-in-one model, namely the Bayesian Neural Network for Pavement Performance Prediction (BNN4Pav). This model enables a single architecture to generate task-specific outputs for three distinct tasks, thereby reducing the model development effort from three categories of models to a single model category. Extensible downstream models are further constructed for each of these tasks, and the upstream–downstream framework is validated using 460 km of maintenance history data from Anhui, Zhejiang, and Jiangsu provinces in China. The analysis results demonstrate that in the multi-index prediction task with uncertainty quantification, a 66.7% reduction in time consumption is achieved. In the anomaly detection task where anomalous data are completely detected, the manual workload for normal data can be reduced by approximately 70%–90%. In maintenance decision-making tasks, the BNN4Pav-based method achieves a 6%–17% improvement in maintenance effectiveness over existing methods, without compromising decision-making requirements.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104413"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel hybrid neural network for high-accuracy vehicle-to-infrastructure network traffic prediction 基于混合神经网络的车辆与基础设施网络流量高精度预测
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-02-06 DOI: 10.1016/j.aei.2026.104423
Xiaosheng Ni , Jingpu Duan , Xiong Li , Xin Zhang
To address the challenges in Vehicle-to-Infrastructure (V2I) network traffic prediction, this study proposes an innovative solution. We first establish a novel paradigm that integrates physical models to systematically convert publicly available vehicle trajectory data into V2I traffic data. On this basis, a gCNN–BiLSTM–MHA deep learning model is constructed, whose core advantage lies in its use of a lightweight GhostNet-based convolutional network (gCNN) to improve computational efficiency, while leveraging the synergistic effect of a bidirectional long short-term memory network (BiLSTM) and a multi-head attention mechanism (MHA) to effectively balance prediction efficiency and accuracy. The model’s superiority is comprehensively validated: compared to baseline models like LSTM, it demonstrates significant advantages across a series of key evaluation metrics — including running time, MBD, MAE, MAPE, RMSE, and R2 — achieving an overall balanced performance. Furthermore, the model exhibits excellent performance on multiple benchmark datasets, confirming its strong robustness and high applicability for complex V2I network traffic prediction tasks.
为了应对车辆到基础设施(V2I)网络流量预测中的挑战,本研究提出了一种创新的解决方案。我们首先建立了一个新的范例,该范例集成了物理模型,系统地将公开可用的车辆轨迹数据转换为V2I交通数据。在此基础上,构建了gCNN - BiLSTM - MHA深度学习模型,其核心优势在于利用基于ghostnet的轻量级卷积网络(gCNN)提高计算效率,同时利用双向长短期记忆网络(BiLSTM)和多头注意机制(MHA)的协同效应,有效平衡预测效率和准确性。该模型的优势得到了全面验证:与LSTM等基线模型相比,它在一系列关键评估指标(包括运行时间、MBD、MAE、MAPE、RMSE和R2)上显示出显著优势,实现了整体平衡性能。此外,该模型在多个基准数据集上表现出优异的性能,证实了其较强的鲁棒性和对复杂V2I网络流量预测任务的高适用性。
{"title":"A novel hybrid neural network for high-accuracy vehicle-to-infrastructure network traffic prediction","authors":"Xiaosheng Ni ,&nbsp;Jingpu Duan ,&nbsp;Xiong Li ,&nbsp;Xin Zhang","doi":"10.1016/j.aei.2026.104423","DOIUrl":"10.1016/j.aei.2026.104423","url":null,"abstract":"<div><div>To address the challenges in Vehicle-to-Infrastructure (V2I) network traffic prediction, this study proposes an innovative solution. We first establish a novel paradigm that integrates physical models to systematically convert publicly available vehicle trajectory data into V2I traffic data. On this basis, a gCNN–BiLSTM–MHA deep learning model is constructed, whose core advantage lies in its use of a lightweight GhostNet-based convolutional network (gCNN) to improve computational efficiency, while leveraging the synergistic effect of a bidirectional long short-term memory network (BiLSTM) and a multi-head attention mechanism (MHA) to effectively balance prediction efficiency and accuracy. The model’s superiority is comprehensively validated: compared to baseline models like LSTM, it demonstrates significant advantages across a series of key evaluation metrics — including running time, MBD, MAE, MAPE, RMSE, and <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> — achieving an overall balanced performance. Furthermore, the model exhibits excellent performance on multiple benchmark datasets, confirming its strong robustness and high applicability for complex V2I network traffic prediction tasks.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104423"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Co-MixPL: An optimized semi-supervised learning method for tunnel water leakage detection Co-MixPL:一种优化的隧道漏水检测半监督学习方法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2025-12-23 DOI: 10.1016/j.aei.2025.104263
Xujie Long , Jing Teng , Zhiwei Zhu , Shaobo Zhao , Mengyang Pu , Ruifeng Shi , You Lv , Jonathan Li , Guoqing Jing
The complex geometries, environmental variability, and inconsistent imaging conditions in shield tunnel linings pose substantial challenges to water leakage detection. Existing models heavily rely on extensive annotated data from diverse environments to ensure reliable performance across varying scenarios, which incurs significant time and labor costs in data annotation. To alleviate the annotation burden, we propose Co-MixPL, a novel semi-supervised learning approach that integrates labeled data with pseudo-labels generated by the Mixed Pseudo Label (MixPL) strategy to iteratively update the teacher-student models. Specifically, Co-MixPL integrates an additional head into the MixPL framework to enhance the encoder’s discriminative capability and introduces a Soft Regression method to mitigate the inherent localization bias in pseudo-labeling, refining the regression loss of pseudo-labels through adaptive reliability scores. Remarkably, experiments on the public “water leakage” dataset, Mendeley Data V1, demonstrate that Co-MixPL approaches state-of-the-art (SOTA) performance using only one-seventh of the training data and outperforms the SOTA by 2.8 AP with merely one-third of the annotations. These findings highlight the effectiveness of Co-MixPL in delivering superior detection performance with significantly reduced annotations, thus better meeting the practical demands of engineering inspection and maintenance. Codes are available at https://github.com/LXJ010/Co-MixPL.
盾构隧道衬砌复杂的几何形状、环境的可变性和不一致的成像条件,给漏水检测带来了巨大的挑战。现有模型严重依赖于来自不同环境的大量带注释的数据,以确保跨不同场景的可靠性能,这在数据注释方面产生了大量的时间和人工成本。为了减轻标注负担,我们提出了一种新的半监督学习方法Co-MixPL,该方法将标记数据与混合伪标签(MixPL)策略生成的伪标签集成在一起,迭代更新师生模型。具体而言,Co-MixPL在MixPL框架中集成了一个额外的头部,以增强编码器的判别能力,并引入了软回归方法来减轻伪标签中固有的定位偏差,通过自适应可靠性评分来改善伪标签的回归损失。值得注意的是,在公共“漏水”数据集Mendeley数据V1上的实验表明,Co-MixPL仅使用七分之一的训练数据就接近最先进(SOTA)的性能,并且仅使用三分之一的注释就比SOTA高出2.8 AP。这些发现突出了Co-MixPL在显著减少注释的情况下提供卓越检测性能的有效性,从而更好地满足了工程检测和维护的实际需求。代码可在https://github.com/LXJ010/Co-MixPL上获得。
{"title":"Co-MixPL: An optimized semi-supervised learning method for tunnel water leakage detection","authors":"Xujie Long ,&nbsp;Jing Teng ,&nbsp;Zhiwei Zhu ,&nbsp;Shaobo Zhao ,&nbsp;Mengyang Pu ,&nbsp;Ruifeng Shi ,&nbsp;You Lv ,&nbsp;Jonathan Li ,&nbsp;Guoqing Jing","doi":"10.1016/j.aei.2025.104263","DOIUrl":"10.1016/j.aei.2025.104263","url":null,"abstract":"<div><div>The complex geometries, environmental variability, and inconsistent imaging conditions in shield tunnel linings pose substantial challenges to water leakage detection. Existing models heavily rely on extensive annotated data from diverse environments to ensure reliable performance across varying scenarios, which incurs significant time and labor costs in data annotation. To alleviate the annotation burden, we propose Co-MixPL, a novel semi-supervised learning approach that integrates labeled data with pseudo-labels generated by the Mixed Pseudo Label (MixPL) strategy to iteratively update the teacher-student models. Specifically, Co-MixPL integrates an additional head into the MixPL framework to enhance the encoder’s discriminative capability and introduces a Soft Regression method to mitigate the inherent localization bias in pseudo-labeling, refining the regression loss of pseudo-labels through adaptive reliability scores. Remarkably, experiments on the public “water leakage” dataset, Mendeley Data V1, demonstrate that Co-MixPL approaches state-of-the-art (SOTA) performance using only one-seventh of the training data and outperforms the SOTA by 2.8 AP with merely one-third of the annotations. These findings highlight the effectiveness of Co-MixPL in delivering superior detection performance with significantly reduced annotations, thus better meeting the practical demands of engineering inspection and maintenance. Codes are available at <span><span>https://github.com/LXJ010/Co-MixPL</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104263"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enabling AI-driven modular building design: an auto-decoder approach for IFC 3D geometry representation 启用ai驱动的模块化建筑设计:IFC 3D几何表示的自动解码器方法
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-13 DOI: 10.1016/j.aei.2026.104326
Sang Du , Lei Hou , Guomin (Kevin) Zhang , Yang Zou , Haosen Chen
Modular building design requires numerous context-dependent component variants that traditional constraint-based methods cannot exhaustively enumerate. Industry Foundation Classes (IFC) models encode rich spatial and semantic context from completed modular projects. This context could enable Artificial Intelligence (AI) models to generate component variants and complement constraint-based methods. However, IFC 3D geometry that carries spatial context is not directly usable by AI models. This stems from IFC’s complex data structure. To address this limitation, this paper proposes a readily deployable auto-decoder method that produces AI-compatible vectors from IFC geometry. First, an IFC export strategy that retains component spatial context is employed. Second, a sampling method that pairs 3D points with their distances to the nearest surface is applied. Third, an auto-decoder neural network that jointly optimises per-component vectors and the model weights is presented, yielding context-aware representation vectors for modular components. Finally, an octree-based decoder for accurate geometry recovery from vectors is employed. Experiments on real-world modular project data demonstrate that the resulting vectors preserve geometric fidelity and support component variant generation. Geometric fidelity is confirmed by the mean and maximum surface reconstruction errors of 14.57 mm and 51.94 mm, sufficient for modular building design analysis. Support for component variant generation is evidenced by geometric interpolation linearity exceeding 0.98 out of 1, showing excellent variant generation suitability. This method makes IFC spatial context accessible to AI-driven modular design methods, transforming Design for Manufacture and Assembly (DfMA) data into actionable knowledge. Codes available on GitHub.
模块化建筑设计需要大量与上下文相关的组件变体,而传统的基于约束的方法无法详尽地列举这些变体。工业基础类(IFC)模型从已完成的模块化项目中编码丰富的空间和语义上下文。该上下文可以使人工智能(AI)模型生成组件变体并补充基于约束的方法。然而,带有空间背景的IFC 3D几何图形不能直接用于AI模型。这源于IFC复杂的数据结构。为了解决这一限制,本文提出了一种易于部署的自动解码器方法,该方法可以从IFC几何形状中产生与ai兼容的向量。首先,采用了保留组件空间上下文的IFC出口策略。其次,采用一种将三维点与其最近表面的距离配对的采样方法。第三,提出了一种自动解码器神经网络,该网络联合优化每个组件向量和模型权重,生成模块化组件的上下文感知表示向量。最后,采用基于八叉树的解码器对矢量进行精确的几何恢复。在实际模块化工程数据上的实验表明,所得到的向量保持了几何保真度,并支持组件变体的生成。平均表面重构误差为14.57 mm,最大表面重构误差为51.94 mm,证实了几何保真度,足以进行模块化建筑设计分析。几何插补线性度超过0.98 (out of 1),显示出良好的变量生成适宜性。这种方法使人工智能驱动的模块化设计方法可以访问IFC的空间背景,将制造和装配设计(DfMA)数据转化为可操作的知识。代码可在GitHub。
{"title":"Enabling AI-driven modular building design: an auto-decoder approach for IFC 3D geometry representation","authors":"Sang Du ,&nbsp;Lei Hou ,&nbsp;Guomin (Kevin) Zhang ,&nbsp;Yang Zou ,&nbsp;Haosen Chen","doi":"10.1016/j.aei.2026.104326","DOIUrl":"10.1016/j.aei.2026.104326","url":null,"abstract":"<div><div>Modular building design requires numerous context-dependent component variants that traditional constraint-based methods cannot exhaustively enumerate. Industry Foundation Classes (IFC) models encode rich spatial and semantic context from completed modular projects. This context could enable Artificial Intelligence (AI) models to generate component variants and complement constraint-based methods. However, IFC 3D geometry that carries spatial context is not directly usable by AI models. This stems from IFC’s complex data structure. To address this limitation, this paper proposes a readily deployable auto-decoder method that produces AI-compatible vectors from IFC geometry. First, an IFC export strategy that retains component spatial context is employed. Second, a sampling method that pairs 3D points with their distances to the nearest surface is applied. Third, an auto-decoder neural network that jointly optimises per-component vectors and the model weights is presented, yielding context-aware representation vectors for modular components. Finally, an octree-based decoder for accurate geometry recovery from vectors is employed. Experiments on real-world modular project data demonstrate that the resulting vectors preserve geometric fidelity and support component variant generation. Geometric fidelity is confirmed by the mean and maximum surface reconstruction errors of 14.57 mm and 51.94 mm, sufficient for modular building design analysis. Support for component variant generation is evidenced by geometric interpolation linearity exceeding 0.98 out of 1, showing excellent variant generation suitability. This method makes IFC spatial context accessible to AI-driven modular design methods, transforming Design for Manufacture and Assembly (DfMA) data into actionable knowledge. Codes available on GitHub.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104326"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Advanced Engineering Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1