首页 > 最新文献

Advanced Engineering Informatics最新文献

英文 中文
How does contextual fidelity impact how we think, talk, and act in AI-assisted engineering design? 情境保真度如何影响我们在人工智能辅助工程设计中的思考、交谈和行为?
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-01 Epub Date: 2026-02-12 DOI: 10.1016/j.aei.2026.104456
Shantanu Vyas, Suryapavan Cheruku, Vinayak R. Krishnamurthy
In this work, we investigate the role of generative AI agents as reflective partners in engineering design. While such models are increasingly used to generate design solutions, concerns remain about their potential to diminish designers’ critical thinking and reasoning skills. To address this, we develop a mixed-initiative conversational framework that positions large language and vision–language models as reflective thinking partners rather than solution providers. The framework is structured around five contextual information channels, namely task–role context, design representations, historical context, evaluation signals, and target references, that enable AI agents to ask reflective questions and provide explanations and suggestions. To study this framework within a concrete design context, we develop an interactive tool that embodies the notion of contextual fidelity of 2D structure design tasks. We implement varying levels of contextual fidelity, defined by the extent of contextual information available to the agent. We evaluate these levels through a between-subjects study with forty-six participants, comparing a high-fidelity and a low-fidelity agent against a control group without AI support. We examine the impact of the agents on how users think, talk and act, using a comprehensive set of metrics, including coarse-level design objectives (deformation and material usage), solution quality metrics (structural and geometric analysis), process-oriented measures (design space exploration patterns and trajectories, design strategy shifts), conversational dynamics (thematic and temporal analysis), and subjective surveys (NASA-TLX, Cognitive Load Theory, Trust in AI). Our analyses show that while conversational agents do not immediately help improve coarse-level design objectives, they significantly shape nuanced aspects of design processes and outcomes. Interaction with the agents critically influences how users explore the design space, where agent-supported groups exhibited more focused exploration patterns compared to the control group’s broader trial-and-error approaches. Furthermore, interactions with the high-fidelity agent led to solutions with higher symmetry and topological alignment with optimal designs, fostered deeper reflection, reduced mental demand, and supported more deliberate design decisions. Building on these findings, we discuss broader implications of AI agents for problem-solving processes and outline guidelines for designing adaptive and generalizable frameworks for different domains.
在这项工作中,我们研究了生成人工智能代理在工程设计中作为反思伙伴的作用。虽然这些模型越来越多地用于生成设计解决方案,但人们仍然担心它们可能会削弱设计师的批判性思维和推理能力。为了解决这个问题,我们开发了一个混合主动对话框架,将大型语言和视觉语言模型定位为反思思维伙伴,而不是解决方案提供者。该框架围绕五个上下文信息通道构建,即任务-角色上下文、设计表示、历史上下文、评估信号和目标参考,使人工智能代理能够提出反思性问题,并提供解释和建议。为了在具体的设计环境中研究这个框架,我们开发了一个交互式工具,它体现了2D结构设计任务的上下文保真度概念。我们实现了不同级别的上下文保真度,由代理可获得的上下文信息的程度来定义。我们通过一项有46名参与者的受试者间研究来评估这些水平,将高保真度和低保真度代理与没有人工智能支持的对照组进行比较。我们使用一套全面的指标,包括粗级设计目标(变形和材料使用)、解决方案质量指标(结构和几何分析)、面向过程的指标(设计空间探索模式和轨迹、设计策略转变)、会话动态(主题和时间分析)和主观调查(NASA-TLX、认知负荷理论、对人工智能的信任),研究代理对用户思考、交谈和行动方式的影响。我们的分析表明,虽然会话代理不能立即帮助改进粗略的设计目标,但它们显著地塑造了设计过程和结果的微妙方面。与代理的交互严重影响用户探索设计空间的方式,与控制组更广泛的试错方法相比,代理支持组表现出更集中的探索模式。此外,与高保真度智能体的交互导致解决方案具有更高的对称性和与最优设计的拓扑一致性,促进更深层次的反思,减少心理需求,并支持更深思熟虑的设计决策。在这些发现的基础上,我们讨论了人工智能代理对问题解决过程的更广泛影响,并概述了为不同领域设计自适应和可推广框架的指南。
{"title":"How does contextual fidelity impact how we think, talk, and act in AI-assisted engineering design?","authors":"Shantanu Vyas,&nbsp;Suryapavan Cheruku,&nbsp;Vinayak R. Krishnamurthy","doi":"10.1016/j.aei.2026.104456","DOIUrl":"10.1016/j.aei.2026.104456","url":null,"abstract":"<div><div>In this work, we investigate the role of generative AI agents as reflective partners in engineering design. While such models are increasingly used to generate design solutions, concerns remain about their potential to diminish designers’ critical thinking and reasoning skills. To address this, we develop a mixed-initiative conversational framework that positions large language and vision–language models as reflective thinking partners rather than solution providers. The framework is structured around five contextual information channels, namely task–role context, design representations, historical context, evaluation signals, and target references, that enable AI agents to ask reflective questions and provide explanations and suggestions. To study this framework within a concrete design context, we develop an interactive tool that embodies the notion of contextual fidelity of 2D structure design tasks. We implement varying levels of contextual fidelity, defined by the extent of contextual information available to the agent. We evaluate these levels through a between-subjects study with forty-six participants, comparing a high-fidelity and a low-fidelity agent against a control group without AI support. We examine the impact of the agents on how users think, talk and act, using a comprehensive set of metrics, including coarse-level design objectives (deformation and material usage), solution quality metrics (structural and geometric analysis), process-oriented measures (design space exploration patterns and trajectories, design strategy shifts), conversational dynamics (thematic and temporal analysis), and subjective surveys (NASA-TLX, Cognitive Load Theory, Trust in AI). Our analyses show that while conversational agents do not immediately help improve coarse-level design objectives, they significantly shape nuanced aspects of design processes and outcomes. Interaction with the agents critically influences how users explore the design space, where agent-supported groups exhibited more focused exploration patterns compared to the control group’s broader trial-and-error approaches. Furthermore, interactions with the high-fidelity agent led to solutions with higher symmetry and topological alignment with optimal designs, fostered deeper reflection, reduced mental demand, and supported more deliberate design decisions. Building on these findings, we discuss broader implications of AI agents for problem-solving processes and outline guidelines for designing adaptive and generalizable frameworks for different domains.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"72 ","pages":"Article 104456"},"PeriodicalIF":9.9,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IDS-Net: A novel framework for few-shot photovoltaic power prediction with interpretable dynamic selection and feature information fusion IDS-Net:基于可解释动态选择和特征信息融合的少射光伏功率预测新框架
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-01 Epub Date: 2026-02-12 DOI: 10.1016/j.aei.2026.104455
Hang Fan , Weican Liu , Zuhan Zhang , Ying Lu , Wencai Run , Dunnan Liu
With the growing demand for renewable energy, countries are accelerating the construction of photovoltaic (PV) power stations. However, accurately forecasting power data for newly constructed PV stations is extremely challenging due to limited data availability. To this end, we propose a novel interpretable dynamic selection network (IDS-Net) based on feature information fusion to achieve accurate few-shot prediction. This transfer learning framework primarily consists of two parts. In the first stage, we pre-train on the large dataset, utilizing Maximum Mean Discrepancy (MMD) to select the source domain dataset most similar to the target domain data distribution. Subsequently, the ReliefF algorithm is utilized for feature selection, reducing the influence of feature redundancy. Then, the Hampel Identifier (HI) is used for training dataset outlier correction. In the IDS-Net model, we first obtain the initial extracted features from a pool of predictive models. Following this, two separate weighting channels are utilized to determine the interpretable weights for each sub-model and the adaptive selection outcomes, respectively. Subsequently, the extracted feature results from each sub-model are multiplied by their corresponding weights and then summed to obtain the weighted extracted features. Then, we perform cross-embedding on the additional features and fuse them with the extracted weighted features. This fused information is then passed through the MLP (Multi-Layer Perceptron) layer to obtain predictions. In the second stage, we design an end-to-end adaptive transfer learning strategy to obtain the final prediction results on the target dataset. We validate the transfer learning process using two PV power datasets from Hebei province, China, to demonstrate the effectiveness and generalization of our framework and transfer learning strategy.
随着可再生能源需求的不断增长,各国都在加快光伏电站的建设。然而,由于数据可用性有限,准确预测新建光伏电站的功率数据极具挑战性。为此,我们提出了一种基于特征信息融合的可解释动态选择网络(IDS-Net),以实现精确的少镜头预测。该迁移学习框架主要由两部分组成。在第一阶段,我们在大数据集上进行预训练,利用最大平均差异(MMD)选择与目标域数据分布最相似的源域数据集。随后,利用ReliefF算法进行特征选择,减少了特征冗余的影响。然后,使用Hampel标识符(HI)对训练数据集进行离群值校正。在IDS-Net模型中,我们首先从一组预测模型中获得初始提取的特征。然后,利用两个独立的加权通道分别确定每个子模型的可解释权重和自适应选择结果。然后,将每个子模型提取的特征结果与它们对应的权重相乘,然后求和,得到加权提取的特征。然后,对附加特征进行交叉嵌入,并将其与提取的加权特征融合。然后将融合的信息通过MLP(多层感知器)层进行预测。在第二阶段,我们设计了一个端到端的自适应迁移学习策略,以获得目标数据集上的最终预测结果。我们使用来自中国河北省的两个光伏发电数据集验证迁移学习过程,以证明我们的框架和迁移学习策略的有效性和泛化性。
{"title":"IDS-Net: A novel framework for few-shot photovoltaic power prediction with interpretable dynamic selection and feature information fusion","authors":"Hang Fan ,&nbsp;Weican Liu ,&nbsp;Zuhan Zhang ,&nbsp;Ying Lu ,&nbsp;Wencai Run ,&nbsp;Dunnan Liu","doi":"10.1016/j.aei.2026.104455","DOIUrl":"10.1016/j.aei.2026.104455","url":null,"abstract":"<div><div>With the growing demand for renewable energy, countries are accelerating the construction of photovoltaic (PV) power stations. However, accurately forecasting power data for newly constructed PV stations is extremely challenging due to limited data availability. To this end, we propose a novel interpretable dynamic selection network (IDS-Net) based on feature information fusion to achieve accurate few-shot prediction. This transfer learning framework primarily consists of two parts. In the first stage, we pre-train on the large dataset, utilizing Maximum Mean Discrepancy (MMD) to select the source domain dataset most similar to the target domain data distribution. Subsequently, the ReliefF algorithm is utilized for feature selection, reducing the influence of feature redundancy. Then, the Hampel Identifier (HI) is used for training dataset outlier correction. In the IDS-Net model, we first obtain the initial extracted features from a pool of predictive models. Following this, two separate weighting channels are utilized to determine the interpretable weights for each sub-model and the adaptive selection outcomes, respectively. Subsequently, the extracted feature results from each sub-model are multiplied by their corresponding weights and then summed to obtain the weighted extracted features. Then, we perform cross-embedding on the additional features and fuse them with the extracted weighted features. This fused information is then passed through the MLP (Multi-Layer Perceptron) layer to obtain predictions. In the second stage, we design an end-to-end adaptive transfer learning strategy to obtain the final prediction results on the target dataset. We validate the transfer learning process using two PV power datasets from Hebei province, China, to demonstrate the effectiveness and generalization of our framework and transfer learning strategy.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"72 ","pages":"Article 104455"},"PeriodicalIF":9.9,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146154143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A unified LLM-KG framework for low‑annotation urban rail transit signal system operation: knowledge acquisition and dynamic update 低标注城市轨道交通信号系统运行的统一LLM-KG框架:知识获取和动态更新
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-24 DOI: 10.1016/j.aei.2026.104327
Wei Cai , Xiaomin Zhu , Zeyu Sun , Aihui Ye , Guanhua Fu , Runtong Zhang
Intelligent operation and maintenance (O&M) of urban rail transit signal systems (URTSS) is essential for ensuring train safety and operational efficiency. However, most O&M data exist as unstructured and sparsely labeled texts, posing major challenges for reliable knowledge extraction, semantic reasoning, and dynamic knowledge management. To address these issues, this paper proposes a unified large language model-knowledge graph framework (ULLM-KG) tailored for low-annotation, knowledge-intensive O&M environments. Firstly, a bidirectional knowledge graph construction mechanism (BKGC) is introduced to rapidly build a domain-specific initial knowledge graph. Secondly, a KG-enhanced distantly supervised entity and event extraction method (KG-DS3E) is designed to enhance critical knowledge extraction accuracy from unstructured texts. Thirdly, a prompt-driven knowledge-enhanced reasoning method (PD-KER) is proposed to improve semantic quality in fault diagnosis and maintenance recommendations. Lastly, a dynamic knowledge graph updating mechanism with temporal awareness and conflict resolution (DKG-UCF) is used to ensure efficient and accurate knowledge evolution. Based on real-world URTSS O&M data, experimental evaluations are conducted on state-of-the-art LLMs (GPT-4o, DeepSeek-V3, and Qwen3-32B). On datasets with varying annotation ratios and rare faults, ULLM-KG demonstrates significantly superior performance in knowledge extraction and reasoning tasks compared to other state-of-the-art methods. Its ability to dynamically update knowledge is also verified to be excellent. ULLM-KG provides a general solution for the intelligent O&M of URTSS under low-annotation conditions.
城市轨道交通信号系统(URTSS)的智能运维是保障列车安全和运行效率的关键。然而,大多数O&;M数据以非结构化和稀疏标记的文本形式存在,这对可靠的知识提取、语义推理和动态知识管理提出了重大挑战。为了解决这些问题,本文提出了一个统一的大型语言模型-知识图框架(ULLM-KG),该框架专为低注释、知识密集型的操作和管理环境量身定制。首先,引入双向知识图谱构建机制(BKGC),快速构建特定领域的初始知识图谱;其次,设计了一种kg增强的远程监督实体和事件提取方法(KG-DS3E),以提高从非结构化文本中提取关键知识的准确性。再次,提出了一种提示驱动的知识增强推理方法(PD-KER),以提高故障诊断和维修建议的语义质量。最后,采用一种具有时间感知和冲突解决的动态知识图更新机制(DKG-UCF)来保证知识进化的高效和准确。基于真实的URTSS o&m数据,在最先进的llm (gpt - 40、DeepSeek-V3和Qwen3-32B)上进行了实验评估。在具有不同标注比率和罕见错误的数据集上,ULLM-KG在知识提取和推理任务中表现出明显优于其他最先进方法的性能。其动态更新知识的能力也被证明是优秀的。ULLM-KG为低标注条件下URTSS的智能运维提供了一种通用的解决方案。
{"title":"A unified LLM-KG framework for low‑annotation urban rail transit signal system operation: knowledge acquisition and dynamic update","authors":"Wei Cai ,&nbsp;Xiaomin Zhu ,&nbsp;Zeyu Sun ,&nbsp;Aihui Ye ,&nbsp;Guanhua Fu ,&nbsp;Runtong Zhang","doi":"10.1016/j.aei.2026.104327","DOIUrl":"10.1016/j.aei.2026.104327","url":null,"abstract":"<div><div>Intelligent operation and maintenance (O&amp;M) of urban rail transit signal systems (URTSS) is essential for ensuring train safety and operational efficiency. However, most O&amp;M data exist as unstructured and sparsely labeled texts, posing major challenges for reliable knowledge extraction, semantic reasoning, and dynamic knowledge management. To address these issues, this paper proposes a unified large language model-knowledge graph framework (ULLM-KG) tailored for low-annotation, knowledge-intensive O&amp;M environments. Firstly, a bidirectional knowledge graph construction mechanism (BKGC) is introduced to rapidly build a domain-specific initial knowledge graph. Secondly, a KG-enhanced distantly supervised entity and event extraction method (KG-DS3E) is designed to enhance critical knowledge extraction accuracy from unstructured texts. Thirdly, a prompt-driven knowledge-enhanced reasoning method (PD-KER) is proposed to improve semantic quality in fault diagnosis and maintenance recommendations. Lastly, a dynamic knowledge graph updating mechanism with temporal awareness and conflict resolution (DKG-UCF) is used to ensure efficient and accurate knowledge evolution. Based on real-world URTSS O&amp;M data, experimental evaluations are conducted on state-of-the-art LLMs (GPT-4o, DeepSeek-V3, and Qwen3-32B). On datasets with varying annotation ratios and rare faults, ULLM-KG demonstrates significantly superior performance in knowledge extraction and reasoning tasks compared to other state-of-the-art methods. Its ability to dynamically update knowledge is also verified to be excellent. ULLM-KG provides a general solution for the intelligent O&amp;M of URTSS under low-annotation conditions.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104327"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146078379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tunable plasmonic absorption in metal–dielectric multilayers via FDTD simulations and an explainable machine learning approach 通过FDTD模拟和可解释的机器学习方法在金属介质多层中的可调谐等离子体吸收
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-09 DOI: 10.1016/j.aei.2026.104311
Emmanuel A. Bamidele
Plasmonic devices, fundamental to modern nanophotonics, exploit resonant interactions between light and free electrons in metals to achieve enhanced light trapping and electromagnetic field confinement. However, modeling their complex, nonlinear optical responses remain computationally intensive. In this work, we combine finite-difference time-domain (FDTD) simulations with machine learning (ML) to simulate and predict absorbed power behavior in multilayer plasmonic stacks composed of SiO2, gold (Au), silver (Ag), and indium tin oxide (ITO). By varying Au and Ag thicknesses (10–50  nm) across a spectral range of 300–1500  nm, we generate spatial absorption maps and integrated power metrics from full-wave solutions to Maxwell’s equations. A multilayer perceptron models global absorption behavior with a mean absolute error (MAE) of 0.0953, while a convolutional neural network predicts spatial absorption distributions with an MAE of 0.0101. SHapley Additive exPlanations identify plasmonic layer thickness and excitation wavelength as dominant contributors to absorption, which peaks between 450 and 850  nm. Gold demonstrates broader and more sustained absorption compared to silver, although both metals exhibit reduced efficiency outside the resonance window. The integrated FDTD–ML framework accelerates plasmonic design while maintaining physical interpretability and predictive accuracy, enabling efficient exploration of tunable optical responses in multilayer nanophotonic systems for applications in optical sensing, photovoltaics, and device engineering.
等离子体器件是现代纳米光子学的基础,利用金属中光和自由电子之间的共振相互作用来实现增强的光捕获和电磁场约束。然而,模拟它们复杂的非线性光学响应仍然需要大量的计算。在这项工作中,我们将时域有限差分(FDTD)模拟与机器学习(ML)相结合,以模拟和预测由SiO2、金(Au)、银(Ag)和氧化铟锡(ITO)组成的多层等离子体堆叠的吸收功率行为。通过在300-1500 nm的光谱范围内改变Au和Ag的厚度(10-50 nm),我们从麦克斯韦方程的全波解中生成空间吸收图和集成功率指标。多层感知器模型全局吸收行为的平均绝对误差(MAE)为0.0953,而卷积神经网络预测空间吸收分布的平均绝对误差为0.0101。SHapley加性解释确定等离子体层厚度和激发波长是吸收的主要贡献者,其峰值在450和850 nm之间。与银相比,金表现出更广泛和更持久的吸收,尽管这两种金属在共振窗口外的效率都有所降低。集成的FDTD-ML框架加速了等离子体设计,同时保持了物理可解释性和预测准确性,能够有效地探索用于光学传感、光伏和器件工程应用的多层纳米光子系统中的可调谐光学响应。
{"title":"Tunable plasmonic absorption in metal–dielectric multilayers via FDTD simulations and an explainable machine learning approach","authors":"Emmanuel A. Bamidele","doi":"10.1016/j.aei.2026.104311","DOIUrl":"10.1016/j.aei.2026.104311","url":null,"abstract":"<div><div>Plasmonic devices, fundamental to modern nanophotonics, exploit resonant interactions between light and free electrons in metals to achieve enhanced light trapping and electromagnetic field confinement. However, modeling their complex, nonlinear optical responses remain computationally intensive. In this work, we combine finite-difference time-domain (FDTD) simulations with machine learning (ML) to simulate and predict absorbed power behavior in multilayer plasmonic stacks composed of SiO<sub>2</sub>, gold (Au), silver (Ag), and indium tin oxide (ITO). By varying Au and Ag thicknesses (10–50  nm) across a spectral range of 300–1500  nm, we generate spatial absorption maps and integrated power metrics from full-wave solutions to Maxwell’s equations. A multilayer perceptron models global absorption behavior with a mean absolute error (MAE) of 0.0953, while a convolutional neural network predicts spatial absorption distributions with an MAE of 0.0101. SHapley Additive exPlanations identify plasmonic layer thickness and excitation wavelength as dominant contributors to absorption, which peaks between 450 and 850  nm. Gold demonstrates broader and more sustained absorption compared to silver, although both metals exhibit reduced efficiency outside the resonance window. The integrated FDTD–ML framework accelerates plasmonic design while maintaining physical interpretability and predictive accuracy, enabling efficient exploration of tunable optical responses in multilayer nanophotonic systems for applications in optical sensing, photovoltaics, and device engineering.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104311"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TSDR-SFE: A prediction model for dam crack width based on two-stage decomposition–reconstruction and spatiotemporal feature extraction 基于两阶段分解-重构和时空特征提取的大坝裂缝宽度预测模型
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-10 DOI: 10.1016/j.aei.2025.104290
Yixiang Fang , Chongshi Gu , Yangtao Li , Yiming Wang , Taiqi Lu , Lei Shen , Xiao Sun , Mingyuan Zhu , Fuqiang Zhou , Sitao Fu , Hao Gu
Crack formation and propagation in concrete dams present substantial risks to long-term structural integrity and operational stability. Accurate crack-width prediction models are thus essential for ensuring the continued safe operation of these structures. However, crack width monitoring data are highly nonlinear and non-stationary, limiting the effectiveness of traditional single-technique signal decomposition methods in capturing their complex time–frequency dynamics. To overcome these challenges, this study proposes a predictive model (TSDR-SFE) that integrates two-stage decomposition–reconstruction with spatiotemporal feature extraction. The model begins by applying improved empirical mode decomposition (IEMD) to decompose the crack width time series into intrinsic mode functions (IMFs). Then, spatial post-multiscale fusion entropy (SPMFE) is used to compute the entropy value of each IMF, which subsequently serves as input to the k-Graph clustering algorithm. By constructing a graph-based structure, IMFs are classified and reconstructed into stochastic, periodic, and trend components. Next, the stochastic component, due to its higher complexity, undergoes a second decomposition using successive variational mode decomposition (SVMD). Finally, the trend and periodic components, along with the subsequences of the stochastic component obtained, are separately predicted using the TSMixer, which effectively extracts spatiotemporal features. The final prediction is obtained by aggregating the prediction results of the three components. Using monitoring data collected from seven crack-width meters installed at multiple locations on a concrete dam, the accuracy and generalization capability of TSDR-SFE are evaluated by comparing its predictive performance with ten ablation models and twelve benchmark models. The experimental results show that TSDR-SFE consistently outperforms all comparative models in both fitting accuracy and predictive performance, achieving coefficient of determination (R2) values above 0.97 and exhibiting the most compact residual distributions. These findings confirm that the layered strategy of decomposition, clustering, secondary decomposition, and modeling effectively reduces the complexity of non-stationary time series. This reduction facilitates the extraction of deeper knowledge from simplified components. It provides a robust theoretical basis for diagnosing crack evolution patterns and guiding engineering safety decisions.
混凝土大坝裂缝的形成和扩展对结构的长期完整性和运行稳定性构成重大风险。因此,准确的裂缝宽度预测模型对于确保这些结构的持续安全运行至关重要。然而,裂缝宽度监测数据是高度非线性和非平稳的,这限制了传统的单技术信号分解方法在捕获其复杂时频动态方面的有效性。为了克服这些挑战,本研究提出了一种将两阶段分解重建与时空特征提取相结合的预测模型(TSDR-SFE)。该模型首先采用改进的经验模态分解(IEMD)将裂缝宽度时间序列分解为内禀模态函数(IMFs)。然后,利用空间后多尺度融合熵(SPMFE)计算每个IMF的熵值,作为k-Graph聚类算法的输入。通过构建基于图的结构,将imf分类并重构为随机、周期和趋势分量。接下来,随机分量由于其较高的复杂性,使用连续变分模态分解(SVMD)进行第二次分解。最后,利用TSMixer分别预测得到的趋势分量和周期分量以及随机分量的子序列,有效地提取了时空特征。将三个分量的预测结果进行汇总得到最终的预测结果。利用安装在混凝土大坝多个位置的7台缝宽仪的监测数据,通过与10种烧蚀模型和12种基准模型的预测性能比较,对TSDR-SFE的预测精度和泛化能力进行了评价。实验结果表明,TSDR-SFE在拟合精度和预测性能上均优于所有比较模型,决定系数(R2)值均在0.97以上,残差分布最紧凑。这些发现证实了分解、聚类、二次分解和建模的分层策略有效地降低了非平稳时间序列的复杂性。这种简化有助于从简化的组件中提取更深层次的知识。为裂缝演化模式诊断和指导工程安全决策提供了有力的理论依据。
{"title":"TSDR-SFE: A prediction model for dam crack width based on two-stage decomposition–reconstruction and spatiotemporal feature extraction","authors":"Yixiang Fang ,&nbsp;Chongshi Gu ,&nbsp;Yangtao Li ,&nbsp;Yiming Wang ,&nbsp;Taiqi Lu ,&nbsp;Lei Shen ,&nbsp;Xiao Sun ,&nbsp;Mingyuan Zhu ,&nbsp;Fuqiang Zhou ,&nbsp;Sitao Fu ,&nbsp;Hao Gu","doi":"10.1016/j.aei.2025.104290","DOIUrl":"10.1016/j.aei.2025.104290","url":null,"abstract":"<div><div>Crack formation and propagation in concrete dams present substantial risks to long-term structural integrity and operational stability. Accurate crack-width prediction models are thus essential for ensuring the continued safe operation of these structures. However, crack width monitoring data are highly nonlinear and non-stationary, limiting the effectiveness of traditional single-technique signal decomposition methods in capturing their complex time–frequency dynamics. To overcome these challenges, this study proposes a predictive model (TSDR-SFE) that integrates two-stage decomposition–reconstruction with spatiotemporal feature extraction. The model begins by applying improved empirical mode decomposition (IEMD) to decompose the crack width time series into intrinsic mode functions (IMFs). Then, spatial post-multiscale fusion entropy (SPMFE) is used to compute the entropy value of each IMF, which subsequently serves as input to the k-Graph clustering algorithm. By constructing a graph-based structure, IMFs are classified and reconstructed into stochastic, periodic, and trend components. Next, the stochastic component, due to its higher complexity, undergoes a second decomposition using successive variational mode decomposition (SVMD). Finally, the trend and periodic components, along with the subsequences of the stochastic component obtained, are separately predicted using the TSMixer, which effectively extracts spatiotemporal features. The final prediction is obtained by aggregating the prediction results of the three components. Using monitoring data collected from seven crack-width meters installed at multiple locations on a concrete dam, the accuracy and generalization capability of TSDR-SFE are evaluated by comparing its predictive performance with ten ablation models and twelve benchmark models. The experimental results show that TSDR-SFE consistently outperforms all comparative models in both fitting accuracy and predictive performance, achieving coefficient of determination (R<sup>2</sup>) values above 0.97 and exhibiting the most compact residual distributions. These findings confirm that the layered strategy of decomposition, clustering, secondary decomposition, and modeling effectively reduces the complexity of non-stationary time series. This reduction facilitates the extraction of deeper knowledge from simplified components. It provides a robust theoretical basis for diagnosing crack evolution patterns and guiding engineering safety decisions.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104290"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An improved tuna swarm optimization with dimension learning-based hunting for global optimization and real-world engineering applications 面向全局优化和实际工程应用的基于维度学习的改进金枪鱼群优化
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-08 DOI: 10.1016/j.aei.2025.104298
Eda Özkul
This study proposes an improved tuna swarm optimization algorithm (I-TSO) for solving global optimization and engineering design problems. However, despite its strong global search ability, tuna swarm optimization (TSO) suffers from trapping in local optima, having premature convergence, and the loss of diversity in the early stage. To eliminate these disadvantages and improve the original TSO, the proposed I-TSO algorithm uses a dimension learning-based hunting (DLH) strategy. DLH constructs a neighborhood for each tuna in the population and uses that information in the optimization process. Thus, it improves population diversity, provides a proper balance between exploration and exploitation, and prevents trapping into local optima. The performance of the proposed algorithm is evaluated on 23 classical benchmark functions, CEC-2017, CEC-2020, and CEC-2022 test suites, and compared it with eight other optimization algorithms. Comparative results demonstrate that I-TSO exhibits stable and effective optimization capabilities. Further, the Friedman test and Wilcoxon signed-rank test are conducted to statistically evaluate the performance of the proposed algorithm, and thus its superiority is statistically confirmed. Moreover, the applicability of the I-TSO in real-world problems is validated on eight engineering design problems. Consequently, the I-TSO algorithm is capable of solving both numerical and engineering design problems with its efficient and superior performance.
本文提出了一种改进的金枪鱼群优化算法(I-TSO),用于解决全局优化和工程设计问题。然而,金枪鱼群优化算法(TSO)虽然具有较强的全局搜索能力,但存在陷入局部最优、过早收敛、早期多样性丧失等问题。为了消除这些缺点并改进原TSO算法,本文提出的I-TSO算法采用了一种基于维学习的搜索(DLH)策略。DLH为种群中的每条金枪鱼构建一个邻域,并在优化过程中使用该信息。因此,它提高了种群多样性,在勘探和开采之间提供了适当的平衡,并防止陷入局部最优状态。在23个经典基准函数、CEC-2017、CEC-2020和CEC-2022测试套件上对该算法的性能进行了评估,并与其他8种优化算法进行了比较。对比结果表明,I-TSO具有稳定有效的优化能力。通过Friedman检验和Wilcoxon有符号秩检验对算法的性能进行统计评价,从而证实了算法的优越性。并通过8个工程设计问题验证了该方法在实际问题中的适用性。因此,I-TSO算法能够以其高效和优越的性能解决数值和工程设计问题。
{"title":"An improved tuna swarm optimization with dimension learning-based hunting for global optimization and real-world engineering applications","authors":"Eda Özkul","doi":"10.1016/j.aei.2025.104298","DOIUrl":"10.1016/j.aei.2025.104298","url":null,"abstract":"<div><div>This study proposes an improved tuna swarm optimization algorithm (I-TSO) for solving global optimization and engineering design problems. However, despite its strong global search ability, tuna swarm optimization (TSO) suffers from trapping in local optima, having premature convergence, and the loss of diversity in the early stage. To eliminate these disadvantages and improve the original TSO, the proposed I-TSO algorithm uses a dimension learning-based hunting (DLH) strategy. DLH constructs a neighborhood for each tuna in the population and uses that information in the optimization process. Thus, it improves population diversity, provides a proper balance between exploration and exploitation, and prevents trapping into local optima. The performance of the proposed algorithm is evaluated on 23 classical benchmark functions, CEC-2017, CEC-2020, and CEC-2022 test suites, and compared it with eight other optimization algorithms. Comparative results demonstrate that I-TSO exhibits stable and effective optimization capabilities. Further, the Friedman test and Wilcoxon signed-rank test are conducted to statistically evaluate the performance of the proposed algorithm, and thus its superiority is statistically confirmed. Moreover, the applicability of the I-TSO in real-world problems is validated on eight engineering design problems. Consequently, the I-TSO algorithm is capable of solving both numerical and engineering design problems with its efficient and superior performance.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104298"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evolution of text mining in construction industry: an LLM-driven analysis of statistical machine learning dominance and internal-external delayed LLM adoption 建筑行业文本挖掘的演变:统计机器学习优势和内外延迟LLM采用的LLM驱动分析
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-10 DOI: 10.1016/j.aei.2026.104321
Jinjing Li , Xianbo Zhao , Haizhe Yu , Lili Gao , Xiaopeng Deng , Bon-Gang Hwang
Over 85 % of construction data remains unstructured, creating urgent needs for text mining (TM). While considerable attention has been directed toward the evolution of TM, a critical gap persists in the form of diachronic analysis, with limited exploration of its trajectory in the context of large language models (LLMs). Hence, this research aims to: (1) generate the LLM-based TM framework for construction; (2) explore different evolutions of TM methods in construction; and (3) identify the driving factors for the evolution. To achieve these objectives, two LLM-based TM methods were used to review the TM-related literature. The results reveal a dual delay pattern: internally, statistical machine learning maintains dominance over LLMs in the construction industry, while externally, LLM adoption lags 2–3 years behind sectors such as healthcare and biomedicine. The study extends existing taxonomies by introducing novel data sources (elicited discourse corpora and multimodal data) and establishing software-based analysis as a distinct methodological stage. Moreover, it addresses the research paradigm gap for LLM-based TM, offering enhanced strategic guidance for practitioners in selecting TM tools.
超过85%的建筑数据仍然是非结构化的,这就产生了对文本挖掘(TM)的迫切需求。虽然人们对TM的发展给予了相当大的关注,但在历时分析方面仍然存在一个关键的差距,在大型语言模型(llm)的背景下对其轨迹的探索有限。因此,本研究旨在:(1)生成基于llm的TM框架进行构建;(2)探索TM方法在施工中的不同演变;(3)识别演化的驱动因素。为了实现这些目标,我们使用了两种基于法学的TM方法来回顾TM相关文献。结果揭示了一种双重延迟模式:在内部,统计机器学习在建筑行业保持对法学硕士的主导地位,而在外部,法学硕士的采用落后于医疗保健和生物医药等行业2-3年。该研究通过引入新的数据源(引出话语语料库和多模态数据)和建立基于软件的分析作为一个独特的方法阶段来扩展现有的分类法。此外,它还解决了基于法学硕士的TM的研究范式差距,为从业者选择TM工具提供了增强的战略指导。
{"title":"Evolution of text mining in construction industry: an LLM-driven analysis of statistical machine learning dominance and internal-external delayed LLM adoption","authors":"Jinjing Li ,&nbsp;Xianbo Zhao ,&nbsp;Haizhe Yu ,&nbsp;Lili Gao ,&nbsp;Xiaopeng Deng ,&nbsp;Bon-Gang Hwang","doi":"10.1016/j.aei.2026.104321","DOIUrl":"10.1016/j.aei.2026.104321","url":null,"abstract":"<div><div>Over 85 % of construction data remains unstructured, creating urgent needs for text mining (TM). While considerable attention has been directed toward the evolution of TM, a critical gap persists in the form of diachronic analysis, with limited exploration of its trajectory in the context of large language models (LLMs). Hence, this research aims to: (1) generate the LLM-based TM framework for construction; (2) explore different evolutions of TM methods in construction; and (3) identify the driving factors for the evolution. To achieve these objectives, two LLM-based TM methods were used to review the TM-related literature. The results reveal a dual delay pattern: internally, statistical machine learning maintains dominance over LLMs in the construction industry, while externally, LLM adoption lags 2–3 years behind sectors such as healthcare and biomedicine. The study extends existing taxonomies by introducing novel data sources (elicited discourse corpora and multimodal data) and establishing software-based analysis as a distinct methodological stage. Moreover, it addresses the research paradigm gap for LLM-based TM, offering enhanced strategic guidance for practitioners in selecting TM tools.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104321"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vision–proprioception fusion with Mamba2 in end-to-end reinforcement learning for motion control 基于Mamba2的视觉本体感觉融合在端到端运动控制强化学习中的应用
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-29 DOI: 10.1016/j.aei.2026.104389
Xiaowen Tao , Yinuo Wang , Jinzhao Zhou
End-to-end reinforcement learning (RL) for motion control trains policies directly from sensor inputs to motor commands, enabling unified controllers for different robots and tasks. However, most existing methods are either blind (proprioception-only) or rely on fusion backbones with unfavorable compute–memory trade-offs. Recurrent controllers struggle with long-horizon credit assignment, and Transformer-based fusion incurs quadratic cost in token length, limiting temporal and spatial context. We present a vision-driven cross-modal RL framework built on SSD-Mamba2, a selective state–space backbone that applies state–space duality (SSD) to enable both recurrent and convolutional scanning with hardware-aware streaming and near-linear scaling. Proprioceptive states and exteroceptive observations (e.g., depth tokens) are encoded into compact tokens and fused by stacked SSD-Mamba2 layers. The selective state–space updates retain long-range dependencies with markedly lower latency and memory use than quadratic self-attention, enabling longer look-ahead, higher token resolution, and stable training under limited compute. Policies are trained end-to-end under curricula that randomize terrain and appearance and progressively increase scene complexity. A compact, state-centric reward balances task progress, energy efficiency, and safety. Across diverse motion-control scenarios, our approach consistently surpasses strong state-of-the-art baselines in return, safety (collisions and falls), and sample efficiency, while converging faster at the same compute budget. These results suggest that SSD-Mamba2 provides a practical fusion backbone for resource-constrained robotic and autonomous systems in engineering informatics applications.
用于运动控制的端到端强化学习(RL)直接从传感器输入到电机命令训练策略,为不同的机器人和任务实现统一的控制器。然而,大多数现有的方法要么是盲目的(只有本体感觉),要么依赖于融合主干,不利于计算内存的权衡。循环控制器与长期信用分配斗争,基于变压器的融合在令牌长度上产生二次成本,限制了时间和空间背景。我们提出了一个基于SSD- mamba2的视觉驱动的跨模态RL框架,这是一个选择性的状态空间主干,它应用状态空间对偶性(SSD)来实现循环扫描和卷积扫描,具有硬件感知流和近线性缩放。本体感觉状态和外感受观察(如深度标记)被编码成紧凑的标记,并通过堆叠的SSD-Mamba2层融合。选择性状态空间更新保留了长期依赖关系,比二次型自关注具有明显更低的延迟和内存使用,支持更长的前瞻性、更高的令牌分辨率和有限计算下的稳定训练。策略在随机化地形和外观并逐渐增加场景复杂性的课程中进行端到端训练。紧凑的、以国家为中心的奖励平衡了任务进度、能源效率和安全。在不同的运动控制场景中,我们的方法在回报、安全性(碰撞和坠落)和样本效率方面始终超过最先进的基线,同时在相同的计算预算下收敛得更快。这些结果表明,SSD-Mamba2为工程信息学应用中资源受限的机器人和自主系统提供了实用的融合骨干。
{"title":"Vision–proprioception fusion with Mamba2 in end-to-end reinforcement learning for motion control","authors":"Xiaowen Tao ,&nbsp;Yinuo Wang ,&nbsp;Jinzhao Zhou","doi":"10.1016/j.aei.2026.104389","DOIUrl":"10.1016/j.aei.2026.104389","url":null,"abstract":"<div><div>End-to-end reinforcement learning (RL) for motion control trains policies directly from sensor inputs to motor commands, enabling unified controllers for different robots and tasks. However, most existing methods are either <em>blind</em> (proprioception-only) or rely on fusion backbones with unfavorable compute–memory trade-offs. Recurrent controllers struggle with long-horizon credit assignment, and Transformer-based fusion incurs quadratic cost in token length, limiting temporal and spatial context. We present a vision-driven cross-modal RL framework built on <em>SSD-Mamba2</em>, a selective state–space backbone that applies <em>state–space duality (SSD)</em> to enable both recurrent and convolutional scanning with hardware-aware streaming and near-linear scaling. Proprioceptive states and exteroceptive observations (e.g., depth tokens) are encoded into compact tokens and fused by stacked SSD-Mamba2 layers. The selective state–space updates retain long-range dependencies with markedly lower latency and memory use than quadratic self-attention, enabling longer look-ahead, higher token resolution, and stable training under limited compute. Policies are trained end-to-end under curricula that randomize terrain and appearance and progressively increase scene complexity. A compact, state-centric reward balances task progress, energy efficiency, and safety. Across diverse motion-control scenarios, our approach consistently surpasses strong state-of-the-art baselines in return, safety (collisions and falls), and sample efficiency, while converging faster at the same compute budget. These results suggest that SSD-Mamba2 provides a practical fusion backbone for resource-constrained robotic and autonomous systems in engineering informatics applications.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104389"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146077879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FACTS: Training-free zero-shot diffusion framework for facade texture restoration in 3D urban models 事实:用于3D城市模型立面纹理恢复的无训练零射击扩散框架
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-24 DOI: 10.1016/j.aei.2026.104385
Juexiao Cheng , Xiangru Huang , Guanzhou Chen , Tong Wang , Jiaqi Wang , Xiaoliang Tan , Aiyi Jiang , Xiaodong Zhang
High-fidelity facade texture restoration is crucial for the realism and utility of 3D urban models in digital twin applications. Low-quality textures can compromise visualization, simulation accuracy, and decision-making. This challenge is particularly evident in Level of Detail 1 and 2 (LoD-1 and LoD-2) models, which represent buildings as basic massing models. In these models, textures baked from complex 3D mesh sources often suffer from geometric distortions, occlusions, and inconsistent illumination. To address these issues, we introduce FACTS (Facade Automated Correction and Texture Synthesis), a novel zero-shot, training-free framework for facade texture restoration. FACTS operates as an automated pipeline, taking 3D Mesh as input and producing geometrically and photometrically corrected models. Its key innovations are as follows: (1) a prompt-guided, occlusion-aware inpainting module that uses semantic guidance to repair missing texture regions; (2) a multi-scale edge-feature-guided diffusion process that enforces geometric consistency by leveraging structural priors extracted from the image; and (3) an efficient illumination harmonization method in the CIELAB color space to resolve lighting inconsistencies across texture patches. Recognizing that conventional metrics fail to assess architectural integrity, we propose three novel metrics: the Edge Straightness Score (ESS), Hough Transform Line Consistency (HTLC), and Linearity Index (LI). Our experiments on the SFDB and RUF-3D datasets show significant improvements over baselines. Specifically, FACTS improved ESS, HTLC, and LI scores on degraded textures by 40.69%, 11.16%, and 54.76%, respectively. The framework processes 2.5-megapixel texture in approximately 58.8 s on a single consumer-grade GPU. This work provides a scalable and interpretable solution for the automated restoration of defective facade textures, thereby enhancing the visual realism and structural accuracy of existing 3D urban models. Code and data available at https://github.com/CVEO/FACTS.
在数字孪生应用中,高保真立面纹理修复对于三维城市模型的真实感和实用性至关重要。低质量的纹理会影响可视化、模拟精度和决策。这一挑战在细节级别1和2 (LoD-1和LoD-2)模型中尤为明显,它们将建筑物表示为基本的体块模型。在这些模型中,从复杂的3D网格源烘烤的纹理经常遭受几何扭曲,遮挡和不一致的照明。为了解决这些问题,我们引入了FACTS(立面自动校正和纹理合成),这是一种新的零拍摄,无需训练的立面纹理恢复框架。FACTS作为自动化管道运行,以3D网格为输入,并产生几何和光度校正模型。其主要创新点如下:(1)基于语义引导修复缺失纹理区域的快速引导、闭塞感知的补图模块;(2)利用从图像中提取的结构先验来增强几何一致性的多尺度边缘特征引导扩散过程;(3)在CIELAB色彩空间中采用一种高效的光照协调方法来解决纹理斑块间的光照不一致问题。认识到传统的度量标准无法评估建筑的完整性,我们提出了三个新的度量标准:边缘直线度评分(ESS)、霍夫变换线一致性(HTLC)和线性度指数(LI)。我们在SFDB和RUF-3D数据集上的实验表明,与基线相比,我们有了显著的改进。具体来说,FACTS在退化纹理上分别提高了40.69%、11.16%和54.76%的ESS、HTLC和LI分数。该框架在单个消费级GPU上处理250万像素的纹理大约58.8秒。这项工作为有缺陷的立面纹理的自动修复提供了一个可扩展和可解释的解决方案,从而提高了现有3D城市模型的视觉真实感和结构准确性。代码和数据可在https://github.com/CVEO/FACTS上获得。
{"title":"FACTS: Training-free zero-shot diffusion framework for facade texture restoration in 3D urban models","authors":"Juexiao Cheng ,&nbsp;Xiangru Huang ,&nbsp;Guanzhou Chen ,&nbsp;Tong Wang ,&nbsp;Jiaqi Wang ,&nbsp;Xiaoliang Tan ,&nbsp;Aiyi Jiang ,&nbsp;Xiaodong Zhang","doi":"10.1016/j.aei.2026.104385","DOIUrl":"10.1016/j.aei.2026.104385","url":null,"abstract":"<div><div>High-fidelity facade texture restoration is crucial for the realism and utility of 3D urban models in digital twin applications. Low-quality textures can compromise visualization, simulation accuracy, and decision-making. This challenge is particularly evident in Level of Detail 1 and 2 (LoD-1 and LoD-2) models, which represent buildings as basic massing models. In these models, textures baked from complex 3D mesh sources often suffer from geometric distortions, occlusions, and inconsistent illumination. To address these issues, we introduce FACTS (Facade Automated Correction and Texture Synthesis), a novel zero-shot, training-free framework for facade texture restoration. FACTS operates as an automated pipeline, taking 3D Mesh as input and producing geometrically and photometrically corrected models. Its key innovations are as follows: (1) a prompt-guided, occlusion-aware inpainting module that uses semantic guidance to repair missing texture regions; (2) a multi-scale edge-feature-guided diffusion process that enforces geometric consistency by leveraging structural priors extracted from the image; and (3) an efficient illumination harmonization method in the CIELAB color space to resolve lighting inconsistencies across texture patches. Recognizing that conventional metrics fail to assess architectural integrity, we propose three novel metrics: the Edge Straightness Score (ESS), Hough Transform Line Consistency (HTLC), and Linearity Index (LI). Our experiments on the SFDB and RUF-3D datasets show significant improvements over baselines. Specifically, FACTS improved ESS, HTLC, and LI scores on degraded textures by 40.69%, 11.16%, and 54.76%, respectively. The framework processes 2.5-megapixel texture in approximately 58.8 s on a single consumer-grade GPU. This work provides a scalable and interpretable solution for the automated restoration of defective facade textures, thereby enhancing the visual realism and structural accuracy of existing 3D urban models. Code and data available at <span><span>https://github.com/CVEO/FACTS</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104385"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven modelling of unloading hours using explainable gradient boosting models 使用可解释的梯度提升模型的卸载时间数据驱动建模
IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-01 Epub Date: 2026-01-23 DOI: 10.1016/j.aei.2026.104353
Celal Cakiroglu , Najat Almasarwah , Mehmet Hakan Özdemir , Batin Latif Aylak , Manjeet Singh , Muhammet Deveci
Unloading processes denote the extraction of finished goods and raw materials from transport units and their subsequent conveyance to designated locations. The efficiency of unloading processes is vital in supply chain and logistics management, regarded as an essential component. Delays in unloading operations result in numerous challenges, including heightened operational expenses, diminished labour efficiency, and supply chain bottlenecks. Consequently, it is essential to ascertain unloading times beforehand to mitigate these challenges, resulting in diminished idle time, enhanced overall efficiency, and optimized scheduling. Therefore, precise prediction of unloading times is critically significant. The novelty of this study lies in the application of machine learning techniques to improve operational efficiency by accurately predicting unloading time. To that end, this study employed LightGBM and XGBoost to predict the unloading time in a real case. The unloading time can be predicted with R2 score greater than 0.99 utilizing both models. Subsequently, the SHapley Additive exPlanations (SHAP) methodology was used to ascertain how each input feature contributed to the model’s output. The load of leg significantly influences the unloading time more than the gross weight of truck and the leg distance.
卸货过程是指从运输单位提取制成品和原材料,并将其运送到指定地点。卸载过程的效率在供应链和物流管理中至关重要,被视为必不可少的组成部分。卸载作业的延迟会带来许多挑战,包括运营费用增加、劳动效率降低和供应链瓶颈。因此,必须事先确定卸载时间,以减轻这些挑战,从而减少闲置时间,提高整体效率,优化调度。因此,准确预测卸载时间至关重要。本研究的新颖之处在于应用机器学习技术,通过准确预测卸载时间来提高操作效率。为此,本研究采用LightGBM和XGBoost来预测实际情况下的卸载时间。两种模型均可预测卸载时间,R2评分均大于0.99。随后,使用SHapley加性解释(SHAP)方法来确定每个输入特征对模型输出的贡献。腿腿的载荷对卸载时间的影响比卡车总重和腿腿距离的影响更显著。
{"title":"Data-driven modelling of unloading hours using explainable gradient boosting models","authors":"Celal Cakiroglu ,&nbsp;Najat Almasarwah ,&nbsp;Mehmet Hakan Özdemir ,&nbsp;Batin Latif Aylak ,&nbsp;Manjeet Singh ,&nbsp;Muhammet Deveci","doi":"10.1016/j.aei.2026.104353","DOIUrl":"10.1016/j.aei.2026.104353","url":null,"abstract":"<div><div>Unloading processes denote the extraction of finished goods and raw materials from transport units and their subsequent conveyance to designated locations. The efficiency of unloading processes is vital in supply chain and logistics management, regarded as an essential component. Delays in unloading operations result in numerous challenges, including heightened operational expenses, diminished labour efficiency, and supply chain bottlenecks. Consequently, it is essential to ascertain unloading times beforehand to mitigate these challenges, resulting in diminished idle time, enhanced overall efficiency, and optimized scheduling. Therefore, precise prediction of unloading times is critically significant. The novelty of this study lies in the application of machine learning techniques to improve operational efficiency by accurately predicting unloading time. To that end, this study employed LightGBM and XGBoost to predict the unloading time in a real case. The unloading time can be predicted with <span><math><msup><mrow><mi>R</mi></mrow><mo>2</mo></msup></math></span> score greater than 0.99 utilizing both models. Subsequently, the SHapley Additive exPlanations (SHAP) methodology was used to ascertain how each input feature contributed to the model’s output. The load of leg significantly influences the unloading time more than the gross weight of truck and the leg distance.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"71 ","pages":"Article 104353"},"PeriodicalIF":9.9,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Advanced Engineering Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1