首页 > 最新文献

Automation in Construction最新文献

英文 中文
Weld seam extraction and path generation for robotic welding of steel structures based on 3D vision 基于三维视觉的钢结构机器人焊接焊缝提取与路径生成
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-03-01 Epub Date: 2026-01-27 DOI: 10.1016/j.autcon.2026.106792
Jinxin Yi , Xuan Kong , Hao Tang , Jie Zhang , Zhenming Chen , Lu Deng
Recent advances in computer vision have provided new solutions for intelligent welding. However, existing vision-based weld seam extraction techniques exhibit limited adaptability to various workpieces in unstructured environments. Therefore, this paper proposes a three-dimensional vision-based method tailored for weld seam extraction and path generation. The proposed method synergizes a deep learning-based point cloud segmentation technique with an improved multi-scale point cloud registration algorithm to reconstruct the complete point cloud model of all weld regions in the workpieces. Subsequently, the welding paths and torch poses are calculated using an optimized multi-plane fitting algorithm integrated with geometry model of weld seam. Experimental validation on four workpieces demonstrates that the proposed method achieves good accuracy and outperforms the existing techniques in terms of efficiency and applicability, offering a robust solution for automated welding of steel structures.
计算机视觉的最新进展为智能焊接提供了新的解决方案。然而,现有的基于视觉的焊缝提取技术对非结构化环境中各种工件的适应性有限。因此,本文提出了一种针对焊缝提取和路径生成的三维视觉方法。该方法将基于深度学习的点云分割技术与改进的多尺度点云配准算法相结合,重建工件中所有焊缝区域的完整点云模型。然后,结合焊缝几何模型,采用优化后的多平面拟合算法计算焊接路径和焊枪位姿。在4个工件上进行的实验验证表明,该方法具有良好的精度,在效率和适用性方面优于现有的方法,为钢结构自动化焊接提供了可靠的解决方案。
{"title":"Weld seam extraction and path generation for robotic welding of steel structures based on 3D vision","authors":"Jinxin Yi ,&nbsp;Xuan Kong ,&nbsp;Hao Tang ,&nbsp;Jie Zhang ,&nbsp;Zhenming Chen ,&nbsp;Lu Deng","doi":"10.1016/j.autcon.2026.106792","DOIUrl":"10.1016/j.autcon.2026.106792","url":null,"abstract":"<div><div>Recent advances in computer vision have provided new solutions for intelligent welding. However, existing vision-based weld seam extraction techniques exhibit limited adaptability to various workpieces in unstructured environments. Therefore, this paper proposes a three-dimensional vision-based method tailored for weld seam extraction and path generation. The proposed method synergizes a deep learning-based point cloud segmentation technique with an improved multi-scale point cloud registration algorithm to reconstruct the complete point cloud model of all weld regions in the workpieces. Subsequently, the welding paths and torch poses are calculated using an optimized multi-plane fitting algorithm integrated with geometry model of weld seam. Experimental validation on four workpieces demonstrates that the proposed method achieves good accuracy and outperforms the existing techniques in terms of efficiency and applicability, offering a robust solution for automated welding of steel structures.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106792"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Physics-informed diffusion for visible-to-infrared domain translation of pavement crack images 路面裂缝图像可见到红外域转换的物理信息扩散
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-03-01 Epub Date: 2026-01-19 DOI: 10.1016/j.autcon.2026.106780
Zhikai Su , Mengnan Shi , Tianyu Gao , Jiaqi Hao , Hongtao Li , Qiang Yao
Infrared imaging is effective for pavement-crack detection under low-illumination conditions, but the scarcity of infrared datasets hinders its broader adoption. This paper proposes a Physics-Informed Diffusion Model to convert readily available visible-light crack images into physically consistent infrared images. The model integrates physical constraints within a Latent Diffusion Model and employs a Channel-Adaptive Dynamic Gamma Correction (CDGC) method to enhance thermally relevant feature representation. Experiments on a ground-truth infrared test set demonstrate that synthetic data generated by the proposed method substantially improves segmentation performance, achieving Pixel Accuracy (PA) of 0.9678 and Frequency-Weighted IoU (FW-IoU) of 0.9459. By obviating the costly, labor-intensive process of infrared dataset collection, the proposed approach facilitates the widespread adoption of infrared machine vision and visible–infrared fusion systems.
红外成像对于低照度条件下的路面裂缝检测是有效的,但红外数据集的稀缺性阻碍了其广泛应用。本文提出了一种物理信息扩散模型,将现成的可见光裂纹图像转换为物理一致的红外图像。该模型在潜在扩散模型中集成了物理约束,并采用通道自适应动态伽玛校正(CDGC)方法来增强热相关特征表示。在红外地面真值测试集上的实验表明,该方法生成的合成数据显著提高了分割性能,像素精度(PA)达到0.9678,频率加权IoU (FW-IoU)达到0.9459。通过避免昂贵、劳动密集型的红外数据集收集过程,该方法促进了红外机器视觉和可见-红外融合系统的广泛采用。
{"title":"Physics-informed diffusion for visible-to-infrared domain translation of pavement crack images","authors":"Zhikai Su ,&nbsp;Mengnan Shi ,&nbsp;Tianyu Gao ,&nbsp;Jiaqi Hao ,&nbsp;Hongtao Li ,&nbsp;Qiang Yao","doi":"10.1016/j.autcon.2026.106780","DOIUrl":"10.1016/j.autcon.2026.106780","url":null,"abstract":"<div><div>Infrared imaging is effective for pavement-crack detection under low-illumination conditions, but the scarcity of infrared datasets hinders its broader adoption. This paper proposes a Physics-Informed Diffusion Model to convert readily available visible-light crack images into physically consistent infrared images. The model integrates physical constraints within a Latent Diffusion Model and employs a Channel-Adaptive Dynamic Gamma Correction (CDGC) method to enhance thermally relevant feature representation. Experiments on a ground-truth infrared test set demonstrate that synthetic data generated by the proposed method substantially improves segmentation performance, achieving Pixel Accuracy (PA) of 0.9678 and Frequency-Weighted IoU (FW-IoU) of 0.9459. By obviating the costly, labor-intensive process of infrared dataset collection, the proposed approach facilitates the widespread adoption of infrared machine vision and visible–infrared fusion systems.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106780"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146000931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Margin-aware maximum classifier discrepancy for BIM-to-scan semantic segmentation of building point clouds 基于边缘感知的建筑点云bim -扫描语义分割最大分类器差异
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-03-01 Epub Date: 2026-01-24 DOI: 10.1016/j.autcon.2026.106799
Difeng Hu , You Dong , Mingkai Li , Hanmo Wang , Tao Wang
BIM-derived point clouds are valuable for semantic segmentation and BIM modeling, but distribution discrepancies between BIM and real-world scans significantly degrade segmentation performance. To mitigate this issue, this paper develops a margin-aware maximum classifier discrepancy (MMCD) method, which extends the conventional MCD framework by incorporating a margin-aware mechanism. Task-specific classifiers act as discriminators to encourage the feature generator to learn domain-invariant yet discriminative features for unlabeled real point clouds, improving BIM-to-scan distribution alignment and segmentation accuracy. A margin-aware discrepancy loss is formulated to enforce sufficient margin between features and classification boundaries, improving robustness to domain shift. In addition, a training strategy is proposed to support MMCD optimization. Finally, a refined RandLA-Net with an attention-based upsampling module is constructed as the backbone for validation. Experiments demonstrate that the proposed approach achieves superior performance, with an IoU of 72.79% and an overall accuracy of 87.99%, outperforming RandLA-Net variants with or without MCD.
BIM衍生的点云对于语义分割和BIM建模很有价值,但是BIM和真实扫描之间的分布差异会显著降低分割性能。为了解决这一问题,本文提出了一种边缘感知的最大分类器差异(MMCD)方法,该方法通过引入边缘感知机制扩展了传统的最大分类器差异框架。特定于任务的分类器充当鉴别器,以鼓励特征生成器学习未标记的真实点云的域不变但有区别的特征,从而提高bim到扫描的分布对齐和分割精度。制定了一个边界感知差异损失来强制特征和分类边界之间有足够的边界,提高了对域移位的鲁棒性。此外,提出了一种支持MMCD优化的训练策略。最后,构建了一个改进的基于注意力上采样模块的RandLA-Net作为验证的主干。实验表明,该方法取得了优异的性能,IoU为72.79%,总体准确率为87.99%,优于带或不带MCD的RandLA-Net变体。
{"title":"Margin-aware maximum classifier discrepancy for BIM-to-scan semantic segmentation of building point clouds","authors":"Difeng Hu ,&nbsp;You Dong ,&nbsp;Mingkai Li ,&nbsp;Hanmo Wang ,&nbsp;Tao Wang","doi":"10.1016/j.autcon.2026.106799","DOIUrl":"10.1016/j.autcon.2026.106799","url":null,"abstract":"<div><div>BIM-derived point clouds are valuable for semantic segmentation and BIM modeling, but distribution discrepancies between BIM and real-world scans significantly degrade segmentation performance. To mitigate this issue, this paper develops a margin-aware maximum classifier discrepancy (MMCD) method, which extends the conventional MCD framework by incorporating a margin-aware mechanism. Task-specific classifiers act as discriminators to encourage the feature generator to learn domain-invariant yet discriminative features for unlabeled real point clouds, improving BIM-to-scan distribution alignment and segmentation accuracy. A margin-aware discrepancy loss is formulated to enforce sufficient margin between features and classification boundaries, improving robustness to domain shift. In addition, a training strategy is proposed to support MMCD optimization. Finally, a refined RandLA-Net with an attention-based upsampling module is constructed as the backbone for validation. Experiments demonstrate that the proposed approach achieves superior performance, with an IoU of 72.79% and an overall accuracy of 87.99%, outperforming RandLA-Net variants with or without MCD.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106799"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146036034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning-based computer vision methods for shield tunnel defect recognition 基于深度学习的盾构隧道缺陷识别方法
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-03-01 Epub Date: 2026-02-04 DOI: 10.1016/j.autcon.2026.106807
Ya-Dong Xue , Fei Jia , Wei Luo , Dong-Mei Zhang , Jie Liu , Yong-Fa Guo
With the rapid expansion of large-scale shield tunnel operations, deep learning has been extensively studied for automated defect recognition. This paper provides a comprehensive review of recent research in deep learning-based methods for tunnel defect recognition, organized into three key stages: dataset establishment, model development, and practical implementation. The review first details the acquisition and preprocessing of tunnel lining images obtained from various inspection equipment, followed by the establishment of defect datasets. It then provides a systematic overview of commonly used deep learning models for defect recognition, with a focus on three primary areas: defect detection, semantic, and instance segmentation, summarizing key innovations within each domain. Based on this analysis, current challenges are identified and future research directions are discussed for each stage. This review aims to promote the practical application of deep learning in tunnel engineering and to support the development of predictive and intelligent maintenance for shield tunnels.
随着大规模盾构隧道工程的迅速发展,深度学习在缺陷自动识别方面得到了广泛的研究。本文对基于深度学习的隧道缺陷识别方法的最新研究进行了全面综述,分为三个关键阶段:数据集建立、模型开发和实际实施。本文首先详细介绍了从各种检测设备获得的隧道衬砌图像的获取和预处理,然后建立了缺陷数据集。然后,它提供了一个用于缺陷识别的常用深度学习模型的系统概述,重点放在三个主要领域:缺陷检测、语义和实例分割,总结了每个领域内的关键创新。在此基础上,确定了当前面临的挑战,并对每个阶段的未来研究方向进行了讨论。本文综述旨在促进深度学习在隧道工程中的实际应用,支持盾构隧道预测和智能维修的发展。
{"title":"Deep learning-based computer vision methods for shield tunnel defect recognition","authors":"Ya-Dong Xue ,&nbsp;Fei Jia ,&nbsp;Wei Luo ,&nbsp;Dong-Mei Zhang ,&nbsp;Jie Liu ,&nbsp;Yong-Fa Guo","doi":"10.1016/j.autcon.2026.106807","DOIUrl":"10.1016/j.autcon.2026.106807","url":null,"abstract":"<div><div>With the rapid expansion of large-scale shield tunnel operations, deep learning has been extensively studied for automated defect recognition. This paper provides a comprehensive review of recent research in deep learning-based methods for tunnel defect recognition, organized into three key stages: dataset establishment, model development, and practical implementation. The review first details the acquisition and preprocessing of tunnel lining images obtained from various inspection equipment, followed by the establishment of defect datasets. It then provides a systematic overview of commonly used deep learning models for defect recognition, with a focus on three primary areas: defect detection, semantic, and instance segmentation, summarizing key innovations within each domain. Based on this analysis, current challenges are identified and future research directions are discussed for each stage. This review aims to promote the practical application of deep learning in tunnel engineering and to support the development of predictive and intelligent maintenance for shield tunnels.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106807"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Condition-aware AI framework for automated structural health monitoring 用于自动结构健康监测的状态感知AI框架
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-03-01 Epub Date: 2026-01-14 DOI: 10.1016/j.autcon.2025.106748
Hamed Hasani, Francesco Freddi
This study presents an AI-powered framework for automated structural health monitoring that integrates modal identification, anomaly detection, and damage localization under varying environmental and operational conditions. The approach combines stochastic subspace identification with frequency–spatial domain decomposition for automated modal extraction and a condition-aware anomaly detector based on a conditional variational autoencoder. A secondary SSA–OC-SVM module verifies and localizes damage. The methodology is validated on a laboratory-scale structure through 500 one-hour tests under temperature variations up to 35 °C and diverse loading conditions. The identified modes exhibit MAC = 0.99–1.00, confirming reliable automated identification. The CVAE reconstructs healthy-state modal frequencies with MAPE = 0.23%, RMSE = 0.027 Hz, and R2 = 0.836, effectively distinguishing environmental effects (0.27 pp) from genuine structural changes. The integrated framework further accurately localizes all induced damage scenarios across nine structural zones, demonstrating high accuracy, robustness, and scalability for next-generation SHM automation.
本研究提出了一个人工智能驱动的自动化结构健康监测框架,该框架集成了模态识别、异常检测和不同环境和操作条件下的损伤定位。该方法结合了随机子空间识别与频率-空间域分解的自动模态提取,以及基于条件变分自编码器的状态感知异常检测器。辅助SSA-OC-SVM模块对损坏进行验证和定位。该方法在实验室规模的结构上进行了500次一小时的测试,在温度变化高达35°C和各种负载条件下进行了验证。识别模式的MAC值为0.99-1.00,表明自动识别是可靠的。CVAE重构健康态模态频率,MAPE = 0.23%, RMSE = 0.027 Hz, R2 = 0.836,有效区分了环境影响(≤0.27 pp)和真实结构变化。集成框架进一步准确地定位了9个结构区域的所有诱发损伤场景,为下一代SHM自动化展示了高精度、鲁棒性和可扩展性。
{"title":"Condition-aware AI framework for automated structural health monitoring","authors":"Hamed Hasani,&nbsp;Francesco Freddi","doi":"10.1016/j.autcon.2025.106748","DOIUrl":"10.1016/j.autcon.2025.106748","url":null,"abstract":"<div><div>This study presents an AI-powered framework for automated structural health monitoring that integrates modal identification, anomaly detection, and damage localization under varying environmental and operational conditions. The approach combines stochastic subspace identification with frequency–spatial domain decomposition for automated modal extraction and a condition-aware anomaly detector based on a conditional variational autoencoder. A secondary SSA–OC-SVM module verifies and localizes damage. The methodology is validated on a laboratory-scale structure through 500 one-hour tests under temperature variations up to 35 °C and diverse loading conditions. The identified modes exhibit MAC = 0.99–1.00, confirming reliable automated identification. The CVAE reconstructs healthy-state modal frequencies with MAPE = 0.23%, RMSE = 0.027 Hz, and <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> = 0.836, effectively distinguishing environmental effects (<span><math><mrow><mo>≤</mo><mn>0</mn><mo>.</mo><mn>27</mn></mrow></math></span> pp) from genuine structural changes. The integrated framework further accurately localizes all induced damage scenarios across nine structural zones, demonstrating high accuracy, robustness, and scalability for next-generation SHM automation.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106748"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LLM-enabled multi-agent framework for natural language interaction with graph-based digital twins 支持llm的多代理框架,用于与基于图的数字孪生进行自然语言交互
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-03-01 Epub Date: 2026-01-27 DOI: 10.1016/j.autcon.2026.106791
Yuandong Pan , Mudan Wang , Linjun Lu , Rabindra Lamsal , Erika Pärn , Sisi Zlatanova , Ioannis Brilakis
Digital twins are increasingly used in the Architecture, Engineering, and Construction (AEC) industry, but their adoption is often hindered by the need for specialised knowledge, such as database querying. This paper presents Graph-DT-GPT, a multi-agent framework that integrates Large Language Models (LLMs) with graph-based digital twins to enable natural language interaction. The framework is designed with modular agents, including decision, query generation, and answer extraction, and grounds all LLMs’ outputs in structured graph data to improve response reliability and reduce hallucinations. The framework is evaluated on two use cases: a city-level graph with over 40,000 building nodes and room-level apartment layout graphs. Graph-DT-GPT achieves 100% and 95.5% answer correctness using Claude Sonnet 4.5 and GPT-4o, respectively, in the city-scale case, and 100% correctness in the room-level case, significantly outperforming baseline methods including LangChain Neo4j pipelines by approximately 40% and 10%, respectively. These results demonstrate its scalability and potential to enhance accessible, accurate information retrieval in AEC digital twin applications.
数字双胞胎越来越多地用于架构、工程和建筑(AEC)行业,但它们的采用往往受到对专业知识(如数据库查询)的需求的阻碍。本文介绍了Graph-DT-GPT,这是一个多智能体框架,它将大型语言模型(llm)与基于图的数字双胞胎集成在一起,以实现自然语言交互。该框架采用模块化代理设计,包括决策、查询生成和答案提取,并将llm的所有输出基于结构化图数据,以提高响应可靠性并减少幻觉。该框架在两个用例上进行评估:包含超过40,000个建筑节点的城市级图和房间级公寓布局图。使用Claude Sonnet 4.5和gpt - 40, Graph-DT-GPT在城市规模的情况下分别达到100%和95.5%的答案正确性,在房间级别的情况下达到100%的正确性,显著优于包括LangChain Neo4j管道在内的基线方法,分别约为40%和10%。这些结果证明了它的可扩展性和潜力,以提高可访问的,准确的信息检索在AEC数字孪生应用。
{"title":"LLM-enabled multi-agent framework for natural language interaction with graph-based digital twins","authors":"Yuandong Pan ,&nbsp;Mudan Wang ,&nbsp;Linjun Lu ,&nbsp;Rabindra Lamsal ,&nbsp;Erika Pärn ,&nbsp;Sisi Zlatanova ,&nbsp;Ioannis Brilakis","doi":"10.1016/j.autcon.2026.106791","DOIUrl":"10.1016/j.autcon.2026.106791","url":null,"abstract":"<div><div>Digital twins are increasingly used in the Architecture, Engineering, and Construction (AEC) industry, but their adoption is often hindered by the need for specialised knowledge, such as database querying. This paper presents Graph-DT-GPT, a multi-agent framework that integrates Large Language Models (LLMs) with graph-based digital twins to enable natural language interaction. The framework is designed with modular agents, including decision, query generation, and answer extraction, and grounds all LLMs’ outputs in structured graph data to improve response reliability and reduce hallucinations. The framework is evaluated on two use cases: a city-level graph with over 40,000 building nodes and room-level apartment layout graphs. Graph-DT-GPT achieves 100% and 95.5% answer correctness using Claude Sonnet 4.5 and GPT-4o, respectively, in the city-scale case, and 100% correctness in the room-level case, significantly outperforming baseline methods including LangChain Neo4j pipelines by approximately 40% and 10%, respectively. These results demonstrate its scalability and potential to enhance accessible, accurate information retrieval in AEC digital twin applications.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106791"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lightweight semantic segmentation for construction progress monitoring using 3D point clouds 基于三维点云的施工进度监测轻量级语义分割
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-03-01 Epub Date: 2026-01-17 DOI: 10.1016/j.autcon.2026.106765
Jinting Huang , Zhonghua Xiao , Ankang Ji , Limao Zhang
This paper proposes a lightweight semantic segmentation framework utilizing 3D point cloud data to enable automatic and rapid construction progress monitoring in high-rise building projects. This study centers on developing an efficient L-PointNet++ model that integrates self-attention mechanisms and MobileNetV3 modules, significantly reducing computational complexity and achieving a 95.63 % reduction in total training time compared to traditional PointNet++. A dual-stage training strategy is adopted to effectively address class imbalance, resulting in high segmentation accuracy with mean Intersection over Union (mIoU) values of 0.9308 for edge points and 0.9300 for corner points. Experimental results indicate that the developed framework can significantly enhance the speed and adaptability of as-built BIM model reconstruction and provide substantial improvements in decision-making efficiency and project management through the implementation of a visualization-based progress monitoring and early-warning system. Overall, the proposed approach demonstrates notable advantages in 3D reconstruction accuracy, speed, and project control, providing a robust solution for real-time construction progress monitoring applications.
本文提出了一种利用三维点云数据的轻量级语义分割框架,实现高层建筑项目施工进度的自动快速监测。本研究的重点是开发一个高效的l - pointnet++模型,该模型集成了自注意机制和MobileNetV3模块,显著降低了计算复杂度,与传统的pointnet++相比,总训练时间减少了95.63%。采用双阶段训练策略,有效解决了类不平衡问题,分割精度较高,边缘点的平均mIoU值为0.9308,角点的平均mIoU值为0.9300。实验结果表明,所开发的框架可以通过实施基于可视化的进度监测预警系统,显著提高BIM模型重建的速度和适应性,显著提高决策效率和项目管理水平。总体而言,该方法在三维重建精度、速度和项目控制方面具有显着优势,为实时施工进度监控应用提供了强大的解决方案。
{"title":"Lightweight semantic segmentation for construction progress monitoring using 3D point clouds","authors":"Jinting Huang ,&nbsp;Zhonghua Xiao ,&nbsp;Ankang Ji ,&nbsp;Limao Zhang","doi":"10.1016/j.autcon.2026.106765","DOIUrl":"10.1016/j.autcon.2026.106765","url":null,"abstract":"<div><div>This paper proposes a lightweight semantic segmentation framework utilizing 3D point cloud data to enable automatic and rapid construction progress monitoring in high-rise building projects. This study centers on developing an efficient L-PointNet++ model that integrates self-attention mechanisms and MobileNetV3 modules, significantly reducing computational complexity and achieving a 95.63 % reduction in total training time compared to traditional PointNet++. A dual-stage training strategy is adopted to effectively address class imbalance, resulting in high segmentation accuracy with mean Intersection over Union (mIoU) values of 0.9308 for edge points and 0.9300 for corner points. Experimental results indicate that the developed framework can significantly enhance the speed and adaptability of as-built BIM model reconstruction and provide substantial improvements in decision-making efficiency and project management through the implementation of a visualization-based progress monitoring and early-warning system. Overall, the proposed approach demonstrates notable advantages in 3D reconstruction accuracy, speed, and project control, providing a robust solution for real-time construction progress monitoring applications.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106765"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145995538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construction productivity and digital technologies 建筑生产力和数字技术
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-03-01 Epub Date: 2026-01-19 DOI: 10.1016/j.autcon.2026.106768
Zijian Wang , Ronen Barak , Rafael Sacks , Sitsofe K. Yevu , Arnon Bentur , Georgios M. Hadjidemetriou
Although digital technologies are increasingly studied in construction, their specific impacts on productivity remain partially understood. This review aims to investigates the relationship between digital technologies and construction productivity. The methodology comprises a bibliometric analysis and a systematic literature review of studies published over the past decade. Scopus was selected as the primary database for data retrieval, with 346 publications across 16 journals being identified and analyzed. The bibliometric analysis reveals publication trends and technology interrelations, highlighting AI and optimization as central to a cohesive ecosystem involving BIM, digital twins, sensors, and robotics. The systematic literature review is structured around categorising the use of technology for productivity into four dimensions: measurement, estimation, optimisation, and enhancement. Despite this qualitative synthesis being influenced by the authors' judgement and subjectivity, it highlights the practical benefits such as improved prediction and automation, alongside challenges including data standardization, integration, and workforce adaptation.
尽管数字技术在建筑领域的研究越来越多,但它们对生产力的具体影响仍不完全清楚。本文旨在探讨数字技术与建筑生产率之间的关系。该方法包括文献计量分析和对过去十年发表的研究进行系统的文献综述。选择Scopus作为数据检索的主数据库,对16种期刊的346篇出版物进行了识别和分析。文献计量分析揭示了出版趋势和技术之间的相互关系,强调人工智能和优化是涉及BIM、数字双胞胎、传感器和机器人技术的凝聚力生态系统的核心。系统的文献综述围绕着将技术用于生产力的使用分为四个维度:测量、估计、优化和增强。尽管这种定性综合受到作者的判断和主观性的影响,但它强调了实际的好处,如改进的预测和自动化,以及包括数据标准化、集成和劳动力适应在内的挑战。
{"title":"Construction productivity and digital technologies","authors":"Zijian Wang ,&nbsp;Ronen Barak ,&nbsp;Rafael Sacks ,&nbsp;Sitsofe K. Yevu ,&nbsp;Arnon Bentur ,&nbsp;Georgios M. Hadjidemetriou","doi":"10.1016/j.autcon.2026.106768","DOIUrl":"10.1016/j.autcon.2026.106768","url":null,"abstract":"<div><div>Although digital technologies are increasingly studied in construction, their specific impacts on productivity remain partially understood. This review aims to investigates the relationship between digital technologies and construction productivity. The methodology comprises a bibliometric analysis and a systematic literature review of studies published over the past decade. Scopus was selected as the primary database for data retrieval, with 346 publications across 16 journals being identified and analyzed. The bibliometric analysis reveals publication trends and technology interrelations, highlighting AI and optimization as central to a cohesive ecosystem involving BIM, digital twins, sensors, and robotics. The systematic literature review is structured around categorising the use of technology for productivity into four dimensions: measurement, estimation, optimisation, and enhancement. Despite this qualitative synthesis being influenced by the authors' judgement and subjectivity, it highlights the practical benefits such as improved prediction and automation, alongside challenges including data standardization, integration, and workforce adaptation.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106768"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146000934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated construction progress monitoring and control through AI-based image recognition and BIM integration 通过基于ai的图像识别和BIM集成实现施工进度自动化监控
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-03-01 Epub Date: 2026-01-19 DOI: 10.1016/j.autcon.2026.106783
Chang-Cheng Hsieh, Hung-Ming Chen, Wan-Yu Chen, Ting-Yu Wu
This paper integrates AI image recognition and BIM technology to develop a prototype system that achieves automation and visualization of construction progress control. The system supports using a BIM model for planning deployment of multiple surveillance cameras to encompass the entire construction site. The real-time images captured by these cameras are processed using object detection technology to locate all actively constructed objects in the images and identify their respective construction phases. By integrating the perspectives of these cameras into the BIM model, the AI detection results from each camera image are automatically inputted into corresponding components of the BIM model. Subsequently, the real-time site progress information stored in the BIM model is compared with the planned schedule, and the comparative results are visually presented on the BIM model components in different colors. Through visualization, this approach enables management personnel to control progress in a specific and intuitive manner in real-time.
本文将AI图像识别与BIM技术相结合,开发了一个实现施工进度控制自动化、可视化的原型系统。系统支持使用BIM模型规划部署多个监控摄像头,覆盖整个施工现场。这些摄像机捕获的实时图像使用目标检测技术进行处理,定位图像中所有正在施工的物体,并识别其各自的施工阶段。通过将这些摄像头的视角整合到BIM模型中,每个摄像头图像的AI检测结果自动输入到BIM模型的相应组件中。随后,将BIM模型中存储的现场实时进度信息与计划进度进行对比,并将对比结果以不同颜色直观地呈现在BIM模型组件上。通过可视化,管理人员可以实时、具体、直观地控制进度。
{"title":"Automated construction progress monitoring and control through AI-based image recognition and BIM integration","authors":"Chang-Cheng Hsieh,&nbsp;Hung-Ming Chen,&nbsp;Wan-Yu Chen,&nbsp;Ting-Yu Wu","doi":"10.1016/j.autcon.2026.106783","DOIUrl":"10.1016/j.autcon.2026.106783","url":null,"abstract":"<div><div>This paper integrates AI image recognition and BIM technology to develop a prototype system that achieves automation and visualization of construction progress control. The system supports using a BIM model for planning deployment of multiple surveillance cameras to encompass the entire construction site. The real-time images captured by these cameras are processed using object detection technology to locate all actively constructed objects in the images and identify their respective construction phases. By integrating the perspectives of these cameras into the BIM model, the AI detection results from each camera image are automatically inputted into corresponding components of the BIM model. Subsequently, the real-time site progress information stored in the BIM model is compared with the planned schedule, and the comparative results are visually presented on the BIM model components in different colors. Through visualization, this approach enables management personnel to control progress in a specific and intuitive manner in real-time.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106783"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146000936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bridging dual knowledge graphs for multi-hop question answering in construction safety 面向建筑安全多跳问答的桥接双知识图
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-03-01 Epub Date: 2026-01-26 DOI: 10.1016/j.autcon.2026.106794
Yuxin Zhang, Xi Wang, Mo Hu, Zhenyu Zhang
Information retrieval and question answering from safety regulations are essential for automated construction compliance checking but are hindered by the linguistic and structural complexity of regulatory text. Many queries are multi-hop, requiring synthesis across interlinked clauses. To address the challenge, this paper introduces BifrostRAG, a dual-graph retrieval-augmented generation (RAG) system that models both linguistic relationships and document structure. The proposed architecture supports a hybrid retrieval mechanism that combines graph traversal with vector-based semantic search, enabling large language models to reason over both the content and the structure of the text. On a multi-hop question dataset, BifrostRAG achieves 92.8% precision, 85.5% recall, and an F1 score of 87.3%. These results significantly outperform vector-only and graph-only RAG baselines, establishing BifrostRAG as a robust knowledge engine for LLM-driven compliance checking. The dual-graph, hybrid retrieval mechanism presented in this paper offers a transferable blueprint for navigating complex technical documents across knowledge-intensive engineering domains.
安全法规的信息检索和问题回答对于自动化建筑符合性检查至关重要,但法规文本的语言和结构复杂性阻碍了这一点。许多查询是多跳的,需要在相互连接的子句之间进行合成。为了应对这一挑战,本文介绍了BifrostRAG,这是一个双图检索增强生成(RAG)系统,可以对语言关系和文档结构进行建模。提出的体系结构支持混合检索机制,该机制结合了图遍历和基于向量的语义搜索,使大型语言模型能够对文本的内容和结构进行推理。在多跳问题数据集上,BifrostRAG达到了92.8%的准确率,85.5%的召回率和87.3%的F1分数。这些结果明显优于纯矢量和纯图形的RAG基线,使BifrostRAG成为llm驱动的符合性检查的强大知识引擎。本文提出的双图混合检索机制为跨知识密集型工程领域的复杂技术文档导航提供了可转移的蓝图。
{"title":"Bridging dual knowledge graphs for multi-hop question answering in construction safety","authors":"Yuxin Zhang,&nbsp;Xi Wang,&nbsp;Mo Hu,&nbsp;Zhenyu Zhang","doi":"10.1016/j.autcon.2026.106794","DOIUrl":"10.1016/j.autcon.2026.106794","url":null,"abstract":"<div><div>Information retrieval and question answering from safety regulations are essential for automated construction compliance checking but are hindered by the linguistic and structural complexity of regulatory text. Many queries are multi-hop, requiring synthesis across interlinked clauses. To address the challenge, this paper introduces BifrostRAG, a dual-graph retrieval-augmented generation (RAG) system that models both linguistic relationships and document structure. The proposed architecture supports a hybrid retrieval mechanism that combines graph traversal with vector-based semantic search, enabling large language models to reason over both the content and the structure of the text. On a multi-hop question dataset, BifrostRAG achieves 92.8% precision, 85.5% recall, and an F1 score of 87.3%. These results significantly outperform vector-only and graph-only RAG baselines, establishing BifrostRAG as a robust knowledge engine for LLM-driven compliance checking. The dual-graph, hybrid retrieval mechanism presented in this paper offers a transferable blueprint for navigating complex technical documents across knowledge-intensive engineering domains.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106794"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146048561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Automation in Construction
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1