首页 > 最新文献

Automation in Construction最新文献

英文 中文
Accurate concrete spalling segmentation from bounding box supervision using Segment Anything 使用分段任何从边界盒监督混凝土剥落准确分割
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-01-03 DOI: 10.1016/j.autcon.2025.106752
Chen Zhang , Dhanada K. Mishra , Matthew M.F. Yuen , Yantao Yu , Jize Zhang
Accurate pixel-level segmentation of concrete spalling has been severely hampered by the prohibitive cost of manual annotation. This paper investigates how accurate pixel-level defect segmentation can be achieved using only low-cost weakly supervised bounding box annotations. A three-stage framework is proposed to generate and refine pseudo-masks from bounding boxes using the Segment Anything Model (SAM), dynamic self-correction, and inference-time fusion. The proposed method outperformed existing techniques by over 10% in F1 score on a large-scale spalling dataset. These findings establish the economic viability of deploying scalable automated inspection systems by drastically reducing data annotation costs, providing a practical and scalable pathway for spalling assessment.
人工标注的高昂成本严重阻碍了混凝土剥落的精确像素级分割。本文研究了如何使用低成本的弱监督边界框注释实现精确的像素级缺陷分割。提出了一种基于分段任意模型(SAM)、动态自校正和推理时间融合的三阶段框架,从边界框生成和细化伪掩码。该方法在大规模剥落数据集上的F1分数比现有技术高出10%以上。这些发现通过大幅降低数据注释成本,确立了部署可扩展自动检测系统的经济可行性,为剥落评估提供了实用且可扩展的途径。
{"title":"Accurate concrete spalling segmentation from bounding box supervision using Segment Anything","authors":"Chen Zhang ,&nbsp;Dhanada K. Mishra ,&nbsp;Matthew M.F. Yuen ,&nbsp;Yantao Yu ,&nbsp;Jize Zhang","doi":"10.1016/j.autcon.2025.106752","DOIUrl":"10.1016/j.autcon.2025.106752","url":null,"abstract":"<div><div>Accurate pixel-level segmentation of concrete spalling has been severely hampered by the prohibitive cost of manual annotation. This paper investigates how accurate pixel-level defect segmentation can be achieved using only low-cost weakly supervised bounding box annotations. A three-stage framework is proposed to generate and refine pseudo-masks from bounding boxes using the Segment Anything Model (SAM), dynamic self-correction, and inference-time fusion. The proposed method outperformed existing techniques by over 10% in F1 score on a large-scale spalling dataset. These findings establish the economic viability of deploying scalable automated inspection systems by drastically reducing data annotation costs, providing a practical and scalable pathway for spalling assessment.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106752"},"PeriodicalIF":11.5,"publicationDate":"2026-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145880901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent prediction of TBM tunneling loads based on modal reconstruction and collaborative modeling 基于模态重构和协同建模的TBM隧道荷载智能预测
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-01-03 DOI: 10.1016/j.autcon.2025.106737
Kang Fu , Yiguo Xue , Daohong Qiu , Jingkai Qu , Huimin Gong
Accurate prediction of TBM tunneling loads is essential for enabling intelligent control. This paper proposes an intelligent prediction framework that integrates modal reconstruction with collaborative modeling. An improved Multivariate Variational Mode Decomposition (IMVMD) combined with Refined Composite Multiscale Diversity Entropy (RCMDE) is employed to extract the trend, seasonal, cyclic, and residual components of tunneling load signals. For each component, specialized predictive models, including Transformer, Bidirectional Gated Recurrent Unit (BiGRU), Convolutional Neural Network–Long Short-Term Memory (CNN-LSTM), and Extreme Gradient Boosting (XGBoost), are developed to construct a collaborative hybrid learning architecture. A CNN-LSTM-based error correction strategy is further introduced, resulting in a corrected hybrid learning (CHL) model that achieved an R2 of 0.9972, a MAPE of 0.66 %, and an MAE of 11.73, exceeding traditional models by more than 60 % on average. The proposed method provides reliable technical support for intelligent perception and automated control in TBM tunneling.
隧道掘进机掘进荷载的准确预测是实现智能控制的关键。提出了一种将模态重构与协同建模相结合的智能预测框架。采用改进的多元变分模态分解(IMVMD)和改进的复合多尺度多样性熵(RCMDE)方法提取隧道荷载信号的趋势分量、季节分量、循环分量和残差分量。对于每个组件,开发了专门的预测模型,包括变压器,双向门通循环单元(BiGRU),卷积神经网络长短期记忆(CNN-LSTM)和极端梯度增强(XGBoost),以构建协作混合学习架构。进一步引入了基于cnn - lstm的纠错策略,得到了修正后的混合学习(CHL)模型,其R2为0.9972,MAPE为0.66%,MAE为11.73,比传统模型平均提高了60%以上。该方法为隧道掘进机的智能感知和自动控制提供了可靠的技术支持。
{"title":"Intelligent prediction of TBM tunneling loads based on modal reconstruction and collaborative modeling","authors":"Kang Fu ,&nbsp;Yiguo Xue ,&nbsp;Daohong Qiu ,&nbsp;Jingkai Qu ,&nbsp;Huimin Gong","doi":"10.1016/j.autcon.2025.106737","DOIUrl":"10.1016/j.autcon.2025.106737","url":null,"abstract":"<div><div>Accurate prediction of TBM tunneling loads is essential for enabling intelligent control. This paper proposes an intelligent prediction framework that integrates modal reconstruction with collaborative modeling. An improved Multivariate Variational Mode Decomposition (IMVMD) combined with Refined Composite Multiscale Diversity Entropy (RCMDE) is employed to extract the trend, seasonal, cyclic, and residual components of tunneling load signals. For each component, specialized predictive models, including Transformer, Bidirectional Gated Recurrent Unit (BiGRU), Convolutional Neural Network–Long Short-Term Memory (CNN-LSTM), and Extreme Gradient Boosting (XGBoost), are developed to construct a collaborative hybrid learning architecture. A CNN-LSTM-based error correction strategy is further introduced, resulting in a corrected hybrid learning (CHL) model that achieved an <em>R</em><sup>2</sup> of 0.9972, a <em>MAPE</em> of 0.66 %, and an <em>MAE</em> of 11.73, exceeding traditional models by more than 60 % on average. The proposed method provides reliable technical support for intelligent perception and automated control in TBM tunneling.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106737"},"PeriodicalIF":11.5,"publicationDate":"2026-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145880897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital twin–driven temperature field optimization in tunnel freezing restoration using particle swarm optimization 基于粒子群算法的隧道冻结修复温度场数字双驱动优化
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-01-02 DOI: 10.1016/j.autcon.2025.106747
Jie Zhou , Chao Ban , Chengjun Liu , Zeyao Li , Huade Zhou , Hsinming Shang
The distribution and evolution of temperature field are key concerns in freezing restoration projects, while traditional methods face limitations due to sparse sensor placement and simplified simulation inputs. More effective and accurate methods are needed to determine the temperature field. A PSO-based digital twin model was developed and validated with a tunnel freezing restoration project in Bangkok, Thailand. By integrating real-time field temperature data, the model enables dynamic optimization of parameters, enhancing the accuracy. Single-parameter optimization achieves fast convergence, ideal for early-stage calibration, while multi-parameter optimization improves performance under complex conditions. In these cases, PSO demonstrates better performance compared with GA and DE. When using multiple measurement points, the model may encounter local optima. The hybrid optimization strategy (GA-PSO) provides an effective pathway to mitigate the issue of local optima. This paper demonstrates the model feasibility and effectiveness, offering a practical approach for dynamic temperature management in complex freezing environments.
温度场的分布和演变是冻结恢复工程的关键问题,而传统的方法由于传感器布置的稀疏和模拟输入的简化而面临局限性。需要更有效和准确的方法来确定温度场。建立了基于pso的数字孪生模型,并通过泰国曼谷的隧道冻结修复项目进行了验证。该模型通过集成实时现场温度数据,实现了参数的动态优化,提高了精度。单参数优化实现了快速收敛,非常适合早期校准,而多参数优化提高了复杂条件下的性能。在这些情况下,粒子群算法比遗传算法和遗传算法表现出更好的性能。当使用多个测量点时,模型可能会遇到局部最优。混合优化策略(GA-PSO)为解决局部最优问题提供了有效途径。本文论证了该模型的可行性和有效性,为复杂冰冻环境下的动态温度管理提供了一种实用的方法。
{"title":"Digital twin–driven temperature field optimization in tunnel freezing restoration using particle swarm optimization","authors":"Jie Zhou ,&nbsp;Chao Ban ,&nbsp;Chengjun Liu ,&nbsp;Zeyao Li ,&nbsp;Huade Zhou ,&nbsp;Hsinming Shang","doi":"10.1016/j.autcon.2025.106747","DOIUrl":"10.1016/j.autcon.2025.106747","url":null,"abstract":"<div><div>The distribution and evolution of temperature field are key concerns in freezing restoration projects, while traditional methods face limitations due to sparse sensor placement and simplified simulation inputs. More effective and accurate methods are needed to determine the temperature field. A PSO-based digital twin model was developed and validated with a tunnel freezing restoration project in Bangkok, Thailand. By integrating real-time field temperature data, the model enables dynamic optimization of parameters, enhancing the accuracy. Single-parameter optimization achieves fast convergence, ideal for early-stage calibration, while multi-parameter optimization improves performance under complex conditions. In these cases, PSO demonstrates better performance compared with GA and DE. When using multiple measurement points, the model may encounter local optima. The hybrid optimization strategy (GA-PSO) provides an effective pathway to mitigate the issue of local optima. This paper demonstrates the model feasibility and effectiveness, offering a practical approach for dynamic temperature management in complex freezing environments.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106747"},"PeriodicalIF":11.5,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145880898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-supervised learning for multi-label sewer defect classification 多标签下水道缺陷分类的自监督学习
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-01-02 DOI: 10.1016/j.autcon.2025.106751
Tugba Yildizli , Tianlong Jia , Jeroen Langeveld , Riccardo Taormina
Automated sewer defect detection has advanced through deep learning, particularly supervised methods using CCTV images, but based on large annotated datasets. This paper proposes a semi-supervised learning (SSL) approach to reduce labeling demands. The method comprises self-supervised pre-training on unlabeled images using SwAV (Swapping Assignments between multiple Views) followed by fine-tuning for multi-label classification. Experiments on the Sewer-ML dataset demonstrate that the SSL approach, trained on only 35k labeled images, achieves an F1-score of 69.11%, and F2CIW of 54.22%, surpassing the fully supervised baseline trained from scratch on 1.04 million images. Increasing the unlabeled pre-training data further enhances performance, while ImageNet initialization consistently outperforms training from scratch. Self-supervised learning also helps mitigate the effects of mislabeled data, which is observed to be present even in the Sewer-ML ground truth. Overall, self-supervised learning provides an accurate, scalable, and cost-effective alternative to fully supervised approaches, particularly in data-scarce or imperfectly labeled scenarios.
自动化下水道缺陷检测通过深度学习取得了进展,特别是使用闭路电视图像的监督方法,但基于大型注释数据集。本文提出了一种半监督学习(SSL)方法来减少标注需求。该方法包括使用SwAV(在多个视图之间交换分配)对未标记图像进行自监督预训练,然后对多标签分类进行微调。在seur - ml数据集上的实验表明,SSL方法仅在35k标记图像上进行训练,f1得分为69.11%,F2CIW为54.22%,超过了在104万张图像上从头开始训练的完全监督基线。增加未标记的预训练数据进一步提高性能,而ImageNet初始化始终优于从头开始训练。自我监督学习还有助于减轻错误标记数据的影响,即使在下水道- ml的基础事实中也观察到这种情况。总的来说,自监督学习提供了一种准确的、可扩展的、具有成本效益的替代完全监督方法,特别是在数据稀缺或不完美标记的场景中。
{"title":"Self-supervised learning for multi-label sewer defect classification","authors":"Tugba Yildizli ,&nbsp;Tianlong Jia ,&nbsp;Jeroen Langeveld ,&nbsp;Riccardo Taormina","doi":"10.1016/j.autcon.2025.106751","DOIUrl":"10.1016/j.autcon.2025.106751","url":null,"abstract":"<div><div>Automated sewer defect detection has advanced through deep learning, particularly supervised methods using CCTV images, but based on large annotated datasets. This paper proposes a semi-supervised learning (SSL) approach to reduce labeling demands. The method comprises self-supervised pre-training on unlabeled images using SwAV (Swapping Assignments between multiple Views) followed by fine-tuning for multi-label classification. Experiments on the Sewer-ML dataset demonstrate that the SSL approach, trained on only 35k labeled images, achieves an F1-score of 69.11%, and F2<sub>CIW</sub> of 54.22%, surpassing the fully supervised baseline trained from scratch on 1.04 million images. Increasing the unlabeled pre-training data further enhances performance, while ImageNet initialization consistently outperforms training from scratch. Self-supervised learning also helps mitigate the effects of mislabeled data, which is observed to be present even in the Sewer-ML ground truth. Overall, self-supervised learning provides an accurate, scalable, and cost-effective alternative to fully supervised approaches, particularly in data-scarce or imperfectly labeled scenarios.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106751"},"PeriodicalIF":11.5,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145880902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collaborative learning architecture for autonomous excavator planning and execution 自主挖掘机规划与执行的协同学习体系结构
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2025-12-30 DOI: 10.1016/j.autcon.2025.106742
Junhyung Cho , Mingyu Shin , Joongheon Kim , Soyi Jung
Autonomous excavation systems face fundamental challenges balancing computational tractability with operational sophistication. This paper presents the collaborative learning for excavation framework (CLEF), resolving this trade-off through strategic decomposition: separating high-level planning from low-level execution while maintaining collaborative optimization. The framework’s key contributions include a bidirectional information flow between specialized modules consisting of reinforcement learning for strategic planning using polar coordinates, and attention-enhanced generative adversarial imitation learning (A-GAIL) with multi-head attention capturing phase-specific temporal dependencies. Unlike monolithic approaches suffering computational intractability, CLEF enables module specialization while coordinating through shared representations. Planning decisions condition trajectory generation while execution outcomes update environmental models, creating adaptive behavior without manual tuning. Validation demonstrates 90.8% success rate compared to 71.1% for monolithic approaches, with trajectory generation achieving 91.3% completion confirming superior performance essential for construction automation.
自主挖掘系统面临着平衡计算可追溯性和操作复杂性的根本挑战。本文提出了挖掘框架的协作学习(CLEF),通过战略分解解决了这种权衡:在保持协作优化的同时,将高级规划与低级执行分离开来。该框架的主要贡献包括专门模块之间的双向信息流,包括使用极坐标进行战略规划的强化学习,以及具有多头注意力捕获特定阶段时间依赖性的注意力增强生成对抗模仿学习(a - gail)。与遭受计算困难的整体方法不同,CLEF在通过共享表示进行协调时支持模块专门化。规划决策条件轨迹生成,而执行结果更新环境模型,无需手动调优即可创建自适应行为。验证的成功率为90.8%,而单片方法的成功率为71.1%,轨迹生成的完成率为91.3%,这证实了施工自动化所必需的卓越性能。
{"title":"Collaborative learning architecture for autonomous excavator planning and execution","authors":"Junhyung Cho ,&nbsp;Mingyu Shin ,&nbsp;Joongheon Kim ,&nbsp;Soyi Jung","doi":"10.1016/j.autcon.2025.106742","DOIUrl":"10.1016/j.autcon.2025.106742","url":null,"abstract":"<div><div>Autonomous excavation systems face fundamental challenges balancing computational tractability with operational sophistication. This paper presents the collaborative learning for excavation framework (CLEF), resolving this trade-off through strategic decomposition: separating high-level planning from low-level execution while maintaining collaborative optimization. The framework’s key contributions include a bidirectional information flow between specialized modules consisting of reinforcement learning for strategic planning using polar coordinates, and attention-enhanced generative adversarial imitation learning (A-GAIL) with multi-head attention capturing phase-specific temporal dependencies. Unlike monolithic approaches suffering computational intractability, CLEF enables module specialization while coordinating through shared representations. Planning decisions condition trajectory generation while execution outcomes update environmental models, creating adaptive behavior without manual tuning. Validation demonstrates 90.8% success rate compared to 71.1% for monolithic approaches, with trajectory generation achieving 91.3% completion confirming superior performance essential for construction automation.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106742"},"PeriodicalIF":11.5,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145880900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unified data synthesis for automated 3D Visual Inspection and digital twinning of bridges 统一数据合成,实现桥梁自动三维视觉检测和数字孪生
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2025-12-29 DOI: 10.1016/j.autcon.2025.106741
Wang Wang , Mingjing Xu , Zhen Cao , Jingzi Guo , Chong Liu , Haowei Zhang , Xiaoling Zhang
The automation of 3D bridge inspection is critically limited by scarce annotated data and a fundamental lack of understanding regarding which intrinsic point cloud features drive Sim-to-Real (S2R) success. The paper proposes a unified procedural synthesis framework to overcome this data bottleneck. The core contributions are twofold: (1) Dual-output generation, which yields segmented ground truth and the first bridge component-level point cloud completion dataset via physical simulation. (2) Systematic feature ablation, establishing a definitive S2R importance hierarchy: Surface Normals Geometry > RGB. This finding offers critical guidance for efficient sensor deployment and data synthesis. A model trained exclusively on synthetic data achieved a satisfactory 84.2% mIoU on a real-world benchmark, validating direct S2R transfer and proving synthetic data can substitute manual annotation. The validated methodology provides the foundation to seamlessly integrate procedural damage models, extending automation from component identification to defect detection for analysis-ready digital twins.
3D桥梁检测的自动化受到缺乏注释数据和根本缺乏对内在点云特征驱动模拟到真实(S2R)成功的理解的严重限制。本文提出了一个统一的程序综合框架来克服这一数据瓶颈。核心贡献有两个方面:(1)双输出生成,通过物理模拟产生分段的地面真值和第一个桥梁组件级点云补全数据集。(2)系统的特征消融,建立了明确的S2R重要性层次:表面法线(Surface法线)>几何(Geometry) & RGB。这一发现为有效的传感器部署和数据合成提供了关键指导。在真实基准测试中,仅使用合成数据训练的模型获得了令人满意的84.2% mIoU,验证了直接S2R传输,并证明合成数据可以替代人工注释。经过验证的方法为无缝集成程序损坏模型提供了基础,将自动化从组件识别扩展到可用于分析的数字孪生的缺陷检测。
{"title":"Unified data synthesis for automated 3D Visual Inspection and digital twinning of bridges","authors":"Wang Wang ,&nbsp;Mingjing Xu ,&nbsp;Zhen Cao ,&nbsp;Jingzi Guo ,&nbsp;Chong Liu ,&nbsp;Haowei Zhang ,&nbsp;Xiaoling Zhang","doi":"10.1016/j.autcon.2025.106741","DOIUrl":"10.1016/j.autcon.2025.106741","url":null,"abstract":"<div><div>The automation of 3D bridge inspection is critically limited by scarce annotated data and a fundamental lack of understanding regarding which intrinsic point cloud features drive Sim-to-Real (S2R) success. The paper proposes a unified procedural synthesis framework to overcome this data bottleneck. The core contributions are twofold: (1) Dual-output generation, which yields segmented ground truth and the first bridge component-level point cloud completion dataset via physical simulation. (2) Systematic feature ablation, establishing a definitive S2R importance hierarchy: Surface Normals <span><math><mo>≫</mo></math></span> Geometry <span><math><mo>&gt;</mo></math></span> RGB. This finding offers critical guidance for efficient sensor deployment and data synthesis. A model trained exclusively on synthetic data achieved a satisfactory 84.2% mIoU on a real-world benchmark, validating direct S2R transfer and proving synthetic data can substitute manual annotation. The validated methodology provides the foundation to seamlessly integrate procedural damage models, extending automation from component identification to defect detection for analysis-ready digital twins.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106741"},"PeriodicalIF":11.5,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145880899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-time stereo reconstruction and geometric quantification of pavement distress with a variable-baseline platform 基于变基线平台的路面损伤实时立体重建与几何量化
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2025-12-27 DOI: 10.1016/j.autcon.2025.106743
Guang-Zhu Zhang, Qingliang Xu, Hong-Feng Li, Chun-Peng Han, Qiushi Li
Accurate maintenance planning requires not only detecting pavement distress but also reconstructing its 3D geometry and reporting metrics. This paper develops Iterative Geometry Encoding Volume-Lite (IGEV-Lite), a compact derivative of IGEV-Stereo, and couples it with a variable-baseline stereo platform. IGEV-Lite adopts a GhostNetV2 backbone with feature transfer–re-encoding context network and a compact iterative updater, plus deployment accelerations; instance-level region of interest (ROI) cropping focuses computation, and plane-referenced, gridded integration yields maximum depth and integrated volume. Under a unified protocol, accuracy improves from an end-point error (EPE) of 0.608 to 0.584 px (pixels) and a disparity outlier rate (D1) of 3.24 % to 2.97 %, while latency drops from 135 ms to 97 ms. Quantification tests conducted at a perpendicular angle to the ground achieve 2.7 % depth and 0.9 % volume error at B = 240 mm. Combining a lightweight stereo backbone with plane-referenced integration provides deployment-ready, geometry-faithful quantification of distress.
准确的维护计划不仅需要检测路面破损情况,还需要重建其三维几何形状并报告指标。本文开发了IGEV-Lite (Iterative Geometry Encoding Volume-Lite, IGEV-Lite),这是IGEV-Stereo的紧凑导数,并将其与可变基线立体平台耦合在一起。IGEV-Lite采用GhostNetV2骨干网,具有特征传输-重新编码上下文网络和紧凑的迭代更新器,以及部署加速;实例级感兴趣区域(ROI)裁剪侧重于计算,平面参考的网格集成产生最大深度和集成体积。在统一协议下,准确率从0.608提高到0.584像素,视差异常率(D1)从3.24%提高到2.97%,延迟从135 ms下降到97 ms。在与地面垂直的角度下进行的量化试验,在B = 240 mm处深度误差为2.7%,体积误差为0.9%。将轻型立体主干与平面参考集成相结合,可提供部署就绪、几何可靠的遇险量化。
{"title":"Real-time stereo reconstruction and geometric quantification of pavement distress with a variable-baseline platform","authors":"Guang-Zhu Zhang,&nbsp;Qingliang Xu,&nbsp;Hong-Feng Li,&nbsp;Chun-Peng Han,&nbsp;Qiushi Li","doi":"10.1016/j.autcon.2025.106743","DOIUrl":"10.1016/j.autcon.2025.106743","url":null,"abstract":"<div><div>Accurate maintenance planning requires not only detecting pavement distress but also reconstructing its 3D geometry and reporting metrics. This paper develops Iterative Geometry Encoding Volume-Lite (IGEV-Lite), a compact derivative of IGEV-Stereo, and couples it with a variable-baseline stereo platform. IGEV-Lite adopts a GhostNetV2 backbone with feature transfer–re-encoding context network and a compact iterative updater, plus deployment accelerations; instance-level region of interest (ROI) cropping focuses computation, and plane-referenced, gridded integration yields maximum depth and integrated volume. Under a unified protocol, accuracy improves from an end-point error (EPE) of 0.608 to 0.584 px (pixels) and a disparity outlier rate (D1) of 3.24 % to 2.97 %, while latency drops from 135 ms to 97 ms. Quantification tests conducted at a perpendicular angle to the ground achieve 2.7 % depth and 0.9 % volume error at B = 240 mm. Combining a lightweight stereo backbone with plane-referenced integration provides deployment-ready, geometry-faithful quantification of distress.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106743"},"PeriodicalIF":11.5,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145844991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implicit neural representations for surrogate modeling in the built environment 建筑环境中代理建模的隐式神经表征
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2025-12-27 DOI: 10.1016/j.autcon.2025.106744
Sarah Mokhtar, Caitlin Mueller
Physical phenomena, including aerodynamics and heat transfer, exhibit complex shape-dependent relationships with building geometry, shaping microclimates that directly affect urban livability and comfort. In design and engineering practice, surrogate models reduce the computational burden of simulations, providing faster and more iterative performance feedback within design workflows. This paper introduces Per-FORM, a framework that leverages implicit neural representations (INRs) for predictive modeling in the built environment. The approach accommodates variations in geometric complexity, scale, and topology while representing continuous physical fields through decoupled modules encoding both geometry and building influence. Its ability to infer full-field and near-surface predictions is evaluated across multiple metrics, demonstrating state-of-the-art accuracy for complex geometries. Beyond predictive accuracy, Per-FORM brings simulation-in-the-loop feedback into digital workflows, supporting performance-informed exploration, ideation, and conceptualization, and enriching informed creative processes in design and engineering practice.
物理现象,包括空气动力学和热传递,与建筑几何形状表现出复杂的形状依赖关系,形成直接影响城市宜居性和舒适度的小气候。在设计和工程实践中,代理模型减少了模拟的计算负担,在设计工作流中提供更快、更迭代的性能反馈。本文介绍了Per-FORM,这是一个利用隐式神经表征(INRs)在建筑环境中进行预测建模的框架。该方法适应几何复杂性、规模和拓扑结构的变化,同时通过编码几何和建筑影响的解耦模块表示连续的物理场。通过多种指标评估其推断全场和近地表预测的能力,展示了复杂几何形状的最先进精度。除了预测准确性之外,Per-FORM还将循环仿真反馈带入数字工作流程,支持性能知情的探索、构思和概念化,并丰富设计和工程实践中的知情创意过程。
{"title":"Implicit neural representations for surrogate modeling in the built environment","authors":"Sarah Mokhtar,&nbsp;Caitlin Mueller","doi":"10.1016/j.autcon.2025.106744","DOIUrl":"10.1016/j.autcon.2025.106744","url":null,"abstract":"<div><div>Physical phenomena, including aerodynamics and heat transfer, exhibit complex shape-dependent relationships with building geometry, shaping microclimates that directly affect urban livability and comfort. In design and engineering practice, surrogate models reduce the computational burden of simulations, providing faster and more iterative performance feedback within design workflows. This paper introduces Per-FORM, a framework that leverages implicit neural representations (INRs) for predictive modeling in the built environment. The approach accommodates variations in geometric complexity, scale, and topology while representing continuous physical fields through decoupled modules encoding both geometry and building influence. Its ability to infer full-field and near-surface predictions is evaluated across multiple metrics, demonstrating state-of-the-art accuracy for complex geometries. Beyond predictive accuracy, Per-FORM brings simulation-in-the-loop feedback into digital workflows, supporting performance-informed exploration, ideation, and conceptualization, and enriching informed creative processes in design and engineering practice.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106744"},"PeriodicalIF":11.5,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145837256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent UAV-based deep learning system for multi-class concrete dam damage detection 基于无人机的多等级混凝土坝损伤检测深度学习系统
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2025-12-26 DOI: 10.1016/j.autcon.2025.106730
Ben Huang , Fei Kang , Xi Liu
Accurate damage detection is critical for ensuring the safety and long-term stability of dams. However, conventional inspection methods often suffer from low automation, high labor intensity, and high costs. To address these limitations, this paper proposes an intelligent detection system based on an enhanced YOLOX framework, designed for real-time identification of multiple damage types in concrete dams using unmanned aerial vehicles (UAVs). The improved model is lightweight, containing only 8.94 million parameters, yet achieves a mAP50 of 0.821 and an F1-score of 0.781. Based on this model, detection software was implemented with the PyQt5 framework, and an integrated UAV-based system was constructed to support high-precision, real-time analysis of both image and video data. This approach provides an automated and intelligent solution for the visual inspection of concrete dam damage, offering significant potential for practical engineering applications and future intelligent monitoring systems.
准确的损伤检测是保证大坝安全和长期稳定的关键。然而,传统的检测方法往往存在自动化程度低、劳动强度大、成本高等问题。为了解决这些限制,本文提出了一种基于增强型YOLOX框架的智能检测系统,用于使用无人机实时识别混凝土大坝中的多种损伤类型。改进后的模型是轻量级的,仅包含894万个参数,但mAP50为0.821,f1得分为0.781。基于该模型,采用PyQt5框架实现了检测软件,构建了基于无人机的集成系统,支持图像和视频数据的高精度、实时分析。该方法为混凝土大坝损伤目视检测提供了一种自动化、智能化的解决方案,为实际工程应用和未来的智能监测系统提供了巨大的潜力。
{"title":"Intelligent UAV-based deep learning system for multi-class concrete dam damage detection","authors":"Ben Huang ,&nbsp;Fei Kang ,&nbsp;Xi Liu","doi":"10.1016/j.autcon.2025.106730","DOIUrl":"10.1016/j.autcon.2025.106730","url":null,"abstract":"<div><div>Accurate damage detection is critical for ensuring the safety and long-term stability of dams. However, conventional inspection methods often suffer from low automation, high labor intensity, and high costs. To address these limitations, this paper proposes an intelligent detection system based on an enhanced YOLOX framework, designed for real-time identification of multiple damage types in concrete dams using unmanned aerial vehicles (UAVs). The improved model is lightweight, containing only 8.94 million parameters, yet achieves a mAP<sub>50</sub> of 0.821 and an <em>F</em><sub>1</sub>-score of 0.781. Based on this model, detection software was implemented with the PyQt5 framework, and an integrated UAV-based system was constructed to support high-precision, real-time analysis of both image and video data. This approach provides an automated and intelligent solution for the visual inspection of concrete dam damage, offering significant potential for practical engineering applications and future intelligent monitoring systems.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106730"},"PeriodicalIF":11.5,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145837257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing LLM-based building data query with chain-of-thought, retrieval-augmented generation, and fine-tuning 使用思维链、检索增强生成和微调增强基于llm的构建数据查询
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2025-12-26 DOI: 10.1016/j.autcon.2025.106738
Mingchen Li , Ziqi Hu , Parastoo Mohebi , Shuhao Li , Zhe Wang
To enhance energy efficiency and occupant satisfaction, modern buildings have collected rich streams of operational and sensor data. Semantic models for buildings, such as the Brick schema expressed in the Resource Description Framework (RDF) and Web Ontology Language (OWL), have standardized the representation of devices, points, and systems. However, non-expert users still faced barriers to accessing such data, because effective use required proficiency in the SPARQL Protocol and RDF Query Language (SPARQL) and navigation of thousands of interconnected nodes and relations. This paper presents BuildingGPT2, a framework that combined large language model fine-tuning, vector-graph retrieval-augmented generation, and chain-of-thought prompting to enable natural-language querying of Brick-based models. The framework was trained on semantic models from 40 real buildings and evaluated in a zero-shot setting on 5 held-out buildings. Using LLaMA 3.1–70B, SPARQL query generation accuracy improved from 49.25 % to 97.11 %, substantially lowering the barrier to interacting with building semantic models.
为了提高能源效率和居住者满意度,现代建筑收集了丰富的操作和传感器数据流。建筑物的语义模型,例如在资源描述框架(RDF)和Web本体语言(OWL)中表达的Brick模式,已经标准化了设备、点和系统的表示。然而,非专业用户在访问这些数据时仍然面临障碍,因为有效地使用这些数据需要熟练掌握SPARQL协议和RDF查询语言(SPARQL),以及在数千个相互连接的节点和关系中导航。本文介绍了BuildingGPT2,这是一个框架,它结合了大型语言模型微调、向量图检索增强生成和思维链提示,以实现基于砖块的模型的自然语言查询。该框架在40个真实建筑的语义模型上进行了训练,并在5个废弃建筑的零射击设置中进行了评估。使用LLaMA 3.1-70B, SPARQL查询生成准确率从49.25%提高到97.11%,大大降低了与构建语义模型交互的障碍。
{"title":"Enhancing LLM-based building data query with chain-of-thought, retrieval-augmented generation, and fine-tuning","authors":"Mingchen Li ,&nbsp;Ziqi Hu ,&nbsp;Parastoo Mohebi ,&nbsp;Shuhao Li ,&nbsp;Zhe Wang","doi":"10.1016/j.autcon.2025.106738","DOIUrl":"10.1016/j.autcon.2025.106738","url":null,"abstract":"<div><div>To enhance energy efficiency and occupant satisfaction, modern buildings have collected rich streams of operational and sensor data. Semantic models for buildings, such as the Brick schema expressed in the Resource Description Framework (RDF) and Web Ontology Language (OWL), have standardized the representation of devices, points, and systems. However, non-expert users still faced barriers to accessing such data, because effective use required proficiency in the SPARQL Protocol and RDF Query Language (SPARQL) and navigation of thousands of interconnected nodes and relations. This paper presents BuildingGPT2, a framework that combined large language model fine-tuning, vector-graph retrieval-augmented generation, and chain-of-thought prompting to enable natural-language querying of Brick-based models. The framework was trained on semantic models from 40 real buildings and evaluated in a zero-shot setting on 5 held-out buildings. Using LLaMA 3.1–70B, SPARQL query generation accuracy improved from 49.25 % to 97.11 %, substantially lowering the barrier to interacting with building semantic models.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106738"},"PeriodicalIF":11.5,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145837318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Automation in Construction
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1