首页 > 最新文献

Automation in Construction最新文献

英文 中文
AI-driven extraction of electrical circuits from floorplans for BIM ai驱动的从平面图中提取电路用于BIM
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-02-01 Epub Date: 2026-01-07 DOI: 10.1016/j.autcon.2025.106746
Martin Urbieta , Matias Urbieta , Guillermo Burriel
BIM solutions require a digital model as a foundation to optimize processes such as maintenance, infrastructure renovation, or demolition. However, a vast number of analog building plans are archived by public entities managing urban development, and manually converting these plans into digital models, which is prohibitively expensive. To address this gap, the paper introduces an approach for organizations who need to convert large datasets of legacy electrical floorplans into a BIM. The approach leverages a Machine Learning model for instance segmentation to detect electrical features, and the line-segment detection model DeepLSD for extracting cable traces. To support model training, a new dataset, referred as IPVBA-ELEC, is provided. The approach assembles circuits by establishing semantic relationships between circuit components and wires, and store them in an IFC file. Case studies were evaluated using quantitative and qualitative techniques yielding promising results and encouraging further research of additional MEP domains.
BIM解决方案需要一个数字模型作为基础,以优化维护、基础设施翻新或拆除等流程。然而,大量的模拟建筑计划是由管理城市发展的公共实体存档的,并且手动将这些计划转换为数字模型,这是非常昂贵的。为了解决这一差距,本文为需要将遗留电气平面图的大型数据集转换为BIM的组织介绍了一种方法。该方法利用机器学习模型进行实例分割以检测电气特征,并利用线段检测模型DeepLSD提取电缆轨迹。为了支持模型训练,提供了一个称为IPVBA-ELEC的新数据集。该方法通过建立电路元件和导线之间的语义关系来组装电路,并将其存储在IFC文件中。使用定量和定性技术对案例研究进行了评估,得出了有希望的结果,并鼓励对其他环境保护领域进行进一步研究。
{"title":"AI-driven extraction of electrical circuits from floorplans for BIM","authors":"Martin Urbieta ,&nbsp;Matias Urbieta ,&nbsp;Guillermo Burriel","doi":"10.1016/j.autcon.2025.106746","DOIUrl":"10.1016/j.autcon.2025.106746","url":null,"abstract":"<div><div>BIM solutions require a digital model as a foundation to optimize processes such as maintenance, infrastructure renovation, or demolition. However, a vast number of analog building plans are archived by public entities managing urban development, and manually converting these plans into digital models, which is prohibitively expensive. To address this gap, the paper introduces an approach for organizations who need to convert large datasets of legacy electrical floorplans into a BIM. The approach leverages a Machine Learning model for instance segmentation to detect electrical features, and the line-segment detection model DeepLSD for extracting cable traces. To support model training, a new dataset, referred as IPVBA-ELEC, is provided. The approach assembles circuits by establishing semantic relationships between circuit components and wires, and store them in an IFC file. Case studies were evaluated using quantitative and qualitative techniques yielding promising results and encouraging further research of additional MEP domains.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106746"},"PeriodicalIF":11.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145921015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collaborative learning architecture for autonomous excavator planning and execution 自主挖掘机规划与执行的协同学习体系结构
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-02-01 Epub Date: 2025-12-30 DOI: 10.1016/j.autcon.2025.106742
Junhyung Cho , Mingyu Shin , Joongheon Kim , Soyi Jung
Autonomous excavation systems face fundamental challenges balancing computational tractability with operational sophistication. This paper presents the collaborative learning for excavation framework (CLEF), resolving this trade-off through strategic decomposition: separating high-level planning from low-level execution while maintaining collaborative optimization. The framework’s key contributions include a bidirectional information flow between specialized modules consisting of reinforcement learning for strategic planning using polar coordinates, and attention-enhanced generative adversarial imitation learning (A-GAIL) with multi-head attention capturing phase-specific temporal dependencies. Unlike monolithic approaches suffering computational intractability, CLEF enables module specialization while coordinating through shared representations. Planning decisions condition trajectory generation while execution outcomes update environmental models, creating adaptive behavior without manual tuning. Validation demonstrates 90.8% success rate compared to 71.1% for monolithic approaches, with trajectory generation achieving 91.3% completion confirming superior performance essential for construction automation.
自主挖掘系统面临着平衡计算可追溯性和操作复杂性的根本挑战。本文提出了挖掘框架的协作学习(CLEF),通过战略分解解决了这种权衡:在保持协作优化的同时,将高级规划与低级执行分离开来。该框架的主要贡献包括专门模块之间的双向信息流,包括使用极坐标进行战略规划的强化学习,以及具有多头注意力捕获特定阶段时间依赖性的注意力增强生成对抗模仿学习(a - gail)。与遭受计算困难的整体方法不同,CLEF在通过共享表示进行协调时支持模块专门化。规划决策条件轨迹生成,而执行结果更新环境模型,无需手动调优即可创建自适应行为。验证的成功率为90.8%,而单片方法的成功率为71.1%,轨迹生成的完成率为91.3%,这证实了施工自动化所必需的卓越性能。
{"title":"Collaborative learning architecture for autonomous excavator planning and execution","authors":"Junhyung Cho ,&nbsp;Mingyu Shin ,&nbsp;Joongheon Kim ,&nbsp;Soyi Jung","doi":"10.1016/j.autcon.2025.106742","DOIUrl":"10.1016/j.autcon.2025.106742","url":null,"abstract":"<div><div>Autonomous excavation systems face fundamental challenges balancing computational tractability with operational sophistication. This paper presents the collaborative learning for excavation framework (CLEF), resolving this trade-off through strategic decomposition: separating high-level planning from low-level execution while maintaining collaborative optimization. The framework’s key contributions include a bidirectional information flow between specialized modules consisting of reinforcement learning for strategic planning using polar coordinates, and attention-enhanced generative adversarial imitation learning (A-GAIL) with multi-head attention capturing phase-specific temporal dependencies. Unlike monolithic approaches suffering computational intractability, CLEF enables module specialization while coordinating through shared representations. Planning decisions condition trajectory generation while execution outcomes update environmental models, creating adaptive behavior without manual tuning. Validation demonstrates 90.8% success rate compared to 71.1% for monolithic approaches, with trajectory generation achieving 91.3% completion confirming superior performance essential for construction automation.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106742"},"PeriodicalIF":11.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145880900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collaborative inspection for large-scale urban sewer pipe networks by coupling multiple robotic pipe capsules and spatial optimization 基于多机器人管道胶囊耦合和空间优化的大型城市污水管网协同检测
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-02-01 Epub Date: 2026-01-12 DOI: 10.1016/j.autcon.2026.106763
Wei Tu , Yu Gu , Ruizhe Chen , Xing Zhang , Jiasong Zhu , Chisheng Wang , Qingquan Li
Urban sewer pipelines are prone to diverse faults, such as cracks, erosion, and root intrusion. Effective and efficient inspection methods are essential for large-scale urban sewer pipe networks. This paper presented a collaborative inspection approach to inspect urban sewer pipes, which integrates robotic pipe capsules (RPCs) with lightweight deep learning and spatial optimization. A bi-level network is built to represent diverse movements of workers and the RPCs and their collaboration. A specialized lightweight deep neural network is designed to identify faults with images captured by PRC in real time. The worker and RPC routes are spatially optimized with hybrid meta-heuristics. An experiment in Shenzhen, China, demonstrated that it achieves a balanced accuracy of 83.43% with 7.64 frames per second, which outperforms baseline methods. The presented method provides an alternative approach for large-scale urban sewer pipe networks.
城市污水管道容易出现裂缝、侵蚀、根部侵入等多种故障。有效和高效的检测方法是大型城市污水管网的必要条件。本文提出了一种将机器人管道胶囊(rpc)与轻量级深度学习和空间优化相结合的城市下水道管道协同检测方法。建立了一个双层网络,以代表不同的工人运动和rpc及其合作。设计了一种专门的轻量级深度神经网络,利用PRC捕获的图像实时识别故障。使用混合元启发式方法对worker和RPC路由进行空间优化。在中国深圳进行的实验表明,该方法以每秒7.64帧的速度达到了83.43%的平衡精度,优于基准方法。该方法为大规模城市污水管网提供了另一种方法。
{"title":"Collaborative inspection for large-scale urban sewer pipe networks by coupling multiple robotic pipe capsules and spatial optimization","authors":"Wei Tu ,&nbsp;Yu Gu ,&nbsp;Ruizhe Chen ,&nbsp;Xing Zhang ,&nbsp;Jiasong Zhu ,&nbsp;Chisheng Wang ,&nbsp;Qingquan Li","doi":"10.1016/j.autcon.2026.106763","DOIUrl":"10.1016/j.autcon.2026.106763","url":null,"abstract":"<div><div>Urban sewer pipelines are prone to diverse faults, such as cracks, erosion, and root intrusion. Effective and efficient inspection methods are essential for large-scale urban sewer pipe networks. This paper presented a collaborative inspection approach to inspect urban sewer pipes, which integrates robotic pipe capsules (RPCs) with lightweight deep learning and spatial optimization. A bi-level network is built to represent diverse movements of workers and the RPCs and their collaboration. A specialized lightweight deep neural network is designed to identify faults with images captured by PRC in real time. The worker and RPC routes are spatially optimized with hybrid meta-heuristics. An experiment in Shenzhen, China, demonstrated that it achieves a balanced accuracy of 83.43% with 7.64 frames per second, which outperforms baseline methods. The presented method provides an alternative approach for large-scale urban sewer pipe networks.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106763"},"PeriodicalIF":11.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145956563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multitask unified large vision-language model for post-earthquake structural damage assessment of buildings 地震后建筑物结构损伤评估的多任务统一大视觉语言模型
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-02-01 Epub Date: 2025-12-22 DOI: 10.1016/j.autcon.2025.106720
Yongqing Jiang , Jianze Wang , Xinyi Shen , Kaoshan Dai , Qingzi Ge
Rapid and accurate damage assessment of structures is critical for post-earthquake recovery and emergency response. Current evaluations are heavily reliant on on-site visual inspections conducted by engineering experts, which are time-consuming and resource-intensive. To this end, the large vision-language model (VLM) for multitask structural damage assessment chatbot (MT-SDAChat) is developed in this paper. It can perform both image-level and regional-level inference analysis, accurately locating and providing specific information about various structural components and damage locations. With the MT-SDAChat, a two-stage automated assessment framework that transitions from a global perspective to a component-specific perspective is proposed. A dataset containing 3348 image-text pairs of seismic structural damage with multiple attributes has been constructed. Experimental results show that MT-SDAChat performs well in multitask evaluation. It achieves a question-and-answer accuracy of 82.92 % and a localization accuracy of 78.6 %. These results highlight its strong zero-shot capability across various damage assessments in building construction.
快速准确的结构损伤评估对于震后恢复和应急响应至关重要。目前的评估严重依赖工程专家进行的现场目视检查,这既耗时又耗费资源。为此,本文开发了多任务结构损伤评估聊天机器人(MT-SDAChat)的大视觉语言模型(VLM)。它可以进行图像级和区域级的推理分析,准确定位并提供各种结构部件和损伤位置的具体信息。使用MT-SDAChat,提出了一个从全局视角到特定组件视角的两阶段自动评估框架。构建了包含3348对多属性地震结构损伤的图像-文本数据集。实验结果表明,MT-SDAChat在多任务评估中表现良好。该方法的问答准确率为82.92%,定位准确率为78.6%。这些结果突出了它在各种建筑施工损伤评估中的强大零射击能力。
{"title":"Multitask unified large vision-language model for post-earthquake structural damage assessment of buildings","authors":"Yongqing Jiang ,&nbsp;Jianze Wang ,&nbsp;Xinyi Shen ,&nbsp;Kaoshan Dai ,&nbsp;Qingzi Ge","doi":"10.1016/j.autcon.2025.106720","DOIUrl":"10.1016/j.autcon.2025.106720","url":null,"abstract":"<div><div>Rapid and accurate damage assessment of structures is critical for post-earthquake recovery and emergency response. Current evaluations are heavily reliant on on-site visual inspections conducted by engineering experts, which are time-consuming and resource-intensive. To this end, the large vision-language model (VLM) for multitask structural damage assessment chatbot (MT-SDAChat) is developed in this paper. It can perform both image-level and regional-level inference analysis, accurately locating and providing specific information about various structural components and damage locations. With the MT-SDAChat, a two-stage automated assessment framework that transitions from a global perspective to a component-specific perspective is proposed. A dataset containing 3348 image-text pairs of seismic structural damage with multiple attributes has been constructed. Experimental results show that MT-SDAChat performs well in multitask evaluation. It achieves a question-and-answer accuracy of 82.92 % and a localization accuracy of 78.6 %. These results highlight its strong zero-shot capability across various damage assessments in building construction.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106720"},"PeriodicalIF":11.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145813790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping digital twin applications in infrastructure and the built environment across research types, methods, sectors, phases, and scales 绘制跨研究类型、方法、部门、阶段和规模的基础设施和建筑环境中的数字孪生应用
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-02-01 Epub Date: 2026-01-13 DOI: 10.1016/j.autcon.2026.106778
Soheila Kookalani , Stephen Green , Peihang Luo , Hamidreza Alavi , Erika Parn , Zhaojie Sun , Ioannis Brilakis
Digital Twin technologies are increasingly used in infrastructure and the built environment to create dynamic, data-driven models of physical assets and processes. This review analyses recent advancements across sectors such as tunnels, bridges, roads, buildings, construction management, and urban planning, covering all life-cycle phases from design to operation. Integrating Digital Twins with Building Information Modelling, Internet of Things sensors, and Artificial Intelligence enhances real-time monitoring, decision-making, and asset performance. Key methods include monitoring, modelling, and simulation, which improve resource use and proactive maintenance. However, adoption faces challenges such as poor data interoperability, high costs, and technical complexity in merging multiple technologies. Ethical and governance issues around data privacy and security also persist. The review identifies future research needs in improving interoperability, expanding predictive analytics, and assessing large-scale impacts. It highlights Digital Twins' potential to improve resilience, efficiency, and sustainability, stressing the need for policy support and stakeholder collaboration.
数字孪生技术越来越多地用于基础设施和建筑环境,以创建物理资产和流程的动态数据驱动模型。本综述分析了隧道、桥梁、道路、建筑、施工管理和城市规划等领域的最新进展,涵盖了从设计到运营的所有生命周期阶段。将数字孪生与建筑信息模型、物联网传感器和人工智能相结合,增强实时监控、决策和资产绩效。关键方法包括监控、建模和仿真,这些方法可以改善资源利用和主动维护。然而,采用面临着诸如数据互操作性差、成本高以及合并多种技术时的技术复杂性等挑战。围绕数据隐私和安全的道德和治理问题也依然存在。该综述确定了未来在改进互操作性、扩展预测分析和评估大规模影响方面的研究需求。报告强调了数字孪生在提高韧性、效率和可持续性方面的潜力,强调了政策支持和利益相关者合作的必要性。
{"title":"Mapping digital twin applications in infrastructure and the built environment across research types, methods, sectors, phases, and scales","authors":"Soheila Kookalani ,&nbsp;Stephen Green ,&nbsp;Peihang Luo ,&nbsp;Hamidreza Alavi ,&nbsp;Erika Parn ,&nbsp;Zhaojie Sun ,&nbsp;Ioannis Brilakis","doi":"10.1016/j.autcon.2026.106778","DOIUrl":"10.1016/j.autcon.2026.106778","url":null,"abstract":"<div><div>Digital Twin technologies are increasingly used in infrastructure and the built environment to create dynamic, data-driven models of physical assets and processes. This review analyses recent advancements across sectors such as tunnels, bridges, roads, buildings, construction management, and urban planning, covering all life-cycle phases from design to operation. Integrating Digital Twins with Building Information Modelling, Internet of Things sensors, and Artificial Intelligence enhances real-time monitoring, decision-making, and asset performance. Key methods include monitoring, modelling, and simulation, which improve resource use and proactive maintenance. However, adoption faces challenges such as poor data interoperability, high costs, and technical complexity in merging multiple technologies. Ethical and governance issues around data privacy and security also persist. The review identifies future research needs in improving interoperability, expanding predictive analytics, and assessing large-scale impacts. It highlights Digital Twins' potential to improve resilience, efficiency, and sustainability, stressing the need for policy support and stakeholder collaboration.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106778"},"PeriodicalIF":11.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145962000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pixel-level image localization for updating 3D digital twins of dams using frequency convolutional networks 基于频率卷积网络的大坝三维数字孪生体的像素级图像定位
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-02-01 Epub Date: 2025-12-14 DOI: 10.1016/j.autcon.2025.106715
Hang Zhao , Vahidreza Gharehbaghi , Caroline Bennett , Rémy D. Lequesne , Jian Li
This paper presents PIL3D, an automated pixel-level image localization framework for maintaining up-to-date 3D digital twins of large-scale civil infrastructure, with a focus on dam structures. Unlike conventional 3D model updating approaches that require extensive manual data acquisition and labor-intensive processing, PIL3D automatically predicts the 3D coordinates of every pixel in an input image relative to an existing model, enabling fully automated dense pixel-to-point correspondences. Experimental validation on a real-world dam case demonstrates centimeter-level localization accuracy, significantly reducing manual intervention, data collection requirements, and computational demand. By integrating PIL3D into digital twin workflows, infrastructure inspection, monitoring, and maintenance can be streamlined into a continuous, automated process, advancing the state of automation in construction and asset management.
本文介绍了PIL3D,这是一个用于维护大型民用基础设施的最新3D数字孪生的自动化像素级图像定位框架,重点是大坝结构。传统的3D模型更新方法需要大量的人工数据采集和劳动密集型处理,与之不同的是,PIL3D可以自动预测输入图像中每个像素相对于现有模型的3D坐标,实现完全自动化的密集像素对点对应。在实际大坝案例上的实验验证证明了厘米级的定位精度,大大减少了人工干预、数据收集要求和计算需求。通过将PIL3D集成到数字孪生工作流程中,基础设施检查、监控和维护可以简化为一个连续的自动化过程,从而提高了建筑和资产管理的自动化水平。
{"title":"Pixel-level image localization for updating 3D digital twins of dams using frequency convolutional networks","authors":"Hang Zhao ,&nbsp;Vahidreza Gharehbaghi ,&nbsp;Caroline Bennett ,&nbsp;Rémy D. Lequesne ,&nbsp;Jian Li","doi":"10.1016/j.autcon.2025.106715","DOIUrl":"10.1016/j.autcon.2025.106715","url":null,"abstract":"<div><div>This paper presents PIL3D, an automated pixel-level image localization framework for maintaining up-to-date 3D digital twins of large-scale civil infrastructure, with a focus on dam structures. Unlike conventional 3D model updating approaches that require extensive manual data acquisition and labor-intensive processing, PIL3D automatically predicts the 3D coordinates of every pixel in an input image relative to an existing model, enabling fully automated dense pixel-to-point correspondences. Experimental validation on a real-world dam case demonstrates centimeter-level localization accuracy, significantly reducing manual intervention, data collection requirements, and computational demand. By integrating PIL3D into digital twin workflows, infrastructure inspection, monitoring, and maintenance can be streamlined into a continuous, automated process, advancing the state of automation in construction and asset management.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106715"},"PeriodicalIF":11.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145759776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Excavator trajectory planning via global probabilistic learning from expert demonstrations 基于专家演示的全局概率学习的挖掘机轨迹规划
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-02-01 Epub Date: 2025-12-20 DOI: 10.1016/j.autcon.2025.106736
Chenlong Feng , Quan Zhang , Jixin Wang , Xinxing Liu , Yuying Shen , Qingzheng Jia , Jiazhi Zhao
Excavator trajectory planning remains challenging due to dependence on expert skill, changing tasks, and complex environments. This paper integrates global probabilistic modeling of expert demonstrations with sampling-based optimization to enable flexible, efficient, and safe autonomous operation. A Global Modulated Movement Primitive (GMMP) model captures global evolution of expert demonstration trajectories in SE(3) space, the 3D rigid-body pose space that combines orientation and translation. A Bayesian update supports efficient task generalization by adjusting new via points. The workspace density of excavator is introduced to enable the transfer of GMMP across different excavator without retraining. A Guided Model Predictive Path Integral (GMPPI) method with SE(3)-consistency cost optimizes GMMP generated trajectories via sampling, handling obstacle avoidance and execution constraints. The method was validated on a full-size excavator and a scaled platform. Results show improved trajectory similarity, execution efficiency, and task adaptability, indicating strong practicality.
由于对专家技能的依赖、不断变化的任务和复杂的环境,挖掘机轨迹规划仍然具有挑战性。本文将专家演示的全局概率建模与基于采样的优化相结合,实现灵活、高效、安全的自主运行。全局调制运动原语(GMMP)模型捕获了SE(3)空间中专家演示轨迹的全局演化,SE(3)空间是结合了方向和平移的3D刚体姿态空间。贝叶斯更新通过调整新的通过点来支持有效的任务泛化。引入挖掘机的工作空间密度,使GMMP在不同挖掘机之间的转移无需再培训。一种具有SE(3)-一致性代价的引导模型预测路径积分(GMPPI)方法通过采样、避障处理和执行约束对GMPPI生成的轨迹进行优化。该方法在一台全尺寸挖掘机和一个规模化平台上进行了验证。结果表明,该方法提高了轨迹相似度、执行效率和任务适应性,具有较强的实用性。
{"title":"Excavator trajectory planning via global probabilistic learning from expert demonstrations","authors":"Chenlong Feng ,&nbsp;Quan Zhang ,&nbsp;Jixin Wang ,&nbsp;Xinxing Liu ,&nbsp;Yuying Shen ,&nbsp;Qingzheng Jia ,&nbsp;Jiazhi Zhao","doi":"10.1016/j.autcon.2025.106736","DOIUrl":"10.1016/j.autcon.2025.106736","url":null,"abstract":"<div><div>Excavator trajectory planning remains challenging due to dependence on expert skill, changing tasks, and complex environments. This paper integrates global probabilistic modeling of expert demonstrations with sampling-based optimization to enable flexible, efficient, and safe autonomous operation. A Global Modulated Movement Primitive (GMMP) model captures global evolution of expert demonstration trajectories in SE(3) space, the 3D rigid-body pose space that combines orientation and translation. A Bayesian update supports efficient task generalization by adjusting new via points. The workspace density of excavator is introduced to enable the transfer of GMMP across different excavator without retraining. A Guided Model Predictive Path Integral (GMPPI) method with SE(3)-consistency cost optimizes GMMP generated trajectories via sampling, handling obstacle avoidance and execution constraints. The method was validated on a full-size excavator and a scaled platform. Results show improved trajectory similarity, execution efficiency, and task adaptability, indicating strong practicality.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106736"},"PeriodicalIF":11.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145785094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frequency-aware crack segmentation network (FACS-net) and crack topology loss (CT-loss) for thin cracks 薄裂纹的频率感知裂纹分割网络(FACS-net)和裂纹拓扑损失(CT-loss)
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-02-01 Epub Date: 2025-12-15 DOI: 10.1016/j.autcon.2025.106719
Siheon Joo, Seokhwan Kim, Hongjo Kim
Structural crack analysis is vital for infrastructure safety, but existing segmentation models often miss fine cracks due to spectral bias in deep networks. This especially affects thin cracks, which are frequently underrepresented. This paper presents FACS-Net, a Frequency-Aware Crack Segmentation Network with Crack Topology Loss (CT-Loss), to mitigate spectral bias and enhance crack-specific representations. FACS-Net employs frequency-aware attention for decoding, while CT-Loss explicitly incorporates boundary accuracy and structural continuity into the learning objective. Given the high edge-to-area ratio of thin cracks, the proposed approach ensures accurate localization without sacrificing topological coherence. Evaluation on CrackVision12K shows that FACS-Net significantly improves detection of thin cracks (width 2 px), outperforming Hybrid-Segmentor by 0.306 IoU and 0.360 CTS. Overall, FACS-Net achieves state-of-the-art performance with 0.663 IoU and 0.651 CTS, demonstrating precise segmentation and robust structural preservation.
结构裂缝分析对基础设施安全至关重要,但现有的分割模型由于深度网络中的频谱偏差,往往会遗漏细微裂缝。这尤其影响到薄裂缝,而薄裂缝经常被低估。本文提出了FACS-Net,一种带有裂纹拓扑损失(CT-Loss)的频率感知裂纹分割网络,以减轻频谱偏差并增强裂纹特定表示。FACS-Net采用频率感知注意力进行解码,而CT-Loss明确地将边界精度和结构连续性纳入学习目标。考虑到薄裂纹的高边面积比,该方法在不牺牲拓扑相干性的情况下保证了精确的定位。在CrackVision12K上的评估表明,FACS-Net显著提高了薄裂纹(宽度≤2 px)的检测,比Hybrid-Segmentor高出0.306 IoU和0.360 CTS。总体而言,FACS-Net以0.663 IoU和0.651 CTS实现了最先进的性能,展示了精确的分割和强大的结构保存。
{"title":"Frequency-aware crack segmentation network (FACS-net) and crack topology loss (CT-loss) for thin cracks","authors":"Siheon Joo,&nbsp;Seokhwan Kim,&nbsp;Hongjo Kim","doi":"10.1016/j.autcon.2025.106719","DOIUrl":"10.1016/j.autcon.2025.106719","url":null,"abstract":"<div><div>Structural crack analysis is vital for infrastructure safety, but existing segmentation models often miss fine cracks due to spectral bias in deep networks. This especially affects thin cracks, which are frequently underrepresented. This paper presents FACS-Net, a Frequency-Aware Crack Segmentation Network with Crack Topology Loss (CT-Loss), to mitigate spectral bias and enhance crack-specific representations. FACS-Net employs frequency-aware attention for decoding, while CT-Loss explicitly incorporates boundary accuracy and structural continuity into the learning objective. Given the high edge-to-area ratio of thin cracks, the proposed approach ensures accurate localization without sacrificing topological coherence. Evaluation on CrackVision12K shows that FACS-Net significantly improves detection of thin cracks (width<span><math><mo>≤</mo></math></span> 2 px), outperforming Hybrid-Segmentor by 0.306 IoU and 0.360 CTS. Overall, FACS-Net achieves state-of-the-art performance with 0.663 IoU and 0.651 CTS, demonstrating precise segmentation and robust structural preservation.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106719"},"PeriodicalIF":11.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145787865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Underwater defect measurement for bridge piers via non-planar refraction correction 桥墩水下缺陷的非平面折射校正测量
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-02-01 Epub Date: 2026-01-06 DOI: 10.1016/j.autcon.2025.106753
Tao Wu , Shitong Hou , Zhishen Wu , Xiaoyuan He , Gang Wu
Quantitative measurement of apparent defects using underwater vision-based techniques is essential for structural inspection of submerged bridge components. However, measurement accuracy is greatly limited by nonlinear imaging distortions caused by multi-medium refraction and viewport deformation under hydrostatic pressure. To overcome these challenges, this paper introduces a multi-refraction correction model that accounts for refractive interface deformation. A nonlinear underwater imaging framework is established by integrating a spatial coordinate transformation-based calibration method with deformation analysis of the viewport. The feasibility and accuracy of the proposed approach are validated through underwater checkerboard corner-detection experiments. Compared with traditional multi-plane refraction correction method, the proposed model enhances measurement precision by more than 40 %. Additional experiments on submerged bridge pier components show that the measurement errors for apparent defect dimensions consistently remain below 5 %, highlighting the strong potential of the method for practical implementation in underwater visual inspection of bridge infrastructure.
基于水下视觉技术的表观缺陷定量测量是水下桥梁构件结构检测的必要手段。然而,在静水压力下,多介质折射和视口变形引起的非线性成像畸变极大地限制了测量精度。为了克服这些挑战,本文引入了考虑折射界面变形的多折射校正模型。将基于空间坐标变换的定标方法与视口变形分析相结合,建立了非线性水下成像框架。通过水下棋盘格角检测实验,验证了该方法的可行性和准确性。与传统的多平面折射校正方法相比,该模型的测量精度提高了40%以上。对水下桥梁桥墩构件的试验结果表明,该方法对表观缺陷尺寸的测量误差始终保持在5%以下,突出了该方法在桥梁基础设施水下目视检测中的实际应用潜力。
{"title":"Underwater defect measurement for bridge piers via non-planar refraction correction","authors":"Tao Wu ,&nbsp;Shitong Hou ,&nbsp;Zhishen Wu ,&nbsp;Xiaoyuan He ,&nbsp;Gang Wu","doi":"10.1016/j.autcon.2025.106753","DOIUrl":"10.1016/j.autcon.2025.106753","url":null,"abstract":"<div><div>Quantitative measurement of apparent defects using underwater vision-based techniques is essential for structural inspection of submerged bridge components. However, measurement accuracy is greatly limited by nonlinear imaging distortions caused by multi-medium refraction and viewport deformation under hydrostatic pressure. To overcome these challenges, this paper introduces a multi-refraction correction model that accounts for refractive interface deformation. A nonlinear underwater imaging framework is established by integrating a spatial coordinate transformation-based calibration method with deformation analysis of the viewport. The feasibility and accuracy of the proposed approach are validated through underwater checkerboard corner-detection experiments. Compared with traditional multi-plane refraction correction method, the proposed model enhances measurement precision by more than 40 %. Additional experiments on submerged bridge pier components show that the measurement errors for apparent defect dimensions consistently remain below 5 %, highlighting the strong potential of the method for practical implementation in underwater visual inspection of bridge infrastructure.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106753"},"PeriodicalIF":11.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145903162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Path planning for UAV-based construction safety inspection under spatiotemporal interference from tower cranes 塔吊时空干扰下基于无人机的建筑安全检测路径规划
IF 11.5 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Pub Date : 2026-02-01 Epub Date: 2026-01-05 DOI: 10.1016/j.autcon.2026.106762
Pinsheng Duan , Xuehai Fu , Jinxin Hu , Jianliang Zhou , Ping Guo
Construction sites are dynamic, complex, high-risk environments, where Unmanned Aerial Vehicles (UAVs) are vital for enhancing safety inspection efficiency. As large-scale dynamic obstacles, tower cranes can interfere with effective UAV inspection paths. This paper proposes a safety inspection path planning method under the spatiotemporal interference of multiple tower cranes. First, a 3D model of the construction site is reconstructed, and inspection viewpoints for UAV flights are generated by optimizing safety inspection strategies. Then, a hierarchical path planning framework is established: the lower-level planner strictly enforces real-time safety obstacle avoidance strategies, while the higher-level planner focuses on global planning to meet inspection requirements. Finally, both simulation and real project studies are conducted to verify the feasibility of the method. Results from the real project show that the effective coverage area is increased by 39.01 % compared with traditional methods. This paper provides theoretical and practical support for UAV-assisted safety inspections in construction.
建筑工地是动态、复杂、高风险的环境,无人机对于提高安全检查效率至关重要。塔吊作为大型动态障碍物,会干扰无人机有效的巡检路径。提出了一种多塔机时空干扰下的安全检测路径规划方法。首先,重建施工现场三维模型,通过优化安全检查策略生成无人机飞行检查视点;然后,建立分层路径规划框架:低层规划器严格执行实时安全避障策略,高层规划器注重全局规划以满足巡检需求。最后,通过仿真和实际工程研究验证了该方法的可行性。实际工程结果表明,与传统方法相比,有效覆盖面积增加了39.01%。为无人机辅助施工安全检测提供理论和实践支持。
{"title":"Path planning for UAV-based construction safety inspection under spatiotemporal interference from tower cranes","authors":"Pinsheng Duan ,&nbsp;Xuehai Fu ,&nbsp;Jinxin Hu ,&nbsp;Jianliang Zhou ,&nbsp;Ping Guo","doi":"10.1016/j.autcon.2026.106762","DOIUrl":"10.1016/j.autcon.2026.106762","url":null,"abstract":"<div><div>Construction sites are dynamic, complex, high-risk environments, where Unmanned Aerial Vehicles (UAVs) are vital for enhancing safety inspection efficiency. As large-scale dynamic obstacles, tower cranes can interfere with effective UAV inspection paths. This paper proposes a safety inspection path planning method under the spatiotemporal interference of multiple tower cranes. First, a 3D model of the construction site is reconstructed, and inspection viewpoints for UAV flights are generated by optimizing safety inspection strategies. Then, a hierarchical path planning framework is established: the lower-level planner strictly enforces real-time safety obstacle avoidance strategies, while the higher-level planner focuses on global planning to meet inspection requirements. Finally, both simulation and real project studies are conducted to verify the feasibility of the method. Results from the real project show that the effective coverage area is increased by 39.01 % compared with traditional methods. This paper provides theoretical and practical support for UAV-assisted safety inspections in construction.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"182 ","pages":"Article 106762"},"PeriodicalIF":11.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145903172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Automation in Construction
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1