Engineering report generation from construction-site Internet of Things (IoT) data using large language models (LLMs) remains challenging due to hallucinations. Ensuring traceability and reliability in information retrieval and multi-step reasoning is essential within retrieval-augmented generation (RAG) for LLM. This paper formalizes the RAG-LLM pipeline and proposes a dual-stream enhancement combining knowledge graph (KG) construction with reinforcement learning (RL)-based retriever tuning. The graph-guided module extracts structured engineering elements, while RL improves semantic alignment and tokenization of critical terms. Leveraging this dual-stream RAG, a traceable reporting agent is developed, providing end-to-end traceability of retrieval and reasoning, along with inter-step similarity measures. When collaborating with existing on-site IoT systems, the agent can extend automated monitoring to decision-making support. This paper presents a reliable approach for construction report generation and advances human-AI collaboration in construction management.
{"title":"Graph-driven embedding reinforcement and traceable LLM agent for reliable element alignment in construction report generation","authors":"Zhenzhao Xia , Botao Zhong , Shuai Zhang , Tonghui Zhao , Miroslaw J. Skibniewski","doi":"10.1016/j.autcon.2026.106816","DOIUrl":"10.1016/j.autcon.2026.106816","url":null,"abstract":"<div><div>Engineering report generation from construction-site Internet of Things (IoT) data using large language models (LLMs) remains challenging due to hallucinations. Ensuring traceability and reliability in information retrieval and multi-step reasoning is essential within retrieval-augmented generation (RAG) for LLM. This paper formalizes the RAG-LLM pipeline and proposes a dual-stream enhancement combining knowledge graph (KG) construction with reinforcement learning (RL)-based retriever tuning. The graph-guided module extracts structured engineering elements, while RL improves semantic alignment and tokenization of critical terms. Leveraging this dual-stream RAG, a traceable reporting agent is developed, providing end-to-end traceability of retrieval and reasoning, along with inter-step similarity measures. When collaborating with existing on-site IoT systems, the agent can extend automated monitoring to decision-making support. This paper presents a reliable approach for construction report generation and advances human-AI collaboration in construction management.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106816"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146110225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2026-02-02DOI: 10.1016/j.autcon.2026.106820
Nima Moghimi, Oldouz Arshang, Farook Hamzeh
Digital Twin (DT) technology is emerging as a critical enabler for Off-Site Construction (OSC). However, current research remains fragmented. This paper synthesizes 50 publications using a mixed-methods approach, combining scientometric mapping with systematic qualitative analysis. Scientometric results reveal a bifurcated landscape, distinctly separating volumetric “Modular Construction” (logistics-focused) from component-based “Prefabrication” (geometry-focused). While applications in scheduling and monitoring are growing, widespread adoption is hindered by “Black-Box” AI opacity, data sovereignty issues, and fragmented standards. Furthermore, sustainability remains an implicit rather than explicit goal. The study concludes with a Strategic Research Roadmap charting the path toward autonomous ecosystems. It emphasizes the need for Neuro-symbolic AI, Operator 5.0 frameworks, and Digital Product Passports to bridge the gap between static monitoring and true Cognitive Digital Twins in OSC.
{"title":"Digital twins in offsite construction: Current implementations, challenges, and future pathways","authors":"Nima Moghimi, Oldouz Arshang, Farook Hamzeh","doi":"10.1016/j.autcon.2026.106820","DOIUrl":"10.1016/j.autcon.2026.106820","url":null,"abstract":"<div><div>Digital Twin (DT) technology is emerging as a critical enabler for Off-Site Construction (OSC). However, current research remains fragmented. This paper synthesizes 50 publications using a mixed-methods approach, combining scientometric mapping with systematic qualitative analysis. Scientometric results reveal a bifurcated landscape, distinctly separating volumetric “Modular Construction” (logistics-focused) from component-based “Prefabrication” (geometry-focused). While applications in scheduling and monitoring are growing, widespread adoption is hindered by “Black-Box” AI opacity, data sovereignty issues, and fragmented standards. Furthermore, sustainability remains an implicit rather than explicit goal. The study concludes with a Strategic Research Roadmap charting the path toward autonomous ecosystems. It emphasizes the need for Neuro-symbolic AI, Operator 5.0 frameworks, and Digital Product Passports to bridge the gap between static monitoring and true Cognitive Digital Twins in OSC.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106820"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146110849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2026-02-04DOI: 10.1016/j.autcon.2026.106803
Qiming Sun , Dominik Reisach , Silke Langenberg , Benjamin Dillenburger
Computational Design Methods (CDMs) have increasingly supported the use of Found Objects (FOs) for circular construction. These methods automate the geometric assignment of FOs to a target design, yet a comprehensive overview is lacking. In this context, this paper systematically reviews 142 publications on CDMs for upcycling FOs in construction. It categorizes existing workflows and identifies six key CDMs based on assignment logic and four geometric FO types. The review serves as a roadmap for future research and practical applications, aiding architects and engineers in informed decision-making. It emphasizes the potential of utilizing FOs’ inherent geometry as design drivers for economical and aesthetic architectural solutions. This paper also identifies challenges in scaling CDMs from prototypes to practical applications, such as structural performance and integration with existing workflows. Future research directions include developing AI-based methods, automating construction processes using CDMs, and advocating for sensitivity analysis to assess adaptability across design scenarios.
{"title":"Computational Design Methods for geometry-driven upcycling of found objects in construction","authors":"Qiming Sun , Dominik Reisach , Silke Langenberg , Benjamin Dillenburger","doi":"10.1016/j.autcon.2026.106803","DOIUrl":"10.1016/j.autcon.2026.106803","url":null,"abstract":"<div><div>Computational Design Methods (CDMs) have increasingly supported the use of Found Objects (FOs) for circular construction. These methods automate the geometric assignment of FOs to a target design, yet a comprehensive overview is lacking. In this context, this paper systematically reviews 142 publications on CDMs for upcycling FOs in construction. It categorizes existing workflows and identifies six key CDMs based on assignment logic and four geometric FO types. The review serves as a roadmap for future research and practical applications, aiding architects and engineers in informed decision-making. It emphasizes the potential of utilizing FOs’ inherent geometry as design drivers for economical and aesthetic architectural solutions. This paper also identifies challenges in scaling CDMs from prototypes to practical applications, such as structural performance and integration with existing workflows. Future research directions include developing AI-based methods, automating construction processes using CDMs, and advocating for sensitivity analysis to assess adaptability across design scenarios.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106803"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2026-01-23DOI: 10.1016/j.autcon.2026.106788
Cong Chen, Shenghan Zhang
Unmanned Aerial Vehicles (UAVs) have emerged as essential tools for building façade inspection. However, due to the repeating patterns on façades, automatically registering images taken by UAV to Building Information Modeling (BIM) models, though important for building maintenance, remains challenging. Existing methods often rely on GPS data, which lack sufficient accuracy in urban environments. This paper proposes a GPS-free automated framework to register UAV-captured image sequences to BIM models by leveraging information from overlapping images. The framework comprises three key components: (1) extracting semantic key points from images using the Grounded SAM 2; (2) implementing a virtual UAV camera model to enable bidirectional projection of key points between BIM coordinates and image coordinates; and (3) developing a particle filter motion model to achieve image-to-BIM registration using image sequences. The proposed method registers various data types to BIM models, including overlapping visual image sequences, infrared (IR)-visual pairs, and façade defects.
无人驾驶飞行器(uav)已成为建筑物外观检查的重要工具。然而,由于立面上的重复模式,自动注册无人机拍摄的图像到建筑信息建模(BIM)模型,虽然对建筑维护很重要,仍然具有挑战性。现有的方法通常依赖于GPS数据,在城市环境中缺乏足够的精度。本文提出了一个无gps的自动化框架,通过利用重叠图像的信息将无人机捕获的图像序列注册到BIM模型。该框架包括三个关键部分:(1)使用ground SAM 2从图像中提取语义关键点;(2)实现虚拟无人机摄像机模型,实现BIM坐标与图像坐标之间关键点的双向投影;(3)开发粒子滤波运动模型,使用图像序列实现图像到bim的注册。该方法将各种数据类型注册到BIM模型中,包括重叠的视觉图像序列,红外(IR)-视觉对和farade缺陷。
{"title":"GPS-free automated registration of UAV-captured façade image sequences to BIM using semantic key points","authors":"Cong Chen, Shenghan Zhang","doi":"10.1016/j.autcon.2026.106788","DOIUrl":"10.1016/j.autcon.2026.106788","url":null,"abstract":"<div><div>Unmanned Aerial Vehicles (UAVs) have emerged as essential tools for building façade inspection. However, due to the repeating patterns on façades, automatically registering images taken by UAV to Building Information Modeling (BIM) models, though important for building maintenance, remains challenging. Existing methods often rely on GPS data, which lack sufficient accuracy in urban environments. This paper proposes a GPS-free automated framework to register UAV-captured image sequences to BIM models by leveraging information from overlapping images. The framework comprises three key components: (1) extracting semantic key points from images using the Grounded SAM 2; (2) implementing a virtual UAV camera model to enable bidirectional projection of key points between BIM coordinates and image coordinates; and (3) developing a particle filter motion model to achieve image-to-BIM registration using image sequences. The proposed method registers various data types to BIM models, including overlapping visual image sequences, infrared (IR)-visual pairs, and façade defects.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106788"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146033314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2026-02-03DOI: 10.1016/j.autcon.2026.106817
Shengnan Ke , Shibin Li , Jun Gong , Lingxiang Liu , Jianjun Luo , Bing Wang , Shengjun Tang
Accurate semantic understanding of indoor 3D point clouds is essential for constructing semantically rich architectural models and enabling component-level monitoring in smart building environments. This paper proposes a dependency-aware indoor 3D scene graph prediction framework that addresses two major limitations in existing methods. To address this, a Dependency-Aware Graph Reasoning Network (DAGRN) is introduced, integrating attention and message-passing mechanisms to learn context-dependent representations of objects and their relationships. Accordingly, a multimodal feature-enhanced learning module is proposed to align point cloud and image features and incorporate textual semantics from image–text models into a unified training scheme with triplet-level constraints ensuring semantic consistency. Extensive experiments on the 3RScan dataset demonstrate that the proposed method significantly outperforms existing approaches, achieving a 3.95% improvement in overall prediction metrics, laying a foundation for advanced semantic modeling in building automation.
{"title":"Dependency-aware indoor 3D scene graph prediction via multimodal feature learning","authors":"Shengnan Ke , Shibin Li , Jun Gong , Lingxiang Liu , Jianjun Luo , Bing Wang , Shengjun Tang","doi":"10.1016/j.autcon.2026.106817","DOIUrl":"10.1016/j.autcon.2026.106817","url":null,"abstract":"<div><div>Accurate semantic understanding of indoor 3D point clouds is essential for constructing semantically rich architectural models and enabling component-level monitoring in smart building environments. This paper proposes a dependency-aware indoor 3D scene graph prediction framework that addresses two major limitations in existing methods. To address this, a Dependency-Aware Graph Reasoning Network (DAGRN) is introduced, integrating attention and message-passing mechanisms to learn context-dependent representations of objects and their relationships. Accordingly, a multimodal feature-enhanced learning module is proposed to align point cloud and image features and incorporate textual semantics from image–text models into a unified training scheme with triplet-level constraints ensuring semantic consistency. Extensive experiments on the 3RScan dataset demonstrate that the proposed method significantly outperforms existing approaches, achieving a 3.95% improvement in overall prediction metrics, laying a foundation for advanced semantic modeling in building automation.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106817"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146110227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Using images captured by UAVs for high-fidelity 3D building reconstruction in architectural engineering is popular and effective nowadays; however, planning a flight trajectory that maximizes reconstruction quality with minimal flight time remains a critical challenge. This paper proposes a universal co-optimization framework that bridges reconstruction objectives with flight dynamics through an integrated planning paradigm. The proposed approach performs initial flight planning by solving a Traveling Salesman Problem over candidate viewpoints and updating them according to the unit-length contribution criterion. The adaptive radius is determined, and subsequently, the sphere-based corridor is constructed to enforce the trajectory passing all updated viewpoints within the corresponding spatial tolerances. Next, an optimal control problem is formulated and solved using a nonlinear solver to obtain the final flight trajectory satisfying both dynamic and safety constraints. Experimental comparisons with state-of-the-art methods on three public scenes and two real scenes captured by ourselves demonstrate that the proposed approach significantly improves flight efficiency, reducing travel distance and flight duration by approximately 10% to 40% with comparable or superior reconstruction quality.
{"title":"Efficient UAV trajectory optimization for fine-detailed 3D building reconstruction","authors":"Tianrui Shen, Lai Kang, Yingmei Wei, Shanshan Wan, Haixuan Wang, Chao Zuo","doi":"10.1016/j.autcon.2026.106775","DOIUrl":"10.1016/j.autcon.2026.106775","url":null,"abstract":"<div><div>Using images captured by UAVs for high-fidelity 3D building reconstruction in architectural engineering is popular and effective nowadays; however, planning a flight trajectory that maximizes reconstruction quality with minimal flight time remains a critical challenge. This paper proposes a universal co-optimization framework that bridges reconstruction objectives with flight dynamics through an integrated planning paradigm. The proposed approach performs initial flight planning by solving a Traveling Salesman Problem over candidate viewpoints and updating them according to the unit-length contribution criterion. The adaptive radius is determined, and subsequently, the sphere-based corridor is constructed to enforce the trajectory passing all updated viewpoints within the corresponding spatial tolerances. Next, an optimal control problem is formulated and solved using a nonlinear solver to obtain the final flight trajectory satisfying both dynamic and safety constraints. Experimental comparisons with state-of-the-art methods on three public scenes and two real scenes captured by ourselves demonstrate that the proposed approach significantly improves flight efficiency, reducing travel distance and flight duration by approximately 10% to 40% with comparable or superior reconstruction quality.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106775"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145976062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2026-01-19DOI: 10.1016/j.autcon.2026.106790
Yongshuang Li, Feng Xu, Zhipeng Zhang, Xinyu Mei, He Huang
Falls are the primary safety hazard in construction, with traditional manual inspections being inefficient and error-prone, and existing computer vision methods lacking generalization in complex scenarios. This paper presents the Construction Safety Vision-Language Model (CS-VLM), a framework for construction site fall hazard identification and automated captioning, which integrates ModelScope Swift (MS-Swift) adapters and Low-Rank Adaptation (LoRA) technology for efficient fine-tuning of the Qwen2.5-7B-Instruct model. To support model training, a standardized image-text dataset for fall hazards is constructed using a Bidirectional Encoder Representations from Transformers (BERT) -based natural language conversion method. Experimental results demonstrate that CS-VLM achieves a Consensus-based Image Description Evaluation (CIDEr) score of 1.324, Semantic Propositional Image Caption Evaluation (SPICE) score of 0.391, and hazard identification F1-score of 90.2%, outperforming state-of-the-art methods in complex scenario adaptability while reducing computational costs. This research enables precise, standardized hazard description generation, facilitating proactive safety management and accident prevention in construction environments.
坠落是建筑施工中的主要安全隐患,传统的人工检查效率低下且容易出错,现有的计算机视觉方法在复杂场景下缺乏通用性。本文提出了建筑安全视觉语言模型(CS-VLM),这是一个用于建筑现场坠落危险识别和自动字幕的框架,它集成了ModelScope Swift (MS-Swift)适配器和低秩自适应(LoRA)技术,用于对qwen2.5 - 7b - directive模型进行有效微调。为了支持模型训练,使用基于变形金刚双向编码器表示(BERT)的自然语言转换方法构建了跌倒危险的标准化图像-文本数据集。实验结果表明,CS-VLM在基于共识的图像描述评价(CIDEr)得分为1.324,语义命题图像标题评价(SPICE)得分为0.391,危害识别f1得分为90.2%,在降低计算成本的同时,在复杂场景适应性方面优于现有方法。这项研究使精确、标准化的危险描述生成,促进建筑环境中的主动安全管理和事故预防。
{"title":"Construction site fall hazard identification and automated captioning using adapted vision-language models","authors":"Yongshuang Li, Feng Xu, Zhipeng Zhang, Xinyu Mei, He Huang","doi":"10.1016/j.autcon.2026.106790","DOIUrl":"10.1016/j.autcon.2026.106790","url":null,"abstract":"<div><div>Falls are the primary safety hazard in construction, with traditional manual inspections being inefficient and error-prone, and existing computer vision methods lacking generalization in complex scenarios. This paper presents the Construction Safety Vision-Language Model (CS-VLM), a framework for construction site fall hazard identification and automated captioning, which integrates ModelScope Swift (MS-Swift) adapters and Low-Rank Adaptation (LoRA) technology for efficient fine-tuning of the Qwen2.5-7B-Instruct model. To support model training, a standardized image-text dataset for fall hazards is constructed using a Bidirectional Encoder Representations from Transformers (BERT) -based natural language conversion method. Experimental results demonstrate that CS-VLM achieves a Consensus-based Image Description Evaluation (CIDEr) score of 1.324, Semantic Propositional Image Caption Evaluation (SPICE) score of 0.391, and hazard identification F1-score of 90.2%, outperforming state-of-the-art methods in complex scenario adaptability while reducing computational costs. This research enables precise, standardized hazard description generation, facilitating proactive safety management and accident prevention in construction environments.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106790"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146000932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2026-01-19DOI: 10.1016/j.autcon.2026.106764
Xuefei Wang , Jiaxue Yuan , Jiale Li , Jianmin Zhang , Guowei Ma
As a critical indicator for evaluating road compaction quality, the Intelligent Compaction Measurement Value (ICMV) still suffers from significant scene dependency and the absence of a unified material-structure coupled evaluation framework, particularly in cross-layer compaction assessment. This paper develops a multi-domain analytical framework that integrates vibration signal time, frequency, and time-frequency features based on field data collected from typical road structures, including soil subgrade, cement-stabilized base layer, and asphalt layers. Rolling pass tracking, compactness prediction modeling, and Shapley additive explanations (SHAP) are employed to identify the generic ICMV applicable to pavement structural layers. Furthermore, comparative analyses are conducted to examine the numerical characteristics and vibration response behaviors of the generic ICMV across various structural layers. Finally, a statistically driven stepwise method is applied to determine the engineering ranges of the generic ICMV, thereby establishing a theoretical paradigm for multi-layer intelligent compaction standards and contributing to the digital transformation of pavement engineering.
{"title":"Generic optimization of cross-layer pavement compaction quality using multi-domain intelligent compaction measurement values","authors":"Xuefei Wang , Jiaxue Yuan , Jiale Li , Jianmin Zhang , Guowei Ma","doi":"10.1016/j.autcon.2026.106764","DOIUrl":"10.1016/j.autcon.2026.106764","url":null,"abstract":"<div><div>As a critical indicator for evaluating road compaction quality, the Intelligent Compaction Measurement Value (ICMV) still suffers from significant scene dependency and the absence of a unified material-structure coupled evaluation framework, particularly in cross-layer compaction assessment. This paper develops a multi-domain analytical framework that integrates vibration signal time, frequency, and time-frequency features based on field data collected from typical road structures, including soil subgrade, cement-stabilized base layer, and asphalt layers. Rolling pass tracking, compactness prediction modeling, and Shapley additive explanations (SHAP) are employed to identify the generic ICMV applicable to pavement structural layers. Furthermore, comparative analyses are conducted to examine the numerical characteristics and vibration response behaviors of the generic ICMV across various structural layers. Finally, a statistically driven stepwise method is applied to determine the engineering ranges of the generic ICMV, thereby establishing a theoretical paradigm for multi-layer intelligent compaction standards and contributing to the digital transformation of pavement engineering.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106764"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146000933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2026-02-06DOI: 10.1016/j.autcon.2026.106821
Yao Wang, Yi Bao
Monitoring cracks is critical for the safety and efficiency of the construction and operation of civil infrastructure. Distributed fiber optic sensors offer advantages for crack monitoring, but their applications are largely limited to near-field cracks. This paper presents an approach for in situ, real-time monitoring of far-field cracks using distributed acoustic sensing. The approach is developed through multi-physics modeling of a representative concrete highway bridge. The influence of key configuration parameters, including gauge length, channel spacing, and sampling rate, is evaluated for crack detection and localization. Results show that cracks located up to 6 m from a fiber optic cable are detected and localized with an average error of 0.94 m across 60 tests with varying crack scenarios and configurations. A cost-benefit analysis compares the proposed approach with state-of-the-art methods based on acoustic emission and distributed fiber optic sensing, demonstrating its benefits for far-field crack monitoring.
{"title":"Distributed acoustic sensing-based real-time monitoring of far-field cracks in reinforced concrete bridge decks","authors":"Yao Wang, Yi Bao","doi":"10.1016/j.autcon.2026.106821","DOIUrl":"10.1016/j.autcon.2026.106821","url":null,"abstract":"<div><div>Monitoring cracks is critical for the safety and efficiency of the construction and operation of civil infrastructure. Distributed fiber optic sensors offer advantages for crack monitoring, but their applications are largely limited to near-field cracks. This paper presents an approach for in situ, real-time monitoring of far-field cracks using distributed acoustic sensing. The approach is developed through multi-physics modeling of a representative concrete highway bridge. The influence of key configuration parameters, including gauge length, channel spacing, and sampling rate, is evaluated for crack detection and localization. Results show that cracks located up to 6 m from a fiber optic cable are detected and localized with an average error of 0.94 m across 60 tests with varying crack scenarios and configurations. A cost-benefit analysis compares the proposed approach with state-of-the-art methods based on acoustic emission and distributed fiber optic sensing, demonstrating its benefits for far-field crack monitoring.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106821"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146134678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Fluctuations in construction material prices significantly affect project budgets and bidding strategies via the Construction Cost Index (CCI). This paper develops an AI-driven decision-support system for construction cost forecasting and consultation, integrating deep learning and Large Language Models (LLMs) to enable intelligent CCI prediction. A multi-source data framework combines historical CCI records, macroeconomic indicators, and sentiment extracted from Traditional Chinese construction news. Time-series forecasting employs an Extended Long Short-Term Memory (xLSTM) network, while sentiment models are fine-tuned using Quantized Low-Rank Adaptation (QLoRA). Model hyperparameters for both the QLoRA-fine-tuned LLMs and the xLSTM forecasting models are optimized via the Pilgrimage Walk Optimization (PWO) algorithm, yielding two horizon-specific configurations for short- and medium-term forecasting. Experimental results demonstrate that integrating sentiment features and PWO-based tuning consistently improves forecasting accuracy relative to baseline models. The deployed platform integrates CCI forecasting, sentiment analytics, and retrieval-augmented consultation to provide interpretable forecasts that enhance cost control and decision-making in construction management.
{"title":"AI-driven decision support system for construction cost forecasting and consultation using optimized deep learning and language models","authors":"Jui-Sheng Chou, Mei-Yuan Lin, Nguyen-Ngan-Hanh Pham","doi":"10.1016/j.autcon.2026.106797","DOIUrl":"10.1016/j.autcon.2026.106797","url":null,"abstract":"<div><div>Fluctuations in construction material prices significantly affect project budgets and bidding strategies via the Construction Cost Index (CCI). This paper develops an AI-driven decision-support system for construction cost forecasting and consultation, integrating deep learning and Large Language Models (LLMs) to enable intelligent CCI prediction. A multi-source data framework combines historical CCI records, macroeconomic indicators, and sentiment extracted from Traditional Chinese construction news. Time-series forecasting employs an Extended Long Short-Term Memory (xLSTM) network, while sentiment models are fine-tuned using Quantized Low-Rank Adaptation (QLoRA). Model hyperparameters for both the QLoRA-fine-tuned LLMs and the xLSTM forecasting models are optimized via the Pilgrimage Walk Optimization (PWO) algorithm, yielding two horizon-specific configurations for short- and medium-term forecasting. Experimental results demonstrate that integrating sentiment features and PWO-based tuning consistently improves forecasting accuracy relative to baseline models. The deployed platform integrates CCI forecasting, sentiment analytics, and retrieval-augmented consultation to provide interpretable forecasts that enhance cost control and decision-making in construction management.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106797"},"PeriodicalIF":11.5,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}