Pub Date : 2026-01-29DOI: 10.1016/j.autcon.2026.106800
Chukwuka Christian Ohueri
Globally, labor productivity declined by 8% from 2022 to 2024, primarily due to human-centric factors. In transition to Construction 5.0 (C5.0), Human Digital Twin (HDT) integrates humans and systems to enhance productivity. However, existing review studies have not identified human-centric productivity drivers or HDT components, nor examined their interactions in enhancing labor productivity. This paper develops a framework that operationalizes the interactions between human-centric productivity drivers and HDT components to optimize labor productivity. A systematic review was conducted by searching for keywords in Scopus, using predefined criteria to select 185 articles published over the last decade, and analyzing the articles using thematic synthesis. Consequently, human-centric productivity drivers and HDT components were identified, and their interactions operationalized via a structured framework to optimize labor productivity in C5.0. This paper advances automation in construction by establishing a pioneering approach that integrates human attributes and cyber-physical systems for optimal human-system interaction.
{"title":"Human digital twin for optimizing labor productivity in construction 5.0","authors":"Chukwuka Christian Ohueri","doi":"10.1016/j.autcon.2026.106800","DOIUrl":"10.1016/j.autcon.2026.106800","url":null,"abstract":"<div><div>Globally, labor productivity declined by 8% from 2022 to 2024, primarily due to human-centric factors. In transition to Construction 5.0 (C5.0), Human Digital Twin (HDT) integrates humans and systems to enhance productivity. However, existing review studies have not identified human-centric productivity drivers or HDT components, nor examined their interactions in enhancing labor productivity. This paper develops a framework that operationalizes the interactions between human-centric productivity drivers and HDT components to optimize labor productivity. A systematic review was conducted by searching for keywords in Scopus, using predefined criteria to select 185 articles published over the last decade, and analyzing the articles using thematic synthesis. Consequently, human-centric productivity drivers and HDT components were identified, and their interactions operationalized via a structured framework to optimize labor productivity in C5.0. This paper advances automation in construction by establishing a pioneering approach that integrates human attributes and cyber-physical systems for optimal human-system interaction.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106800"},"PeriodicalIF":11.5,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Rebar inspection is a critical but labor-intensive task in concrete construction quality control. This paper develops an improved Boundary-Aware Mask R-CNN (BA-Mask R-CNN) that incorporates a path-enhanced feature extraction network and a boundary-squeeze module to enhance segmentation performance. Trained on a self-constructed dataset of 3450 images, the proposed model achieves a mean Average Precision (mAP) of 91.84%, a mean Intersection over Union (mIoU) of 93.78%, an F1-score of 96.79%, and a Precision of 96.08%, outperforming the baseline Mask R-CNN by 6.69%, 7.35%, 4.36%, and 5.75%, respectively. The probability distributions of rebar diameters (8–22 mm) were obtained from multiple rotational viewpoints, and the corresponding mean values were subsequently computed. The proposed method accurately measures the mean diameters and spacing of rebars in double-layer meshes, with all measurement errors falling within standard engineering tolerances.
{"title":"Improved Boundary-Aware Mask R-CNN using stereo vision for automated rebar inspection","authors":"Weijian Zhao , Ruoshui Xing , Cuiting Wei , Bochao Sun , Tianren Jiang , Qiliang Zhao","doi":"10.1016/j.autcon.2026.106801","DOIUrl":"10.1016/j.autcon.2026.106801","url":null,"abstract":"<div><div>Rebar inspection is a critical but labor-intensive task in concrete construction quality control. This paper develops an improved Boundary-Aware Mask R-CNN (BA-Mask R-CNN) that incorporates a path-enhanced feature extraction network and a boundary-squeeze module to enhance segmentation performance. Trained on a self-constructed dataset of 3450 images, the proposed model achieves a mean Average Precision (mAP) of 91.84%, a mean Intersection over Union (mIoU) of 93.78%, an F1-score of 96.79%, and a Precision of 96.08%, outperforming the baseline Mask R-CNN by 6.69%, 7.35%, 4.36%, and 5.75%, respectively. The probability distributions of rebar diameters (8–22 mm) were obtained from multiple rotational viewpoints, and the corresponding mean values were subsequently computed. The proposed method accurately measures the mean diameters and spacing of rebars in double-layer meshes, with all measurement errors falling within standard engineering tolerances.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106801"},"PeriodicalIF":11.5,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Accurate and resilient monitoring of construction projects remains challenging due to fragmented reporting, data uncertainty and delayed system integration. This paper evaluates RealCONs, a QR-enabled real-time monitoring framework that integrates BIM, mobile scanning, cloud-based SQL storage, and Power BI analytics to support live project control. A 90-day comparative case analysis of two concurrent Electrical and Instrumentation projects benchmarked RealCONs against a conventional tracking system. Performance was assessed using Earned Value and Earned Schedule metrics, supported by Chi-square and two-proportion tests, confidence intervals, normality testing, regression forecasting, and non-parametric Wilcoxon and Mann–Whitney analyses. Data continuity strongly favoured RealCONs, with five missing earned-value days compared with 35 in the comparator project (χ2 = 28.93, p < .001). Across 51 paired days, RealCONs achieved superior CPI (1.02 vs 0.90) and SPI (1.01 vs 0.89). During a delay event (Days 33–37), RealCONs maintained measurable progress and statistically significant SPI predictability, while the comparator recorded zero earned value. Overall, RealCONs enabled earlier delay detection, improved forecast reliability and scalable, real-time decision support aligned with Industry 4.0 objectives.
由于报告的碎片化、数据的不确定性和系统集成的延迟,对建筑项目进行准确和有弹性的监测仍然具有挑战性。本文评估了RealCONs,这是一个支持qr的实时监控框架,它集成了BIM、移动扫描、基于云的SQL存储和Power BI分析,以支持实时项目控制。对两个同时进行的电气和仪器项目进行了为期90天的比较案例分析,将realcon与传统跟踪系统进行了对比。使用挣值和挣进度指标评估绩效,并采用卡方检验和双比例检验、置信区间、正态性检验、回归预测以及非参数Wilcoxon和Mann-Whitney分析。数据连续性非常有利于RealCONs,有5天缺少挣值日,而比较项目为35天(χ2 = 28.93, p < .001)。在51个配对的日子里,RealCONs取得了卓越的CPI (1.02 vs 0.90)和SPI (1.01 vs 0.89)。在延迟事件期间(第33-37天),RealCONs保持了可测量的进度和统计上显著的SPI可预测性,而比较器记录的挣值为零。总体而言,RealCONs实现了更早的延迟检测,提高了预测可靠性和可扩展的实时决策支持,符合工业4.0的目标。
{"title":"A digital monitoring, delay detection and visualisation framework for construction projects: RealCONs","authors":"Kambiz Radman, Mostafa Babaeian Jelodar, Ruggiero Lovreglio","doi":"10.1016/j.autcon.2026.106781","DOIUrl":"10.1016/j.autcon.2026.106781","url":null,"abstract":"<div><div>Accurate and resilient monitoring of construction projects remains challenging due to fragmented reporting, data uncertainty and delayed system integration. This paper evaluates RealCONs, a QR-enabled real-time monitoring framework that integrates BIM, mobile scanning, cloud-based SQL storage, and Power BI analytics to support live project control. A 90-day comparative case analysis of two concurrent Electrical and Instrumentation projects benchmarked RealCONs against a conventional tracking system. Performance was assessed using Earned Value and Earned Schedule metrics, supported by Chi-square and two-proportion tests, confidence intervals, normality testing, regression forecasting, and non-parametric Wilcoxon and Mann–Whitney analyses. Data continuity strongly favoured RealCONs, with five missing earned-value days compared with 35 in the comparator project (χ<sup>2</sup> = 28.93, <em>p</em> < .001). Across 51 paired days, RealCONs achieved superior CPI (1.02 vs 0.90) and SPI (1.01 vs 0.89). During a delay event (Days 33–37), RealCONs maintained measurable progress and statistically significant SPI predictability, while the comparator recorded zero earned value. Overall, RealCONs enabled earlier delay detection, improved forecast reliability and scalable, real-time decision support aligned with Industry 4.0 objectives.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106781"},"PeriodicalIF":11.5,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Fluctuations in construction material prices significantly affect project budgets and bidding strategies via the Construction Cost Index (CCI). This paper develops an AI-driven decision-support system for construction cost forecasting and consultation, integrating deep learning and Large Language Models (LLMs) to enable intelligent CCI prediction. A multi-source data framework combines historical CCI records, macroeconomic indicators, and sentiment extracted from Traditional Chinese construction news. Time-series forecasting employs an Extended Long Short-Term Memory (xLSTM) network, while sentiment models are fine-tuned using Quantized Low-Rank Adaptation (QLoRA). Model hyperparameters for both the QLoRA-fine-tuned LLMs and the xLSTM forecasting models are optimized via the Pilgrimage Walk Optimization (PWO) algorithm, yielding two horizon-specific configurations for short- and medium-term forecasting. Experimental results demonstrate that integrating sentiment features and PWO-based tuning consistently improves forecasting accuracy relative to baseline models. The deployed platform integrates CCI forecasting, sentiment analytics, and retrieval-augmented consultation to provide interpretable forecasts that enhance cost control and decision-making in construction management.
{"title":"AI-driven decision support system for construction cost forecasting and consultation using optimized deep learning and language models","authors":"Jui-Sheng Chou, Mei-Yuan Lin, Nguyen-Ngan-Hanh Pham","doi":"10.1016/j.autcon.2026.106797","DOIUrl":"10.1016/j.autcon.2026.106797","url":null,"abstract":"<div><div>Fluctuations in construction material prices significantly affect project budgets and bidding strategies via the Construction Cost Index (CCI). This paper develops an AI-driven decision-support system for construction cost forecasting and consultation, integrating deep learning and Large Language Models (LLMs) to enable intelligent CCI prediction. A multi-source data framework combines historical CCI records, macroeconomic indicators, and sentiment extracted from Traditional Chinese construction news. Time-series forecasting employs an Extended Long Short-Term Memory (xLSTM) network, while sentiment models are fine-tuned using Quantized Low-Rank Adaptation (QLoRA). Model hyperparameters for both the QLoRA-fine-tuned LLMs and the xLSTM forecasting models are optimized via the Pilgrimage Walk Optimization (PWO) algorithm, yielding two horizon-specific configurations for short- and medium-term forecasting. Experimental results demonstrate that integrating sentiment features and PWO-based tuning consistently improves forecasting accuracy relative to baseline models. The deployed platform integrates CCI forecasting, sentiment analytics, and retrieval-augmented consultation to provide interpretable forecasts that enhance cost control and decision-making in construction management.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106797"},"PeriodicalIF":11.5,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-27DOI: 10.1016/j.autcon.2026.106802
Guoqiang Huang , Chengjin Qin , Jie Lu , Pengcheng Xia , Haodi Wang , Chengliang Liu
Accurately predicting muck particle size distribution (PSD) of Tunnel Boring Machine (TBM) is constrained by the cumbersome process of manual annotation and environmental noise. This paper investigates robust prediction of muck PSD curve under noisy TBM operation conditions, while reducing reliance on manual annotations. A noise-robust self-supervised learning method with frequency-bias decomposition is proposed, which integrates contrastive pre-training based on noise augmentation, frequency-domain bias decomposition, and hybrid edge-aware loss function. The experiments show that with only 10% annotation, it achieves performance comparable to existing models trained on 90% annotation, with a maximum particle size MAPE of 6.7% and Rosin-Rammler parameter errors between 10 and 20%. These results demonstrate a low-cost, accurate, and noise-robust approach for muck monitoring, substantially reducing the need for manual annotation and improving prediction reliability. Future work will combine muck PSD with multi-modal TBM excavation data to support intelligent tunneling decision-making.
{"title":"Noise-robust self-supervised learning with frequency-bias decomposition for TBM muck particle size distribution prediction","authors":"Guoqiang Huang , Chengjin Qin , Jie Lu , Pengcheng Xia , Haodi Wang , Chengliang Liu","doi":"10.1016/j.autcon.2026.106802","DOIUrl":"10.1016/j.autcon.2026.106802","url":null,"abstract":"<div><div>Accurately predicting muck particle size distribution (PSD) of Tunnel Boring Machine (TBM) is constrained by the cumbersome process of manual annotation and environmental noise. This paper investigates robust prediction of muck PSD curve under noisy TBM operation conditions, while reducing reliance on manual annotations. A noise-robust self-supervised learning method with frequency-bias decomposition is proposed, which integrates contrastive pre-training based on noise augmentation, frequency-domain bias decomposition, and hybrid edge-aware loss function. The experiments show that with only 10% annotation, it achieves performance comparable to existing models trained on 90% annotation, with a maximum particle size MAPE of 6.7% and Rosin-Rammler parameter errors between 10 and 20%. These results demonstrate a low-cost, accurate, and noise-robust approach for muck monitoring, substantially reducing the need for manual annotation and improving prediction reliability. Future work will combine muck PSD with multi-modal TBM excavation data to support intelligent tunneling decision-making.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106802"},"PeriodicalIF":11.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-27DOI: 10.1016/j.autcon.2026.106770
Dinh-Nhat Truong , Jui-Sheng Chou
This paper presents a simulation-driven framework integrating the Multi-Objective Scientific Approach to Problem Solving-inspired Optimization (MOSAPSO) algorithm with the finite element method (FEM) for automated structural design in construction. The proposed MOSAPSO integrates chaotic initialization, Lévy flight dynamics, elite population control, and sparsity-biased Pareto archiving to enhance convergence and diversity, while embedding the scientific research process, including review and problem definition, hypothesis formulation, data collection, and analysis and interpretation, into a unified optimization strategy. A temporal control strategy balances exploration and exploitation during the optimization process. Benchmarking on 24 CEC-2020 test functions reveals that MOSAPSO outperforms 11 established multi-objective algorithms across hypervolume (HV), generational distance (GD), and spacing (SP) metrics. Integrated with FEM, MOSAPSO–FEM automatically generates Pareto-optimal designs for five large-scale structural systems, balancing weight, displacement, and stability constraints. The framework provides a robust foundation for intelligent, simulation-driven decision-making in construction design, offering significant opportunities for integration with BIM, digital twins, and automated design tools.
{"title":"Multi-objective scientific approach to problem solving-inspired optimization integrated with the finite element method for automated structural design","authors":"Dinh-Nhat Truong , Jui-Sheng Chou","doi":"10.1016/j.autcon.2026.106770","DOIUrl":"10.1016/j.autcon.2026.106770","url":null,"abstract":"<div><div>This paper presents a simulation-driven framework integrating the Multi-Objective Scientific Approach to Problem Solving-inspired Optimization (MOSAPSO) algorithm with the finite element method (FEM) for automated structural design in construction. The proposed MOSAPSO integrates chaotic initialization, Lévy flight dynamics, elite population control, and sparsity-biased Pareto archiving to enhance convergence and diversity, while embedding the scientific research process, including review and problem definition, hypothesis formulation, data collection, and analysis and interpretation, into a unified optimization strategy. A temporal control strategy balances exploration and exploitation during the optimization process. Benchmarking on 24 CEC-2020 test functions reveals that MOSAPSO outperforms 11 established multi-objective algorithms across hypervolume (HV), generational distance (GD), and spacing (SP) metrics. Integrated with FEM, MOSAPSO–FEM automatically generates Pareto-optimal designs for five large-scale structural systems, balancing weight, displacement, and stability constraints. The framework provides a robust foundation for intelligent, simulation-driven decision-making in construction design, offering significant opportunities for integration with BIM, digital twins, and automated design tools.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106770"},"PeriodicalIF":11.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146072633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Bridge Expansion Joints (BEJs) are crucial for bridge safety, yet their acoustic signals are complex and easily disturbed by traffic noise, limiting traditional identification accuracy. To address this, an intelligent monitoring system based on voiceprint features and deep learning is developed. Its key contributions include: (1) a cloud-edge collaborative voiceprint monitoring device that integrates audio sampling, embedded processing, cloud server and wireless transmission, enabling long-term data collection and remote diagnosis under noisy environments; (2) the use of first- and second-order differential Mel Frequency Cepstral Coefficients (MFCC) for feature extraction, improving discriminability; and (3) the Hybrid Attention Fusion Network (HAFNet), built on a pre-trained convolutional backbone with multi-scale attention, achieving high-precision recognition of typical BEJ faults, with testing accuracies of 97.99% and 99.00% for two vehicle types. Field experiments demonstrate the system's stability, reliability, and feasibility for real-time BEJ monitoring.
{"title":"Automated diagnosis of bridge expansion joint defects using voiceprint features and deep learning","authors":"Yixuan Chen , Hongzhe Zhao , Yichao Xu , Yufeng Zhang , Jian Zhang","doi":"10.1016/j.autcon.2025.106739","DOIUrl":"10.1016/j.autcon.2025.106739","url":null,"abstract":"<div><div>Bridge Expansion Joints (BEJs) are crucial for bridge safety, yet their acoustic signals are complex and easily disturbed by traffic noise, limiting traditional identification accuracy. To address this, an intelligent monitoring system based on voiceprint features and deep learning is developed. Its key contributions include: (1) a cloud-edge collaborative voiceprint monitoring device that integrates audio sampling, embedded processing, cloud server and wireless transmission, enabling long-term data collection and remote diagnosis under noisy environments; (2) the use of first- and second-order differential Mel Frequency Cepstral Coefficients (MFCC) for feature extraction, improving discriminability; and (3) the Hybrid Attention Fusion Network (HAFNet), built on a pre-trained convolutional backbone with multi-scale attention, achieving high-precision recognition of typical BEJ faults, with testing accuracies of 97.99% and 99.00% for two vehicle types. Field experiments demonstrate the system's stability, reliability, and feasibility for real-time BEJ monitoring.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106739"},"PeriodicalIF":11.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-27DOI: 10.1016/j.autcon.2026.106792
Jinxin Yi , Xuan Kong , Hao Tang , Jie Zhang , Zhenming Chen , Lu Deng
Recent advances in computer vision have provided new solutions for intelligent welding. However, existing vision-based weld seam extraction techniques exhibit limited adaptability to various workpieces in unstructured environments. Therefore, this paper proposes a three-dimensional vision-based method tailored for weld seam extraction and path generation. The proposed method synergizes a deep learning-based point cloud segmentation technique with an improved multi-scale point cloud registration algorithm to reconstruct the complete point cloud model of all weld regions in the workpieces. Subsequently, the welding paths and torch poses are calculated using an optimized multi-plane fitting algorithm integrated with geometry model of weld seam. Experimental validation on four workpieces demonstrates that the proposed method achieves good accuracy and outperforms the existing techniques in terms of efficiency and applicability, offering a robust solution for automated welding of steel structures.
{"title":"Weld seam extraction and path generation for robotic welding of steel structures based on 3D vision","authors":"Jinxin Yi , Xuan Kong , Hao Tang , Jie Zhang , Zhenming Chen , Lu Deng","doi":"10.1016/j.autcon.2026.106792","DOIUrl":"10.1016/j.autcon.2026.106792","url":null,"abstract":"<div><div>Recent advances in computer vision have provided new solutions for intelligent welding. However, existing vision-based weld seam extraction techniques exhibit limited adaptability to various workpieces in unstructured environments. Therefore, this paper proposes a three-dimensional vision-based method tailored for weld seam extraction and path generation. The proposed method synergizes a deep learning-based point cloud segmentation technique with an improved multi-scale point cloud registration algorithm to reconstruct the complete point cloud model of all weld regions in the workpieces. Subsequently, the welding paths and torch poses are calculated using an optimized multi-plane fitting algorithm integrated with geometry model of weld seam. Experimental validation on four workpieces demonstrates that the proposed method achieves good accuracy and outperforms the existing techniques in terms of efficiency and applicability, offering a robust solution for automated welding of steel structures.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106792"},"PeriodicalIF":11.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-27DOI: 10.1016/j.autcon.2026.106791
Yuandong Pan , Mudan Wang , Linjun Lu , Rabindra Lamsal , Erika Pärn , Sisi Zlatanova , Ioannis Brilakis
Digital twins are increasingly used in the Architecture, Engineering, and Construction (AEC) industry, but their adoption is often hindered by the need for specialised knowledge, such as database querying. This paper presents Graph-DT-GPT, a multi-agent framework that integrates Large Language Models (LLMs) with graph-based digital twins to enable natural language interaction. The framework is designed with modular agents, including decision, query generation, and answer extraction, and grounds all LLMs’ outputs in structured graph data to improve response reliability and reduce hallucinations. The framework is evaluated on two use cases: a city-level graph with over 40,000 building nodes and room-level apartment layout graphs. Graph-DT-GPT achieves 100% and 95.5% answer correctness using Claude Sonnet 4.5 and GPT-4o, respectively, in the city-scale case, and 100% correctness in the room-level case, significantly outperforming baseline methods including LangChain Neo4j pipelines by approximately 40% and 10%, respectively. These results demonstrate its scalability and potential to enhance accessible, accurate information retrieval in AEC digital twin applications.
{"title":"LLM-enabled multi-agent framework for natural language interaction with graph-based digital twins","authors":"Yuandong Pan , Mudan Wang , Linjun Lu , Rabindra Lamsal , Erika Pärn , Sisi Zlatanova , Ioannis Brilakis","doi":"10.1016/j.autcon.2026.106791","DOIUrl":"10.1016/j.autcon.2026.106791","url":null,"abstract":"<div><div>Digital twins are increasingly used in the Architecture, Engineering, and Construction (AEC) industry, but their adoption is often hindered by the need for specialised knowledge, such as database querying. This paper presents Graph-DT-GPT, a multi-agent framework that integrates Large Language Models (LLMs) with graph-based digital twins to enable natural language interaction. The framework is designed with modular agents, including decision, query generation, and answer extraction, and grounds all LLMs’ outputs in structured graph data to improve response reliability and reduce hallucinations. The framework is evaluated on two use cases: a city-level graph with over 40,000 building nodes and room-level apartment layout graphs. Graph-DT-GPT achieves 100% and 95.5% answer correctness using Claude Sonnet 4.5 and GPT-4o, respectively, in the city-scale case, and 100% correctness in the room-level case, significantly outperforming baseline methods including LangChain Neo4j pipelines by approximately 40% and 10%, respectively. These results demonstrate its scalability and potential to enhance accessible, accurate information retrieval in AEC digital twin applications.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106791"},"PeriodicalIF":11.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-27DOI: 10.1016/j.autcon.2025.106754
Yiming Liu, Christiane M. Herr
As Artificial Intelligence transforms design through decentralised and self-organising generative systems, Cellular Automata (CA) exemplify a foundational yet underexplored paradigm capable of bridging rule-based emergence and computational creativity in architecture and urbanism. Driven by simple local rules, CA produce spatially responsive and systemic patterns well-suited to capturing the dynamics of complex interrelated systems, making them valuable for generative design exploration. This review systematically investigates control strategies for guiding CA-based generative processes. It identifies temporal logic methods for adjusting CA behaviour through bibliometric analysis. The review further demonstrates control factors, computational control, and human-mediated control, analysing their impact on the adaptability of CA design processes at each stage through the content-based synthesis. The results reveal the advantages of different control strategies in guiding goal-directed CA generation. This study advances the understanding of CA-based design mechanisms and highlights opportunities to develop intelligent control, process-oriented design tools integrating data-driven and AI technologies.
{"title":"Control strategies for Cellular Automata-based generative design in architecture and urbanism","authors":"Yiming Liu, Christiane M. Herr","doi":"10.1016/j.autcon.2025.106754","DOIUrl":"10.1016/j.autcon.2025.106754","url":null,"abstract":"<div><div>As Artificial Intelligence transforms design through decentralised and self-organising generative systems, Cellular Automata (CA) exemplify a foundational yet underexplored paradigm capable of bridging rule-based emergence and computational creativity in architecture and urbanism. Driven by simple local rules, CA produce spatially responsive and systemic patterns well-suited to capturing the dynamics of complex interrelated systems, making them valuable for generative design exploration. This review systematically investigates control strategies for guiding CA-based generative processes. It identifies temporal logic methods for adjusting CA behaviour through bibliometric analysis. The review further demonstrates control factors, computational control, and human-mediated control, analysing their impact on the adaptability of CA design processes at each stage through the content-based synthesis. The results reveal the advantages of different control strategies in guiding goal-directed CA generation. This study advances the understanding of CA-based design mechanisms and highlights opportunities to develop intelligent control, process-oriented design tools integrating data-driven and AI technologies.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106754"},"PeriodicalIF":11.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146071952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}