Pub Date : 2026-02-06DOI: 10.1016/j.autcon.2026.106821
Yao Wang, Yi Bao
Monitoring cracks is critical for the safety and efficiency of the construction and operation of civil infrastructure. Distributed fiber optic sensors offer advantages for crack monitoring, but their applications are largely limited to near-field cracks. This paper presents an approach for in situ, real-time monitoring of far-field cracks using distributed acoustic sensing. The approach is developed through multi-physics modeling of a representative concrete highway bridge. The influence of key configuration parameters, including gauge length, channel spacing, and sampling rate, is evaluated for crack detection and localization. Results show that cracks located up to 6 m from a fiber optic cable are detected and localized with an average error of 0.94 m across 60 tests with varying crack scenarios and configurations. A cost-benefit analysis compares the proposed approach with state-of-the-art methods based on acoustic emission and distributed fiber optic sensing, demonstrating its benefits for far-field crack monitoring.
{"title":"Distributed acoustic sensing-based real-time monitoring of far-field cracks in reinforced concrete bridge decks","authors":"Yao Wang, Yi Bao","doi":"10.1016/j.autcon.2026.106821","DOIUrl":"10.1016/j.autcon.2026.106821","url":null,"abstract":"<div><div>Monitoring cracks is critical for the safety and efficiency of the construction and operation of civil infrastructure. Distributed fiber optic sensors offer advantages for crack monitoring, but their applications are largely limited to near-field cracks. This paper presents an approach for in situ, real-time monitoring of far-field cracks using distributed acoustic sensing. The approach is developed through multi-physics modeling of a representative concrete highway bridge. The influence of key configuration parameters, including gauge length, channel spacing, and sampling rate, is evaluated for crack detection and localization. Results show that cracks located up to 6 m from a fiber optic cable are detected and localized with an average error of 0.94 m across 60 tests with varying crack scenarios and configurations. A cost-benefit analysis compares the proposed approach with state-of-the-art methods based on acoustic emission and distributed fiber optic sensing, demonstrating its benefits for far-field crack monitoring.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106821"},"PeriodicalIF":11.5,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146134678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-05DOI: 10.1016/j.autcon.2026.106823
Lingxiao Wang , Jingfeng Yuan , Shu Su , Hongxing Ding , Yu Bai , Miroslaw J. Skibniewski
Tower crane operations in construction are inherently hazardous due to complex and dynamic site environments. Enhancing operators' perceptual and cognitive capabilities is essential for ensuring safety and improving situational awareness. This paper presents an integrated framework that combines an improved YOLOv8 model with a Knowledge Graph (KG)-enhanced large language model to achieve proactive and intelligent safety management. The improved YOLOv8 incorporates attention-based optimization to improve detection accuracy for small targets in tower crane perspectives. A domain-specific safety KG is constructed to represent critical entities, relationships, and operational contexts, and is aligned with a fine-tuned GPT model, enabling semantic reasoning and context-aware hazard interpretation. The integrated system links visual perception with structured knowledge reasoning to provide real-time and interpretable safety feedback. This approach enhances the perception, understanding, and decision-making capabilities of tower crane operators, transforming safety management from reactive monitoring to proactive and intelligent control in complex construction environments.
{"title":"Fusing enhanced YOLO and knowledge graph-based large language models for automatic risk perception in tower crane operations","authors":"Lingxiao Wang , Jingfeng Yuan , Shu Su , Hongxing Ding , Yu Bai , Miroslaw J. Skibniewski","doi":"10.1016/j.autcon.2026.106823","DOIUrl":"10.1016/j.autcon.2026.106823","url":null,"abstract":"<div><div>Tower crane operations in construction are inherently hazardous due to complex and dynamic site environments. Enhancing operators' perceptual and cognitive capabilities is essential for ensuring safety and improving situational awareness. This paper presents an integrated framework that combines an improved YOLOv8 model with a Knowledge Graph (KG)-enhanced large language model to achieve proactive and intelligent safety management. The improved YOLOv8 incorporates attention-based optimization to improve detection accuracy for small targets in tower crane perspectives. A domain-specific safety KG is constructed to represent critical entities, relationships, and operational contexts, and is aligned with a fine-tuned GPT model, enabling semantic reasoning and context-aware hazard interpretation. The integrated system links visual perception with structured knowledge reasoning to provide real-time and interpretable safety feedback. This approach enhances the perception, understanding, and decision-making capabilities of tower crane operators, transforming safety management from reactive monitoring to proactive and intelligent control in complex construction environments.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106823"},"PeriodicalIF":11.5,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146134693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-04DOI: 10.1016/j.autcon.2026.106798
Yongsheng Li , Limao Zhang , Qixiang Yan , Jianjun Qin , Zhanpeng Luo
The paper addresses the general problem of unreliable excavation in synchronous tunnel boring machines (S-TBMs) caused by inertia effects and time-delay phenomenon. The specific research question is how to achieve robust inertia matching those accounts for both equipment dynamics and human operator response behavior. A Gaussian mixture model (GMM) and an encoder-decoder framework (EDF) are proposed to estimate the driver and S-TBM inertial response time. A dynamic expression of the S-TBM excavation system is formulated, taking into account both human response time and equipment inertia effects. The results demonstrate that the proposed method accurately fits driver response time, achieves high-precision estimation of system inertia, and significantly reduces attitude errors by over 86% compared to non-matched control. An important contribution of this study is the integration of human behavioral inertia into the field of engineering equipment control, providing theoretical support for human-machine collaboration and real-time sharing control.
{"title":"Inertia effects matching for optimal attitude control in synchronous TBM considering human behavior","authors":"Yongsheng Li , Limao Zhang , Qixiang Yan , Jianjun Qin , Zhanpeng Luo","doi":"10.1016/j.autcon.2026.106798","DOIUrl":"10.1016/j.autcon.2026.106798","url":null,"abstract":"<div><div>The paper addresses the general problem of unreliable excavation in synchronous tunnel boring machines (S-TBMs) caused by inertia effects and time-delay phenomenon. The specific research question is how to achieve robust inertia matching those accounts for both equipment dynamics and human operator response behavior. A Gaussian mixture model (GMM) and an encoder-decoder framework (EDF) are proposed to estimate the driver and S-TBM inertial response time. A dynamic expression of the S-TBM excavation system is formulated, taking into account both human response time and equipment inertia effects. The results demonstrate that the proposed method accurately fits driver response time, achieves high-precision estimation of system inertia, and significantly reduces attitude errors by over 86% compared to non-matched control. An important contribution of this study is the integration of human behavioral inertia into the field of engineering equipment control, providing theoretical support for human-machine collaboration and real-time sharing control.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106798"},"PeriodicalIF":11.5,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-04DOI: 10.1016/j.autcon.2026.106803
Qiming Sun , Dominik Reisach , Silke Langenberg , Benjamin Dillenburger
Computational Design Methods (CDMs) have increasingly supported the use of Found Objects (FOs) for circular construction. These methods automate the geometric assignment of FOs to a target design, yet a comprehensive overview is lacking. In this context, this paper systematically reviews 142 publications on CDMs for upcycling FOs in construction. It categorizes existing workflows and identifies six key CDMs based on assignment logic and four geometric FO types. The review serves as a roadmap for future research and practical applications, aiding architects and engineers in informed decision-making. It emphasizes the potential of utilizing FOs’ inherent geometry as design drivers for economical and aesthetic architectural solutions. This paper also identifies challenges in scaling CDMs from prototypes to practical applications, such as structural performance and integration with existing workflows. Future research directions include developing AI-based methods, automating construction processes using CDMs, and advocating for sensitivity analysis to assess adaptability across design scenarios.
{"title":"Computational Design Methods for geometry-driven upcycling of found objects in construction","authors":"Qiming Sun , Dominik Reisach , Silke Langenberg , Benjamin Dillenburger","doi":"10.1016/j.autcon.2026.106803","DOIUrl":"10.1016/j.autcon.2026.106803","url":null,"abstract":"<div><div>Computational Design Methods (CDMs) have increasingly supported the use of Found Objects (FOs) for circular construction. These methods automate the geometric assignment of FOs to a target design, yet a comprehensive overview is lacking. In this context, this paper systematically reviews 142 publications on CDMs for upcycling FOs in construction. It categorizes existing workflows and identifies six key CDMs based on assignment logic and four geometric FO types. The review serves as a roadmap for future research and practical applications, aiding architects and engineers in informed decision-making. It emphasizes the potential of utilizing FOs’ inherent geometry as design drivers for economical and aesthetic architectural solutions. This paper also identifies challenges in scaling CDMs from prototypes to practical applications, such as structural performance and integration with existing workflows. Future research directions include developing AI-based methods, automating construction processes using CDMs, and advocating for sensitivity analysis to assess adaptability across design scenarios.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106803"},"PeriodicalIF":11.5,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-04DOI: 10.1016/j.autcon.2026.106807
Ya-Dong Xue , Fei Jia , Wei Luo , Dong-Mei Zhang , Jie Liu , Yong-Fa Guo
With the rapid expansion of large-scale shield tunnel operations, deep learning has been extensively studied for automated defect recognition. This paper provides a comprehensive review of recent research in deep learning-based methods for tunnel defect recognition, organized into three key stages: dataset establishment, model development, and practical implementation. The review first details the acquisition and preprocessing of tunnel lining images obtained from various inspection equipment, followed by the establishment of defect datasets. It then provides a systematic overview of commonly used deep learning models for defect recognition, with a focus on three primary areas: defect detection, semantic, and instance segmentation, summarizing key innovations within each domain. Based on this analysis, current challenges are identified and future research directions are discussed for each stage. This review aims to promote the practical application of deep learning in tunnel engineering and to support the development of predictive and intelligent maintenance for shield tunnels.
{"title":"Deep learning-based computer vision methods for shield tunnel defect recognition","authors":"Ya-Dong Xue , Fei Jia , Wei Luo , Dong-Mei Zhang , Jie Liu , Yong-Fa Guo","doi":"10.1016/j.autcon.2026.106807","DOIUrl":"10.1016/j.autcon.2026.106807","url":null,"abstract":"<div><div>With the rapid expansion of large-scale shield tunnel operations, deep learning has been extensively studied for automated defect recognition. This paper provides a comprehensive review of recent research in deep learning-based methods for tunnel defect recognition, organized into three key stages: dataset establishment, model development, and practical implementation. The review first details the acquisition and preprocessing of tunnel lining images obtained from various inspection equipment, followed by the establishment of defect datasets. It then provides a systematic overview of commonly used deep learning models for defect recognition, with a focus on three primary areas: defect detection, semantic, and instance segmentation, summarizing key innovations within each domain. Based on this analysis, current challenges are identified and future research directions are discussed for each stage. This review aims to promote the practical application of deep learning in tunnel engineering and to support the development of predictive and intelligent maintenance for shield tunnels.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106807"},"PeriodicalIF":11.5,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-04DOI: 10.1016/j.autcon.2026.106815
Yihui Li , Jun Xiao , Hao Zhou , Borong Lin
Early-stage design decisions strongly influence a building’s lifetime energy performance, yet the transformation from architectural geometry to analysis-ready models remains fragmented. This paper presents CUGER, an automated framework that converts complex 3D building geometries into physically consistent heterogeneous graphs for building energy analysis. CUGER integrates a convex optimization-based segmentation algorithm with structured graph encoding to directly transform CAD geometries into simulation- and learning-ready representations. Evaluated on 266 architecturally diverse building models, the framework achieved an average geometric-to-graph conversion success rate above 80% with strong topological consistency (). Building on the generated graphs, a heterogeneous ZoneGNN model enables zone-level energy prediction of single building, achieving accurate and stable performance () and outperforming MLP baselines, particularly under limited training data. Overall, CUGER establishes a topology-preserving and fully automatable bridge between architectural modeling, performance simulation, and data-driven prediction for early-stage building design.
{"title":"From geometry to graph: Automation of building performance modeling via convex graph encoding","authors":"Yihui Li , Jun Xiao , Hao Zhou , Borong Lin","doi":"10.1016/j.autcon.2026.106815","DOIUrl":"10.1016/j.autcon.2026.106815","url":null,"abstract":"<div><div>Early-stage design decisions strongly influence a building’s lifetime energy performance, yet the transformation from architectural geometry to analysis-ready models remains fragmented. This paper presents CUGER, an automated framework that converts complex 3D building geometries into physically consistent heterogeneous graphs for building energy analysis. CUGER integrates a convex optimization-based segmentation algorithm with structured graph encoding to directly transform CAD geometries into simulation- and learning-ready representations. Evaluated on 266 architecturally diverse building models, the framework achieved an average geometric-to-graph conversion success rate above 80% with strong topological consistency (<span><math><mrow><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>=</mo><mn>0</mn><mo>.</mo><mn>82</mn></mrow></math></span>). Building on the generated graphs, a heterogeneous ZoneGNN model enables zone-level energy prediction of single building, achieving accurate and stable performance (<span><math><mrow><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>=</mo><mn>0</mn><mo>.</mo><mn>88</mn><mo>±</mo><mn>0</mn><mo>.</mo><mn>01</mn></mrow></math></span>) and outperforming MLP baselines, particularly under limited training data. Overall, CUGER establishes a topology-preserving and fully automatable bridge between architectural modeling, performance simulation, and data-driven prediction for early-stage building design.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106815"},"PeriodicalIF":11.5,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-04DOI: 10.1016/j.autcon.2026.106806
Xi Hu , Rayan H. Assaad
This paper proposes a robotic teleoperation pipeline to automate the segmentation, quantification, localization, and visualization of pavement potholes in real-time. The pipeline includes a new attention-based deep learning (DL) model and integrates a 4WD robot, teleoperation workstation, multimodal RGBD sensing fusion and point cloud processing on the edge, and interactive web application through cloud services. The DL model was developed by incorporating an efficient multi-scale attention (EMA) mechanism and transfer learning, which was trained and tested on a pavement dataset with 9472 images. The pipeline was validated through real-world field tests. The new EMA-based DL model yielded a 0.611 mAP50–95(B) and a 0.613 mAP50–95(M), outperforming the YOLOv9 baseline by 8.33% and 6.98%, respectively. The findings also showed that the proposed pipeline successfully automates pothole inspection and generates an interactive map, enabling remote access to the robot's trajectory and detailed pothole information, including pothole area, volume, average and maximum depth.
{"title":"Real-time robotic teleoperation for pavement pothole segmentation, quantification, and localization using multimodal sensing and efficient multi-scale attention-enhanced edge deep learning","authors":"Xi Hu , Rayan H. Assaad","doi":"10.1016/j.autcon.2026.106806","DOIUrl":"10.1016/j.autcon.2026.106806","url":null,"abstract":"<div><div>This paper proposes a robotic teleoperation pipeline to automate the segmentation, quantification, localization, and visualization of pavement potholes in real-time. The pipeline includes a new attention-based deep learning (DL) model and integrates a 4WD robot, teleoperation workstation, multimodal RGBD sensing fusion and point cloud processing on the edge, and interactive web application through cloud services. The DL model was developed by incorporating an efficient multi-scale attention (EMA) mechanism and transfer learning, which was trained and tested on a pavement dataset with 9472 images. The pipeline was validated through real-world field tests. The new EMA-based DL model yielded a 0.611 mAP50–95(B) and a 0.613 mAP50–95(M), outperforming the YOLOv9 baseline by 8.33% and 6.98%, respectively. The findings also showed that the proposed pipeline successfully automates pothole inspection and generates an interactive map, enabling remote access to the robot's trajectory and detailed pothole information, including pothole area, volume, average and maximum depth.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106806"},"PeriodicalIF":11.5,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146185057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Engineering report generation from construction-site Internet of Things (IoT) data using large language models (LLMs) remains challenging due to hallucinations. Ensuring traceability and reliability in information retrieval and multi-step reasoning is essential within retrieval-augmented generation (RAG) for LLM. This paper formalizes the RAG-LLM pipeline and proposes a dual-stream enhancement combining knowledge graph (KG) construction with reinforcement learning (RL)-based retriever tuning. The graph-guided module extracts structured engineering elements, while RL improves semantic alignment and tokenization of critical terms. Leveraging this dual-stream RAG, a traceable reporting agent is developed, providing end-to-end traceability of retrieval and reasoning, along with inter-step similarity measures. When collaborating with existing on-site IoT systems, the agent can extend automated monitoring to decision-making support. This paper presents a reliable approach for construction report generation and advances human-AI collaboration in construction management.
{"title":"Graph-driven embedding reinforcement and traceable LLM agent for reliable element alignment in construction report generation","authors":"Zhenzhao Xia , Botao Zhong , Shuai Zhang , Tonghui Zhao , Miroslaw J. Skibniewski","doi":"10.1016/j.autcon.2026.106816","DOIUrl":"10.1016/j.autcon.2026.106816","url":null,"abstract":"<div><div>Engineering report generation from construction-site Internet of Things (IoT) data using large language models (LLMs) remains challenging due to hallucinations. Ensuring traceability and reliability in information retrieval and multi-step reasoning is essential within retrieval-augmented generation (RAG) for LLM. This paper formalizes the RAG-LLM pipeline and proposes a dual-stream enhancement combining knowledge graph (KG) construction with reinforcement learning (RL)-based retriever tuning. The graph-guided module extracts structured engineering elements, while RL improves semantic alignment and tokenization of critical terms. Leveraging this dual-stream RAG, a traceable reporting agent is developed, providing end-to-end traceability of retrieval and reasoning, along with inter-step similarity measures. When collaborating with existing on-site IoT systems, the agent can extend automated monitoring to decision-making support. This paper presents a reliable approach for construction report generation and advances human-AI collaboration in construction management.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106816"},"PeriodicalIF":11.5,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146110225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-03DOI: 10.1016/j.autcon.2026.106817
Shengnan Ke , Shibin Li , Jun Gong , Lingxiang Liu , Jianjun Luo , Bing Wang , Shengjun Tang
Accurate semantic understanding of indoor 3D point clouds is essential for constructing semantically rich architectural models and enabling component-level monitoring in smart building environments. This paper proposes a dependency-aware indoor 3D scene graph prediction framework that addresses two major limitations in existing methods. To address this, a Dependency-Aware Graph Reasoning Network (DAGRN) is introduced, integrating attention and message-passing mechanisms to learn context-dependent representations of objects and their relationships. Accordingly, a multimodal feature-enhanced learning module is proposed to align point cloud and image features and incorporate textual semantics from image–text models into a unified training scheme with triplet-level constraints ensuring semantic consistency. Extensive experiments on the 3RScan dataset demonstrate that the proposed method significantly outperforms existing approaches, achieving a 3.95% improvement in overall prediction metrics, laying a foundation for advanced semantic modeling in building automation.
{"title":"Dependency-aware indoor 3D scene graph prediction via multimodal feature learning","authors":"Shengnan Ke , Shibin Li , Jun Gong , Lingxiang Liu , Jianjun Luo , Bing Wang , Shengjun Tang","doi":"10.1016/j.autcon.2026.106817","DOIUrl":"10.1016/j.autcon.2026.106817","url":null,"abstract":"<div><div>Accurate semantic understanding of indoor 3D point clouds is essential for constructing semantically rich architectural models and enabling component-level monitoring in smart building environments. This paper proposes a dependency-aware indoor 3D scene graph prediction framework that addresses two major limitations in existing methods. To address this, a Dependency-Aware Graph Reasoning Network (DAGRN) is introduced, integrating attention and message-passing mechanisms to learn context-dependent representations of objects and their relationships. Accordingly, a multimodal feature-enhanced learning module is proposed to align point cloud and image features and incorporate textual semantics from image–text models into a unified training scheme with triplet-level constraints ensuring semantic consistency. Extensive experiments on the 3RScan dataset demonstrate that the proposed method significantly outperforms existing approaches, achieving a 3.95% improvement in overall prediction metrics, laying a foundation for advanced semantic modeling in building automation.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106817"},"PeriodicalIF":11.5,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146110227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-03DOI: 10.1016/j.autcon.2026.106804
Eunbeen Jeong , Seoyoung Jung , Insu Jung , Kyujin Ko , Junyoung Jang
The manual conversion of 2D CAD drawings to 3D BIM objects for Precast Concrete (PC) components is a time-consuming, error-prone bottleneck that hinders project efficiency. Existing automation studies have struggled with the complex, non-standard geometries typical of PC components. This paper presents a framework that automates BIM generation using a Semantic Naming Convention (SNC) to encode design information into 2D CAD attributes. Developed in the Dynamo visual programming environment, the system automatically creates fabrication-level BIM objects for PC columns, beams, and slabs. The framework's versatility was validated by successfully processing 100% of production drawings from three different manufacturers. A Charrette test showed the automated approach reduced modeling time by an average of 78.9%, improved model accuracy by 12.3, and cut human errors by over 90% compared to manual methods. This framework provides a practical solution to enhance productivity and model quality in the PC construction sector.
{"title":"Semantic naming convention-based automated BIM generation of precast concrete components from 2D CAD drawings","authors":"Eunbeen Jeong , Seoyoung Jung , Insu Jung , Kyujin Ko , Junyoung Jang","doi":"10.1016/j.autcon.2026.106804","DOIUrl":"10.1016/j.autcon.2026.106804","url":null,"abstract":"<div><div>The manual conversion of 2D CAD drawings to 3D BIM objects for Precast Concrete (PC) components is a time-consuming, error-prone bottleneck that hinders project efficiency. Existing automation studies have struggled with the complex, non-standard geometries typical of PC components. This paper presents a framework that automates BIM generation using a Semantic Naming Convention (SNC) to encode design information into 2D CAD attributes. Developed in the Dynamo visual programming environment, the system automatically creates fabrication-level BIM objects for PC columns, beams, and slabs. The framework's versatility was validated by successfully processing 100% of production drawings from three different manufacturers. A Charrette test showed the automated approach reduced modeling time by an average of 78.9%, improved model accuracy by 12.3, and cut human errors by over 90% compared to manual methods. This framework provides a practical solution to enhance productivity and model quality in the PC construction sector.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"183 ","pages":"Article 106804"},"PeriodicalIF":11.5,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146110224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}