{"title":"Text-to-structure interpretation of user requests in BIM interaction","authors":"Yinyi Wei , Xiao Li , Frank Petzold","doi":"10.1016/j.autcon.2025.106119","DOIUrl":null,"url":null,"abstract":"<div><div>Numerous efforts have been devoted to utilizing a natural language-based interface for BIM interaction. These interfaces require extracting user's intent (i.e., the operation type) and slots (i.e., the targeted elements and properties). However, there is a lack of a fine-grained approach for extracting intent and slot information simultaneously. This paper introduces a text-to-structure approach based on language models to interpret user requests for BIM interaction (T2S4BIM). It proposed a synthetic data generation method and a curated dataset as data support. Employing Transformer-based models, T2S4BIM converts unstructured user requests into a structured format with intent and slot information. Experiments demonstrated that T2S4BIM outperformed existing approaches, with encoder-decoder models like T5 and FLAN-T5 achieving performance comparable to larger, decoder-only models such as Llama3.1-8B and Qwen2.5-7B, while improving efficiency. The practical applicability of T2S4BIM was illustrated through a Revit plug-in that interprets user requests and executes corresponding actions (e.g., manipulating object properties).</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"174 ","pages":"Article 106119"},"PeriodicalIF":9.6000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automation in Construction","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926580525001591","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Numerous efforts have been devoted to utilizing a natural language-based interface for BIM interaction. These interfaces require extracting user's intent (i.e., the operation type) and slots (i.e., the targeted elements and properties). However, there is a lack of a fine-grained approach for extracting intent and slot information simultaneously. This paper introduces a text-to-structure approach based on language models to interpret user requests for BIM interaction (T2S4BIM). It proposed a synthetic data generation method and a curated dataset as data support. Employing Transformer-based models, T2S4BIM converts unstructured user requests into a structured format with intent and slot information. Experiments demonstrated that T2S4BIM outperformed existing approaches, with encoder-decoder models like T5 and FLAN-T5 achieving performance comparable to larger, decoder-only models such as Llama3.1-8B and Qwen2.5-7B, while improving efficiency. The practical applicability of T2S4BIM was illustrated through a Revit plug-in that interprets user requests and executes corresponding actions (e.g., manipulating object properties).
期刊介绍:
Automation in Construction is an international journal that focuses on publishing original research papers related to the use of Information Technologies in various aspects of the construction industry. The journal covers topics such as design, engineering, construction technologies, and the maintenance and management of constructed facilities.
The scope of Automation in Construction is extensive and covers all stages of the construction life cycle. This includes initial planning and design, construction of the facility, operation and maintenance, as well as the eventual dismantling and recycling of buildings and engineering structures.