{"title":"Development of an action classification method for construction sites combining pose assessment and object proximity evaluation","authors":"Toshiya Kikuta, Pang-jo Chun","doi":"10.1007/s12652-024-04753-7","DOIUrl":null,"url":null,"abstract":"<p>Addressing the inherent hazards of on-site construction work and stagnant labor productivity is crucial in the construction industry. To tackle these challenges, automated monitoring of construction sites and analysis of workers' actions play a pivotal role. In this study, we developed a method for classifying actions at a construction site from video, using deep learning. Specifically, we used two image processing techniques, pose assessment and object detection, and found that the accuracy of action classification was improved by extracting information on the proximity of workers to equipment installed at the construction site, and also by considering the pose information. For classification, LSTM (Long Short-Term Memory), CNN (Convolutional Neural Network), and XGBoost models were used, and the presence of proximity information improved average recall by 7.0% to 8.5% for all models used. The final model was developed as an ensemble of these methods, offering accuracy and average recall that are higher than with conventional methods. The methodology developed in this research enables quantification and visualization of work content at construction sites, contributing to the overall enhancement of safety and productivity within the construction industry.</p>","PeriodicalId":14959,"journal":{"name":"Journal of Ambient Intelligence and Humanized Computing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Ambient Intelligence and Humanized Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12652-024-04753-7","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0
Abstract
Addressing the inherent hazards of on-site construction work and stagnant labor productivity is crucial in the construction industry. To tackle these challenges, automated monitoring of construction sites and analysis of workers' actions play a pivotal role. In this study, we developed a method for classifying actions at a construction site from video, using deep learning. Specifically, we used two image processing techniques, pose assessment and object detection, and found that the accuracy of action classification was improved by extracting information on the proximity of workers to equipment installed at the construction site, and also by considering the pose information. For classification, LSTM (Long Short-Term Memory), CNN (Convolutional Neural Network), and XGBoost models were used, and the presence of proximity information improved average recall by 7.0% to 8.5% for all models used. The final model was developed as an ensemble of these methods, offering accuracy and average recall that are higher than with conventional methods. The methodology developed in this research enables quantification and visualization of work content at construction sites, contributing to the overall enhancement of safety and productivity within the construction industry.
期刊介绍:
The purpose of JAIHC is to provide a high profile, leading edge forum for academics, industrial professionals, educators and policy makers involved in the field to contribute, to disseminate the most innovative researches and developments of all aspects of ambient intelligence and humanized computing, such as intelligent/smart objects, environments/spaces, and systems. The journal discusses various technical, safety, personal, social, physical, political, artistic and economic issues. The research topics covered by the journal are (but not limited to):
Pervasive/Ubiquitous Computing and Applications
Cognitive wireless sensor network
Embedded Systems and Software
Mobile Computing and Wireless Communications
Next Generation Multimedia Systems
Security, Privacy and Trust
Service and Semantic Computing
Advanced Networking Architectures
Dependable, Reliable and Autonomic Computing
Embedded Smart Agents
Context awareness, social sensing and inference
Multi modal interaction design
Ergonomics and product prototyping
Intelligent and self-organizing transportation networks & services
Healthcare Systems
Virtual Humans & Virtual Worlds
Wearables sensors and actuators