Sungkook Hong , Byungjoo Choi , Youngjib Ham , JungHo Jeon , Hyunsoo Kim
{"title":"通过稳定扩散合成大规模建筑数据集,用于机器学习训练","authors":"Sungkook Hong , Byungjoo Choi , Youngjib Ham , JungHo Jeon , Hyunsoo Kim","doi":"10.1016/j.aei.2024.102866","DOIUrl":null,"url":null,"abstract":"<div><div>Advancements of artificial intelligence (AI)-driven image generation provide opportunities to address a problem in machine learning applications that have suffered from a lack of domain-specific training data. This study explores the feasibility of employing synthesized images (SIs) generated through Stable Diffusion as training data for construction. This study aims to examine the potential of Stable Diffusion in construction, and the performance of convolutional neural network (CNN) models trained exclusively on SIs. A total of 82.01% of images synthesized are suitable for representing construction tasks. The CNN model trained on preprocessed SIs (with context-based labeling results) exhibited a classification accuracy of 89.09%. The CNN model trained solely on raw SIs (synthesized images without context-based labeling results) achieved a successful classification rate of 86.51% for the images. This study presents the viability of SIs as a training dataset and introduces context-based labeling through object detection techniques, enhancing the performance of estimation models.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102866"},"PeriodicalIF":8.0000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Massive-Scale construction dataset synthesis through Stable Diffusion for Machine learning training\",\"authors\":\"Sungkook Hong , Byungjoo Choi , Youngjib Ham , JungHo Jeon , Hyunsoo Kim\",\"doi\":\"10.1016/j.aei.2024.102866\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Advancements of artificial intelligence (AI)-driven image generation provide opportunities to address a problem in machine learning applications that have suffered from a lack of domain-specific training data. This study explores the feasibility of employing synthesized images (SIs) generated through Stable Diffusion as training data for construction. This study aims to examine the potential of Stable Diffusion in construction, and the performance of convolutional neural network (CNN) models trained exclusively on SIs. A total of 82.01% of images synthesized are suitable for representing construction tasks. The CNN model trained on preprocessed SIs (with context-based labeling results) exhibited a classification accuracy of 89.09%. The CNN model trained solely on raw SIs (synthesized images without context-based labeling results) achieved a successful classification rate of 86.51% for the images. This study presents the viability of SIs as a training dataset and introduces context-based labeling through object detection techniques, enhancing the performance of estimation models.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"62 \",\"pages\":\"Article 102866\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2024-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034624005147\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034624005147","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Massive-Scale construction dataset synthesis through Stable Diffusion for Machine learning training
Advancements of artificial intelligence (AI)-driven image generation provide opportunities to address a problem in machine learning applications that have suffered from a lack of domain-specific training data. This study explores the feasibility of employing synthesized images (SIs) generated through Stable Diffusion as training data for construction. This study aims to examine the potential of Stable Diffusion in construction, and the performance of convolutional neural network (CNN) models trained exclusively on SIs. A total of 82.01% of images synthesized are suitable for representing construction tasks. The CNN model trained on preprocessed SIs (with context-based labeling results) exhibited a classification accuracy of 89.09%. The CNN model trained solely on raw SIs (synthesized images without context-based labeling results) achieved a successful classification rate of 86.51% for the images. This study presents the viability of SIs as a training dataset and introduces context-based labeling through object detection techniques, enhancing the performance of estimation models.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.