{"title":"利用视觉曼巴和卷积网络增强像素级裂缝分割能力","authors":"Chengjia Han , Handuo Yang , Yaowen Yang","doi":"10.1016/j.autcon.2024.105770","DOIUrl":null,"url":null,"abstract":"<div><p>Computer vision-based semantic segmentation methods are currently the most widely used for automated detection of structural cracks in buildings and pavements. However, these methods face persistent challenges in detecting fine cracks with small widths and in distinguishing cracks from background stains. This paper addresses these issues by introducing MambaCrackNet, a new network architecture for pixel-level crack segmentation. MambaCrackNet incorporates residual visual Mamba blocks and integrates visual Mamba and convolutional neural network-based segmentation techniques. This approach effectively enhances the detection of fine cracks, reduces misdetections of background stains, and remains robust to variations in patch size and training sample sizes, making it highly practical for engineering applications. On two open access crack datasets, MambaCrackNet outperformed mainstream crack segmentation models, achieving MIoU scores of 0.8939 and 0.8560 and F1-scores of 0.8817 and 0.8412.</p></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":null,"pages":null},"PeriodicalIF":9.6000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing pixel-level crack segmentation with visual mamba and convolutional networks\",\"authors\":\"Chengjia Han , Handuo Yang , Yaowen Yang\",\"doi\":\"10.1016/j.autcon.2024.105770\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Computer vision-based semantic segmentation methods are currently the most widely used for automated detection of structural cracks in buildings and pavements. However, these methods face persistent challenges in detecting fine cracks with small widths and in distinguishing cracks from background stains. This paper addresses these issues by introducing MambaCrackNet, a new network architecture for pixel-level crack segmentation. MambaCrackNet incorporates residual visual Mamba blocks and integrates visual Mamba and convolutional neural network-based segmentation techniques. This approach effectively enhances the detection of fine cracks, reduces misdetections of background stains, and remains robust to variations in patch size and training sample sizes, making it highly practical for engineering applications. On two open access crack datasets, MambaCrackNet outperformed mainstream crack segmentation models, achieving MIoU scores of 0.8939 and 0.8560 and F1-scores of 0.8817 and 0.8412.</p></div>\",\"PeriodicalId\":8660,\"journal\":{\"name\":\"Automation in Construction\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":9.6000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Automation in Construction\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0926580524005065\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automation in Construction","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926580524005065","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
Enhancing pixel-level crack segmentation with visual mamba and convolutional networks
Computer vision-based semantic segmentation methods are currently the most widely used for automated detection of structural cracks in buildings and pavements. However, these methods face persistent challenges in detecting fine cracks with small widths and in distinguishing cracks from background stains. This paper addresses these issues by introducing MambaCrackNet, a new network architecture for pixel-level crack segmentation. MambaCrackNet incorporates residual visual Mamba blocks and integrates visual Mamba and convolutional neural network-based segmentation techniques. This approach effectively enhances the detection of fine cracks, reduces misdetections of background stains, and remains robust to variations in patch size and training sample sizes, making it highly practical for engineering applications. On two open access crack datasets, MambaCrackNet outperformed mainstream crack segmentation models, achieving MIoU scores of 0.8939 and 0.8560 and F1-scores of 0.8817 and 0.8412.
期刊介绍:
Automation in Construction is an international journal that focuses on publishing original research papers related to the use of Information Technologies in various aspects of the construction industry. The journal covers topics such as design, engineering, construction technologies, and the maintenance and management of constructed facilities.
The scope of Automation in Construction is extensive and covers all stages of the construction life cycle. This includes initial planning and design, construction of the facility, operation and maintenance, as well as the eventual dismantling and recycling of buildings and engineering structures.