Pengyuan Sun , Lina Yang , Haoyan Yang , Banfu Yan , Thomas Wu , Jincheng Li
{"title":"DSWMamba: A deep feature fusion mamba network for detection of asphalt pavement distress","authors":"Pengyuan Sun , Lina Yang , Haoyan Yang , Banfu Yan , Thomas Wu , Jincheng Li","doi":"10.1016/j.conbuildmat.2025.140393","DOIUrl":null,"url":null,"abstract":"<div><div>The maintenance of asphalt pavements is vital for national infrastructure. Timely detection of pavement distress ensures optimal road performance and extends pavement lifespan. However, the various distress types and complex detection environments pose significant challenges. To overcome these, we propose a novel framework named DSWMamba, which is the first application of Mamba in asphalt pavement distress detection. This model incorporates targeted optimizations via the Selective Scanning Modules (SSM) to meet the unique demands of distress detection. To overcome the limitations of the SSM in receptive fields and poor image localization, we develop the Depth Fusion Selective Scan Block (DFSS), which integrates a Fusion Perception Block (FP-Block). This design expands the receptive field and strengthens local feature extraction. Additionally, the Vision Shuffle Down module(VSD) preserves both multiscale feature information, further enhancing feature representation. Unlike Transformer-based models, DSWMamba leverages a Separate Multidimensional Attention (SMD-Attention) mechanism for multi-path fusion, enabling global modeling. To improve the detection of large-scale features, we develop a Dynamic Separable align Head (DSA-Head), which decouples classification and localization tasks, significantly boosting feature recognition accuracy. Extensive experiments demonstrate that DSWMamba surpasses 18 target detection models, including CNN, Transformer frameworks and the latest YOLO and RTDETR series, on three datasets. DSWMamba improves transverse crack detection accuracy on the Asphalt Pavement Distress Dataset by 13.1% compared to Mamba YOLO while reducing parameters by 34.6% and computational cost by 44.6%. We release the Asphalt Pavement Distress Dataset, comprising 13,129 images of six distress types, offering a valuable resource for future research.</div></div>","PeriodicalId":288,"journal":{"name":"Construction and Building Materials","volume":"469 ","pages":"Article 140393"},"PeriodicalIF":7.4000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Construction and Building Materials","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950061825005410","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The maintenance of asphalt pavements is vital for national infrastructure. Timely detection of pavement distress ensures optimal road performance and extends pavement lifespan. However, the various distress types and complex detection environments pose significant challenges. To overcome these, we propose a novel framework named DSWMamba, which is the first application of Mamba in asphalt pavement distress detection. This model incorporates targeted optimizations via the Selective Scanning Modules (SSM) to meet the unique demands of distress detection. To overcome the limitations of the SSM in receptive fields and poor image localization, we develop the Depth Fusion Selective Scan Block (DFSS), which integrates a Fusion Perception Block (FP-Block). This design expands the receptive field and strengthens local feature extraction. Additionally, the Vision Shuffle Down module(VSD) preserves both multiscale feature information, further enhancing feature representation. Unlike Transformer-based models, DSWMamba leverages a Separate Multidimensional Attention (SMD-Attention) mechanism for multi-path fusion, enabling global modeling. To improve the detection of large-scale features, we develop a Dynamic Separable align Head (DSA-Head), which decouples classification and localization tasks, significantly boosting feature recognition accuracy. Extensive experiments demonstrate that DSWMamba surpasses 18 target detection models, including CNN, Transformer frameworks and the latest YOLO and RTDETR series, on three datasets. DSWMamba improves transverse crack detection accuracy on the Asphalt Pavement Distress Dataset by 13.1% compared to Mamba YOLO while reducing parameters by 34.6% and computational cost by 44.6%. We release the Asphalt Pavement Distress Dataset, comprising 13,129 images of six distress types, offering a valuable resource for future research.
期刊介绍:
Construction and Building Materials offers an international platform for sharing innovative and original research and development in the realm of construction and building materials, along with their practical applications in new projects and repair practices. The journal publishes a diverse array of pioneering research and application papers, detailing laboratory investigations and, to a limited extent, numerical analyses or reports on full-scale projects. Multi-part papers are discouraged.
Additionally, Construction and Building Materials features comprehensive case studies and insightful review articles that contribute to new insights in the field. Our focus is on papers related to construction materials, excluding those on structural engineering, geotechnics, and unbound highway layers. Covered materials and technologies encompass cement, concrete reinforcement, bricks and mortars, additives, corrosion technology, ceramics, timber, steel, polymers, glass fibers, recycled materials, bamboo, rammed earth, non-conventional building materials, bituminous materials, and applications in railway materials.