{"title":"集成 GLMANet 和 EFPN 的道路裂缝检测模型","authors":"Xinran Li;Xiangyang Xu;Hao Yang","doi":"10.1109/TITS.2024.3432995","DOIUrl":null,"url":null,"abstract":"Detecting road cracks is crucial for ensuring road traffic safety and stability. However, currently existing detection methods do not usually pay close attention to the global, local and multi-scale feature information of road crack images, resulting in poor detection effects. To overcome this limitation, we propose a road crack detection model that combines the global and local multi-scale attention network (GLMANet) and enhanced feature pyramid network (EFPN). The GLMANet can effectively extract useful global, local and multi-scale feature information of crack images through a novel global and local multi-scale attention mechanism. Meanwhile, EFPN utilizes global adaptive attention module and multi-receptive field feature enhancement module to mitigate information loss during feature map generation, enhancing the representation of advanced semantic features in the feature pyramid. Leveraging faster region-based convolutional neural networks (Faster R-CNN) as the object detection architecture, we conduct experimental evaluations on both publicly available crack datasets and a dataset we collected. The proposed model achieved optimal detection performance across all three datasets. In comparison experiments with the current state-of-the-art crack detection methods, the proposed model outperformed other models, with AP and AP50 increasing by up to 16% and 11%, respectively, validating its effectiveness and superiority. Additionally, the model achieved a good balance between complexity and detection performance with 79.2 GFLOPs and 72.60 million parameters.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"25 11","pages":"18211-18223"},"PeriodicalIF":7.9000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Road Crack Detection Model Integrating GLMANet and EFPN\",\"authors\":\"Xinran Li;Xiangyang Xu;Hao Yang\",\"doi\":\"10.1109/TITS.2024.3432995\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Detecting road cracks is crucial for ensuring road traffic safety and stability. However, currently existing detection methods do not usually pay close attention to the global, local and multi-scale feature information of road crack images, resulting in poor detection effects. To overcome this limitation, we propose a road crack detection model that combines the global and local multi-scale attention network (GLMANet) and enhanced feature pyramid network (EFPN). The GLMANet can effectively extract useful global, local and multi-scale feature information of crack images through a novel global and local multi-scale attention mechanism. Meanwhile, EFPN utilizes global adaptive attention module and multi-receptive field feature enhancement module to mitigate information loss during feature map generation, enhancing the representation of advanced semantic features in the feature pyramid. Leveraging faster region-based convolutional neural networks (Faster R-CNN) as the object detection architecture, we conduct experimental evaluations on both publicly available crack datasets and a dataset we collected. The proposed model achieved optimal detection performance across all three datasets. In comparison experiments with the current state-of-the-art crack detection methods, the proposed model outperformed other models, with AP and AP50 increasing by up to 16% and 11%, respectively, validating its effectiveness and superiority. Additionally, the model achieved a good balance between complexity and detection performance with 79.2 GFLOPs and 72.60 million parameters.\",\"PeriodicalId\":13416,\"journal\":{\"name\":\"IEEE Transactions on Intelligent Transportation Systems\",\"volume\":\"25 11\",\"pages\":\"18211-18223\"},\"PeriodicalIF\":7.9000,\"publicationDate\":\"2024-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Intelligent Transportation Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10666989/\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10666989/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
A Road Crack Detection Model Integrating GLMANet and EFPN
Detecting road cracks is crucial for ensuring road traffic safety and stability. However, currently existing detection methods do not usually pay close attention to the global, local and multi-scale feature information of road crack images, resulting in poor detection effects. To overcome this limitation, we propose a road crack detection model that combines the global and local multi-scale attention network (GLMANet) and enhanced feature pyramid network (EFPN). The GLMANet can effectively extract useful global, local and multi-scale feature information of crack images through a novel global and local multi-scale attention mechanism. Meanwhile, EFPN utilizes global adaptive attention module and multi-receptive field feature enhancement module to mitigate information loss during feature map generation, enhancing the representation of advanced semantic features in the feature pyramid. Leveraging faster region-based convolutional neural networks (Faster R-CNN) as the object detection architecture, we conduct experimental evaluations on both publicly available crack datasets and a dataset we collected. The proposed model achieved optimal detection performance across all three datasets. In comparison experiments with the current state-of-the-art crack detection methods, the proposed model outperformed other models, with AP and AP50 increasing by up to 16% and 11%, respectively, validating its effectiveness and superiority. Additionally, the model achieved a good balance between complexity and detection performance with 79.2 GFLOPs and 72.60 million parameters.
期刊介绍:
The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.