{"title":"利用特征分辨率分析改进工业裂纹单针探测器","authors":"Shengxiang Qi, Yaming Dong, Qing Mao","doi":"10.1145/3430199.3430204","DOIUrl":null,"url":null,"abstract":"Although the single shot detector (SSD) is effective for object detection in natural images, it is not suitable for special tasks such as the industrial crack detection. The difficulty lies in the wide diversity of crack sizes and shapes that is usually unpredictable. To solve this problem, we improve the SSD model by feature resolution analysis. The classical SSD network extracts several convolutional feature layers with degressive scales, and then classifies and locates targets by a series of prior boxes with default sizes and aspect ratios regarding to each scale. Therefore, the key is whether the design of these prior boxes is consistent with the real target characteristics. In this paper, we improve the architecture of SSD network via statistically analyzing the distribution of sizes and shapes from our collected crack samples. According to the resolution analysis of the targets at each feature scale, only a fewer number of valid feature layers are carefully extracted, and some more accurate prior boxes are designed relative to each scale. Finally, experimental results demonstrate that the proposed method could not only achieve significantly better prediction accuracy, but also acquire higher computational efficiency, which outperform the state-of-the-art methods.","PeriodicalId":371055,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Improving Single Shot Detector for Industrial Cracks by Feature Resolution Analysis\",\"authors\":\"Shengxiang Qi, Yaming Dong, Qing Mao\",\"doi\":\"10.1145/3430199.3430204\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Although the single shot detector (SSD) is effective for object detection in natural images, it is not suitable for special tasks such as the industrial crack detection. The difficulty lies in the wide diversity of crack sizes and shapes that is usually unpredictable. To solve this problem, we improve the SSD model by feature resolution analysis. The classical SSD network extracts several convolutional feature layers with degressive scales, and then classifies and locates targets by a series of prior boxes with default sizes and aspect ratios regarding to each scale. Therefore, the key is whether the design of these prior boxes is consistent with the real target characteristics. In this paper, we improve the architecture of SSD network via statistically analyzing the distribution of sizes and shapes from our collected crack samples. According to the resolution analysis of the targets at each feature scale, only a fewer number of valid feature layers are carefully extracted, and some more accurate prior boxes are designed relative to each scale. Finally, experimental results demonstrate that the proposed method could not only achieve significantly better prediction accuracy, but also acquire higher computational efficiency, which outperform the state-of-the-art methods.\",\"PeriodicalId\":371055,\"journal\":{\"name\":\"Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3430199.3430204\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3430199.3430204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving Single Shot Detector for Industrial Cracks by Feature Resolution Analysis
Although the single shot detector (SSD) is effective for object detection in natural images, it is not suitable for special tasks such as the industrial crack detection. The difficulty lies in the wide diversity of crack sizes and shapes that is usually unpredictable. To solve this problem, we improve the SSD model by feature resolution analysis. The classical SSD network extracts several convolutional feature layers with degressive scales, and then classifies and locates targets by a series of prior boxes with default sizes and aspect ratios regarding to each scale. Therefore, the key is whether the design of these prior boxes is consistent with the real target characteristics. In this paper, we improve the architecture of SSD network via statistically analyzing the distribution of sizes and shapes from our collected crack samples. According to the resolution analysis of the targets at each feature scale, only a fewer number of valid feature layers are carefully extracted, and some more accurate prior boxes are designed relative to each scale. Finally, experimental results demonstrate that the proposed method could not only achieve significantly better prediction accuracy, but also acquire higher computational efficiency, which outperform the state-of-the-art methods.