{"title":"增强功能的 YOLO 及其在智能装配中的应用","authors":"Fenglei Zheng , Aijun Yin , Chuande Zhou","doi":"10.1016/j.robot.2024.104844","DOIUrl":null,"url":null,"abstract":"<div><div>Object detection is the most important part in intelligent assembly tasks, accurate and fast detection for different targets can complete positioning and assembly tasks more automatically and efficiently. In this paper, a feature enhancement object detection model based on YOLO is proposed. Firstly, the expression ability of feature layer is enhanced through RFP (Recursive Feature Pyramid) structure. The ARSPP (Atrous Residual Spatial Pyramid Pooling) is proposed to have a further enhancement for the feature layers output by the backbone network, it improves the recognition performance for multi-scale targets of model by using different size of dilated convolution and residual connection. Finally, the contiguous pyramid features are fused and enhanced through the attention mechanism, the results are used for the input of next recursive or predictive output. The model proposed in this paper effectively improves the detection accuracy of YOLO, it has 3% MAP improvement in PASCAL VOC dataset. The validity and accuracy of the model are verified in the robot intelligent assembly recognition task.</div></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"183 ","pages":"Article 104844"},"PeriodicalIF":4.3000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"YOLO with feature enhancement and its application in intelligent assembly\",\"authors\":\"Fenglei Zheng , Aijun Yin , Chuande Zhou\",\"doi\":\"10.1016/j.robot.2024.104844\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Object detection is the most important part in intelligent assembly tasks, accurate and fast detection for different targets can complete positioning and assembly tasks more automatically and efficiently. In this paper, a feature enhancement object detection model based on YOLO is proposed. Firstly, the expression ability of feature layer is enhanced through RFP (Recursive Feature Pyramid) structure. The ARSPP (Atrous Residual Spatial Pyramid Pooling) is proposed to have a further enhancement for the feature layers output by the backbone network, it improves the recognition performance for multi-scale targets of model by using different size of dilated convolution and residual connection. Finally, the contiguous pyramid features are fused and enhanced through the attention mechanism, the results are used for the input of next recursive or predictive output. The model proposed in this paper effectively improves the detection accuracy of YOLO, it has 3% MAP improvement in PASCAL VOC dataset. The validity and accuracy of the model are verified in the robot intelligent assembly recognition task.</div></div>\",\"PeriodicalId\":49592,\"journal\":{\"name\":\"Robotics and Autonomous Systems\",\"volume\":\"183 \",\"pages\":\"Article 104844\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Robotics and Autonomous Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0921889024002288\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889024002288","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
YOLO with feature enhancement and its application in intelligent assembly
Object detection is the most important part in intelligent assembly tasks, accurate and fast detection for different targets can complete positioning and assembly tasks more automatically and efficiently. In this paper, a feature enhancement object detection model based on YOLO is proposed. Firstly, the expression ability of feature layer is enhanced through RFP (Recursive Feature Pyramid) structure. The ARSPP (Atrous Residual Spatial Pyramid Pooling) is proposed to have a further enhancement for the feature layers output by the backbone network, it improves the recognition performance for multi-scale targets of model by using different size of dilated convolution and residual connection. Finally, the contiguous pyramid features are fused and enhanced through the attention mechanism, the results are used for the input of next recursive or predictive output. The model proposed in this paper effectively improves the detection accuracy of YOLO, it has 3% MAP improvement in PASCAL VOC dataset. The validity and accuracy of the model are verified in the robot intelligent assembly recognition task.
期刊介绍:
Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems.
Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.