Dong Gyu Park , Tae Nam Jung , Jin Gahk Kim , Sang Hun Lee , Eun Su Oh , Dong Hwan Kim
{"title":"基于 DBSCAN 和 Yolov5 的 3D 物体检测及其与移动平台的适配","authors":"Dong Gyu Park , Tae Nam Jung , Jin Gahk Kim , Sang Hun Lee , Eun Su Oh , Dong Hwan Kim","doi":"10.1016/j.mechatronics.2024.103238","DOIUrl":null,"url":null,"abstract":"<div><p>This study presents a 3D object detection technology for mobile platforms and its application. Rather than an innovative high-performance model, we proposed a “useable” model for the robot industry at the current technology stage by combining various techniques. To reduce computation time, a 2D region proposal was obtained using a RGB image-based CNN model. By applying the DBSCAN clustering technique to the point cloud corresponding to the 2D region proposal, a method of obtaining a 3D region proposal was proposed. This allowed for 3D object detection using an RGB image dataset, which has been widely researched, while reducing the computation load to a level suitable for use in mobile robots. Furthermore, the 3D object detection was integrated into a ROS 2-based mobile platform, which was used to perform pedestrian-safe avoidance tasks and elevator button operation tasks. The performance was confirmed through experiments.</p></div>","PeriodicalId":49842,"journal":{"name":"Mechatronics","volume":"103 ","pages":"Article 103238"},"PeriodicalIF":3.1000,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DBSCAN and Yolov5 based 3D object detection and its adaptation to a mobile platform\",\"authors\":\"Dong Gyu Park , Tae Nam Jung , Jin Gahk Kim , Sang Hun Lee , Eun Su Oh , Dong Hwan Kim\",\"doi\":\"10.1016/j.mechatronics.2024.103238\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This study presents a 3D object detection technology for mobile platforms and its application. Rather than an innovative high-performance model, we proposed a “useable” model for the robot industry at the current technology stage by combining various techniques. To reduce computation time, a 2D region proposal was obtained using a RGB image-based CNN model. By applying the DBSCAN clustering technique to the point cloud corresponding to the 2D region proposal, a method of obtaining a 3D region proposal was proposed. This allowed for 3D object detection using an RGB image dataset, which has been widely researched, while reducing the computation load to a level suitable for use in mobile robots. Furthermore, the 3D object detection was integrated into a ROS 2-based mobile platform, which was used to perform pedestrian-safe avoidance tasks and elevator button operation tasks. The performance was confirmed through experiments.</p></div>\",\"PeriodicalId\":49842,\"journal\":{\"name\":\"Mechatronics\",\"volume\":\"103 \",\"pages\":\"Article 103238\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2024-08-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mechatronics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S095741582400103X\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mechatronics","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095741582400103X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
DBSCAN and Yolov5 based 3D object detection and its adaptation to a mobile platform
This study presents a 3D object detection technology for mobile platforms and its application. Rather than an innovative high-performance model, we proposed a “useable” model for the robot industry at the current technology stage by combining various techniques. To reduce computation time, a 2D region proposal was obtained using a RGB image-based CNN model. By applying the DBSCAN clustering technique to the point cloud corresponding to the 2D region proposal, a method of obtaining a 3D region proposal was proposed. This allowed for 3D object detection using an RGB image dataset, which has been widely researched, while reducing the computation load to a level suitable for use in mobile robots. Furthermore, the 3D object detection was integrated into a ROS 2-based mobile platform, which was used to perform pedestrian-safe avoidance tasks and elevator button operation tasks. The performance was confirmed through experiments.
期刊介绍:
Mechatronics is the synergistic combination of precision mechanical engineering, electronic control and systems thinking in the design of products and manufacturing processes. It relates to the design of systems, devices and products aimed at achieving an optimal balance between basic mechanical structure and its overall control. The purpose of this journal is to provide rapid publication of topical papers featuring practical developments in mechatronics. It will cover a wide range of application areas including consumer product design, instrumentation, manufacturing methods, computer integration and process and device control, and will attract a readership from across the industrial and academic research spectrum. Particular importance will be attached to aspects of innovation in mechatronics design philosophy which illustrate the benefits obtainable by an a priori integration of functionality with embedded microprocessor control. A major item will be the design of machines, devices and systems possessing a degree of computer based intelligence. The journal seeks to publish research progress in this field with an emphasis on the applied rather than the theoretical. It will also serve the dual role of bringing greater recognition to this important area of engineering.