{"title":"基于特征拼接的集成卷积神经主干在不同交通条件下的车辆检测","authors":"","doi":"10.1080/19427867.2023.2250622","DOIUrl":null,"url":null,"abstract":"<div><div>Nowadays, deploying an intelligent vehicle detection system (IVDS) in diverse traffic is a work priority. It provides real-time traffic information with vehicle counts and types of vehicles. IVDS deployment in diverse traffic is challenging because different vehicle classes occlude each other on the road. In recent years, convolutional neural network (CNN) based deep learning (DL) methods have attained incredible progress in implementing IVDS. However, most CNN-based DL methods do not include diverse traffic conditions in Asian countries. Also, due to existing feature extraction backbones, they cannot accurately detect multi-scale vehicles. This work proposes an advanced visual computing deep learning (AVCDL) method with a vast labeled vehicle dataset to detect vehicles in diverse traffic. It includes an ensemble backbone and an improved multi-stage vehicle detection head (MSVDH). An ensemble CNN backbone extracts the vehicle features and combines them on a single channel via a feature concatenation. The final detection is carried out by an improved MSVDH that classifies the target vehicles. The proposed method is examined, tested, and evaluated using traffic statistics. It is contrasted with current cutting-edge vehicle detection techniques. It achieves 86.32% mean average precision (mAP) on self-collected diverse traffic labeled dataset (DTLD) and 86.17% mAP on KITTI. Moreover, the real-time performance is validated with NVIDIA Jetson Tx2 and Nano boards. It achieves 15 frames per second (FPS) on Jetson Tx2 and 7 FPS on Jetson Nano.</div></div>","PeriodicalId":48974,"journal":{"name":"Transportation Letters-The International Journal of Transportation Research","volume":"16 8","pages":"Pages 838-856"},"PeriodicalIF":3.3000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Vehicle detection in diverse traffic using an ensemble convolutional neural backbone via feature concatenation\",\"authors\":\"\",\"doi\":\"10.1080/19427867.2023.2250622\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Nowadays, deploying an intelligent vehicle detection system (IVDS) in diverse traffic is a work priority. It provides real-time traffic information with vehicle counts and types of vehicles. IVDS deployment in diverse traffic is challenging because different vehicle classes occlude each other on the road. In recent years, convolutional neural network (CNN) based deep learning (DL) methods have attained incredible progress in implementing IVDS. However, most CNN-based DL methods do not include diverse traffic conditions in Asian countries. Also, due to existing feature extraction backbones, they cannot accurately detect multi-scale vehicles. This work proposes an advanced visual computing deep learning (AVCDL) method with a vast labeled vehicle dataset to detect vehicles in diverse traffic. It includes an ensemble backbone and an improved multi-stage vehicle detection head (MSVDH). An ensemble CNN backbone extracts the vehicle features and combines them on a single channel via a feature concatenation. The final detection is carried out by an improved MSVDH that classifies the target vehicles. The proposed method is examined, tested, and evaluated using traffic statistics. It is contrasted with current cutting-edge vehicle detection techniques. It achieves 86.32% mean average precision (mAP) on self-collected diverse traffic labeled dataset (DTLD) and 86.17% mAP on KITTI. Moreover, the real-time performance is validated with NVIDIA Jetson Tx2 and Nano boards. It achieves 15 frames per second (FPS) on Jetson Tx2 and 7 FPS on Jetson Nano.</div></div>\",\"PeriodicalId\":48974,\"journal\":{\"name\":\"Transportation Letters-The International Journal of Transportation Research\",\"volume\":\"16 8\",\"pages\":\"Pages 838-856\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Letters-The International Journal of Transportation Research\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/org/science/article/pii/S1942786723002308\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Letters-The International Journal of Transportation Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/org/science/article/pii/S1942786723002308","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION","Score":null,"Total":0}
Vehicle detection in diverse traffic using an ensemble convolutional neural backbone via feature concatenation
Nowadays, deploying an intelligent vehicle detection system (IVDS) in diverse traffic is a work priority. It provides real-time traffic information with vehicle counts and types of vehicles. IVDS deployment in diverse traffic is challenging because different vehicle classes occlude each other on the road. In recent years, convolutional neural network (CNN) based deep learning (DL) methods have attained incredible progress in implementing IVDS. However, most CNN-based DL methods do not include diverse traffic conditions in Asian countries. Also, due to existing feature extraction backbones, they cannot accurately detect multi-scale vehicles. This work proposes an advanced visual computing deep learning (AVCDL) method with a vast labeled vehicle dataset to detect vehicles in diverse traffic. It includes an ensemble backbone and an improved multi-stage vehicle detection head (MSVDH). An ensemble CNN backbone extracts the vehicle features and combines them on a single channel via a feature concatenation. The final detection is carried out by an improved MSVDH that classifies the target vehicles. The proposed method is examined, tested, and evaluated using traffic statistics. It is contrasted with current cutting-edge vehicle detection techniques. It achieves 86.32% mean average precision (mAP) on self-collected diverse traffic labeled dataset (DTLD) and 86.17% mAP on KITTI. Moreover, the real-time performance is validated with NVIDIA Jetson Tx2 and Nano boards. It achieves 15 frames per second (FPS) on Jetson Tx2 and 7 FPS on Jetson Nano.
期刊介绍:
Transportation Letters: The International Journal of Transportation Research is a quarterly journal that publishes high-quality peer-reviewed and mini-review papers as well as technical notes and book reviews on the state-of-the-art in transportation research.
The focus of Transportation Letters is on analytical and empirical findings, methodological papers, and theoretical and conceptual insights across all areas of research. Review resource papers that merge descriptions of the state-of-the-art with innovative and new methodological, theoretical, and conceptual insights spanning all areas of transportation research are invited and of particular interest.