{"title":"Detection of pedestrians and vehicles in autonomous driving with selective kernel networks","authors":"Zhenlin Zhang, Gao Hanwen, Xingang Wu","doi":"10.1049/ccs2.12078","DOIUrl":null,"url":null,"abstract":"<p>Accurate detection of pedestrians and vehicles on the road is an important content in autonomous driving technology. In this article, a method to optimise the object detection network using the channel attention mechanism is proposed. In general, small object detection problems and difficult sample detection problems in object detection tasks can be solved by using feature pyramids. Different from building a feature pyramid, the authors did not make extensive changes to the network, but used the channel attention mechanism to dynamically adjust the output of a layer during the feature extraction process, allowing each neuron to adjust its receptive field size adaptively according to multiple scales of the input information, so that the network pays attention to the extraction of important features, especially the features of small objects and difficult samples. In order to evaluate the performance of the proposed method, experiments were conducted on standard benchmark data sets. It has been observed that the proposed method is superior to the original object detection network in terms of the detection accuracy of pedestrians and vehicles, especially the detection of small objects.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12078","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation and Systems","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ccs2.12078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate detection of pedestrians and vehicles on the road is an important content in autonomous driving technology. In this article, a method to optimise the object detection network using the channel attention mechanism is proposed. In general, small object detection problems and difficult sample detection problems in object detection tasks can be solved by using feature pyramids. Different from building a feature pyramid, the authors did not make extensive changes to the network, but used the channel attention mechanism to dynamically adjust the output of a layer during the feature extraction process, allowing each neuron to adjust its receptive field size adaptively according to multiple scales of the input information, so that the network pays attention to the extraction of important features, especially the features of small objects and difficult samples. In order to evaluate the performance of the proposed method, experiments were conducted on standard benchmark data sets. It has been observed that the proposed method is superior to the original object detection network in terms of the detection accuracy of pedestrians and vehicles, especially the detection of small objects.