{"title":"利用多池化-PCA 过程的 CNN 模块识别比例变化的车辆目标","authors":"Yuxiang Guo;Itsuo Kumazawa;Chuyo Kaku","doi":"10.26599/JICV.2023.9210017","DOIUrl":null,"url":null,"abstract":"The moving vehicles present different scales in the image due to the perspective effect of different viewpoint distances. The premise of advanced driver assistance system (ADAS) system for safety surveillance and safe driving is early identification of vehicle targets in front of the ego vehicle. The recognition of the same vehicle at different scales requires feature learning with scale invariance. Unlike existing feature vector methods, the normalized PCA eigenvalues calculated from feature maps are used to extract scale-invariant features. This study proposed a convolutional neural network (CNN) structure embedded with the module of multi-pooling-PCA for scale variant object recognition. The validation of the proposed network structure is verified by scale variant vehicle image dataset. Compared with scale invariant network algorithms of Scale-invariant feature transform (SIFT) and FSAF as well as miscellaneous networks, the proposed network can achieve the best recognition accuracy tested by the vehicle scale variant dataset. To testify the practicality of this modified network, the testing of public dataset ImageNet is done and the comparable results proved its effectiveness in general purpose of applications.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"6 4","pages":"227-236"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10409228","citationCount":"0","resultStr":"{\"title\":\"Scale Variant Vehicle Object Recognition by CNN Module of Multi-Pooling-PCA Process\",\"authors\":\"Yuxiang Guo;Itsuo Kumazawa;Chuyo Kaku\",\"doi\":\"10.26599/JICV.2023.9210017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The moving vehicles present different scales in the image due to the perspective effect of different viewpoint distances. The premise of advanced driver assistance system (ADAS) system for safety surveillance and safe driving is early identification of vehicle targets in front of the ego vehicle. The recognition of the same vehicle at different scales requires feature learning with scale invariance. Unlike existing feature vector methods, the normalized PCA eigenvalues calculated from feature maps are used to extract scale-invariant features. This study proposed a convolutional neural network (CNN) structure embedded with the module of multi-pooling-PCA for scale variant object recognition. The validation of the proposed network structure is verified by scale variant vehicle image dataset. Compared with scale invariant network algorithms of Scale-invariant feature transform (SIFT) and FSAF as well as miscellaneous networks, the proposed network can achieve the best recognition accuracy tested by the vehicle scale variant dataset. To testify the practicality of this modified network, the testing of public dataset ImageNet is done and the comparable results proved its effectiveness in general purpose of applications.\",\"PeriodicalId\":100793,\"journal\":{\"name\":\"Journal of Intelligent and Connected Vehicles\",\"volume\":\"6 4\",\"pages\":\"227-236\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10409228\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Intelligent and Connected Vehicles\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10409228/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent and Connected Vehicles","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10409228/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Scale Variant Vehicle Object Recognition by CNN Module of Multi-Pooling-PCA Process
The moving vehicles present different scales in the image due to the perspective effect of different viewpoint distances. The premise of advanced driver assistance system (ADAS) system for safety surveillance and safe driving is early identification of vehicle targets in front of the ego vehicle. The recognition of the same vehicle at different scales requires feature learning with scale invariance. Unlike existing feature vector methods, the normalized PCA eigenvalues calculated from feature maps are used to extract scale-invariant features. This study proposed a convolutional neural network (CNN) structure embedded with the module of multi-pooling-PCA for scale variant object recognition. The validation of the proposed network structure is verified by scale variant vehicle image dataset. Compared with scale invariant network algorithms of Scale-invariant feature transform (SIFT) and FSAF as well as miscellaneous networks, the proposed network can achieve the best recognition accuracy tested by the vehicle scale variant dataset. To testify the practicality of this modified network, the testing of public dataset ImageNet is done and the comparable results proved its effectiveness in general purpose of applications.