{"title":"基于卷积神经网络架构的车辆类型数据集图像分类","authors":"Yian Seo, K. Shin","doi":"10.1145/3299819.3299822","DOIUrl":null,"url":null,"abstract":"Fast development in Deep Learning and its hybrid methodologies has led diverse applications in different domains. For image classification tasks in vehicle related fields, Convolutional Neural Network (CNN) is mostly chosen for recent usages. To train the CNN classifier, various vehicle image datasets are used, however, most of previous studies have learned features from datasets with a single form of images taken in the controlled condition such as surveillance camera vehicle image dataset from the same road, which results the classifier cannot guarantee the generalization of the model onto different forms of vehicle images. In addition, most of researches using CNN have used LeNet, GoogLeNet, or VGGNet for their main architecture. In this study, we perform vehicle type (convertible, coupe, crossover, sedan, SUV, truck, and van) classification and we use our own collected dataset with vehicle images taken in different angles and backgrounds to ensure the generalization and adaptability of proposed classifier. Moreover, we use the state-of-the-art CNN architecture, NASNet, which is a hybrid CNN architecture having Recurrent Neural Network structure trained by Reinforcement Learning to find optimal architecture. After 10 folded experiments, the average final test accuracy points 83%, and on the additional evaluation with random query images, the proposed model achieves accurate classification.","PeriodicalId":119217,"journal":{"name":"Artificial Intelligence and Cloud Computing Conference","volume":"128 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Image Classification for Vehicle Type Dataset Using State-of-the-art Convolutional Neural Network Architecture\",\"authors\":\"Yian Seo, K. Shin\",\"doi\":\"10.1145/3299819.3299822\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fast development in Deep Learning and its hybrid methodologies has led diverse applications in different domains. For image classification tasks in vehicle related fields, Convolutional Neural Network (CNN) is mostly chosen for recent usages. To train the CNN classifier, various vehicle image datasets are used, however, most of previous studies have learned features from datasets with a single form of images taken in the controlled condition such as surveillance camera vehicle image dataset from the same road, which results the classifier cannot guarantee the generalization of the model onto different forms of vehicle images. In addition, most of researches using CNN have used LeNet, GoogLeNet, or VGGNet for their main architecture. In this study, we perform vehicle type (convertible, coupe, crossover, sedan, SUV, truck, and van) classification and we use our own collected dataset with vehicle images taken in different angles and backgrounds to ensure the generalization and adaptability of proposed classifier. Moreover, we use the state-of-the-art CNN architecture, NASNet, which is a hybrid CNN architecture having Recurrent Neural Network structure trained by Reinforcement Learning to find optimal architecture. After 10 folded experiments, the average final test accuracy points 83%, and on the additional evaluation with random query images, the proposed model achieves accurate classification.\",\"PeriodicalId\":119217,\"journal\":{\"name\":\"Artificial Intelligence and Cloud Computing Conference\",\"volume\":\"128 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence and Cloud Computing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3299819.3299822\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence and Cloud Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3299819.3299822","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Image Classification for Vehicle Type Dataset Using State-of-the-art Convolutional Neural Network Architecture
Fast development in Deep Learning and its hybrid methodologies has led diverse applications in different domains. For image classification tasks in vehicle related fields, Convolutional Neural Network (CNN) is mostly chosen for recent usages. To train the CNN classifier, various vehicle image datasets are used, however, most of previous studies have learned features from datasets with a single form of images taken in the controlled condition such as surveillance camera vehicle image dataset from the same road, which results the classifier cannot guarantee the generalization of the model onto different forms of vehicle images. In addition, most of researches using CNN have used LeNet, GoogLeNet, or VGGNet for their main architecture. In this study, we perform vehicle type (convertible, coupe, crossover, sedan, SUV, truck, and van) classification and we use our own collected dataset with vehicle images taken in different angles and backgrounds to ensure the generalization and adaptability of proposed classifier. Moreover, we use the state-of-the-art CNN architecture, NASNet, which is a hybrid CNN architecture having Recurrent Neural Network structure trained by Reinforcement Learning to find optimal architecture. After 10 folded experiments, the average final test accuracy points 83%, and on the additional evaluation with random query images, the proposed model achieves accurate classification.