T. Satonaka, T. Baba, T. Otsuki, T. Chikamura, T. Meng
{"title":"具有亮度、旋转和位置不变性的目标识别","authors":"T. Satonaka, T. Baba, T. Otsuki, T. Chikamura, T. Meng","doi":"10.1109/ICIP.1997.632109","DOIUrl":null,"url":null,"abstract":"We propose a neural network based on image synthesis, histogram adaptive quantization and the discrete cosine transformation (DCT) for object recognition with luminance, rotation and location invariance. An efficient representation of the invariant features is constructed using a three-dimensional memory structure. The performance of luminance and rotation invariance is illustrated by reduced error rates in face recognition. The error rate of using a two-dimensional DCT is improved from 13.6% to 2.4% with the aid of the proposed image synthesis procedure. The 2.4% error rate is better than all previously reported results using Karhunen-Loeve (1990) transform convolution networks and eigenface models. In using the DCT, our approach also enjoys the additional advantage of greatly reduced computational complexity.","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"125 1","pages":"336-339 vol.3"},"PeriodicalIF":0.0000,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Object recognition with luminance, rotation and location invariance\",\"authors\":\"T. Satonaka, T. Baba, T. Otsuki, T. Chikamura, T. Meng\",\"doi\":\"10.1109/ICIP.1997.632109\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a neural network based on image synthesis, histogram adaptive quantization and the discrete cosine transformation (DCT) for object recognition with luminance, rotation and location invariance. An efficient representation of the invariant features is constructed using a three-dimensional memory structure. The performance of luminance and rotation invariance is illustrated by reduced error rates in face recognition. The error rate of using a two-dimensional DCT is improved from 13.6% to 2.4% with the aid of the proposed image synthesis procedure. The 2.4% error rate is better than all previously reported results using Karhunen-Loeve (1990) transform convolution networks and eigenface models. In using the DCT, our approach also enjoys the additional advantage of greatly reduced computational complexity.\",\"PeriodicalId\":92344,\"journal\":{\"name\":\"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing\",\"volume\":\"125 1\",\"pages\":\"336-339 vol.3\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1997-10-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIP.1997.632109\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIP.1997.632109","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Object recognition with luminance, rotation and location invariance
We propose a neural network based on image synthesis, histogram adaptive quantization and the discrete cosine transformation (DCT) for object recognition with luminance, rotation and location invariance. An efficient representation of the invariant features is constructed using a three-dimensional memory structure. The performance of luminance and rotation invariance is illustrated by reduced error rates in face recognition. The error rate of using a two-dimensional DCT is improved from 13.6% to 2.4% with the aid of the proposed image synthesis procedure. The 2.4% error rate is better than all previously reported results using Karhunen-Loeve (1990) transform convolution networks and eigenface models. In using the DCT, our approach also enjoys the additional advantage of greatly reduced computational complexity.