{"title":"利用时间关联进行基于视图的三维物体识别","authors":"A. Massad, B. Mertsching, S. Schmalz","doi":"10.1109/IECON.1998.724038","DOIUrl":null,"url":null,"abstract":"The authors propose an architecture for the recognition of three-dimensional objects on the basis of viewer-centered representations and temporal associations. Motivated by biological findings and by successful computational implementations they have chosen a viewer-centered representation scheme. In contrast to other implementations, special attention is paid to the temporal order of the views, which proves useful for learning and recognition purposes. Their recognition system combines different kinds of artificial neural networks into a four stage architecture: preprocessing by a Gaborjet transform is followed by an extended dynamic link matching algorithm which implements recognition and learning of the view classes. A STORE network records the temporal order of the views by transforming a sequence of view classes into an item-and-order coding. Subsequently, a Gaussian-ARTMAP architecture is used for the classification of the sequences and for their mapping onto object classes by means of supervised learning. The presented results demonstrate that the system is capable to autonomously learn and to discriminate similar objects. Additionally, the examples show how the utilization of the temporal context improves object recognition by making ambiguous views manageable and facilitating an increased insensitiveness against misclassifications.","PeriodicalId":377136,"journal":{"name":"IECON '98. Proceedings of the 24th Annual Conference of the IEEE Industrial Electronics Society (Cat. No.98CH36200)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Utilizing temporal associations for view-based 3-D object recognition\",\"authors\":\"A. Massad, B. Mertsching, S. Schmalz\",\"doi\":\"10.1109/IECON.1998.724038\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The authors propose an architecture for the recognition of three-dimensional objects on the basis of viewer-centered representations and temporal associations. Motivated by biological findings and by successful computational implementations they have chosen a viewer-centered representation scheme. In contrast to other implementations, special attention is paid to the temporal order of the views, which proves useful for learning and recognition purposes. Their recognition system combines different kinds of artificial neural networks into a four stage architecture: preprocessing by a Gaborjet transform is followed by an extended dynamic link matching algorithm which implements recognition and learning of the view classes. A STORE network records the temporal order of the views by transforming a sequence of view classes into an item-and-order coding. Subsequently, a Gaussian-ARTMAP architecture is used for the classification of the sequences and for their mapping onto object classes by means of supervised learning. The presented results demonstrate that the system is capable to autonomously learn and to discriminate similar objects. Additionally, the examples show how the utilization of the temporal context improves object recognition by making ambiguous views manageable and facilitating an increased insensitiveness against misclassifications.\",\"PeriodicalId\":377136,\"journal\":{\"name\":\"IECON '98. Proceedings of the 24th Annual Conference of the IEEE Industrial Electronics Society (Cat. No.98CH36200)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IECON '98. Proceedings of the 24th Annual Conference of the IEEE Industrial Electronics Society (Cat. No.98CH36200)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IECON.1998.724038\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IECON '98. Proceedings of the 24th Annual Conference of the IEEE Industrial Electronics Society (Cat. No.98CH36200)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IECON.1998.724038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Utilizing temporal associations for view-based 3-D object recognition
The authors propose an architecture for the recognition of three-dimensional objects on the basis of viewer-centered representations and temporal associations. Motivated by biological findings and by successful computational implementations they have chosen a viewer-centered representation scheme. In contrast to other implementations, special attention is paid to the temporal order of the views, which proves useful for learning and recognition purposes. Their recognition system combines different kinds of artificial neural networks into a four stage architecture: preprocessing by a Gaborjet transform is followed by an extended dynamic link matching algorithm which implements recognition and learning of the view classes. A STORE network records the temporal order of the views by transforming a sequence of view classes into an item-and-order coding. Subsequently, a Gaussian-ARTMAP architecture is used for the classification of the sequences and for their mapping onto object classes by means of supervised learning. The presented results demonstrate that the system is capable to autonomously learn and to discriminate similar objects. Additionally, the examples show how the utilization of the temporal context improves object recognition by making ambiguous views manageable and facilitating an increased insensitiveness against misclassifications.