{"title":"MIMD implementation of neural networks through pipelined, parallel communication trees","authors":"P. Wohl, T. Christopher","doi":"10.1109/TAI.1991.167079","DOIUrl":null,"url":null,"abstract":"Hardware implementations and single-instruction-stream/multiple-data-stream (SIMD) simulations are relatively inflexible, while multiple-instruction-stream (MIMD) simulations often trade their flexibility for increased efficiency. An alternative technique is presented which is based on pipelining fewer but larger messages through parallel, broadcast/accumulate trees. This method exploits both the structural parallelism of neural networks and the data parallelism of neural algorithms. The mapping is flexible to changes in the network architecture and learning algorithm and is suited to a variety of computer configurations. Experimental results show a higher efficiency than similar implementation.<<ETX>>","PeriodicalId":371778,"journal":{"name":"[Proceedings] Third International Conference on Tools for Artificial Intelligence - TAI 91","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1991-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[Proceedings] Third International Conference on Tools for Artificial Intelligence - TAI 91","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TAI.1991.167079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Hardware implementations and single-instruction-stream/multiple-data-stream (SIMD) simulations are relatively inflexible, while multiple-instruction-stream (MIMD) simulations often trade their flexibility for increased efficiency. An alternative technique is presented which is based on pipelining fewer but larger messages through parallel, broadcast/accumulate trees. This method exploits both the structural parallelism of neural networks and the data parallelism of neural algorithms. The mapping is flexible to changes in the network architecture and learning algorithm and is suited to a variety of computer configurations. Experimental results show a higher efficiency than similar implementation.<>