{"title":"The CM-2 data transposition problem","authors":"R. Vetter, D. Du, A. Klietz","doi":"10.1109/IPPS.1993.262792","DOIUrl":null,"url":null,"abstract":"The CM-2's natural data layout is not conducive to exchanging data with other machines. Before CM-2 data is sent to a remote machine, a bitwise transpose must be performed on the data. Each bit in an n bit value must be transmitted to a different processor, requiring n send operations through the CM-2's global router network. The time required to transpose the data limits the effective throughput of the I/O channel to a small fraction of its peak theoretical bandwidth. For example, when sending data to a remote supercomputer using a 100 MB/s HIPPI channel, an effective throughput of only 4.9 MB/s can be achieved. The authors describe the CM-2 transpose problem and study ways to improve the performance of transposed data transmissions.<<ETX>>","PeriodicalId":248927,"journal":{"name":"[1993] Proceedings Seventh International Parallel Processing Symposium","volume":"126 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1993-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1993] Proceedings Seventh International Parallel Processing Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPPS.1993.262792","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The CM-2's natural data layout is not conducive to exchanging data with other machines. Before CM-2 data is sent to a remote machine, a bitwise transpose must be performed on the data. Each bit in an n bit value must be transmitted to a different processor, requiring n send operations through the CM-2's global router network. The time required to transpose the data limits the effective throughput of the I/O channel to a small fraction of its peak theoretical bandwidth. For example, when sending data to a remote supercomputer using a 100 MB/s HIPPI channel, an effective throughput of only 4.9 MB/s can be achieved. The authors describe the CM-2 transpose problem and study ways to improve the performance of transposed data transmissions.<>