{"title":"Design of VLSI sorting accelerator architecture","authors":"Yun-Nan Chang, Chien Jung Fu","doi":"10.1109/MWSCAS.2008.4616940","DOIUrl":null,"url":null,"abstract":"In this paper, the design of VLSI sorter architecture for the acceleration of data sorting operation is addressed. In order to support the sorting of the variable length sequences, the sorter architecture discussed in this paper is based on a central memory module equipped with some fundamental compare-and-swap (C&S) functional units. Three memory-based sorter designs have been addressed. In addition to the basic single-serial C&S architecture, two parallel approaches have been presented. The first approach based on the direct use of parallel C&S units can lead to the speedup of the sorting process nearly proportional to the number of parallel units being used. However, the second approach based on the multi-step cascaded C&S units can further reduce the number of memory data accesses significantly. The dissipation power due to the memory operation can then be reduced such that this approach will be suitable for low-power applications.","PeriodicalId":118637,"journal":{"name":"2008 51st Midwest Symposium on Circuits and Systems","volume":"362 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 51st Midwest Symposium on Circuits and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MWSCAS.2008.4616940","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, the design of VLSI sorter architecture for the acceleration of data sorting operation is addressed. In order to support the sorting of the variable length sequences, the sorter architecture discussed in this paper is based on a central memory module equipped with some fundamental compare-and-swap (C&S) functional units. Three memory-based sorter designs have been addressed. In addition to the basic single-serial C&S architecture, two parallel approaches have been presented. The first approach based on the direct use of parallel C&S units can lead to the speedup of the sorting process nearly proportional to the number of parallel units being used. However, the second approach based on the multi-step cascaded C&S units can further reduce the number of memory data accesses significantly. The dissipation power due to the memory operation can then be reduced such that this approach will be suitable for low-power applications.