Xiayuan Wen, Hanwei Wang, Shuyang Jin, Jun Lin, Zhongfeng Wang
{"title":"一种加速Burrows-Wheeler变换的新型硬件架构","authors":"Xiayuan Wen, Hanwei Wang, Shuyang Jin, Jun Lin, Zhongfeng Wang","doi":"10.1109/CICTA.2018.8706064","DOIUrl":null,"url":null,"abstract":"Burrows-Wheeler Transform (BWT) is an important algorithm in many fields including string matching for genome sequences. However, the implementations of BWT-based algorithms are limited due to the complexity of its sorting process. This paper presents a novel hardware architecture which can significantly reduce the number of sorting iterations. Experimental results show a significant reduction in both cycles and time to compute the BWT. Moreover, with the increase of Longest Common Prefix (LCP), our proposed architecture outperforms the traditional implementations further. In the worst case, it achieves 75.1× and 33.2× speedup compared with the Wavesorter architecture and the traditional parallel sorting network architecture respectively.","PeriodicalId":186840,"journal":{"name":"2018 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Novel Hardware Architecture to Accelerate Burrows-Wheeler Transform\",\"authors\":\"Xiayuan Wen, Hanwei Wang, Shuyang Jin, Jun Lin, Zhongfeng Wang\",\"doi\":\"10.1109/CICTA.2018.8706064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Burrows-Wheeler Transform (BWT) is an important algorithm in many fields including string matching for genome sequences. However, the implementations of BWT-based algorithms are limited due to the complexity of its sorting process. This paper presents a novel hardware architecture which can significantly reduce the number of sorting iterations. Experimental results show a significant reduction in both cycles and time to compute the BWT. Moreover, with the increase of Longest Common Prefix (LCP), our proposed architecture outperforms the traditional implementations further. In the worst case, it achieves 75.1× and 33.2× speedup compared with the Wavesorter architecture and the traditional parallel sorting network architecture respectively.\",\"PeriodicalId\":186840,\"journal\":{\"name\":\"2018 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CICTA.2018.8706064\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CICTA.2018.8706064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Novel Hardware Architecture to Accelerate Burrows-Wheeler Transform
Burrows-Wheeler Transform (BWT) is an important algorithm in many fields including string matching for genome sequences. However, the implementations of BWT-based algorithms are limited due to the complexity of its sorting process. This paper presents a novel hardware architecture which can significantly reduce the number of sorting iterations. Experimental results show a significant reduction in both cycles and time to compute the BWT. Moreover, with the increase of Longest Common Prefix (LCP), our proposed architecture outperforms the traditional implementations further. In the worst case, it achieves 75.1× and 33.2× speedup compared with the Wavesorter architecture and the traditional parallel sorting network architecture respectively.