{"title":"基于内存一致性的云计算CPU-Cache-FPGA加速架构","authors":"Hao Yang, Xiaolang Yan","doi":"10.1109/ICISCE.2015.74","DOIUrl":null,"url":null,"abstract":"The power efficiency target is becomes the first goal of current hardware system. The workload under cloud computing environment needs to be accelerated by hardware as more as possible, in order to improve the overall power/performance efficiency. The traditional CPU-FPGA architecture can not handle fine granularity routine of algorithm by FPGA hardware in a effective way, as well as challenge on programming model. This paper proposed a memory/cache coherency based CPU-Cache-FPGA architecture to perform an effective transparent communication between software threads on CPU and hardware threads on FPGA. In a sorting application example, the proposed architecture can gain 2.6 times acceleration ratio than traditional CPU-FPGA architecture, as well as a simplified programming model.","PeriodicalId":356250,"journal":{"name":"2015 2nd International Conference on Information Science and Control Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Memory Coherency Based CPU-Cache-FPGA Acceleration Architecture for Cloud Computing\",\"authors\":\"Hao Yang, Xiaolang Yan\",\"doi\":\"10.1109/ICISCE.2015.74\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The power efficiency target is becomes the first goal of current hardware system. The workload under cloud computing environment needs to be accelerated by hardware as more as possible, in order to improve the overall power/performance efficiency. The traditional CPU-FPGA architecture can not handle fine granularity routine of algorithm by FPGA hardware in a effective way, as well as challenge on programming model. This paper proposed a memory/cache coherency based CPU-Cache-FPGA architecture to perform an effective transparent communication between software threads on CPU and hardware threads on FPGA. In a sorting application example, the proposed architecture can gain 2.6 times acceleration ratio than traditional CPU-FPGA architecture, as well as a simplified programming model.\",\"PeriodicalId\":356250,\"journal\":{\"name\":\"2015 2nd International Conference on Information Science and Control Engineering\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 2nd International Conference on Information Science and Control Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICISCE.2015.74\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 2nd International Conference on Information Science and Control Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISCE.2015.74","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Memory Coherency Based CPU-Cache-FPGA Acceleration Architecture for Cloud Computing
The power efficiency target is becomes the first goal of current hardware system. The workload under cloud computing environment needs to be accelerated by hardware as more as possible, in order to improve the overall power/performance efficiency. The traditional CPU-FPGA architecture can not handle fine granularity routine of algorithm by FPGA hardware in a effective way, as well as challenge on programming model. This paper proposed a memory/cache coherency based CPU-Cache-FPGA architecture to perform an effective transparent communication between software threads on CPU and hardware threads on FPGA. In a sorting application example, the proposed architecture can gain 2.6 times acceleration ratio than traditional CPU-FPGA architecture, as well as a simplified programming model.