A. Gordon-Ross, S. Abdel-Hafeez, Mohamad Hammam Alsafrjalani
{"title":"A One-Cycle FIFO Buffer for Memory Management Units in Manycore Systems","authors":"A. Gordon-Ross, S. Abdel-Hafeez, Mohamad Hammam Alsafrjalani","doi":"10.1109/ISVLSI.2019.00056","DOIUrl":null,"url":null,"abstract":"We present an efficient synchronous first-in first-out (FIFO) buffer for enhanced memory management units and inter-core data communication in manycore systems. Our design significantly reduces hardware overhead and eliminates latency delays by using both the rising and falling clock edges during read and write, which makes our design suitable for increased processing element (PE) utilization by increasing the memory bandwidth in complex network and system on-chip solutions. Compared to prior work, our design can operate 5X faster at the same supply voltage, or up to 44X faster with a 2.5X increase in supply voltage. Our design's total power consumption is 7.8 mW with a total transistor count of 34,470.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"19 1","pages":"265-270"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISVLSI.2019.00056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
We present an efficient synchronous first-in first-out (FIFO) buffer for enhanced memory management units and inter-core data communication in manycore systems. Our design significantly reduces hardware overhead and eliminates latency delays by using both the rising and falling clock edges during read and write, which makes our design suitable for increased processing element (PE) utilization by increasing the memory bandwidth in complex network and system on-chip solutions. Compared to prior work, our design can operate 5X faster at the same supply voltage, or up to 44X faster with a 2.5X increase in supply voltage. Our design's total power consumption is 7.8 mW with a total transistor count of 34,470.