首页 > 最新文献

Proceedings of the 25th edition on Great Lakes Symposium on VLSI最新文献

英文 中文
Session details: Reliability, Resiliency, Robustness II 会话细节:可靠性,弹性,稳健性II
Pub Date : 2015-05-20 DOI: 10.1145/3254010
S. Cotofana
{"title":"Session details: Reliability, Resiliency, Robustness II","authors":"S. Cotofana","doi":"10.1145/3254010","DOIUrl":"https://doi.org/10.1145/3254010","url":null,"abstract":"","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123119424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Characterization of Analog-to-Digital Converters using Graphene P-N Junctions 石墨烯P-N结模数转换器的设计与表征
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742099
R. G. Rizzo, S. Miryala, A. Calimera, E. Macii, M. Poncino
Electrostatically controlled graphene p-n junctions are devices built on single-layer graphene sheets whose in-to-out resistance can be dynamically tuned through external voltage potentials. While several recent works mainly focused on the possibility of using those devices as a new logic primitive for digital circuits, in this paper we address a complementary problem, that is, how to efficiently implement Analog-to-Digital Converters (ADCs) that can be integrated in future all-graphene flexible ICs. The contribution of this work is threefold: (i) introduce a new ADC architecture that perfectly matches with the main characteristics of graphene p-n junctions; (ii) give a first, yet detailed parametric characterization of the proposed ADC architecture as to validate its functionality and quantify its figures of merit; (iii) provide a fully automated design flow that, given as input the design specs, i.e., input voltage range, voltage resolution and sampling rate, returns an optimally sized ADC circuitry. Few case studies also demonstrate p-n junction based graphene ADCs have characteristics in line with those offered by todays' CMOS ones.
静电控制石墨烯p-n结是建立在单层石墨烯片上的器件,其输入输出电阻可以通过外部电压电位动态调节。虽然最近的一些工作主要集中在使用这些器件作为数字电路的新逻辑原语的可能性上,但在本文中,我们解决了一个补充问题,即如何有效地实现可以集成在未来全石墨烯柔性ic中的模数转换器(adc)。这项工作的贡献有三个方面:(i)引入了一种新的ADC架构,它与石墨烯p-n结的主要特性完美匹配;(ii)对拟议的ADC架构进行第一次详细的参数化表征,以验证其功能并量化其优点;(iii)提供一个完全自动化的设计流程,将设计规格(即输入电压范围、电压分辨率和采样率)作为输入,返回最佳大小的ADC电路。少数案例研究还表明,基于p-n结的石墨烯adc具有与当今CMOS adc相同的特性。
{"title":"Design and Characterization of Analog-to-Digital Converters using Graphene P-N Junctions","authors":"R. G. Rizzo, S. Miryala, A. Calimera, E. Macii, M. Poncino","doi":"10.1145/2742060.2742099","DOIUrl":"https://doi.org/10.1145/2742060.2742099","url":null,"abstract":"Electrostatically controlled graphene p-n junctions are devices built on single-layer graphene sheets whose in-to-out resistance can be dynamically tuned through external voltage potentials. While several recent works mainly focused on the possibility of using those devices as a new logic primitive for digital circuits, in this paper we address a complementary problem, that is, how to efficiently implement Analog-to-Digital Converters (ADCs) that can be integrated in future all-graphene flexible ICs. The contribution of this work is threefold: (i) introduce a new ADC architecture that perfectly matches with the main characteristics of graphene p-n junctions; (ii) give a first, yet detailed parametric characterization of the proposed ADC architecture as to validate its functionality and quantify its figures of merit; (iii) provide a fully automated design flow that, given as input the design specs, i.e., input voltage range, voltage resolution and sampling rate, returns an optimally sized ADC circuitry. Few case studies also demonstrate p-n junction based graphene ADCs have characteristics in line with those offered by todays' CMOS ones.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125849328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Yield-aware Performance-Cost Characterization for Multi-Core SIMT 多核SIMT的产率感知性能成本表征
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742112
S. H. Mozafari, B. Meyer, K. Skadron
Redundancy is now routinely allocated in circuits, microarchitectural structures, or at the system level, to mitigate mounting manufacturing yield losses. In this paper, we propose spare lane sharing, which reduces the cost of multi-core SIMT systems by allowing one of two neighboring cores to make use of a redundant lane if necessary. We have evaluated the performance-cost trade-offs of core-, lane-, and shared-lane-sparing under a variety of benchmarks, and found that for nearly all applications shared-lane-sparing outperforms lane-sparing, reducing cost by up to 20%.
现在,冗余通常分配在电路、微架构结构或系统级别,以减轻安装制造产量损失。在本文中,我们提出了备用通道共享,它允许两个相邻核心中的一个在必要时使用冗余通道,从而降低了多核SIMT系统的成本。我们在各种基准测试下评估了核心、车道和共享车道节约的性能成本权衡,发现几乎所有应用中,共享车道节约都优于车道节约,最多可降低20%的成本。
{"title":"Yield-aware Performance-Cost Characterization for Multi-Core SIMT","authors":"S. H. Mozafari, B. Meyer, K. Skadron","doi":"10.1145/2742060.2742112","DOIUrl":"https://doi.org/10.1145/2742060.2742112","url":null,"abstract":"Redundancy is now routinely allocated in circuits, microarchitectural structures, or at the system level, to mitigate mounting manufacturing yield losses. In this paper, we propose spare lane sharing, which reduces the cost of multi-core SIMT systems by allowing one of two neighboring cores to make use of a redundant lane if necessary. We have evaluated the performance-cost trade-offs of core-, lane-, and shared-lane-sparing under a variety of benchmarks, and found that for nearly all applications shared-lane-sparing outperforms lane-sparing, reducing cost by up to 20%.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125881428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Session details: Emerging Technologies 会议详情:新兴技术
Pub Date : 2015-05-20 DOI: 10.1145/3254014
Yiran Chen
{"title":"Session details: Emerging Technologies","authors":"Yiran Chen","doi":"10.1145/3254014","DOIUrl":"https://doi.org/10.1145/3254014","url":null,"abstract":"","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116611231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graphene Neural Sensors for Next Generation In Vivo Imaging and Optogenetics 下一代体内成像和光遗传学石墨烯神经传感器
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2745702
Z. Ma
Graphene has been studied extensively for their properties in the electrical, mechanical, and optical domains. Graphene"s flexible, transparent, and bio-compatible characteristics expand its boundaries from electrical applications to biological applications. Here, we present graphene neural sensors that allow for next generation in vivo imaging and optogenetics for its transparency over a broad wavelength spectrum and ultra-mechanical flexibility. The neural sensors implanted on the brain surface in rodents verify their unique abilities, including see-through in vivo imaging via fluorescence microscopy and 3D optical coherence tomography, and performance in advanced optogenetic experiments. The study is expected to deliver key information regarding the use of graphene in biological environments, specifically the brain. Subsequently, the study will have a strong impact on a wide spectrum of research areas spanning electrical engineering, neural science, and neural engineering.
石墨烯在电学、力学和光学领域的性能得到了广泛的研究。石墨烯的柔韧、透明和生物相容性将其从电气应用扩展到生物应用。在这里,我们提出了石墨烯神经传感器,它允许下一代体内成像和光遗传学,因为它在宽波长光谱上的透明性和超机械灵活性。植入啮齿类动物脑表面的神经传感器验证了它们的独特能力,包括通过荧光显微镜和3D光学相干断层扫描进行体内透视成像,以及在先进的光遗传学实验中的表现。这项研究有望提供有关石墨烯在生物环境,特别是大脑中使用的关键信息。随后,该研究将对电气工程、神经科学和神经工程等广泛的研究领域产生重大影响。
{"title":"Graphene Neural Sensors for Next Generation In Vivo Imaging and Optogenetics","authors":"Z. Ma","doi":"10.1145/2742060.2745702","DOIUrl":"https://doi.org/10.1145/2742060.2745702","url":null,"abstract":"Graphene has been studied extensively for their properties in the electrical, mechanical, and optical domains. Graphene\"s flexible, transparent, and bio-compatible characteristics expand its boundaries from electrical applications to biological applications. Here, we present graphene neural sensors that allow for next generation in vivo imaging and optogenetics for its transparency over a broad wavelength spectrum and ultra-mechanical flexibility. The neural sensors implanted on the brain surface in rodents verify their unique abilities, including see-through in vivo imaging via fluorescence microscopy and 3D optical coherence tomography, and performance in advanced optogenetic experiments. The study is expected to deliver key information regarding the use of graphene in biological environments, specifically the brain. Subsequently, the study will have a strong impact on a wide spectrum of research areas spanning electrical engineering, neural science, and neural engineering.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123983034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Approximate Multiplier Architectures Through Partial Product Perforation: Power-Area Tradeoffs Analysis 通过部分乘积穿孔的近似乘法器结构:功率面积权衡分析
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742109
Georgios Zervakis, Kostas Tsoumanis, S. Xydis, N. Axelos, K. Pekmestzi
Approximate computing has received significant attention as a promising strategy to decrease power consumption of inherently error-tolerant applications. Hardware approximation mainly targets arithmetic units, e.g. adders and multipliers. In this paper, we design new approximate hardware multipliers and propose the Partial Product Perforation technique, which omits a number of consecutive partial products by perforating their generation. Through extensive experimental evaluation, we apply the partial product perforation method on different multiplier architectures and expose the optimal configurations for different error values. We show that the partial product perforation delivers reductions of up to 50% in power consumption, 45% in area and 35% in critical delay. Also, the product perforation method is compared with state-of-the-art works on approximate computing that consider the Voltage Over-Scaling (VOS) and logic approximation (i.e. design of approximate compressors) techniques, outperforming them in terms of power dissipation by up to 17% and 20% on average respectively. Finally, with respect to the aforementioned gains, the error value delivered by the proposed product perforation method is smaller by 70% and 99% than the VOS and logic approximation methods respectively.
近似计算作为一种降低固有容错应用的功耗的有前途的策略受到了广泛的关注。硬件逼近主要针对算术单元,如加法器和乘法器。在本文中,我们设计了新的近似硬件乘法器,并提出了部分乘积射孔技术,该技术通过射孔来省略一些连续的部分乘积。通过广泛的实验评估,我们将部分积穿孔方法应用于不同乘法器架构,并揭示了不同误差值下的最佳配置。我们表明,部分产品射孔可减少高达50%的功耗,45%的面积和35%的临界延迟。此外,将产品穿孔方法与考虑电压过标度(VOS)和逻辑近似(即近似压缩机的设计)技术的最先进的近似计算方法进行比较,在功耗方面分别平均优于它们17%和20%。最后,对于上述增益,本文提出的乘积穿孔法所传递的误差值比VOS法和逻辑近似法分别小70%和99%。
{"title":"Approximate Multiplier Architectures Through Partial Product Perforation: Power-Area Tradeoffs Analysis","authors":"Georgios Zervakis, Kostas Tsoumanis, S. Xydis, N. Axelos, K. Pekmestzi","doi":"10.1145/2742060.2742109","DOIUrl":"https://doi.org/10.1145/2742060.2742109","url":null,"abstract":"Approximate computing has received significant attention as a promising strategy to decrease power consumption of inherently error-tolerant applications. Hardware approximation mainly targets arithmetic units, e.g. adders and multipliers. In this paper, we design new approximate hardware multipliers and propose the Partial Product Perforation technique, which omits a number of consecutive partial products by perforating their generation. Through extensive experimental evaluation, we apply the partial product perforation method on different multiplier architectures and expose the optimal configurations for different error values. We show that the partial product perforation delivers reductions of up to 50% in power consumption, 45% in area and 35% in critical delay. Also, the product perforation method is compared with state-of-the-art works on approximate computing that consider the Voltage Over-Scaling (VOS) and logic approximation (i.e. design of approximate compressors) techniques, outperforming them in terms of power dissipation by up to 17% and 20% on average respectively. Finally, with respect to the aforementioned gains, the error value delivered by the proposed product perforation method is smaller by 70% and 99% than the VOS and logic approximation methods respectively.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123667250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
An Effective TSV Self-Repair Scheme for 3D-Stacked ICs 一种有效的3d堆叠集成电路TSV自修复方案
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742071
Songwei Pei, Jingdong Zhang, Yu Jin, Song Jin, Jun Liu, Weizhi Xu
Various types of defects are prone to be occurred inside the TSV during the manufacturing and bonding steps, thereby severely impacting the yield of 3D-stacked ICs. Moreover, several types of TSV defects are latent and may easily escape detection during the manufacturing test. However, these latent TSVs are prone to degrade during the field operation and may eventually become faulty and then destroy the entire 3D-stacked IC. To tackle the above problems, in this paper, we present an effective TSV self-repair scheme for 3D-stacked ICs. By designing redundant TSVs and a TSV self-repair architecture, the proposed scheme can effectively repair faulty TSVs detected by manufacturing test for improving the yield of 3D-stacked ICs. Moreover, the latent TSVS failed and then detected during the in-field operation can also be self-repaired, thereby elevating the 3D ICs' quality and reliability. Experimental results are presented to validate the proposed method.
在制造和键合过程中,TSV内部容易产生各种类型的缺陷,从而严重影响3d堆叠ic的良率。此外,一些类型的TSV缺陷是潜在的,很容易在制造测试中逃脱检测。然而,这些潜在的TSV在现场工作过程中容易退化,最终可能出现故障并破坏整个3d堆叠IC。为了解决上述问题,本文提出了一种有效的3d堆叠IC TSV自修复方案。该方案通过设计冗余的TSV和TSV自修复架构,能够有效地修复制造测试中检测到的故障TSV,从而提高3d堆叠ic的成品率。此外,在现场操作过程中发现的潜在TSVS故障也可以自我修复,从而提高了3D集成电路的质量和可靠性。实验结果验证了该方法的有效性。
{"title":"An Effective TSV Self-Repair Scheme for 3D-Stacked ICs","authors":"Songwei Pei, Jingdong Zhang, Yu Jin, Song Jin, Jun Liu, Weizhi Xu","doi":"10.1145/2742060.2742071","DOIUrl":"https://doi.org/10.1145/2742060.2742071","url":null,"abstract":"Various types of defects are prone to be occurred inside the TSV during the manufacturing and bonding steps, thereby severely impacting the yield of 3D-stacked ICs. Moreover, several types of TSV defects are latent and may easily escape detection during the manufacturing test. However, these latent TSVs are prone to degrade during the field operation and may eventually become faulty and then destroy the entire 3D-stacked IC. To tackle the above problems, in this paper, we present an effective TSV self-repair scheme for 3D-stacked ICs. By designing redundant TSVs and a TSV self-repair architecture, the proposed scheme can effectively repair faulty TSVs detected by manufacturing test for improving the yield of 3D-stacked ICs. Moreover, the latent TSVS failed and then detected during the in-field operation can also be self-repaired, thereby elevating the 3D ICs' quality and reliability. Experimental results are presented to validate the proposed method.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121182485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Revisiting Dynamic Thermal Management Exploiting Inverse Thermal Dependence 利用逆热依赖重温动态热管理
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742086
Katayoun Neshatpour, H. Homayoun, A. Djahromi, W. Burleson
As CMOS technology scales down towards nanometer regime and the supply voltage approaches the threshold voltage, increase in operating temperature results in increased circuit current, which in turn reduces circuit propagation delay. This paper exploits this new phenomenon, known as inverse thermal dependence (ITD) for power, performance, and temperature optimization in processor architecture. ITD changes the maximum achievable operating frequency of the processor at high temperatures. Dynamic thermal management techniques such as activity migration, dynamic voltage frequency scaling, and throttling are revisited in this paper, with a focus on the effect of ITD. Results are obtained using the predictive technology models of 7nm, 10nm 14nm and 20nm technology nodes and with extensive architectural and circuit simulations. The results show that based on the design goals, various design corners should be re-investigated for power, performance and energy-efficiency optimization. Architectural simulations for a multi-core processor and across standard benchmarks show that utilizing ITD-aware schemes for thermal management improves the performance of the processor in terms of speed and energy-delay-product by 8.55% and 4.4%, respectively.
随着CMOS技术向纳米级方向缩小,电源电压接近阈值电压,工作温度的增加导致电路电流的增加,从而减少了电路的传播延迟。本文利用这种被称为逆热依赖(ITD)的新现象来优化处理器架构中的功率、性能和温度。ITD改变了处理器在高温下可达到的最大工作频率。动态热管理技术,如活度迁移,动态电压频率缩放和节流在本文中被重新审视,重点是过渡段的影响。研究结果采用7nm、10nm、14nm和20nm技术节点的预测技术模型,并进行了大量的架构和电路仿真。结果表明,在设计目标的基础上,应重新考察各个设计角落,进行功率、性能和能效优化。对多核处理器的架构模拟和跨标准基准测试表明,利用可感知过渡段的热管理方案在速度和能量延迟产品方面分别提高了8.55%和4.4%的处理器性能。
{"title":"Revisiting Dynamic Thermal Management Exploiting Inverse Thermal Dependence","authors":"Katayoun Neshatpour, H. Homayoun, A. Djahromi, W. Burleson","doi":"10.1145/2742060.2742086","DOIUrl":"https://doi.org/10.1145/2742060.2742086","url":null,"abstract":"As CMOS technology scales down towards nanometer regime and the supply voltage approaches the threshold voltage, increase in operating temperature results in increased circuit current, which in turn reduces circuit propagation delay. This paper exploits this new phenomenon, known as inverse thermal dependence (ITD) for power, performance, and temperature optimization in processor architecture. ITD changes the maximum achievable operating frequency of the processor at high temperatures. Dynamic thermal management techniques such as activity migration, dynamic voltage frequency scaling, and throttling are revisited in this paper, with a focus on the effect of ITD. Results are obtained using the predictive technology models of 7nm, 10nm 14nm and 20nm technology nodes and with extensive architectural and circuit simulations. The results show that based on the design goals, various design corners should be re-investigated for power, performance and energy-efficiency optimization. Architectural simulations for a multi-core processor and across standard benchmarks show that utilizing ITD-aware schemes for thermal management improves the performance of the processor in terms of speed and energy-delay-product by 8.55% and 4.4%, respectively.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132388251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Session details: Keynote IV 会议详情:主题演讲四
Pub Date : 2015-05-20 DOI: 10.1145/3254021
M. Margala
{"title":"Session details: Keynote IV","authors":"M. Margala","doi":"10.1145/3254021","DOIUrl":"https://doi.org/10.1145/3254021","url":null,"abstract":"","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"133 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131658193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Bitstream Length Scaling Energy Effective Stochastic LDPC Decoding 动态比特流长度缩放能量有效随机LDPC解码
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742117
T. Marconi, S. Cotofana
Stochastic Computing (SC) is an attractive solution for implementing Low Density Parity Codes (LDPC) decoders due to its fault tolerance capability and low hardware requirements. However, in practical implementations, SC efficiency is limited by the Stochastic Bitstream (SB) length and by the computation inaccuracies due to non-unique SB representations. In this paper, rather than statically fixing the SB length at run-time, we propose a Dynamic Bitstream Length Scaling (DBLS) technique, which adjusts on-the-fly the SB length such that Quality of Service requirements for energy efficient LDPC decoding are fulfilled. In this way, depending on the communication channel condition, different SB lengths are adaptively utilized such that the best decoding performance vs energy consumption tradeoff is achieved. To evaluate the DBLS practical implications we selected an (1296,648) LDPC with dv=3 and dc=6 and implemented our approach and the best state-of-the-art stochastic LDPC decoder with 64-bit edge memory on a Virtex-7 FPGA. Experimental results indicate that our proposal requires 9% more FFs and 3% more LUTs while diminishing the energy consumption by 31-80% and providing 1.5-5.1x higher throughput.
随机计算(SC)由于其容错能力和低硬件要求而成为实现低密度奇偶码(LDPC)解码器的一种有吸引力的解决方案。然而,在实际实现中,SC的效率受到随机比特流(SB)长度和由于非唯一的SB表示而导致的计算不准确性的限制。在本文中,我们提出了一种动态比特流长度缩放(DBLS)技术,而不是在运行时静态固定SB长度,该技术可以动态调整SB长度,从而满足高效LDPC解码的服务质量要求。通过这种方式,根据通信信道条件,自适应地利用不同的SB长度,从而实现最佳的解码性能与能耗权衡。为了评估DBLS的实际意义,我们选择了一个dv=3和dc=6的(1296,648)LDPC,并在Virtex-7 FPGA上实现了我们的方法和最先进的具有64位边缘内存的随机LDPC解码器。实验结果表明,我们的方案需要增加9%的ff和3%的lut,同时降低31-80%的能耗,并提供1.5-5.1倍的高吞吐量。
{"title":"Dynamic Bitstream Length Scaling Energy Effective Stochastic LDPC Decoding","authors":"T. Marconi, S. Cotofana","doi":"10.1145/2742060.2742117","DOIUrl":"https://doi.org/10.1145/2742060.2742117","url":null,"abstract":"Stochastic Computing (SC) is an attractive solution for implementing Low Density Parity Codes (LDPC) decoders due to its fault tolerance capability and low hardware requirements. However, in practical implementations, SC efficiency is limited by the Stochastic Bitstream (SB) length and by the computation inaccuracies due to non-unique SB representations. In this paper, rather than statically fixing the SB length at run-time, we propose a Dynamic Bitstream Length Scaling (DBLS) technique, which adjusts on-the-fly the SB length such that Quality of Service requirements for energy efficient LDPC decoding are fulfilled. In this way, depending on the communication channel condition, different SB lengths are adaptively utilized such that the best decoding performance vs energy consumption tradeoff is achieved. To evaluate the DBLS practical implications we selected an (1296,648) LDPC with dv=3 and dc=6 and implemented our approach and the best state-of-the-art stochastic LDPC decoder with 64-bit edge memory on a Virtex-7 FPGA. Experimental results indicate that our proposal requires 9% more FFs and 3% more LUTs while diminishing the energy consumption by 31-80% and providing 1.5-5.1x higher throughput.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130766258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 25th edition on Great Lakes Symposium on VLSI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1