首页 > 最新文献

Proceedings of the 25th edition on Great Lakes Symposium on VLSI最新文献

英文 中文
Skew Bounded Buffer Tree Resynthesis For Clock Power Optimization 时钟功率优化的倾斜有界缓冲树重合成
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742119
Subhendu Roy, D. Pan, Pavlos M. Mattheakis, P. S. Colyer, L. Masse-Navette, Pierre-Olivier Ribet
With aggressive technology scaling in nanometer regime, a significant fraction of dynamic power is consumed in the clock network due to its high switching activity. Clock networks are typically synthesized and routed to optimize for zero clock skew. However, clock skew optimization is often accompanied with routing overhead which increases the clock net capacitance thereby consuming more power. In this paper, we propose a skew bounded buffer tree resynthesis algorithm to optimize clock net capacitance after the clock network has been synthesized and routed. Our algorithm restricts the skew of the designs within a specified margin from its original skew, and does not introduce any additional Design Rule Check (DRC) violation. Experimental results on industrial designs, with clock networks synthesized and routed by an industrial tool, have demonstrated that our approach can achieve an average reduction of 5.6% and 3.5% in clock net capacitance and clock dynamic power respectively with a marginal overhead in the clock skew.
随着技术在纳米范围内的积极扩展,时钟网络中由于其高开关活性而消耗了很大一部分动态功率。时钟网络通常被合成和路由以优化零时钟偏差。然而,时钟倾斜优化通常伴随着路由开销,这会增加时钟净电容,从而消耗更多的功率。在本文中,我们提出了一种倾斜有界缓冲树重合成算法来优化时钟网络合成和路由后的时钟网络电容。我们的算法将设计的斜度限制在其原始斜度的指定余量内,并且不引入任何额外的设计规则检查(DRC)违规。用工业工具合成和路由时钟网络的工业设计实验结果表明,我们的方法可以实现时钟净电容和时钟动态功率分别平均降低5.6%和3.5%,时钟倾斜的边际开销很小。
{"title":"Skew Bounded Buffer Tree Resynthesis For Clock Power Optimization","authors":"Subhendu Roy, D. Pan, Pavlos M. Mattheakis, P. S. Colyer, L. Masse-Navette, Pierre-Olivier Ribet","doi":"10.1145/2742060.2742119","DOIUrl":"https://doi.org/10.1145/2742060.2742119","url":null,"abstract":"With aggressive technology scaling in nanometer regime, a significant fraction of dynamic power is consumed in the clock network due to its high switching activity. Clock networks are typically synthesized and routed to optimize for zero clock skew. However, clock skew optimization is often accompanied with routing overhead which increases the clock net capacitance thereby consuming more power. In this paper, we propose a skew bounded buffer tree resynthesis algorithm to optimize clock net capacitance after the clock network has been synthesized and routed. Our algorithm restricts the skew of the designs within a specified margin from its original skew, and does not introduce any additional Design Rule Check (DRC) violation. Experimental results on industrial designs, with clock networks synthesized and routed by an industrial tool, have demonstrated that our approach can achieve an average reduction of 5.6% and 3.5% in clock net capacitance and clock dynamic power respectively with a marginal overhead in the clock skew.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115484157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Session details: VLSI Design 会议详情:VLSI设计
Pub Date : 2015-05-20 DOI: 10.1145/3254019
E. Salman
{"title":"Session details: VLSI Design","authors":"E. Salman","doi":"10.1145/3254019","DOIUrl":"https://doi.org/10.1145/3254019","url":null,"abstract":"","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114307790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Bit-Nibble-Byte MicroEngine (BnB) for Efficient Computing on Short Data 用于短数据高效计算的Bit-Nibble-Byte微引擎(BnB
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742106
Dilip P. Vasudevan, A. Chien
Energy is a critical challenge in computing performance. Due to "word size creep" from modern CPUs are inefficient for short-data element processing. We propose and evaluate a new microarchitecture called "Bit-Nibble-Byte"(BnB). We describe our design which includes both long fixed point vectors and as well as novel variable length instructions. Together, these features provide energy and performance benefits on a wide range of applications. We evaluate BnB with a detailed design of 5 vector sizes (128,256,512,1024,2048) mapped into 32nm and 7nm transistor technologies, and in combination with a variety of memory systems (DDR3 and HMC). The evaluation is based on both handwritten and compiled code with a custom compiler built for BnB. Our results include significant performance (19x-252x) and energy benefits (5.6x-140.7x) for short bit-field operations typically assumed to require hardwired accelerators and large-scale applications with compiled code.
能源是计算性能的一个关键挑战。由于“字长蠕变”,现代cpu对于短数据元素的处理效率很低。我们提出并评估了一种新的微架构,称为“Bit-Nibble-Byte”(BnB)。我们描述了我们的设计,其中包括长定点向量和新颖的可变长度指令。总之,这些特性为广泛的应用程序提供了能源和性能优势。我们通过详细设计5种矢量尺寸(128,256,512,1024,2048)映射到32nm和7nm晶体管技术,并结合各种存储系统(DDR3和HMC)来评估BnB。计算基于手写和编译的代码,并使用为BnB构建的自定义编译器。我们的结果包括显著的性能(19x-252x)和能源效益(5.6x-140.7x)对于通常假定需要硬连接加速器和具有编译代码的大规模应用程序的短位域操作。
{"title":"The Bit-Nibble-Byte MicroEngine (BnB) for Efficient Computing on Short Data","authors":"Dilip P. Vasudevan, A. Chien","doi":"10.1145/2742060.2742106","DOIUrl":"https://doi.org/10.1145/2742060.2742106","url":null,"abstract":"Energy is a critical challenge in computing performance. Due to \"word size creep\" from modern CPUs are inefficient for short-data element processing. We propose and evaluate a new microarchitecture called \"Bit-Nibble-Byte\"(BnB). We describe our design which includes both long fixed point vectors and as well as novel variable length instructions. Together, these features provide energy and performance benefits on a wide range of applications. We evaluate BnB with a detailed design of 5 vector sizes (128,256,512,1024,2048) mapped into 32nm and 7nm transistor technologies, and in combination with a variety of memory systems (DDR3 and HMC). The evaluation is based on both handwritten and compiled code with a custom compiler built for BnB. Our results include significant performance (19x-252x) and energy benefits (5.6x-140.7x) for short bit-field operations typically assumed to require hardwired accelerators and large-scale applications with compiled code.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115063470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Improving Lifetime of Multicore Soft Real-Time Systems through Global Utilization Control 利用全局利用率控制提高多核软实时系统的寿命
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742113
Yuexi Ma, Thidapat Chantem, X. Hu, R. Dick
System lifetime reliability is an important design consideration for many real-time embedded systems. Increasing integrated circuit power density and the subsequent rise in chip temperature negatively impact the lifetime reliability of such systems. Although existing thermal-aware methods are effective in reducing temperature, they cannot increase, and may even hamper, the system lifetime reliability. The complicated relationship between temperature and system lifetime requires that reliability be considered explicitly during system design. This paper presents a reliability-aware utilization control framework for homogeneous multicore soft real-time systems. The framework employs a model predictive controller to increase the system lifetime by manipulating the utilization of real-time tasks. An online heuristic algorithm is introduced to adjust the controller's sampling window in order to reduce the effects of thermal cycling on reliability. Simulation results show that the proposed approach can improve the system mean time to failure by at least 43% and as much as 369% compared to existing techniques.
系统寿命可靠性是许多实时嵌入式系统设计的重要考虑因素。集成电路功率密度的增加和随后芯片温度的升高会对此类系统的寿命可靠性产生负面影响。虽然现有的热感知方法在降低温度方面是有效的,但它们不能提高甚至可能妨碍系统寿命的可靠性。温度与系统寿命之间的复杂关系要求在系统设计中明确考虑可靠性。提出了一种同构多核软实时系统的可靠性感知利用控制框架。该框架采用模型预测控制器,通过控制实时任务的利用率来延长系统寿命。为了降低热循环对可靠性的影响,引入了一种在线启发式算法来调整控制器的采样窗口。仿真结果表明,与现有技术相比,该方法可将系统平均无故障时间提高至少43%,最高可提高369%。
{"title":"Improving Lifetime of Multicore Soft Real-Time Systems through Global Utilization Control","authors":"Yuexi Ma, Thidapat Chantem, X. Hu, R. Dick","doi":"10.1145/2742060.2742113","DOIUrl":"https://doi.org/10.1145/2742060.2742113","url":null,"abstract":"System lifetime reliability is an important design consideration for many real-time embedded systems. Increasing integrated circuit power density and the subsequent rise in chip temperature negatively impact the lifetime reliability of such systems. Although existing thermal-aware methods are effective in reducing temperature, they cannot increase, and may even hamper, the system lifetime reliability. The complicated relationship between temperature and system lifetime requires that reliability be considered explicitly during system design. This paper presents a reliability-aware utilization control framework for homogeneous multicore soft real-time systems. The framework employs a model predictive controller to increase the system lifetime by manipulating the utilization of real-time tasks. An online heuristic algorithm is introduced to adjust the controller's sampling window in order to reduce the effects of thermal cycling on reliability. Simulation results show that the proposed approach can improve the system mean time to failure by at least 43% and as much as 369% compared to existing techniques.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115067897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Session details: Keynote II 会议详情:主题演讲二
Pub Date : 2015-05-20 DOI: 10.1145/3254018
Alex K. Jones
{"title":"Session details: Keynote II","authors":"Alex K. Jones","doi":"10.1145/3254018","DOIUrl":"https://doi.org/10.1145/3254018","url":null,"abstract":"","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116962747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Voltage-Boosted Synchronizers Voltage-Boosted某个
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742075
Yaoqiang Li, P. Chuang, A. Kennings, M. Sachdev
With a specified Mean Timing Between Failure (MTBF), the metastability resolution time of synchronizers possibly constrains the system performance. To enhance metastability resolution time under single low-voltage supply environments, Voltage-Boosted Synchronizers (VBSs) consisting of a basic minimum-sized Jamb latch and a switched-capacitor-based charge pump are proposed. The capacitor of the charge pump is sized 13 times the area of the Jamb latch. Two powering strategies of the charge pump, namely Metastability-driven VBS (MVBS) and Clock-driven VBS (CVBS), are proposed. For a 1-year MTBF specification, MVBS and CVBS show 2.0-2.7 and 5.1-9.8 times the performance improvement over the basic Jamb latch, respectively, without incurring large power consumption.
在一定的平均故障间隔时间(MTBF)下,同步器的亚稳态分辨时间可能会制约系统的性能。为了提高在单一低压供电环境下的亚稳态分辨率时间,提出了电压提升同步器(VBSs),该同步器由一个基本的最小尺寸门闩锁紧器和一个基于开关电容的电荷泵组成。电荷泵的电容尺寸是门框闩锁面积的13倍。提出了两种电荷泵的供电策略:亚稳态驱动VBS (Metastability-driven VBS)和时钟驱动VBS (Clock-driven VBS)。在1年的MTBF规格下,MVBS和CVBS的性能分别是基本门闩的2.0-2.7倍和5.1-9.8倍,且功耗不高。
{"title":"Voltage-Boosted Synchronizers","authors":"Yaoqiang Li, P. Chuang, A. Kennings, M. Sachdev","doi":"10.1145/2742060.2742075","DOIUrl":"https://doi.org/10.1145/2742060.2742075","url":null,"abstract":"With a specified Mean Timing Between Failure (MTBF), the metastability resolution time of synchronizers possibly constrains the system performance. To enhance metastability resolution time under single low-voltage supply environments, Voltage-Boosted Synchronizers (VBSs) consisting of a basic minimum-sized Jamb latch and a switched-capacitor-based charge pump are proposed. The capacitor of the charge pump is sized 13 times the area of the Jamb latch. Two powering strategies of the charge pump, namely Metastability-driven VBS (MVBS) and Clock-driven VBS (CVBS), are proposed. For a 1-year MTBF specification, MVBS and CVBS show 2.0-2.7 and 5.1-9.8 times the performance improvement over the basic Jamb latch, respectively, without incurring large power consumption.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123434943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mammalian Synthetic Gene Networks 哺乳动物合成基因网络
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2743764
J. Lohmueller
Synthetic gene networks enable the programming of living cells to perform novel behaviors. Mammalian synthetic gene networks have largely been used as research tools to probe cellular function and more recently to engineer therapeutic capabilities. To create these networks researchers have developed a vast array of DNA -encoded parts that can serve as sensors, computational regulators, and actuators. Many of these gene circuit components have varying temporal characteristics and sensitivities making them well - suited for engineering systems that can act on different time scales and at different molecular concentrations. These components have been combined to create increasingly complex gene circuits. Major challenges for engineering mammalian synthetic gene networks include further improving scalability and predictability. Recent technological advancements in site - directed genome engineering and programmable DNA binding domains will likely aid in addressing these issues. Other important future directions will include incorporating new regulators that act at the levels of chromatin remodeling and DNA methylation and the division of computational loads among different cell types with population - based computing.
合成基因网络使活细胞的编程能够执行新的行为。哺乳动物合成基因网络在很大程度上被用作探测细胞功能的研究工具,最近还被用于设计治疗能力。为了创建这些网络,研究人员开发了大量的DNA编码部件,这些部件可以作为传感器、计算调节器和执行器。许多这些基因电路组件具有不同的时间特征和灵敏度,使它们非常适合于可以在不同的时间尺度和不同的分子浓度下工作的工程系统。这些成分组合在一起,形成了越来越复杂的基因回路。哺乳动物合成基因网络工程面临的主要挑战包括进一步提高可扩展性和可预测性。最近在定点基因组工程和可编程DNA结合域方面的技术进步可能有助于解决这些问题。其他重要的未来方向将包括在染色质重塑和DNA甲基化水平上结合新的调节因子,以及在不同细胞类型之间使用基于群体的计算划分计算负荷。
{"title":"Mammalian Synthetic Gene Networks","authors":"J. Lohmueller","doi":"10.1145/2742060.2743764","DOIUrl":"https://doi.org/10.1145/2742060.2743764","url":null,"abstract":"Synthetic gene networks enable the programming of living cells to perform novel behaviors. Mammalian synthetic gene networks have largely been used as research tools to probe cellular function and more recently to engineer therapeutic capabilities. To create these networks researchers have developed a vast array of DNA -encoded parts that can serve as sensors, computational regulators, and actuators. Many of these gene circuit components have varying temporal characteristics and sensitivities making them well - suited for engineering systems that can act on different time scales and at different molecular concentrations. These components have been combined to create increasingly complex gene circuits. Major challenges for engineering mammalian synthetic gene networks include further improving scalability and predictability. Recent technological advancements in site - directed genome engineering and programmable DNA binding domains will likely aid in addressing these issues. Other important future directions will include incorporating new regulators that act at the levels of chromatin remodeling and DNA methylation and the division of computational loads among different cell types with population - based computing.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123681294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Experimental Validation of a Faithful Binary Circuit Model 忠实二进制电路模型的实验验证
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742081
Robert Najvirt, U. Schmid, M. Hofbauer, Matthias Függer, Thomas Nowak, K. Schweiger
Fast digital timing simulations based on continuous-time, digital-value circuit models are an attractive and heavily used alternative to analog simulations. Models based on analytic delay formulas are particularly interesting here, as they also facilitate formal verification and delay bound synthesis of complex circuits. Recently, Függer et al. (arXiv:1406.2544 [cs.OH]) proposed a circuit model based on so-called involution channels. It is the first binary circuit model that realistically captures solvability of short-pulse filtration, a non-trivial glitch propagation problem related to building one-shot inertial delays. In this work, we address the question of whether involution channels also accurately model the delay of real circuits. Using both Spice simulations and physical measurements, we confirm that modeling an inverter chain by involution channels accurately describes reality. We also demonstrate that transitions in vanishing pulse trains are accurately predicted by the involution model. For our Spice simulations, we used both UMC-90 and UMC-65 technology, with varying supply voltages from nominal down to near sub-threshold range. The measurements were performed on a special-purpose UMC-90 ASIC that combines an inverter chain with low-intrusive high-speed on-chip analog amplifiers.
基于连续时间,数字值电路模型的快速数字时序仿真是模拟仿真的一个有吸引力和广泛使用的替代方案。基于解析延迟公式的模型在这里特别有趣,因为它们也有助于复杂电路的形式化验证和延迟界合成。最近,f gger等人(arXiv:1406.2544 [c . oh])提出了一种基于所谓对合通道的电路模型。这是第一个实际捕获短脉冲滤波可解性的二进制电路模型,短脉冲滤波是一个与建立单次惯性延迟有关的重要故障传播问题。在这项工作中,我们解决了对合通道是否也能准确地模拟真实电路的延迟的问题。通过Spice仿真和物理测量,我们证实了通过对合通道建模逆变器链准确地描述了现实情况。我们还证明了用对合模型可以准确地预测消失脉冲串中的跃迁。对于Spice模拟,我们使用了UMC-90和UMC-65技术,其电源电压从标称降至接近亚阈值范围。测量是在专用的UMC-90 ASIC上进行的,该ASIC结合了逆变器链和低干扰高速片上模拟放大器。
{"title":"Experimental Validation of a Faithful Binary Circuit Model","authors":"Robert Najvirt, U. Schmid, M. Hofbauer, Matthias Függer, Thomas Nowak, K. Schweiger","doi":"10.1145/2742060.2742081","DOIUrl":"https://doi.org/10.1145/2742060.2742081","url":null,"abstract":"Fast digital timing simulations based on continuous-time, digital-value circuit models are an attractive and heavily used alternative to analog simulations. Models based on analytic delay formulas are particularly interesting here, as they also facilitate formal verification and delay bound synthesis of complex circuits. Recently, Függer et al. (arXiv:1406.2544 [cs.OH]) proposed a circuit model based on so-called involution channels. It is the first binary circuit model that realistically captures solvability of short-pulse filtration, a non-trivial glitch propagation problem related to building one-shot inertial delays. In this work, we address the question of whether involution channels also accurately model the delay of real circuits. Using both Spice simulations and physical measurements, we confirm that modeling an inverter chain by involution channels accurately describes reality. We also demonstrate that transitions in vanishing pulse trains are accurately predicted by the involution model. For our Spice simulations, we used both UMC-90 and UMC-65 technology, with varying supply voltages from nominal down to near sub-threshold range. The measurements were performed on a special-purpose UMC-90 ASIC that combines an inverter chain with low-intrusive high-speed on-chip analog amplifiers.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"191 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127421400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Flip-Mirror-Rotate: An Architecture for Bit-write Reduction and Wear Leveling in Non-volatile Memories 翻转镜像旋转:非易失性存储器中减少比特写入和损耗均衡的体系结构
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742110
Poovaiah M. Palangappa, K. Mohanram
This paper proposes Flip-Mirror-Rotate (FMR), an architecture for bit-write reduction and endurance enhancement in emerging non-volatile memories (NVMs). FMR comprises three components: adaptive Flip-N-Write (aFNW), Mirror-N-Write (MNW), and Rotate-N-Write (RNW). aFNW and MNW focus on word-level bit-write reduction, which reduces NVM dynamic energy while also improving endurance. RNW is an intra-word wear leveling scheme that operates at cache line granularity. The proposed FMR architecture is integrated with frequent pattern compression (FPC) to simultaneously reduce bit-writes and wear in NVMs. Trace-based simulations of the SPEC CPU2006 benchmarks show that for the same memory overhead and < 1% loss in memory bandwidth, FMR reduces bit-writes (dynamic energy) by 48% (29%) in comparison to classical read-modify-write (DCW), 39% (13%) in comparison to Flip-N-Write (FNW), and 21% (14%) in comparison to FPC. Simultaneously, FMR also reduces the peak bit-writes per cell by 47% in comparison to DCW, 34% in comparison to FNW, and 47% in comparison to FPC, improving NVM endurance.
本文提出了一种用于减少新兴非易失性存储器(nvm)的比特写入和增强持久性的结构——翻转镜像旋转(FMR)。FMR由三个部分组成:自适应翻转- n -写(aFNW)、镜像- n -写(MNW)和旋转- n -写(RNW)。aFNW和MNW侧重于字级比特写减少,这降低了NVM的动态能量,同时也提高了耐用性。RNW是一种按缓存线粒度操作的字内损耗均衡方案。提出的FMR架构与频繁模式压缩(FPC)相结合,可以同时减少nvm中的比特写入和损耗。基于跟踪的SPEC CPU2006基准测试模拟表明,对于相同的内存开销和< 1%的内存带宽损失,FMR与传统的读-修改-写(DCW)相比减少了48%(29%)的写位(动态能量),与翻转- n -写(FNW)相比减少了39%(13%),与FPC相比减少了21%(14%)。同时,与DCW相比,FMR还将每个单元的峰值比特写入减少了47%,与FNW相比减少了34%,与FPC相比减少了47%,从而提高了NVM的耐用性。
{"title":"Flip-Mirror-Rotate: An Architecture for Bit-write Reduction and Wear Leveling in Non-volatile Memories","authors":"Poovaiah M. Palangappa, K. Mohanram","doi":"10.1145/2742060.2742110","DOIUrl":"https://doi.org/10.1145/2742060.2742110","url":null,"abstract":"This paper proposes Flip-Mirror-Rotate (FMR), an architecture for bit-write reduction and endurance enhancement in emerging non-volatile memories (NVMs). FMR comprises three components: adaptive Flip-N-Write (aFNW), Mirror-N-Write (MNW), and Rotate-N-Write (RNW). aFNW and MNW focus on word-level bit-write reduction, which reduces NVM dynamic energy while also improving endurance. RNW is an intra-word wear leveling scheme that operates at cache line granularity. The proposed FMR architecture is integrated with frequent pattern compression (FPC) to simultaneously reduce bit-writes and wear in NVMs. Trace-based simulations of the SPEC CPU2006 benchmarks show that for the same memory overhead and < 1% loss in memory bandwidth, FMR reduces bit-writes (dynamic energy) by 48% (29%) in comparison to classical read-modify-write (DCW), 39% (13%) in comparison to Flip-N-Write (FNW), and 21% (14%) in comparison to FPC. Simultaneously, FMR also reduces the peak bit-writes per cell by 47% in comparison to DCW, 34% in comparison to FNW, and 47% in comparison to FPC, improving NVM endurance.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123362385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
A SystemC Platform for Signal Transduction Modelling and Simulation in Systems Biology 系统生物学中信号转导建模与仿真的SystemC平台
Pub Date : 2015-05-20 DOI: 10.1145/2742060.2742115
Rosario Distefano, F. Fummi, C. Laudanna, N. Bombieri, R. Giugno
Signal transduction is a class of cell's biological processes, which are commonly represented as highly concurrent reactive systems. In the Systems Biology community, modelling and simulation of signal transduction require overcoming issues like discrete event-based execution of complex systems, description from building blocks through composition and encapsulation, description at different levels of granularity, methods for abstraction and refinement. This paper presents a signal transduction modelling and simulation platform based on SystemC, and shows how the platform allows handling the system complexity by modelling it at different abstraction levels. The paper reports the results obtained by applying the platform to model the intracellular signalling network controlling integrin activation mediating leukocyte recruitment from the blood into the tissues.
信号转导是一类细胞的生物过程,通常表现为高度并发的反应系统。在系统生物学社区中,信号转导的建模和模拟需要克服诸如复杂系统的离散事件执行,从构建块到组合和封装的描述,不同粒度级别的描述,抽象和细化方法等问题。本文提出了一个基于SystemC的信号转导建模与仿真平台,并展示了该平台如何通过在不同抽象层次上对系统进行建模来处理系统复杂性。本文报道了应用该平台模拟控制整合素激活介导白细胞从血液向组织募集的细胞内信号网络所获得的结果。
{"title":"A SystemC Platform for Signal Transduction Modelling and Simulation in Systems Biology","authors":"Rosario Distefano, F. Fummi, C. Laudanna, N. Bombieri, R. Giugno","doi":"10.1145/2742060.2742115","DOIUrl":"https://doi.org/10.1145/2742060.2742115","url":null,"abstract":"Signal transduction is a class of cell's biological processes, which are commonly represented as highly concurrent reactive systems. In the Systems Biology community, modelling and simulation of signal transduction require overcoming issues like discrete event-based execution of complex systems, description from building blocks through composition and encapsulation, description at different levels of granularity, methods for abstraction and refinement. This paper presents a signal transduction modelling and simulation platform based on SystemC, and shows how the platform allows handling the system complexity by modelling it at different abstraction levels. The paper reports the results obtained by applying the platform to model the intracellular signalling network controlling integrin activation mediating leukocyte recruitment from the blood into the tissues.","PeriodicalId":255133,"journal":{"name":"Proceedings of the 25th edition on Great Lakes Symposium on VLSI","volume":"208 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123379850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Proceedings of the 25th edition on Great Lakes Symposium on VLSI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1