首页 > 最新文献

2012 IEEE 30th International Conference on Computer Design (ICCD)最新文献

英文 中文
Aurora: A thermally resilient photonic network-on-chip architecture Aurora:一种热弹性光子网络芯片架构
Pub Date : 2012-09-30 DOI: 10.1109/ICCD.2012.6378667
Amer Qouneh, Zhongqi Li, Madhura Joshi, Wangyuan Zhang, Xin Fu, Tao Li
With silicon optical technology moving towards maturity, the use of photonic network-on-chip (NoCs) for global chip communication is emerging as a promising solution to communication requirements of future many core processors. It is expected that photonic NoCs will play an important role in alleviating current power, latency, and bandwidth constraints. However, photonic NoCs are sensitive to ambient temperature variations because their basic constituents, ring resonators, are themselves sensitive to those variations. Since ring resonators are basic building blocks for photonic modulators, switches, multiplexers, and demultiplexers, variations of on-chip temperature pose serious challenges to the proper operation of photonic NoCs. Proposed methods that mitigate the effects of temperature at device level are either difficult to use in CMOS processes or not suitable for large scale implementation. In this paper, we propose Aurora, a thermally resilient photonic NoC architecture design that supports reliable and low bit error rate (BER) on-chip communications in the presence of large temperature variations. Our proposed architecture leverages solutions at both device and architecture layers that synergistically provide significant improvements. To compensate for small temperature variations, our design varies the bias current through ring resonators. For larger temperature variations, we propose architecture-level techniques to re-route messages away from hot regions, and through cooler regions, to their destinations, thereby lowering BER. Our simulation results show that Aurora provides a robust architectural solution to handle temperature variation effects on future photonic NoCs. For instance, average BER and message error rate (MER) are reduced by 78% and 30% respectively when the combined device and architectural technique (SPF) is applied. From the perspective of power efficiency, Aurora is also superior to conventional photonic NoC architectures by as much as 33%.
随着硅光技术的日趋成熟,利用光子片上网络(NoCs)进行全球芯片通信是满足未来多核心处理器通信需求的一种有前景的解决方案。预计光子noc将在缓解当前功率、延迟和带宽限制方面发挥重要作用。然而,光子noc对环境温度变化很敏感,因为它们的基本成分环谐振器本身对这些变化很敏感。由于环形谐振器是光子调制器、开关、多路复用器和解路复用器的基本组成部分,片上温度的变化对光子noc的正常运行构成了严重的挑战。所提出的减轻器件级温度影响的方法要么难以在CMOS工艺中使用,要么不适合大规模实施。在本文中,我们提出了Aurora,这是一种热弹性光子NoC架构设计,支持在存在大温度变化的情况下可靠和低误码率(BER)的片上通信。我们提出的架构利用了设备层和架构层的解决方案,这些解决方案协同提供了显著的改进。为了补偿小的温度变化,我们的设计通过环形谐振器改变偏置电流。对于较大的温度变化,我们提出了架构级技术,将消息从炎热地区重新路由到目的地,并通过较冷的地区,从而降低误码率。我们的模拟结果表明,Aurora提供了一个强大的架构解决方案来处理温度变化对未来光子noc的影响。例如,采用组合设备和结构技术(SPF)后,平均误码率和消息错误率分别降低了78%和30%。从功率效率的角度来看,Aurora也比传统的光子NoC架构高出33%。
{"title":"Aurora: A thermally resilient photonic network-on-chip architecture","authors":"Amer Qouneh, Zhongqi Li, Madhura Joshi, Wangyuan Zhang, Xin Fu, Tao Li","doi":"10.1109/ICCD.2012.6378667","DOIUrl":"https://doi.org/10.1109/ICCD.2012.6378667","url":null,"abstract":"With silicon optical technology moving towards maturity, the use of photonic network-on-chip (NoCs) for global chip communication is emerging as a promising solution to communication requirements of future many core processors. It is expected that photonic NoCs will play an important role in alleviating current power, latency, and bandwidth constraints. However, photonic NoCs are sensitive to ambient temperature variations because their basic constituents, ring resonators, are themselves sensitive to those variations. Since ring resonators are basic building blocks for photonic modulators, switches, multiplexers, and demultiplexers, variations of on-chip temperature pose serious challenges to the proper operation of photonic NoCs. Proposed methods that mitigate the effects of temperature at device level are either difficult to use in CMOS processes or not suitable for large scale implementation. In this paper, we propose Aurora, a thermally resilient photonic NoC architecture design that supports reliable and low bit error rate (BER) on-chip communications in the presence of large temperature variations. Our proposed architecture leverages solutions at both device and architecture layers that synergistically provide significant improvements. To compensate for small temperature variations, our design varies the bias current through ring resonators. For larger temperature variations, we propose architecture-level techniques to re-route messages away from hot regions, and through cooler regions, to their destinations, thereby lowering BER. Our simulation results show that Aurora provides a robust architectural solution to handle temperature variation effects on future photonic NoCs. For instance, average BER and message error rate (MER) are reduced by 78% and 30% respectively when the combined device and architectural technique (SPF) is applied. From the perspective of power efficiency, Aurora is also superior to conventional photonic NoC architectures by as much as 33%.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129171697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Fast error aware model for arithmetic and logic circuits 用于算术和逻辑电路的快速错误感知模型
Pub Date : 2012-09-30 DOI: 10.1109/ICCD.2012.6378659
Samy Zaynoun, Muhammed S. Khairy, A. Eltawil, F. Kurdahi, A. Djahromi
As a result of supply voltage reduction and process variations effects, the error free margin for dynamic voltage scaling has been drastically reduced. This paper presents an error aware model for arithmetic and logic circuits that accurately and rapidly estimates the propagation delays of the output bits in a digital block operating under voltage scaling to identify circuit-level failures (timing violations) within the block. Consequently, these failure models are then used to examine how circuit-level failures affect system-level reliability. A case study consisting of a CORDIC DSP unit employing the proposed model provides tradeoffs between power, performance and reliability.
由于电源电压降低和工艺变化的影响,动态电压缩放的无误差范围大大减少。本文提出了一种用于算术和逻辑电路的错误感知模型,该模型可以准确快速地估计在电压标度下工作的数字块中输出位的传播延迟,以识别块内的电路级故障(时序违规)。因此,这些故障模型随后被用于检查电路级故障如何影响系统级可靠性。一个由CORDIC DSP单元组成的案例研究采用了所提出的模型,提供了功耗、性能和可靠性之间的权衡。
{"title":"Fast error aware model for arithmetic and logic circuits","authors":"Samy Zaynoun, Muhammed S. Khairy, A. Eltawil, F. Kurdahi, A. Djahromi","doi":"10.1109/ICCD.2012.6378659","DOIUrl":"https://doi.org/10.1109/ICCD.2012.6378659","url":null,"abstract":"As a result of supply voltage reduction and process variations effects, the error free margin for dynamic voltage scaling has been drastically reduced. This paper presents an error aware model for arithmetic and logic circuits that accurately and rapidly estimates the propagation delays of the output bits in a digital block operating under voltage scaling to identify circuit-level failures (timing violations) within the block. Consequently, these failure models are then used to examine how circuit-level failures affect system-level reliability. A case study consisting of a CORDIC DSP unit employing the proposed model provides tradeoffs between power, performance and reliability.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122395326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Engineering crossbar based emerging memory technologies 基于工程交叉杆的新兴存储技术
Pub Date : 2012-09-30 DOI: 10.1109/ICCD.2012.6378682
Sachhidh Kannan, Jeyavijayan Rajendran, R. Karri, O. Sinanoglu
Emerging Resistive Random Access Memories (RRAM) devices are an attractive option for future memory architectures due to their low-power and high density. However, their capacity is limited by sneak paths and the sensitivity of the sense amplifiers (SA). We develop a framework to maximize the capacity of RRAM memories by modeling the interactions between memory capacity, sneak paths, device parameters, and the sense amplifier. The framework explores the design space of the memory by considering different read/write mechanisms, sneak path elimination techniques, and multi-level storage.
新兴的电阻随机存取存储器(RRAM)器件由于其低功耗和高密度而成为未来存储器架构的一个有吸引力的选择。然而,它们的能力受到潜行路径和感测放大器(SA)灵敏度的限制。我们开发了一个框架,通过模拟存储器容量,潜行路径,器件参数和感测放大器之间的相互作用来最大化RRAM存储器的容量。该框架通过考虑不同的读/写机制、潜行路径消除技术和多级存储来探索内存的设计空间。
{"title":"Engineering crossbar based emerging memory technologies","authors":"Sachhidh Kannan, Jeyavijayan Rajendran, R. Karri, O. Sinanoglu","doi":"10.1109/ICCD.2012.6378682","DOIUrl":"https://doi.org/10.1109/ICCD.2012.6378682","url":null,"abstract":"Emerging Resistive Random Access Memories (RRAM) devices are an attractive option for future memory architectures due to their low-power and high density. However, their capacity is limited by sneak paths and the sensitivity of the sense amplifiers (SA). We develop a framework to maximize the capacity of RRAM memories by modeling the interactions between memory capacity, sneak paths, device parameters, and the sense amplifier. The framework explores the design space of the memory by considering different read/write mechanisms, sneak path elimination techniques, and multi-level storage.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132798719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
3D-NoC: Reconfigurable 3D photonic on-chip interconnect for multicores 3D- noc:用于多核的可重构3D光子片上互连
Pub Date : 2012-09-30 DOI: 10.1109/ICCD.2012.6378672
R. Morris, Avinash Karanth Kodi, A. Louri
The power dissipation of metallic interconnects in future multicore architectures is projected to be a major bottleneck as we scale to sub-nanometer regime. This has motivated researchers to develop alternate power-efficient technology solutions to the performance limitations of future multicores. Nanophotonic interconnects (NIs) is a disruptive technology solution that is capable of delivering the communication bandwidth at low power dissipation when the number of cores is scaled to large numbers. Similarly, 3D stacking is another interconnect technology solution that can lead to low energy/bit for communication. In this paper, we propose to combine NIs with with 3D stacking to develop a scalable, reconfigurable, power-efficient and high-performance interconnect for future many-core systems called 3D-NoC. We propose to develop a multi-layer NIs that can dynamically reconfigure without system intervention and allocate channel bandwidth from less utilized links to more utilized communication links. Our simulation results indicate that the performance can be further improved by 10%-25% for Splash-2, PARSEC and SPEC CPU2006 benchmarks.
在未来的多核架构中,金属互连的功耗预计将成为亚纳米体系结构的主要瓶颈。这促使研究人员开发替代的节能技术解决方案,以解决未来多核的性能限制。纳米光子互连(NIs)是一种颠覆性的技术解决方案,能够在核数扩展到较大数量时以低功耗提供通信带宽。同样,3D堆叠是另一种互连技术解决方案,可以实现低能量/比特的通信。在本文中,我们建议将NIs与3D堆叠相结合,为未来的多核系统开发可扩展,可重构,节能和高性能的互连,称为3D- noc。我们建议开发一种多层NIs,它可以在没有系统干预的情况下动态重新配置,并将信道带宽从利用率较低的链路分配给利用率较高的通信链路。我们的仿真结果表明,在Splash-2、PARSEC和SPEC CPU2006基准测试中,性能可以进一步提高10%-25%。
{"title":"3D-NoC: Reconfigurable 3D photonic on-chip interconnect for multicores","authors":"R. Morris, Avinash Karanth Kodi, A. Louri","doi":"10.1109/ICCD.2012.6378672","DOIUrl":"https://doi.org/10.1109/ICCD.2012.6378672","url":null,"abstract":"The power dissipation of metallic interconnects in future multicore architectures is projected to be a major bottleneck as we scale to sub-nanometer regime. This has motivated researchers to develop alternate power-efficient technology solutions to the performance limitations of future multicores. Nanophotonic interconnects (NIs) is a disruptive technology solution that is capable of delivering the communication bandwidth at low power dissipation when the number of cores is scaled to large numbers. Similarly, 3D stacking is another interconnect technology solution that can lead to low energy/bit for communication. In this paper, we propose to combine NIs with with 3D stacking to develop a scalable, reconfigurable, power-efficient and high-performance interconnect for future many-core systems called 3D-NoC. We propose to develop a multi-layer NIs that can dynamically reconfigure without system intervention and allocate channel bandwidth from less utilized links to more utilized communication links. Our simulation results indicate that the performance can be further improved by 10%-25% for Splash-2, PARSEC and SPEC CPU2006 benchmarks.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114988712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
A physical unclonable function based on setup time violation 基于设置时间冲突的物理不可克隆函数
Pub Date : 2012-09-30 DOI: 10.1109/ICCD.2012.6378630
D. Hély, Maurin Augagneur, Y. Clauzel, Jeremy Dubeuf
In this work, we propose a physical unclonable function (i.e. PUF) based on setup time violations. The paper presents the way the PUF has been developed detailing the successive design iterations. This work has been developed by Grenoble INP students (J. Dubeuf, M. Augagneur and Y. Clauzel) during the secure IC design course at Grenoble INP Esisar for the Embedded System Challenge during the Cyber Security Awareness Week (CSAW) 2011 organized by Polytechnic Institute of New York University.
在这项工作中,我们提出了一个基于设置时间违规的物理不可克隆函数(即PUF)。本文介绍了PUF的开发方法,详细介绍了连续的设计迭代。这项工作是由格勒诺布尔INP学生(J. Dubeuf, M. Augagneur和Y. Clauzel)在格勒诺布尔INP Esisar的安全IC设计课程中开发的,该课程是为了参加由纽约大学理工学院组织的2011年网络安全意识周(CSAW)期间的嵌入式系统挑战。
{"title":"A physical unclonable function based on setup time violation","authors":"D. Hély, Maurin Augagneur, Y. Clauzel, Jeremy Dubeuf","doi":"10.1109/ICCD.2012.6378630","DOIUrl":"https://doi.org/10.1109/ICCD.2012.6378630","url":null,"abstract":"In this work, we propose a physical unclonable function (i.e. PUF) based on setup time violations. The paper presents the way the PUF has been developed detailing the successive design iterations. This work has been developed by Grenoble INP students (J. Dubeuf, M. Augagneur and Y. Clauzel) during the secure IC design course at Grenoble INP Esisar for the Embedded System Challenge during the Cyber Security Awareness Week (CSAW) 2011 organized by Polytechnic Institute of New York University.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123082792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Automatic assertion extraction in gate-level simulation using GPGPUs 基于gpgpu的门级仿真中的自动断言提取
Pub Date : 2012-09-30 DOI: 10.1109/ICCD.2012.6378704
Shohei Ono, Takeshi Matsumoto, M. Fujita
In modern VLSI designs, assertions play an important role to understand design intention and ensure correctness of designs. In this paper, we consider to generate assertions from simulation results. This assertion extraction is performed by examining whether a logical relation is satisfied among a set of signals. We propose to accelerate it by utilizing a highly parallelized computation performed by GPGPUs. Through the experiments with designs from industry, our implementation on GPGPU runs 30 times faster than a software implementation.
在现代VLSI设计中,断言对于理解设计意图和确保设计的正确性起着重要的作用。在本文中,我们考虑从仿真结果中生成断言。这种断言提取是通过检查一组信号之间的逻辑关系是否满足来执行的。我们建议利用gpgpu执行的高度并行计算来加速它。通过与工业设计的实验,我们在GPGPU上的实现比软件实现快30倍。
{"title":"Automatic assertion extraction in gate-level simulation using GPGPUs","authors":"Shohei Ono, Takeshi Matsumoto, M. Fujita","doi":"10.1109/ICCD.2012.6378704","DOIUrl":"https://doi.org/10.1109/ICCD.2012.6378704","url":null,"abstract":"In modern VLSI designs, assertions play an important role to understand design intention and ensure correctness of designs. In this paper, we consider to generate assertions from simulation results. This assertion extraction is performed by examining whether a logical relation is satisfied among a set of signals. We propose to accelerate it by utilizing a highly parallelized computation performed by GPGPUs. Through the experiments with designs from industry, our implementation on GPGPU runs 30 times faster than a software implementation.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124970266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parameterized free space redistribution for engineering change in placement of integrated circuits 工程中集成电路布局变化的参数化自由空间再分配
Pub Date : 2012-09-30 DOI: 10.1109/ICCD.2012.6378670
Taraneh Taghavi, Shyam Ramji, F. Musante, Suhasini Rege
In this paper we present a method for parameterized free space redistribution of a fragmented placement. The fragmentation problem arises in different contexts within the physical design automation, including post physical synthesis for filler cell insertion, incremental placement, timing optimization, and late mode ECO fix-ups. To address this problem, we apply a post-placement parameterized method of defragmentation. This method involves capturing a view of a given placement and modeling a dynamic programming problem to optimally maximize the amount of so-called useful free space as defined by a given set of parameters. The parameters act as constraints to preserve the row placement and order of the cells while minimizing the perturbation of the whole design for a successful timing and design closure. Experimental results demonstrate that by applying the proposed technique, on average, 9.7% increase in the number of inserted filler cells and 5.7% improvement in the success rate of incremental placement requests can be achieved with minimal or no impact on timing and wirelength. Moreover, when deployed in early mode buffering for timing optimization, this technique can result in 3% reduction in the number of paths with negative slacks.
本文提出了一种碎片化布局的参数化自由空间再分配方法。碎片问题出现在物理设计自动化的不同环境中,包括填充单元插入的后物理合成、增量放置、时间优化和后期模式ECO修复。为了解决这个问题,我们采用了一种放置后参数化的碎片整理方法。该方法包括捕获给定位置的视图,并对动态规划问题进行建模,以最优地最大化由给定参数集定义的所谓有用自由空间的数量。参数充当约束,以保持单元的行位置和顺序,同时最大限度地减少整个设计的扰动,以实现成功的定时和设计闭合。实验结果表明,应用该技术,在对时间和长度影响很小或没有影响的情况下,平均增加了9.7%的插入填充细胞数量和5.7%的增量放置请求成功率。此外,当在早期模式缓冲中部署以进行时间优化时,该技术可以使具有负松弛的路径数量减少3%。
{"title":"Parameterized free space redistribution for engineering change in placement of integrated circuits","authors":"Taraneh Taghavi, Shyam Ramji, F. Musante, Suhasini Rege","doi":"10.1109/ICCD.2012.6378670","DOIUrl":"https://doi.org/10.1109/ICCD.2012.6378670","url":null,"abstract":"In this paper we present a method for parameterized free space redistribution of a fragmented placement. The fragmentation problem arises in different contexts within the physical design automation, including post physical synthesis for filler cell insertion, incremental placement, timing optimization, and late mode ECO fix-ups. To address this problem, we apply a post-placement parameterized method of defragmentation. This method involves capturing a view of a given placement and modeling a dynamic programming problem to optimally maximize the amount of so-called useful free space as defined by a given set of parameters. The parameters act as constraints to preserve the row placement and order of the cells while minimizing the perturbation of the whole design for a successful timing and design closure. Experimental results demonstrate that by applying the proposed technique, on average, 9.7% increase in the number of inserted filler cells and 5.7% improvement in the success rate of incremental placement requests can be achieved with minimal or no impact on timing and wirelength. Moreover, when deployed in early mode buffering for timing optimization, this technique can result in 3% reduction in the number of paths with negative slacks.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122118962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical modeling of Phase Change memory for reliable design 面向可靠性设计的相变存储器分层建模
Pub Date : 2012-09-30 DOI: 10.1109/ICCD.2012.6378626
Zihan Xu, K. Sutaria, Chengen Yang, C. Chakrabarti, Yu Cao
As CMOS based memory devices near their end, memory technologies, such as Phase Change Random Access Memory (PRAM), have emerged as viable alternatives. This work develops a hierarchical modeling framework that connects the unique device physics of PRAM with its circuit and state transition properties. Such an approach enables design exploration at various levels in order to optimize the performance and yield. By providing a complete set of compact models, it supports SPICE simulation of PRAM in the presence of process variations and temporal degradation. Furthermore, this work proposes a new metric, State Transition Curve (STC) that supports the assessment of other performance metrics (e.g., power, speed, yield, etc.), helping gain valuable insights on PRAM reliability.
随着基于CMOS的存储设备接近尾声,存储技术,如相变随机存取存储器(PRAM),已经成为可行的替代方案。这项工作开发了一个分层建模框架,将PRAM的独特器件物理与其电路和状态转换属性联系起来。这种方法可以在不同的层次上进行设计探索,以优化性能和产量。通过提供一套完整的紧凑模型,它支持在工艺变化和时间退化存在的PRAM的SPICE模拟。此外,这项工作提出了一个新的指标,状态转换曲线(STC),支持评估其他性能指标(例如,功率,速度,产量等),有助于获得有关PRAM可靠性的宝贵见解。
{"title":"Hierarchical modeling of Phase Change memory for reliable design","authors":"Zihan Xu, K. Sutaria, Chengen Yang, C. Chakrabarti, Yu Cao","doi":"10.1109/ICCD.2012.6378626","DOIUrl":"https://doi.org/10.1109/ICCD.2012.6378626","url":null,"abstract":"As CMOS based memory devices near their end, memory technologies, such as Phase Change Random Access Memory (PRAM), have emerged as viable alternatives. This work develops a hierarchical modeling framework that connects the unique device physics of PRAM with its circuit and state transition properties. Such an approach enables design exploration at various levels in order to optimize the performance and yield. By providing a complete set of compact models, it supports SPICE simulation of PRAM in the presence of process variations and temporal degradation. Furthermore, this work proposes a new metric, State Transition Curve (STC) that supports the assessment of other performance metrics (e.g., power, speed, yield, etc.), helping gain valuable insights on PRAM reliability.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129760681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
HPRA: A pro-active Hotspot-Preventive high-performance routing algorithm for Networks-on-Chips HPRA:一种面向片上网络的主动热点预防高性能路由算法
Pub Date : 2012-09-30 DOI: 10.1109/ICCD.2012.6378648
E. Kakoulli, V. Soteriou, T. Theocharides
The inherent spatio-temporal unevenness of traffic flows in Networks-on-Chips (NoCs) can cause unforeseen, and in cases, severe forms of congestion, known as hotspots. Hotspots reduce the NoC's effective throughput, where in the worst case scenario, the entire network can be brought to an unrecoverable halt as a hotspot(s) spreads across the topology. To alleviate this problematic phenomenon several adaptive routing algorithms employ online load-balancing functions, aiming to reduce the possibility of hotspots arising. Most, however, work passively, merely distributing traffic as evenly as possible among alternative network paths, and they cannot guarantee the absence of network congestion as their reactive capability in reducing hotspot formation(s) is limited. In this paper we present a new pro-active Hotspot-Preventive Routing Algorithm (HPRA) which uses the advance knowledge gained from network-embedded Artificial Neural Network-based (ANN) hotspot predictors to guide packet routing across the network in an effort to mitigate any unforeseen near-future occurrences of hotspots. These ANNs are trained offline and during multicore operation they gather online buffer utilization data to predict about-to-be-formed hotspots, promptly informing the HPRA routing algorithm to take appropriate action in preventing hotspot formation(s). Evaluation results across two synthetic traffic patterns, and traffic benchmarks gathered from a chip multiprocessor architecture, show that HPRA can reduce network latency and improve network throughput up to 81% when compared against several existing state-of-the-art congestion-aware routing functions. Hardware synthesis results demonstrate the efficacy of the HPRA mechanism.
片上网络(noc)中固有的时空不均匀的交通流可能导致不可预见的,在某些情况下,严重形式的拥塞,称为热点。热点降低了NoC的有效吞吐量,在最坏的情况下,当热点在拓扑中传播时,整个网络可能会陷入不可恢复的停顿。为了缓解这一问题,一些自适应路由算法采用在线负载均衡功能,旨在减少热点产生的可能性。然而,它们大多是被动工作的,仅仅是将流量尽可能均匀地分配到可选的网络路径上,并且由于它们减少热点形成的被动能力有限,无法保证不发生网络拥塞。在本文中,我们提出了一种新的主动热点预防路由算法(HPRA),该算法利用从网络嵌入式人工神经网络(ANN)热点预测器中获得的先进知识来指导网络中的分组路由,以减轻任何不可预见的热点事件。这些人工神经网络是离线训练的,在多核运行期间,它们收集在线缓冲区利用率数据来预测即将形成的热点,并及时通知HPRA路由算法采取适当的措施来防止热点的形成。跨两种综合流量模式的评估结果,以及从芯片多处理器架构收集的流量基准测试表明,与几种现有的最先进的拥塞感知路由功能相比,HPRA可以减少网络延迟,提高网络吞吐量高达81%。硬件合成结果证明了HPRA机制的有效性。
{"title":"HPRA: A pro-active Hotspot-Preventive high-performance routing algorithm for Networks-on-Chips","authors":"E. Kakoulli, V. Soteriou, T. Theocharides","doi":"10.1109/ICCD.2012.6378648","DOIUrl":"https://doi.org/10.1109/ICCD.2012.6378648","url":null,"abstract":"The inherent spatio-temporal unevenness of traffic flows in Networks-on-Chips (NoCs) can cause unforeseen, and in cases, severe forms of congestion, known as hotspots. Hotspots reduce the NoC's effective throughput, where in the worst case scenario, the entire network can be brought to an unrecoverable halt as a hotspot(s) spreads across the topology. To alleviate this problematic phenomenon several adaptive routing algorithms employ online load-balancing functions, aiming to reduce the possibility of hotspots arising. Most, however, work passively, merely distributing traffic as evenly as possible among alternative network paths, and they cannot guarantee the absence of network congestion as their reactive capability in reducing hotspot formation(s) is limited. In this paper we present a new pro-active Hotspot-Preventive Routing Algorithm (HPRA) which uses the advance knowledge gained from network-embedded Artificial Neural Network-based (ANN) hotspot predictors to guide packet routing across the network in an effort to mitigate any unforeseen near-future occurrences of hotspots. These ANNs are trained offline and during multicore operation they gather online buffer utilization data to predict about-to-be-formed hotspots, promptly informing the HPRA routing algorithm to take appropriate action in preventing hotspot formation(s). Evaluation results across two synthetic traffic patterns, and traffic benchmarks gathered from a chip multiprocessor architecture, show that HPRA can reduce network latency and improve network throughput up to 81% when compared against several existing state-of-the-art congestion-aware routing functions. Hardware synthesis results demonstrate the efficacy of the HPRA mechanism.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127224435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Energy modelling of embedded multimedia streaming applications with GStreamer on heterogeneous MPSoC 基于GStreamer的异构MPSoC嵌入式多媒体流应用的能量建模
Pub Date : 2012-09-30 DOI: 10.1109/ICCD.2012.6378686
Mickael Lanoe, E. Senn
Embedded systems have to support more and more demanding multimedia applications. Heterogeneous multi-core architectures are now commonplace in mobile electronic devices. The impact on the power and energy consumption is tremendous; it has to be evaluated as soon as possible in the design process. Multimedia development frameworks are used to abstract the complexity of the hardware to the designer. In this paper, we propose a methodology to develop high-level performance and consumption models for multimedia streaming applications based on the GStreamer framework. Our approach is based on measurements of the power consumptions and execution times during the processing of combined video and audio streams. Performance and consumption models are build for various plugins instantiated in the corresponding GStreamer pipelines. The combination of estimations for all those plugins leads to a precise evaluation of the complete plugin performances. The precision of the models is evaluated against measurements for different real-life streaming applications.
嵌入式系统必须支持越来越多的多媒体应用。异构多核架构现在在移动电子设备中很常见。这对电力和能源消耗的影响是巨大的;在设计过程中必须尽快对其进行评估。多媒体开发框架用于将硬件的复杂性抽象给设计者。在本文中,我们提出了一种基于GStreamer框架为多媒体流应用程序开发高级性能和消费模型的方法。我们的方法是基于对合并视频和音频流处理过程中的功耗和执行时间的测量。性能和消费模型是为在相应的GStreamer管道中实例化的各种插件构建的。对所有这些插件的综合评估导致了对插件完整性能的精确评估。模型的精度是根据不同的现实生活流媒体应用的测量来评估的。
{"title":"Energy modelling of embedded multimedia streaming applications with GStreamer on heterogeneous MPSoC","authors":"Mickael Lanoe, E. Senn","doi":"10.1109/ICCD.2012.6378686","DOIUrl":"https://doi.org/10.1109/ICCD.2012.6378686","url":null,"abstract":"Embedded systems have to support more and more demanding multimedia applications. Heterogeneous multi-core architectures are now commonplace in mobile electronic devices. The impact on the power and energy consumption is tremendous; it has to be evaluated as soon as possible in the design process. Multimedia development frameworks are used to abstract the complexity of the hardware to the designer. In this paper, we propose a methodology to develop high-level performance and consumption models for multimedia streaming applications based on the GStreamer framework. Our approach is based on measurements of the power consumptions and execution times during the processing of combined video and audio streams. Performance and consumption models are build for various plugins instantiated in the corresponding GStreamer pipelines. The combination of estimations for all those plugins leads to a precise evaluation of the complete plugin performances. The precision of the models is evaluated against measurements for different real-life streaming applications.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130408369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2012 IEEE 30th International Conference on Computer Design (ICCD)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1