首页 > 最新文献

2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)最新文献

英文 中文
Sensor-wise methodology to face NBTI stress of NoC buffers 面对NoC缓冲的NBTI压力的传感器智能方法
Pub Date : 2013-03-18 DOI: 10.7873/DATE.2013.216
Davide Zoni, W. Fornaciari
Networks-on-Chip (NoCs) are a key component for the new many-core architectures, from the performance and reliability stand-points. Unfortunately, continuous scaling of CMOS technology poses severe concerns regarding failure mechanisms such as NBTI and stress-migration. Process variation makes harder the scenario, decreasing device lifetime and performance predictability during chip fabrication. This paper presents a novel cooperative sensor-wise methodology to reduce the NBTI degradation in the network on-chip (NoC) virtual channel (VC) buffers, considering process variation effects as well. The changes introduced to the reference NoC model exhibit an area overhead below 4%. Experimental validation is obtained using a cycle accurate simulator considering both real and synthetic traffic patterns. We compare our methodology to the best sensor-less round-robin approach used as reference model. The proposed sensor-wise strategy achieves up to 26.6% and 18.9% activity factor improvement over the reference policy on synthetic and real traffic patterns respectively. Moreover a net NBTI Vth saving up to 54.2% is shown against the baseline NoC that does not account for NBTI.
从性能和可靠性的角度来看,片上网络(noc)是新的多核架构的关键组件。不幸的是,CMOS技术的持续扩展引起了诸如NBTI和应力迁移等失效机制的严重担忧。工艺变化使场景变得更加困难,降低了芯片制造过程中的器件寿命和性能可预测性。本文提出了一种新的基于传感器的协作方法,在考虑过程变化效应的情况下,降低了片上网络(NoC)虚拟信道(VC)缓冲区中NBTI的退化。引入参考NoC模型的更改显示面积开销低于4%。在考虑真实交通模式和合成交通模式的循环精度模拟器上进行了实验验证。我们将我们的方法与作为参考模型的最佳无传感器循环方法进行了比较。与参考策略相比,所提出的传感器智能策略在综合交通模式和真实交通模式下的活动因子分别提高了26.6%和18.9%。此外,与不考虑NBTI的基准NoC相比,净NBTI Vth节省高达54.2%。
{"title":"Sensor-wise methodology to face NBTI stress of NoC buffers","authors":"Davide Zoni, W. Fornaciari","doi":"10.7873/DATE.2013.216","DOIUrl":"https://doi.org/10.7873/DATE.2013.216","url":null,"abstract":"Networks-on-Chip (NoCs) are a key component for the new many-core architectures, from the performance and reliability stand-points. Unfortunately, continuous scaling of CMOS technology poses severe concerns regarding failure mechanisms such as NBTI and stress-migration. Process variation makes harder the scenario, decreasing device lifetime and performance predictability during chip fabrication. This paper presents a novel cooperative sensor-wise methodology to reduce the NBTI degradation in the network on-chip (NoC) virtual channel (VC) buffers, considering process variation effects as well. The changes introduced to the reference NoC model exhibit an area overhead below 4%. Experimental validation is obtained using a cycle accurate simulator considering both real and synthetic traffic patterns. We compare our methodology to the best sensor-less round-robin approach used as reference model. The proposed sensor-wise strategy achieves up to 26.6% and 18.9% activity factor improvement over the reference policy on synthetic and real traffic patterns respectively. Moreover a net NBTI Vth saving up to 54.2% is shown against the baseline NoC that does not account for NBTI.","PeriodicalId":6310,"journal":{"name":"2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"2 1","pages":"1038-1043"},"PeriodicalIF":0.0,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76765368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
D-MRAM cache: Enhancing energy efficiency with 3T-1MTJ DRAM / MRAM hybrid memory D-MRAM高速缓存:利用3T-1MTJ DRAM / MRAM混合存储器提高能源效率
Pub Date : 2013-03-18 DOI: 10.7873/DATE.2013.363
H. Noguchi, K. Nomura, K. Abe, S. Fujita, Eishi Arima, Kyundong Kim, Takashi Nakada, Shinobu Miwa, Hiroshi Nakamura
This paper describes a proposal of non-volatile cache architecture utilizing novel DRAM / MRAM cell-level hybrid structured memory (D-MRAM) that enables effective power reduction for high performance mobile SoCs without area overhead. Here, the key point to reduce active power is intermittent refresh process for the DRAM-mode. D-MRAM has advantage to reduce static power consumptions compared to the conventional SRAM, because there are no static leakage paths in the D-MRAM cell and it is not needed to supply voltage to its cells when used as the MRAM-mode. Besides, with advanced perpendicular magnetic tunnel junctions (p-MTJ), which decreases the write energy and latency without shortening its retention time, D-MRAM is capable of power reduction by replacing the traditional SRAM caches. Considering the 65-nm CMOS technology, the access latencies of 1MB memory macro are 2.2 ns / 1.5 ns for read / write in DRAM mode, and 2.2 ns / 4.5 ns in MRAM mode, while those of SRAM are 1.17 ns. The SPEC CPU2006 benchmarks have revealed that the energy per instruction (EPI) of the total cache memory can be dramatically reduced by 71 % on average, and the instruction per cycle (IPC) performance of the D-MRAM cache architecture degraded only by approximately 4 % on average in spite of its latency overhead.
本文描述了一种利用新型DRAM / MRAM单元级混合结构化存储器(D-MRAM)的非易失性缓存架构的建议,该架构可以有效降低高性能移动soc的功耗,而无需面积开销。在这里,降低有功功率的关键是dram模式的间歇刷新过程。与传统的SRAM相比,D-MRAM具有降低静态功耗的优势,因为D-MRAM单元中没有静态泄漏路径,并且在作为mram模式使用时不需要为其单元提供电压。此外,D-MRAM采用先进的垂直磁隧道结(p-MTJ),在不缩短其保留时间的情况下降低了写入能量和延迟,可以取代传统的SRAM缓存,从而降低功耗。考虑65纳米CMOS技术,1MB内存宏的存取延迟在DRAM模式下为2.2 ns / 1.5 ns,在MRAM模式下为2.2 ns / 4.5 ns,而SRAM的存取延迟为1.17 ns。SPEC CPU2006基准测试显示,总缓存内存的每条指令能量(EPI)平均可以显着降低71%,并且D-MRAM缓存架构的每周期指令(IPC)性能平均仅下降约4%,尽管它的延迟开销。
{"title":"D-MRAM cache: Enhancing energy efficiency with 3T-1MTJ DRAM / MRAM hybrid memory","authors":"H. Noguchi, K. Nomura, K. Abe, S. Fujita, Eishi Arima, Kyundong Kim, Takashi Nakada, Shinobu Miwa, Hiroshi Nakamura","doi":"10.7873/DATE.2013.363","DOIUrl":"https://doi.org/10.7873/DATE.2013.363","url":null,"abstract":"This paper describes a proposal of non-volatile cache architecture utilizing novel DRAM / MRAM cell-level hybrid structured memory (D-MRAM) that enables effective power reduction for high performance mobile SoCs without area overhead. Here, the key point to reduce active power is intermittent refresh process for the DRAM-mode. D-MRAM has advantage to reduce static power consumptions compared to the conventional SRAM, because there are no static leakage paths in the D-MRAM cell and it is not needed to supply voltage to its cells when used as the MRAM-mode. Besides, with advanced perpendicular magnetic tunnel junctions (p-MTJ), which decreases the write energy and latency without shortening its retention time, D-MRAM is capable of power reduction by replacing the traditional SRAM caches. Considering the 65-nm CMOS technology, the access latencies of 1MB memory macro are 2.2 ns / 1.5 ns for read / write in DRAM mode, and 2.2 ns / 4.5 ns in MRAM mode, while those of SRAM are 1.17 ns. The SPEC CPU2006 benchmarks have revealed that the energy per instruction (EPI) of the total cache memory can be dramatically reduced by 71 % on average, and the instruction per cycle (IPC) performance of the D-MRAM cache architecture degraded only by approximately 4 % on average in spite of its latency overhead.","PeriodicalId":6310,"journal":{"name":"2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"15 1","pages":"1813-1818"},"PeriodicalIF":0.0,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75555366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Reliability analysis reloaded: How will we survive? 可靠性分析重装上阵:我们将如何生存?
Pub Date : 2013-03-18 DOI: 10.7873/DATE.2013.084
R. Aitken, G. Fey, Z. Kalbarczyk, F. Reichenbach, M. Reorda
In safety related applications and in products with long lifetimes reliability is a must. Moreover, facing future technology nodes of integrated circuit device level reliability may decrease, i.e., counter-measures have to be taken to ensure product level reliability. But assessing the reliability of a large system is not a trivial task. This paper revisits the state-of-the-art in reliability evaluation starting from the physical device level, to the software system level, all the way up to the product level. Relevant standards and future trends are discussed.
在与安全相关的应用和长寿命的产品中,可靠性是必须的。此外,面对未来集成电路的技术节点,器件级可靠性可能会下降,也就是说,必须采取对策来保证产品级可靠性。但是评估一个大型系统的可靠性并不是一件小事。本文回顾了可靠性评估的最新进展,从物理设备级开始,到软件系统级,一直到产品级。讨论了相关标准和未来发展趋势。
{"title":"Reliability analysis reloaded: How will we survive?","authors":"R. Aitken, G. Fey, Z. Kalbarczyk, F. Reichenbach, M. Reorda","doi":"10.7873/DATE.2013.084","DOIUrl":"https://doi.org/10.7873/DATE.2013.084","url":null,"abstract":"In safety related applications and in products with long lifetimes reliability is a must. Moreover, facing future technology nodes of integrated circuit device level reliability may decrease, i.e., counter-measures have to be taken to ensure product level reliability. But assessing the reliability of a large system is not a trivial task. This paper revisits the state-of-the-art in reliability evaluation starting from the physical device level, to the software system level, all the way up to the product level. Relevant standards and future trends are discussed.","PeriodicalId":6310,"journal":{"name":"2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"18 1","pages":"358-367"},"PeriodicalIF":0.0,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82428852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Optimizing BDDs for Time-Series dataset manipulation 优化bdd用于时间序列数据集操作
Pub Date : 2013-03-18 DOI: 10.7873/DATE.2013.212
S. Stergiou, J. Jain
In this work we advocate the adoption of Binary Decision Diagrams (BDDs) for storing and manipulating Time-Series datasets. We first propose a generic BDD transformation which identifies and removes 50% of all BDD edges without any loss of information. Following, we optimize the core operation for adding samples to a dataset and characterize its complexity. We identify time-range queries as one of the core operations executed on time-series datasets, and describe explicit Boolean function constructions that aid in efficiently executing them directly on BDDs. We exhibit significant space and performance gains when applying our algorithms on synthetic and real-life biosensor time-series datasets collected from field trials.
在这项工作中,我们提倡采用二进制决策图(bdd)来存储和操作时间序列数据集。我们首先提出了一种通用的BDD转换,它可以在不丢失任何信息的情况下识别和删除所有BDD边缘的50%。接下来,我们优化了向数据集添加样本的核心操作,并描述了其复杂性。我们将时间范围查询确定为在时间序列数据集上执行的核心操作之一,并描述了明确的布尔函数结构,以帮助在bdd上有效地直接执行它们。当将我们的算法应用于从现场试验中收集的合成和真实生物传感器时间序列数据集时,我们展示了显着的空间和性能增益。
{"title":"Optimizing BDDs for Time-Series dataset manipulation","authors":"S. Stergiou, J. Jain","doi":"10.7873/DATE.2013.212","DOIUrl":"https://doi.org/10.7873/DATE.2013.212","url":null,"abstract":"In this work we advocate the adoption of Binary Decision Diagrams (BDDs) for storing and manipulating Time-Series datasets. We first propose a generic BDD transformation which identifies and removes 50% of all BDD edges without any loss of information. Following, we optimize the core operation for adding samples to a dataset and characterize its complexity. We identify time-range queries as one of the core operations executed on time-series datasets, and describe explicit Boolean function constructions that aid in efficiently executing them directly on BDDs. We exhibit significant space and performance gains when applying our algorithms on synthetic and real-life biosensor time-series datasets collected from field trials.","PeriodicalId":6310,"journal":{"name":"2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"17 1","pages":"1018-1021"},"PeriodicalIF":0.0,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80229194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Vertically-stacked double-gate nanowire FETs with controllable polarity: From devices to regular ASICs 具有可控极性的垂直堆叠双栅纳米线场效应管:从器件到常规asic
Pub Date : 2013-03-18 DOI: 10.7873/DATE.2013.137
P. Gaillardon, L. Amarù, Shashikanth Bobba, M. D. Marchi, D. Sacchetto, Y. Leblebici, G. Micheli
Vertically stacked nanowire FETs (NWFETs) with gate-all-around structure are the natural and most advanced extension of FinFETs. At advanced technology nodes, many devices exhibit ambipolar behavior, i.e., the device shows n- and p-type characteristics simultaneously. In this paper, we show that, by engineering of the contacts and by constructing independent double-gate structures, the device polarity can be electrostatically programmed to be either n- or p-type. Such a device enables a compact realization of XOR-based logic functions at the cost of a denser interconnect. To mitigate the added area/routing overhead caused by the additional gate, an approach for designing an efficient regular layout, called Sea-of-Tiles is presented. Then, specific logic synthesis techniques, supporting the higher expressive power provided by this technology, are introduced and used to showcase the performance of the controllable-polarity NWFETs circuits in comparison with traditional CMOS circuits.
垂直堆叠纳米线场效应管(nwfet)具有栅极全方位结构,是finfet的自然和最先进的扩展。在先进的技术节点上,许多器件表现出双极性行为,即器件同时显示n型和p型特性。在本文中,我们证明,通过工程的触点和构建独立的双栅结构,器件的极性可以静电编程为n型或p型。这种器件以更密集的互连为代价,实现了基于xor的逻辑功能的紧凑实现。为了减轻由额外的栅极引起的额外面积/路由开销,提出了一种设计高效规则布局的方法,称为“瓦片海洋”。然后,介绍了支持该技术提供的更高表达能力的特定逻辑合成技术,并使用该技术与传统CMOS电路相比,展示了可控极性nwfet电路的性能。
{"title":"Vertically-stacked double-gate nanowire FETs with controllable polarity: From devices to regular ASICs","authors":"P. Gaillardon, L. Amarù, Shashikanth Bobba, M. D. Marchi, D. Sacchetto, Y. Leblebici, G. Micheli","doi":"10.7873/DATE.2013.137","DOIUrl":"https://doi.org/10.7873/DATE.2013.137","url":null,"abstract":"Vertically stacked nanowire FETs (NWFETs) with gate-all-around structure are the natural and most advanced extension of FinFETs. At advanced technology nodes, many devices exhibit ambipolar behavior, i.e., the device shows n- and p-type characteristics simultaneously. In this paper, we show that, by engineering of the contacts and by constructing independent double-gate structures, the device polarity can be electrostatically programmed to be either n- or p-type. Such a device enables a compact realization of XOR-based logic functions at the cost of a denser interconnect. To mitigate the added area/routing overhead caused by the additional gate, an approach for designing an efficient regular layout, called Sea-of-Tiles is presented. Then, specific logic synthesis techniques, supporting the higher expressive power provided by this technology, are introduced and used to showcase the performance of the controllable-polarity NWFETs circuits in comparison with traditional CMOS circuits.","PeriodicalId":6310,"journal":{"name":"2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"52 1","pages":"625-630"},"PeriodicalIF":0.0,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81167321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
An integrated approach for managing the lifetime of flash-based SSDs 用于管理基于闪存的ssd的生命周期的集成方法
Pub Date : 2013-03-18 DOI: 10.7873/DATE.2013.309
Sungjin Lee, Taejin Kim, Jisung Park, Jihong Kim
As the semiconductor process is scaled down, the endurance of NAND flash memory greatly deteriorates. To overcome such a poor endurance characteristic and to provide a reasonable storage lifetime, system-level endurance enhancement techniques are rapidly adopted in recent NAND flash-based storage devices like solid-state drives (SSDs). In this paper, we propose an integrated lifetime management approach for SSDs. The proposed lifetime management technique combines several lifetime-enhancement schemes, including lossless compression, deduplication, and performance throttling, in an integrated fashion so that the lifetime of SSDs can be maximally extended. By selectively disabling less effective lifetime-enhancement schemes, the proposed technique achieves both high performance and high energy efficiency while meeting the required lifetime. Our evaluation results show that the proposed technique, over the SSDs with no lifetime management schemes, improves write performance by up to 55% and reduces energy consumption by up to 43% while satisfying a 5-year lifetime warranty.
随着半导体工艺的缩小,NAND闪存的耐用性大大降低。为了克服这种较差的耐久性特性并提供合理的存储寿命,系统级耐久性增强技术在最近基于NAND闪存的存储设备(如固态驱动器(ssd))中迅速采用。在本文中,我们提出了一种集成的ssd寿命管理方法。建议的生命周期管理技术以集成的方式结合了几种生命周期增强方案,包括无损压缩、重复数据删除和性能调节,以便最大限度地延长ssd的生命周期。通过选择性地禁用不太有效的寿命增强方案,该技术在满足所需寿命的同时实现了高性能和高能效。我们的评估结果表明,在没有生命周期管理方案的ssd上,所提出的技术将写入性能提高了55%,将能耗降低了43%,同时满足5年的生命周期保证。
{"title":"An integrated approach for managing the lifetime of flash-based SSDs","authors":"Sungjin Lee, Taejin Kim, Jisung Park, Jihong Kim","doi":"10.7873/DATE.2013.309","DOIUrl":"https://doi.org/10.7873/DATE.2013.309","url":null,"abstract":"As the semiconductor process is scaled down, the endurance of NAND flash memory greatly deteriorates. To overcome such a poor endurance characteristic and to provide a reasonable storage lifetime, system-level endurance enhancement techniques are rapidly adopted in recent NAND flash-based storage devices like solid-state drives (SSDs). In this paper, we propose an integrated lifetime management approach for SSDs. The proposed lifetime management technique combines several lifetime-enhancement schemes, including lossless compression, deduplication, and performance throttling, in an integrated fashion so that the lifetime of SSDs can be maximally extended. By selectively disabling less effective lifetime-enhancement schemes, the proposed technique achieves both high performance and high energy efficiency while meeting the required lifetime. Our evaluation results show that the proposed technique, over the SSDs with no lifetime management schemes, improves write performance by up to 55% and reduces energy consumption by up to 43% while satisfying a 5-year lifetime warranty.","PeriodicalId":6310,"journal":{"name":"2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"33 1","pages":"1522-1525"},"PeriodicalIF":0.0,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82730040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A parallel fast transform-based preconditioning approach for electrical-thermal co-simulation of power delivery networks 输电网电-热联合仿真中一种基于并行快速变换的预处理方法
Pub Date : 2013-03-18 DOI: 10.7873/DATE.2013.341
Konstantis Daloukas, Alexia Marnari, N. Evmorfopoulos, P. Tsompanopoulou, G. Stamoulis
Efficient analysis of massive on-chip power delivery networks is among the most challenging problems facing the EDA industry today. Due to Joule heating effect and the temperature dependence of resistivity, temperature is one of the most important factors that affect IR drop and must be taken into account in power grid analysis. However, the sheer size of modern power delivery networks (comprising several thousands or millions of nodes) usually forces designers to neglect thermal effects during IR drop analysis in order to simplify and accelerate simulation. As a result, the absence of accurate estimates of Joule heating effect on IR drop analysis introduces significant uncertainty in the evaluation of circuit functionality. This work presents a new approach for fast electrical-thermal co-simulation of large-scale power grids found in contemporary nanometer-scale ICs. A state-of-the-art iterative method is combined with an efficient and extremely parallel preconditioning mechanism, which enables harnessing the computational resources of massively parallel architectures, such as graphics processing units (GPUs). Experimental results demonstrate that the proposed method achieves a speedup of 66.1X for a 3.1M-node design over a state-of-the-art direct method and a speedup of 22.2X for a 20.9M-node design over a state-of-the-art iterative method when GPUs are utilized.
大规模片上供电网络的有效分析是当今EDA行业面临的最具挑战性的问题之一。由于焦耳热效应和电阻率的温度依赖性,温度是影响红外降的重要因素之一,是电网分析中必须考虑的因素。然而,现代输电网络的庞大规模(包括数千或数百万个节点)通常迫使设计人员在红外下降分析期间忽略热效应,以简化和加速模拟。因此,在红外跌落分析中缺乏焦耳热效应的准确估计,在电路功能的评估中引入了重大的不确定性。这项工作为当代纳米级集成电路中大规模电网的快速电-热联合模拟提供了一种新的方法。最先进的迭代方法与高效且极其并行的预处理机制相结合,从而能够利用大规模并行架构(如图形处理单元(gpu))的计算资源。实验结果表明,当使用gpu时,与最先进的迭代方法相比,该方法在3.1 m节点设计上实现了66.1X的加速,在20.9 m节点设计上实现了22.2X的加速。
{"title":"A parallel fast transform-based preconditioning approach for electrical-thermal co-simulation of power delivery networks","authors":"Konstantis Daloukas, Alexia Marnari, N. Evmorfopoulos, P. Tsompanopoulou, G. Stamoulis","doi":"10.7873/DATE.2013.341","DOIUrl":"https://doi.org/10.7873/DATE.2013.341","url":null,"abstract":"Efficient analysis of massive on-chip power delivery networks is among the most challenging problems facing the EDA industry today. Due to Joule heating effect and the temperature dependence of resistivity, temperature is one of the most important factors that affect IR drop and must be taken into account in power grid analysis. However, the sheer size of modern power delivery networks (comprising several thousands or millions of nodes) usually forces designers to neglect thermal effects during IR drop analysis in order to simplify and accelerate simulation. As a result, the absence of accurate estimates of Joule heating effect on IR drop analysis introduces significant uncertainty in the evaluation of circuit functionality. This work presents a new approach for fast electrical-thermal co-simulation of large-scale power grids found in contemporary nanometer-scale ICs. A state-of-the-art iterative method is combined with an efficient and extremely parallel preconditioning mechanism, which enables harnessing the computational resources of massively parallel architectures, such as graphics processing units (GPUs). Experimental results demonstrate that the proposed method achieves a speedup of 66.1X for a 3.1M-node design over a state-of-the-art direct method and a speedup of 22.2X for a 20.9M-node design over a state-of-the-art iterative method when GPUs are utilized.","PeriodicalId":6310,"journal":{"name":"2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"47 1","pages":"1689-1694"},"PeriodicalIF":0.0,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87193039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Minimization of P-circuits using boolean relations 用布尔关系最小化p电路
Pub Date : 2013-03-18 DOI: 10.7873/DATE.2013.208
A. Bernasconi, V. Ciriani, G. Trucco, T. Villa
In this paper, we investigate how to use the complete flexibility of P-circuits, which realize a Boolean function by projecting it onto overlapping subsets given by a generalized Shannon decomposition. It is known how to compute the complete flexibility of P-circuits, but the algorithms proposed so far for its exploitation do not guarantee to find the best implementation, because they cast the problem as the minimization of an incompletely specified function. Instead, here we show that to explore all solutions we must set up the problem as the minimization of a Boolean relation, because there are don't care conditions that cannot be expressed by single cubes. In the experiments we report major improvements with respect to the previously published results.
本文研究了如何利用p电路的完全灵活性,将布尔函数投影到由广义香农分解给出的重叠子集上,从而实现布尔函数。如何计算p电路的完全灵活性是已知的,但迄今为止提出的算法并不能保证找到最佳实现,因为它们将问题视为不完全指定函数的最小化。相反,这里我们表明,为了探索所有的解决方案,我们必须将问题设置为布尔关系的最小化,因为存在不能由单个立方体表示的不关心条件。在实验中,我们报告了与先前发表的结果相比的重大改进。
{"title":"Minimization of P-circuits using boolean relations","authors":"A. Bernasconi, V. Ciriani, G. Trucco, T. Villa","doi":"10.7873/DATE.2013.208","DOIUrl":"https://doi.org/10.7873/DATE.2013.208","url":null,"abstract":"In this paper, we investigate how to use the complete flexibility of P-circuits, which realize a Boolean function by projecting it onto overlapping subsets given by a generalized Shannon decomposition. It is known how to compute the complete flexibility of P-circuits, but the algorithms proposed so far for its exploitation do not guarantee to find the best implementation, because they cast the problem as the minimization of an incompletely specified function. Instead, here we show that to explore all solutions we must set up the problem as the minimization of a Boolean relation, because there are don't care conditions that cannot be expressed by single cubes. In the experiments we report major improvements with respect to the previously published results.","PeriodicalId":6310,"journal":{"name":"2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"8 1","pages":"996-1001"},"PeriodicalIF":0.0,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87563617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Share with care: A quantitative evaluation of sharing approaches in high-level synthesis 谨慎分享:高层次综合中分享方法的定量评价
Pub Date : 2013-03-18 DOI: 10.7873/DATE.2013.315
A. Kondratyev, L. Lavagno, M. Meyer, Yosinori Watanabe
This paper focuses on the resource sharing problem when performing high-level synthesis. It argues that the conventionally accepted synthesis flow when resource sharing is done after scheduling is sub-optimal because it cannot account for timing penalties from resource merging. The paper describes a competitive approach when resource sharing and scheduling are performed simultaneously. It provides a quantitative evaluation of both approaches and shows that performing sharing during scheduling wins over the conventional approach in terms of quality of results.
本文主要研究在进行高级综合时的资源共享问题。它认为,当资源共享在调度之后完成时,传统上接受的合成流是次优的,因为它不能考虑资源合并带来的时间损失。本文描述了一种资源共享和调度同时进行的竞争方法。它提供了两种方法的定量评估,并表明在调度期间执行共享在结果质量方面优于传统方法。
{"title":"Share with care: A quantitative evaluation of sharing approaches in high-level synthesis","authors":"A. Kondratyev, L. Lavagno, M. Meyer, Yosinori Watanabe","doi":"10.7873/DATE.2013.315","DOIUrl":"https://doi.org/10.7873/DATE.2013.315","url":null,"abstract":"This paper focuses on the resource sharing problem when performing high-level synthesis. It argues that the conventionally accepted synthesis flow when resource sharing is done after scheduling is sub-optimal because it cannot account for timing penalties from resource merging. The paper describes a competitive approach when resource sharing and scheduling are performed simultaneously. It provides a quantitative evaluation of both approaches and shows that performing sharing during scheduling wins over the conventional approach in terms of quality of results.","PeriodicalId":6310,"journal":{"name":"2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"33 1","pages":"1547-1552"},"PeriodicalIF":0.0,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87726158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Breaking the energy Barrier in fault-tolerant caches for multicore systems 打破多核系统容错缓存中的能量屏障
Pub Date : 2013-03-18 DOI: 10.5555/2485288.2485466
P. Ampadu, Meilin Zhang, V. Stojanović
Balancing cache energy efficiency and reliability is a major challenge for future multicore system design. Supply voltage reduction is an effective tool to minimize cache energy consumption, usually at the expense of increased number of errors. To achieve substantial energy reduction without degrading reliability, we propose an adaptive fault-tolerant cache architecture, which provides appropriate error control for each cache line based on the number of faulty cells detected at reduced supply voltages. Our experiments show that the proposed approach can improve energy efficiency by more than 25% and energy-execution time product by over 10%, while improving reliability up to 4X using Mean-Error-To-Failure (METF) metric, compared to the next-best solution at the cost of 0.08% storage overhead.
平衡缓存的能量效率和可靠性是未来多核系统设计的主要挑战。降低电源电压是最小化缓存能耗的有效工具,通常以增加错误数量为代价。为了在不降低可靠性的情况下大幅降低能耗,我们提出了一种自适应容错缓存架构,该架构基于在降低电源电压下检测到的故障单元数量,为每条缓存线路提供适当的错误控制。我们的实验表明,所提出的方法可以将能源效率提高25%以上,将能源执行时间产品提高10%以上,同时使用平均错误到故障(METF)指标将可靠性提高4倍,与次优解决方案相比,成本为0.08%的存储开销。
{"title":"Breaking the energy Barrier in fault-tolerant caches for multicore systems","authors":"P. Ampadu, Meilin Zhang, V. Stojanović","doi":"10.5555/2485288.2485466","DOIUrl":"https://doi.org/10.5555/2485288.2485466","url":null,"abstract":"Balancing cache energy efficiency and reliability is a major challenge for future multicore system design. Supply voltage reduction is an effective tool to minimize cache energy consumption, usually at the expense of increased number of errors. To achieve substantial energy reduction without degrading reliability, we propose an adaptive fault-tolerant cache architecture, which provides appropriate error control for each cache line based on the number of faulty cells detected at reduced supply voltages. Our experiments show that the proposed approach can improve energy efficiency by more than 25% and energy-execution time product by over 10%, while improving reliability up to 4X using Mean-Error-To-Failure (METF) metric, compared to the next-best solution at the cost of 0.08% storage overhead.","PeriodicalId":6310,"journal":{"name":"2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"15 1","pages":"731-736"},"PeriodicalIF":0.0,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87042791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
2013 Design, Automation & Test in Europe Conference & Exhibition (DATE)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1