首页 > 最新文献

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)最新文献

英文 中文
Run-time adaptive performance compensation using on-chip sensors 使用片上传感器的运行时自适应性能补偿
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722199
M. Hashimoto
This paper discusses run-time adaptive performance control with on-chip sensors that predict timing errors. The sensors embedded into functional circuits capture delay variations due to not only die-to-die process variation but also random process variation, environmental fluctuation and aging. By compensating circuit performance according to the sensor outputs, we can overcome PVT worst-case design and reduce power dissipation while satisfying circuit performance. We applied the adaptive speed control to subthreshold circuits that are very sensitive to random variation and environmental fluctuation. Measurement results of a 65nm test chip show that the adaptive speed control can compensate PVT variations and improve energy efficiency by up to 46% compared to the worst-case design and operation with guardbanding.
本文讨论了用片上传感器预测时序误差的运行时自适应性能控制。嵌入功能电路中的传感器不仅可以捕获由于模对模工艺变化引起的延迟变化,还可以捕获随机工艺变化、环境波动和老化引起的延迟变化。根据传感器输出补偿电路性能,可以克服PVT最坏情况设计,在满足电路性能的同时降低功耗。我们将自适应速度控制应用于对随机变化和环境波动非常敏感的亚阈值电路。65nm测试芯片的测量结果表明,自适应速度控制可以补偿PVT变化,与最坏情况设计和保护带操作相比,能源效率提高了46%。
{"title":"Run-time adaptive performance compensation using on-chip sensors","authors":"M. Hashimoto","doi":"10.1109/ASPDAC.2011.5722199","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722199","url":null,"abstract":"This paper discusses run-time adaptive performance control with on-chip sensors that predict timing errors. The sensors embedded into functional circuits capture delay variations due to not only die-to-die process variation but also random process variation, environmental fluctuation and aging. By compensating circuit performance according to the sensor outputs, we can overcome PVT worst-case design and reduce power dissipation while satisfying circuit performance. We applied the adaptive speed control to subthreshold circuits that are very sensitive to random variation and environmental fluctuation. Measurement results of a 65nm test chip show that the adaptive speed control can compensate PVT variations and improve energy efficiency by up to 46% compared to the worst-case design and operation with guardbanding.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125913387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An efficient hybrid engine to perform range analysis and allocate integer bit-widths for arithmetic circuits 一种有效的混合引擎,用于运算电路的范围分析和整数位宽分配
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722233
Yu Pang, K. Radecka, Z. Zilic
Range analysis is an important task in obtaining the correct, yet fast and inexpensive arithmetic circuits. The traditional methods, either simulation-based or static, have the disadvantage of low efficiency and coarse bounds, which may lead to unnecessary bits. In this paper, we propose a new method that combines several techniques to perform fixed-point range analysis in a datapath towards obtaining the much tighter ranges efficiently. We show that the range and the bit-width allocation can be obtained with better results relative to the past methods, and in significantly shorter time.
极差分析是获得正确、快速、廉价的算术电路的重要任务。传统的方法,无论是基于仿真的还是静态的,都存在效率低、边界粗、可能导致不必要的比特的缺点。本文提出了一种结合多种技术在数据路径中进行定点距离分析的新方法,以有效地获得更紧密的距离。我们表明,相对于过去的方法,可以在更短的时间内获得更好的范围和位宽分配结果。
{"title":"An efficient hybrid engine to perform range analysis and allocate integer bit-widths for arithmetic circuits","authors":"Yu Pang, K. Radecka, Z. Zilic","doi":"10.1109/ASPDAC.2011.5722233","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722233","url":null,"abstract":"Range analysis is an important task in obtaining the correct, yet fast and inexpensive arithmetic circuits. The traditional methods, either simulation-based or static, have the disadvantage of low efficiency and coarse bounds, which may lead to unnecessary bits. In this paper, we propose a new method that combines several techniques to perform fixed-point range analysis in a datapath towards obtaining the much tighter ranges efficiently. We show that the range and the bit-width allocation can be obtained with better results relative to the past methods, and in significantly shorter time.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121597358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Variation-aware logic mapping for crossbar nano-architectures 交叉杆纳米结构的变化感知逻辑映射
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722207
M. Zamani, M. Tahoori
Programmable nano-architectures fabricated based on bottom-up self-assembly process are alternative for CMOS technology to overcome physical barriers as well as increased lithography-based fabrication costs in downscaling. Extreme process variation and high failure rate due to nondeter-ministic self assembly fabrication process pose serious challenges for logic implementation in this technology. In this paper, we analyze the effect of variations on mapped designs and propose an efficient mapping method to reduce variation effects on crossbar nano-architectures. This method takes advantage of reconfigurability and abundance of resources for tolerating variation and improving reliability. The main idea is based on duplicating crossbar input lines as well as swapping rows (columns) of a crossbar to reduce the output dependency and be able to reduce delay variation. Experimental results on a set of benchmarks show that the proposed method can reduce critical path delay up to 74% (57% in average).
基于自下而上自组装工艺制造的可编程纳米架构是CMOS技术的替代方案,可以克服物理障碍,并在缩小规模时增加基于光刻的制造成本。非确定性自组装制造过程的极端工艺变化和高故障率对该技术的逻辑实现提出了严峻的挑战。在本文中,我们分析了变化对映射设计的影响,并提出了一种有效的映射方法来减少变化对交叉杆纳米结构的影响。该方法利用了系统的可重构性和系统资源的丰富性,使系统能够承受变化,提高系统的可靠性。其主要思想是基于复制交叉栏输入行以及交换交叉栏的行(列)来减少输出依赖并能够减少延迟变化。在一组基准测试上的实验结果表明,该方法可将关键路径延迟降低74%(平均57%)。
{"title":"Variation-aware logic mapping for crossbar nano-architectures","authors":"M. Zamani, M. Tahoori","doi":"10.1109/ASPDAC.2011.5722207","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722207","url":null,"abstract":"Programmable nano-architectures fabricated based on bottom-up self-assembly process are alternative for CMOS technology to overcome physical barriers as well as increased lithography-based fabrication costs in downscaling. Extreme process variation and high failure rate due to nondeter-ministic self assembly fabrication process pose serious challenges for logic implementation in this technology. In this paper, we analyze the effect of variations on mapped designs and propose an efficient mapping method to reduce variation effects on crossbar nano-architectures. This method takes advantage of reconfigurability and abundance of resources for tolerating variation and improving reliability. The main idea is based on duplicating crossbar input lines as well as swapping rows (columns) of a crossbar to reduce the output dependency and be able to reduce delay variation. Experimental results on a set of benchmarks show that the proposed method can reduce critical path delay up to 74% (57% in average).","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"360 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121719401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Robust power gating reactivation by dynamic wakeup sequence throttling 动态唤醒序列节流的鲁棒功率门控再激活
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722263
Tung-Yeh Wu, Shih-Hsin Hu, J. Abraham
The wakeup sequence for power gating techniques has become an important issue as the rush current typically causes a high voltage drop. This paper proposes a new wakeup scheme utilizing an on-chip detector which continuously monitors the power supply noise in real time. Therefore, this scheme is able to dynamically throttle the wakeup sequence according to ambient voltage level. As a result, even the adjacent active circuit blocks induce an unexpectedly high voltage drop, the possibility of the occurrence of excessive voltage drop is reduced significantly.
功率门控技术的唤醒顺序已经成为一个重要的问题,因为涌流通常会导致高电压降。本文提出了一种利用片上检测器对电源噪声进行实时监测的唤醒方案。因此,该方案能够根据环境电压水平动态调节唤醒序列。因此,即使相邻有源电路块也会感应到异常高的压降,大大降低了电压降过高的可能性。
{"title":"Robust power gating reactivation by dynamic wakeup sequence throttling","authors":"Tung-Yeh Wu, Shih-Hsin Hu, J. Abraham","doi":"10.1109/ASPDAC.2011.5722263","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722263","url":null,"abstract":"The wakeup sequence for power gating techniques has become an important issue as the rush current typically causes a high voltage drop. This paper proposes a new wakeup scheme utilizing an on-chip detector which continuously monitors the power supply noise in real time. Therefore, this scheme is able to dynamically throttle the wakeup sequence according to ambient voltage level. As a result, even the adjacent active circuit blocks induce an unexpectedly high voltage drop, the possibility of the occurrence of excessive voltage drop is reduced significantly.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"48 7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125729474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Area-efficient FPGA logic elements: Architecture and synthesis 面积高效的FPGA逻辑元件:架构与合成
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722215
J. Anderson, Qiang Wang
We consider architecture and synthesis techniques for FPGA logic elements (function generators) and show that the LUT-based logic elements in modern commercial FPGAs are over-engineered. Circuits mapped into traditional LUT-based logic elements have speeds that can be achieved by alternative logic elements that consume considerably less silicon area. We introduce the concept of a trimming input to a logic function, which is an input to a K-variable function about which Shannon decomposition produces a cofactor having fewer than K −1 variables. We show that trimming inputs occur frequently in circuits and we propose low-cost asymmetric FPGA logic element architectures that leverage the trimming input concept, as well as some other properties of a circuit's AND-inverter graph (AIG) functional representation. We describe synthesis techniques for the proposed architectures that combine a standard cut-based FPGA technology mapping algorithm with two straightforward procedures: 1) Shannon decomposition, and 2) finding non-inverting paths in the circuit's AIG. The proposed architectures exhibit improved logic density versus traditional LUT-based architectures with minimal impact on circuit speed.
我们考虑了FPGA逻辑元件(函数生成器)的体系结构和合成技术,并表明现代商用FPGA中基于lut的逻辑元件是过度设计的。映射到传统基于lut的逻辑元件的电路的速度可以通过消耗相当少的硅面积的替代逻辑元件来实现。我们引入了一个逻辑函数的修剪输入的概念,这是一个K变量函数的输入,关于香农分解产生一个少于K−1个变量的协因子。我们表明,微调输入在电路中经常发生,我们提出了低成本的非对称FPGA逻辑元件架构,利用微调输入概念,以及电路的与逆变器图(AIG)函数表示的一些其他属性。我们描述了将基于标准切割的FPGA技术映射算法与两个简单程序相结合的拟议架构的综合技术:1)香农分解,以及2)在电路的AIG中找到非反相路径。与传统的基于lut的架构相比,所提出的架构表现出更高的逻辑密度,对电路速度的影响最小。
{"title":"Area-efficient FPGA logic elements: Architecture and synthesis","authors":"J. Anderson, Qiang Wang","doi":"10.1109/ASPDAC.2011.5722215","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722215","url":null,"abstract":"We consider architecture and synthesis techniques for FPGA logic elements (function generators) and show that the LUT-based logic elements in modern commercial FPGAs are over-engineered. Circuits mapped into traditional LUT-based logic elements have speeds that can be achieved by alternative logic elements that consume considerably less silicon area. We introduce the concept of a trimming input to a logic function, which is an input to a K-variable function about which Shannon decomposition produces a cofactor having fewer than K −1 variables. We show that trimming inputs occur frequently in circuits and we propose low-cost asymmetric FPGA logic element architectures that leverage the trimming input concept, as well as some other properties of a circuit's AND-inverter graph (AIG) functional representation. We describe synthesis techniques for the proposed architectures that combine a standard cut-based FPGA technology mapping algorithm with two straightforward procedures: 1) Shannon decomposition, and 2) finding non-inverting paths in the circuit's AIG. The proposed architectures exhibit improved logic density versus traditional LUT-based architectures with minimal impact on circuit speed.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114482485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 49
Exploration of VLSI CAD researches for early design rule evaluation VLSI CAD研究在早期设计规则评估中的探索
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722223
Chul-Hong Park, D. Pan, K. Lucas
Design rule has been a primary metric to link design and technology, and is likely to be considered as IC manufacturer's role for the generation due to the empirical and unsystematic in nature. Disruptive and radical changes in terms of layout style, lithography and device in the next decade require the design rule evaluation in early development stage. In this paper, we explore VLSI CAD researches for early and systematic evaluation of design rule, which will be a key technique for enhancing the competitiveness in IC market.
设计规则一直是连接设计和技术的主要指标,由于其经验性和非系统性,很可能被认为是一代集成电路制造商的角色。未来十年,排版风格、光刻、设备等方面的颠覆性、根本性的变化,都需要在发展初期进行设计规则评估。本文探讨了对设计规则进行早期系统评估的VLSI CAD研究,这将是提高集成电路市场竞争力的关键技术。
{"title":"Exploration of VLSI CAD researches for early design rule evaluation","authors":"Chul-Hong Park, D. Pan, K. Lucas","doi":"10.1109/ASPDAC.2011.5722223","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722223","url":null,"abstract":"Design rule has been a primary metric to link design and technology, and is likely to be considered as IC manufacturer's role for the generation due to the empirical and unsystematic in nature. Disruptive and radical changes in terms of layout style, lithography and device in the next decade require the design rule evaluation in early development stage. In this paper, we explore VLSI CAD researches for early and systematic evaluation of design rule, which will be a key technique for enhancing the competitiveness in IC market.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126869760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
All-out fight against yield losses by design-manufacturing collaboration in nano-lithography era 纳米光刻时代设计制造协同对抗良率损失
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722220
S. Inoue, S. Kobayashi
The concept of design-manufacturing collaboration for nano-lithography era has been clarified. The novel design-manufacturing system that the manufacturing tolerance reflecting design intention properly can be allocated to the layout has been proposed. According to the system, one can assign the “weak portion” explicitly on the layout, and can control the process for reducing the burden of manufacturing and further getting higher yield. More specifically, the extraction of electrically critical portion and conversion to the manufacturing tolerance has been demonstrated. The tolerance has applied to reduce computational burden of mask data preparation. Besides, the yield model-based layout scoring system has been also suggested to be significant remarkably. One can check the layout and modify not to loose the yield. Creation of yield function, layout scoring, and layout modification based upon the yield model have been demonstrated.
阐明了纳米光刻时代设计-制造协同的概念。提出了一种能合理分配反映设计意图的制造公差到版图的新型设计-制造系统。根据该系统,可以在布置图上明确分配“薄弱环节”,并对生产过程进行控制,以减轻制造负担,进一步提高成品率。更具体地说,电临界部分的提取和转换到制造公差已被证明。该公差用于减少掩模数据准备的计算负担。此外,基于产量模型的布局评分系统也具有显著的意义。可以检查布局和修改,以免失去产量。演示了产量函数的创建、布局评分和基于产量模型的布局修改。
{"title":"All-out fight against yield losses by design-manufacturing collaboration in nano-lithography era","authors":"S. Inoue, S. Kobayashi","doi":"10.1109/ASPDAC.2011.5722220","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722220","url":null,"abstract":"The concept of design-manufacturing collaboration for nano-lithography era has been clarified. The novel design-manufacturing system that the manufacturing tolerance reflecting design intention properly can be allocated to the layout has been proposed. According to the system, one can assign the “weak portion” explicitly on the layout, and can control the process for reducing the burden of manufacturing and further getting higher yield. More specifically, the extraction of electrically critical portion and conversion to the manufacturing tolerance has been demonstrated. The tolerance has applied to reduce computational burden of mask data preparation. Besides, the yield model-based layout scoring system has been also suggested to be significant remarkably. One can check the layout and modify not to loose the yield. Creation of yield function, layout scoring, and layout modification based upon the yield model have been demonstrated.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128923647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An ultra-low-voltage LC-VCO with a frequency extension circuit for future 0.5-V clock generation 超低电压LC-VCO,具有频率扩展电路,可用于未来的0.5 v时钟生成
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722159
W. Deng, K. Okada, A. Matsuzawa
This paper proposes a 0.5-V LC-VCO with a frequency extension circuit to replace ring oscillators for ultra-low-voltage sub-1ps-jitter clock generation. Significant performances, in terms of 0.6-ps jitter, 50MHz-to-6.4GHz frequency tuning range with 2 bands and sub-1mW PDC, indicates the successful replacement of ring VCO for the future 0.5-V LSIs and power aware LSIs.
本文提出了一种带频率扩展电路的0.5 v LC-VCO,以取代环形振荡器,用于超低电压亚1ps抖动时钟的产生。在0.6 ps抖动、50mhz -6.4 ghz频率调谐范围和2个频段以及低于1mw的PDC方面的显著性能表明,环形压控振荡器成功替代了未来的0.5 v lsi和功率感知lsi。
{"title":"An ultra-low-voltage LC-VCO with a frequency extension circuit for future 0.5-V clock generation","authors":"W. Deng, K. Okada, A. Matsuzawa","doi":"10.1109/ASPDAC.2011.5722159","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722159","url":null,"abstract":"This paper proposes a 0.5-V LC-VCO with a frequency extension circuit to replace ring oscillators for ultra-low-voltage sub-1ps-jitter clock generation. Significant performances, in terms of 0.6-ps jitter, 50MHz-to-6.4GHz frequency tuning range with 2 bands and sub-1mW PDC, indicates the successful replacement of ring VCO for the future 0.5-V LSIs and power aware LSIs.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133284701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Low power discrete voltage assignment under clock skew scheduling 时钟倾斜调度下的低功耗离散电压分配
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722244
Li Li, Jian Sun, Yinghai Lu, H. Zhou, Xuan Zeng
Multiple Supply Voltage (MSV) assignment has emerged as an appealing technique in low power IC design, due to its flexibility in balancing power and performance. However, clock skew scheduling, which has great impact on criticality of combinational paths in sequential circuit, has not been explored in the merit of MSV assignment. In this paper, we propose a discrete voltage assignment algorithm for sequential circuit under clock scheduling. The sequential MSV assignment problem is first formulated as a convex cost dual network flow problem, which can be optimally solved in polynomial time assuming delay of each gate can be chosen in continuous domain. Then a mincut-based heuristic is designed to convert the unfeasible continuous solution into feasible discrete solution while largely preserving the global optimality. Besides, we revisit the hardness of the general discrete voltage assignment problem and point out some misunderstandings on the approximability of this problem in previous related work. Benchmark test for our algorithm shows 9.2% reduction in power consumption on average, in compared with combinational MSV assignment. Referring to the continuous solution obtained from network flow as the lower bound, the gap between our solution and the lower bound is only 1.77%.
多电源电压(MSV)分配由于其在平衡功率和性能方面的灵活性,已成为低功耗集成电路设计中一种有吸引力的技术。然而,对于时序电路中对组合路径的临界性影响很大的时钟偏差调度问题,在MSV分配方面还没有得到深入的研究。本文提出了一种时序电路在时钟调度下的离散电压分配算法。首先将序列MSV分配问题表述为一个凸代价双网络流问题,假设在连续域内可以选择各门的延迟,该问题可以在多项式时间内得到最优解。然后设计了一种基于最小分割的启发式算法,将不可行的连续解转化为可行的离散解,同时在很大程度上保持了全局最优性。此外,我们重新审视了一般离散电压分配问题的难度,并指出了以往有关工作中对该问题近似性的一些误解。基准测试表明,与组合MSV分配相比,我们的算法平均降低了9.2%的功耗。以从网络流中得到的连续解作为下界,我们的解与下界的差距仅为1.77%。
{"title":"Low power discrete voltage assignment under clock skew scheduling","authors":"Li Li, Jian Sun, Yinghai Lu, H. Zhou, Xuan Zeng","doi":"10.1109/ASPDAC.2011.5722244","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722244","url":null,"abstract":"Multiple Supply Voltage (MSV) assignment has emerged as an appealing technique in low power IC design, due to its flexibility in balancing power and performance. However, clock skew scheduling, which has great impact on criticality of combinational paths in sequential circuit, has not been explored in the merit of MSV assignment. In this paper, we propose a discrete voltage assignment algorithm for sequential circuit under clock scheduling. The sequential MSV assignment problem is first formulated as a convex cost dual network flow problem, which can be optimally solved in polynomial time assuming delay of each gate can be chosen in continuous domain. Then a mincut-based heuristic is designed to convert the unfeasible continuous solution into feasible discrete solution while largely preserving the global optimality. Besides, we revisit the hardness of the general discrete voltage assignment problem and point out some misunderstandings on the approximability of this problem in previous related work. Benchmark test for our algorithm shows 9.2% reduction in power consumption on average, in compared with combinational MSV assignment. Referring to the continuous solution obtained from network flow as the lower bound, the gap between our solution and the lower bound is only 1.77%.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132221995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
NS-FTR: A fault tolerant routing scheme for networks on chip with permanent and runtime intermittent faults NS-FTR:一种容错路由方案,适用于具有永久和运行时间歇故障的片上网络
Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722231
S. Pasricha, Yong Zou
In sub-65nm CMOS technologies, interconnection networks-on-chip (NoC) will increasingly be susceptible to design time permanent faults and runtime intermittent faults, which can cause system failure. To overcome these faults, NoC routing schemes can be enhanced by adding fault tolerance capabilities, so that they can adapt communication flows to follow fault-free paths. A majority of existing fault tolerant routing algorithms are based on the turn model approach due to its simplicity and inherent freedom from deadlock. However, these turn model based algorithms are either too restrictive in the choice of paths that flits can traverse, or are tailored to work efficiently only on very specific fault distribution patterns. In this paper, we propose a novel fault tolerant routing scheme (NS-FTR) for NoC architectures that combines the North-last and South-last turn models to create a robust hybrid NoC routing scheme. The proposed scheme is shown to have a low implementation overhead and adapt to design time and runtime faults better than existing turn model, stochastic random walk, and dual virtual channel based routing schemes.
在65nm以下的CMOS技术中,互连片上网络(NoC)将越来越容易受到设计时永久故障和运行时间歇性故障的影响,这些故障可能导致系统故障。为了克服这些故障,可以通过添加容错功能来增强NoC路由方案,以便它们可以调整通信流以遵循无故障路径。现有的容错路由算法大多基于回合模型方法,因为它简单且不受死锁的影响。然而,这些基于转弯模型的算法要么在飞行路径的选择上过于严格,要么只能在非常特定的故障分布模式下有效地工作。在本文中,我们提出了一种针对NoC架构的新型容错路由方案(NS-FTR),该方案结合了北向最后和南北向最后模型来创建一个鲁棒的混合NoC路由方案。与现有的转弯模型、随机漫步和基于双虚拟通道的路由方案相比,该方案具有实现开销低、对设计时间和运行时故障适应性强的优点。
{"title":"NS-FTR: A fault tolerant routing scheme for networks on chip with permanent and runtime intermittent faults","authors":"S. Pasricha, Yong Zou","doi":"10.1109/ASPDAC.2011.5722231","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722231","url":null,"abstract":"In sub-65nm CMOS technologies, interconnection networks-on-chip (NoC) will increasingly be susceptible to design time permanent faults and runtime intermittent faults, which can cause system failure. To overcome these faults, NoC routing schemes can be enhanced by adding fault tolerance capabilities, so that they can adapt communication flows to follow fault-free paths. A majority of existing fault tolerant routing algorithms are based on the turn model approach due to its simplicity and inherent freedom from deadlock. However, these turn model based algorithms are either too restrictive in the choice of paths that flits can traverse, or are tailored to work efficiently only on very specific fault distribution patterns. In this paper, we propose a novel fault tolerant routing scheme (NS-FTR) for NoC architectures that combines the North-last and South-last turn models to create a robust hybrid NoC routing scheme. The proposed scheme is shown to have a low implementation overhead and adapt to design time and runtime faults better than existing turn model, stochastic random walk, and dual virtual channel based routing schemes.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132530117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
期刊
16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1