首页 > 最新文献

2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)最新文献

英文 中文
Obstacle-aware longest path using rectangular pattern detouring in routing grids 在路由网格中使用矩形模式绕行的障碍物感知最长路径
Pub Date : 2010-01-18 DOI: 10.1109/ASPDAC.2010.5419879
Jin-Tai Yan, Ming-Ching Jhong, Zhi-Wei Chen
As the clock frequency increases, signal propagation delays on PCBs are requested to meet the timing specifications with very high accuracy. Generally speaking, the length controllability of a net decides the routing delay of the net. If a routing result has the higher length controllability, the routing delay will be obtained with higher accuracy. In this paper, given a start terminal, S, and a target terminal, T, in mxn routing grids with obstacles, based on the rectangular partition in routing grids and the analysis of unreachable grids in rectangular pattern detouring, an efficient O(mnlog(mn)) algorithm is proposed to generate the longest path in routing grids from S to T. Compared with the US routing[5], our proposed routing approach can achieve longer paths for tested examples in less CPU time.
随着时钟频率的增加,要求pcb上的信号传播延迟以满足高精度的定时规范。一般来说,网络的长度可控性决定了网络的路由延迟。如果路由结果具有较高的长度可控性,则获得的路由延迟将具有较高的精度。本文在mxn个有障碍物的路由网格中,给定起始终端S和目标终端T,基于路由网格中的矩形划分和矩形模式绕路中不可达网格的分析,提出了一种高效的O(mnlog(mn))算法来生成路由网格中从S到T的最长路径。与美国路由[5]相比,我们提出的路由方法可以在更少的CPU时间内获得更长的测试样例路径。
{"title":"Obstacle-aware longest path using rectangular pattern detouring in routing grids","authors":"Jin-Tai Yan, Ming-Ching Jhong, Zhi-Wei Chen","doi":"10.1109/ASPDAC.2010.5419879","DOIUrl":"https://doi.org/10.1109/ASPDAC.2010.5419879","url":null,"abstract":"As the clock frequency increases, signal propagation delays on PCBs are requested to meet the timing specifications with very high accuracy. Generally speaking, the length controllability of a net decides the routing delay of the net. If a routing result has the higher length controllability, the routing delay will be obtained with higher accuracy. In this paper, given a start terminal, S, and a target terminal, T, in mxn routing grids with obstacles, based on the rectangular partition in routing grids and the analysis of unreachable grids in rectangular pattern detouring, an efficient O(mnlog(mn)) algorithm is proposed to generate the longest path in routing grids from S to T. Compared with the US routing[5], our proposed routing approach can achieve longer paths for tested examples in less CPU time.","PeriodicalId":152569,"journal":{"name":"2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133253673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
A robust pixel-based RET optimization algorithm independent of initial conditions 一种独立于初始条件的稳健的基于像素的RET优化算法
Pub Date : 2010-01-18 DOI: 10.1109/ASPDAC.2010.5419808
Jinyu Zhang, Wei Xiong, Yan Wang, Zhiping Yu, M. Tsai
A robust pixel-based optimization algorithm is proposed for mask synthesis of inverse lithography technology (ILT) to improve the resolution and pattern fidelity in optical lithography. Result shows that the final image fidelity is almost independent of the initial condition. To demonstrate the robustness of the algorithm, six typical desired mask patterns and two mask technologies are applied in mask synthesis optimization using 100 randomly generated initial conditions. The critical dimension (CD) is 60nm and the partial-coherence image system is applied. It is found that the final edge placement error (EPE) and iteration number are quite weakly dependent on the initial conditions. Good final image fidelity can be acquired using arbitrary initial conditions. This algorithm is about several orders of magnitude faster and more effective than other gradient-based algorithm and simulated annealing algorithm.
为了提高反光刻技术的分辨率和图案保真度,提出了一种基于像素的鲁棒优化算法用于掩模合成。结果表明,最终图像保真度几乎与初始条件无关。为了证明该算法的鲁棒性,在100个随机生成的初始条件下,采用6种典型的期望掩码模式和两种掩码技术进行掩码合成优化。临界尺寸(CD)为60nm,采用部分相干成像系统。结果表明,最终边缘放置误差(EPE)和迭代次数对初始条件的依赖性较弱。使用任意初始条件都可以获得良好的最终图像保真度。与其他基于梯度的算法和模拟退火算法相比,该算法的速度和效率提高了几个数量级。
{"title":"A robust pixel-based RET optimization algorithm independent of initial conditions","authors":"Jinyu Zhang, Wei Xiong, Yan Wang, Zhiping Yu, M. Tsai","doi":"10.1109/ASPDAC.2010.5419808","DOIUrl":"https://doi.org/10.1109/ASPDAC.2010.5419808","url":null,"abstract":"A robust pixel-based optimization algorithm is proposed for mask synthesis of inverse lithography technology (ILT) to improve the resolution and pattern fidelity in optical lithography. Result shows that the final image fidelity is almost independent of the initial condition. To demonstrate the robustness of the algorithm, six typical desired mask patterns and two mask technologies are applied in mask synthesis optimization using 100 randomly generated initial conditions. The critical dimension (CD) is 60nm and the partial-coherence image system is applied. It is found that the final edge placement error (EPE) and iteration number are quite weakly dependent on the initial conditions. Good final image fidelity can be acquired using arbitrary initial conditions. This algorithm is about several orders of magnitude faster and more effective than other gradient-based algorithm and simulated annealing algorithm.","PeriodicalId":152569,"journal":{"name":"2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133313585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Incremental high-level synthesis 增量高级综合
Pub Date : 2010-01-18 DOI: 10.1109/ASPDAC.2010.5419798
L. Lavagno, A. Kondratyev, Yosinori Watanabe, Qiang Zhu, Mototsugu Fujii, M. Tatesawa, Noriyasu Nakayama
The widespread acceptance of High-level synthesis as a mainstream tool mostly depends on its tight integration with the following RTL-to-GDSII design flow. A key aspect is the handling of so-called Engineering Change Orders (ECOs), i.e. minor changes required to fix small functional bugs or meet performance requirements late in the design cycle. Traditional high-level synthesis has attempted to optimize at best the output logic. However, in the ECO scenario the goal is to implement the required change with as few modifications as possible to the RTL, logic netlist, placed netlist and layout. In this paper we show how, by judiciously changing the internal databases used by the tool to match as much as possible the original design, one can achieve minimal impact and implement ECOs in truly incremental mode, while full-blow re-synthesis would lead to massive unnecessary downstream changes. The tool essentially matches source constructs between the original and the ECO design, and copies as many synthesis decisions as possible from the original design to the ECO design.
高级综合作为主流工具的广泛接受主要取决于它与以下RTL-to-GDSII设计流程的紧密集成。一个关键方面是处理所谓的工程变更令(eco),即在设计周期后期修复小的功能缺陷或满足性能要求所需的微小变更。传统的高级合成试图在最好的情况下优化输出逻辑。然而,在ECO场景中,目标是在对RTL、逻辑网络列表、放置网络列表和布局进行尽可能少的修改的情况下实现所需的更改。在本文中,我们展示了如何通过明智地更改工具使用的内部数据库以尽可能多地匹配原始设计,可以实现最小的影响并以真正的增量模式实现eco,而全面的重新合成将导致大量不必要的下游更改。该工具基本上匹配原始设计和ECO设计之间的源结构,并将尽可能多的综合决策从原始设计复制到ECO设计。
{"title":"Incremental high-level synthesis","authors":"L. Lavagno, A. Kondratyev, Yosinori Watanabe, Qiang Zhu, Mototsugu Fujii, M. Tatesawa, Noriyasu Nakayama","doi":"10.1109/ASPDAC.2010.5419798","DOIUrl":"https://doi.org/10.1109/ASPDAC.2010.5419798","url":null,"abstract":"The widespread acceptance of High-level synthesis as a mainstream tool mostly depends on its tight integration with the following RTL-to-GDSII design flow. A key aspect is the handling of so-called Engineering Change Orders (ECOs), i.e. minor changes required to fix small functional bugs or meet performance requirements late in the design cycle. Traditional high-level synthesis has attempted to optimize at best the output logic. However, in the ECO scenario the goal is to implement the required change with as few modifications as possible to the RTL, logic netlist, placed netlist and layout. In this paper we show how, by judiciously changing the internal databases used by the tool to match as much as possible the original design, one can achieve minimal impact and implement ECOs in truly incremental mode, while full-blow re-synthesis would lead to massive unnecessary downstream changes. The tool essentially matches source constructs between the original and the ECO design, and copies as many synthesis decisions as possible from the original design to the ECO design.","PeriodicalId":152569,"journal":{"name":"2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133490390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Possibility of ESL: A software centric system design for multicore SoC in the upstream phase ESL的可能性:上游阶段多核SoC以软件为中心的系统设计
Pub Date : 2010-01-18 DOI: 10.1109/ASPDAC.2010.5419779
K. Yamashita
The embedded systems for which both hardware and software are rapidly advancing and expanding, there is a growing need to be able to comprehensively and quantitatively estimate system performance at an early stage in the design process, especially multi-core based SoC. But it can be difficult to estimate system performance of actual target by employing only simple estimation methods.
嵌入式系统的硬件和软件都在快速发展和扩展,在设计过程的早期阶段,特别是基于多核的SoC,越来越需要能够全面和定量地评估系统性能。但仅采用简单的估计方法很难对实际目标的系统性能进行估计。
{"title":"Possibility of ESL: A software centric system design for multicore SoC in the upstream phase","authors":"K. Yamashita","doi":"10.1109/ASPDAC.2010.5419779","DOIUrl":"https://doi.org/10.1109/ASPDAC.2010.5419779","url":null,"abstract":"The embedded systems for which both hardware and software are rapidly advancing and expanding, there is a growing need to be able to comprehensively and quantitatively estimate system performance at an early stage in the design process, especially multi-core based SoC. But it can be difficult to estimate system performance of actual target by employing only simple estimation methods.","PeriodicalId":152569,"journal":{"name":"2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"611 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133633407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
CAFE router: A fast connectivity aware multiple nets routing algorithm for routing grid with obstacles CAFE路由器:一种快速连接感知多网路由算法,用于有障碍物的路由网格
Pub Date : 2010-01-18 DOI: 10.1109/ASPDAC.2010.5419882
Y. Kohira, A. Takahashi
In this paper, we propose CAFE router which obtains routes of multiple nets with target wire lengths for single layer routing grid with obstacles. CAFE router extends the route of each net from a pin to the other pin greedily so that the wire length of the net approaches its target wire length. Experiments show that CAFE router obtains the routes of nets with small length error in short time.
本文提出了一种CAFE路由器,它可以在有障碍物的单层路由网格中获取具有目标线长的多个网的路由。CAFE路由器贪婪地将每个网的路由从一个引脚延伸到另一个引脚,使网的线长接近其目标线长。实验表明,该算法能在短时间内得到长度误差小的网络路由。
{"title":"CAFE router: A fast connectivity aware multiple nets routing algorithm for routing grid with obstacles","authors":"Y. Kohira, A. Takahashi","doi":"10.1109/ASPDAC.2010.5419882","DOIUrl":"https://doi.org/10.1109/ASPDAC.2010.5419882","url":null,"abstract":"In this paper, we propose CAFE router which obtains routes of multiple nets with target wire lengths for single layer routing grid with obstacles. CAFE router extends the route of each net from a pin to the other pin greedily so that the wire length of the net approaches its target wire length. Experiments show that CAFE router obtains the routes of nets with small length error in short time.","PeriodicalId":152569,"journal":{"name":"2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134464778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
A 60GHz direct-conversion transmitter in 65nm CMOS technology 采用65nm CMOS技术的60GHz直接转换发射机
Pub Date : 2010-01-18 DOI: 10.1109/ASPDAC.2010.5419862
Naoki Takayama, Kota Matsushita, Shogo Ito, Ning Li, K. Okada, A. Matsuzawa
This paper presents a 60 GHz direct-conversion transmitter in 65 nm CMOS technology. The power amplifier consists of 4-stage transistors. The circuit model of de-coupling capacitor is built as a transmission line to consider the physical length. In the measurement results, the conversion gain is above 9.6dB at 58–65GHz band, and the 1 dB compression point is 1.6 dBm with 60 GHz LO frequency and 1 dB LO power.
本文提出了一种采用65纳米CMOS技术的60 GHz直接转换发射机。功率放大器由4级晶体管组成。考虑物理长度,将解耦电容作为传输线建立电路模型。测量结果显示,在58 ~ 65ghz频段,转换增益大于9.6dB,在60 GHz LO频率和1 dB LO功率下,1db压缩点为1.6 dBm。
{"title":"A 60GHz direct-conversion transmitter in 65nm CMOS technology","authors":"Naoki Takayama, Kota Matsushita, Shogo Ito, Ning Li, K. Okada, A. Matsuzawa","doi":"10.1109/ASPDAC.2010.5419862","DOIUrl":"https://doi.org/10.1109/ASPDAC.2010.5419862","url":null,"abstract":"This paper presents a 60 GHz direct-conversion transmitter in 65 nm CMOS technology. The power amplifier consists of 4-stage transistors. The circuit model of de-coupling capacitor is built as a transmission line to consider the physical length. In the measurement results, the conversion gain is above 9.6dB at 58–65GHz band, and the 1 dB compression point is 1.6 dBm with 60 GHz LO frequency and 1 dB LO power.","PeriodicalId":152569,"journal":{"name":"2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"368 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134421390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A low latency wormhole router for asynchronous on-chip networks 用于异步片上网络的低延迟虫洞路由器
Pub Date : 2010-01-18 DOI: 10.5555/1899721.1899827
Wei Song, D. Edwards
Asynchronous on-chip networks are power efficient and tolerant to process variation but they are slower than synchronous on-chip networks. A low latency asynchronous wormhole router is proposed using sliced sub-channels and the lookahead pipeline. Channel slicing removes the C-element tree in the completion detection circuit and converts a channel into multiple independent sub-channels reducing the cycle period. The lookahead pipeline uses the early evaluation protocol to reduce cycle period. Using the lookahead pipeline on the pipeline stages with the maximal cycle period improves the overall throughput. The router is a pure standard cell design implemented by a 0.13 µm technology. The cycle period of the router at the typical corner is 1.7 ns, providing 2.35GByte/sec throughput per port.
异步片上网络具有功耗效率和对进程变化的容忍度,但它们比同步片上网络慢。提出了一种低延迟异步虫洞路由器,该路由器采用切片子通道和前瞻管道。通道切片去除了补全检测电路中的c元素树,将一个通道转换成多个独立的子通道,减少了周期。前瞻性管道使用早期评估协议来缩短周期。在具有最大周期的管道阶段上使用前瞻性管道可以提高总体吞吐量。该路由器是一个纯标准单元设计,采用0.13µm技术实现。典型拐角处的路由器周期为1.7 ns,每个端口的吞吐量为2.35GByte/sec。
{"title":"A low latency wormhole router for asynchronous on-chip networks","authors":"Wei Song, D. Edwards","doi":"10.5555/1899721.1899827","DOIUrl":"https://doi.org/10.5555/1899721.1899827","url":null,"abstract":"Asynchronous on-chip networks are power efficient and tolerant to process variation but they are slower than synchronous on-chip networks. A low latency asynchronous wormhole router is proposed using sliced sub-channels and the lookahead pipeline. Channel slicing removes the C-element tree in the completion detection circuit and converts a channel into multiple independent sub-channels reducing the cycle period. The lookahead pipeline uses the early evaluation protocol to reduce cycle period. Using the lookahead pipeline on the pipeline stages with the maximal cycle period improves the overall throughput. The router is a pure standard cell design implemented by a 0.13 µm technology. The cycle period of the router at the typical corner is 1.7 ns, providing 2.35GByte/sec throughput per port.","PeriodicalId":152569,"journal":{"name":"2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132150815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Novel dual-Vth independent-gate FinFET circuits 新型双v独立栅极FinFET电路
Pub Date : 2010-01-18 DOI: 10.1109/ASPDAC.2010.5419680
M. Rostami, K. Mohanram
This paper describes gate work function and oxide thickness tuning to realize novel circuits using dual-Vth independent-gate FinFETs. Dual-Vth FinFETs with independent gates enable series and parallel merge transformations in logic gates, realizing compact low power alternatives. Furthermore, they also enable the design of a new class of compact logic gates with higher expressive power and flexibility than conventional forms, e.g., implementing 12 unique Boolean functions using only four transistors. The gates are designed and calibrated using the University of Florida double-gate model into a technology library. Synthesis results for 14 benchmark circuits from the ISCAS and OpenSPARC suites indicate that on average, the enhanced library reduces delay, power, and area by 9%, 21%, and 27%, respectively, over a conventional library designed using FinFETs in 32nm technology.
本文介绍了利用双v独立栅极finfet实现新型电路的门功函数和氧化物厚度调谐。具有独立门的双vth finfet可在逻辑门中实现串联和并联合并转换,实现紧凑的低功耗替代方案。此外,它们还能够设计出比传统形式具有更高表达能力和灵活性的新型紧凑型逻辑门,例如,仅使用四个晶体管即可实现12个独特的布尔函数。门的设计和校准使用佛罗里达大学的双门模型进入一个技术库。来自ISCAS和OpenSPARC套件的14个基准电路的合成结果表明,平均而言,与使用32纳米finfet设计的传统库相比,增强库分别降低了9%,21%和27%的延迟,功耗和面积。
{"title":"Novel dual-Vth independent-gate FinFET circuits","authors":"M. Rostami, K. Mohanram","doi":"10.1109/ASPDAC.2010.5419680","DOIUrl":"https://doi.org/10.1109/ASPDAC.2010.5419680","url":null,"abstract":"This paper describes gate work function and oxide thickness tuning to realize novel circuits using dual-Vth independent-gate FinFETs. Dual-Vth FinFETs with independent gates enable series and parallel merge transformations in logic gates, realizing compact low power alternatives. Furthermore, they also enable the design of a new class of compact logic gates with higher expressive power and flexibility than conventional forms, e.g., implementing 12 unique Boolean functions using only four transistors. The gates are designed and calibrated using the University of Florida double-gate model into a technology library. Synthesis results for 14 benchmark circuits from the ISCAS and OpenSPARC suites indicate that on average, the enhanced library reduces delay, power, and area by 9%, 21%, and 27%, respectively, over a conventional library designed using FinFETs in 32nm technology.","PeriodicalId":152569,"journal":{"name":"2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133655576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Current source modeling in the presence of body bias 存在体偏的电流源建模
Pub Date : 2010-01-18 DOI: 10.1109/ASPDAC.2010.5419896
Saket Gupta, S. Sapatnekar
With the increasing use of adaptive body biases in high-performance designs, it has become necessary to build timing models that can include these effects. State-of-the-art timing tools use current source models (CSMs), which have proven to be fast and accurate. However, a straightforward extension of CSMs to incorporate multiple body biases results in unreasonably large characterization tables for each cell. We propose a new approach to compactly capture body bias effects within a mainstream CSM framework. Our approach features a table reduction method for compact storage, and a fast and novel waveform sensitivity method for timing evaluation. On a 45nm technology, we demonstrate high accuracy, with worst-case errors of under 5% in both slew and delay as compared to HSPICE. We show a speedup of over five orders of magnitude over HSPICE and almost 70x over conventional CSMs.
随着在高性能设计中越来越多地使用自适应身体偏差,有必要建立可以包括这些影响的时间模型。最先进的定时工具使用电流源模型(csm),这已被证明是快速和准确的。然而,直接扩展csm以纳入多个体偏差会导致每个细胞的不合理的大表征表。我们提出了一种在主流CSM框架内紧凑捕获身体偏差效应的新方法。我们的方法具有用于紧凑存储的表约简方法和用于时序评估的快速新颖波形灵敏度方法。在45nm技术上,我们证明了高精度,与HSPICE相比,最坏情况下的转换和延迟误差均低于5%。我们的速度比HSPICE快5个数量级,比传统csm快近70倍。
{"title":"Current source modeling in the presence of body bias","authors":"Saket Gupta, S. Sapatnekar","doi":"10.1109/ASPDAC.2010.5419896","DOIUrl":"https://doi.org/10.1109/ASPDAC.2010.5419896","url":null,"abstract":"With the increasing use of adaptive body biases in high-performance designs, it has become necessary to build timing models that can include these effects. State-of-the-art timing tools use current source models (CSMs), which have proven to be fast and accurate. However, a straightforward extension of CSMs to incorporate multiple body biases results in unreasonably large characterization tables for each cell. We propose a new approach to compactly capture body bias effects within a mainstream CSM framework. Our approach features a table reduction method for compact storage, and a fast and novel waveform sensitivity method for timing evaluation. On a 45nm technology, we demonstrate high accuracy, with worst-case errors of under 5% in both slew and delay as compared to HSPICE. We show a speedup of over five orders of magnitude over HSPICE and almost 70x over conventional CSMs.","PeriodicalId":152569,"journal":{"name":"2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132630139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Improved clock-gating control scheme for transparent pipeline 改进的透明管道时钟门控控制方案
Pub Date : 2010-01-18 DOI: 10.1109/ASPDAC.2010.5419847
J. Choi, Byung Guk Kim, A. Dasgupta, K. Roy
This paper presents a stage-level clock-gating scheme for clock power improvement. The proposed technique efficiently implements the concept of transparent pipeline which improves clocking power by dynamically making pipeline registers transparent. We developed new control scheme for transparent pipeline which can be applied to any number of pipeline stages. A low-overhead flip-flop with transparent mode is also proposed to reduce implementation overhead. The proposed clock-gating control logic is extended to pipeline collapsing which allows energy/performance trade-off through dynamic frequency scaling. Simulation results on IBM 90nm technology show that the proposed approach has less overhead (∼25%) than the previous transparent pipeline scheme and improves up to 40% of clocking power in 64-bit 7-stage pipeline over traditional stage-level clock-gating technique.
提出了一种提高时钟功率的级配方案。该技术有效地实现了透明管道的概念,通过动态地使管道寄存器透明来提高时钟功率。我们开发了一种新的透明管道控制方案,可以应用于任意数量的管道阶段。为了减少实现开销,还提出了一种具有透明模式的低开销触发器。提出的时钟门控逻辑扩展到管道崩溃,允许能量/性能权衡通过动态频率缩放。在IBM 90nm技术上的仿真结果表明,该方法比以前的透明管道方案开销更小(约25%),并且在64位7级管道中比传统的级级时钟门控技术提高了高达40%的时钟功率。
{"title":"Improved clock-gating control scheme for transparent pipeline","authors":"J. Choi, Byung Guk Kim, A. Dasgupta, K. Roy","doi":"10.1109/ASPDAC.2010.5419847","DOIUrl":"https://doi.org/10.1109/ASPDAC.2010.5419847","url":null,"abstract":"This paper presents a stage-level clock-gating scheme for clock power improvement. The proposed technique efficiently implements the concept of transparent pipeline which improves clocking power by dynamically making pipeline registers transparent. We developed new control scheme for transparent pipeline which can be applied to any number of pipeline stages. A low-overhead flip-flop with transparent mode is also proposed to reduce implementation overhead. The proposed clock-gating control logic is extended to pipeline collapsing which allows energy/performance trade-off through dynamic frequency scaling. Simulation results on IBM 90nm technology show that the proposed approach has less overhead (∼25%) than the previous transparent pipeline scheme and improves up to 40% of clocking power in 64-bit 7-stage pipeline over traditional stage-level clock-gating technique.","PeriodicalId":152569,"journal":{"name":"2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117008147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1