首页 > 最新文献

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)最新文献

英文 中文
Fidelity metrics for estimation models 估计模型的保真度度量
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5653959
Haris Javaid, A. Ignjatović, S. Parameswaran
Estimation models play a vital role in many aspects of day to day life. Extremely complex estimation models are employed in the design space exploration of SoCs, and the efficacy of these estimation models is usually measured by the absolute error of the models compared to known actual results. Such absolute error based metrics can often result in over-designed estimation models, with a number of researchers suggesting that fidelity of an estimation model (correlation between the ordering of the estimated points and the ordering of the actual points) should be examined instead of, or in addition to, the absolute error. In this paper, for the first time, we propose four metrics to measure the fidelity of an estimation model, in particular for use in design space exploration. The first two are based on two well known rank correlation coefficients. The other two are weighted versions of the first two metrics, to give importance to points nearer the Pareto front. The proposed fidelity metrics range from −1 to 1, where a value of 1 reflects a perfect positive correlation while a value of −1 reflects a perfect negative correlation. The proposed fidelity metrics were calculated for a single processor estimation model and a multiprocessor estimation model to observe their behavior, and were compared against the models' absolute error. For the multiprocessor estimation model, even though the worst average and maximum absolute error of 6.40% and 16.61% respectively can be considered reasonable in design automation, the worst fidelity of 0.753 suggests that the multiprocessor estimation model may not be as good a model (compared to an estimation model with same or higher absolute errors but a fidelity of 0.95) as depicted by its absolute accuracy, leading to an over-designed estimation model.
评估模型在日常生活的许多方面起着至关重要的作用。在soc的设计空间探索中使用了极其复杂的估计模型,这些估计模型的有效性通常通过模型与已知实际结果的绝对误差来衡量。这种基于绝对误差的度量通常会导致过度设计的估计模型,许多研究人员建议应该检查估计模型的保真度(估计点的顺序与实际点的顺序之间的相关性),而不是绝对误差。在本文中,我们第一次提出了四个度量来测量估计模型的保真度,特别是在设计空间探索中使用。前两个是基于两个众所周知的等级相关系数。另外两个指标是前两个指标的加权版本,以给予更接近帕累托前沿的点的重要性。建议的保真度度量范围从−1到1,其中1的值反映了完全的正相关,而−1的值反映了完全的负相关。计算了单处理器估计模型和多处理器估计模型的保真度指标,观察了它们的行为,并与模型的绝对误差进行了比较。对于多处理器估计模型,即使在设计自动化中,6.40%和16.61%的最差平均和最大绝对误差可以被认为是合理的,但0.753的最差保真度表明,多处理器估计模型可能不如其绝对精度所描述的那样好(与具有相同或更高的绝对误差但保真度为0.95的估计模型相比),导致过度设计的估计模型。
{"title":"Fidelity metrics for estimation models","authors":"Haris Javaid, A. Ignjatović, S. Parameswaran","doi":"10.1109/ICCAD.2010.5653959","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5653959","url":null,"abstract":"Estimation models play a vital role in many aspects of day to day life. Extremely complex estimation models are employed in the design space exploration of SoCs, and the efficacy of these estimation models is usually measured by the absolute error of the models compared to known actual results. Such absolute error based metrics can often result in over-designed estimation models, with a number of researchers suggesting that fidelity of an estimation model (correlation between the ordering of the estimated points and the ordering of the actual points) should be examined instead of, or in addition to, the absolute error. In this paper, for the first time, we propose four metrics to measure the fidelity of an estimation model, in particular for use in design space exploration. The first two are based on two well known rank correlation coefficients. The other two are weighted versions of the first two metrics, to give importance to points nearer the Pareto front. The proposed fidelity metrics range from −1 to 1, where a value of 1 reflects a perfect positive correlation while a value of −1 reflects a perfect negative correlation. The proposed fidelity metrics were calculated for a single processor estimation model and a multiprocessor estimation model to observe their behavior, and were compared against the models' absolute error. For the multiprocessor estimation model, even though the worst average and maximum absolute error of 6.40% and 16.61% respectively can be considered reasonable in design automation, the worst fidelity of 0.753 suggests that the multiprocessor estimation model may not be as good a model (compared to an estimation model with same or higher absolute errors but a fidelity of 0.95) as depicted by its absolute accuracy, leading to an over-designed estimation model.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126983679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Analysis of circuit dynamic behavior with timed ternary decision diagram 用定时三元决策图分析电路的动态特性
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5653852
Lu Wan, Deming Chen
Modern logic optimization tools tend to optimize circuits in a balanced way so that all primary outputs (POs) have similar delay close to the cycle time. However, certain POs will be exercised more frequently than the rest. Among these critical primary outputs, some may be stabilized very quickly by input vectors, even if their topological delays from primary inputs are very long. Knowing the dynamic behavior of a circuit can help optimize the most commonly activated paths and help engineers understand how resilient a PO is against dynamic environmental variations such as voltage fluctuations. In this paper, we describe a tool to analyze the dynamic behavior of a circuit utilizing probabilistic information. The techniques exploit the use of timed ternary decision diagrams (tTDD) to encode stabilization conditions for POs. To compute probabilities based on a tTDD, we propose false assignment pruning and random variable compaction to preserve probability calculation accuracy. To deal with the scalability issue, this paper proposes a new circuit partitioning heuristic to reduce the inaccuracy introduced by partitioning. Compared to the timed simulation results, our tool has a mean absolute error of 2.5% and a root mean square error of 5.3% on average for ISCAS-85 benchmarks. Compared to a state-of-the-art dynamic behavior analysis tool, our tool is on average 40x faster and can handle circuits that the previous tool cannot.
现代逻辑优化工具倾向于以平衡的方式优化电路,以便所有主输出(POs)在接近周期时间时具有相似的延迟。但是,某些权利将比其他权利更频繁地行使。在这些关键的初级输出中,有些可以通过输入向量非常迅速地稳定下来,即使它们的主要输入的拓扑延迟非常长。了解电路的动态行为可以帮助优化最常见的激活路径,并帮助工程师了解PO对动态环境变化(如电压波动)的弹性。在本文中,我们描述了一种利用概率信息分析电路动态行为的工具。该技术利用定时三元决策图(tTDD)来编码POs的稳定条件。为了计算基于tTDD的概率,我们提出了假分配修剪和随机变量压缩以保持概率计算的准确性。为了解决可扩展性问题,本文提出了一种新的电路划分启发式方法,以减少划分带来的不准确性。与定时模拟结果相比,我们的工具在ISCAS-85基准测试中的平均绝对误差为2.5%,均方根误差为5.3%。与最先进的动态行为分析工具相比,我们的工具平均速度快40倍,并且可以处理以前的工具无法处理的电路。
{"title":"Analysis of circuit dynamic behavior with timed ternary decision diagram","authors":"Lu Wan, Deming Chen","doi":"10.1109/ICCAD.2010.5653852","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5653852","url":null,"abstract":"Modern logic optimization tools tend to optimize circuits in a balanced way so that all primary outputs (POs) have similar delay close to the cycle time. However, certain POs will be exercised more frequently than the rest. Among these critical primary outputs, some may be stabilized very quickly by input vectors, even if their topological delays from primary inputs are very long. Knowing the dynamic behavior of a circuit can help optimize the most commonly activated paths and help engineers understand how resilient a PO is against dynamic environmental variations such as voltage fluctuations. In this paper, we describe a tool to analyze the dynamic behavior of a circuit utilizing probabilistic information. The techniques exploit the use of timed ternary decision diagrams (tTDD) to encode stabilization conditions for POs. To compute probabilities based on a tTDD, we propose false assignment pruning and random variable compaction to preserve probability calculation accuracy. To deal with the scalability issue, this paper proposes a new circuit partitioning heuristic to reduce the inaccuracy introduced by partitioning. Compared to the timed simulation results, our tool has a mean absolute error of 2.5% and a root mean square error of 5.3% on average for ISCAS-85 benchmarks. Compared to a state-of-the-art dynamic behavior analysis tool, our tool is on average 40x faster and can handle circuits that the previous tool cannot.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126529941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Design-hierarchy aware mixed-size placement for routability optimization 基于设计层次的可达性优化混合尺寸布局
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654234
Yi-Lin Chuang, Gi-Joon Nam, C. Alpert, Yao-Wen Chang, Jarrod A. Roy, Natarajan Viswanathan
Routability is a mandatory metric for modern large-scale mixed-size circuit placement which typically needs to handle hundreds of large macros and millions of small standard cells. However, most existing academic mixed-size placers either focus on wirelength minimization alone, or do not consider the impact of movable macros on routing. To remedy these insufficiencies, this paper formulates design-hierarchy information as a novel fence force in an analytical placement framework. Unlike a state-of-the-art routability-driven placer that simply removes net bounding boxes during placement, this paper utilizes two different optimization forces, the global fence force and the local spreading force, to determine the positions of both standard cells and macros. We utilize design-hierarchy information to determine block distributions globally, and locally we add additional spreading forces to preserve sufficient free space among blocks by a net-topology estimation. With the interactions between these two forces, our placer can well balance routability and wirelength. Experimental results show that our placer can achieve the best routability and routing time among all published works.
可达性是现代大规模混合尺寸电路布局的强制性指标,通常需要处理数百个大型宏和数百万个小型标准单元。然而,大多数现有的学术混合大小放置器要么只关注最小的无线长度,要么不考虑可移动宏对路由的影响。为了弥补这些不足,本文将设计层次信息作为分析放置框架中的一种新的围栏力。与最先进的可达性驱动的放矿器在放置过程中简单地移除网边界框不同,本文利用两种不同的优化力,即全局围栏力和局部扩展力,来确定标准单元和宏的位置。我们利用设计层次信息来确定全局块分布,并在局部通过网络拓扑估计增加额外的扩展力以保持块之间足够的自由空间。通过这两种力之间的相互作用,我们的放置器可以很好地平衡可达性和无线性。实验结果表明,该算法在所有已发表的算法中具有最佳的路由可达性和路由时间。
{"title":"Design-hierarchy aware mixed-size placement for routability optimization","authors":"Yi-Lin Chuang, Gi-Joon Nam, C. Alpert, Yao-Wen Chang, Jarrod A. Roy, Natarajan Viswanathan","doi":"10.1109/ICCAD.2010.5654234","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654234","url":null,"abstract":"Routability is a mandatory metric for modern large-scale mixed-size circuit placement which typically needs to handle hundreds of large macros and millions of small standard cells. However, most existing academic mixed-size placers either focus on wirelength minimization alone, or do not consider the impact of movable macros on routing. To remedy these insufficiencies, this paper formulates design-hierarchy information as a novel fence force in an analytical placement framework. Unlike a state-of-the-art routability-driven placer that simply removes net bounding boxes during placement, this paper utilizes two different optimization forces, the global fence force and the local spreading force, to determine the positions of both standard cells and macros. We utilize design-hierarchy information to determine block distributions globally, and locally we add additional spreading forces to preserve sufficient free space among blocks by a net-topology estimation. With the interactions between these two forces, our placer can well balance routability and wirelength. Experimental results show that our placer can achieve the best routability and routing time among all published works.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121632441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
A network-flow based pin-count aware routing algorithm for broadcast electrode-addressing EWOD chips 广播电极寻址EWOD芯片中基于网络流的针数感知路由算法
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5653715
Tsung-Wei Huang, S. Yeh, Tsung-Yi Ho
Electrowetting-on-dielectric (EWOD) chips have emerged as the most widely used actuators for digital microfluidic (DMF) systems. These devices enable the electrical manipulation of microfluidics with various advantages such as low power consumption, flexibility, accuracy, and efficiency. In addressing the need for low-cost and practical fabrication, pin-count reduction has become a key problem to the large-scale integration of EWOD-chip designs. One of the major approaches, broadcast addressing, reduces the pin count by assigning a single control pin to multiple electrodes with mutually compatible control signals. Most previous studies utilize this addressing scheme by scheduling fluidic-level synthesis on pin-constrained chip arrays. However, the associated interconnect routing problem is still not provided in currently available DMF automations, and thus the broadcast-addressing scheme cannot be actually realized. In this paper, we present the first network-flow based pin-count aware routing algorithm for EWOD-chip designs with a broadcast electrode-addressing scheme. Our algorithm simultaneously takes pin-count reduction and wirelength minimization into consideration for higher integration and better design performance. Experimental results show the effectiveness and scalability of our algorithm on a set of real-life chip applications.
电介质电润湿(EWOD)芯片已成为数字微流控(DMF)系统中应用最广泛的驱动器。这些装置使微流体的电气操作具有各种优点,如低功耗,灵活性,准确性和效率。为了满足低成本和实用制造的需求,减少引脚数已成为ewod芯片大规模集成设计的关键问题。其中一种主要方法是广播寻址,它通过将单个控制引脚分配给具有相互兼容控制信号的多个电极来减少引脚数。以往的研究大多利用这种寻址方案,在引脚受限的芯片阵列上调度流体级合成。然而,目前可用的DMF自动化仍然没有提供相关的互连路由问题,因此广播寻址方案无法实际实现。在本文中,我们提出了第一种基于网络流的针数感知路由算法,用于具有广播电极寻址方案的ewod芯片设计。我们的算法同时考虑了减少引脚数和最小化无线长度,以获得更高的集成度和更好的设计性能。实验结果表明了该算法在实际芯片应用中的有效性和可扩展性。
{"title":"A network-flow based pin-count aware routing algorithm for broadcast electrode-addressing EWOD chips","authors":"Tsung-Wei Huang, S. Yeh, Tsung-Yi Ho","doi":"10.1109/ICCAD.2010.5653715","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5653715","url":null,"abstract":"Electrowetting-on-dielectric (EWOD) chips have emerged as the most widely used actuators for digital microfluidic (DMF) systems. These devices enable the electrical manipulation of microfluidics with various advantages such as low power consumption, flexibility, accuracy, and efficiency. In addressing the need for low-cost and practical fabrication, pin-count reduction has become a key problem to the large-scale integration of EWOD-chip designs. One of the major approaches, broadcast addressing, reduces the pin count by assigning a single control pin to multiple electrodes with mutually compatible control signals. Most previous studies utilize this addressing scheme by scheduling fluidic-level synthesis on pin-constrained chip arrays. However, the associated interconnect routing problem is still not provided in currently available DMF automations, and thus the broadcast-addressing scheme cannot be actually realized. In this paper, we present the first network-flow based pin-count aware routing algorithm for EWOD-chip designs with a broadcast electrode-addressing scheme. Our algorithm simultaneously takes pin-count reduction and wirelength minimization into consideration for higher integration and better design performance. Experimental results show the effectiveness and scalability of our algorithm on a set of real-life chip applications.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121685968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Variation-aware layout-driven scheduling for performance yield optimization 变化感知布局驱动调度的性能产出优化
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654344
Gregory Lucas, Deming Chen
With the move to deep submicron processes, the design-productivity gap has continued to widen for RTL-based design methodologies. High-level synthesis has been touted as a solution to the design-productivity gap by allowing designers to move up to a higher level of abstraction where they focus on the functionality of the circuit instead of the low level details. However, at the same time, the move to deep submicron processes has led to increased levels of process variation, which must be considered during synthesis so that the performance yield of the circuit meets design specifications. In this paper, we tackle the problem of performance yield optimization during the scheduling task of high-level synthesis. We formulate the problem of performance yield optimization for scheduling as an integer linear programming problem (ILP) and offer the following contributions: 1) a totally unimodular ILP formulation for performance yield maximization and 2) a variation-aware and layout-driven iterative algorithm for performance yield improvement. Experimental results show that we can obtain significant gain in performance yield compared to a state-of-the-art variation-aware high-level synthesis tool Fast Yield.
随着向深亚微米工艺的转变,基于rtl的设计方法的设计生产力差距继续扩大。高级合成被吹捧为一种解决设计效率差距的方法,它允许设计人员提升到更高的抽象层次,在那里他们专注于电路的功能而不是低级细节。然而,与此同时,向深亚微米工艺的转变导致了工艺变化水平的增加,在合成过程中必须考虑到这一点,以便电路的性能良率符合设计规范。本文研究了高阶综合调度任务中性能良率优化问题。我们将调度的性能成品率优化问题表述为整数线性规划问题(ILP),并提供了以下贡献:1)用于性能成品率最大化的完全单模线性规划公式;2)用于性能成品率改进的变化感知和布局驱动迭代算法。实验结果表明,与最先进的变化感知高级合成工具Fast yield相比,我们可以获得显着的性能收率增益。
{"title":"Variation-aware layout-driven scheduling for performance yield optimization","authors":"Gregory Lucas, Deming Chen","doi":"10.1109/ICCAD.2010.5654344","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654344","url":null,"abstract":"With the move to deep submicron processes, the design-productivity gap has continued to widen for RTL-based design methodologies. High-level synthesis has been touted as a solution to the design-productivity gap by allowing designers to move up to a higher level of abstraction where they focus on the functionality of the circuit instead of the low level details. However, at the same time, the move to deep submicron processes has led to increased levels of process variation, which must be considered during synthesis so that the performance yield of the circuit meets design specifications. In this paper, we tackle the problem of performance yield optimization during the scheduling task of high-level synthesis. We formulate the problem of performance yield optimization for scheduling as an integer linear programming problem (ILP) and offer the following contributions: 1) a totally unimodular ILP formulation for performance yield maximization and 2) a variation-aware and layout-driven iterative algorithm for performance yield improvement. Experimental results show that we can obtain significant gain in performance yield compared to a state-of-the-art variation-aware high-level synthesis tool Fast Yield.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125469351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
On the escape routing of differential pairs 微分对的逃逸路径
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654214
Tan Yan, Pei-Ci Wu, Q. Ma, Martin D. F. Wong
As an important step in PCB design, the escape routing problem has been extensively studied in literature. However, few studies have been done on the escape routing of differential pairs. In this paper, we study the differential pair escape routing problem and propose two algorithms. The first one computes the optimal routing for a single differential pair while the second one is able to simultaneously route multiple differential pairs considering both routability and wire length. We then propose a two-stage routing scheme based on the two algorithms. Experimental results show that our routing scheme efficiently and effectively solves the differential pair escape routing test cases we obtained from industry.
逃逸布线问题作为PCB设计中的一个重要环节,已经得到了广泛的研究。然而,关于微分对的逃逸路径的研究很少。本文研究了差分对逃逸路由问题,并提出了两种算法。前者计算单个差分对的最优路由,后者考虑可达性和导线长度,能够同时路由多个差分对。然后,我们提出了基于这两种算法的两阶段路由方案。实验结果表明,该方案有效地解决了已有的工业差分对逃逸路由测试用例。
{"title":"On the escape routing of differential pairs","authors":"Tan Yan, Pei-Ci Wu, Q. Ma, Martin D. F. Wong","doi":"10.1109/ICCAD.2010.5654214","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654214","url":null,"abstract":"As an important step in PCB design, the escape routing problem has been extensively studied in literature. However, few studies have been done on the escape routing of differential pairs. In this paper, we study the differential pair escape routing problem and propose two algorithms. The first one computes the optimal routing for a single differential pair while the second one is able to simultaneously route multiple differential pairs considering both routability and wire length. We then propose a two-stage routing scheme based on the two algorithms. Experimental results show that our routing scheme efficiently and effectively solves the differential pair escape routing test cases we obtained from industry.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115035207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Fast performance evaluation of fixed-point systems with un-smooth operators 非光滑算子不动点系统的快速性能评价
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654064
Karthick Parashar, D. Ménard, R. Rocher, O. Sentieys, D. Novo, F. Catthoor
Fixed-point refinement of signal processing systems is an essential step performed before implementation of any signal processing system. Existing analytical techniques to evaluate performance of fixed-point systems are not applicable to the errors due to quantization in the presence of un-smooth operators. Thus, it is inevitable to use simulation to evaluate performance of fixed-point systems in the presence un-smooth operators. This paper proposes a hybrid technique which can be used in place of pure simulation to accelerate the performance evaluation. The principle idea in the proposed hybrid approach is to selectively simulate parts of the system only when un-smooth errors occur but use analytical results otherwise. The acceleration thus obtained reduces the performance evaluation time which can be used to explore a wider word-length design space or speedup the optimization process. This method has been tried on a complex MIMO sphere decoding algorithm and the results obtained show several orders of magnitude improvement in terms of evaluation time.
信号处理系统的定点细化是任何信号处理系统实施前必不可少的一步。现有的评价定点系统性能的分析技术不适用于在非光滑算子存在下量化产生的误差。因此,在非光滑算子存在的情况下,利用仿真来评价定点系统的性能是不可避免的。本文提出了一种混合技术,可以代替单纯的仿真来加速性能评估。所提出的混合方法的主要思想是,只有当出现非光滑误差时,才选择性地模拟系统的某些部分,否则使用分析结果。由此获得的加速减少了性能评估时间,可用于探索更宽的字长设计空间或加速优化过程。在一种复杂的MIMO球面译码算法上进行了实验,结果表明该方法在评估时间上有几个数量级的提高。
{"title":"Fast performance evaluation of fixed-point systems with un-smooth operators","authors":"Karthick Parashar, D. Ménard, R. Rocher, O. Sentieys, D. Novo, F. Catthoor","doi":"10.1109/ICCAD.2010.5654064","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654064","url":null,"abstract":"Fixed-point refinement of signal processing systems is an essential step performed before implementation of any signal processing system. Existing analytical techniques to evaluate performance of fixed-point systems are not applicable to the errors due to quantization in the presence of un-smooth operators. Thus, it is inevitable to use simulation to evaluate performance of fixed-point systems in the presence un-smooth operators. This paper proposes a hybrid technique which can be used in place of pure simulation to accelerate the performance evaluation. The principle idea in the proposed hybrid approach is to selectively simulate parts of the system only when un-smooth errors occur but use analytical results otherwise. The acceleration thus obtained reduces the performance evaluation time which can be used to explore a wider word-length design space or speedup the optimization process. This method has been tried on a complex MIMO sphere decoding algorithm and the results obtained show several orders of magnitude improvement in terms of evaluation time.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114961228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
High variation-tolerant obstacle-avoiding clock mesh synthesis with symmetrical driving trees 对称驱动树的高容变避障时钟网格合成
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5653754
Xin-Wei Shih, Hsu-Chieh Lee, Kuan-Hsien Ho, Yao-Wen Chang
For high-performance chip designs, a clock network with high tolerance towards process-variation is essential for chip synchronization. Clock mesh structure are widely used in these designs because of its resistant to variations. However, traditional mesh structures suffer from several drawbacks such as difficulty in timing estimation, inability to handle obstacles, and high power consumption. This paper proposes a new obstacle-avoiding clock mesh synthesis method which applies a two-stage approach of mesh construction followed by driving-tree synthesis. The method achieves very low skew through structural optimization, thus eliminating the need of direct timing estimation and/or SPICE simulation during clock network synthesis. In addition, our approach handles obstacles with the structural consideration, and reduces power consumption by removing non-critical mesh components and optimizing the driving-tree structure. Based on the benchmarks of the ISPD'10 Clock Network Synthesis Contest, the top contest performers result in 1.32X skew over our approach by using mesh structure, and more than 2.0X skew over our approach by using tree structure. Our approach runs 8326X/11421X faster than teams that used simulation, and 67X/90X times faster than teams that did not use simulation.
在高性能芯片设计中,对工艺变化具有高容忍度的时钟网络是实现芯片同步的必要条件。钟网结构因其抗变异性而被广泛应用于这些设计中。然而,传统的网格结构存在着时间估计困难、无法处理障碍物、功耗高等缺点。本文提出了一种新的避障时钟网格综合方法,该方法采用两阶段网格构建和驱动树综合的方法。该方法通过结构优化实现了极低的偏度,从而消除了时钟网络合成过程中直接时序估计和/或SPICE仿真的需要。此外,我们的方法还考虑了结构上的障碍,并通过去除非关键网格部件和优化驱动树结构来降低功耗。根据ISPD'10时钟网络合成竞赛的基准,比赛中表现最好的选手使用网格结构的方法比我们的方法倾斜1.32倍,使用树结构的方法比我们的方法倾斜2.0倍以上。我们的方法比使用模拟的团队快8326X/11421X,比不使用模拟的团队快67X/90X。
{"title":"High variation-tolerant obstacle-avoiding clock mesh synthesis with symmetrical driving trees","authors":"Xin-Wei Shih, Hsu-Chieh Lee, Kuan-Hsien Ho, Yao-Wen Chang","doi":"10.1109/ICCAD.2010.5653754","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5653754","url":null,"abstract":"For high-performance chip designs, a clock network with high tolerance towards process-variation is essential for chip synchronization. Clock mesh structure are widely used in these designs because of its resistant to variations. However, traditional mesh structures suffer from several drawbacks such as difficulty in timing estimation, inability to handle obstacles, and high power consumption. This paper proposes a new obstacle-avoiding clock mesh synthesis method which applies a two-stage approach of mesh construction followed by driving-tree synthesis. The method achieves very low skew through structural optimization, thus eliminating the need of direct timing estimation and/or SPICE simulation during clock network synthesis. In addition, our approach handles obstacles with the structural consideration, and reduces power consumption by removing non-critical mesh components and optimizing the driving-tree structure. Based on the benchmarks of the ISPD'10 Clock Network Synthesis Contest, the top contest performers result in 1.32X skew over our approach by using mesh structure, and more than 2.0X skew over our approach by using tree structure. Our approach runs 8326X/11421X faster than teams that used simulation, and 67X/90X times faster than teams that did not use simulation.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129898205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Multi-Wafer Virtual Probe: Minimum-cost variation characterization by exploring wafer-to-wafer correlation 多晶圆虚拟探针:通过探索晶圆之间的相关性来表征最小成本变化
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654349
Wangyang Zhang, Xin Li, E. Acar, Frank Liu, Rob A. Rutenbar
In this paper, we propose a new technique, referred to as Multi-Wafer Virtual Probe (MVP) to efficiently model wafer-level spatial variations for nanoscale integrated circuits. Towards this goal, a novel Bayesian inference is derived to extract a shared model template to explore the wafer-to-wafer correlation information within the same lot. In addition, a robust regression algorithm is proposed to automatically detect and remove outliers (i.e., abnormal measurement data with large error) so that they do not bias the modeling results. The proposed MVP method is extensively tested for silicon measurement data collected from 200 wafers at an advanced technology node. Our experimental results demonstrate that MVP offers superior accuracy over other traditional approaches such as VP [7] and EM [8], if a limited number of measurement data are available.
在本文中,我们提出了一种新的技术,称为多晶圆虚拟探针(MVP),以有效地模拟纳米级集成电路的晶圆级空间变化。为此,提出了一种新的贝叶斯推理方法来提取共享模型模板,以探索同一批次内晶圆间的相关信息。此外,提出了一种鲁棒回归算法,自动检测和去除异常值(即误差较大的异常测量数据),使其不影响建模结果。提出的MVP方法在一个先进的技术节点上对从200片晶圆收集的硅测量数据进行了广泛的测试。我们的实验结果表明,如果可用的测量数据数量有限,MVP比其他传统方法(如VP[7]和EM[8])具有更高的准确性。
{"title":"Multi-Wafer Virtual Probe: Minimum-cost variation characterization by exploring wafer-to-wafer correlation","authors":"Wangyang Zhang, Xin Li, E. Acar, Frank Liu, Rob A. Rutenbar","doi":"10.1109/ICCAD.2010.5654349","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654349","url":null,"abstract":"In this paper, we propose a new technique, referred to as Multi-Wafer Virtual Probe (MVP) to efficiently model wafer-level spatial variations for nanoscale integrated circuits. Towards this goal, a novel Bayesian inference is derived to extract a shared model template to explore the wafer-to-wafer correlation information within the same lot. In addition, a robust regression algorithm is proposed to automatically detect and remove outliers (i.e., abnormal measurement data with large error) so that they do not bias the modeling results. The proposed MVP method is extensively tested for silicon measurement data collected from 200 wafers at an advanced technology node. Our experimental results demonstrate that MVP offers superior accuracy over other traditional approaches such as VP [7] and EM [8], if a limited number of measurement data are available.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133225106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
Redundant-wires-aware ECO timing and mask cost optimization 冗余线感知ECO时序和掩模成本优化
Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5653648
Shao-Yun Fang, Tzuo-Fan Chien, Yao-Wen Chang
Spare cells are often used in engineering change order (ECO) timing optimization. By applying spare-cell rewiring techniques, timing-violated paths in a design can be fixed. In addition, mask re-spin cost economization has become a critical challenge for modern IC design, and it can be achieved by reducing the number of layers used to rewire spare cells. This paper presents the first work for the problem of ECO timing optimization considering redundant wires (unused wires or dummy metals) to minimize the number of rewiring layers. We first propose a multi-commodity flow model for the spare-cell selection problem and apply integer linear programming (ILP) to simultaneously optimize all timing-violated paths. The ILP formulation minimizes the number of used spare cells and considers the routability of the selected spare cells. Then, we develop a tile-based ECO router which minimizes the number of rewiring layers by reusing redundant wires. Experimental results based on five industry benchmarks show that our algorithm not only effectively resolves timing violations but also reduces the number of rewiring layers under reasonable runtime.
备用电池常用于工程变更顺序(ECO)时序优化。通过应用备用单元重布线技术,可以固定设计中违反时间的路径。此外,掩膜重旋成本节约已成为现代IC设计的一个关键挑战,这可以通过减少用于重新布线备用电池的层数来实现。本文提出了考虑冗余导线(未使用导线或假金属)的ECO时序优化问题的第一个工作,以尽量减少重布线层的数量。本文首先提出了备用单元选择问题的多商品流模型,并应用整数线性规划(ILP)同时优化所有违反时间的路径。ILP公式最大限度地减少备用电池的使用数量,并考虑所选备用电池的可达性。然后,我们开发了一种基于瓷砖的ECO路由器,通过重复使用冗余的电线,最大限度地减少了重新布线层的数量。基于五个行业基准的实验结果表明,该算法在合理的运行时间下,不仅有效地解决了时序违规问题,而且减少了重布线层数。
{"title":"Redundant-wires-aware ECO timing and mask cost optimization","authors":"Shao-Yun Fang, Tzuo-Fan Chien, Yao-Wen Chang","doi":"10.1109/ICCAD.2010.5653648","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5653648","url":null,"abstract":"Spare cells are often used in engineering change order (ECO) timing optimization. By applying spare-cell rewiring techniques, timing-violated paths in a design can be fixed. In addition, mask re-spin cost economization has become a critical challenge for modern IC design, and it can be achieved by reducing the number of layers used to rewire spare cells. This paper presents the first work for the problem of ECO timing optimization considering redundant wires (unused wires or dummy metals) to minimize the number of rewiring layers. We first propose a multi-commodity flow model for the spare-cell selection problem and apply integer linear programming (ILP) to simultaneously optimize all timing-violated paths. The ILP formulation minimizes the number of used spare cells and considers the routability of the selected spare cells. Then, we develop a tile-based ECO router which minimizes the number of rewiring layers by reusing redundant wires. Experimental results based on five industry benchmarks show that our algorithm not only effectively resolves timing violations but also reduces the number of rewiring layers under reasonable runtime.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134621123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1