2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)最新文献

英文中文

SMATO: Simultaneous mask and target optimization for improving lithographic process window SMATO:用于改善光刻工艺窗口的同时掩模和目标优化

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654341

Shayak Banerjee, K. Agarwal, M. Orshansky

Low-k1 lithography results in features that suffer from poor lithographic yield in the presence of process variation. The problem is especially pronounced for lower level metals used for local routing, where bi-directionality gives rise to lithography unfriendly layout patterns. However, one can modify such wires without significantly affecting design behavior. In this paper, we propose to simultaneously modify mask and target during OPC to improve lithographic yield. The method uses image slope information, available during image simulation at no extra cost, as a measure of process window. We derive a cost function that maximizes both contour fidelity and robustness to drive our simultaneous mask and target optimization (SMATO) method. We then develop analytical equations to predict the cost for a given mask and target modification and use a fast algorithm to minimize this cost function to obtain an optimal mask and target solution. Our experiments on sample metal1 (M1) layouts show that the use of SMATO reduces the Process Manufacturability Index (PMI) [18] by 15.4% compared to OPC, which further leads to 69% reduction in the number of layout hotspots. Additionally, such improvement is obtained at low average runtime overhead (5.5%). Compared to PWOPC, we observe 4.6% improvement in PMI at large (2.6X) improvement in runtime.

低k1光刻导致在存在工艺变化的情况下遭受光刻成品率差的特征。对于用于本地路由的低级别金属，这个问题尤其明显，双向性会导致光刻不友好的布局模式。然而，我们可以在不显著影响设计行为的情况下修改这些线路。本文提出在OPC过程中同时修改掩模和靶，以提高光刻成品率。该方法使用图像斜率信息作为过程窗口的度量，该信息在图像仿真过程中无需额外成本即可获得。我们推导了一个成本函数，最大限度地提高了轮廓保真度和鲁棒性，以驱动我们的同步掩模和目标优化(SMATO)方法。然后，我们开发了分析方程来预测给定掩模和目标修改的成本，并使用快速算法最小化该成本函数以获得最优掩模和目标解。我们对样品金属1 (M1)布局的实验表明，与OPC相比，SMATO的使用使过程可制造性指数(PMI)[18]降低了15.4%，这进一步导致布局热点数量减少了69%。此外，这种改进是在较低的平均运行时开销(5.5%)下获得的。与PWOPC相比，我们观察到运行时PMI总体提高了4.6%(2.6倍)。

{"title":"SMATO: Simultaneous mask and target optimization for improving lithographic process window","authors":"Shayak Banerjee, K. Agarwal, M. Orshansky","doi":"10.1109/ICCAD.2010.5654341","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654341","url":null,"abstract":"Low-k1 lithography results in features that suffer from poor lithographic yield in the presence of process variation. The problem is especially pronounced for lower level metals used for local routing, where bi-directionality gives rise to lithography unfriendly layout patterns. However, one can modify such wires without significantly affecting design behavior. In this paper, we propose to simultaneously modify mask and target during OPC to improve lithographic yield. The method uses image slope information, available during image simulation at no extra cost, as a measure of process window. We derive a cost function that maximizes both contour fidelity and robustness to drive our simultaneous mask and target optimization (SMATO) method. We then develop analytical equations to predict the cost for a given mask and target modification and use a fast algorithm to minimize this cost function to obtain an optimal mask and target solution. Our experiments on sample metal1 (M1) layouts show that the use of SMATO reduces the Process Manufacturability Index (PMI) [18] by 15.4% compared to OPC, which further leads to 69% reduction in the number of layout hotspots. Additionally, such improvement is obtained at low average runtime overhead (5.5%). Compared to PWOPC, we observe 4.6% improvement in PMI at large (2.6X) improvement in runtime.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114643261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

Recent research development in PCB layout PCB布局的最新研究进展

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654190

Tan Yan, Martin D. F. Wong

The increasing complexity of electronic systems has made PCB layout a difficult problem. A large amount of research efforts are dedicated to the study of this problem. In this paper, we provide an overview of recent research results on the PCB layout problem. We focus on the escape routing problem and the length-matching routing problem, which are the two most important problems in PCB layout. Other relevant works are also briefly introduced.

电子系统的复杂性日益增加，使得PCB布局成为一个难题。大量的研究工作致力于研究这个问题。在本文中，我们概述了近年来在PCB布局问题上的研究成果。重点研究了逃逸布线问题和长度匹配布线问题，这是PCB布线中最重要的两个问题。并简要介绍了其他相关工作。

引用次数: 29

Engineering a scalable Boolean matching based on EDA SaaS 2.0 设计一个基于EDA SaaS 2.0的可伸缩布尔匹配

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654275

Chun Zhang, Yu Hu, Lingli Wang, Lei He, J. Tong

Software as a Service (SaaS) 1.0 signifcantly lowers the infrastructure and maintenance cost and increases the accessibility of the software by hosting software via the web. Compared with SaaS 1.0, SaaS 2.0 is more flexible since it leverages software tools from both server and client sides with closer interaction between them. The SaaS 2.0 paradigm provides new opportunities and challenges for EDA. In this paper, we take Boolean matching, one of the core sub algorithms in logic synthesis for field programmable gate arrays (FPGAs), as a case study. We investigate the advantages and challenges of implementing a scalable EDA algorithm under SaaS 2.0 paradigm from a technical perspective. We propose SaaS-BM, a new Boolean matching algorithm customized to take full advantage of the cloud while addressing concerns such as security and the internet bandwidth limit. Extensive experiments are performed under a networked environment with concurrent accesses. Integrated into a post-mapping re-synthesis algorithm minimizing area, the proposed SaaS-BM is 863X times faster than state-of-the-art SAT-based Boolean matching with 0.5% area overhead. Compared with a recent Bloom Filter-based Boolean matching algorithm, our proposed SaaS-BM is 53X times faster on large circuits with no area overhead.

软件即服务(SaaS) 1.0显著降低了基础设施和维护成本，并通过web托管软件，增加了软件的可访问性。与SaaS 1.0相比，SaaS 2.0更加灵活，因为它利用了服务器端和客户端的软件工具，两者之间的交互更加紧密。SaaS 2.0范式为EDA提供了新的机遇和挑战。本文以现场可编程门阵列(fpga)逻辑综合的核心子算法之一布尔匹配为例进行了研究。我们从技术角度研究了在SaaS 2.0范式下实现可扩展EDA算法的优势和挑战。我们提出SaaS-BM，这是一种新的布尔匹配算法，旨在充分利用云，同时解决安全性和互联网带宽限制等问题。在具有并发访问的网络环境下进行了大量的实验。集成到映射后再合成算法中，最小化面积，所提出的SaaS-BM比最先进的基于sat的布尔匹配快863X，面积开销为0.5%。与最近基于布隆滤波器的布尔匹配算法相比，我们提出的SaaS-BM在没有面积开销的大型电路上速度快53倍。

{"title":"Engineering a scalable Boolean matching based on EDA SaaS 2.0","authors":"Chun Zhang, Yu Hu, Lingli Wang, Lei He, J. Tong","doi":"10.1109/ICCAD.2010.5654275","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5654275","url":null,"abstract":"Software as a Service (SaaS) 1.0 signifcantly lowers the infrastructure and maintenance cost and increases the accessibility of the software by hosting software via the web. Compared with SaaS 1.0, SaaS 2.0 is more flexible since it leverages software tools from both server and client sides with closer interaction between them. The SaaS 2.0 paradigm provides new opportunities and challenges for EDA. In this paper, we take Boolean matching, one of the core sub algorithms in logic synthesis for field programmable gate arrays (FPGAs), as a case study. We investigate the advantages and challenges of implementing a scalable EDA algorithm under SaaS 2.0 paradigm from a technical perspective. We propose SaaS-BM, a new Boolean matching algorithm customized to take full advantage of the cloud while addressing concerns such as security and the internet bandwidth limit. Extensive experiments are performed under a networked environment with concurrent accesses. Integrated into a post-mapping re-synthesis algorithm minimizing area, the proposed SaaS-BM is 863X times faster than state-of-the-art SAT-based Boolean matching with 0.5% area overhead. Compared with a recent Bloom Filter-based Boolean matching algorithm, our proposed SaaS-BM is 53X times faster on large circuits with no area overhead.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125130944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

On behavioral model equivalence checking for large analog/mixed signal systems 大型模拟/混合信号系统行为模型等价性检验

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5651402

Amandeep Singh, Peng Li

This paper presents a systematic, hierarchical, optimization based semi-formal equivalence checking methodology for large analog/mixed signal systems such as PLLs, ADCs and I/O's. We verify the equivalence between a behavioral model and its electrical implementation over a limited, but highly likely, input space defined as the Constrained Behavioral Input Space. Further, we clearly distinguish between the behavioral and electrical domains and define mappings between the two domains to allow for calculation of deviation between the behavioral and electrical implementation. The verification problem is then formulated as an optimization problem which is solved by interfacing a SQP based optimizer with commercial circuit simulation tools. The proposed methodology is then applied for equivalence checking of a PLL as a test case.

本文提出了一种系统的、分层的、基于优化的半形式等效检验方法，用于大型模拟/混合信号系统，如锁相环、adc和I/O。我们在一个有限的，但极有可能的，被定义为约束行为输入空间的输入空间上验证了行为模型和它的电子实现之间的等价性。此外，我们清楚地区分了行为和电领域，并定义了两个领域之间的映射，以便计算行为和电实现之间的偏差。然后将验证问题表述为优化问题，通过将基于SQP的优化器与商用电路仿真工具相结合来解决优化问题。然后将提出的方法应用于锁相环的等效性检查作为测试用例。

引用次数: 26

Symbolic system level reliability analysis 符号系统级可靠性分析

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5654134

M. Glaß, M. Lukasiewycz, Felix Reimann, C. Haubelt, J. Teich

More and more embedded systems provide a multitude of services, implemented by a large number of networked hardware components. In early design phases, dimensioning such complex systems in terms of monetary costs, power consumption, reliability etc. demands for new analysis approaches at the electronic system level. In this paper, two symbolic system level reliability analysis approaches are introduced. First, a formal approach based on Binary Decision Diagrams is presented that allows to calculate exact reliability measures for small to moderate-sized systems. Second, a simulative approach is presented that hybridizes a Monte Carlo simulation with a SAT solver and delivers adequate approximations of the reliability measures for large and complex systems.

越来越多的嵌入式系统提供大量的服务，由大量的网络硬件组件实现。在早期设计阶段，从货币成本、功耗、可靠性等方面对如此复杂的系统进行量纲化，需要在电子系统层面采用新的分析方法。本文介绍了两种符号系统级可靠性分析方法。首先，提出了一种基于二元决策图的形式化方法，允许计算小型到中等规模系统的精确可靠性度量。其次，提出了一种模拟方法，该方法将蒙特卡罗模拟与SAT求解器相结合，并为大型复杂系统提供了足够的可靠性度量近似值。

引用次数: 11

A synthesis flow for digital signal processing with biomolecular reactions 用生物分子反应处理数字信号的合成流程

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Pub Date : 2010-11-07 DOI: 10.5555/2133429.2133518

Hua Jiang, Aleksandra P. Kharam, Marc D. Riedel, K. Parhi

We present a methodology for implementing digital signal processing (DSP) operations such as filtering with biomolecular reactions. From a DSP specification, we demonstrate how to synthesize biomolecular reactions that produce time-varying output quantities of molecules as a function of time-varying input quantities. Unlike all previous schemes for biomolecular computation, ours produces designs that are dependent only on coarse rate categories for the reactions (“fast” and “slow”). Given such categories, the computation is exact and independent of the specific reaction rates. We implement DSP operations through a self-timed “handshaking” protocol that transfers quantities between molecular types based on the absence of other types. We illustrate our methodology with the design of a simple moving-average filter as well as a more complex biquad filter. We validate our designs through transient stochastic simulations of the chemical kinetics. Although conceptual for the time being, the proposed methodology has potential applications in domains of synthetic biology such as biochemical sensing and drug delivery. We are exploring DNA-based computation via strand displacement as a possible experimental chassis.

我们提出了一种实现数字信号处理(DSP)操作的方法，如生物分子反应滤波。从DSP规范，我们演示了如何合成生物分子反应，产生时变输出量的分子作为时变输入量的函数。与之前所有的生物分子计算方案不同，我们的设计只依赖于反应的粗略速率类别(“快”和“慢”)。给定这些类别，计算是精确的，与特定的反应速率无关。我们通过自定时“握手”协议实现DSP操作，该协议在缺乏其他类型的分子类型之间传输数量。我们通过设计一个简单的移动平均滤波器以及一个更复杂的二元滤波器来说明我们的方法。我们通过化学动力学的瞬态随机模拟来验证我们的设计。虽然目前还处于概念阶段，但该方法在生物化学传感和药物传递等合成生物学领域具有潜在的应用前景。我们正在探索基于dna的计算通过链位移作为可能的实验底盘。

{"title":"A synthesis flow for digital signal processing with biomolecular reactions","authors":"Hua Jiang, Aleksandra P. Kharam, Marc D. Riedel, K. Parhi","doi":"10.5555/2133429.2133518","DOIUrl":"https://doi.org/10.5555/2133429.2133518","url":null,"abstract":"We present a methodology for implementing digital signal processing (DSP) operations such as filtering with biomolecular reactions. From a DSP specification, we demonstrate how to synthesize biomolecular reactions that produce time-varying output quantities of molecules as a function of time-varying input quantities. Unlike all previous schemes for biomolecular computation, ours produces designs that are dependent only on coarse rate categories for the reactions (“fast” and “slow”). Given such categories, the computation is exact and independent of the specific reaction rates. We implement DSP operations through a self-timed “handshaking” protocol that transfers quantities between molecular types based on the absence of other types. We illustrate our methodology with the design of a simple moving-average filter as well as a more complex biquad filter. We validate our designs through transient stochastic simulations of the chemical kinetics. Although conceptual for the time being, the proposed methodology has potential applications in domains of synthetic biology such as biochemical sensing and drug delivery. We are exploring DNA-based computation via strand displacement as a possible experimental chassis.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"301 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128622487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

PEDS: Passivity enforcement for descriptor systems via Hamiltonian-symplectic matrix pencil perturbation 用哈密顿-辛矩阵铅笔摄动实现广义系统的无源性

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5653885

Yuanzhe Wang, Zheng Zhang, Cheng-Kok Koh, G. Pang, N. Wong

Passivity is a crucial property of macromodels to guarantee stable global (interconnected) simulation. However, weakly nonpassive models may be generated for passive circuits and systems in various contexts, such as data fitting, model order reduction (MOR) and electromagnetic (EM) macromodeling. Therefore, a post-processing passivity enforcement algorithm is desired. Most existing algorithms are designed to handle pole-residue models. The few algorithms for state space models only handle regular systems (RSs) with a nonsingular D+DT term. To the authors' best knowledge, no algorithm has been proposed to enforce passivity for more general descriptor systems (DSs) and state space models with singular D+DT terms. In this paper, a new post-processing passivity enforcement algorithm based on perturbation of Hamiltonian-symplectic matrix pencil, PEDS, is proposed. PEDS, for the first time, can enforce passivity for DSs. It can also handle all kinds of state space models (both RSs and DSs) with singular D+DT terms. Moreover, a criterion to control the error of perturbation is devised, with which the optimal passive models with the best accuracy can be obtained. Numerical examples then verify that PEDS is efficient, robust and relatively cheap for passivity enforcement of DSs with mild passivity violations.

无源性是保证宏观模型稳定全局(互联)仿真的关键特性。然而，在各种情况下，弱非被动模型可用于无源电路和系统，如数据拟合，模型降阶(MOR)和电磁(EM)宏建模。因此，需要一种后处理被动执行算法。大多数现有算法都是为了处理极点残数模型而设计的。少数用于状态空间模型的算法仅处理具有非奇异D+DT项的正则系统。据作者所知，目前还没有提出任何算法来为更一般的描述符系统(ds)和具有奇异D+DT项的状态空间模型强制执行无源性。提出了一种基于哈密顿-辛矩阵铅笔摄动的后处理无源增强算法PEDS。PEDS第一次可以强制执行ds的被动性。它还可以处理具有奇异D+DT项的各种状态空间模型(RSs和ds)。此外，还设计了一种控制摄动误差的准则，利用该准则可以得到精度最高的最优被动模型。数值算例验证了PEDS对于具有轻微无源违例的DSs的无源强制执行是有效的、鲁棒的和相对便宜的。

{"title":"PEDS: Passivity enforcement for descriptor systems via Hamiltonian-symplectic matrix pencil perturbation","authors":"Yuanzhe Wang, Zheng Zhang, Cheng-Kok Koh, G. Pang, N. Wong","doi":"10.1109/ICCAD.2010.5653885","DOIUrl":"https://doi.org/10.1109/ICCAD.2010.5653885","url":null,"abstract":"Passivity is a crucial property of macromodels to guarantee stable global (interconnected) simulation. However, weakly nonpassive models may be generated for passive circuits and systems in various contexts, such as data fitting, model order reduction (MOR) and electromagnetic (EM) macromodeling. Therefore, a post-processing passivity enforcement algorithm is desired. Most existing algorithms are designed to handle pole-residue models. The few algorithms for state space models only handle regular systems (RSs) with a nonsingular D+DT term. To the authors' best knowledge, no algorithm has been proposed to enforce passivity for more general descriptor systems (DSs) and state space models with singular D+DT terms. In this paper, a new post-processing passivity enforcement algorithm based on perturbation of Hamiltonian-symplectic matrix pencil, PEDS, is proposed. PEDS, for the first time, can enforce passivity for DSs. It can also handle all kinds of state space models (both RSs and DSs) with singular D+DT terms. Moreover, a criterion to control the error of perturbation is devised, with which the optimal passive models with the best accuracy can be obtained. Numerical examples then verify that PEDS is efficient, robust and relatively cheap for passivity enforcement of DSs with mild passivity violations.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"351 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123324852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

Clustering-based simultaneous task and voltage scheduling for NoC systems 基于聚类的NoC系统同步任务与电压调度

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Pub Date : 2010-11-07 DOI: 10.5555/2133429.2133488

Yifang Liu, Yu Yang, Jiang Hu

Networks-on-chip (NoC) is emerging as a promising communication structure, which is scalable with respect to chip complexity. Meanwhile, latest chip designs are increasingly leveraging multiple voltage-frequency domains for energy-efficiency improvement. In this work, we propose a simultaneous task and voltage scheduling algorithm for energy minimization in NoC based designs. The energy-latency tradeoff is handled by Lagrangian relaxation. The core algorithm is a clustering based approach which not only assigns voltage levels and starting time to each task (or Processing Element) but also naturally finds voltage-frequency clusters. Compared to a recent previous work, which performs task scheduling and voltage assignment sequentially, our method leads to an average of 20% energy reduction.

片上网络(NoC)作为一种有前途的通信结构正在兴起，它在芯片复杂性方面具有可扩展性。同时，最新的芯片设计越来越多地利用多个电压频率域来提高能效。在这项工作中，我们提出了一种同步任务和电压调度算法，用于基于NoC的设计中的能量最小化。能量延迟的权衡是由拉格朗日弛豫来处理的。核心算法是一种基于聚类的方法，它不仅为每个任务(或处理单元)分配电压水平和开始时间，而且自然地找到电压频率聚类。与之前的一项工作相比，该方法可以依次执行任务调度和电压分配，平均减少20%的能量。

引用次数: 11

Low-power clock trees for CPUs cpu的低功耗时钟树

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Pub Date : 2010-11-07 DOI: 10.1109/ICCAD.2010.5653738

Dongjin Lee, Myung-Chul Kim, I. Markov

Clock networks contribute a significant fraction of dynamic power and can be a limiting factor in high-performance CPUs and SoCs. The need for multi-objective optimization over a large parameter space and the increasing impact of process variation make clock network synthesis particularly challenging. In this work, we develop new modeling techniques and algorithms, as well as a methodology, for clock power optimization subject to tight skew constraints in the presence of process variations. Key contributions include a new time-budgeting step for clock-tree tuning, accurate optimizations that satisfy budgets, modeling and optimization of variational skew. Our implementation, Contango 2.0, outperforms the winners of the ISPD 2010 clock-network synthesis contest on 45nm benchmarks from Intel and IBM.

时钟网络占动态功耗的很大一部分，并且可能成为高性能cpu和soc的限制因素。在大参数空间上进行多目标优化的需要以及过程变化的影响日益增加，使得时钟网络的合成特别具有挑战性。在这项工作中，我们开发了新的建模技术和算法，以及一种方法，用于在存在工艺变化的情况下受严格偏态约束的时钟功率优化。主要贡献包括用于时钟树调整的新的时间预算步骤，满足预算的精确优化，变分偏差建模和优化。我们的实现，Contango 2.0，在英特尔和IBM的45纳米基准上优于ISPD 2010时钟网络合成竞赛的获胜者。

引用次数: 34

Scheduling of synchronous data flow models on scratchpad memory based embedded processors 基于刮本存储器的嵌入式处理器上同步数据流模型的调度

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Pub Date : 2010-11-07 DOI: 10.1145/2536747.2536752

W. Che, Karam S. Chatha

Many embedded processors incorporate scratchpad memories (SPM) due to their lower power consumption characteristics. SPMs are utilized to host both code and data, often on the same physical unit. Synchronous dataflow (SDF) is a popular format for specifying many embedded system applications particularly in multimedia and network processing domains. Execution of SDF specifications on SPM based processors involves division of memory between actor code and buffers, and scheduling of actor executions and code overlays such that latency is minimized subject to the memory constraints. In our problem instance a traditional minimum buffer SDF schedule could require a larger code overlay overhead and therefore may not be optimal. The paper presents a three stage integer linear programming (ILP) formulation to solve the problem. Further, the paper also introduces modifications to the three stage ILP for incorporating code prefetching optimization to further reduce the code overlay overhead. The effectiveness of the proposed approaches is evaluated by comparisons with minimum buffer SDF schedules for several benchmark applications.

由于功耗较低，许多嵌入式处理器都采用了刮板存储器(SPM)。spm用于同时托管代码和数据，通常在同一个物理单元上。同步数据流(SDF)是一种流行的格式，用于指定许多嵌入式系统应用程序，特别是在多媒体和网络处理领域。在基于SPM的处理器上执行SDF规范涉及到参与者代码和缓冲区之间的内存划分，以及参与者执行和代码覆盖的调度，以便在内存约束下最小化延迟。在我们的问题实例中，传统的最小缓冲区SDF调度可能需要更大的代码覆盖开销，因此可能不是最优的。本文提出了一个三期整数线性规划(ILP)公式来解决这一问题。此外，本文还介绍了对三阶段ILP的修改，以纳入代码预取优化，以进一步减少代码覆盖开销。通过与几个基准应用程序的最小缓冲区SDF调度进行比较，评估了所提出方法的有效性。

{"title":"Scheduling of synchronous data flow models on scratchpad memory based embedded processors","authors":"W. Che, Karam S. Chatha","doi":"10.1145/2536747.2536752","DOIUrl":"https://doi.org/10.1145/2536747.2536752","url":null,"abstract":"Many embedded processors incorporate scratchpad memories (SPM) due to their lower power consumption characteristics. SPMs are utilized to host both code and data, often on the same physical unit. Synchronous dataflow (SDF) is a popular format for specifying many embedded system applications particularly in multimedia and network processing domains. Execution of SDF specifications on SPM based processors involves division of memory between actor code and buffers, and scheduling of actor executions and code overlays such that latency is minimized subject to the memory constraints. In our problem instance a traditional minimum buffer SDF schedule could require a larger code overlay overhead and therefore may not be optimal. The paper presents a three stage integer linear programming (ILP) formulation to solve the problem. Further, the paper also introduces modifications to the three stage ILP for incorporating code prefetching optimization to further reduce the code overlay overhead. The effectiveness of the proposed approaches is evaluated by comparisons with minimum buffer SDF schedules for several benchmark applications.","PeriodicalId":344703,"journal":{"name":"2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132514283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀