2008 45th ACM/IEEE Design Automation Conference最新文献

英文中文

Merging nodes under sequential observability 在顺序可观察性条件下合并节点

2008 45th ACM/IEEE Design Automation Conference

Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391605

Michael L. Case, Victor N. Kravets, A. Mishchenko, R. Brayton

This paper presents a new type of sequential technology independent synthesis. Building on the previous notions of combinational observability and sequential equivalence, sequential observability is introduced and discussed. By considering both the sequential nature of the design and observability simultaneously, better results can be obtained than with either algorithm alone. The experimental results show that this method can reduce the technology-independent gate count up to 10% more than the previously best known synthesis techniques.

提出了一种新型的序列技术独立合成。在先前的组合可观测性和顺序等价概念的基础上，引入并讨论了顺序可观测性。通过同时考虑设计的顺序性和可观测性，可以获得比单独使用任何一种算法更好的结果。实验结果表明，与现有的合成技术相比，该方法可将与技术无关的栅极数减少10%以上。

引用次数: 20

Application mapping for chip multiprocessors 芯片多处理器的应用映射

2008 45th ACM/IEEE Design Automation Conference

Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391628

Guangyu Chen, Feihui Li, S. Son, M. Kandemir

The problem attacked in this paper is one of automatically mapping an application onto a network-on-chip (NoC) based chip multiprocessor (CMP) architecture in a locality-aware fashion. The proposed compiler approach has four major steps: task scheduling, processor mapping, data mapping, and packet routing. In the first step, the application code is parallelized and the resulting parallel threads are assigned to virtual processors. The second step implements a virtual processor-to-physical processor mapping. The goal of this mapping is to ensure that the threads that are expected to communicate frequently with each other are assigned to neighboring processors as much as possible. In the third step, data elements are mapped to memories attached to CMP nodes. The main objective of this mapping is to place a given data item into a node which is close to the nodes that access it. The last step of our approach determines the paths (between memories and processors) for data to travel in an energy efficient manner. In this paper, we describe the compiler algorithms we implemented in detail and present an experimental evaluation of the framework. In our evaluation, we test our entire framework as well as the impact of omitting some of its steps. This experimental analysis clearly shows that the proposed framework reduces energy consumption of our applications significantly (27.41% on average over a pure performance oriented application mapping strategy) as a result of improved locality of data accesses.

本文研究的问题是以位置感知的方式将应用程序自动映射到基于片上网络(NoC)的芯片多处理器(CMP)体系结构上。提出的编译器方法有四个主要步骤:任务调度、处理器映射、数据映射和数据包路由。在第一步中，将应用程序代码并行化，并将生成的并行线程分配给虚拟处理器。第二步实现虚拟处理器到物理处理器的映射。这种映射的目标是确保将预期频繁相互通信的线程尽可能多地分配给相邻的处理器。在第三步中，将数据元素映射到附加到CMP节点的内存。此映射的主要目标是将给定的数据项放置到靠近访问它的节点的节点中。我们方法的最后一步决定了数据以一种节能的方式传输的路径(存储器和处理器之间)。在本文中，我们详细描述了我们实现的编译算法，并给出了该框架的实验评估。在我们的评估中，我们测试了整个框架以及省略其中一些步骤的影响。这个实验分析清楚地表明，由于改进了数据访问的局域性，所提出的框架显着降低了应用程序的能耗(比纯面向性能的应用程序映射策略平均降低了27.41%)。

{"title":"Application mapping for chip multiprocessors","authors":"Guangyu Chen, Feihui Li, S. Son, M. Kandemir","doi":"10.1145/1391469.1391628","DOIUrl":"https://doi.org/10.1145/1391469.1391628","url":null,"abstract":"The problem attacked in this paper is one of automatically mapping an application onto a network-on-chip (NoC) based chip multiprocessor (CMP) architecture in a locality-aware fashion. The proposed compiler approach has four major steps: task scheduling, processor mapping, data mapping, and packet routing. In the first step, the application code is parallelized and the resulting parallel threads are assigned to virtual processors. The second step implements a virtual processor-to-physical processor mapping. The goal of this mapping is to ensure that the threads that are expected to communicate frequently with each other are assigned to neighboring processors as much as possible. In the third step, data elements are mapped to memories attached to CMP nodes. The main objective of this mapping is to place a given data item into a node which is close to the nodes that access it. The last step of our approach determines the paths (between memories and processors) for data to travel in an energy efficient manner. In this paper, we describe the compiler algorithms we implemented in detail and present an experimental evaluation of the framework. In our evaluation, we test our entire framework as well as the impact of omitting some of its steps. This experimental analysis clearly shows that the proposed framework reduces energy consumption of our applications significantly (27.41% on average over a pure performance oriented application mapping strategy) as a result of improved locality of data accesses.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129107571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 95

Partial order reduction for scalable testing of SystemC TLM designs SystemC TLM设计可扩展测试的偏阶约简

2008 45th ACM/IEEE Design Automation Conference

Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391706

Sudipta Kundu, Malay K. Ganai, Rajesh K. Gupta

A SystemC simulation kernel consists of a deterministic implementation of the scheduler, whose specification is non- deterministic. To leverage testing of a SystemC TLM design, we focus on automatically exploring all possible behaviors of the design for a given data input. We combine static and dynamic partial order reduction techniques with SystemC semantics to intelligently explore a subset of the possible traces, while still being provably sufficient for detecting deadlocks and safety property violations. We have implemented our exploration algorithm in a framework called Satya and have applied it to a variety of examples including the TAC benchmark. Using Satya, we automatically found an assertion violation in a benchmark distributed as a part of the OSCI repository.

SystemC仿真内核由调度程序的确定性实现组成，调度程序的规范是非确定性的。为了利用SystemC TLM设计的测试，我们专注于自动探索给定数据输入的设计的所有可能行为。我们将静态和动态偏序约简技术与SystemC语义结合起来，智能地探索可能的踪迹子集，同时仍然足以检测死锁和安全属性违规。我们已经在一个名为Satya的框架中实现了我们的探索算法，并将其应用于包括TAC基准在内的各种示例。使用Satya，我们在作为OSCI存储库一部分分发的基准测试中自动发现了一个断言违规。

引用次数: 52

Analog parallelism in ring-based VCOs 基于环的压控振荡器中的模拟并行性

2008 45th ACM/IEEE Design Automation Conference

Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391557

Daeik D. Kim, Choongyeun Cho, Jonghae Kim

The performance advantages in parallel ring-based VCOs are explored. When the number of VCOs is doubled, the parallel VCOs enhance phase noise by 3 dB, and the within-chip process-induced variation is reduced by 3 dB, which improves chip-limited yield. The parallel VCOs trade off circuit area and power in exchange. The parallelism advantages in analog and digital systems are compared.

探讨了并联环型压控振荡器的性能优势。当vco数量增加一倍时，并行vco的相位噪声提高了3 dB，芯片内工艺引起的变化降低了3 dB，从而提高了芯片受限良率。并联vco交换电路面积和功率。比较了模拟系统和数字系统并行性的优点。

引用次数: 1

Pro-VIZOR: Process tunable virtually zero margin low power adaptive RF for wireless systems Pro-VIZOR:用于无线系统的过程可调谐几乎为零边际的低功率自适应射频

2008 45th ACM/IEEE Design Automation Conference

Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391595

Shreyas Sen, Vishwanath Natarajan, R. Senguttuvan, A. Chatterjee

In this paper, a process tunable, continuously adaptive wireless front end architecture and related adaptation algorithms are presented that allow an RF transceiver to function at minimum power irrespective of channel conditions and process variability induced performance loss in the RF front end and baseband interface. Current wireless transceiver front ends are designed for worst case channel conditions and a limited degree of post manufacture tuning is performed to compensate for process variations. It is shown how the proposed architecture can result in significant power savings over current practice without compromising system-level bit error rate. The adaptation methodology is applied to a WLAN transceiver design and hardware measurement data for an adaptive receiver is presented.

在本文中，提出了一种过程可调、连续自适应的无线前端架构和相关的自适应算法，该算法允许射频收发器在最小功率下工作，而不考虑信道条件和过程可变性在射频前端和基带接口中引起的性能损失。目前的无线收发器前端是为最坏的信道条件而设计的，并且进行了有限程度的制造后调谐以补偿工艺变化。它展示了所提出的架构如何在不影响系统级误码率的情况下，在当前实践中显著节省功耗。将该自适应方法应用于无线局域网收发器设计，并给出了自适应接收机的硬件测量数据。

引用次数: 43

Process variation tolerant SRAM array for ultra low voltage applications 超低电压应用的过程变化容忍SRAM阵列

2008 45th ACM/IEEE Design Automation Conference

Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391498

J. Kulkarni, Keejong Kim, S. P. Park, K. Roy

In this work, we propose a Schmitt Trigger (ST) based differential sensing SRAM bitcell that can operate at ultra-low supply voltage. The proposed Schmitt Trigger SRAM cell addresses the fundamental conflicting design requirement of read versus write operation of a conventional 6T cell. Schmitt Trigger operation gives better read-stability and as well as better write- ability compared to the standard 6T cell. The proposed ST bitcell incorporates a built-in feedback mechanism, achieving process variation tolerance -- a must for future nano-scaled technology nodes. Measurements on 10 test-chips fabricated in 130 nm technology show that the proposed Schmitt Trigger bitcell gives 58% higher read Static Noise Margin (SNM), 2X higher write- trip-point and 120 mV lower read Vmin compared to the conventional 6T cell. The ST SRAM array is operational at 150mV of supply voltage.

在这项工作中，我们提出了一种基于施密特触发器(ST)的差分传感SRAM位单元，可以在超低电源电压下工作。提出的施密特触发器SRAM单元解决了传统6T单元的读与写操作的基本冲突设计要求。与标准6T电池相比，施密特触发操作提供了更好的读取稳定性和更好的写入能力。所提出的ST bitcell集成了内置的反馈机制，实现了对工艺变化的容忍度——这是未来纳米级技术节点所必须的。对10个采用130纳米技术制造的测试芯片的测量表明，与传统的6T电池相比，所提出的Schmitt Trigger位电池的读取静态噪声边界(SNM)提高了58%，写入触发点提高了2倍，读取Vmin降低了120 mV。ST SRAM阵列在150mV电源电压下工作。

引用次数: 74

Model checking based analysis of end-to-end latency in embedded, real-time systems with clock drifts 基于模型检查的嵌入式实时系统端到端时延分析

2008 45th ACM/IEEE Design Automation Conference

Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391544

S. Mohalik, A. Rajeev, Manoj G. Dixit, S. Ramesh, P. Suman, P. Pandya, Shengbing Jiang

End-to-end latency of messages is an important design parameter that needs to be within specified bounds for the correct functioning of distributed real-time control systems. In this paper we give a formal definition of end-to-end latency, and use this as the basis for checking whether a stipulated deadline is violated within a bounded time. For unbounded verification, we model the system as a set of communicating timed automata, and perform reachability analysis. The proposed method takes into account the drift of clocks which is shown to affect the latency appreciably. The method has been tested on a medium sized automotive example.

消息的端到端延迟是一个重要的设计参数，需要在指定的范围内才能使分布式实时控制系统正常运行。本文给出了端到端延迟的形式化定义，并以此作为检查在限定时间内是否违反规定截止日期的依据。对于无界验证，我们将系统建模为一组通信时间自动机，并进行可达性分析。所提出的方法考虑了时钟漂移对延迟的显著影响。该方法已在一个中型汽车实例上进行了验证。

引用次数: 28

An 8×8 run-time reconfigurable FPGA embedded in a SoC 嵌入在SoC中的8×8运行时可重构FPGA

2008 45th ACM/IEEE Design Automation Conference

Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391500

S. Chaudhuri, S. Guilley, Florent Flament, P. Hoogvorst, J. Danger

This paper presents a RTR FPGA embedded in a system on chip fabricated in 130 nm CMOS process. Various aspects of the design flow, from automation to floor-planning are discussed. We explain the measures taken in the FPGA design to guarantee RTR functionality free of electrical conflicts, and we present a flow based on Altera synthesis tools to implement IPs(hardware blocks) in this FPGA. We demonstrate the full functionality with experiments on the FPGA, and as conclusion we highlight the limitations and future research directions.

本文提出了一种RTR FPGA嵌入130纳米CMOS制程的片上系统。讨论了设计流程的各个方面，从自动化到楼层规划。我们解释了在FPGA设计中采取的措施，以保证RTR功能没有电气冲突，我们提出了一个基于Altera合成工具的流程，在该FPGA中实现ip(硬件块)。我们在FPGA上进行了实验，展示了该方法的全部功能，并指出了局限性和未来的研究方向。

引用次数: 11

Power gating scheduling for power/ground noise reduction 电源门控调度电源/地噪声降低

2008 45th ACM/IEEE Design Automation Conference

Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391716

Hailin Jiang, M. Marek-Sadowska

Power gating is a technique for efficiently reducing leakage power by disconnecting idle blocks from the power grid. When gated blocks are woken up, large amounts of switching currents are drawn in a short period of time that may introduce severe noise on the power delivery mesh. In this paper, we propose a GA-based approach to schedule power gating considering power/ground noise. We introduce a simulation-based method to accurately and efficiently estimate the worst case noise, taking all the current sources, inductance and decaps' effects into consideration. We also present an incremental scheduling procedure considering the dynamic changes of decap configuration. Experimental results show that by optimally scheduling the wake-up order under time constraints, our technique can reduce noise up to 50% compared to waking gated blocks simultaneously. The quality of results depends upon the total wake-up time constraint, locations of gated blocks, current densities of gated blocks, and decap distribution.

电源门控是一种通过将空闲模块与电网断开来有效降低泄漏功率的技术。当门控模块被唤醒时，在短时间内会产生大量的开关电流，这可能会对输电网产生严重的噪声。在本文中，我们提出了一种基于遗传算法的考虑功率/地噪声的功率门控调度方法。我们介绍了一种基于仿真的方法来准确有效地估计最坏情况下的噪声，同时考虑了所有电流源、电感和电容的影响。我们还提出了一种考虑decap结构动态变化的增量调度方法。实验结果表明，通过在时间约束下优化调度唤醒顺序，与同时唤醒门控块相比，我们的技术可以将噪声降低50%。结果的质量取决于总的唤醒时间约束、门控块的位置、门控块的电流密度和decap分布。

引用次数: 44

Multicore design is the challenge! What is the solution? 多核设计是个挑战!解决方案是什么?

2008 45th ACM/IEEE Design Automation Conference

Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391504

Eshel Haritan, T. Hattori, Hiroyuki Yagi, P. Paulin, W. Wolf, A. Nohl, D. Wingard, M. Muller

Multi Processor SoC (MPSoC) are being designed today. MPSoC design can help achieve aggressive performance and low power targets but it creates new design challenges: How to design the interconnect fabric and memory sub-system to allow the massive data movement required in a multi processor SoC environment? How to develop, debug and verify HW and SW functionality in a MPSoC design? Is MPSoC design an inflection point that will require new design methods including ESL methodologies?

今天正在设计多处理器SoC (MPSoC)。MPSoC设计可以帮助实现更高的性能和低功耗目标，但它也带来了新的设计挑战:如何设计互连结构和内存子系统，以允许在多处理器SoC环境中所需的大量数据移动?如何在MPSoC设计中开发、调试和验证硬件和软件功能?MPSoC设计是否成为一个拐点，需要包括ESL方法在内的新设计方法?

引用次数: 13

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2008 45th ACM/IEEE Design Automation Conference

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀