Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools最新文献

英文中文

Evolutionary algorithm for state assignment of finite state machines 有限状态机状态分配的进化算法

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

Pub Date : 2002-09-04 DOI: 10.1109/DSD.2002.1115392

Mariusz Chyzy, W. Kosinski

The paper proposes an evolutionary algorithm (EA) for the state assignment problem (SAP). Two original crossover operators are presented. They are experimentally compared with other known crossovers for SAP using a set of benchmark finite state machines. Solutions generated by EA (using different crossover operators) are compared with the random ones and with the state assignments generated by the MAX+PLUS II system. Experimental results show that the solutions found by EA are significantly better (up to 55%) than those from MAX+PLUS II. Moreover, EA equipped with proposed crossover operators found better results than those obtained with the use of other compared crossovers.

提出了一种状态分配问题的进化算法(EA)。提出了两种原始的交叉算子。使用一组基准有限状态机，将它们与其他已知的SAP交叉进行实验比较。将EA生成的解(使用不同的交叉算子)与随机解以及MAX+PLUS II系统生成的状态分配进行了比较。实验结果表明，EA找到的解明显优于MAX+PLUS II的解(高达55%)。此外，配备所提出的交叉算子的EA比使用其他比较交叉算子获得更好的结果。

引用次数: 16

Improving the operation autonomy of SIMD processing elements by using guarded instructions and pseudo branches 采用保护指令和伪分支，提高了SIMD处理单元的操作自主性

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

Pub Date : 2002-09-04 DOI: 10.1109/DSD.2002.1115363

M. L. Anido, A. Paar, N. Bagherzadeh

This paper presents a novel method for improving the operation autonomy of the processing elements (PE) of SIMD-like machines. By combining guarded instructions and pseudo branches it is possible to achieve higher operation autonomy and higher instruction level parallelism than in previous SIMD/ASIMD architectures. The paper shows that it is feasible to avoid most branches and it is also possible to emulate conditional execution on the processing elements, either by using guarded instructions or by using pseudo branches, thus avoiding unnecessary intervention by the array control unit in data-dependant computations. Pseudo branches are used when it is not possible to use guarded instructions. Additionally, they also support the implementation of complex nested if-then-else constructs, improving the execution of irregular dataparallel applications. The paper also shows that the simplicity of the method allows it to be implemented both in fine-grain and coarse-grain SIMD/ASIMD architectures because it does not require significant additional silicon area. Finally, it is shown that pseudo branches can be used to control the power saving of those processing elements that have instructions nullified.

提出了一种提高类simd机械加工单元操作自主性的新方法。通过结合保护指令和伪分支，可以实现比以前的SIMD/ASIMD体系结构更高的操作自主性和更高的指令级并行性。本文表明，避免大多数分支是可行的，也可以通过使用保护指令或使用伪分支在处理单元上模拟条件执行，从而避免阵列控制单元对数据依赖计算的不必要干预。当不可能使用受保护指令时，使用伪分支。此外，它们还支持复杂的嵌套if-then-else结构的实现，从而改进不规则数据并行应用程序的执行。本文还表明，该方法的简单性使得它可以在细颗粒和粗颗粒SIMD/ASIMD体系结构中实现，因为它不需要显着的额外硅面积。最后，证明了伪分支可以用来控制那些指令无效的处理元素的省电。

{"title":"Improving the operation autonomy of SIMD processing elements by using guarded instructions and pseudo branches","authors":"M. L. Anido, A. Paar, N. Bagherzadeh","doi":"10.1109/DSD.2002.1115363","DOIUrl":"https://doi.org/10.1109/DSD.2002.1115363","url":null,"abstract":"This paper presents a novel method for improving the operation autonomy of the processing elements (PE) of SIMD-like machines. By combining guarded instructions and pseudo branches it is possible to achieve higher operation autonomy and higher instruction level parallelism than in previous SIMD/ASIMD architectures. The paper shows that it is feasible to avoid most branches and it is also possible to emulate conditional execution on the processing elements, either by using guarded instructions or by using pseudo branches, thus avoiding unnecessary intervention by the array control unit in data-dependant computations. Pseudo branches are used when it is not possible to use guarded instructions. Additionally, they also support the implementation of complex nested if-then-else constructs, improving the execution of irregular dataparallel applications. The paper also shows that the simplicity of the method allows it to be implemented both in fine-grain and coarse-grain SIMD/ASIMD architectures because it does not require significant additional silicon area. Finally, it is shown that pseudo branches can be used to control the power saving of those processing elements that have instructions nullified.","PeriodicalId":330609,"journal":{"name":"Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools","volume":"103 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122603077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 26

A design for a low-power digital matched filter applicable to W-CDMA 一种适用于W-CDMA的低功耗数字匹配滤波器设计

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

Pub Date : 2002-09-04 DOI: 10.1109/DSD.2002.1115371

S. Goto, Takashi Yamada, Norihisa Takayarna, Yoshifumi Matsushita, Y. Harada, H. Yasuura

This paper presents a design for a low-power digital matched filter (DMF) applicable to Wideband-Code Division Multiple Access (W-CDMA), which is a Direct-Sequence Spread-Spectrum (DS-SS) communication system. The proposed architectural approach to reducing the power consumption focuses on the reception registers and the correlation-calculating unit (CCU), which dissipate the majority of the power in a DMF The main features are asynchronous latch clock generation for the reception registers, parallelism of the correlation calculation operations and bit manipulation for chip-correlation operations. A DMF is designed in compliance with the W-CDMA specifications incorporating the proposed techniques, and its properties are evaluated by computer simulations at the gate level using 0.18-/spl mu/m CMOS standard cell array technology. The results of the simulations show a power consumption of 9.3 mW (@15.6MHz, 1.6V), which is only about 30% of the power consumption of conventional DMFs.

本文设计了一种适用于宽带码分多址(W-CDMA)通信系统的低功耗数字匹配滤波器(DMF)。所提出的降低功耗的架构方法集中在接收寄存器和相关计算单元(CCU)上，它们消耗了DMF的大部分功率，其主要特征是接收寄存器的异步锁存时钟生成，相关计算操作的并行性以及芯片相关操作的位操作。采用所提出的技术，设计了符合W-CDMA规范的DMF，并使用0.18-/spl μ m CMOS标准单元阵列技术在栅极级进行了计算机模拟，评估了DMF的性能。仿真结果显示，功耗为9.3 mW (@15.6MHz, 1.6V)，仅为传统dmf功耗的30%左右。

引用次数: 9

Parallel multimedia processor using customised Infineon TriCores 采用定制英飞凌TriCores的并行多媒体处理器

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

Pub Date : 2002-09-04 DOI: 10.1109/DSD.2002.1115362

Ari Wahyudi, A. Omondi

This paper reports on our experiments on using the Infineon TriCore as a building block for a multimedia processor. The experiments aim to obtain a high performance processor using two strategies: integrating multimedia units into the TriCore CPU and constructing the TriCore in multiprocessor configuration. The design and implementation of the multimedia units for video, audio, and text compressions are discussed. Two hardware architectures for IMA ADPCM audio compression multimedia unit were designed: direct architecture and sequential architecture. The multimedia unit for text compression is based on a modification from another design; our design uses a more efficient timing operation and has a better hardware utilization than the original design. Two algorithms for parallel motion-estimation were implemented on the multiple TriCore system. The results show that the TriCore is a good building block for a multiprocessor system.

本文报告了我们使用英飞凌TriCore作为多媒体处理器构建块的实验。实验的目的是采用两种策略:将多媒体单元集成到TriCore CPU中，并在多处理器配置中构建TriCore处理器。讨论了用于视频、音频和文本压缩的多媒体单元的设计和实现。设计了IMA ADPCM音频压缩多媒体单元的两种硬件结构:直接结构和顺序结构。用于文本压缩的多媒体单元是基于另一种设计的修改;与原设计相比，本设计采用了更高效的定时运算，硬件利用率更高。在多TriCore系统上实现了两种并行运动估计算法。结果表明，TriCore是一个很好的多处理器系统构建模块。

引用次数: 1

Use of the autocorrelation function in the classification of switching functions 自相关函数在开关函数分类中的应用

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

Pub Date : 2002-09-04 DOI: 10.1109/DSD.2002.1115375

J. Rice, J. Muzio

Four operations on switching functions are used to define a classification technique based on the autocorrelation function. The relationship between these classes and the well-known spectral classes is investigated, and a canonical representative for each class is proposed. It is thought that these classes will be of use in logic synthesis employing decision diagram representations for intermediate steps.

通过对切换函数的四种操作，定义了一种基于自相关函数的分类技术。研究了这些类与已知谱类之间的关系，并给出了每个类的典型代表。据认为，这些类将在逻辑综合中使用决策图表示中间步骤。

引用次数: 7

A flexible architecture for H.263 video coding H.263视频编码的灵活架构

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

Pub Date : 2002-09-04 DOI: 10.1109/DSD.2002.1115353

M. Garrido, C. Sanz, Marcos Jiménez, J. Meneses

In this paper a very flexible and efficient architecture that implements the core of a video coder according to Rec. H.263 is presented. It consists of a RISC processor that controls the scheduling of a set of specialized processors for the transforms (DCT and IDCT), quantizers (DQ and IQ), motion estimation and motion compensation (ME/MC). The architecture also includes preprocessing modules for the input video signal from the camera and interfaces for the external video memory and the H.263 bit-stream generation. The architecture has been written in synthesizable Verilog and tested using standard video sequences. It has also been prototyped into a development system based on an FPGA and a RISC.

本文根据Rec. H.263标准，提出了一种灵活高效的视频编码器核心架构。它由一个RISC处理器组成，该处理器控制一组专用处理器的调度，用于变换(DCT和IDCT)，量化(DQ和IQ)，运动估计和运动补偿(ME/MC)。该体系结构还包括用于从摄像机输入视频信号的预处理模块和用于外部视频存储器和H.263比特流生成的接口。该架构已在可合成的Verilog中编写，并使用标准视频序列进行了测试。它还被原型化为基于FPGA和RISC的开发系统。

引用次数: 6

Integrated design and test generation under internet based environment MOSCITO 基于网络环境MOSCITO的集成设计与测试生成

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

Pub Date : 2002-09-04 DOI: 10.1109/DSD.2002.1115368

A. Schneider, K. Diener, E. Ivask, R. Ubar, E. Gramatová, T. Hollstein, W. Kuzmicz, Zebo Peng

This paper describes an environment for internet-based collaboration in the field of design and test of digital systems. Automatic Test Pattern Generation (ATPG) and fault simulation tools at behavioral, logical and hierarchical levels available at geographically different places running under the virtual environment using the MOSCITO system are presented The interfaces between the integrated tools and also commercial design tools were developed. The tools can be used separately, or in multiple applications in different design and test flows. The functionality of the integrated design and test system was verified in several collaborative experiments over internet by partners locating in different geographical sites.

本文描述了数字系统设计和测试领域中基于internet的协作环境。提出了利用MOSCITO系统在虚拟环境下运行的不同地理位置的自动测试模式生成(ATPG)和故障仿真工具，并开发了集成工具与商业设计工具之间的接口。这些工具可以单独使用，也可以在不同设计和测试流程中的多个应用程序中使用。集成设计和测试系统的功能已由位于不同地理位置的合作伙伴在互联网上进行了几次协作实验。

引用次数: 1

Systems are made from transistors: UDSM technology creates new challenges for library and IC development 系统由晶体管构成:UDSM技术为库和集成电路的开发带来了新的挑战

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

Pub Date : 2002-09-04 DOI: 10.1109/DSD.2002.1115344

Ulf Schlichtmann

The progress of silicon process technology relentlessly marches on. Moore's law still holds, the number of transistors that can be integrated on an IC doubles approximately every 18 months. The inability of system designs to keep up with this ever increasing number of available transistors has been debated for a long time, many solutions have been proposed. Now, as 130 nm processes enter volume production, 90 nm yields first engineering samples, and 65 nm processes are being developed, the design productivity crisis is exacerbated by the fact that very difficult design challenges are inherent in Ultra-Deep Submicron (UDSM) technologies. They threaten the approach of abstracting technological features away at higher levels, thus endangering design productivity even more. This paper outlines current challenges, presents approaches to address them and proposes further areas for research.

硅制程技术的进步日新月异。摩尔定律仍然成立，集成在集成电路上的晶体管数量大约每18个月翻一番。系统设计无法跟上可用晶体管数量不断增加的问题已经争论了很长时间，提出了许多解决方案。现在，随着130纳米工艺进入量产阶段，90纳米工艺产生第一批工程样品，65纳米工艺正在开发中，超深亚微米(UDSM)技术固有的非常困难的设计挑战加剧了设计生产力危机。它们威胁到在更高层次上抽象技术特征的方法，从而进一步危及设计生产力。本文概述了当前的挑战，提出了解决这些挑战的方法，并提出了进一步的研究领域。

引用次数: 2

Decision diagram optimization using copy properties 使用拷贝属性优化决策图

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

Pub Date : 2002-09-04 DOI: 10.1109/DSD.2002.1115374

D. Jankovic, R. Stankovic, R. Drechsler

In this paper, we propose an approach to the reduction of sizes of multi-terminal binary decision diagrams (MTBDDs) by using the copy properties of discrete functions. The underlying principles come from copy theory of discrete signals considered previously. We propose two modifications of MTBDDs, called copy DDs (CDDs) and half copy DDs (HCDDs), using the corresponding copy operations from copy theory. Functions having different types of copy properties can be efficiently represented by the proposed Copy DDs. Examples are Walsh and Reed-Muller functions as well as different binary codes.

本文提出了一种利用离散函数的复制特性减小多端二元决策图(mtbdd)尺寸的方法。其基本原理来自先前考虑的离散信号的复制理论。我们利用拷贝理论中相应的拷贝操作，对mtbdd进行了两种修改，分别称为拷贝dd (copy dd)和半拷贝dd (half copy dd)。具有不同类型复制属性的函数可以被建议的复制光盘有效地表示。例如Walsh和Reed-Muller函数以及不同的二进制代码。

引用次数: 4

Recursive bi-partitioning of netlists for large number of partitions 针对大量分区的网络列表的递归双分区

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

Pub Date : 2002-09-04 DOI: 10.1109/DSD.2002.1115349

R. Drechsler, Wolfgang Günther, T. Eschbach, Lothar Linhard, Gerhard Angst

In many application in VLSI CAD, a given netlist has to be partitioned into smaller sub-designs which can be handled much better. In this paper we present a new recursive bi-partitioning algorithm that is especially applicable, if a large number of final partitions, e.g. more than 1000, has to be computed. The algorithm consists of two steps. Based on recursive splits the problem is divided into several sub-problems, but with increasing recursion depth more run time is invested. By this an initial solution is determined very fast. The core of the method is a second step, where a very powerful greedy algorithm is applied to refine the partitions. Experimental results are given that compare the new approach to state-of-the-art tools. The experiments show that the new approach outperforms the standard techniques with respect to run time and quality. Furthermore, the memory usage is very low and is reduced in comparison to other methods by more than a factor of four.

在VLSI CAD的许多应用中，给定的网表必须划分为更小的子设计，这样可以更好地处理。在本文中，我们提出了一种新的递归双分区算法，它特别适用于需要计算大量最终分区的情况，例如超过1000个。该算法分为两步。在递归分解的基础上，将问题分解成若干个子问题，但随着递归深度的增加，所投入的运行时间也会增加。通过这种方法，可以很快地确定初始解。该方法的核心是第二步，其中应用了一个非常强大的贪婪算法来细化分区。实验结果给出了比较新方法的最先进的工具。实验表明，新方法在运行时间和质量方面优于标准技术。此外，内存使用量非常低，与其他方法相比减少了四倍以上。

引用次数: 17

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings Euromicro Symposium on Digital System Design. Architectures, Methods and Tools

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀