Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors最新文献

英文中文

Performance model of the Argonne Voyager multimedia server Argonne Voyager多媒体服务器的性能模型

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

Pub Date : 1997-07-14 DOI: 10.1109/ASAP.1997.606837

T. Disz, R. Olson, R. Stevens

The Argonne Voyager Multimedia Server is being developed in the Futures Lab of the Mathematics and Computer Science Division at Argonne National Laboratory. As a network based service for recording and playing multimedia streams, it is important that the Voyager system be capable of sustaining certain minimal levels of performance in order for it to be a viable system. In this article, we examine the performance characteristics of the server. As we examine the architecture of the system, we try to determine where bottlenecks lie, show actual vs potential performance, and recommend areas for improvement through custom architectures and system tuning.

阿贡航海家多媒体服务器正在阿贡国家实验室数学和计算机科学部的未来实验室开发。作为一个基于网络的记录和播放多媒体流的服务，重要的是旅行者系统能够维持一定的最低水平的性能，以使其成为一个可行的系统。在本文中，我们将研究服务器的性能特征。当我们检查系统的体系结构时，我们尝试确定瓶颈所在，显示实际性能和潜在性能，并通过自定义体系结构和系统调优推荐需要改进的领域。

引用次数: 9

Scheduling in co-partitioned array architectures 共分区阵列架构中的调度

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

Pub Date : 1997-07-14 DOI: 10.1109/ASAP.1997.606828

U. Eckhardt, R. Merker

We consider a balanced combined application of the known LPGS- and LSGP-partitioning which we call co-partitioning. This approach allows a structural adjustment of the array design as well as a balancing of the size of the local memory and the IO-demand between the processing elements of the co-partitioned array. We determine the size of the LSGP-partitions such that there exists a sequential scheduling within the LSGP-partitions which is free of wait states. We give the proof for the existence of such a scheduling, and we give explicit formulas for the lower and upper bounds of the loops of a for-loop program which represents one of the possible sequential schedulings.

我们考虑了已知的LPGS-和lpgp -分区的平衡组合应用，我们称之为协分区。这种方法允许对阵列设计进行结构调整，并在共分区阵列的处理元素之间平衡本地内存的大小和io需求。我们确定lsgp分区的大小，以便在lsgp分区中存在一个没有等待状态的顺序调度。我们证明了这种调度的存在性，并给出了表示一种可能的顺序调度的for循环程序的循环下界和上界的显式公式。

引用次数: 7

Optimized software synthesis for synchronous dataflow 优化了同步数据流的软件合成

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

Pub Date : 1997-07-14 DOI: 10.1109/ASAP.1997.606831

S. Bhattacharyya, P. Murthy, Edward A. Lee

This paper reviews a set of techniques for compiling dataflow-based, graphical programs for digital signal processing (DSP) applications into efficient implementations on programmable digital signal processors. This is a critical problem because programmable digital signal processors have very limited amounts of on-chip memory and the speed power, and financial cost penalties for using off-chip memory are often prohibitively high for the types of applications, typically embedded systems, in which these processors are used. The compilation techniques described in this paper are developed for the synchronous dataflow model of computation, a model that has found widespread use for specifying and prototyping DSP systems.

本文回顾了一组用于将基于数据流的图形程序编译为可编程数字信号处理器上的有效实现的技术。这是一个关键问题，因为可编程数字信号处理器的片上存储器和速度能力非常有限，而且对于使用这些处理器的应用类型(通常是嵌入式系统)来说，使用片外存储器的财务成本通常非常高。本文描述的编译技术是为计算的同步数据流模型开发的，该模型已广泛用于DSP系统的指定和原型设计。

引用次数: 18

A massively parallel implementation of the watershed based on cellular automata 基于元胞自动机的分水岭大规模并行实现

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

Pub Date : 1997-07-14 DOI: 10.1109/ASAP.1997.606811

Dominique Soguet

The watershed transform is a very powerful segmentation tool which comes directly from the idea of watershed line in geohydrology. It has proved its efficiency in many computer vision application fields. This paper presents a new implementation of the watershed which is optimal according to computation time. The flooding algorithm is reminded. Then, a massively parallel cellular automaton is proposed to propagate data using this approach. We discuss the pros and cons of a hardware implementation and give an example of application. A comparison between the results obtained and theoretical limit cases is also presented.

分水岭变换是一种非常强大的分割工具，它直接来源于地质水文学中的分水岭线思想。它在许多计算机视觉应用领域中都证明了它的有效性。本文提出了一种基于计算时间优化的分水岭实现方法。提示了泛洪算法。然后，提出了一种大规模并行元胞自动机，利用这种方法来传播数据。我们讨论了硬件实现的优点和缺点，并给出了一个应用示例。并将所得结果与理论极限情况作了比较。

引用次数: 21

A FPGA-based implementation of an intravenous infusion controller system 一种基于fpga实现的静脉输液控制器系统

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

Pub Date : 1997-07-14 DOI: 10.1109/ASAP.1997.606845

C. Araujo, M. V. Santos, E. Barros

In this paper we present the development and implementation of an intravenous infusion controller system based on FPGA's. The system receives information of an infusion drop sensor and controls the drop flow by giving the direction and number of steps of a stepper motor, which compress the drip-feed hose. The system consists of a mixed implementation of software and hardware. The software was implemented in C++ and the hardware was implemented by using FPGA's.

本文介绍了一种基于FPGA的静脉输液控制器系统的开发与实现。该系统接收液滴传感器的信息，并通过给出压缩液滴软管的步进电机的方向和步数来控制液滴流量。该系统由软件和硬件混合实现。软件采用c++语言实现，硬件采用FPGA实现。

引用次数: 1

On core and more: a design perspective for systems-on-a-chip 核心及更多:片上系统的设计视角

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

Pub Date : 1997-07-14 DOI: 10.1109/ASAP.1997.606850

S. Pees, M. Vaupel, V. Zivojnovic, H. Meyr

In this survey, key drivers in design methodology are provided that enable successful design of systems-on-a-chip for the highly competitive telecommunications market. Main components of a design environment are described that fulfill the requirements of today's system design: efficient verification by means of fast simulation, integration of intellectual property, support of HW/SW co-design by means of a generic machine description language, generation of dedicated hardware blocks for high speed applications, and the link from system level performance evaluation to implementations in hardware and software.

在本调查中，提供了设计方法中的关键驱动因素，使芯片上系统的成功设计能够适应竞争激烈的电信市场。描述了满足当今系统设计要求的设计环境的主要组成部分:通过快速仿真进行有效验证，集成知识产权，通过通用机器描述语言支持硬件/软件协同设计，为高速应用生成专用硬件模块，以及从系统级性能评估到硬件和软件实现的链接。

引用次数: 6

An efficient video decoder design for MPEG-2 MP@ML 一个高效的MPEG-2视频解码器设计MP@ML

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

Pub Date : 1997-07-14 DOI: 10.1109/ASAP.1997.606856

Jui-Hua Li, N. Ling

In this paper, we present an efficient MPEG-2 video decoder architecture design to meet MP@ML real-time decoding requirement. The overall architecture, as well as the design of the major function-specific processing blocks, such as the variable-length decoder, the inverse 2-D discrete cosine transform unit, and the motion compensation unit, are discussed. A hierarchical and distributed controller approach is used and a bus-monitoring model for different bus arbitration schemes to control external DRAM accesses is developed and the system is simulated. Practical issues and buffer sizes are addressed. With a 27 MHz clock, our architecture uses much fewer than the 667 cycles, upper bond for the MP@ML decoding requirement, to decode each macroblock with a single external bus and DRAM.

本文提出了一种高效的MPEG-2视频解码器架构设计，以满足MP@ML实时解码的要求。讨论了该系统的总体结构和主要功能处理模块的设计，如变长解码器、逆二维离散余弦变换单元和运动补偿单元。采用分层分布式控制器方法，建立了针对不同总线仲裁方案的总线监控模型来控制外部DRAM访问，并对系统进行了仿真。解决了实际问题和缓冲区大小。对于27 MHz时钟，我们的架构使用比667周期(MP@ML解码要求的上限)少得多的周期，用单个外部总线和DRAM解码每个宏块。

引用次数: 6

PART: a partitioning tool for efficient use of distributed systems 部分:高效使用分布式系统的分区工具

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

Pub Date : 1997-07-14 DOI: 10.1109/ASAP.1997.606838

Jing Chen, V. Taylor

The interconnection of geographically distributed supercomputers via high-speed networks allows users to access the needed compute power for large-scale, complex applications. For efficient use of such systems, the variance in processor performance and network (i.e., interconnection network versus wide area network) performance must be considered. In this paper, we present a decomposition tool, called PART, for distributed systems. PART takes into consideration the variance in performance of the networks and processors as well as the computational complexity of the application. This is achieved via the parameters used in the objective function of simulated annealing. The initial version of PART focuses on finite element based problems. The results of using PART demonstrate a 30% reduction in execution time as compared to using conventional schemes that partition the problem domain into equal-sized subdomains.

地理上分布的超级计算机通过高速网络相互连接，使用户能够获得大规模复杂应用所需的计算能力。为了有效地使用这些系统，必须考虑处理器性能和网络(即互连网络与广域网)性能的差异。在本文中，我们提出了一个分布式系统的分解工具，称为PART。PART考虑了网络和处理器性能的差异以及应用程序的计算复杂性。这是通过模拟退火的目标函数中使用的参数来实现的。PART的初始版本侧重于基于有限元的问题。使用PART的结果表明，与使用将问题域划分为大小相等的子域的传统方案相比，执行时间减少了30%。

引用次数: 4

Implementation of orthogonal wavelet transforms and their applications 正交小波变换的实现及其应用

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

Pub Date : 1997-07-14 DOI: 10.1109/ASAP.1997.606854

P. Rieder, J. Nossek

In this paper the efficient implementation of different types of orthogonal wavelet transforms with respect to practical applications is discussed. Orthogonal single-wavelet transforms being based on one scaling function and one wavelet function are used for denosing of signals. Orthogonal multiwavelets are based on several scaling functions and several wavelets. Since they allow properties like regularity, orthogonality and symmetry being impossible in the single-wavelet case, multiwavelets are well suited bases for image compression applications. With respect to an efficient implementation of these orthogonal wavelet transforms approximating the exact rotation angles of the corresponding orthogonal wavelet lattice filters by using very few CORDIC-based elementary rotations reduces the number of shift and add operations significantly. The performance of the resulting, computationally cheap, approximated wavelet transforms with respect to practical applications is discussed in this paper.

本文从实际应用的角度讨论了不同类型正交小波变换的有效实现。采用基于一个尺度函数和一个小波函数的正交单小波变换对信号进行去噪。正交多小波基于多个尺度函数和多个小波。由于它们允许在单小波情况下不可能实现的正则性、正交性和对称性等特性，因此多小波非常适合用于图像压缩应用。对于这些正交小波变换的有效实现，通过使用很少的基于cordic的初等旋转来近似相应正交小波晶格滤波器的精确旋转角度，可以显着减少移位和加法操作的次数。本文讨论了所得到的计算成本低廉的近似小波变换在实际应用中的性能。

引用次数: 27

On computing with locally-interconnected architectures in atomic/nanoelectronic systems 原子/纳米电子系统中局部互连体系结构的计算研究

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

Pub Date : 1997-07-14 DOI: 10.1109/ASAP.1997.606808

V. Roychowdhury, M. Anantram

The past decade has seen tremendous experimental and theoretical progress in the field of mesoscopic devices and molecular self assembly techniques, leading to laboratory demonstration of many new device concepts. While these studies have been important from a fundamental physics perspective, it has been recognized by many that they may offer new insights into building a future generation of computing machines. This has recently led to a number of proposals for computing machines which use these new and novel device concepts. In this paper, we explain the physical principles behind the operation of one of these proposals, namely the ground state computing model. These computational models share some of the characteristics of the well-known systolic type processor arrays, namely spatial locality, and functional uniformity. In particular, we study the effect of metastable states on the relaxation process (and hence information propagation) in locally coupled and boundary-driven structures. We first give a general argument to show that metastable states are inevitable even in the simplest of structures, a wire. At finite temperatures, the relaxation mechanism is a thermally assisted random walk. The time required to reach the ground state and its life time are determined by the coupling parameters. These time scales are studied in a model based on an array of quantum dots.

在过去的十年中，介观器件和分子自组装技术领域取得了巨大的实验和理论进展，导致许多新的器件概念在实验室中得到验证。虽然这些研究从基础物理学的角度来看很重要，但许多人已经认识到，它们可能为构建未来一代的计算机器提供新的见解。最近，这导致了许多使用这些新颖设备概念的计算机器的建议。在本文中，我们解释了其中一个建议的物理原理，即基态计算模型。这些计算模型具有众所周知的收缩型处理器阵列的一些特征，即空间局部性和功能均匀性。特别地，我们研究了亚稳态对局部耦合和边界驱动结构中的弛豫过程(以及信息传播)的影响。我们首先给出一个一般的论证，证明亚稳态是不可避免的，即使是在最简单的结构，如电线中。在有限温度下，弛豫机制是热辅助随机游走。到达基态所需的时间和寿命由耦合参数决定。这些时间尺度是在基于量子点阵列的模型中研究的。

{"title":"On computing with locally-interconnected architectures in atomic/nanoelectronic systems","authors":"V. Roychowdhury, M. Anantram","doi":"10.1109/ASAP.1997.606808","DOIUrl":"https://doi.org/10.1109/ASAP.1997.606808","url":null,"abstract":"The past decade has seen tremendous experimental and theoretical progress in the field of mesoscopic devices and molecular self assembly techniques, leading to laboratory demonstration of many new device concepts. While these studies have been important from a fundamental physics perspective, it has been recognized by many that they may offer new insights into building a future generation of computing machines. This has recently led to a number of proposals for computing machines which use these new and novel device concepts. In this paper, we explain the physical principles behind the operation of one of these proposals, namely the ground state computing model. These computational models share some of the characteristics of the well-known systolic type processor arrays, namely spatial locality, and functional uniformity. In particular, we study the effect of metastable states on the relaxation process (and hence information propagation) in locally coupled and boundary-driven structures. We first give a general argument to show that metastable states are inevitable even in the simplest of structures, a wire. At finite temperatures, the relaxation mechanism is a thermally assisted random walk. The time required to reach the ground state and its life time are determined by the coupling parameters. These time scales are studied in a model based on an array of quantum dots.","PeriodicalId":368315,"journal":{"name":"Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124922891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings IEEE International Conference on Application-Specific Systems, Architectures and Processors

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀