Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)最新文献

英文中文

An ASIP design methodology for embedded systems 嵌入式系统的ASIP设计方法

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

Pub Date : 1999-03-01 DOI: 10.1109/HSC.1999.777384

K. Kucukcakar

A well-known challenge during processor design is to obtain the best possible results for a typical target application domain that is generally described as a set of benchmarks. Obtaining the best possible result in turn becomes a complex tradeoff between the generality of the processor and the physical characteristics. A custom instruction to perform a task can result in significant improvements for an application, but generally, at the expense of some overhead for all other applications. In the recent years, Application-Specific Instruction-Set Processors (ASIP) have gained popularity in production chips as well as in the research community. In this paper, we present a unique architecture and methodology to design ASIPs in the embedded controller domain by customizing an existing processor instruction set and architecture rather than creating an entirely new ASIP tuned to a benchmark.

在处理器设计期间，一个众所周知的挑战是为典型的目标应用程序域(通常被描述为一组基准测试)获得最佳结果。在处理器的通用性和物理特性之间进行复杂的权衡，从而获得可能的最佳结果。执行任务的自定义指令可以为应用程序带来显著的改进，但通常会为所有其他应用程序带来一些开销。近年来，专用指令集处理器(Application-Specific Instruction-Set processor, ASIP)在生产芯片和研究领域都得到了广泛的应用。在本文中，我们提出了一种独特的架构和方法，通过定制现有的处理器指令集和架构来设计嵌入式控制器领域的ASIP，而不是创建一个全新的ASIP来调整基准。

引用次数: 37

System synthesis utilizing a layered functional model 利用分层功能模型的系统综合

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

Pub Date : 1999-03-01 DOI: 10.1145/301177.301510

I. Sander, A. Jantsch

We propose a system synthesis method which bridges the gap between a highly abstract functional model and an efficient hardware or software implementation. The functional model is based on a formal semantics and the synchrony hypothesis. However, the use of skeletons in conjunction with a proper computational model structures the system description into three layers, the system layer, the skeleton layer, and the elementary layer. The synthesis process takes advantage of this structure and uses a different technique for each layer: (a) connection of components, and processes at the system layer; (b) template based generation of compound entities possibly containing state information, memory, and complex control at the skeleton layer; this layer also determines the communication and timing behaviour; (c) direct translation into combinatorial functions at the elementary layer. Thus, without compromising the formal properties of the abstract system model we provide an efficient synthesis method.

我们提出了一种系统综合方法，它在高度抽象的功能模型和高效的硬件或软件实现之间架起了桥梁。功能模型基于形式语义和同步假设。然而，将骨架与适当的计算模型结合使用，将系统描述分为三层:系统层、骨架层和基本层。合成过程利用了这种结构，并对每一层使用不同的技术:(a)在系统层连接组件和过程;(b)基于模板生成可能包含状态信息、内存和骨架层复杂控制的复合实体;这一层还决定了通信和定时行为;(c)在初等层直接转化为组合函数。因此，在不影响抽象系统模型的形式性质的情况下，我们提供了一种有效的综合方法。

引用次数: 14

Automatic detection of recurring operation patterns 自动检测重复的操作模式

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

Pub Date : 1999-03-01 DOI: 10.1145/301177.301192

M. Arnold, H. Corporaal

An important problem in the area of processor design for embedded systems is determining the proper instruction set architecture. Trade-offs have to be made between programmability and reusability of dedicated hardware for special functionality on the one hand, and a high performance dedicated instruction set on the other hand. This paper addresses the question of how to find specialized ISA extensions for a set of applications. We describe the application of a new pattern matching technique to the problem of the identification of recurring patterns of operations. By implementing frequently occurring operation patterns in hardware, and using this hardware as special function units, a fine-grained hardware/software partitioning can be found. The fine granularity, and the fact that patterns are taken from a number of different target applications rather than a single one, increase the opportunities for reuse of the special-purpose hardware. We illustrate our technique with experiments on a number of benchmarks from the DSP domain.

嵌入式系统处理器设计中的一个重要问题是确定合适的指令集体系结构。必须在专用硬件的可编程性和可重用性与高性能专用指令集之间进行权衡。本文解决了如何为一组应用程序找到专门的ISA扩展的问题。我们描述了一种新的模式匹配技术在识别操作重复模式问题上的应用。通过在硬件中实现频繁出现的操作模式，并将这些硬件用作特殊功能单元，可以找到细粒度的硬件/软件分区。精细的粒度，以及模式取自多个不同的目标应用程序而不是单个应用程序的事实，增加了重用专用硬件的机会。我们通过DSP领域的一些基准测试来说明我们的技术。

引用次数: 22

Power estimation for architectural exploration of HW/SW communication on system-level buses 系统级总线上软硬件通信体系结构探索的功率估计

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

Pub Date : 1999-03-01 DOI: 10.1109/HSC.1999.777411

W. Fornaciari, D. Sciuto, C. Silvano

The power consumption due to the HW/SW communication on system-level buses represents one of the major contributions to the overall power budget. A model to estimate the switching activity of the on-chip and off-chip buses at the system-level has been defined to evaluate the power dissipation and to compare the effectiveness of power optimization techniques. The paper aims at providing a framework for architectural exploration of a system design, focusing on the power consumption estimation of memory communication. Experimental results, conducted on bus streams generated by a real microprocessor and a stream generator, show how the variation of cache parameters and the introduction of bus encoding at the different levels on the memory hierarchy can affect the system power dissipation. Therefore, the proposed model can be effectively adopted to appropriately configure the memory hierarchy and the system bus architecture from the power standpoint.

系统级总线上的硬件/软件通信造成的功耗是总体功耗预算的主要贡献之一。定义了一个估计片内和片外总线在系统级的开关活动的模型，以评估功耗和比较功率优化技术的有效性。本文旨在为系统设计的架构探索提供一个框架，重点研究内存通信的功耗估计。在实际微处理器和流生成器生成的总线流上进行的实验结果表明，缓存参数的变化和在存储器层次的不同层次上引入总线编码对系统功耗的影响。因此，从功耗的角度出发，可以有效地采用该模型对存储器层次结构和系统总线体系结构进行适当的配置。

引用次数: 57

Development of an optimizing compiler for a Fujitsu fixed-point digital signal processor 富士通定点数字信号处理器优化编译器的开发

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

Pub Date : 1999-03-01 DOI: 10.1145/301177.301184

S. Rajan, M. Fujita, A. Sudarsanam, S. Malik

A common design methodology for embedded DSP systems is the integration of one or more digital signal processors (DSPs), program memory, and ASIC circuitry onto a single IC. Consequently, program memory size being limited, the criterion for optimality is that the embedded software must be very dense. We describe the development of an optimizing compiler, based on a retargetable compiler infrastructure, for the Fujitsu Elixir, a fixed-point DSP that is primarily used in cellular telephones. For small DSP benchmark programs (25-90 lines of C code), the average ratio of the size of compiler-generated code to the size of hand-written assembly code is 1.18. For a much larger program (more than 800 lines of C code), the ratio of the size of compiled code to the size of hand-written code is similar (1.14).

嵌入式DSP系统的常见设计方法是将一个或多个数字信号处理器(DSP)、程序存储器和ASIC电路集成到单个IC上。因此，程序存储器的大小受到限制，优化的标准是嵌入式软件必须非常密集。我们描述了一个优化编译器的开发，基于一个可重新定位的编译器基础设施，为富士通Elixir，一个定点DSP，主要用于移动电话。对于小型DSP基准程序(25-90行C代码)，编译器生成的代码大小与手工编写的汇编代码大小的平均比率为1.18。对于一个更大的程序(超过800行C代码)，编译代码的大小与手写代码的大小之比是相似的(1.14)。

引用次数: 8

Compiling Esterel into sequential code 编译Esterel成顺序代码

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

Pub Date : 1999-03-01 DOI: 10.1109/HSC.1999.777410

S. Edwards

This paper presents a novel compiler for Esterel, a concurrent synchronous imperative language. It generates fast, small object code by compiling away concurrency, producing a single C function requiring no operating system support for threads. It translates an Esterel program into an acyclic concurrent control-flow graph from which code is synthesized that runs instructions in an order respecting inter-thread communication. Exceptions and preemption constructs become conditional branches. Variables save control state; conditional branches restore it. Although designed for Esterel, this approach could be applied to compiling other synchronous concurrent languages.

本文提出了一种针对并发同步命令式语言Esterel的新型编译器。它通过编译消除并发性来生成快速、小的目标代码，生成一个不需要操作系统支持线程的C函数。它将一个Esterel程序转换成一个无循环并发控制流图，从中合成代码，以尊重线程间通信的顺序运行指令。异常和抢占结构成为条件分支。变量保存控制状态;条件分支恢复它。虽然这种方法是为Esterel设计的，但也可以应用于编译其他同步并发语言。

引用次数: 4

Iterative cache simulation of embedded CPUs with trace stripping 带跟踪剥离的嵌入式cpu迭代缓存仿真

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

Pub Date : 1999-03-01 DOI: 10.1145/301177.301496

Z. Wu, W. Wolf

Trace-driven cache simulation is a time-consuming yet valuable procedure for evaluating the performance of embedded memory systems. In this paper we present a novel technique, called iterative cache simulation, to produce a variety of performance metrics for several different cache configurations. Compared with previous work in this field, our approach has the following features. First, it supports a wide range of performance metrics, including miss ratio, write-back counts, bus traffic, et al. Second, unlike estimation-based methods, the results produced by our simulator are accurate. Third, our approach is flexible. It can simulate both uniprocessor and multiprocessor caches, with options of higher level caches, sub-block replacement and prefetching. Last, it is fast. Our simulation results show that it has similar runtime as the fastest one-pass cache simulator.

跟踪驱动的缓存模拟是一种耗时但有价值的过程，用于评估嵌入式存储系统的性能。在本文中，我们提出了一种称为迭代缓存模拟的新技术，用于为几种不同的缓存配置生成各种性能指标。与该领域以往的工作相比，我们的方法具有以下特点。首先，它支持广泛的性能指标，包括遗漏率、回写计数、总线流量等。其次，与基于估计的方法不同，我们的模拟器产生的结果是准确的。三是灵活变通。它可以模拟单处理器和多处理器缓存，具有更高级别缓存，子块替换和预取选项。最后，它很快。我们的仿真结果表明，它的运行时间与最快的单遍缓存模拟器相似。

引用次数: 31

Instruction set selection for ASIP design ASIP设计的指令集选择

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

Pub Date : 1999-03-01 DOI: 10.1145/301177.301187

M. Gschwind

We describe an approach for application-specific processor design based on an extendible microprocessor core. Core-based design allows to derive application-specific instruction processors from a common base architecture with low non-recurring engineering cost. The results of this application-specific customization of a common base architecture are families of related and largely compatible processor families. These families can share support tools and even binary compatible code which has been written for the common base architecture. Critical code portions are customized using the application-specific instruction set extensions. We describe a hardware/software co-design methodology which can be used with this design approach. The presented approach uses the processor core to allow early evaluation of ASIP design options using rapid prototyping techniques. We demonstrate this approach with two case studies, based on the implementation and evaluation of application-specific processor extensions for Prolog program execution, and memory prefetching for vector and matrix operations.

我们描述了一种基于可扩展微处理器核心的特定应用处理器设计方法。基于核心的设计允许从公共基础架构中派生特定于应用程序的指令处理器，并且具有较低的非重复性工程成本。这种特定于应用程序的公共基础体系结构定制的结果是一系列相关且基本兼容的处理器家族。这些家族可以共享支持工具，甚至是为通用基础架构编写的二进制兼容代码。关键代码部分使用特定于应用程序的指令集扩展自定义。我们描述了一种硬件/软件协同设计方法，可以与这种设计方法一起使用。提出的方法使用处理器核心，允许使用快速原型技术对ASIP设计选项进行早期评估。我们通过两个案例研究演示了这种方法，基于Prolog程序执行的特定于应用程序的处理器扩展的实现和评估，以及向量和矩阵操作的内存预取。

引用次数: 68

Peer-based multithreaded executable co-specification 基于对等的多线程可执行协同规范

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

Pub Date : 1999-03-01 DOI: 10.1109/HSC.1999.777402

D. E. Thomas, J. M. Paul, S. Peffers, S. J. Weber

We are integrating language-based software and hardware behaviors in C/pthreads and Verilog for unrestricted peer execution of the domains, including bounded (finite) and unbounded notions of computer system modeling. Since we do not restrict the modeling currently available in each domain, our co-specification is inclusive of both reactive and data-intensive systems. By viewing all mixed system state as shared memory accessible by threads in each domain, we differentiate domains by system resource inferences. We introduce a unified multithreading model for execution and motivate the need to expand the specification capabilities currently available in each domain for mixed-systems using widely accepted languages as a basis. We discuss specific aspects of our cosimulator, provide examples and results, and indicate future directions of our work.

我们正在C/pthreads和Verilog中集成基于语言的软件和硬件行为，以实现域的不受限制的对等执行，包括计算机系统建模的有界(有限)和无界概念。由于我们没有限制每个领域中当前可用的建模，因此我们的共同规范既包括反应性系统，也包括数据密集型系统。通过将所有混合系统状态视为每个域中线程可访问的共享内存，我们通过系统资源推断来区分域。我们引入了一个统一的多线程执行模型，并以广泛接受的语言为基础，为混合系统扩展当前在每个领域可用的规范功能。我们讨论了我们的协同模拟器的具体方面，提供了示例和结果，并指出了我们工作的未来方向。

引用次数: 3

Timed executable system specification of an ADSL modem using a C++ based design environment: A case study 基于c++设计环境的ADSL调制解调器定时可执行系统规范:一个案例研究

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

Pub Date : 1999-03-01 DOI: 10.1145/301177.301198

D. Desmet, M. Esvelt, P. Avasare, D. Verkest, H. Man

In this paper we propose a C++ based cosimulation and codesign environment, that allows to specify the timing behavior of the components of a complex hardware-software system independently of the functional refinement. While the hardware models are at a high functional abstraction level, thus resulting in a high simulation speed, yet the timing behavior can be specified with sufficient granularity to give relevant feedback concerning the timing of the software tasks. We demonstrate this method on the design of the digital part of an ADSL modem.

在本文中，我们提出了一个基于c++的协同仿真和协同设计环境，它允许独立于功能改进来指定复杂硬件软件系统组件的时序行为。虽然硬件模型处于高功能抽象级别，从而导致高仿真速度，但定时行为可以用足够的粒度指定，以提供有关软件任务定时的相关反馈。我们在ADSL调制解调器数字部分的设计中演示了这种方法。

引用次数: 13

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the Seventh International Workshop on Hardware/Software Codesign (CODES'99) (IEEE Cat. No.99TH8450)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀