15th International Symposium on System Synthesis, 2002.最新文献

英文中文

Efficient simulation of synthesis-oriented system level designs 面向综合的系统级设计的高效仿真

15th International Symposium on System Synthesis, 2002.

Pub Date : 2002-10-02 DOI: 10.1145/581199.581237

Rajesh K. Gupta, S. Shukla, N. Savoiu

Modeling for synthesis and modeling for simulation seem to be two competing goals in the context of C++-based modeling frameworks. One of the reasons is while most hardware systems have some inherent parallelism efficiently expressing it depends on whether the target usage is synthesis or simulation. For synthesis, designs are usually described with synthesis tools in mind and are therefore partitioned according to the targeted hardware units. For simulation, runtime efficiency is critical but our previous work has shown that a synthesis-oriented description is not necessarily the most efficient, especially if using multiprocessor simulators. Multiprocessor simulation requires preemptive multithreading but most current C++-based high level system description languages use cooperative multithreading to exploit parallelism to reduce overhead. We have seen that, for synthesis-oriented models, along with adding preemptive threading we need to transform the threading structure for good simulation performance. In this paper we present an algorithm for automatically applying such transformations to C++-based hardware models, ongoing work aimed at proving the equivalence between the original and transformed model, and a 62% to 76% simulation time improvement on a dual processor simulator.

在基于c++的建模框架中，为综合而建模和为仿真而建模似乎是两个相互竞争的目标。其中一个原因是，虽然大多数硬件系统都有一些固有的并行性，但它的有效表达取决于目标用途是综合还是模拟。对于综合，设计通常是用综合工具描述的，因此根据目标硬件单元进行划分。对于仿真，运行时效率是至关重要的，但我们之前的工作表明，面向合成的描述不一定是最有效的，特别是如果使用多处理器模拟器。多处理器仿真需要抢占式多线程，但目前大多数基于c++的高级系统描述语言都使用协作式多线程来利用并行性来减少开销。我们已经看到，对于面向综合的模型，除了添加抢占式线程外，我们还需要转换线程结构以获得良好的仿真性能。在本文中，我们提出了一种自动将这种转换应用于基于c++的硬件模型的算法，正在进行的工作旨在证明原始模型和转换模型之间的等价性，并在双处理器模拟器上将仿真时间提高62%至76%。

{"title":"Efficient simulation of synthesis-oriented system level designs","authors":"Rajesh K. Gupta, S. Shukla, N. Savoiu","doi":"10.1145/581199.581237","DOIUrl":"https://doi.org/10.1145/581199.581237","url":null,"abstract":"Modeling for synthesis and modeling for simulation seem to be two competing goals in the context of C++-based modeling frameworks. One of the reasons is while most hardware systems have some inherent parallelism efficiently expressing it depends on whether the target usage is synthesis or simulation. For synthesis, designs are usually described with synthesis tools in mind and are therefore partitioned according to the targeted hardware units. For simulation, runtime efficiency is critical but our previous work has shown that a synthesis-oriented description is not necessarily the most efficient, especially if using multiprocessor simulators. Multiprocessor simulation requires preemptive multithreading but most current C++-based high level system description languages use cooperative multithreading to exploit parallelism to reduce overhead. We have seen that, for synthesis-oriented models, along with adding preemptive threading we need to transform the threading structure for good simulation performance. In this paper we present an algorithm for automatically applying such transformations to C++-based hardware models, ongoing work aimed at proving the equivalence between the original and transformed model, and a 62% to 76% simulation time improvement on a dual processor simulator.","PeriodicalId":413693,"journal":{"name":"15th International Symposium on System Synthesis, 2002.","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127473324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Optimal code size reduction for software-pipelined and unfolded loops 减少软件流水线和展开循环的最佳代码大小

15th International Symposium on System Synthesis, 2002.

Pub Date : 2002-10-02 DOI: 10.1145/581199.581232

Bin Xiao, Z. Shao, C. Phongpensri, E. Sha, Qingfeng Zhuge

Software pipelining and unfolding are commonly used techniques to increase parallelism for DSP applications. However, these techniques expand the code size of the application significantly. For most DSP systems with limited memory resources, code size becomes one of the most critical concerns for the high-performance applications. In this paper, we present the code size reduction theory based on retiming and unfolding concepts. We propose a code size reduction framework to achieve the optimal code size of software-pipelined and unfolded loops by using conditional registers. The experimental results on several well-known benchmarks show the effectiveness of our code size reduction technique in controlling the code size of optimized loops.

软件流水线和软件展开是提高DSP应用并行性的常用技术。然而，这些技术显著地扩展了应用程序的代码大小。对于大多数内存资源有限的DSP系统，代码大小成为高性能应用程序最关键的问题之一。本文提出了基于重定时和展开概念的代码缩减理论。我们提出了一个代码缩减框架，通过使用条件寄存器来实现软件流水线和展开循环的最佳代码大小。在几个著名的基准测试上的实验结果表明，我们的代码缩减技术在控制优化循环的代码大小方面是有效的。

引用次数: 5

System-level modeling of a network switch SoC 网络交换机SoC的系统级建模

15th International Symposium on System Synthesis, 2002.

Pub Date : 2002-10-02 DOI: 10.1145/581199.581215

A. Cassidy, Christopher P. Andrews, D. E. Thomas, J. M. Paul

We present the modeling of the high-level design of a next generation network switch from the perspective of a Computer-Aided Design (CAD) team within the larger context of a design team consisting of an experienced network switch designer and an experienced VLSI hardware designer. After facilitating the design process, the CAD team observed how designers approach highlevel designs, beyond RTL. We motivate the need for CAD support that allows designers to effectively manipulate what we refer to as Memory Visualization Level (MVL) design.

我们从计算机辅助设计(CAD)团队的角度，在由经验丰富的网络交换机设计人员和经验丰富的VLSI硬件设计人员组成的设计团队的更大背景下，提出下一代网络交换机的高级设计建模。在促进设计过程之后，CAD团队观察了设计师如何处理超越RTL的高级设计。我们激发了对CAD支持的需求，它允许设计师有效地操作我们所说的内存可视化级别(MVL)设计。

引用次数: 3

Datapath merging and interconnection sharing for reconfigurable architectures 可重构体系结构的数据路径合并和互连共享

15th International Symposium on System Synthesis, 2002.

Pub Date : 2002-10-02 DOI: 10.1145/581199.581210

G. Araújo, S. Malik, Zhining Huang, N. Moreano

Recent work in reconfigurable computing research has shown that a substantial performance speedup can be achieved through architectures that map the most relevant application inner-loops to a reconfigurable datapath. Any solution to this problem must be able to synthesize a datapath for each loop and to merge them together into a single reconfigurable datapath. The main contribution of this paper is a novel graph-based technique for the datapath merge problem. This approach is based on the solution of a maximum clique problem that merges datapaths one at a time. A set of experiments, using the MediaBench benchmark, shows that the proposed technique produces 24% fewer datapath interconnections than a previous solution to this problem.

最近在可重构计算研究方面的工作表明，通过将最相关的应用程序内循环映射到可重构数据路径的体系结构，可以实现实质性的性能加速。此问题的任何解决方案都必须能够为每个循环合成数据路径，并将它们合并为单个可重构数据路径。本文的主要贡献是一种新的基于图的数据路径合并技术。这种方法基于一次合并一个数据路径的最大团问题的解决方案。一组使用mediabbench基准测试的实验表明，所提出的技术产生的数据路径互连比以前解决该问题的解决方案少24%。

引用次数: 29

Security-driven exploration of cryptography in DSP cores 安全驱动的DSP核心密码学探索

15th International Symposium on System Synthesis, 2002.

Pub Date : 2002-10-02 DOI: 10.1145/581199.581218

C. Gebotys

With the popularity of wireless communication devices a new important dimension of embedded systems design has arisen, that of security. This paper presents for the first time design exploration for secure implementation of cryptographic applications on a complex DSP processor core. A new metric for security, the implementation security index, is introduced for measuring resistance to power attacks. Elliptic curve cryptographic algorithms are used to demonstrate and quantize security, energy, performance and code size tradeoffs. Modification of power traces is performed to maximize security against power attacks which has significant savings in energy dissipation compared to an existing mathematical approach. This research is important for industry since efficient yet secure cryptography is crucial for wireless communication embedded system devices.

随着无线通信设备的普及，嵌入式系统设计中出现了一个新的重要维度，即安全性。本文首次提出了在复杂DSP处理器核心上安全实现加密应用的设计探索。引入了一种新的安全度量，即实现安全指数，用于测量对功率攻击的抵抗力。椭圆曲线密码算法用于演示和量化安全性，能源，性能和代码大小的权衡。修改电源走线是为了最大限度地提高安全性，防止电源攻击，这与现有的数学方法相比，在能量消耗方面有显着节省。该研究对工业具有重要意义，因为高效且安全的加密对于无线通信嵌入式系统设备至关重要。

引用次数: 2

Efficient power reduction techniques for time multiplexed address buses 时间复用地址总线的有效功率降低技术

15th International Symposium on System Synthesis, 2002.

Pub Date : 2002-10-02 DOI: 10.1145/581199.581246

N. Dutt, D. Hirschberg, M. Mamidipaka

We address the problem of reducing power dissipation on the time multiplexed address buses employed by contemporary DRAMs in SOC designs. We propose address encoding techniques to reduce the transition activity on the time-multiplexed address buses and hence reduce power dissipation. The reduction in transition activity is achieved by exploiting the principle of locality in address streams in addition to its sequential nature. We consider a realistic processor-memory architecture and apply the proposed techniques on the address streams derived from time-multiplexed DRAM addresses. Although the techniques by themselves axe not new, we show that a judicious combination of the existing techniques yield significant gains in power reductions. Experiments on SPEC95 benchmark programs show that our encoding techniques yield as much as 82% in transition activity compared to binary encoding. We show that these reductions amount to as much 60% reduction in the off-chip address bus power. Also since the encoder/decoder add some power overhead, we calculate the minimum off-chip bus capacitance to the internal node capacitance ratio needed to achieve power reductions.

我们解决了在SOC设计中当代dram采用的时间复用地址总线上降低功耗的问题。我们提出了地址编码技术，以减少时间复用地址总线上的转换活动，从而降低功耗。减少转换活动是通过利用地址流中的局部性原则以及其顺序性来实现的。我们考虑了一个现实的处理器-存储器架构，并将所提出的技术应用于从时间复用的DRAM地址派生的地址流。虽然这些技术本身并不新鲜，但我们表明，明智地结合现有技术，可以在降低功率方面获得显著收益。在SPEC95基准程序上的实验表明，与二进制编码相比，我们的编码技术在转换活动方面的产量高达82%。我们表明，这些减少相当于减少多达60%的芯片外地址总线功率。此外，由于编码器/解码器增加了一些功率开销，我们计算了实现功耗降低所需的最小片外总线电容与内部节点电容比。

{"title":"Efficient power reduction techniques for time multiplexed address buses","authors":"N. Dutt, D. Hirschberg, M. Mamidipaka","doi":"10.1145/581199.581246","DOIUrl":"https://doi.org/10.1145/581199.581246","url":null,"abstract":"We address the problem of reducing power dissipation on the time multiplexed address buses employed by contemporary DRAMs in SOC designs. We propose address encoding techniques to reduce the transition activity on the time-multiplexed address buses and hence reduce power dissipation. The reduction in transition activity is achieved by exploiting the principle of locality in address streams in addition to its sequential nature. We consider a realistic processor-memory architecture and apply the proposed techniques on the address streams derived from time-multiplexed DRAM addresses. Although the techniques by themselves axe not new, we show that a judicious combination of the existing techniques yield significant gains in power reductions. Experiments on SPEC95 benchmark programs show that our encoding techniques yield as much as 82% in transition activity compared to binary encoding. We show that these reductions amount to as much 60% reduction in the off-chip address bus power. Also since the encoder/decoder add some power overhead, we calculate the minimum off-chip bus capacitance to the internal node capacitance ratio needed to achieve power reductions.","PeriodicalId":413693,"journal":{"name":"15th International Symposium on System Synthesis, 2002.","volume":"743 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125658214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

System-level abstraction semantics 系统级抽象语义

15th International Symposium on System Synthesis, 2002.

Pub Date : 2002-10-02 DOI: 10.1145/581199.581251

D. Gajski, A. Gerstlauer

Raising the level of abstraction is widely seen as the solution for closing the productivity gap in system design. They key for the success of this approach, however, axe well-defined abstraction levels and models. In this paper, we present such system level semantics to cover the system design process. We define properties and features of each model. Formalization of the flow enables design automation for synthesis and verification to achieve the required productivity gains. Through customization, the semantics allow creation of specific design methodologies. We applied the concepts to system languages SystemC and SpecC. Using the example of a JPEG encoder, we will demonstrate the feasibility and effectiveness of the approach.

提高抽象层次被广泛认为是缩小系统设计中生产力差距的解决方案。然而，它们对于这种方法的成功至关重要，因为它们需要定义良好的抽象层次和模型。在本文中，我们提出了这样的系统级语义来涵盖系统设计过程。我们定义了每个模型的属性和特征。流程的形式化使合成和验证的设计自动化能够实现所需的生产力增益。通过定制，语义允许创建特定的设计方法。我们将这些概念应用到系统语言SystemC和spec中。使用JPEG编码器的示例，我们将演示该方法的可行性和有效性。

引用次数: 60

System-level design of IEEE1394 bus segment bridge IEEE1394总线分段桥的系统级设计

15th International Symposium on System Synthesis, 2002.

Pub Date : 2002-10-02 DOI: 10.1145/581199.581217

T. Onoye, Yukihiro Nakamura, A. Shigiya, K. Chikamura, K. Tsujino, T. Izumi, H. Yamamoto

A system simulation environment is constructed dedicatedly for IEEE1394 high-speed digital communication. In this environment, various network transactions inherent in communication systems are taken into account for system simulation, which is indispensable to enable IP (Intellectual Property)-based design of the systems. By using the proposed environment, system-level design of IEEE1394 link layer controller and bus segment bridge is achieved with great ability of network transactions as well as connectivities with physical layer chips. Functionalities of the designed bus segment bridge has been verified according to its FPGA implementation.

构建了IEEE1394高速数字通信系统仿真环境。在这种环境下，考虑到通信系统中固有的各种网络事务进行系统仿真，这对于实现基于IP(知识产权)的系统设计是必不可少的。利用所提出的环境，实现了IEEE1394链路层控制器和总线段桥的系统级设计，具有很强的网络事务处理能力和与物理层芯片的连接能力。通过FPGA的实现，验证了所设计的总线段桥的功能。

引用次数: 1

Timing analysis of embedded software for speculative processors 投机处理器嵌入式软件的时序分析

15th International Symposium on System Synthesis, 2002.

Pub Date : 2002-10-02 DOI: 10.1145/581199.581229

Abhik Roychoudhury, Xianfeng Li, T. Mitra

Static timing analysis of embedded software is important for systems with hard real-time constraints. To accurately estimate time bounds, it is essential to model the underlying micro-architecture. In this paper, we study static timing analysis of embedded programs for modern processors with speculative execution. Speculation of conditional branch outcomes significantly improves processor performance, and hence program execution time. Although speculation is used in most modern processors, its effect on software timing has not been systematically studied before. The main contribution of our work is a parameterized framework to model different control flow speculation schemes. The accuracy of our framework is illustrated through tight timing estimates obtained for benchmark programs.

嵌入式软件的静态时序分析对于具有硬实时性约束的系统是非常重要的。为了准确地估计时间范围，必须对底层微体系结构进行建模。本文研究了具有推测性执行的现代处理器嵌入式程序的静态时序分析。推测条件分支结果可以显著提高处理器性能，从而缩短程序执行时间。虽然在大多数现代处理器中都使用了推测，但它对软件时序的影响以前还没有被系统地研究过。我们工作的主要贡献是一个参数化框架来模拟不同的控制流推测方案。通过对基准程序进行严格的时序估计，说明了我们的框架的准确性。

引用次数: 20

Design experience of a chip multiprocessor Merlot and expectation to functional verification 芯片多处理器梅洛的设计经验，并期望功能验证

15th International Symposium on System Synthesis, 2002.

Pub Date : 2002-10-02 DOI: 10.1145/581199.581223

S. Matsushita

We have fabricated a Chip Multiprocessor prototype code-named Merlot to proof our novel speculative multithreading architecture. On Merlot, multiple threads provide wider issue window beyond ordinal instruction level parallel (ILP) processors like superscalar or VLIW. With the architecture, we estimate 3.0 times speedup against single processing elements (PE) on speech recognition code and IDCT code with four PEs. Merlot integrates on-chip devices, PCI interface, and SDRAM interfaces. We have encountered design issues of chip multiprocessor and SoC design. We have successfully run parallelized mpeg3 decoder on the first silicon with several software workarounds, thanks to functional verification environment including system modeling on RTL. However, bugs found in later stage of design have required larger manpower or delay of project. In this paper, we also discuss the methodology to improve functional verification coverage, and expect the solution in formal approaches.

我们制造了一个代号为Merlot的芯片多处理器原型来证明我们的新型推测多线程架构。在Merlot上，多线程提供了比有序指令级并行(ILP)处理器(如超标量或VLIW)更宽的问题窗口。使用该架构，我们估计语音识别代码和具有四个PE的IDCT代码在单处理元素(PE)下的加速速度为3.0倍。Merlot集成了片上器件、PCI接口和SDRAM接口。我们遇到了芯片多处理器和SoC设计的设计问题。由于功能验证环境(包括RTL上的系统建模)，我们已经成功地在第一块芯片上使用几个软件解决方案运行了并行mpeg3解码器。然而，在设计后期发现的bug需要更多的人力或项目的延迟。在本文中，我们还讨论了改进功能验证覆盖率的方法，并期望在正式方法中得到解决方案。

引用次数: 3

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

15th International Symposium on System Synthesis, 2002.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀