2012 International Conference on Embedded Computer Systems (SAMOS)最新文献_第4页

Challenges in automotive cyber-physical systems design 汽车信息物理系统设计中的挑战

2012 International Conference on Embedded Computer Systems (SAMOS)

Pub Date : 2012-07-16 DOI: 10.1109/SAMOS.2012.6404199

Dip Goswami, Reinhard Schneider, Alejandro Masrur, M. Lukasiewycz, S. Chakraborty, Harald Voit, A. Annaswamy

Systems with tightly interacting computational (cyber) units and physical systems are generally referred to as cyber-physical systems. They involve an interplay between embedded systems, control theory, real-time systems and software engineering. A very good example of cyber-physical systems design arises in the context of automotive architectures and software. Modern high-end cars have 50-100 processors or electronic control units (ECUs) that communicate over a network of buses such as CAN and FlexRay. In such complex settings, traditional control-theoretic approaches - where control engineers are only concerned with high-level plant and controller models - start breaking down. This is because implementation-level realities such as message delay, jitter, and task execution times are not adequately considered when designing the controller. Hence, it is becoming necessary to adopt a more holistic, cyber-physical systems design approach where the semantic gap between high-level control models and their actual implementations on multiprocessor automotive platforms is quantified and consciously closed. In this paper we give several examples on how this may be done and the current research challenges in this area that are being faced by the academia and the industry.

具有紧密交互的计算(网络)单元和物理系统的系统通常被称为网络-物理系统。它们涉及嵌入式系统、控制理论、实时系统和软件工程之间的相互作用。网络物理系统设计的一个很好的例子出现在汽车架构和软件的背景下。现代高端汽车拥有50-100个处理器或电子控制单元(ecu)，它们通过CAN和FlexRay等总线网络进行通信。在这种复杂的环境中，传统的控制理论方法——控制工程师只关心高级工厂和控制器模型——开始失效。这是因为在设计控制器时没有充分考虑消息延迟、抖动和任务执行时间等实现级现实。因此，有必要采用更全面的网络物理系统设计方法，将高级控制模型与其在多处理器汽车平台上的实际实现之间的语义差距量化并有意识地缩小。在本文中，我们举了几个例子来说明如何做到这一点，以及学术界和工业界目前在这一领域面临的研究挑战。

{"title":"Challenges in automotive cyber-physical systems design","authors":"Dip Goswami, Reinhard Schneider, Alejandro Masrur, M. Lukasiewycz, S. Chakraborty, Harald Voit, A. Annaswamy","doi":"10.1109/SAMOS.2012.6404199","DOIUrl":"https://doi.org/10.1109/SAMOS.2012.6404199","url":null,"abstract":"Systems with tightly interacting computational (cyber) units and physical systems are generally referred to as cyber-physical systems. They involve an interplay between embedded systems, control theory, real-time systems and software engineering. A very good example of cyber-physical systems design arises in the context of automotive architectures and software. Modern high-end cars have 50-100 processors or electronic control units (ECUs) that communicate over a network of buses such as CAN and FlexRay. In such complex settings, traditional control-theoretic approaches - where control engineers are only concerned with high-level plant and controller models - start breaking down. This is because implementation-level realities such as message delay, jitter, and task execution times are not adequately considered when designing the controller. Hence, it is becoming necessary to adopt a more holistic, cyber-physical systems design approach where the semantic gap between high-level control models and their actual implementations on multiprocessor automotive platforms is quantified and consciously closed. In this paper we give several examples on how this may be done and the current research challenges in this area that are being faced by the academia and the industry.","PeriodicalId":130275,"journal":{"name":"2012 International Conference on Embedded Computer Systems (SAMOS)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126278295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 46

A framework for efficient cache resizing 一个用于有效调整缓存大小的框架

2012 International Conference on Embedded Computer Systems (SAMOS)

Pub Date : 2012-07-16 DOI: 10.1109/SAMOS.2012.6404160

G. Keramidas, Chrysovalantis Datsios, S. Kaxiras

We present a novel framework to dynamically reconfigure on-chip memory resources according to the changing behavior of the running applications. Our framework enables smooth scaling (i.e., resizing) of the on-chip caches targeting both performance and power efficiency. In contrast to previous approaches, the resizing decisions in our framework are not tainted by transient events (e.g., misses) that are due to downsizing avoiding at the same time swinging the cache size due to trial-and-error resizing decisions. This minimizes both execution time penalty induced by resizing as well as the effective cache size. Furthermore, an inherent property of our approach is that the actual invalidation of the cache blocks and the corresponding write-backs of the dirty blocks are asynchronous to resizing decisions, ensuring a smooth transition from one size to another. This makes it possible to apply our framework even on write-back caches. The proposed mechanism is simple to implement requiring minimal additional hardware. Using cycle-accurate simulations, we evaluate our proposal against previously proposed techniques. In all cases, our experimental results show significant benefits in both power and performance.

我们提出了一个新的框架，可以根据运行中的应用程序的变化动态地重新配置片上内存资源。我们的框架实现了芯片上缓存的平滑缩放(即调整大小)，目标是性能和功率效率。与以前的方法相比，我们框架中的调整大小决策不会受到由于缩小而导致的瞬态事件(例如，未命中)的影响，同时避免了由于试错调整大小决策而导致的缓存大小摇摆。这将最小化由调整大小和有效缓存大小引起的执行时间损失。此外，我们的方法的一个固有属性是，缓存块的实际失效和脏块的相应回写与调整大小的决策是异步的，从而确保了从一种大小到另一种大小的平滑过渡。这使得我们的框架甚至可以应用于回写缓存。所建议的机制易于实现，只需要很少的额外硬件。使用周期精确的模拟，我们评估了我们的建议与先前提出的技术。在所有情况下，我们的实验结果都显示出在功率和性能方面的显着优势。

{"title":"A framework for efficient cache resizing","authors":"G. Keramidas, Chrysovalantis Datsios, S. Kaxiras","doi":"10.1109/SAMOS.2012.6404160","DOIUrl":"https://doi.org/10.1109/SAMOS.2012.6404160","url":null,"abstract":"We present a novel framework to dynamically reconfigure on-chip memory resources according to the changing behavior of the running applications. Our framework enables smooth scaling (i.e., resizing) of the on-chip caches targeting both performance and power efficiency. In contrast to previous approaches, the resizing decisions in our framework are not tainted by transient events (e.g., misses) that are due to downsizing avoiding at the same time swinging the cache size due to trial-and-error resizing decisions. This minimizes both execution time penalty induced by resizing as well as the effective cache size. Furthermore, an inherent property of our approach is that the actual invalidation of the cache blocks and the corresponding write-backs of the dirty blocks are asynchronous to resizing decisions, ensuring a smooth transition from one size to another. This makes it possible to apply our framework even on write-back caches. The proposed mechanism is simple to implement requiring minimal additional hardware. Using cycle-accurate simulations, we evaluate our proposal against previously proposed techniques. In all cases, our experimental results show significant benefits in both power and performance.","PeriodicalId":130275,"journal":{"name":"2012 International Conference on Embedded Computer Systems (SAMOS)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130711526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

A co-simulation approach for system-level analysis of embedded control systems 嵌入式控制系统级分析的联合仿真方法

2012 International Conference on Embedded Computer Systems (SAMOS)

Pub Date : 2012-07-16 DOI: 10.1109/SAMOS.2012.6404200

M. Glaß, J. Teich, Liyuan Zhang

Control applications have become an integral part of modern networked embedded systems. However, there often exists a gap between control engineering and system design. The control engineer has detailed knowledge about the algorithms but is abstracting from the system architecture and implementation. On the other hand, the system designer aims at achieving high-quality implementations based on quality constraints specified by the control engineer. This may result in either an overdesigned system in case the specifications are pessimistic or an unsafe system behavior when specifications are too optimistic. Thus, future design automation approaches have to consider the quality of control applications both as design objectives and design constraints to achieve safe yet highly optimized system implementations. The work at hand introduces an automatic tool flow at the Electronic System Level (ESL) that enables the optimization of a system implementation with quality of control being introduced as a principal design objective, like the maximum braking distance, while respecting constraints like maximum slip to ensure maneuverability of a car. The gap between mathematically well-defined models for system synthesis and common analysis techniques for control quality is bridged by co-simulation: A SystemC-based virtual prototype of a distributed controller implementation is combined with high-level models of the plants specified in Matlab/Simulink. Through a model transformation, the traditional development process of control applications is combined with state-of-the-art ESL techniques, ensuring model consistency while enabling a high degree of automation.

控制应用已成为现代网络化嵌入式系统的重要组成部分。然而，控制工程与系统设计之间往往存在着差距。控制工程师对算法有详细的了解，但从系统架构和实现中抽象出来。另一方面，系统设计者的目标是根据控制工程师指定的质量约束来实现高质量的实现。如果规范过于悲观，这可能导致过度设计的系统;如果规范过于乐观，则可能导致不安全的系统行为。因此，未来的设计自动化方法必须考虑控制应用程序的质量作为设计目标和设计约束，以实现安全但高度优化的系统实现。目前的工作引入了电子系统级(ESL)的自动工具流程，可以优化系统实施，并将控制质量作为主要设计目标，如最大制动距离，同时尊重最大滑移等约束，以确保汽车的可操作性。在数学上定义良好的系统综合模型和控制质量的通用分析技术之间的差距通过联合仿真弥合:基于systemc的分布式控制器实现的虚拟原型与Matlab/Simulink中指定的工厂的高级模型相结合。通过模型转换，控制应用程序的传统开发过程与最先进的ESL技术相结合，确保模型一致性，同时实现高度自动化。

{"title":"A co-simulation approach for system-level analysis of embedded control systems","authors":"M. Glaß, J. Teich, Liyuan Zhang","doi":"10.1109/SAMOS.2012.6404200","DOIUrl":"https://doi.org/10.1109/SAMOS.2012.6404200","url":null,"abstract":"Control applications have become an integral part of modern networked embedded systems. However, there often exists a gap between control engineering and system design. The control engineer has detailed knowledge about the algorithms but is abstracting from the system architecture and implementation. On the other hand, the system designer aims at achieving high-quality implementations based on quality constraints specified by the control engineer. This may result in either an overdesigned system in case the specifications are pessimistic or an unsafe system behavior when specifications are too optimistic. Thus, future design automation approaches have to consider the quality of control applications both as design objectives and design constraints to achieve safe yet highly optimized system implementations. The work at hand introduces an automatic tool flow at the Electronic System Level (ESL) that enables the optimization of a system implementation with quality of control being introduced as a principal design objective, like the maximum braking distance, while respecting constraints like maximum slip to ensure maneuverability of a car. The gap between mathematically well-defined models for system synthesis and common analysis techniques for control quality is bridged by co-simulation: A SystemC-based virtual prototype of a distributed controller implementation is combined with high-level models of the plants specified in Matlab/Simulink. Through a model transformation, the traditional development process of control applications is combined with state-of-the-art ESL techniques, ensuring model consistency while enabling a high degree of automation.","PeriodicalId":130275,"journal":{"name":"2012 International Conference on Embedded Computer Systems (SAMOS)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132850054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Hardware/software specialization through aspects: The LARA approach 通过方面实现硬件/软件专业化:LARA方法

2012 International Conference on Embedded Computer Systems (SAMOS)

Pub Date : 2012-07-16 DOI: 10.1109/SAMOS.2012.6404183

João MP Cardoso, Tiago Carvalho, J. Teixeira, P. Diniz, F. Gonçalves, Z. Petrov

LARA is a programming language being developed to complement application code in a host programming language with instrumentation code, for monitoring, logging, and debugging, user's knowledge about specific characteristics of the application, non-functional requirements, and compiler, mapping and synthesis strategies to guide/control design-flows, especially the ones used to map computations to FPGA-based systems. This paper shows how the aspect-oriented approach provided by LARA allows developers to specify complementary program information that can be used by LARA aware design-flows to promote customized FPGA-based software/hardware implementations. Program and compiler/mapping specialization take advantage of specific properties of applications to optimize and customize specific application modules and software/hardware implementations, e.g., according to usage contexts. We illustrate the concept using a hotspot function from a real-life, industrial, application. The results show the importance of program specialization in deriving hardware/software implementations with higher-performance.

LARA是一种正在开发的编程语言，用于补充主机编程语言中的应用程序代码和仪表代码，用于监控，日志记录和调试，用户对应用程序特定特征的了解，非功能需求，以及指导/控制设计流程的编译器，映射和综合策略，特别是用于将计算映射到基于fpga的系统的策略。本文展示了LARA提供的面向方面的方法如何允许开发人员指定补充的程序信息，这些信息可以被LARA感知的设计流使用，以促进基于fpga的定制软件/硬件实现。程序和编译器/映射专门化利用应用程序的特定属性来优化和定制特定的应用程序模块和软件/硬件实现，例如，根据使用上下文。我们使用现实生活中的工业应用程序中的热点函数来说明这个概念。结果表明，程序专门化对于获得性能更高的硬件/软件实现具有重要意义。

{"title":"Hardware/software specialization through aspects: The LARA approach","authors":"João MP Cardoso, Tiago Carvalho, J. Teixeira, P. Diniz, F. Gonçalves, Z. Petrov","doi":"10.1109/SAMOS.2012.6404183","DOIUrl":"https://doi.org/10.1109/SAMOS.2012.6404183","url":null,"abstract":"LARA is a programming language being developed to complement application code in a host programming language with instrumentation code, for monitoring, logging, and debugging, user's knowledge about specific characteristics of the application, non-functional requirements, and compiler, mapping and synthesis strategies to guide/control design-flows, especially the ones used to map computations to FPGA-based systems. This paper shows how the aspect-oriented approach provided by LARA allows developers to specify complementary program information that can be used by LARA aware design-flows to promote customized FPGA-based software/hardware implementations. Program and compiler/mapping specialization take advantage of specific properties of applications to optimize and customize specific application modules and software/hardware implementations, e.g., according to usage contexts. We illustrate the concept using a hotspot function from a real-life, industrial, application. The results show the importance of program specialization in deriving hardware/software implementations with higher-performance.","PeriodicalId":130275,"journal":{"name":"2012 International Conference on Embedded Computer Systems (SAMOS)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133173969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

An FPGA-accelerated testbed for hardware component development in MIMO wireless communication systems MIMO无线通信系统硬件组件开发的fpga加速试验台

2012 International Conference on Embedded Computer Systems (SAMOS)

Pub Date : 2012-07-16 DOI: 10.1109/SAMOS.2012.6404187

Filippo Borlenghi, Dominik Auras, E. M. Witte, T. Kempf, G. Ascheid, R. Leupers, H. Meyr

FPGA-based prototyping is nowadays common practice in the functional verification of hardware components since it allows to cover a large number of test cases in a shorter time compared to HDL simulation. In addition, an FPGA-based emulator significantly accelerates the simulation with respect to bit-true software models. This speed-up is crucial when the statistical properties of a system have to be analyzed by Monte Carlo techniques. In this paper we consider a multiple-input multiple-output (MIMO) wireless communication system and show how integrating an FPGA accelerator in the software simulation framework is key to enable the development of complex hardware components in the receiver, from algorithm all the way to chip testing. In particular, we focus on a MIMO detector implementation based on the depth-first sphere decoding algorithm. The speed-up of up to 3 orders of magnitude achieved by hardware-accelerated simulation compared to a pure software testbed enables an extensive fixed-point exploration. Furthermore, it allows a unique characterization of the system communication performance and the MIMO detector run-time characteristics, which vary for different configuration parameters and operating scenarios and hence require a thorough investigation.

基于fpga的原型设计如今在硬件组件的功能验证中是常见的实践，因为与HDL仿真相比，它允许在更短的时间内覆盖大量的测试用例。此外，基于fpga的仿真器大大加快了对位真软件模型的仿真。当必须通过蒙特卡罗技术分析系统的统计特性时，这种加速是至关重要的。在本文中，我们考虑了一个多输入多输出(MIMO)无线通信系统，并展示了如何在软件仿真框架中集成FPGA加速器是实现接收机中复杂硬件组件开发的关键，从算法一直到芯片测试。我们特别关注基于深度优先球体解码算法的MIMO检测器实现。与纯软件测试平台相比，通过硬件加速模拟实现的速度提升高达3个数量级，可以进行广泛的定点探索。此外，它允许对系统通信性能和MIMO检测器运行时特性进行独特的表征，这些特性因不同的配置参数和操作场景而异，因此需要进行彻底的研究。

{"title":"An FPGA-accelerated testbed for hardware component development in MIMO wireless communication systems","authors":"Filippo Borlenghi, Dominik Auras, E. M. Witte, T. Kempf, G. Ascheid, R. Leupers, H. Meyr","doi":"10.1109/SAMOS.2012.6404187","DOIUrl":"https://doi.org/10.1109/SAMOS.2012.6404187","url":null,"abstract":"FPGA-based prototyping is nowadays common practice in the functional verification of hardware components since it allows to cover a large number of test cases in a shorter time compared to HDL simulation. In addition, an FPGA-based emulator significantly accelerates the simulation with respect to bit-true software models. This speed-up is crucial when the statistical properties of a system have to be analyzed by Monte Carlo techniques. In this paper we consider a multiple-input multiple-output (MIMO) wireless communication system and show how integrating an FPGA accelerator in the software simulation framework is key to enable the development of complex hardware components in the receiver, from algorithm all the way to chip testing. In particular, we focus on a MIMO detector implementation based on the depth-first sphere decoding algorithm. The speed-up of up to 3 orders of magnitude achieved by hardware-accelerated simulation compared to a pure software testbed enables an extensive fixed-point exploration. Furthermore, it allows a unique characterization of the system communication performance and the MIMO detector run-time characteristics, which vary for different configuration parameters and operating scenarios and hence require a thorough investigation.","PeriodicalId":130275,"journal":{"name":"2012 International Conference on Embedded Computer Systems (SAMOS)","volume":"132 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122098487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Efficient computing in cyber-physical systems 网络物理系统中的高效计算

2012 International Conference on Embedded Computer Systems (SAMOS)

Pub Date : 2012-07-16 DOI: 10.1109/SAMOS.2012.6404195

P. Marwedel, M. Engel

Computing in cyber-physical systems has to be efficient in terms of a number of objectives. In particular, computing has to be execution-time and energy efficient. In this paper, we will consider optimization techniques aiming at efficiency in terms of these two objectives. In the first part, we will consider techniques for the integration of compilers and worst-case execution time (WCET) estimation. We will demonstrate, how such integration opens the door to WCET-reduction algorithms. For example, an algorithm for WCET-aware compilation reduces the WCET for an automotive application by more than 50% by exploiting scratch pad memories (SPMs). In the second part, we will demonstrate techniques for improving the energy efficiency of cyber-physical systems, in particular the use of SPMs. In the third part, we demonstrate how the optimization for multiple objectives taken into account. This paper provides an overview of work performed at the Chair for Embedded Systems of TU Dortmund and the Informatik Centrum Dortmund, Germany1.

网络物理系统中的计算必须在许多目标方面是高效的。特别是，计算必须具有执行时间和能源效率。在本文中，我们将根据这两个目标考虑以效率为目标的优化技术。在第一部分中，我们将考虑集成编译器和最坏情况执行时间(WCET)估计的技术。我们将演示这种集成如何为WCET-reduction算法打开大门。例如，一种用于感知WCET的编译算法通过利用暂存存储器(spm)，将汽车应用程序的WCET降低了50%以上。在第二部分中，我们将展示提高网络物理系统能源效率的技术，特别是spm的使用。在第三部分中，我们演示了如何考虑多目标优化。本文概述了多特蒙德工业大学嵌入式系统主席和德国多特蒙德信息中心的工作1。

引用次数: 3

Combining on-hardware prototyping and high-level simulation for DSE of multi-ASIP systems 结合硬件上的原型设计和多asip系统的DSE高级仿真

2012 International Conference on Embedded Computer Systems (SAMOS)

Pub Date : 2012-07-16 DOI: 10.1109/SAMOS.2012.6404191

P. Meloni, Sebastiano Pomata, L. Raffo, R. Piscitelli, A. Pimentel

Modern heterogeneous multi-processor embedded systems very often expose to the designer a large number of degrees of freedom, related to the application partitioning/mapping and to the component- and system-level architecture composition. The number is even larger when the designer targets systems based on configurable Application Specific Instruction-set Processors, due to the fine customizability of their internal architecture. This poses the need for effective and user-friendly design tools, capable to deal with the extremely wide system-level design space exposed by multi-processor architecture and, at the same time, with an extended variety of processing element architectural configurations, to be evaluated in detail and in reasonable times. As a possible solution, within the MADNESS project [1], an integrated toolset has been proposed, combining the benefits of novel fast FPGA-based prototyping techniques with those provided by high-level simulation. In the toolset, the resulting evaluation platform serves as an underlying layer for a Design Space search algorithm. The paper presents the individual tools included in the toolset and their interaction strategy. The approach is then evaluated with a design space exploration case study, taking as a target application a video compression kernel. The integrated toolset has been used to produce a Pareto front of evaluated system-level configurations.

现代异构多处理器嵌入式系统经常向设计人员展示大量的自由度，这些自由度与应用程序分区/映射以及组件和系统级体系结构组合有关。当设计人员以基于可配置应用程序特定指令集处理器的系统为目标时，由于其内部架构的良好可定制性，这个数字甚至更大。这就需要有效且用户友好的设计工具，能够处理多处理器体系结构所暴露的极其广泛的系统级设计空间，同时具有扩展的各种处理元素体系结构配置，以便在合理的时间内进行详细评估。作为一种可能的解决方案，在MADNESS项目[1]中，已经提出了一个集成的工具集，将基于fpga的新型快速原型技术的优点与高级仿真技术相结合。在工具集中，产生的评估平台作为Design Space搜索算法的底层。本文介绍了工具集中包含的各个工具及其交互策略。然后，以视频压缩内核为目标应用程序，通过设计空间探索案例研究对该方法进行了评估。集成的工具集已用于产生评估系统级配置的帕累托前沿。

{"title":"Combining on-hardware prototyping and high-level simulation for DSE of multi-ASIP systems","authors":"P. Meloni, Sebastiano Pomata, L. Raffo, R. Piscitelli, A. Pimentel","doi":"10.1109/SAMOS.2012.6404191","DOIUrl":"https://doi.org/10.1109/SAMOS.2012.6404191","url":null,"abstract":"Modern heterogeneous multi-processor embedded systems very often expose to the designer a large number of degrees of freedom, related to the application partitioning/mapping and to the component- and system-level architecture composition. The number is even larger when the designer targets systems based on configurable Application Specific Instruction-set Processors, due to the fine customizability of their internal architecture. This poses the need for effective and user-friendly design tools, capable to deal with the extremely wide system-level design space exposed by multi-processor architecture and, at the same time, with an extended variety of processing element architectural configurations, to be evaluated in detail and in reasonable times. As a possible solution, within the MADNESS project [1], an integrated toolset has been proposed, combining the benefits of novel fast FPGA-based prototyping techniques with those provided by high-level simulation. In the toolset, the resulting evaluation platform serves as an underlying layer for a Design Space search algorithm. The paper presents the individual tools included in the toolset and their interaction strategy. The approach is then evaluated with a design space exploration case study, taking as a target application a video compression kernel. The integrated toolset has been used to produce a Pareto front of evaluated system-level configurations.","PeriodicalId":130275,"journal":{"name":"2012 International Conference on Embedded Computer Systems (SAMOS)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121215509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Multicore enablement for Cyber Physical Systems 网络物理系统的多核支持

2012 International Conference on Embedded Computer Systems (SAMOS)

Pub Date : 2012-07-16 DOI: 10.1109/SAMOS.2012.6404198

A. Herkersdorf

Summary form only given. Cyber Physical Systems (CPS) design expands the horizon of traditional hardware/software systems engineering into the specifics of natural sciences such as physics or bio-chemistry. This is a new quality and inherent challenge of CPS since interdisciplinary skills, methods, tools and design flows, which originally werent even considered for mutual awareness, should now be linked, inter-work with each other and eventually be merged in order to achieve a holistic system co-optimization. At the same time, application domains in which CPS are playing increasingly important roles such as ecological electro-mobility, medical healthcare, ambient assisted living or all aspects around energy generation, distribution and control all can be characterized by vastly growing demands for compute performance. Multicore and manycore architectures, brought forward by all leading processor vendors, are the natural candidates for compute work (/race) horses within the cyber domain of CPS. However, the use of multicore processors in CPS requires levels of safety, security, real-time support and energy efficiency, which multicore processors for general purpose computing applications have typically not been designed for. In order to help reducing the complexity from the above described CPS inter-domain linkage challenges, this contribution to the Samos CPS special session discusses generic accelerators and hardware/software techniques that provide coverage for critical non-functional requirements of off-the-shelf multicore and MPSoC (Multi-Processor System on Chip) architectures. Our main focus is on suitable virtualization and self-organization techniques for CPS.

只提供摘要形式。网络物理系统(CPS)设计将传统硬件/软件系统工程的视野扩展到物理或生物化学等自然科学的细节。这是CPS的一个新的品质和固有的挑战，因为跨学科的技能、方法、工具和设计流程，最初甚至没有考虑到相互意识，现在应该联系起来，相互协作，最终合并，以实现整体系统的协同优化。与此同时，CPS在生态电动汽车、医疗保健、环境辅助生活或能源产生、分配和控制等各个方面发挥着越来越重要的作用，这些应用领域对计算性能的需求都在不断增长。由所有领先的处理器供应商提出的多核和多核架构是CPS网络领域中计算工作(/竞赛)马匹的自然候选者。然而，在CPS中使用多核处理器需要安全性、安全性、实时支持和能源效率，而用于通用计算应用的多核处理器通常没有设计到这些方面。为了帮助降低上述CPS域间链接挑战的复杂性，Samos CPS特别会议的这一贡献讨论了通用加速器和硬件/软件技术，这些技术涵盖了现有多核和MPSoC(多处理器系统芯片)架构的关键非功能需求。我们主要关注的是适合于CPS的虚拟化和自组织技术。

{"title":"Multicore enablement for Cyber Physical Systems","authors":"A. Herkersdorf","doi":"10.1109/SAMOS.2012.6404198","DOIUrl":"https://doi.org/10.1109/SAMOS.2012.6404198","url":null,"abstract":"Summary form only given. Cyber Physical Systems (CPS) design expands the horizon of traditional hardware/software systems engineering into the specifics of natural sciences such as physics or bio-chemistry. This is a new quality and inherent challenge of CPS since interdisciplinary skills, methods, tools and design flows, which originally werent even considered for mutual awareness, should now be linked, inter-work with each other and eventually be merged in order to achieve a holistic system co-optimization. At the same time, application domains in which CPS are playing increasingly important roles such as ecological electro-mobility, medical healthcare, ambient assisted living or all aspects around energy generation, distribution and control all can be characterized by vastly growing demands for compute performance. Multicore and manycore architectures, brought forward by all leading processor vendors, are the natural candidates for compute work (/race) horses within the cyber domain of CPS. However, the use of multicore processors in CPS requires levels of safety, security, real-time support and energy efficiency, which multicore processors for general purpose computing applications have typically not been designed for. In order to help reducing the complexity from the above described CPS inter-domain linkage challenges, this contribution to the Samos CPS special session discusses generic accelerators and hardware/software techniques that provide coverage for critical non-functional requirements of off-the-shelf multicore and MPSoC (Multi-Processor System on Chip) architectures. Our main focus is on suitable virtualization and self-organization techniques for CPS.","PeriodicalId":130275,"journal":{"name":"2012 International Conference on Embedded Computer Systems (SAMOS)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126834410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

An FPGA-based probability-aware fault simulator 基于fpga的概率感知故障模拟器

2012 International Conference on Embedded Computer Systems (SAMOS)

Pub Date : 2012-07-16 DOI: 10.1109/SAMOS.2012.6404190

D. May, W. Stechele

A recent approach to deal with the challenges that come along with the shrinking feature size of CMOS circuits is probabilistic computing. Those challenges, such as noise or process variations, result in a certain probabilistic behavior of the circuit and its gates. Probabilistic Computing, also referred to as pCMOS, does not try to avoid the occurrence of errors, but tries to determine the probability of errors at the output of the circuit, and to limit it to a value that the specific application can tolerate. Past research has shown that probabilistic computing has potential to drastically reduce the power consumption of circuits by scaling the supply voltage of gates to a value where they become non-deterministic, while tolerating a certain amount of probabilistic behavior at the output. Therefore, one main task in the design of pCMOS circuits is to determine the error probabilities at the output of the circuit, given a combination of error probabilities at the gates. In earlier work, pCMOS circuits have been characterized by memory-consuming and complex analytical calculations or by time-consuming software-based simulations. Hardware-accelerated emulators exist in large numbers, but miss the support of injecting errors with specified probabilities into as many circuit elements the user specifies at the same time. In this paper, we propose an FPGA-based fault simulator that allows for fast error probability classification, injection of errors at gate- and RT-level, and that is furthermore independent on the target architecture. Moreover, we demonstrate the usefulness of such a simulator by characterizing the probabilistic behavior of two benchmark circuits and reveal their energy-saving capability.

最近一种应对CMOS电路特征尺寸缩小带来的挑战的方法是概率计算。这些挑战，如噪声或过程变化，会导致电路及其门的某种概率行为。概率计算，也称为pCMOS，并不试图避免错误的发生，而是试图确定电路输出错误的概率，并将其限制在特定应用程序可以容忍的值。过去的研究表明，概率计算有潜力通过将门的供电电压缩放到一个不确定的值来大幅降低电路的功耗，同时在输出端容忍一定数量的概率行为。因此，设计pCMOS电路的一个主要任务是在给定栅极误差概率组合的情况下，确定电路输出端的误差概率。在早期的工作中，pCMOS电路的特点是内存消耗和复杂的分析计算或耗时的基于软件的模拟。硬件加速仿真器大量存在，但不支持在用户指定的多个电路元件中同时注入具有指定概率的错误。在本文中，我们提出了一个基于fpga的故障模拟器，它允许快速的错误概率分类，在门级和rt级注入错误，并且进一步独立于目标体系结构。此外，我们通过表征两个基准电路的概率行为来证明这种模拟器的实用性，并揭示了它们的节能能力。

{"title":"An FPGA-based probability-aware fault simulator","authors":"D. May, W. Stechele","doi":"10.1109/SAMOS.2012.6404190","DOIUrl":"https://doi.org/10.1109/SAMOS.2012.6404190","url":null,"abstract":"A recent approach to deal with the challenges that come along with the shrinking feature size of CMOS circuits is probabilistic computing. Those challenges, such as noise or process variations, result in a certain probabilistic behavior of the circuit and its gates. Probabilistic Computing, also referred to as pCMOS, does not try to avoid the occurrence of errors, but tries to determine the probability of errors at the output of the circuit, and to limit it to a value that the specific application can tolerate. Past research has shown that probabilistic computing has potential to drastically reduce the power consumption of circuits by scaling the supply voltage of gates to a value where they become non-deterministic, while tolerating a certain amount of probabilistic behavior at the output. Therefore, one main task in the design of pCMOS circuits is to determine the error probabilities at the output of the circuit, given a combination of error probabilities at the gates. In earlier work, pCMOS circuits have been characterized by memory-consuming and complex analytical calculations or by time-consuming software-based simulations. Hardware-accelerated emulators exist in large numbers, but miss the support of injecting errors with specified probabilities into as many circuit elements the user specifies at the same time. In this paper, we propose an FPGA-based fault simulator that allows for fast error probability classification, injection of errors at gate- and RT-level, and that is furthermore independent on the target architecture. Moreover, we demonstrate the usefulness of such a simulator by characterizing the probabilistic behavior of two benchmark circuits and reveal their energy-saving capability.","PeriodicalId":130275,"journal":{"name":"2012 International Conference on Embedded Computer Systems (SAMOS)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130126670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Is time predictability quantifiable? 时间的可预测性是可量化的吗?

2012 International Conference on Embedded Computer Systems (SAMOS)

Pub Date : 2012-07-16 DOI: 10.1109/SAMOS.2012.6404196

Martin Schoeberl

Computer architects and researchers in the real-time domain start to investigate processors and architectures optimized for real-time systems. Optimized for real-time systems means time predictable, i.e., architectures where it is possible to statically derive a tight bound of the worst-case execution time. To compare different approaches we would like to quantify time predictability. That means we need to measure time predictability. In this paper we discuss the different approaches for these measurements and conclude that time predictability is practically not quantifiable. We can only compare the worst-case execution time bounds of different architectures.

计算机架构师和实时领域的研究人员开始研究针对实时系统优化的处理器和体系结构。针对实时系统的优化意味着时间可预测，也就是说，架构可以静态地推导出最坏情况下执行时间的严格界限。为了比较不同的方法，我们想要量化时间的可预测性。这意味着我们需要测量时间的可预测性。在本文中，我们讨论了这些测量的不同方法，并得出结论，时间可预测性实际上是不可量化的。我们只能比较不同架构的最坏情况执行时间界限。

引用次数: 22