首页 > 最新文献

2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)最新文献

英文 中文
Cyber/physical co-design in practice: Case studies in metroII 网络/物理协同设计的实践:都市ii的案例研究
Pub Date : 2016-05-23 DOI: 10.1109/SIES.2016.7509408
Luca Rizzon, R. Passerone
To analyze embedded systems, engineers use tools that can simulate the performance of software components executed on hardware architectures. When the embedded system functionality is strongly correlated to physical quantities, as in the case of Cyber-Physical System (CPS), we need to model physical processes to determine the overall behavior of the system. Unfortunately, embedded systems simulators are not generally suitable to evaluate physical processes, and in the same way physical model simulators hardly capture the functionality of computing systems. In this work, we present a methodology to concurrently explore these aspects using the METROII design framework. In this work, we provide guidelines for the implementation of these models in the design environment, and discuss the results gathered with the simulator for two case studies.
为了分析嵌入式系统,工程师使用可以模拟在硬件架构上执行的软件组件性能的工具。当嵌入式系统功能与物理量紧密相关时,如在信息物理系统(CPS)的情况下,我们需要对物理过程建模以确定系统的整体行为。不幸的是,嵌入式系统模拟器通常不适合评估物理过程,并且以同样的方式,物理模型模拟器很难捕获计算系统的功能。在这项工作中,我们提出了一种使用METROII设计框架同时探索这些方面的方法。在这项工作中,我们提供了在设计环境中实现这些模型的指导方针,并讨论了用模拟器收集的两个案例研究的结果。
{"title":"Cyber/physical co-design in practice: Case studies in metroII","authors":"Luca Rizzon, R. Passerone","doi":"10.1109/SIES.2016.7509408","DOIUrl":"https://doi.org/10.1109/SIES.2016.7509408","url":null,"abstract":"To analyze embedded systems, engineers use tools that can simulate the performance of software components executed on hardware architectures. When the embedded system functionality is strongly correlated to physical quantities, as in the case of Cyber-Physical System (CPS), we need to model physical processes to determine the overall behavior of the system. Unfortunately, embedded systems simulators are not generally suitable to evaluate physical processes, and in the same way physical model simulators hardly capture the functionality of computing systems. In this work, we present a methodology to concurrently explore these aspects using the METROII design framework. In this work, we provide guidelines for the implementation of these models in the design environment, and discuss the results gathered with the simulator for two case studies.","PeriodicalId":185636,"journal":{"name":"2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)","volume":"28 10","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120845844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Communication aware multiprocessor binding for shared memory systems 用于共享内存系统的通信感知多处理器绑定
Pub Date : 2016-05-23 DOI: 10.1109/SIES.2016.7509438
S. Adyanthaya, M. Geilen, T. Basten, J. Voeten, R. Schiffelers
We present a three-step binding algorithm for applications in the form of directed acyclic graphs (DAGs) of tasks with deadlines, that need to be bound to a shared memory multiprocessor platform. The aim of the algorithm is to obtain a good binding that results in low makespans of the schedules of the DAGs. It first clusters tasks assuming unlimited resources using a deadline-aware shared memory extension of the existing dominant sequence clustering algorithm. Second, the clusters produced are merged based on communication dependencies to fit into the number of available platform resources. As a final step, the clusters are allocated to the available resources by balancing the workload. The approach is compared to the state of the art bounded dominant sequence clustering (BDSC) algorithm that also performs clustering on a limited number of resources. We show that our three-step algorithm makes better use of the shared memory communication structure and produces significantly lower makespans than BDSC on benchmark cases.
我们提出了一种三步绑定算法,用于有向无环图(dag)形式的任务,这些任务需要绑定到共享内存多处理器平台。该算法的目标是获得一个良好的绑定,从而使dag调度的最大完成时间较低。它首先使用现有优势序列聚类算法的截止日期感知共享内存扩展来对假设无限资源的任务进行聚类。其次,根据通信依赖关系合并生成的集群,以适应可用平台资源的数量。作为最后一步,通过平衡工作负载将集群分配给可用资源。该方法与最先进的有界优势序列聚类(BDSC)算法进行了比较,后者也在有限数量的资源上执行聚类。我们表明,我们的三步算法更好地利用了共享内存通信结构,并且在基准测试情况下产生的makespans明显低于BDSC。
{"title":"Communication aware multiprocessor binding for shared memory systems","authors":"S. Adyanthaya, M. Geilen, T. Basten, J. Voeten, R. Schiffelers","doi":"10.1109/SIES.2016.7509438","DOIUrl":"https://doi.org/10.1109/SIES.2016.7509438","url":null,"abstract":"We present a three-step binding algorithm for applications in the form of directed acyclic graphs (DAGs) of tasks with deadlines, that need to be bound to a shared memory multiprocessor platform. The aim of the algorithm is to obtain a good binding that results in low makespans of the schedules of the DAGs. It first clusters tasks assuming unlimited resources using a deadline-aware shared memory extension of the existing dominant sequence clustering algorithm. Second, the clusters produced are merged based on communication dependencies to fit into the number of available platform resources. As a final step, the clusters are allocated to the available resources by balancing the workload. The approach is compared to the state of the art bounded dominant sequence clustering (BDSC) algorithm that also performs clustering on a limited number of resources. We show that our three-step algorithm makes better use of the shared memory communication structure and produces significantly lower makespans than BDSC on benchmark cases.","PeriodicalId":185636,"journal":{"name":"2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114315633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Availability analysis for synchronous data-flow graphs in mixed-criticality systems 混合临界系统中同步数据流图的可用性分析
Pub Date : 2016-05-23 DOI: 10.1109/SIES.2016.7509431
R. Medina, Etienne Borde, L. Pautet
The safety-critical industry is compelled to continually increase the number of functionalities in embedded systems. These platforms tend to integrate software with various non-functional requirements, in particular different levels of criticality. As a consequence, Mixed-Criticality Systems emerged in order to assure robustness, safety and predictability for these embedded platforms. Although Mixed-Critcality Systems show promising results, formal methods to quantify availability are still missing for this type of systems and will most likely be required for deployment. This paper presents a transformation process that first produces a formal model of a Mixed-Criticality System. From this formal model, it generates a PRISM automaton in order to compute availability.
安全关键行业被迫不断增加嵌入式系统中的功能数量。这些平台倾向于集成带有各种非功能需求的软件,特别是不同级别的关键需求。因此,为了确保这些嵌入式平台的鲁棒性、安全性和可预测性,混合临界系统应运而生。尽管混合临界系统显示出有希望的结果,但是这种类型的系统仍然缺少量化可用性的正式方法,并且很可能需要部署。本文提出了一个转换过程,该过程首先产生混合临界系统的形式化模型。从这个正式模型中,它生成一个PRISM自动机来计算可用性。
{"title":"Availability analysis for synchronous data-flow graphs in mixed-criticality systems","authors":"R. Medina, Etienne Borde, L. Pautet","doi":"10.1109/SIES.2016.7509431","DOIUrl":"https://doi.org/10.1109/SIES.2016.7509431","url":null,"abstract":"The safety-critical industry is compelled to continually increase the number of functionalities in embedded systems. These platforms tend to integrate software with various non-functional requirements, in particular different levels of criticality. As a consequence, Mixed-Criticality Systems emerged in order to assure robustness, safety and predictability for these embedded platforms. Although Mixed-Critcality Systems show promising results, formal methods to quantify availability are still missing for this type of systems and will most likely be required for deployment. This paper presents a transformation process that first produces a formal model of a Mixed-Criticality System. From this formal model, it generates a PRISM automaton in order to compute availability.","PeriodicalId":185636,"journal":{"name":"2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131450708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Hardware runtime verification of embedded software in SoPC SoPC中嵌入式软件的硬件运行验证
Pub Date : 2016-05-23 DOI: 10.1109/SIES.2016.7509425
Dimitry Solet, Jean-Luc Béchennec, M. Briday, S. Faucou, S. Pillement
This paper discusses an implementation of runtime verification for embedded software running on a System-on-Programmable-Chip (SoPC) composed of a micro-controller and a FPGA. The goal is to verify at runtime that the execution of the software on the micro-controller conforms to a set of properties. To do so, a minimal instrumentation of the software is used to send events to a set of monitors implemented in the FPGA. These monitors are synthesised from a formal specification of the expected behavior of the system expressed as a set of past-time linear temporal logic (ptLTL) formulas.
本文讨论了在由微控制器和FPGA组成的可编程芯片系统(SoPC)上运行时验证嵌入式软件的实现方法。目标是在运行时验证微控制器上软件的执行是否符合一组属性。为此,使用最小的软件仪器将事件发送到FPGA中实现的一组监视器。这些监视器由系统预期行为的正式规范合成,表示为一组过去时间线性时间逻辑(ptLTL)公式。
{"title":"Hardware runtime verification of embedded software in SoPC","authors":"Dimitry Solet, Jean-Luc Béchennec, M. Briday, S. Faucou, S. Pillement","doi":"10.1109/SIES.2016.7509425","DOIUrl":"https://doi.org/10.1109/SIES.2016.7509425","url":null,"abstract":"This paper discusses an implementation of runtime verification for embedded software running on a System-on-Programmable-Chip (SoPC) composed of a micro-controller and a FPGA. The goal is to verify at runtime that the execution of the software on the micro-controller conforms to a set of properties. To do so, a minimal instrumentation of the software is used to send events to a set of monitors implemented in the FPGA. These monitors are synthesised from a formal specification of the expected behavior of the system expressed as a set of past-time linear temporal logic (ptLTL) formulas.","PeriodicalId":185636,"journal":{"name":"2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)","volume":"143 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133698444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Splitting tasks for migrating real-time automotive applications to multi-core ECUs 拆分任务,将实时汽车应用程序迁移到多核ecu
Pub Date : 2016-05-23 DOI: 10.1109/SIES.2016.7509418
Martin Lowinski, D. Ziegenbein, S. Glesner
Real-time automotive software becomes increasingly complex due to the integration of more functionalities. At the same time, the computation power of electronic control units grows by increasing the number of cores instead of the core performance. Thus, in the near future a single task will require more computation power than a single core can offer. We propose an approach that solves this problem by splitting a task into multiple parallel task partitions with minimal synchronization overhead while maintaining all data dependencies of the functionalities inside the original task. The approach is successfully validated on a real-world engine management system.
由于集成了更多的功能,实时汽车软件变得越来越复杂。与此同时,电子控制单元的计算能力的增长是通过增加核心数量而不是核心性能来实现的。因此,在不久的将来,单个任务将需要比单个核心所能提供的更多的计算能力。我们提出了一种解决这个问题的方法,将一个任务分割成多个并行任务分区,以最小的同步开销,同时保持原始任务内部功能的所有数据依赖关系。该方法在实际发动机管理系统中得到了成功验证。
{"title":"Splitting tasks for migrating real-time automotive applications to multi-core ECUs","authors":"Martin Lowinski, D. Ziegenbein, S. Glesner","doi":"10.1109/SIES.2016.7509418","DOIUrl":"https://doi.org/10.1109/SIES.2016.7509418","url":null,"abstract":"Real-time automotive software becomes increasingly complex due to the integration of more functionalities. At the same time, the computation power of electronic control units grows by increasing the number of cores instead of the core performance. Thus, in the near future a single task will require more computation power than a single core can offer. We propose an approach that solves this problem by splitting a task into multiple parallel task partitions with minimal synchronization overhead while maintaining all data dependencies of the functionalities inside the original task. The approach is successfully validated on a real-world engine management system.","PeriodicalId":185636,"journal":{"name":"2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125952239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
End-to-end path delay estimation in embedded software involving heterogeneous models 涉及异构模型的嵌入式软件端到端路径延迟估计
Pub Date : 2016-05-23 DOI: 10.1109/SIES.2016.7509427
Padma Iyenghar, Arne Noyer, Joachim Engelhardt, E. Pulvermüller, C. Westerkamp
Extending model-based Non-Functional Property (NFP) analysis approaches to Embedded Software Engineering (ESE) projects cutting across heterogeneous modeling domains is an emerging research challenge. Towards this direction, a generic workflow for timing validation and a methodology for synchronization of timing attributes (before performing a timing analysis) in ESE projects developed using heterogeneous modeling domains is proposed in this paper. An experimental evaluation of the proposed approach, in a state-of-the-art timing analysis tool, using a real life, light-weight ESE project, developed using Unified Modeling Language (UML) and Simulink is presented.
将基于模型的非功能属性(NFP)分析方法扩展到跨异构建模领域的嵌入式软件工程(ESE)项目是一个新兴的研究挑战。朝着这个方向,本文提出了在使用异构建模领域开发的ESE项目中用于时间验证的通用工作流和时间属性同步(在执行时间分析之前)的方法。本文介绍了采用统一建模语言(UML)和Simulink开发的现实生活轻量级ESE项目,在最先进的时序分析工具中对所提出的方法进行了实验评估。
{"title":"End-to-end path delay estimation in embedded software involving heterogeneous models","authors":"Padma Iyenghar, Arne Noyer, Joachim Engelhardt, E. Pulvermüller, C. Westerkamp","doi":"10.1109/SIES.2016.7509427","DOIUrl":"https://doi.org/10.1109/SIES.2016.7509427","url":null,"abstract":"Extending model-based Non-Functional Property (NFP) analysis approaches to Embedded Software Engineering (ESE) projects cutting across heterogeneous modeling domains is an emerging research challenge. Towards this direction, a generic workflow for timing validation and a methodology for synchronization of timing attributes (before performing a timing analysis) in ESE projects developed using heterogeneous modeling domains is proposed in this paper. An experimental evaluation of the proposed approach, in a state-of-the-art timing analysis tool, using a real life, light-weight ESE project, developed using Unified Modeling Language (UML) and Simulink is presented.","PeriodicalId":185636,"journal":{"name":"2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129407535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A soft real-time scheduling framework for wireless industrial sensor actuator networks 无线工业传感器执行器网络的软实时调度框架
Pub Date : 2016-05-23 DOI: 10.1109/SIES.2016.7509416
G. Cena, S. Scanzio, L. Seno, A. Valenzano
Wireless networks are typically deemed not reliable enough to address real-time requirements typical of industrial distributed control applications. Several solutions were proposed in the past years to overcome their limitations and make them suitable for those (soft) real-time applications where a limited percentage of deadlines can be missed. A promising approach relies on centralized transmission (and retransmission) scheduling and exploits well-known results for feasibility analysis to provide a priori guarantees on performance. This paper focuses on a framework based on non-preemptive EDF transmission scheduling and takes into account two retransmission strategies. A prototype framework implementation, relying on conventional Linux platforms and employing Wi-Fi technology, is then exploited to perform a preliminary performance assessment. To provide useful implementation guidelines, the paper investigates how the choice of retransmission strategy and of the parameters to be used in feasibility analysis may affect performance.
无线网络通常被认为不够可靠,无法满足工业分布式控制应用的实时需求。在过去的几年里,提出了一些解决方案来克服它们的局限性,使它们适合于那些(软)实时应用程序,在这些应用程序中,可能会错过有限比例的截止日期。一种有前途的方法依赖于集中传输(和重传)调度,并利用众所周知的结果进行可行性分析,以提供对性能的先验保证。本文研究了一种基于非抢占式EDF传输调度的框架,并考虑了两种重传策略。然后利用基于传统Linux平台并采用Wi-Fi技术的原型框架实现来执行初步性能评估。为了提供有用的实施指南,本文研究了可行性分析中使用的重传策略和参数的选择如何影响性能。
{"title":"A soft real-time scheduling framework for wireless industrial sensor actuator networks","authors":"G. Cena, S. Scanzio, L. Seno, A. Valenzano","doi":"10.1109/SIES.2016.7509416","DOIUrl":"https://doi.org/10.1109/SIES.2016.7509416","url":null,"abstract":"Wireless networks are typically deemed not reliable enough to address real-time requirements typical of industrial distributed control applications. Several solutions were proposed in the past years to overcome their limitations and make them suitable for those (soft) real-time applications where a limited percentage of deadlines can be missed. A promising approach relies on centralized transmission (and retransmission) scheduling and exploits well-known results for feasibility analysis to provide a priori guarantees on performance. This paper focuses on a framework based on non-preemptive EDF transmission scheduling and takes into account two retransmission strategies. A prototype framework implementation, relying on conventional Linux platforms and employing Wi-Fi technology, is then exploited to perform a preliminary performance assessment. To provide useful implementation guidelines, the paper investigates how the choice of retransmission strategy and of the parameters to be used in feasibility analysis may affect performance.","PeriodicalId":185636,"journal":{"name":"2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)","volume":"121 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121486254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Automated FPGA implementations of BIP designs 自动化FPGA实现的BIP设计
Pub Date : 2016-05-23 DOI: 10.1109/SIES.2016.7509424
Maya H. Safieddine, F. Zaraket, Mohamad Jaber, R. Kanj, M. Saghir
Embedded system designs have IP components that each may be implemented as either software or realtime accelerated hardware depending on the logic and the available resources. FPGA implementations are desirable hardware implementations for embedded systems since they are reconfigurable and several IP components can be deployed on the same FPGA board. BIP is a framework that facilitates correct-by-construction design of embedded systems by leveraging (1) a component based design paradigm and (2) the separation of behavior, interaction, and priority concerns. In this paper, we present the first automated design flow that takes a BIP design into an efficient FPGA implementation. We first transform the design into a one loop program implemented in C/C++ and then take the program into a sequential circuit implemented on an FPGA. We evaluate the design flow with the ATM and Quorom designs. We compare the results in terms of efficiency and performance of the software realizations of the BIP design such as regular BIP, enhanced flat BIP, and C/C++ simulations. We rely on the BIP engine to simulate regular BIP and flat BIP. The FPGA implementation is shown to be at least 16x-30x faster than the enhanced flat BIP implementation and 7x-10x faster than the C/C++ software realization running on state-of-the-art processors. Our experiments show higher performance improvement for larger design systems.
嵌入式系统设计具有IP组件,每个组件都可以根据逻辑和可用资源作为软件或实时加速硬件实现。FPGA实现是嵌入式系统的理想硬件实现,因为它们是可重构的,并且可以在同一FPGA板上部署多个IP组件。BIP是一个框架,通过利用(1)基于组件的设计范式和(2)行为、交互和优先级关注点的分离,促进了嵌入式系统的按结构正确设计。在本文中,我们提出了第一个自动化设计流程,将BIP设计纳入高效的FPGA实现。我们首先将设计转换为用C/ c++实现的单回路程序,然后将程序转换为在FPGA上实现的顺序电路。我们用ATM和quorum设计来评估设计流程。我们比较了BIP设计的软件实现的效率和性能,如常规BIP、增强平面BIP和C/ c++模拟。我们依靠BIP引擎来模拟常规BIP和平台BIP。FPGA实现比增强的平面BIP实现至少快16 -30倍,比在最先进的处理器上运行的C/ c++软件实现快7 -10倍。我们的实验表明,对于更大的设计系统,性能有更高的提高。
{"title":"Automated FPGA implementations of BIP designs","authors":"Maya H. Safieddine, F. Zaraket, Mohamad Jaber, R. Kanj, M. Saghir","doi":"10.1109/SIES.2016.7509424","DOIUrl":"https://doi.org/10.1109/SIES.2016.7509424","url":null,"abstract":"Embedded system designs have IP components that each may be implemented as either software or realtime accelerated hardware depending on the logic and the available resources. FPGA implementations are desirable hardware implementations for embedded systems since they are reconfigurable and several IP components can be deployed on the same FPGA board. BIP is a framework that facilitates correct-by-construction design of embedded systems by leveraging (1) a component based design paradigm and (2) the separation of behavior, interaction, and priority concerns. In this paper, we present the first automated design flow that takes a BIP design into an efficient FPGA implementation. We first transform the design into a one loop program implemented in C/C++ and then take the program into a sequential circuit implemented on an FPGA. We evaluate the design flow with the ATM and Quorom designs. We compare the results in terms of efficiency and performance of the software realizations of the BIP design such as regular BIP, enhanced flat BIP, and C/C++ simulations. We rely on the BIP engine to simulate regular BIP and flat BIP. The FPGA implementation is shown to be at least 16x-30x faster than the enhanced flat BIP implementation and 7x-10x faster than the C/C++ software realization running on state-of-the-art processors. Our experiments show higher performance improvement for larger design systems.","PeriodicalId":185636,"journal":{"name":"2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131824559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient algorithms for memory management in embedded vision systems 嵌入式视觉系统中高效的内存管理算法
Pub Date : 2016-05-23 DOI: 10.1109/SIES.2016.7509426
K. H. Salem, Yann Kieffer, S. Mancini
In the field of embedded vision systems, meeting the constraints on design criteria such as performance, area, and power consumption can be a real challenge. In fact, to alleviate the well known “Memory Mall”, it is mandatory to provide efficient memory hierarchies to reach usable performance for the system to be designed when it has to handle non-linear image treatments. To address this problematic, Mancini and Rousseau (Proc.DATE, 2012) have designed a software generator of memory hierarchies for each non-linear image operation. It allows one to improve dramatically the performance of the system, while moderately increasing its area and energy consumption. The trade-offs between these three parameters are then taken to the level of the design of the operation of this memory hierarchy, a problem that can be formalized as a 3-objective optimization problem. In this study, we formalize this problem and give new approaches both for the problem and particular sub-problems. The results on the same real-world data set as used by Mancini and Rousseau (Proc.DATE, 2012) show a very significant improvement and reduce the amount of transferred data up to 30% and a reduction of the computing time up to 15%.
在嵌入式视觉系统领域,满足诸如性能、面积和功耗等设计标准的限制可能是一个真正的挑战。事实上,为了缓解众所周知的“内存中心”,必须提供有效的内存层次结构,以便在必须处理非线性图像处理时为要设计的系统达到可用的性能。为了解决这个问题,Mancini和Rousseau (Proc.DATE, 2012)为每个非线性图像操作设计了一个内存层次的软件生成器。它可以极大地提高系统的性能,同时适度地增加其面积和能耗。然后将这三个参数之间的权衡考虑到该内存层次结构的操作设计层面,这个问题可以形式化为3目标优化问题。在本研究中,我们形式化了这个问题,并给出了这个问题和特定子问题的新方法。Mancini和Rousseau (Proc.DATE, 2012)使用的相同真实世界数据集的结果显示出非常显着的改进,将传输的数据量减少了30%,计算时间减少了15%。
{"title":"Efficient algorithms for memory management in embedded vision systems","authors":"K. H. Salem, Yann Kieffer, S. Mancini","doi":"10.1109/SIES.2016.7509426","DOIUrl":"https://doi.org/10.1109/SIES.2016.7509426","url":null,"abstract":"In the field of embedded vision systems, meeting the constraints on design criteria such as performance, area, and power consumption can be a real challenge. In fact, to alleviate the well known “Memory Mall”, it is mandatory to provide efficient memory hierarchies to reach usable performance for the system to be designed when it has to handle non-linear image treatments. To address this problematic, Mancini and Rousseau (Proc.DATE, 2012) have designed a software generator of memory hierarchies for each non-linear image operation. It allows one to improve dramatically the performance of the system, while moderately increasing its area and energy consumption. The trade-offs between these three parameters are then taken to the level of the design of the operation of this memory hierarchy, a problem that can be formalized as a 3-objective optimization problem. In this study, we formalize this problem and give new approaches both for the problem and particular sub-problems. The results on the same real-world data set as used by Mancini and Rousseau (Proc.DATE, 2012) show a very significant improvement and reduce the amount of transferred data up to 30% and a reduction of the computing time up to 15%.","PeriodicalId":185636,"journal":{"name":"2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115492986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Real-time scheduling for 3D GPU rendering 实时调度3D GPU渲染
Pub Date : 2016-05-23 DOI: 10.1109/SIES.2016.7509411
Stephan Schnitzer, Simon Gansel, Frank Dürr, K. Rothermel
3D graphical functions in cars enjoy growing popularity. For instance, analog instruments of the instrument cluster are replaced by digital 3D displays as shown by Mercedes-Benz in the F125 prototype car. The trend to use 3D applications expands into two directions: towards more safety-relevant applications such as the speedometer and towards third-party applications, e.g., from an app store. In order to save cost, energy, and installation space, all these applications should share a single GPU. GPU sharing brings up the problem of providing real-time guarantees for rendering content of time-sensitive applications like the speedometer. To solve this problem, we present a real-time GPU scheduling framework which provides strong guarantees for critical applications while still giving as much GPU resources to less important applications as possible, thus ensuring a high GPU utilization. Since current GPUs are not preemptible, we use the estimated execution time of each GPU rendering job to make the scheduling decisions. Our evaluations show that our scheduler guarantees given real-time constraints, while achieving a high GPU utilization of 97%. Moreover, scheduling is performed highly efficient in real-time with less than 10 μs latency.
汽车上的3D图形功能越来越受欢迎。例如,仪表盘的模拟仪表被数字3D显示器所取代,正如梅赛德斯-奔驰F125原型车所示。使用3D应用程序的趋势扩展到两个方向:更安全相关的应用程序,如速度计和第三方应用程序,如应用程序商店。为了节省成本、能源和安装空间,所有这些应用程序应该共享一个GPU。GPU共享带来了为时间敏感应用程序(如速度计)提供实时渲染保证的问题。为了解决这个问题,我们提出了一个实时GPU调度框架,该框架为关键应用程序提供了强有力的保障,同时仍将尽可能多的GPU资源分配给不太重要的应用程序,从而确保了GPU的高利用率。由于当前GPU是不可抢占的,我们使用每个GPU渲染作业的估计执行时间来做出调度决策。我们的评估表明,我们的调度器保证了给定的实时约束,同时实现了97%的高GPU利用率。此外,调度的实时执行效率非常高,延迟小于10 μs。
{"title":"Real-time scheduling for 3D GPU rendering","authors":"Stephan Schnitzer, Simon Gansel, Frank Dürr, K. Rothermel","doi":"10.1109/SIES.2016.7509411","DOIUrl":"https://doi.org/10.1109/SIES.2016.7509411","url":null,"abstract":"3D graphical functions in cars enjoy growing popularity. For instance, analog instruments of the instrument cluster are replaced by digital 3D displays as shown by Mercedes-Benz in the F125 prototype car. The trend to use 3D applications expands into two directions: towards more safety-relevant applications such as the speedometer and towards third-party applications, e.g., from an app store. In order to save cost, energy, and installation space, all these applications should share a single GPU. GPU sharing brings up the problem of providing real-time guarantees for rendering content of time-sensitive applications like the speedometer. To solve this problem, we present a real-time GPU scheduling framework which provides strong guarantees for critical applications while still giving as much GPU resources to less important applications as possible, thus ensuring a high GPU utilization. Since current GPUs are not preemptible, we use the estimated execution time of each GPU rendering job to make the scheduling decisions. Our evaluations show that our scheduler guarantees given real-time constraints, while achieving a high GPU utilization of 97%. Moreover, scheduling is performed highly efficient in real-time with less than 10 μs latency.","PeriodicalId":185636,"journal":{"name":"2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)","volume":"619 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123322748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1