2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)最新文献

Poster Abstract: Scheduling Multi-Threaded Tasks to Reduce Intra-Task Cache Contention 摘要:调度多线程任务以减少任务内缓存争用

2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Pub Date : 2016-04-11 DOI: 10.1109/RTAS.2016.7461352

Corey Tessler, N. Fisher

Summary form only given. Research on hard real-time systems and their models has predominately focused upon single-threaded tasks. When multithreaded tasks are introduced to the classical real-time model the individual threads are treated as distinct tasks, one for each thread. These artificial tasks share the deadline, period, and worst case execution time of their parent task. In the presence of instruction and data caches this model is overly pessimistic, failing to account for the execution time benefit of cache hits when multiple threads of execution share a memory address space. This work takes a new perspective on instruction caches. Treating the cache as a benefit to schedulability for a single task with m threads. To realize the “inter-thread cache benefit” a new scheduling algorithm and accompanying worst-case execution time (WCET) calculation method are proposed. The scheduling algorithm permits threads to execute across conflict free regions, and blocks those threads that would create an unnecessary cache conflict. The WCET bound is determined for the entire set of m threads, rather than treating each thread as a distinct task. Both the scheduler and WCET method rely on the calculation of conflict free regions which are found by a static analysis method that relies on no external information from the system designer. By virtue of this perspective the system's total execution execution time is reduced and is reflected in a tighter WCET bound compared to the techniques applied to the classical model. Obtaining this tighter bound requires the integration of three typically independent areas: WCET, schedulability, and cache-related preemption delay analysis.

只提供摘要形式。硬实时系统及其模型的研究主要集中在单线程任务上。当多线程任务被引入到经典的实时模型中时，各个线程被视为不同的任务，每个线程一个。这些人工任务共享父任务的截止日期、周期和最坏情况执行时间。在存在指令和数据缓存的情况下，该模型过于悲观，无法考虑多个执行线程共享内存地址空间时缓存命中的执行时间优势。这项工作为指令缓存提供了一个新的视角。将缓存视为具有m个线程的单个任务的可调度性的好处。为了实现“线程间缓存效益”，提出了一种新的调度算法和相应的最坏情况执行时间(WCET)计算方法。调度算法允许线程跨无冲突区域执行，并阻塞那些可能造成不必要缓存冲突的线程。WCET绑定是为整个m个线程集确定的，而不是将每个线程视为一个独立的任务。调度程序和WCET方法都依赖于无冲突区域的计算，这些区域是通过静态分析方法发现的，不依赖于来自系统设计者的外部信息。通过这种视角，系统的总执行执行时间减少了，并且与应用于经典模型的技术相比，反映在更严格的WCET约束中。获得这种更紧密的界限需要集成三个通常独立的领域:WCET、可调度性和与缓存相关的抢占延迟分析。

{"title":"Poster Abstract: Scheduling Multi-Threaded Tasks to Reduce Intra-Task Cache Contention","authors":"Corey Tessler, N. Fisher","doi":"10.1109/RTAS.2016.7461352","DOIUrl":"https://doi.org/10.1109/RTAS.2016.7461352","url":null,"abstract":"Summary form only given. Research on hard real-time systems and their models has predominately focused upon single-threaded tasks. When multithreaded tasks are introduced to the classical real-time model the individual threads are treated as distinct tasks, one for each thread. These artificial tasks share the deadline, period, and worst case execution time of their parent task. In the presence of instruction and data caches this model is overly pessimistic, failing to account for the execution time benefit of cache hits when multiple threads of execution share a memory address space. This work takes a new perspective on instruction caches. Treating the cache as a benefit to schedulability for a single task with m threads. To realize the “inter-thread cache benefit” a new scheduling algorithm and accompanying worst-case execution time (WCET) calculation method are proposed. The scheduling algorithm permits threads to execute across conflict free regions, and blocks those threads that would create an unnecessary cache conflict. The WCET bound is determined for the entire set of m threads, rather than treating each thread as a distinct task. Both the scheduler and WCET method rely on the calculation of conflict free regions which are found by a static analysis method that relies on no external information from the system designer. By virtue of this perspective the system's total execution execution time is reduced and is reflected in a tighter WCET bound compared to the techniques applied to the classical model. Obtaining this tighter bound requires the integration of three typically independent areas: WCET, schedulability, and cache-related preemption delay analysis.","PeriodicalId":338179,"journal":{"name":"2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)","volume":"350 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115231188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Trading Cores for Memory Bandwidth in Real-Time Systems 实时系统中的内存带宽交易核心

2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Pub Date : 2016-04-11 DOI: 10.1109/RTAS.2016.7461364

A. Alhammad, R. Pellizzoni

Federated scheduling has been proposed for parallel tasks. In this scheduling scheme, each parallel task is a assigned a private set of cores whereas sequential tasks share the remaining set of cores. Since parallel tasks are assigned dedicated cores, they receive no interference from other tasks. However, multicore processors are commonly built with shared main memory. The memory bandwidth is limited and therefore subject to contention. Consequently, parallel tasks can interfere with each other through the shared main memory. In this paper, we propose a novel method that is memory-aware when assigning cores to tasks. Our experimental results show a significant advantage of our method with respect to memory- oblivious methods.

针对并行任务，提出了联合调度方法。在这个调度方案中，每个并行任务被分配一组私有的内核，而顺序任务共享剩余的内核集。由于并行任务被分配了专用的内核，所以它们不会受到其他任务的干扰。然而，多核处理器通常是用共享主存构建的。内存带宽是有限的，因此会引起争用。因此，并行任务可以通过共享主存相互干扰。在本文中，我们提出了一种新的方法，即在分配任务时对内核进行内存感知。实验结果表明，该方法相对于记忆无关方法具有显著的优势。

引用次数: 17

Modeling and Verification of Dynamic Command Scheduling for Real-Time Memory Controllers 实时内存控制器的动态命令调度建模与验证

2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Pub Date : 2016-04-11 DOI: 10.1109/RTAS.2016.7461341

Yonghui Li, B. Akesson, Kai Lampka, K. Goossens

In modern multi-core systems with multiple real-time (RT) applications, memory traffic accessing the shared SDRAM is increasingly diverse, e.g., transactions have variable sizes. RT memory controllers with dynamic command scheduling can efficiently address the diversity by issuing appropriate commands subject to the SDRAM timing constraints. However, the scheduling dependencies between commands make it challenging to derive tight bounds for the worst-case response time (WCRT) and the worst-case bandwidth (WCBW) of a memory controller. Existing modeling and analysis techniques either do not provide tight WCRT and WCBW bounds for diverse memory traffic with variable transaction sizes or are difficult to adapt to different RT memory controllers. This paper models a memory controller using Timed Automata (TA), where model checking is applied for analysis. Our TA model is modular and accurately captures the behavior of a RT memory controller with dynamic command scheduling. We obtain WCRT and WCBW bounds, which are validated by simulating the worst- case transaction traces obtained by model checking with a cycle-accurate model of the memory controller. Our method outperforms three state-of-the-art analysis techniques. We reduce WCRT bound by up to 20%, while the average improvement is 7.7%, and increase the WCBW bound by up to 25% with an average improvement of 13.6%. In addition, our modeling is generic enough to extend to memory controllers with different mechanisms.

在具有多个实时(RT)应用程序的现代多核系统中，访问共享SDRAM的内存流量越来越多样化，例如，事务具有可变的大小。具有动态命令调度的RT存储器控制器可以通过在SDRAM时间约束下发出适当的命令来有效地解决多样性问题。然而，命令之间的调度依赖关系使得很难获得内存控制器的最坏情况响应时间(WCRT)和最坏情况带宽(WCBW)的严格界限。现有的建模和分析技术要么不能为具有可变事务大小的各种内存流量提供严格的WCRT和WCBW边界，要么难以适应不同的RT内存控制器。本文采用时间自动机(TA)对存储器控制器进行建模，其中模型检验用于分析。我们的TA模型是模块化的，可以准确地捕获具有动态命令调度的RT内存控制器的行为。我们得到了WCRT和WCBW的边界，并通过用内存控制器的周期精确模型模拟模型检查得到的最坏情况下的事务跟踪来验证。我们的方法优于三种最先进的分析技术。我们将WCRT束缚减少了20%，平均改善了7.7%，并将WCBW束缚增加了25%，平均改善了13.6%。此外，我们的建模足够通用，可以扩展到具有不同机制的内存控制器。

{"title":"Modeling and Verification of Dynamic Command Scheduling for Real-Time Memory Controllers","authors":"Yonghui Li, B. Akesson, Kai Lampka, K. Goossens","doi":"10.1109/RTAS.2016.7461341","DOIUrl":"https://doi.org/10.1109/RTAS.2016.7461341","url":null,"abstract":"In modern multi-core systems with multiple real-time (RT) applications, memory traffic accessing the shared SDRAM is increasingly diverse, e.g., transactions have variable sizes. RT memory controllers with dynamic command scheduling can efficiently address the diversity by issuing appropriate commands subject to the SDRAM timing constraints. However, the scheduling dependencies between commands make it challenging to derive tight bounds for the worst-case response time (WCRT) and the worst-case bandwidth (WCBW) of a memory controller. Existing modeling and analysis techniques either do not provide tight WCRT and WCBW bounds for diverse memory traffic with variable transaction sizes or are difficult to adapt to different RT memory controllers. This paper models a memory controller using Timed Automata (TA), where model checking is applied for analysis. Our TA model is modular and accurately captures the behavior of a RT memory controller with dynamic command scheduling. We obtain WCRT and WCBW bounds, which are validated by simulating the worst- case transaction traces obtained by model checking with a cycle-accurate model of the memory controller. Our method outperforms three state-of-the-art analysis techniques. We reduce WCRT bound by up to 20%, while the average improvement is 7.7%, and increase the WCBW bound by up to 25% with an average improvement of 13.6%. In addition, our modeling is generic enough to extend to memory controllers with different mechanisms.","PeriodicalId":338179,"journal":{"name":"2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128037768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

Poster Abstract: Online Semi-Partitioned Multiprocessor Scheduling of Soft Real-Time Periodic Tasks for QoS Optimization 摘要:面向QoS优化的软实时周期任务在线半分区多处理器调度

2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Pub Date : 2016-04-11 DOI: 10.1109/RTAS.2016.7461350

Behnaz Sanati, A. Cheng

Summary form only given. Multiprocessor real-time scheduling algorithms may follow a partitioned or global approach or some hybrid of the two, called semi-partitioning. Semi-partitioned real-time scheduling algorithms extend partitioned ones by allowing a subset of tasks to migrate. Given the goal of “less overhead”, it is desirable for such strategy to be boundary-limited, and allow a migrating task to migrate only between successive invocations (job boundaries). Non-boundary-limited schedulers allow jobs to migrate, which can be expensive in practice, if jobs maintain much cached state. Previously proposed semi-partitioned algorithms for soft real-time (SRT) tasks such as EDF-fm and EDF-os, have two phases: an offline assignment phase, where tasks are assigned to processors and fixed tasks (which do not migrate) are distinguished from migrating ones; and an online execution phase. In their execution phase, rules that extend EDF scheduling are used. These strategies aim to minimize tardiness. In this paper, we propose a new online reward-based semi-partitioning approach to schedule periodic soft real-time tasks in homogeneous multiprocessor systems. We use an online choice of two approximation algorithms, Greedy and Load-Balancing, for partitioning, which provides an optimized usage of processing time. In this method, no prior information is needed. Hence, there is no offline phase. Our objective is to enhance the QoS by minimizing tardiness and maximizing the total reward obtained by completed tasks in minimum makespan. Therefore, we allow different jobs of any task get assigned to different processors (migration at job boundaries) based on their reward-based priorities and workload of the processors. This method can also extend to direct SRT systems with mixed set of tasks (aperiodic, sporadic and periodic) by defining their deadline accordingly. Many real-time applications can benefit from this solution including but not limited to video streaming servers, multi-player video games, mobile online banking and medical monitoring systems.

只提供摘要形式。多处理器实时调度算法可能遵循分区或全局方法，或者两者的某种混合，称为半分区。半分区实时调度算法通过允许任务子集迁移来扩展分区调度算法。考虑到“减少开销”的目标，这种策略最好是有边界限制的，并允许迁移任务仅在连续调用之间迁移(作业边界)。无边界限制的调度器允许作业迁移，如果作业维护大量缓存状态，那么迁移作业在实践中可能代价高昂。以前提出的软实时(SRT)任务的半分区算法，如EDF-fm和EDF-os，有两个阶段:离线分配阶段，任务分配给处理器，固定任务(不迁移)与迁移任务区分开来;以及在线执行阶段。在执行阶段，使用扩展EDF调度的规则。这些策略的目的是尽量减少迟到。本文提出了一种基于在线奖励的半分区方法来调度同构多处理器系统中的周期性软实时任务。我们使用在线选择两种近似算法，贪心和负载平衡，用于分区，这提供了处理时间的优化使用。该方法不需要先验信息。因此，不存在脱机阶段。我们的目标是通过最小化延迟和最大化在最小makespan内完成任务所获得的总奖励来提高QoS。因此，我们允许根据基于奖励的优先级和处理器的工作负载，将任何任务的不同作业分配给不同的处理器(在作业边界上迁移)。该方法还可以通过定义相应的截止日期，扩展到具有混合任务集(非周期性、零星和周期性)的直接SRT系统。许多实时应用程序都可以从该解决方案中受益，包括但不限于视频流服务器、多人视频游戏、移动网上银行和医疗监控系统。

{"title":"Poster Abstract: Online Semi-Partitioned Multiprocessor Scheduling of Soft Real-Time Periodic Tasks for QoS Optimization","authors":"Behnaz Sanati, A. Cheng","doi":"10.1109/RTAS.2016.7461350","DOIUrl":"https://doi.org/10.1109/RTAS.2016.7461350","url":null,"abstract":"Summary form only given. Multiprocessor real-time scheduling algorithms may follow a partitioned or global approach or some hybrid of the two, called semi-partitioning. Semi-partitioned real-time scheduling algorithms extend partitioned ones by allowing a subset of tasks to migrate. Given the goal of “less overhead”, it is desirable for such strategy to be boundary-limited, and allow a migrating task to migrate only between successive invocations (job boundaries). Non-boundary-limited schedulers allow jobs to migrate, which can be expensive in practice, if jobs maintain much cached state. Previously proposed semi-partitioned algorithms for soft real-time (SRT) tasks such as EDF-fm and EDF-os, have two phases: an offline assignment phase, where tasks are assigned to processors and fixed tasks (which do not migrate) are distinguished from migrating ones; and an online execution phase. In their execution phase, rules that extend EDF scheduling are used. These strategies aim to minimize tardiness. In this paper, we propose a new online reward-based semi-partitioning approach to schedule periodic soft real-time tasks in homogeneous multiprocessor systems. We use an online choice of two approximation algorithms, Greedy and Load-Balancing, for partitioning, which provides an optimized usage of processing time. In this method, no prior information is needed. Hence, there is no offline phase. Our objective is to enhance the QoS by minimizing tardiness and maximizing the total reward obtained by completed tasks in minimum makespan. Therefore, we allow different jobs of any task get assigned to different processors (migration at job boundaries) based on their reward-based priorities and workload of the processors. This method can also extend to direct SRT systems with mixed set of tasks (aperiodic, sporadic and periodic) by defining their deadline accordingly. Many real-time applications can benefit from this solution including but not limited to video streaming servers, multi-player video games, mobile online banking and medical monitoring systems.","PeriodicalId":338179,"journal":{"name":"2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134371286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Multi-Objective Co-Optimization of FlexRay-Based Distributed Control Systems 基于flexray的分布式控制系统多目标协同优化

2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Pub Date : 2016-04-11 DOI: 10.1109/RTAS.2016.7461344

Debayan Roy, Licong Zhang, Wanli Chang, Dip Goswami, S. Chakraborty

Recently, research on control and architecture co- design has been drawing increasingly more attention. This is because these techniques integrate the design of the controllers and the architecture and explore the characteristics on both sides to achieve more efficient design of embedded control systems. However, there still exist several challenges like the large design space and inadequate trade-off opportunities for different objectives like control performance and resource utilization. In this paper, we propose a co-optimization approach for FlexRay-based distributed control systems, that synthesizes both the controllers and the task and communication schedules. This approach exploits some FlexRay protocol specific characteristics to reduce the complexity of the whole optimization problem. This is done by employing a customized control design and a nested two-layered optimization technique. Therefore, compared to existing methods, the proposed approach is more scalable. It also allows multi-objective optimization taking into account both the overall control performance and the bus resource utilization. This approach generates a Pareto front representing the trade-offs between these two, which allows the engineers to make suitable design choices.

近年来，控制与结构协同设计的研究越来越受到人们的重视。这是因为这些技术将控制器的设计与体系结构的设计相结合，探索两者的特点，从而实现更高效的嵌入式控制系统设计。然而，仍然存在一些挑战，如较大的设计空间和对不同目标(如控制性能和资源利用率)的权衡机会不足。在本文中，我们提出了一种基于flexray的分布式控制系统的协同优化方法，该方法综合了控制器以及任务和通信调度。这种方法利用了FlexRay协议的一些特定特性来降低整个优化问题的复杂性。这是通过采用定制的控件设计和嵌套的两层优化技术来实现的。因此，与现有方法相比，该方法具有更强的可扩展性。它还允许考虑总体控制性能和总线资源利用率的多目标优化。这种方法产生了一个帕累托前沿，代表了这两者之间的权衡，这使得工程师能够做出合适的设计选择。

{"title":"Multi-Objective Co-Optimization of FlexRay-Based Distributed Control Systems","authors":"Debayan Roy, Licong Zhang, Wanli Chang, Dip Goswami, S. Chakraborty","doi":"10.1109/RTAS.2016.7461344","DOIUrl":"https://doi.org/10.1109/RTAS.2016.7461344","url":null,"abstract":"Recently, research on control and architecture co- design has been drawing increasingly more attention. This is because these techniques integrate the design of the controllers and the architecture and explore the characteristics on both sides to achieve more efficient design of embedded control systems. However, there still exist several challenges like the large design space and inadequate trade-off opportunities for different objectives like control performance and resource utilization. In this paper, we propose a co-optimization approach for FlexRay-based distributed control systems, that synthesizes both the controllers and the task and communication schedules. This approach exploits some FlexRay protocol specific characteristics to reduce the complexity of the whole optimization problem. This is done by employing a customized control design and a nested two-layered optimization technique. Therefore, compared to existing methods, the proposed approach is more scalable. It also allows multi-objective optimization taking into account both the overall control performance and the bus resource utilization. This approach generates a Pareto front representing the trade-offs between these two, which allows the engineers to make suitable design choices.","PeriodicalId":338179,"journal":{"name":"2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131715467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 36

Demo Abstract: Run-Time Monitoring Environments for Real-Time and Safety Critical Systems 摘要:实时和安全关键系统的运行时监控环境

2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Pub Date : 2016-04-11 DOI: 10.1109/RTAS.2016.7461333

Geoffrey Nelissen, H. Carvalho, David Pereira, E. Tovar

With the increasing complexity of embedded systems, it becomes unrealistic to formally verify that all the system requirements will be respected under any possible execution scenario. Moreover, the worst-case analyses that are usually performed before the system deployment are also based on a set of assumptions (e.g., minimum activation period, worst-case execution time, maximum release jitter) that may not always be respected at run-time. For those reasons, run-time monitoring and run-time verification become an interesting alternative to the traditional offline verification. Run-time verification is based on the instrumentation of the target applications. Monitors are then added to the system to verify at run-time that the system requirements are respected during the execution. If a misbehaviour is detected, an alarm can be raised so as to trigger appropriate counter-measures (e.g., execution mode change, reset or deactivation of some of the functionalities). In this work, we present four different implementations of a run-time monitoring framework suited to real-time and safety critical systems. Two implementations are written in Ada and follow the Ravenscar profile, which make them particularly suited to the development of high integrity systems. The first version is available as a standalone library for Ada programs while the second has been integrated in the GNAT run-time environment and instruments the ORK+ micro-kernel. Information on the task scheduling events, directly originating from the kernel, can thus be used by the monitors to check if the system follows all its requirements. The third implementation is a standalone library written in C++ that can be used in any POSIX compliant run-time environment. It is therefore compatible with the vast majority of operating systems used in embedded systems. The last implementation is a loadable kernel module for Linux. It has for main advantage to be able to enforce complete space partitioning between the monitors and the monitored applications. It is therefore impossible for memory faults to propagate and corrupt the state of the monitors.

随着嵌入式系统的日益复杂，在任何可能的执行场景下，正式验证所有系统需求都将得到尊重变得不现实。此外，通常在系统部署之前执行的最坏情况分析也是基于一组假设(例如，最小激活周期、最坏情况执行时间、最大释放抖动)，这些假设在运行时可能并不总是得到尊重。由于这些原因，运行时监视和运行时验证成为传统离线验证的有趣替代方案。运行时验证基于目标应用程序的检测。然后将监视器添加到系统中，以在运行时验证在执行过程中是否满足了系统需求。如果检测到错误行为，则可以发出警报，以触发适当的应对措施(例如，执行模式更改，重置或停用某些功能)。在这项工作中，我们提出了适合实时和安全关键系统的运行时监控框架的四种不同实现。两个实现是用Ada编写的，并遵循Ravenscar配置文件，这使得它们特别适合开发高完整性系统。第一个版本可作为Ada程序的独立库使用，而第二个版本已集成到GNAT运行时环境中，并用于检测ORK+微内核。因此，监控器可以使用直接来自内核的任务调度事件信息来检查系统是否满足其所有需求。第三种实现是用c++编写的独立库，可以在任何POSIX兼容的运行时环境中使用。因此，它与嵌入式系统中使用的绝大多数操作系统兼容。最后一个实现是Linux的可加载内核模块。它的主要优点是能够在监视器和被监视的应用程序之间强制执行完整的空间分区。因此，内存错误不可能传播并破坏监视器的状态。

{"title":"Demo Abstract: Run-Time Monitoring Environments for Real-Time and Safety Critical Systems","authors":"Geoffrey Nelissen, H. Carvalho, David Pereira, E. Tovar","doi":"10.1109/RTAS.2016.7461333","DOIUrl":"https://doi.org/10.1109/RTAS.2016.7461333","url":null,"abstract":"With the increasing complexity of embedded systems, it becomes unrealistic to formally verify that all the system requirements will be respected under any possible execution scenario. Moreover, the worst-case analyses that are usually performed before the system deployment are also based on a set of assumptions (e.g., minimum activation period, worst-case execution time, maximum release jitter) that may not always be respected at run-time. For those reasons, run-time monitoring and run-time verification become an interesting alternative to the traditional offline verification. Run-time verification is based on the instrumentation of the target applications. Monitors are then added to the system to verify at run-time that the system requirements are respected during the execution. If a misbehaviour is detected, an alarm can be raised so as to trigger appropriate counter-measures (e.g., execution mode change, reset or deactivation of some of the functionalities). In this work, we present four different implementations of a run-time monitoring framework suited to real-time and safety critical systems. Two implementations are written in Ada and follow the Ravenscar profile, which make them particularly suited to the development of high integrity systems. The first version is available as a standalone library for Ada programs while the second has been integrated in the GNAT run-time environment and instruments the ORK+ micro-kernel. Information on the task scheduling events, directly originating from the kernel, can thus be used by the monitors to check if the system follows all its requirements. The third implementation is a standalone library written in C++ that can be used in any POSIX compliant run-time environment. It is therefore compatible with the vast majority of operating systems used in embedded systems. The last implementation is a loadable kernel module for Linux. It has for main advantage to be able to enforce complete space partitioning between the monitors and the monitored applications. It is therefore impossible for memory faults to propagate and corrupt the state of the monitors.","PeriodicalId":338179,"journal":{"name":"2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133194654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Poster Abstract: Cache Persistence Aware Response Time Analysis for Fixed Priority Preemptive Systems 摘要:基于缓存持久性的固定优先级抢占系统响应时间分析

2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Pub Date : 2016-04-11 DOI: 10.1109/RTAS.2016.7461347

Syed Aftab Rashid, Geoffrey Nelissen, E. Tovar

Summary form only given. The existing gap between the processor and main memory operating speeds necessitates the use of intermediate cache memories to accelerate the average case access time to instructions and data that must be executed or treated on the processor. However, the introduction of cache memories in modern computing platforms is the cause of big variations in the execution time of each instruction depending on whether the instruction and the data it treats are already loaded in the cache or not. During the worst-case response time (WCRT) analysis, the existing works assume that each job released by the preempting tasks will ask for their worst-case memory demand. This is however pessimistic since there is a high chance that a big portion of the instructions and data associated with the preempting task τj , are still available in the cache when τj releases its next jobs. We call this content persistent cache blocks (PCBs). In this work, we propose a method to accurately bound the memory overhead incurred by a low priority task due to high priority tasks executing during its response time. For this purpose, we first identify the existence of persistent and nonpersistent cache blocks (i.e., PCBs and nPCBs) associated with each task. We then show with an example that due to the existence of PCBs, the memory demand of a task can significantly vary over time. Therefore, accounting for PCBs in the memory demand of the preempting task allows to reduce the pessimism on the total memory demand considered by the WCRT analysis. Finally, we propose a refined WCRT analysis for fixed priority preemptive systems considering (i) the effect of PCBs on the memory demand of the preempting task, and (ii) accounting for the number of PCBs that can be evicted by the preempted tasks between two successive job releases of the preempting tasks.

只提供摘要形式。处理器和主存储器操作速度之间存在的差距需要使用中间缓存存储器来加快必须在处理器上执行或处理的指令和数据的平均case访问时间。然而，现代计算平台中缓存内存的引入导致每条指令的执行时间发生很大变化，这取决于该指令及其处理的数据是否已经加载在缓存中。在最坏情况响应时间(WCRT)分析中，现有的工作假设抢占任务释放的每个作业都会请求它们的最坏情况内存需求。然而，这是悲观的，因为与抢占任务τj相关的大部分指令和数据很有可能在τj释放下一个作业时仍然在缓存中可用。我们称之为内容持久缓存块(pcb)。在这项工作中，我们提出了一种方法来精确地绑定由于高优先级任务在其响应时间内执行而导致的低优先级任务所产生的内存开销。为此，我们首先确定与每个任务相关的持久和非持久缓存块(即pcb和npcb)的存在。然后，我们用一个例子来说明，由于pcb的存在，任务的内存需求会随着时间的推移而显著变化。因此，在抢占任务的内存需求中考虑pcb可以减少对WCRT分析所考虑的总内存需求的悲观看法。最后，我们提出了一种针对固定优先级抢占系统的改进WCRT分析，考虑了(i) pcb对抢占任务内存需求的影响，以及(ii)在抢占任务的两个连续作业释放之间，被抢占任务可以驱逐的pcb数量。

{"title":"Poster Abstract: Cache Persistence Aware Response Time Analysis for Fixed Priority Preemptive Systems","authors":"Syed Aftab Rashid, Geoffrey Nelissen, E. Tovar","doi":"10.1109/RTAS.2016.7461347","DOIUrl":"https://doi.org/10.1109/RTAS.2016.7461347","url":null,"abstract":"Summary form only given. The existing gap between the processor and main memory operating speeds necessitates the use of intermediate cache memories to accelerate the average case access time to instructions and data that must be executed or treated on the processor. However, the introduction of cache memories in modern computing platforms is the cause of big variations in the execution time of each instruction depending on whether the instruction and the data it treats are already loaded in the cache or not. During the worst-case response time (WCRT) analysis, the existing works assume that each job released by the preempting tasks will ask for their worst-case memory demand. This is however pessimistic since there is a high chance that a big portion of the instructions and data associated with the preempting task τj , are still available in the cache when τj releases its next jobs. We call this content persistent cache blocks (PCBs). In this work, we propose a method to accurately bound the memory overhead incurred by a low priority task due to high priority tasks executing during its response time. For this purpose, we first identify the existence of persistent and nonpersistent cache blocks (i.e., PCBs and nPCBs) associated with each task. We then show with an example that due to the existence of PCBs, the memory demand of a task can significantly vary over time. Therefore, accounting for PCBs in the memory demand of the preempting task allows to reduce the pessimism on the total memory demand considered by the WCRT analysis. Finally, we propose a refined WCRT analysis for fixed priority preemptive systems considering (i) the effect of PCBs on the memory demand of the preempting task, and (ii) accounting for the number of PCBs that can be evicted by the preempted tasks between two successive job releases of the preempting tasks.","PeriodicalId":338179,"journal":{"name":"2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125413542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Taming Non-Blocking Caches to Improve Isolation in Multicore Real-Time Systems 驯服非阻塞缓存以提高多核实时系统中的隔离性

2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Pub Date : 2016-04-11 DOI: 10.1109/RTAS.2016.7461361

P. K. Valsan, H. Yun, F. Farshchi

In this paper, we show that cache partitioning does not necessarily ensure predictable cache performance in modern COTS multicore platforms that use non-blocking caches to exploit memory- level-parallelism (MLP). Through carefully designed experiments using three real COTS multicore platforms (four distinct CPU architectures) and a cycle- accurate full system simulator, we show that special hardware registers in non-blocking caches, known as Miss Status Holding Registers (MSHRs), which track the status of outstanding cache-misses, can be a significant source of contention; we observe up to 21X WCET increase in a real COTS multicore platform due to MSHR contention. We propose a hardware and system software (OS) collaborative approach to efficiently eliminate MSHR contention for multicore real-time systems. Our approach includes a low-cost hardware extension that enables dynamic control of per-core MLP by the OS. Using the hardware extension, the OS scheduler then globally controls each core's MLP in such a way that eliminates MSHR contention and maximizes overall throughput of the system. We implement the hardware extension in a cycle- accurate fullsystem simulator and the scheduler modification in Linux 3.14 kernel. We evaluate the effectiveness of our approach using a set of synthetic and macro benchmarks. In a case study, we achieve up to 19% WCET reduction (average: 13%) for a set of EEMBC benchmarks compared to a baseline cache partitioning setup.

在本文中，我们表明，在使用非阻塞缓存来利用内存级并行性(MLP)的现代COTS多核平台中，缓存分区不一定能确保可预测的缓存性能。通过使用三个真实的COTS多核平台(四种不同的CPU架构)和一个周期精确的全系统模拟器精心设计的实验，我们表明，非阻塞缓存中的特殊硬件寄存器，称为Miss Status Holding寄存器(MSHRs)，用于跟踪未完成的缓存未完成状态，可能是争用的重要来源;我们观察到，由于MSHR争用，在真正的COTS多核平台中，WCET增加了21倍。我们提出了一种硬件和系统软件(OS)协作的方法来有效地消除多核实时系统的MSHR争用。我们的方法包括一个低成本的硬件扩展，使操作系统能够动态控制每核MLP。使用硬件扩展，操作系统调度器然后全局控制每个核心的MLP，从而消除MSHR争用并最大限度地提高系统的总体吞吐量。我们在一个周期精确的全系统模拟器中实现了硬件扩展，并在Linux 3.14内核中实现了调度器的修改。我们使用一组综合和宏观基准来评估我们方法的有效性。在一个案例研究中，与基线缓存分区设置相比，我们在一组EEMBC基准测试中实现了19%的WCET减少(平均:13%)。

{"title":"Taming Non-Blocking Caches to Improve Isolation in Multicore Real-Time Systems","authors":"P. K. Valsan, H. Yun, F. Farshchi","doi":"10.1109/RTAS.2016.7461361","DOIUrl":"https://doi.org/10.1109/RTAS.2016.7461361","url":null,"abstract":"In this paper, we show that cache partitioning does not necessarily ensure predictable cache performance in modern COTS multicore platforms that use non-blocking caches to exploit memory- level-parallelism (MLP). Through carefully designed experiments using three real COTS multicore platforms (four distinct CPU architectures) and a cycle- accurate full system simulator, we show that special hardware registers in non-blocking caches, known as Miss Status Holding Registers (MSHRs), which track the status of outstanding cache-misses, can be a significant source of contention; we observe up to 21X WCET increase in a real COTS multicore platform due to MSHR contention. We propose a hardware and system software (OS) collaborative approach to efficiently eliminate MSHR contention for multicore real-time systems. Our approach includes a low-cost hardware extension that enables dynamic control of per-core MLP by the OS. Using the hardware extension, the OS scheduler then globally controls each core's MLP in such a way that eliminates MSHR contention and maximizes overall throughput of the system. We implement the hardware extension in a cycle- accurate fullsystem simulator and the scheduler modification in Linux 3.14 kernel. We evaluate the effectiveness of our approach using a set of synthetic and macro benchmarks. In a case study, we achieve up to 19% WCET reduction (average: 13%) for a set of EEMBC benchmarks compared to a baseline cache partitioning setup.","PeriodicalId":338179,"journal":{"name":"2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132551215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 105

A Kernel for Energy-Neutral Real-Time Systems with Mixed Criticalities 混合临界能量中性实时系统的内核

2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Pub Date : 2016-04-11 DOI: 10.1109/RTAS.2016.7461320

Peter Wägemann, T. Distler, Heiko Janker, Phillip Raffeck, V. Sieh

Energy-neutral real-time systems harvest the entire energy they use from their environment, making it essential to treat energy as an equally important resource as time. As a result, such systems need to solve a number of problems that so far have not been addressed by traditional real-time systems. In particular, this includes the scheduling of tasks with both time and energy constraints, the monitoring of energy budgets, as well as the survival of blackout periods during which not enough energy is available to keep the system fully operational. In this paper, we address these issues presenting ENOS, an operating-system kernel for energy-neutral real-time systems. ENOS considers mixed time criticality levels for different energy criticality modes, which enables a decoupling of time and energy constraints during phases when one is considered less critical than the other. When switching the energy criticality mode, the system also changes the set of tasks to be executed and is therefore able to dynamically adapt its energy consumption depending on external conditions. By keeping track of the energy budget available, ENOS ensures that in case of a blackout the system state is safely stored to persistent memory, allowing operations to resume at a later point when enough energy is harvested again.

能量中性的实时系统从环境中获取所需的全部能量，因此必须将能量视为与时间同等重要的资源。因此，这样的系统需要解决许多传统实时系统迄今为止没有解决的问题。特别是，这包括有时间和能量限制的任务调度，能源预算的监控，以及停电期间的生存，在此期间没有足够的能源可用来保持系统的全面运行。在本文中，我们提出了ENOS，一个用于能量中性实时系统的操作系统内核来解决这些问题。ENOS考虑了不同能量临界模式的混合时间临界水平，这使得在一个被认为比另一个更不关键的阶段中时间和能量约束解耦。当切换能量临界模式时，系统也会改变要执行的任务集，从而能够根据外部条件动态调整其能耗。通过跟踪可用的能量预算，ENOS确保在停电的情况下，系统状态被安全地存储到持久存储器中，允许在以后的某个点再次获得足够的能量时恢复操作。

{"title":"A Kernel for Energy-Neutral Real-Time Systems with Mixed Criticalities","authors":"Peter Wägemann, T. Distler, Heiko Janker, Phillip Raffeck, V. Sieh","doi":"10.1109/RTAS.2016.7461320","DOIUrl":"https://doi.org/10.1109/RTAS.2016.7461320","url":null,"abstract":"Energy-neutral real-time systems harvest the entire energy they use from their environment, making it essential to treat energy as an equally important resource as time. As a result, such systems need to solve a number of problems that so far have not been addressed by traditional real-time systems. In particular, this includes the scheduling of tasks with both time and energy constraints, the monitoring of energy budgets, as well as the survival of blackout periods during which not enough energy is available to keep the system fully operational. In this paper, we address these issues presenting ENOS, an operating-system kernel for energy-neutral real-time systems. ENOS considers mixed time criticality levels for different energy criticality modes, which enables a decoupling of time and energy constraints during phases when one is considered less critical than the other. When switching the energy criticality mode, the system also changes the set of tasks to be executed and is therefore able to dynamically adapt its energy consumption depending on external conditions. By keeping track of the energy budget available, ENOS ensures that in case of a blackout the system state is safely stored to persistent memory, allowing operations to resume at a later point when enough energy is harvested again.","PeriodicalId":338179,"journal":{"name":"2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114526668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

Complete, High-Assurance Determination of Loop Bounds and Infeasible Paths for WCET Analysis 完整的，高保证的确定环路边界和不可行的路径用于WCET分析

2016 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Pub Date : 2016-04-11 DOI: 10.1109/RTAS.2016.7461326

Thomas Sewell, Felix Kam, G. Heiser

Worst-case execution time (WCET) analysis of real-time code needs to be performed on the executable binary code for soundness. Determination of loop bounds and elimination of infeasible paths, essential for obtaining tight bounds, frequently depends on program state that is difficult to extract from static analysis of the binary. Obtaining this information generally requires manual intervention, or compiler modifications to preserve more semantic information from the source program. We propose an alternative approach, which leverages an existing translation-validation framework, to enable high-assurance, automatic determination of loop bounds and infeasible paths. We show that this approach automatically determines all loop bounds and many (possibly all) infeasible paths in the seL4 microkernel, as well as in standard WCET benchmarks which are in the language subset of our C parser.

为了保证实时代码的可靠性，需要对可执行二进制代码进行最坏情况执行时间(WCET)分析。循环边界的确定和不可行路径的消除是获得紧边界的关键，但往往依赖于难以从二进制文件的静态分析中提取的程序状态。获取这些信息通常需要人工干预，或者修改编译器以保留源程序中的更多语义信息。我们提出了一种替代方法，该方法利用现有的翻译验证框架，实现高保证，自动确定循环边界和不可行的路径。我们展示了这种方法自动确定seL4微内核中的所有循环边界和许多(可能是全部)不可行的路径，以及在我们的C解析器的语言子集中的标准WCET基准测试。

引用次数: 23