SIGBED Rev.最新文献

英文中文

Traffic-aware reconfigurable architecture for fault-tolerant 2D mesh NoCs 容错二维网格noc的流量感知可重构架构

SIGBED Rev.

Pub Date : 2018-08-15 DOI: 10.1145/3267419.3267423

Poona Bahrebar, D. Stroobandt

With the aggressive scaling of the VLSI technology, Networks-on-Chip (NoCs) are becoming more susceptible to faults. Therefore, designing reliable and efficient NoCs is of significant importance. The rerouting approach which is employed in most of the fault-tolerant methods causes the network performance to degrade considerably due to taking longer paths and creating hotspots around the faults. Moreover, they cannot adapt to the dynamic traffic distribution in the network. Considering the increasing demands for real-time systems, the necessity for designing reconfigurable and robust NoCs is even more pronounced. In this paper, a dynamically reconfigurable technique is proposed to address fault-tolerance and minimal routing in mesh NoCs. To accomplish this goal, the router architecture is modified to enable the frequently communicating nodes to bypass the faulty router and communicate through shorter paths. Thus, not only the rerouting is minimized, the connectivity of the network is maintained in the vicinity of faults. The experimental results validate the performance and reliability of the proposed technique with a small hardware overhead.

随着超大规模集成电路技术的迅猛发展，片上网络(noc)越来越容易出现故障。因此，设计可靠、高效的noc具有重要意义。大多数容错方法采用的重路由方法由于需要较长的路径和在故障周围产生热点而导致网络性能显著下降。而且，它们不能适应网络中流量的动态分布。考虑到对实时系统日益增长的需求，设计可重构和健壮的noc的必要性更加明显。本文提出了一种动态可重构技术来解决网状网络网络中的容错和最小路由问题。为了实现这一目标，修改了路由器架构，使频繁通信的节点能够绕过故障路由器，通过更短的路径进行通信。这样，不仅可以最大限度地减少重路由，而且可以在故障附近保持网络的连通性。实验结果验证了该技术的性能和可靠性，且硬件开销小。

引用次数: 3

Impact of time-triggered transmission window placement on rate-constrained traffic in TTEthernet networks 时间触发传输窗口放置对以太网中速率受限流量的影响

SIGBED Rev.

Pub Date : 2018-08-15 DOI: 10.1145/3267419.3267420

Florian Heilmann, G. Fohler

Safety-critical Ethernet-based networks are receiving significant attention in avionics, automotive and industrial domains. Time-Triggered Ethernet (TTEthernet AS6802) provides safety-critical transmission guarantees via a high priority, time-triggered (TT) traffic class and a lower priority, rate-constrained (RC) traffic class. TT traffic is transmitted between synchronized nodes of a TTEthernet network in offline scheduled TT transmission windows. In this work, we analyze the impact of different placement strategies for these TT transmission windows on end-to-end delay and jitter of RC messages on the same path segment. We show that, depending on the placement of TT transmission windows in a schedule, the end-to-end delay and jitter of RC messages can vary significantly. We further introduce link-based offsets, a new placement strategy for TT transmission windows which allows to reduce the impact of TT transmission windows on RC traffic. In this strategy offsets are applied to all TT transmission windows in a physical link schedule to reduce the amount of time that an RC message on the same physical link is delayed by TT traffic. The link-based offsets strategy can be implemented in the TTEthernet scheduler and does not require hardware modifications. We show that the link-based offset strategy can reduce the end-to-end delay and jitter of RC traffic, and evaluate our claims using an OMNET++ simulation.

基于以太网的安全关键网络在航空电子、汽车和工业领域受到了极大的关注。时间触发以太网(ttetheras6802)通过高优先级、时间触发(TT)流量类和低优先级、速率约束(RC)流量类提供安全关键型传输保证。TT流量在离线定时TT传输窗口中在同步节点间传输。在这项工作中，我们分析了这些TT传输窗口的不同放置策略对同一路径段上RC消息的端到端延迟和抖动的影响。我们表明，根据TT传输窗口在调度中的位置，RC消息的端到端延迟和抖动可以显着变化。我们进一步介绍了基于链路的偏移，这是TT传输窗口的一种新的放置策略，可以减少TT传输窗口对RC流量的影响。在这种策略中，偏移量应用于物理链路调度中的所有TT传输窗口，以减少同一物理链路上的RC消息被TT流量延迟的时间。基于链路的偏移策略可以在以太网调度器中实现，并且不需要修改硬件。我们证明了基于链路的偏移策略可以减少RC流量的端到端延迟和抖动，并使用omnet++仿真评估了我们的主张。

{"title":"Impact of time-triggered transmission window placement on rate-constrained traffic in TTEthernet networks","authors":"Florian Heilmann, G. Fohler","doi":"10.1145/3267419.3267420","DOIUrl":"https://doi.org/10.1145/3267419.3267420","url":null,"abstract":"Safety-critical Ethernet-based networks are receiving significant attention in avionics, automotive and industrial domains. Time-Triggered Ethernet (TTEthernet AS6802) provides safety-critical transmission guarantees via a high priority, time-triggered (TT) traffic class and a lower priority, rate-constrained (RC) traffic class. TT traffic is transmitted between synchronized nodes of a TTEthernet network in offline scheduled TT transmission windows. In this work, we analyze the impact of different placement strategies for these TT transmission windows on end-to-end delay and jitter of RC messages on the same path segment. We show that, depending on the placement of TT transmission windows in a schedule, the end-to-end delay and jitter of RC messages can vary significantly. We further introduce link-based offsets, a new placement strategy for TT transmission windows which allows to reduce the impact of TT transmission windows on RC traffic. In this strategy offsets are applied to all TT transmission windows in a physical link schedule to reduce the amount of time that an RC message on the same physical link is delayed by TT traffic. The link-based offsets strategy can be implemented in the TTEthernet scheduler and does not require hardware modifications. We show that the link-based offset strategy can reduce the end-to-end delay and jitter of RC traffic, and evaluate our claims using an OMNET++ simulation.","PeriodicalId":447904,"journal":{"name":"SIGBED Rev.","volume":"330 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115970040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Routing algorithms for IEEE802.1Qbv networks IEEE802.1Qbv网络的路由算法

SIGBED Rev.

Pub Date : 2018-08-15 DOI: 10.1145/3267419.3267421

N. Nayak, Frank Dürr, K. Rothermel

The recently published IEEE 802.1Qbv standard specifies enhancements for providing real-time communication guarantees for time-triggered flows while also handling best-effort traffic in a converged Ethernet network. The enhancements include a programmable time-based gating mechanism for stipulating which of the queues of an egress port are available for transmission at any given point of time. By appropriately programming (opening and closing) these gates, the traversal of packets through the network can be controlled to precisely follow a precomputed schedule that satisfies the timing constraints of the time-triggered flows. Computing such transmission schedules requires routing of the flows in the first step, followed by the computation of gate schedules for the flows along their respective routes. So far off-the-shelf algorithms like shortest path routing, which optimize the number of hops over which flows are routed, have been used for computing routes for the time-triggered traffic. In this paper, we explore how the routing of time-triggered flows affects their schedulability. Moreover, we identify additional parameters that must be considered while routing time-triggered traffic and propose ILP-based algorithms for the purpose. Our evaluations show that the proposed routing algorithms could improve the slack in the computed schedules by upto 60 % and 30 % compared to shortest path routing and equal cost multi-pathing (ECMP), respectively, and, thus, increase the capacity of the network to accommodate more time-triggered traffic.

最近发布的IEEE 802.1Qbv标准规定了增强功能，以便为时间触发的流提供实时通信保证，同时在融合以太网中处理最佳流量。增强功能包括可编程的基于时间的门控机制，用于规定在任何给定时间点出口端口的哪个队列可用于传输。通过适当地编程(打开和关闭)这些门，可以控制数据包通过网络的遍历，以精确地遵循预先计算的时间表，该时间表满足时间触发流的时间约束。计算这样的传输调度需要在第一步对流进行路由，然后计算沿其各自路由的流的闸调度。到目前为止，像最短路径路由这样的现成算法已经被用于计算时间触发流量的路由。最短路径路由优化了流量经过的跳数。在本文中，我们探讨了时间触发流的路由如何影响它们的可调度性。此外，我们确定了在路由时间触发流量时必须考虑的其他参数，并提出了基于ilp的算法。我们的评估表明，与最短路径路由和等成本多路径(ECMP)相比，所提出的路由算法可以分别将计算调度中的空闲量提高60%和30%，从而增加网络的容量，以容纳更多的时间触发流量。

{"title":"Routing algorithms for IEEE802.1Qbv networks","authors":"N. Nayak, Frank Dürr, K. Rothermel","doi":"10.1145/3267419.3267421","DOIUrl":"https://doi.org/10.1145/3267419.3267421","url":null,"abstract":"The recently published IEEE 802.1Qbv standard specifies enhancements for providing real-time communication guarantees for time-triggered flows while also handling best-effort traffic in a converged Ethernet network. The enhancements include a programmable time-based gating mechanism for stipulating which of the queues of an egress port are available for transmission at any given point of time. By appropriately programming (opening and closing) these gates, the traversal of packets through the network can be controlled to precisely follow a precomputed schedule that satisfies the timing constraints of the time-triggered flows. Computing such transmission schedules requires routing of the flows in the first step, followed by the computation of gate schedules for the flows along their respective routes. So far off-the-shelf algorithms like shortest path routing, which optimize the number of hops over which flows are routed, have been used for computing routes for the time-triggered traffic. In this paper, we explore how the routing of time-triggered flows affects their schedulability. Moreover, we identify additional parameters that must be considered while routing time-triggered traffic and propose ILP-based algorithms for the purpose. Our evaluations show that the proposed routing algorithms could improve the slack in the computed schedules by upto 60 % and 30 % compared to shortest path routing and equal cost multi-pathing (ECMP), respectively, and, thus, increase the capacity of the network to accommodate more time-triggered traffic.","PeriodicalId":447904,"journal":{"name":"SIGBED Rev.","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128081225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 40

Performance enhancement of extended AFDX via bandwidth reservation for TSN/BLS shapers 通过TSN/BLS整形器的带宽保留来增强扩展AFDX的性能

SIGBED Rev.

Pub Date : 2018-07-03 DOI: 10.1145/3314206.3314209

A. Finzi, A. Mifdaoui, F. Frances, E. Lochin

To support mixed-criticality applications, the AFDX may integrate multiple traffic classes: Safety-Critical Traffic (SCT) with hard realtime constraints, Rate-Constrained (RC) traffic requiring bounded latencies and Best Effort (BE) traffic with no delivery constraints. These traffic classes are managed based on a Non-Preemptive Strict Priority (NP-SP) Scheduler, where the highest priority traffic (SCT) is shaped with a Burst Limiting Shaper (BLS). The latter has been defined by the Time Sensitive Networking (TSN) task group to limit the impact of high priority flows on lower priority ones. This paper proposes two bandwidth reservation methods for BLS shapers in AFDX networks. The proposed methods are evaluated on a realistic AFDX configuration. Results show their efficiency to noticeably enhance the RC delay bounds and the SCT schedulability, in comparison to an intuitive method.

为了支持混合关键应用，AFDX可以集成多个流量类别:具有硬实时约束的安全关键流量(SCT)，需要有限延迟的速率约束流量(RC)和没有交付约束的最佳努力流量(BE)。这些流量类基于非抢占严格优先级(NP-SP)调度程序进行管理，其中最高优先级流量(SCT)由突发限制整形器(BLS)塑造。后者由时间敏感网络(TSN)任务组定义，以限制高优先级流对低优先级流的影响。提出了AFDX网络中BLS整形器的两种带宽预留方法。在实际的AFDX配置上对所提出的方法进行了评估。结果表明，与直观的方法相比，它们显著提高了RC延迟界和SCT可调度性。

引用次数: 4

Device microagent for IoT home gateway: a lightweight plug-n-play architecture 物联网家庭网关的设备微代理:轻量级即插即用架构

SIGBED Rev.

Pub Date : 2018-06-05 DOI: 10.1145/3231535.3231537

D. Chattopadhyay, Abinash Samantaray, Anupam Datta

Smart home implementation in IoT involves practical challenges of management and scalability of connecting various non IP end-devices i.e. sensors and actuators behind the connnected home gateway. While there are separate standards for interaction between IoT service to home gateway and gateway to variety of end-devices there remains disconnect regarding how this two ends meet in an adaptable and scalable way. In this paper we present an light-weight, loosly coupled architecture for IoT smart home gateway whereby end-devices can be added dynamically on the gateway without disrupting long haul communication between IoT cloud service and gateway. The gateway agent exchanges data through sensor-block or actuator-block with end-devices via device microagents and the protocol specific read-write task is offloaded to individual device microagent. This hybrid approach to integrate MQTT pub/sub flexibility with LWM2M RESTful adaptability results in a design of plug-n-play modular agent architecture for home gateway management in IoT applications.

物联网中的智能家居实施涉及连接各种非IP终端设备(即连接的家庭网关后面的传感器和执行器)的管理和可扩展性的实际挑战。虽然物联网服务到家庭网关和网关到各种终端设备之间的交互有单独的标准，但关于这两个端点如何以可适应和可扩展的方式满足仍然存在脱节。在本文中，我们提出了一种轻量级、松耦合的物联网智能家居网关架构，通过该架构，终端设备可以在网关上动态添加，而不会中断物联网云服务和网关之间的长途通信。网关代理通过传感器块或执行器块与终端设备通过设备微代理交换数据，协议特定的读写任务被卸载到单个设备微代理。这种将MQTT发布/订阅灵活性与LWM2M RESTful适应性集成在一起的混合方法，为物联网应用中的家庭网关管理设计了即插即用的模块化代理体系结构。

引用次数: 3

Where is PELE?: pervasive localization using wearable and handheld devices 贝利在哪里?:使用可穿戴和手持设备的普遍定位

SIGBED Rev.

Pub Date : 2018-06-05 DOI: 10.1145/3231535.3231536

L. H. John, Chayan Sarkar, R. V. Prasad

Smartphones or in general handhelds commonly used for indoor localization purposes are not a viable option in places where people do not carry them all the time - for example, home and office. Alternatively, wearable devices can partially solve this problem but have many limitations with respect to power supply, processing capability, and availability of sensors. These issues prevent the adoption of many common handheld localization solutions. In this work, we present PErvasive Localization Engine (PELE), a distributed localization system that uses wearable and handheld jointly to address the above drawbacks. Using only magnetometer, accelerometer, and Bluetooth radio, localization is performed by means of a particle filter. In addition, a dynamic handoff mechanism is presented, which uses the wearable only when it is necessary, thus reducing energy consumption on the wearable without affecting the desired localization accuracy. Evaluating the system with ten participants, we achieve a localization accuracy of 90.31 % in an indoor environment spanning about 320 m2.

通常用于室内定位目的的智能手机或手持设备在人们不经常携带它们的地方(例如，家庭和办公室)是不可行的选择。另外，可穿戴设备可以部分解决这个问题，但在电源、处理能力和传感器的可用性方面有许多限制。这些问题阻碍了许多常见的手持定位解决方案的采用。在这项工作中，我们提出了普及定位引擎(PELE)，这是一种使用可穿戴和手持设备联合使用的分布式定位系统，以解决上述缺点。仅使用磁力计、加速度计和蓝牙无线电，通过粒子滤波器进行定位。此外，提出了一种动态切换机制，仅在必要时才使用可穿戴设备，从而在不影响所需定位精度的情况下降低了可穿戴设备的能耗。通过对10个参与者的系统进行评估，我们在大约320平方米的室内环境中实现了90.31%的定位精度。

引用次数: 2

Zero energy visible light communication receiver for embedded applications 用于嵌入式应用的零能量可见光通信接收器

SIGBED Rev.

Pub Date : 2018-06-05 DOI: 10.1145/3231535.3231540

T. V. Prabhakar, V. Shashidhar, G. Meghana, R. V. Prasad, Garani Vittal Pranavendra

Internet of Things is bringing multiple domains and multiple avenues to connect anything and everything. It mainly uses RF connectivity. However, recently visible light communication (VLC) is also being explored. VLC has the properties that are unique with respect to the privacy and security that it provides. Though the transmission is of broadcast in nature the receiver needs to be in the vicinity of the transmitter, thus providing secure communications. Further, when low power receivers need to be constructed, it is important to harness energy from transmission itself. In this article we propose a novel design for a receiver to be used in VLC for embedded systems. The setup works, using a small solar panel (2mm x 2mm) as a medium to simultaneously harvest incident light energy and receive data bit streams. The LED source was modulated using On-Off Keying. The receiver works for close range communications. The results in this paper show experimental evaluation of the system. We could detect the signals from the source using harvested energy from the same transmission.

物联网带来了多个领域和多种途径来连接任何东西。它主要使用射频连接。然而，最近可见光通信(VLC)也在探索中。VLC在其提供的隐私和安全性方面具有独特的属性。虽然传输本质上是广播，但接收器需要在发射器附近，从而提供安全的通信。此外，当需要构建低功率接收器时，重要的是利用传输本身的能量。在本文中，我们提出了一种用于嵌入式系统的VLC接收机的新设计。该装置使用一个小型太阳能电池板(2mm x 2mm)作为介质，同时收集入射光能和接收数据比特流。LED光源采用开关键控调制。接收器用于近距离通信。最后给出了该系统的实验结果。我们可以利用从同一传输中收集的能量来探测来自源的信号。

引用次数: 0

Lightweight IO virtualization on MPU enabled microcontrollers 支持MPU的微控制器上的轻量级IO虚拟化

SIGBED Rev.

Pub Date : 2018-03-20 DOI: 10.1145/3199610.3199617

F. Paci, D. Brunelli, L. Benini

In the era of the Internet of Things (IoT), millions of devices and embedded platforms based on low-cost and limited resources microcontroller units (MCUs) will be used in continuous operation. Even if over-the-air firmware update is today a common feature, many applications might require not to reboot or to support hardware resource sharing. In such a context stop, update and reboot the platform is unpractical and dynamic loading of new user code is required. This in turn requires mechanisms to protect the MCU hardware resources and the continuously executing system tasks from uncontrolled perturbation caused by new user code being dynamically loaded. In this paper, we present a framework which provides a lightweight virtualization of the IO and platform peripherals and permits the dynamic loading of new user code. The aim of this work is to support critical isolation features typical of virtualization-ready CPUs on low-cost low-power microcontrollers with no MMU (Memory Management Unit), IOMMU or dedicated instruction extensions. Our approach only leverages the Memory Protection Unit (MPU), which is generally available in all ARM Cortex-M3 and Cortex-M4 microcontrollers. Experimental evaluations demonstrate not only the feasibility, but also the really low impact of the proposed framework in terms of memory requirements and runtime overhead.

在物联网(IoT)时代，数以百万计的基于低成本和有限资源的微控制器单元(mcu)的设备和嵌入式平台将被用于连续运行。即使无线固件更新现在是一个常见的功能，许多应用程序可能不需要重新启动或支持硬件资源共享。在这种情况下，停止、更新和重新启动平台是不切实际的，需要动态加载新的用户代码。这反过来又需要保护MCU硬件资源和连续执行系统任务的机制，使其免受动态加载新用户代码引起的不受控制的干扰。在本文中，我们提出了一个框架，它提供了IO和平台外围设备的轻量级虚拟化，并允许动态加载新的用户代码。这项工作的目的是在没有MMU(内存管理单元)、IOMMU或专用指令扩展的低成本低功耗微控制器上支持虚拟化就绪cpu典型的关键隔离功能。我们的方法只利用内存保护单元(MPU)，它通常在所有ARM Cortex-M3和Cortex-M4微控制器中可用。实验评估不仅证明了该框架的可行性，而且在内存需求和运行时开销方面，该框架的影响非常小。

{"title":"Lightweight IO virtualization on MPU enabled microcontrollers","authors":"F. Paci, D. Brunelli, L. Benini","doi":"10.1145/3199610.3199617","DOIUrl":"https://doi.org/10.1145/3199610.3199617","url":null,"abstract":"In the era of the Internet of Things (IoT), millions of devices and embedded platforms based on low-cost and limited resources microcontroller units (MCUs) will be used in continuous operation. Even if over-the-air firmware update is today a common feature, many applications might require not to reboot or to support hardware resource sharing. In such a context stop, update and reboot the platform is unpractical and dynamic loading of new user code is required. This in turn requires mechanisms to protect the MCU hardware resources and the continuously executing system tasks from uncontrolled perturbation caused by new user code being dynamically loaded. In this paper, we present a framework which provides a lightweight virtualization of the IO and platform peripherals and permits the dynamic loading of new user code. The aim of this work is to support critical isolation features typical of virtualization-ready CPUs on low-cost low-power microcontrollers with no MMU (Memory Management Unit), IOMMU or dedicated instruction extensions. Our approach only leverages the Memory Protection Unit (MPU), which is generally available in all ARM Cortex-M3 and Cortex-M4 microcontrollers. Experimental evaluations demonstrate not only the feasibility, but also the really low impact of the proposed framework in terms of memory requirements and runtime overhead.","PeriodicalId":447904,"journal":{"name":"SIGBED Rev.","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126237795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Aligning Deos and RTEMS with the FACE safety base operating system profile 使Deos和RTEMS符合FACE安全基础操作系统配置文件

SIGBED Rev.

Pub Date : 2018-03-20 DOI: 10.1145/3199610.3199612

Gedare Bloom, J. Sherrill, G. Gilliland

The Open Group Future Airborne Capability Environment (FACE™) Consortium has developed a reference architecture and standard for real-time embedded avionics systems. The FACE Technical Standard defines required capabilities for real-time operating systems (RTOS), portable components, and a shared data model to facilitate information exchange between components. FACE RTOS requirements are based on ARINC 653 and POSIX 1003.1b with tailoring to address the safety and security needs of avionics systems. Deos is a safety-certified RTOS that supports ARINC 653 but not POSIX. In contrast, RTEMS is an open source RTOS that supports POSIX but not ARINC 653. Integrating a paravirtualized RTEMS with Deos combines the strengths of both and provides a path to conformance with the FACE Safety Base operating system profile. This paper presents the FACE operating system profiles and discusses the technical challenges of the paravirtualization and integration effort.

开放式集团未来机载能力环境(FACE™)联盟为实时嵌入式航空电子系统开发了一种参考体系结构和标准。FACE技术标准定义了实时操作系统(RTOS)、可移植组件和共享数据模型所需的功能，以促进组件之间的信息交换。FACE RTOS要求基于ARINC 653和POSIX 1003.1b，并进行了定制，以满足航空电子系统的安全和保障需求。Deos是一个安全认证的RTOS，支持arinc653，但不支持POSIX。相比之下，RTEMS是一个支持POSIX但不支持arinc653的开源RTOS。将半虚拟化RTEMS与Deos集成，结合了两者的优势，并提供了符合FACE Safety Base操作系统配置文件的途径。本文介绍了FACE操作系统概要，并讨论了半虚拟化和集成工作的技术挑战。

引用次数: 1

A runtime controller for openCL applications on heterogeneous system architectures 异构系统架构上的openCL应用程序的运行时控制器

SIGBED Rev.

Pub Date : 2018-03-20 DOI: 10.1145/3199610.3199614

C. Bolchini, Stefano Cherubin, Gianluca Durelli, Simone Libutti, A. Miele, M. Santambrogio

Nowadays Heterogeneous System Architectures (HSAs) are becoming very attractive in the embedded and mobile markets thanks to the possibility to select the best computational resource among the available compute units to optimize the performance per Watt figure of merit. In this scenario, OpenCL is becoming the standard paradigm for heterogeneous computing supporting the programming of all types of units with a single abstraction level. However, the decision of the resource to use together with its architectural tuning is still left to the programmer; this issue is even more exacerbated when considering the fact that the choice depends also on the actual conditions in which the system is operating. This work aims at proposing a runtime controller, integrated in Linux Operating System (OS), for optimizing the power efficiency of a running OpenCL application deciding the system configuration. Our experimental results over a set of applications from the Polybench suite on the Odroid XU3 board show that our controller is able to obtain a power efficiency of more than 90% of the one achievable via offline profiling.

如今，异构系统架构(HSAs)在嵌入式和移动市场中变得非常有吸引力，这要感谢在可用计算单元中选择最佳计算资源以优化每瓦特性能的可能性。在这种情况下，OpenCL正在成为异构计算的标准范例，支持用单一抽象层对所有类型的单元进行编程。然而，决定使用何种资源及其架构调优仍然留给程序员;当考虑到选择还取决于系统运行的实际条件时，这个问题就更加严重了。本工作旨在提出一个集成在Linux操作系统(OS)中的运行时控制器，用于优化运行OpenCL应用程序决定系统配置的电源效率。我们在Odroid XU3板上的Polybench套件的一组应用程序上的实验结果表明，我们的控制器能够获得通过离线分析实现的功率效率的90%以上。

引用次数: 15

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

SIGBED Rev.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀