2016 14th ACM/IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)最新文献

英文中文

An online overclocking scheme for bursty real-time tasks and an evaluation of its thermal impact 突发实时任务的在线超频方案及其热影响评估

2016 14th ACM/IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)

Pub Date : 2016-10-01 DOI: 10.1145/2993452.2993568

Björn Forsberg, Kai Lampka, Vasileios Spiliopoulos

This paper proposes a scheme which drives a processor beyond its rated operation frequency, e. g., by exploiting Intel's boost technology, to digest the peak workload of the system in time. In the setting of deadline constrained workloads, this is far from trivial: the boost mode can only be used during short time spans, therefore it can only help to digest the peak workload, rather than serving the normal case. A lowered processor frequency, used outside the peak workload time, yields a backlog of not completed jobs. This backlog may result in deadline violations or buffer overflows, if the next burst of job arrivals appears too early. To overcome the above problem, we propose a peak workload aware speed assignment strategy, which only allows the system to build up computation backlog if the absence of high computation demands is assured. Contrasting the existing body of work, we take advantage of bursty arrival patterns of compute jobs, thereby progressing over the standard (non-bursty sporadic) job release model. Together with our scheme, we also present a tool chain and simulations of synthetic workloads for investigating the thermal effects of different speed assignment strategies.

本文提出了一种驱动处理器超过其额定工作频率的方案，如利用英特尔的boost技术，及时消化系统的峰值工作负载。在设置受截止日期限制的工作负载时，这一点非常重要:boost模式只能在短时间内使用，因此它只能帮助消化高峰工作负载，而不能服务于正常情况。在高峰工作负载时间之外使用较低的处理器频率，会产生未完成作业的积压。如果下一批工作到达的时间太早，这种积压可能会导致违反截止日期或缓冲区溢出。为了克服上述问题，我们提出了一种峰值负载感知的速度分配策略，该策略只允许系统在没有高计算需求的情况下建立计算积压。与现有的工作主体相比，我们利用了计算作业的突发到达模式，从而超越了标准的(非突发的零星)作业释放模型。结合我们的方案，我们还提供了一个工具链和模拟的合成工作负载来研究不同的速度分配策略的热效应。

{"title":"An online overclocking scheme for bursty real-time tasks and an evaluation of its thermal impact","authors":"Björn Forsberg, Kai Lampka, Vasileios Spiliopoulos","doi":"10.1145/2993452.2993568","DOIUrl":"https://doi.org/10.1145/2993452.2993568","url":null,"abstract":"This paper proposes a scheme which drives a processor beyond its rated operation frequency, e. g., by exploiting Intel's boost technology, to digest the peak workload of the system in time. In the setting of deadline constrained workloads, this is far from trivial: the boost mode can only be used during short time spans, therefore it can only help to digest the peak workload, rather than serving the normal case. A lowered processor frequency, used outside the peak workload time, yields a backlog of not completed jobs. This backlog may result in deadline violations or buffer overflows, if the next burst of job arrivals appears too early. To overcome the above problem, we propose a peak workload aware speed assignment strategy, which only allows the system to build up computation backlog if the absence of high computation demands is assured. Contrasting the existing body of work, we take advantage of bursty arrival patterns of compute jobs, thereby progressing over the standard (non-bursty sporadic) job release model. Together with our scheme, we also present a tool chain and simulations of synthetic workloads for investigating the thermal effects of different speed assignment strategies.","PeriodicalId":198459,"journal":{"name":"2016 14th ACM/IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124646289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Mobile ultrasound imaging on heterogeneous multi-core platforms 异构多核平台上的移动超声成像

2016 14th ACM/IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)

Pub Date : 2016-10-01 DOI: 10.1145/2993452.2993565

Andreas Kurth, Andreas Tretter, P. Hager, S. Sanabria, Orçun Göksel, L. Thiele, L. Benini

Ultrasound imaging is one of the most important medical diagnostic methods. The bulkiness of state-of-the-art high-quality ultrasound devices, however, drastically limits their usability in important application scenarios. In this paper, we show how a portable medical ultrasound device can be built using many-core technology and programmable logic, combining low power consumption with high flexibility. We discuss a typical ultrasound image reconstruction algorithm and howit can be parallelized using a pipelined design that efficiently partitions theworkload among heterogeneous processing elements. A special focus lies on the limited memory resources and data bandwidth between components. To tackle both problems, we use floating windowbuffers and approximate computations, and we minimize lookup table sizes using on-the-fly calculations. We evaluate the design on the Adapteva Parallella platform, which contains a power-efficient 16-core Epiphany coprocessor and a Zynq SoC including a dual-core ARM A9 processor and programmable logic. Experimental results show that parallel beamforming of 128 input channels to a 288x128 pixel ultrasound image can be achieved on the Parallella at a rate of 5.3 frames per second consuming only 2watt of dynamic power.

超声成像是最重要的医学诊断方法之一。然而，最先进的高质量超声设备的体积极大地限制了它们在重要应用场景中的可用性。在本文中，我们展示了如何使用多核技术和可编程逻辑构建便携式医疗超声设备，将低功耗与高灵活性相结合。我们讨论了一种典型的超声图像重建算法，以及如何使用流水线设计将其并行化，从而有效地在异构处理元素之间划分工作负载。特别关注的是组件之间有限的内存资源和数据带宽。为了解决这两个问题，我们使用浮动窗口缓冲区和近似计算，并使用动态计算最小化查找表大小。我们在Adapteva parallelella平台上评估了该设计，该平台包含一个节能的16核Epiphany协处理器和一个包含双核ARM A9处理器和可编程逻辑的Zynq SoC。实验结果表明，128个输入通道对288x128像素的超声图像可以实现并行波束形成，速率为5.3帧/秒，仅消耗2w的动态功率。

{"title":"Mobile ultrasound imaging on heterogeneous multi-core platforms","authors":"Andreas Kurth, Andreas Tretter, P. Hager, S. Sanabria, Orçun Göksel, L. Thiele, L. Benini","doi":"10.1145/2993452.2993565","DOIUrl":"https://doi.org/10.1145/2993452.2993565","url":null,"abstract":"Ultrasound imaging is one of the most important medical diagnostic methods. The bulkiness of state-of-the-art high-quality ultrasound devices, however, drastically limits their usability in important application scenarios. In this paper, we show how a portable medical ultrasound device can be built using many-core technology and programmable logic, combining low power consumption with high flexibility. We discuss a typical ultrasound image reconstruction algorithm and howit can be parallelized using a pipelined design that efficiently partitions theworkload among heterogeneous processing elements. A special focus lies on the limited memory resources and data bandwidth between components. To tackle both problems, we use floating windowbuffers and approximate computations, and we minimize lookup table sizes using on-the-fly calculations. We evaluate the design on the Adapteva Parallella platform, which contains a power-efficient 16-core Epiphany coprocessor and a Zynq SoC including a dual-core ARM A9 processor and programmable logic. Experimental results show that parallel beamforming of 128 input channels to a 288x128 pixel ultrasound image can be achieved on the Parallella at a rate of 5.3 frames per second consuming only 2watt of dynamic power.","PeriodicalId":198459,"journal":{"name":"2016 14th ACM/IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131220947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Temporal analysis of static priority preemptive scheduled cyclic streaming applications using CSDF models 使用CSDF模型的静态优先级抢占调度循环流应用程序的时间分析

2016 14th ACM/IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)

Pub Date : 2016-09-02 DOI: 10.1145/2993452.2993564

P. Kurtin, M. Bekooij

Real-time streaming applications with cyclic data dependencies that are executed on multiprocessor systems with processor sharing usually require a temporal analysis to give guarantees on their temporal behavior at design time. Current accurate analysis techniques for cyclic applications that are scheduled with Static Priority Preemptive (SPP) schedulers are however limited to the analysis of applications that can be expressed with Homogeneous Synchronous Dataflow (HSDF) models, i.e. in which all tasks operate at a single rate. Moreover, it is required that both input and output buffers synchronize atomically at the beginnings and finishes of task executions, which is difficult to realize on many existing hardware platforms. This paper presents a temporal analysis approach for cyclic real-time streaming applications executed on multiprocessor systems with processor sharing and SPP scheduling that can be expressed using Cyclo-Static Dataflow (CSDF) models. This allows to model tasks with multiple phases and changing rates and furthermore resolves the problematic restriction that buffer synchronization must occur atomically at the boundaries of task executions. For that purpose a joint interference characterization over multiple phases is introduced, which realizes a significant accuracy improvement compared to an isolated consideration of interference. Applicability, efficiency and accuracy of the presented approach are evaluated in a case study using a WLAN 802.11p transceiver application. Thereby different use-cases of CSDF modeling are discussed, including a CSDF model relaxing the requirement of atomic synchronization.

具有循环数据依赖的实时流应用程序在具有处理器共享的多处理器系统上执行，通常需要进行时间分析，以保证其在设计时的时间行为。然而，对于使用静态优先级抢占(SPP)调度器调度的循环应用程序，目前的精确分析技术仅限于可以用同构同步数据流(HSDF)模型表示的应用程序的分析，即所有任务都以单一速率运行。此外，还要求输入和输出缓冲区在任务执行的开始和结束时自动同步，这在许多现有硬件平台上很难实现。本文提出了一种在多处理器系统上运行的具有处理器共享和SPP调度的循环实时流应用程序的时间分析方法，该方法可以用循环静态数据流(CSDF)模型来表示。这允许对具有多个阶段和变化速率的任务进行建模，并且进一步解决了缓冲区同步必须在任务执行边界自动发生的有问题的限制。为此，引入了一种多相位的联合干扰表征，与孤立地考虑干扰相比，它实现了显著的精度提高。在一个使用WLAN 802.11p收发器应用的案例研究中，评估了所提出方法的适用性、效率和准确性。因此，讨论了CSDF建模的不同用例，包括放松原子同步需求的CSDF模型。

{"title":"Temporal analysis of static priority preemptive scheduled cyclic streaming applications using CSDF models","authors":"P. Kurtin, M. Bekooij","doi":"10.1145/2993452.2993564","DOIUrl":"https://doi.org/10.1145/2993452.2993564","url":null,"abstract":"Real-time streaming applications with cyclic data dependencies that are executed on multiprocessor systems with processor sharing usually require a temporal analysis to give guarantees on their temporal behavior at design time. Current accurate analysis techniques for cyclic applications that are scheduled with Static Priority Preemptive (SPP) schedulers are however limited to the analysis of applications that can be expressed with Homogeneous Synchronous Dataflow (HSDF) models, i.e. in which all tasks operate at a single rate. Moreover, it is required that both input and output buffers synchronize atomically at the beginnings and finishes of task executions, which is difficult to realize on many existing hardware platforms. This paper presents a temporal analysis approach for cyclic real-time streaming applications executed on multiprocessor systems with processor sharing and SPP scheduling that can be expressed using Cyclo-Static Dataflow (CSDF) models. This allows to model tasks with multiple phases and changing rates and furthermore resolves the problematic restriction that buffer synchronization must occur atomically at the boundaries of task executions. For that purpose a joint interference characterization over multiple phases is introduced, which realizes a significant accuracy improvement compared to an isolated consideration of interference. Applicability, efficiency and accuracy of the presented approach are evaluated in a case study using a WLAN 802.11p transceiver application. Thereby different use-cases of CSDF modeling are discussed, including a CSDF model relaxing the requirement of atomic synchronization.","PeriodicalId":198459,"journal":{"name":"2016 14th ACM/IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129400698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2016 14th ACM/IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀