首页 > 最新文献

2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)最新文献

英文 中文
Adaptive multi-resource end-to-end reservations for component-based distributed real-time systems 基于组件的分布式实时系统的自适应多资源端到端预留
Pub Date : 2015-12-17 DOI: 10.1109/ESTIMedia.2015.7351772
N. Khalilzad, M. Ashjaei, L. Almeida, M. Behnam, Thomas Nolte
Complexity in the real-time embedded software domain has been growing rapidly. The component-based software development approach facilitates the development process of such software systems by dividing a complex system into a number of simpler components. Resource reservation techniques have been widely used for providing resources to real-time software components. In this paper we target real-time components operating on a distributed resource infrastructure. Furthermore, we target a class of software components which demonstrate dynamic resource consumption behavior. A prime example of such components is a multimedia software component. In the paper, we present a framework supporting multi-resource endto- end resource reservations. We reserve resource bandwidths on both processor resources as well as on the network resources. The proposed framework utilizes a Multiple Input Multiple Output (MIMO) controller which adjusts the sizes of reservations tracking the dynamic resource demands of the software components. Finally, we present a case study using a multimedia component to demonstrate the performance and efficiency of our framework.
实时嵌入式软件领域的复杂性一直在快速增长。基于组件的软件开发方法通过将复杂的系统划分为许多更简单的组件,从而促进了此类软件系统的开发过程。资源预留技术被广泛用于向实时软件组件提供资源。在本文中,我们的目标是在分布式资源基础设施上运行的实时组件。此外,我们针对一类显示动态资源消耗行为的软件组件。这种组件的一个主要例子是多媒体软件组件。本文提出了一个支持多资源端到端资源预留的框架。我们在处理器资源和网络资源上预留资源带宽。该框架采用多输入多输出(MIMO)控制器,通过跟踪软件组件的动态资源需求来调整保留量的大小。最后,我们给出了一个使用多媒体组件的案例研究,以展示我们的框架的性能和效率。
{"title":"Adaptive multi-resource end-to-end reservations for component-based distributed real-time systems","authors":"N. Khalilzad, M. Ashjaei, L. Almeida, M. Behnam, Thomas Nolte","doi":"10.1109/ESTIMedia.2015.7351772","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2015.7351772","url":null,"abstract":"Complexity in the real-time embedded software domain has been growing rapidly. The component-based software development approach facilitates the development process of such software systems by dividing a complex system into a number of simpler components. Resource reservation techniques have been widely used for providing resources to real-time software components. In this paper we target real-time components operating on a distributed resource infrastructure. Furthermore, we target a class of software components which demonstrate dynamic resource consumption behavior. A prime example of such components is a multimedia software component. In the paper, we present a framework supporting multi-resource endto- end resource reservations. We reserve resource bandwidths on both processor resources as well as on the network resources. The proposed framework utilizes a Multiple Input Multiple Output (MIMO) controller which adjusts the sizes of reservations tracking the dynamic resource demands of the software components. Finally, we present a case study using a multimedia component to demonstrate the performance and efficiency of our framework.","PeriodicalId":350361,"journal":{"name":"2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122197165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Visual co-occurrence network: using context for large-scale object recognition in retail 视觉共现网络:基于上下文的零售业大规模目标识别
Pub Date : 2015-12-17 DOI: 10.1109/ESTIMedia.2015.7351774
Siddharth Advani, Brigid Smith, Yasuki Tanabe, K. Irick, M. Cotter, J. Sampson, N. Vijaykrishnan
In any visual object recognition system, the classification accuracy will likely determine the usefulness of the system as a whole. In many real-world applications, it is also important to be able to recognize a large number of diverse objects for the system to be robust enough to handle the sort of tasks that the human visual system handles on an average day. These objectives are often at odds with performance, as running too large of a number of detectors on any one scene will be prohibitively slow for use in any real-time scenario. However, visual information has temporal and spatial context that can be exploited to reduce the number of detectors that need to be triggered at any given instance. In this paper, we propose a dynamic approach to encode such context, called Visual Co-occurrence Network (ViCoNet) that establishes relationships between objects observed in a visual scene. We investigate the utility of ViCoNet when integrated into a vision pipeline targeted for retail shopping. When evaluated on a large and deep dataset, we achieve a 50% improvement in performance and a 7% improvement in accuracy in the best case, and a 45% improvement in performance and a 3% improvement in accuracy in the average case over an established baseline. The memory overhead of ViCoNet is around 10KB, highlighting its effectiveness on temporal big data.
在任何视觉对象识别系统中,分类精度很可能决定整个系统的有用性。在许多现实世界的应用程序中,能够识别大量不同的对象也很重要,因为系统要足够健壮,才能处理人类视觉系统每天处理的那种任务。这些目标通常与性能不一致,因为在任何一个场景中运行太多的检测器对于任何实时场景的使用来说都是非常慢的。但是,视觉信息具有时间和空间上下文,可以利用这些上下文来减少在任何给定实例中需要触发的检测器的数量。在本文中,我们提出了一种动态方法来编码这种上下文,称为视觉共现网络(ViCoNet),它建立了视觉场景中观察到的物体之间的关系。我们研究了ViCoNet在集成到针对零售购物的视觉管道时的效用。当在一个大而深入的数据集上进行评估时,我们在最佳情况下实现了50%的性能提高和7%的准确性提高,在既定基线的平均情况下实现了45%的性能提高和3%的准确性提高。ViCoNet的内存开销在10KB左右,突出了它在时态大数据上的有效性。
{"title":"Visual co-occurrence network: using context for large-scale object recognition in retail","authors":"Siddharth Advani, Brigid Smith, Yasuki Tanabe, K. Irick, M. Cotter, J. Sampson, N. Vijaykrishnan","doi":"10.1109/ESTIMedia.2015.7351774","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2015.7351774","url":null,"abstract":"In any visual object recognition system, the classification accuracy will likely determine the usefulness of the system as a whole. In many real-world applications, it is also important to be able to recognize a large number of diverse objects for the system to be robust enough to handle the sort of tasks that the human visual system handles on an average day. These objectives are often at odds with performance, as running too large of a number of detectors on any one scene will be prohibitively slow for use in any real-time scenario. However, visual information has temporal and spatial context that can be exploited to reduce the number of detectors that need to be triggered at any given instance. In this paper, we propose a dynamic approach to encode such context, called Visual Co-occurrence Network (ViCoNet) that establishes relationships between objects observed in a visual scene. We investigate the utility of ViCoNet when integrated into a vision pipeline targeted for retail shopping. When evaluated on a large and deep dataset, we achieve a 50% improvement in performance and a 7% improvement in accuracy in the best case, and a 45% improvement in performance and a 3% improvement in accuracy in the average case over an established baseline. The memory overhead of ViCoNet is around 10KB, highlighting its effectiveness on temporal big data.","PeriodicalId":350361,"journal":{"name":"2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"215 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130866224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Javascript ahead-of-time compilation for embedded web platform 嵌入式web平台的Javascript提前编译
Pub Date : 2015-12-17 DOI: 10.1109/ESTIMedia.2015.7351768
Hyukwoo Park, Wonki Jung, Soo-Mook Moon
Web applications (apps) programmed using HTML, CSS, and JavaScript have advantages in portability and productivity, compared to Android or IOS apps. However, web apps are involved with some performance issue, due to JavaScript with its dynamic typing and prototypes which are difficult to execute efficiently. One popular way of accelerating JavaScript is using the just-in-time compilation (JITC), which translates the JavaScript source code to the machine code at runtime. Unfortunately, JavaScript JITC for web apps suffers from the parsing and compilation overhead seriously, which offsets the performance gain of executing the compiled code. In this paper, we propose ahead-of-time compilation (AOTC) of JavaScript at the client device. We save the code generated by the JITC at the first run of the web app, so that we can reuse the code in the next runs to remove the parsing and the compilation overhead. For the JavaScriptCore (JSC) engine of the WebKit, we developed three AOTCs. Bytecode-AOTC saves the bytecode, so we can omit the parsing overhead. Native-AOTC saves the machine code, so we can omit the compilation as well as the parsing overhead, yet with a higher space overhead. We also developed a selective-AOTC which selects between the two AOTC depending on the performance benefit and the space overhead of a target JavaScript function. We experimented with four web apps on a commercial smart TV. For the JavaScript portion of the app loading time, bytecode-AOTC and native- AOTC achieve 33.5% and 62.1% performance benefit, with a space overhead of 2.3 times and 15.4 times of the original JavaScript source code size, respectively. Selective-AOTC achieves a trade-off of 45.9% performance benefit and a space overhead of 4.7 times. Our evaluation on JavaScript benchmarks also shows a tangible performance gain, although it is lower than web apps. Finally, we compare to the AOTC for V8 engine and discuss some issues.
与Android或IOS应用程序相比,使用HTML、CSS和JavaScript编写的Web应用程序在可移植性和生产力方面具有优势。然而,由于JavaScript的动态类型和原型难以有效执行,web应用程序涉及一些性能问题。一种流行的加速JavaScript的方法是使用即时编译(jit),它在运行时将JavaScript源代码转换为机器码。不幸的是,用于web应用程序的JavaScript JITC受到解析和编译开销的严重影响,这抵消了执行编译代码的性能增益。在本文中,我们提出了在客户端设备上对JavaScript进行提前编译(AOTC)。我们在web应用程序的第一次运行时保存JITC生成的代码,以便我们可以在下一次运行时重用这些代码,以消除解析和编译开销。对于WebKit的JavaScriptCore (JSC)引擎,我们开发了三个aotc。字节码- aotc保存字节码,因此我们可以省略解析开销。Native-AOTC保存了机器代码,因此我们可以省略编译和解析开销,但是空间开销更高。我们还开发了一个选择性AOTC,它根据性能优势和目标JavaScript函数的空间开销在两个AOTC之间进行选择。我们在一台商用智能电视上试验了四个网络应用程序。对于应用程序加载时间的JavaScript部分,字节码-AOTC和本机-AOTC分别获得33.5%和62.1%的性能优势,空间开销分别是原始JavaScript源代码大小的2.3倍和15.4倍。选择性aotc实现了45.9%的性能优势和4.7倍的空间开销的折衷。我们对JavaScript基准测试的评估也显示出明显的性能提升,尽管它低于web应用程序。最后,我们对V8引擎的AOTC进行了比较,并讨论了一些问题。
{"title":"Javascript ahead-of-time compilation for embedded web platform","authors":"Hyukwoo Park, Wonki Jung, Soo-Mook Moon","doi":"10.1109/ESTIMedia.2015.7351768","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2015.7351768","url":null,"abstract":"Web applications (apps) programmed using HTML, CSS, and JavaScript have advantages in portability and productivity, compared to Android or IOS apps. However, web apps are involved with some performance issue, due to JavaScript with its dynamic typing and prototypes which are difficult to execute efficiently. One popular way of accelerating JavaScript is using the just-in-time compilation (JITC), which translates the JavaScript source code to the machine code at runtime. Unfortunately, JavaScript JITC for web apps suffers from the parsing and compilation overhead seriously, which offsets the performance gain of executing the compiled code. In this paper, we propose ahead-of-time compilation (AOTC) of JavaScript at the client device. We save the code generated by the JITC at the first run of the web app, so that we can reuse the code in the next runs to remove the parsing and the compilation overhead. For the JavaScriptCore (JSC) engine of the WebKit, we developed three AOTCs. Bytecode-AOTC saves the bytecode, so we can omit the parsing overhead. Native-AOTC saves the machine code, so we can omit the compilation as well as the parsing overhead, yet with a higher space overhead. We also developed a selective-AOTC which selects between the two AOTC depending on the performance benefit and the space overhead of a target JavaScript function. We experimented with four web apps on a commercial smart TV. For the JavaScript portion of the app loading time, bytecode-AOTC and native- AOTC achieve 33.5% and 62.1% performance benefit, with a space overhead of 2.3 times and 15.4 times of the original JavaScript source code size, respectively. Selective-AOTC achieves a trade-off of 45.9% performance benefit and a space overhead of 4.7 times. Our evaluation on JavaScript benchmarks also shows a tangible performance gain, although it is lower than web apps. Finally, we compare to the AOTC for V8 engine and discuss some issues.","PeriodicalId":350361,"journal":{"name":"2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126182172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Energy-efficient mapping of real-time streaming applications on cluster heterogeneous MPSoCs 集群异构mpsoc上实时流应用的节能映射
Pub Date : 2015-12-17 DOI: 10.1109/ESTIMedia.2015.7351764
Di Liu, J. Spasić, Gang Chen, T. Stefanov
In this paper, we propose a novel polynomial time algorithm, called Frequency Driven Mapping, to map real-time streaming applications specified as cyclo-static dataflow (CSDF) graphs onto a cluster heterogeneous MPSoC. The objective of our mapping approach is to reduce the energy consumption and guarantee latency and throughput constraints. The main novelty in our mapping algorithm is twofold: (1) By using hard-realtime scheduling of CSDF graphs, we propose an efficient way to determine a suitable processor type for each task in a CSDF graph, where the energy consumption is minimized and throughput and latency constraints are met; (2) According to an initial mapping derived by a first-fit-decreasing heuristic, we propose a remapping approach, where some tasks are remapped to unused clusters in order to further reduce the energy consumption of the system by cluster dynamic voltage/frequency scaling (DVFS). The experimental results show that the proposed algorithm finds more energy efficient mapping compared to existing approaches. The energy savings due to our proposed algorithm are up to 34%.
在本文中,我们提出了一种新的多项式时间算法,称为频率驱动映射,将指定为循环静态数据流(CSDF)图的实时流应用映射到集群异构MPSoC上。我们的映射方法的目标是减少能耗并保证延迟和吞吐量约束。我们的映射算法的主要新颖之处在于:(1)通过使用CSDF图的硬实时调度,我们提出了一种有效的方法来为CSDF图中的每个任务确定合适的处理器类型,其中能耗最小,并且满足吞吐量和延迟约束;(2)根据首次拟合递减启发式导出的初始映射,提出了一种重新映射方法,将部分任务重新映射到未使用的集群中,通过集群动态电压/频率缩放(DVFS)进一步降低系统能耗。实验结果表明,与现有的方法相比,该算法找到了更节能的映射。由于我们提出的算法节能高达34%。
{"title":"Energy-efficient mapping of real-time streaming applications on cluster heterogeneous MPSoCs","authors":"Di Liu, J. Spasić, Gang Chen, T. Stefanov","doi":"10.1109/ESTIMedia.2015.7351764","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2015.7351764","url":null,"abstract":"In this paper, we propose a novel polynomial time algorithm, called Frequency Driven Mapping, to map real-time streaming applications specified as cyclo-static dataflow (CSDF) graphs onto a cluster heterogeneous MPSoC. The objective of our mapping approach is to reduce the energy consumption and guarantee latency and throughput constraints. The main novelty in our mapping algorithm is twofold: (1) By using hard-realtime scheduling of CSDF graphs, we propose an efficient way to determine a suitable processor type for each task in a CSDF graph, where the energy consumption is minimized and throughput and latency constraints are met; (2) According to an initial mapping derived by a first-fit-decreasing heuristic, we propose a remapping approach, where some tasks are remapped to unused clusters in order to further reduce the energy consumption of the system by cluster dynamic voltage/frequency scaling (DVFS). The experimental results show that the proposed algorithm finds more energy efficient mapping compared to existing approaches. The energy savings due to our proposed algorithm are up to 34%.","PeriodicalId":350361,"journal":{"name":"2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126559893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Framework separated migration for web applications 针对web应用程序的框架分离迁移
Pub Date : 2015-12-17 DOI: 10.1109/ESTIMedia.2015.7351767
Jin-woo Kwon, JinSeok Oh, InChang Jeong, Soo-Mook Moon
Web applications (apps) are programs created by web technologies such as HTML, CSS, and JavaScript. Web apps can be executed on any platform that supports a web browser. Such portability allows an interesting user experience called app migration [2, 4, 5], which can save app's execution state information as a snapshot, transmit to another device, and continue its execution on the device. However, there is one feasibility issue in existing approaches because they save app's state without separating the JavaScript framework state, resulting in both time and size overhead. JavaScript framework such as jQuery is a library written in JavaScript to support web app development, popularly used in web apps. So an app's JavaScript heap is composed of app's objects and framework objects, possibly intermingled with their pointers, and the app code can modify framework objects. In this paper, we propose framework separated migration, which can separate the framework objects during snapshot creation so that snapshot does not contain framework objects, but does contain the changes made to them as well as the app objects. After transmission, the browser loads the JavaScript framework first, loads the snapshot, and restores the original state of framework and app objects to continue execution. With our approach, we could reduce 40% of total migration time, resulting increased feasibility in a real situation.
Web应用程序(app)是由HTML、CSS和JavaScript等Web技术创建的程序。Web应用程序可以在任何支持Web浏览器的平台上执行。这种可移植性允许一种有趣的用户体验,称为app migration[2,4,5],它可以将app的执行状态信息保存为快照,传输到另一个设备,并在该设备上继续执行。然而,在现有的方法中存在一个可行性问题,因为它们保存应用程序的状态而不分离JavaScript框架状态,导致时间和大小开销。jQuery等JavaScript框架是用JavaScript编写的支持web应用程序开发的库,广泛用于web应用程序中。因此,应用程序的JavaScript堆由应用程序的对象和框架对象组成,可能与它们的指针混在一起,应用程序代码可以修改框架对象。在本文中,我们提出了框架分离迁移,它可以在快照创建过程中分离框架对象,这样快照就不包含框架对象,但包含对框架对象和应用程序对象所做的更改。传输后,浏览器首先加载JavaScript框架,加载快照,恢复框架和应用对象的原始状态,继续执行。使用我们的方法,我们可以减少总迁移时间的40%,从而提高了实际情况下的可行性。
{"title":"Framework separated migration for web applications","authors":"Jin-woo Kwon, JinSeok Oh, InChang Jeong, Soo-Mook Moon","doi":"10.1109/ESTIMedia.2015.7351767","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2015.7351767","url":null,"abstract":"Web applications (apps) are programs created by web technologies such as HTML, CSS, and JavaScript. Web apps can be executed on any platform that supports a web browser. Such portability allows an interesting user experience called app migration [2, 4, 5], which can save app's execution state information as a snapshot, transmit to another device, and continue its execution on the device. However, there is one feasibility issue in existing approaches because they save app's state without separating the JavaScript framework state, resulting in both time and size overhead. JavaScript framework such as jQuery is a library written in JavaScript to support web app development, popularly used in web apps. So an app's JavaScript heap is composed of app's objects and framework objects, possibly intermingled with their pointers, and the app code can modify framework objects. In this paper, we propose framework separated migration, which can separate the framework objects during snapshot creation so that snapshot does not contain framework objects, but does contain the changes made to them as well as the app objects. After transmission, the browser loads the JavaScript framework first, loads the snapshot, and restores the original state of framework and app objects to continue execution. With our approach, we could reduce 40% of total migration time, resulting increased feasibility in a real situation.","PeriodicalId":350361,"journal":{"name":"2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123473735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Dynamic task mapping of graphics processing applications on many-core architectures through stream rewriting 通过流重写实现多核架构上图形处理应用程序的动态任务映射
Pub Date : 2015-12-17 DOI: 10.1109/ESTIMedia.2015.7351763
Lars Middendorf, C. Haubelt
Although modern graphics processing units (GPU) contain a large number of programmable shader cores, the focus on data parallelism and also the lack of efficient on-chip communication hinder the creation of custom graphics pipelines with arbitrary topologies. Based on the concept of stream rewriting, we propose a novel many-core architecture for graphics processing, which supports dynamic scheduling of recursively expandable task graphs and graphics pipelines. In particular, the tasks and their dependencies are encoded as a token stream, which is iteratively rewritten via pattern matching on multiple cores in parallel. The scalability of the proposed hardware architecture has been evaluated using an FPGA prototype.
尽管现代图形处理单元(GPU)包含大量可编程着色器核心,但对数据并行性的关注以及缺乏有效的片上通信阻碍了使用任意拓扑创建自定义图形管道。基于流重写的概念,我们提出了一种新的多核图形处理体系结构,支持递归可扩展任务图和图形管道的动态调度。特别是,任务及其依赖关系被编码为令牌流,通过并行的多核模式匹配迭代重写。使用FPGA原型对所提出的硬件架构的可扩展性进行了评估。
{"title":"Dynamic task mapping of graphics processing applications on many-core architectures through stream rewriting","authors":"Lars Middendorf, C. Haubelt","doi":"10.1109/ESTIMedia.2015.7351763","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2015.7351763","url":null,"abstract":"Although modern graphics processing units (GPU) contain a large number of programmable shader cores, the focus on data parallelism and also the lack of efficient on-chip communication hinder the creation of custom graphics pipelines with arbitrary topologies. Based on the concept of stream rewriting, we propose a novel many-core architecture for graphics processing, which supports dynamic scheduling of recursively expandable task graphs and graphics pipelines. In particular, the tasks and their dependencies are encoded as a token stream, which is iteratively rewritten via pattern matching on multiple cores in parallel. The scalability of the proposed hardware architecture has been evaluated using an FPGA prototype.","PeriodicalId":350361,"journal":{"name":"2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126040917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predictability of image processing algorithms on heterogeneous MPSoC 异构MPSoC上图像处理算法的可预测性
Pub Date : 2015-12-17 DOI: 10.1109/ESTIMedia.2015.7351760
Johny Paul, W. Stechele
Multiprocessor System-on-Chip (MPSoC) designs offer a lot of computational power assembled in a compact design. The computing power of MPSoCs can be further augmented by adding heterogeneous processing elements, e.g. massively parallel processor arrays (MPPA) and specialized hardware with instruction-set extensions. However, the presence of multiple processing elements (PEs) with different characteristics raises issues related to programming and application mapping. The conventional approach used for programming heterogeneous MPSoCs results in a static mapping of various parts of the application to different PE types, based on the nature of the algorithm and the structure of the PEs. Yet, such a mapping scheme independent of the instantaneous load on the PEs may lead to under-utilization of some type of PEs while overloading others. We investigate the benefits of a resource-aware programming model called Invasive Computing for dynamically mapping image processing applications to different types of PEs available on a heterogeneous MPSoC. A case study of visual object recognition is presented, including Harris corner detection and SIFT feature matching. Results indicate that resource-aware programming helps to predict the latency of the application program along with better overall workload distribution within the heterogeneous MPSoC.
多处理器片上系统(MPSoC)设计在紧凑的设计中提供了大量的计算能力。通过添加异构处理元素,例如大规模并行处理器阵列(MPPA)和具有指令集扩展的专用硬件,可以进一步增强mpsoc的计算能力。然而,具有不同特征的多个处理元素(pe)的存在引发了与编程和应用程序映射相关的问题。用于编程异构mpsoc的传统方法是基于算法的性质和PE的结构,将应用程序的各个部分静态映射到不同的PE类型。然而,这种与pe上的瞬时负载无关的映射方案可能导致某些类型的pe利用率不足,而另一些类型的pe过载。我们研究了一种称为入侵计算的资源感知编程模型的好处,该模型用于将图像处理应用动态映射到异构MPSoC上可用的不同类型的pe。以视觉目标识别为例,对Harris角点检测和SIFT特征匹配进行了研究。结果表明,资源感知编程有助于预测应用程序的延迟,以及异构MPSoC内更好的总体工作负载分布。
{"title":"Predictability of image processing algorithms on heterogeneous MPSoC","authors":"Johny Paul, W. Stechele","doi":"10.1109/ESTIMedia.2015.7351760","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2015.7351760","url":null,"abstract":"Multiprocessor System-on-Chip (MPSoC) designs offer a lot of computational power assembled in a compact design. The computing power of MPSoCs can be further augmented by adding heterogeneous processing elements, e.g. massively parallel processor arrays (MPPA) and specialized hardware with instruction-set extensions. However, the presence of multiple processing elements (PEs) with different characteristics raises issues related to programming and application mapping. The conventional approach used for programming heterogeneous MPSoCs results in a static mapping of various parts of the application to different PE types, based on the nature of the algorithm and the structure of the PEs. Yet, such a mapping scheme independent of the instantaneous load on the PEs may lead to under-utilization of some type of PEs while overloading others. We investigate the benefits of a resource-aware programming model called Invasive Computing for dynamically mapping image processing applications to different types of PEs available on a heterogeneous MPSoC. A case study of visual object recognition is presented, including Harris corner detection and SIFT feature matching. Results indicate that resource-aware programming helps to predict the latency of the application program along with better overall workload distribution within the heterogeneous MPSoC.","PeriodicalId":350361,"journal":{"name":"2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"78 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127419033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrated visual security management for video encryption in limited battery devices 集成视觉安全管理的视频加密在有限的电池设备
Pub Date : 2015-12-17 DOI: 10.1109/ESTIMedia.2015.7351773
Junhyung Moon, Kyoungwoo Lee
Various video applications in mobile and wearable devices deal with private or important video data. In order to protect the important video information, several video encryption techniques have been proposed. The secure video processing, the combination of the video compression/decompression and the video encryption/decryption, causes lots of computational overheads, thereby consuming huge energy. Therefore, mobile and wearable devices which have limited battery capacity need to efficiently manage the power resources when performing the secure video processing. In order to find the configuration of the secure video processing which satisfies both the energy and visual security constraints, examining the tradeoff space in the secure video processing is significantly essential. In this paper, we study the tradeoff space between the energy consumption and the visual security in the secure video processing. Consequently, we propose the EVS (Energy-bound Visual Security) to find the interesting configuration of the secure video processing in order to achieve the maximum visual security under the energy budget, by exploring the huge tradeoff space. Moreover, we design an exploration technique to reduce the overhead of investigating the tradeoff space. Thanks to the EVS, we improve the visual security by up to about 17% under the same energy budget in our experiments. In addition, our proposed technique reduces about 55% exploration overhead in the experiments.
移动和可穿戴设备中的各种视频应用程序处理私人或重要的视频数据。为了保护重要的视频信息,人们提出了几种视频加密技术。安全视频处理将视频压缩/解压缩与视频加/解密相结合,造成了大量的计算开销,从而消耗了巨大的能量。因此,电池容量有限的移动和可穿戴设备在进行安全视频处理时,需要有效地管理电源资源。为了找到既满足能量约束又满足视觉安全约束的安全视频处理配置,研究安全视频处理中的权衡空间至关重要。本文研究了安全视频处理中能量消耗与视觉安全之间的权衡空间。因此,我们提出了EVS (energy -bound Visual Security),通过探索巨大的权衡空间,寻找安全视频处理的有趣配置,以在能量预算下实现最大的视觉安全性。此外,我们设计了一种探索技术,以减少调查权衡空间的开销。在我们的实验中,在相同的能量预算下,我们的视觉安全性提高了约17%。此外,我们提出的技术在实验中减少了大约55%的勘探开销。
{"title":"Integrated visual security management for video encryption in limited battery devices","authors":"Junhyung Moon, Kyoungwoo Lee","doi":"10.1109/ESTIMedia.2015.7351773","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2015.7351773","url":null,"abstract":"Various video applications in mobile and wearable devices deal with private or important video data. In order to protect the important video information, several video encryption techniques have been proposed. The secure video processing, the combination of the video compression/decompression and the video encryption/decryption, causes lots of computational overheads, thereby consuming huge energy. Therefore, mobile and wearable devices which have limited battery capacity need to efficiently manage the power resources when performing the secure video processing. In order to find the configuration of the secure video processing which satisfies both the energy and visual security constraints, examining the tradeoff space in the secure video processing is significantly essential. In this paper, we study the tradeoff space between the energy consumption and the visual security in the secure video processing. Consequently, we propose the EVS (Energy-bound Visual Security) to find the interesting configuration of the secure video processing in order to achieve the maximum visual security under the energy budget, by exploring the huge tradeoff space. Moreover, we design an exploration technique to reduce the overhead of investigating the tradeoff space. Thanks to the EVS, we improve the visual security by up to about 17% under the same energy budget in our experiments. In addition, our proposed technique reduces about 55% exploration overhead in the experiments.","PeriodicalId":350361,"journal":{"name":"2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125307725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Invasive computing for predictable stream processing: a simulation-based case study 可预测流处理的侵入性计算:基于模拟的案例研究
Pub Date : 2015-12-17 DOI: 10.1109/ESTIMedia.2015.7351761
Sascha Roloff, S. Wildermann, Frank Hannig, J. Teich
Heterogeneous many-core systems enable the integration of more and more applications into a single system. Executing multiple applications in the same system inevitably leads to resource sharing, e.g., when accessing on-chip communication and memory. This poses a challenge when applications are expected to guarantee user requirements regarding timing, reliability, security, etc. In this paper, we review a design methodology that (a) allows an application designer to model a stream processing application and user requirements, and then (b) automatically generates a set of resource requirements that guarantee the fulfillment of these user requirements. Techniques from the Invasive Computing paradigm enable the program-driven dynamic reservation of resources according to these generated resource requirements.We demonstrate that this provides means for predictable execution of stream processing applications by evaluating a simulation-based case study.
异构多核系统支持将越来越多的应用程序集成到单个系统中。在同一系统中执行多个应用程序不可避免地导致资源共享,例如,当访问片上通信和内存时。当应用程序期望保证用户在时间、可靠性、安全性等方面的需求时,这就提出了挑战。在本文中,我们回顾了一种设计方法,它(a)允许应用程序设计人员对流处理应用程序和用户需求进行建模,然后(b)自动生成一组资源需求,以保证满足这些用户需求。入侵计算范式的技术使程序驱动的资源根据这些生成的资源需求进行动态保留。我们通过评估一个基于模拟的案例研究来证明,这为流处理应用程序的可预测执行提供了手段。
{"title":"Invasive computing for predictable stream processing: a simulation-based case study","authors":"Sascha Roloff, S. Wildermann, Frank Hannig, J. Teich","doi":"10.1109/ESTIMedia.2015.7351761","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2015.7351761","url":null,"abstract":"Heterogeneous many-core systems enable the integration of more and more applications into a single system. Executing multiple applications in the same system inevitably leads to resource sharing, e.g., when accessing on-chip communication and memory. This poses a challenge when applications are expected to guarantee user requirements regarding timing, reliability, security, etc. In this paper, we review a design methodology that (a) allows an application designer to model a stream processing application and user requirements, and then (b) automatically generates a set of resource requirements that guarantee the fulfillment of these user requirements. Techniques from the Invasive Computing paradigm enable the program-driven dynamic reservation of resources according to these generated resource requirements.We demonstrate that this provides means for predictable execution of stream processing applications by evaluating a simulation-based case study.","PeriodicalId":350361,"journal":{"name":"2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134452677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
WebRTCbench: a benchmark for performance assessment of webRTC implementations WebRTCbench:用于评估webRTC实现的性能基准
Pub Date : 2015-12-17 DOI: 10.1109/ESTIMedia.2015.7351769
Sajjad Taheri, Laleh Aghababaie Beni, A. Veidenbaum, A. Nicolau, Rosario Cammarota, Jianlin Qiu, Qiang Lu, M. Haghighat
WebRTC is an HTML5 API that allows browsers to establish a peer-to-peer connection for transferring data and media content via JavaScript APIs. This functionality enables broad range of new applications to emerge and is going to revolutionize Web communication. However, this technology is still under development and standardization process. Hence, detecting performance bottlenecks of different implementations across operating systems and architectures can help improve it significantly, and a benchmark suite would be a great help to accomplish this task. In this paper, we present WebRTCBench, a benchmark which measures WebRTC peer connection establishment and communication performance. We present and discuss performance evaluation of WebRTC implementations across a range of implementations and devices. This benchmark is publicly available under GPL license.
WebRTC是一个HTML5 API,允许浏览器建立点对点连接,通过JavaScript API传输数据和媒体内容。这种功能使广泛的新应用程序得以出现,并将彻底改变Web通信。然而,该技术仍处于开发和标准化过程中。因此,检测跨操作系统和体系结构的不同实现的性能瓶颈可以帮助显著改进它,而基准测试套件将极大地帮助完成这项任务。在本文中,我们提出了WebRTCBench,这是一个衡量WebRTC对等连接建立和通信性能的基准。我们提出并讨论了跨一系列实现和设备的WebRTC实现的性能评估。该基准测试在GPL许可下公开提供。
{"title":"WebRTCbench: a benchmark for performance assessment of webRTC implementations","authors":"Sajjad Taheri, Laleh Aghababaie Beni, A. Veidenbaum, A. Nicolau, Rosario Cammarota, Jianlin Qiu, Qiang Lu, M. Haghighat","doi":"10.1109/ESTIMedia.2015.7351769","DOIUrl":"https://doi.org/10.1109/ESTIMedia.2015.7351769","url":null,"abstract":"WebRTC is an HTML5 API that allows browsers to establish a peer-to-peer connection for transferring data and media content via JavaScript APIs. This functionality enables broad range of new applications to emerge and is going to revolutionize Web communication. However, this technology is still under development and standardization process. Hence, detecting performance bottlenecks of different implementations across operating systems and architectures can help improve it significantly, and a benchmark suite would be a great help to accomplish this task. In this paper, we present WebRTCBench, a benchmark which measures WebRTC peer connection establishment and communication performance. We present and discuss performance evaluation of WebRTC implementations across a range of implementations and devices. This benchmark is publicly available under GPL license.","PeriodicalId":350361,"journal":{"name":"2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128290514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
期刊
2015 13th IEEE Symposium on Embedded Systems For Real-time Multimedia (ESTIMedia)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1