首页 > 最新文献

Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)最新文献

英文 中文
Morpheo: A high-performance processor generator for a FPGA implementation 用于FPGA实现的高性能处理器生成器
Mathieu Rosiere, J. Desbarbieux, Nathalie Drach-Temam, F. Wajsbürt
Complex applications, such as multimedia, telephony or cryptography, in embedded systems must provide more and more performance that can be achieved by using multiple levels of parallelism. Today, FPGA are viable alternatives for these kinds of applications. Unfortunately, the available processors on FPGA do not provide sufficient performance. This work proposes the Morpheo tool that is a generator of configurable high performance processors dedicated to FPGA. As the FPGA architecture is more restrictive than on ASIC, VHDL models produced by Morpheo can also be used for an ASIC implementation. The main advantage is that there is no need for specific components, therefore, processors are easier to generate. Despite the architectural changes related to the FPGA target, the IPC (Instructions Per Cycle) of 2-way and 4-way superscalar processors are, respectively, 0.81 and 0.74 times that of M5 processors (ASIC targeted) with corresponding parameters. These processors can be placed in a Xilinx Virtex-5 xc5vlx330 using 15% and 31% of hardware available resources and perform at, respectively, 79 MHz and 72 MHz.
嵌入式系统中的复杂应用程序,如多媒体、电话或密码学,必须提供越来越多的性能,这些性能可以通过使用多层并行性来实现。今天,FPGA是这类应用的可行替代方案。不幸的是,FPGA上可用的处理器不能提供足够的性能。这项工作提出了Morpheo工具,它是专用于FPGA的可配置高性能处理器的生成器。由于FPGA架构比ASIC更受限制,Morpheo生成的VHDL模型也可用于ASIC实现。其主要优点是不需要特定的组件,因此更容易生成处理器。尽管与FPGA目标相关的架构发生了变化,但具有相应参数的2路和4路超标量处理器的IPC(指令周期)分别是M5处理器(ASIC目标)的0.81和0.74倍。这些处理器可以放置在Xilinx Virtex-5 xc5vlx330中,分别使用15%和31%的硬件可用资源,运行频率分别为79 MHz和72 MHz。
{"title":"Morpheo: A high-performance processor generator for a FPGA implementation","authors":"Mathieu Rosiere, J. Desbarbieux, Nathalie Drach-Temam, F. Wajsbürt","doi":"10.1109/DASIP.2011.6136849","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136849","url":null,"abstract":"Complex applications, such as multimedia, telephony or cryptography, in embedded systems must provide more and more performance that can be achieved by using multiple levels of parallelism. Today, FPGA are viable alternatives for these kinds of applications. Unfortunately, the available processors on FPGA do not provide sufficient performance. This work proposes the Morpheo tool that is a generator of configurable high performance processors dedicated to FPGA. As the FPGA architecture is more restrictive than on ASIC, VHDL models produced by Morpheo can also be used for an ASIC implementation. The main advantage is that there is no need for specific components, therefore, processors are easier to generate. Despite the architectural changes related to the FPGA target, the IPC (Instructions Per Cycle) of 2-way and 4-way superscalar processors are, respectively, 0.81 and 0.74 times that of M5 processors (ASIC targeted) with corresponding parameters. These processors can be placed in a Xilinx Virtex-5 xc5vlx330 using 15% and 31% of hardware available resources and perform at, respectively, 79 MHz and 72 MHz.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126261529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Real-time moving object detection for video surveillance system in FPGA 基于FPGA的视频监控系统实时运动目标检测
T. Kryjak, M. Komorkiewicz, M. Gorgon
FPGA devices are a perfect platform for implementing image processing algorithms. In the article, an advanced video system is presented, which is able to detect moving objects in video sequences. The detection method is using two algorithms. First of all, a multimodal background generation method allows reliable scene modelling in case of rapid changes in lighting conditions and small background movement. Finally a segmentation based on three parameters lightness, colour and texture is applied. This approach allows to remove shadows from the processed image. Authors proposed some improvements and modifications to existing algorithms in order to make them suitable for reconfigurable platforms. In the final system only one, low cost, FPGA device is able to receive data from high speed digital camera, perform a Bayer transform, RGB to CIE Lab colour space conversion, generate a moving object mask and present results to the operator in real-time.
FPGA器件是实现图像处理算法的完美平台。本文介绍了一种能够检测视频序列中运动物体的高级视频系统。检测方法采用两种算法。首先,多模态背景生成方法可以在光照条件快速变化和背景移动较小的情况下实现可靠的场景建模。最后应用基于亮度、颜色和纹理三个参数的分割。这种方法允许从处理后的图像中去除阴影。作者对现有的算法提出了一些改进和修改,以使它们适合于可重构平台。在最后的系统中,只有一个低成本的FPGA器件能够接收高速数码相机的数据,执行拜耳变换,RGB到CIE Lab的色彩空间转换,生成运动对象掩模并实时将结果呈现给操作员。
{"title":"Real-time moving object detection for video surveillance system in FPGA","authors":"T. Kryjak, M. Komorkiewicz, M. Gorgon","doi":"10.1109/DASIP.2011.6136881","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136881","url":null,"abstract":"FPGA devices are a perfect platform for implementing image processing algorithms. In the article, an advanced video system is presented, which is able to detect moving objects in video sequences. The detection method is using two algorithms. First of all, a multimodal background generation method allows reliable scene modelling in case of rapid changes in lighting conditions and small background movement. Finally a segmentation based on three parameters lightness, colour and texture is applied. This approach allows to remove shadows from the processed image. Authors proposed some improvements and modifications to existing algorithms in order to make them suitable for reconfigurable platforms. In the final system only one, low cost, FPGA device is able to receive data from high speed digital camera, perform a Bayer transform, RGB to CIE Lab colour space conversion, generate a moving object mask and present results to the operator in real-time.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133757247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
A unified hardware/software co-synthesis solution for signal processing systems 用于信号处理系统的统一硬件/软件协同合成解决方案
E. Bezati, H. Yviquel, M. Raulet, M. Mattavelli
This paper presents a methodology to specify from a high-level data-flow description an application for both hardware and software synthesis. Firstly, an introduction to RVC-Cal data-flow programming and Orcc framework is presented. Furthermore, an analysis of a close to gate intermediate representation (XLIM) is bestowed. As a proof of concept a JPEG codec was written purely in RVC-Cal to test the co-synthesis tools and then an analysis of the generated hardware and software results are given. Our experience shows that using RVC-Cal can unify the process of creating the same application for software and hardware without modifying a single source code for each solution.
本文提出了一种从高级数据流描述中指定硬件和软件综合应用程序的方法。首先介绍了RVC-Cal数据流编程和Orcc框架。在此基础上,对近门中间表示(XLIM)进行了分析。作为概念验证,我们在RVC-Cal中编写了一个JPEG编解码器来测试协同合成工具,然后对生成的硬件和软件结果进行了分析。我们的经验表明,使用RVC-Cal可以统一为软件和硬件创建相同应用程序的过程,而无需为每个解决方案修改单个源代码。
{"title":"A unified hardware/software co-synthesis solution for signal processing systems","authors":"E. Bezati, H. Yviquel, M. Raulet, M. Mattavelli","doi":"10.1109/DASIP.2011.6136877","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136877","url":null,"abstract":"This paper presents a methodology to specify from a high-level data-flow description an application for both hardware and software synthesis. Firstly, an introduction to RVC-Cal data-flow programming and Orcc framework is presented. Furthermore, an analysis of a close to gate intermediate representation (XLIM) is bestowed. As a proof of concept a JPEG codec was written purely in RVC-Cal to test the co-synthesis tools and then an analysis of the generated hardware and software results are given. Our experience shows that using RVC-Cal can unify the process of creating the same application for software and hardware without modifying a single source code for each solution.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124832717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
A new algorithm for realization of FIR filters using multiple constant multiplications 一种利用多次常数乘法实现FIR滤波器的新算法
M. Farahani, Eduardo Castillo Guerra, B. Colpitts
This paper presents a new common subexpression elimination (CSE) algorithm to realize FIR filters based on multiple constant multiplications (MCMs). This algorithm shares the maximum number of partial terms amongst minimal signed digit (MSD)-represented coefficients. It modifies the iterated matching (ITM) algorithm to share more partial terms in MCMs, which yields a significant logic and, consequently, chip area savings. The employment of the proposed algorithm results in efficient realizations of FIR filters with a fewer number of adders compared to the conventional CSE algorithms. Experimental results demonstrate a reduction up to 22% in the complexity of FIR filters over some conventional CSE algorithms. The proposed algorithm also addresses challenges encountered in resource-constrained applications, which require banks of high-order filters, such as in real-time distributed optical fiber sensor.
提出了一种新的通用子表达式消除(CSE)算法来实现基于多次常数乘法的FIR滤波器。该算法在最小符号数(MSD)表示的系数中共享部分项的最大数量。它修改了迭代匹配(ITM)算法,在mcm中共享更多的部分项,这产生了显著的逻辑,从而节省了芯片面积。与传统的CSE算法相比,采用该算法可以用更少的加法器有效地实现FIR滤波器。实验结果表明,与传统的CSE算法相比,FIR滤波器的复杂度降低了22%。该算法还解决了在资源受限的应用中遇到的挑战,例如在实时分布式光纤传感器中需要高阶滤波器。
{"title":"A new algorithm for realization of FIR filters using multiple constant multiplications","authors":"M. Farahani, Eduardo Castillo Guerra, B. Colpitts","doi":"10.1109/DASIP.2011.6136891","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136891","url":null,"abstract":"This paper presents a new common subexpression elimination (CSE) algorithm to realize FIR filters based on multiple constant multiplications (MCMs). This algorithm shares the maximum number of partial terms amongst minimal signed digit (MSD)-represented coefficients. It modifies the iterated matching (ITM) algorithm to share more partial terms in MCMs, which yields a significant logic and, consequently, chip area savings. The employment of the proposed algorithm results in efficient realizations of FIR filters with a fewer number of adders compared to the conventional CSE algorithms. Experimental results demonstrate a reduction up to 22% in the complexity of FIR filters over some conventional CSE algorithms. The proposed algorithm also addresses challenges encountered in resource-constrained applications, which require banks of high-order filters, such as in real-time distributed optical fiber sensor.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121997357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Interfacing and scheduling legacy code within the Canals framework 在运河框架内接口和调度遗留代码
Andreas Dahlin, F. Jokhio, J. Lilius, J. Gorin, M. Raulet
The need for understanding how to distribute computations across multiple cores, have obviously increased in the multi-core era. Scheduling the functional blocks of an application for concurrent execution requires not only a good understanding of data dependencies, but also a structured way to describe the intended scheduling. In this paper we describe how the Canals language and its scheduling framework can be used for the purpose of scheduling and executing legacy code. Additionally a set of translation guidelines for translating RVC-CAL applications into Canals are presented. The proposed approaches are applied to an existing MPEG-4 Simple Profile decoder for evaluation purposes. The inverse discrete cosine transform (IDCT) is accelerated by the means of OpenCL.
在多核时代,理解如何跨多核分配计算的需求明显增加了。为并发执行调度应用程序的功能块不仅需要很好地理解数据依赖关系,还需要一种结构化的方式来描述预期的调度。在本文中,我们描述了如何使用运河语言及其调度框架来调度和执行遗留代码。此外,还提出了一套将RVC-CAL应用程序翻译成canal的翻译指南。将所提出的方法应用于现有的MPEG-4简单配置文件解码器以进行评估。利用OpenCL实现离散余弦反变换(IDCT)的加速。
{"title":"Interfacing and scheduling legacy code within the Canals framework","authors":"Andreas Dahlin, F. Jokhio, J. Lilius, J. Gorin, M. Raulet","doi":"10.1109/DASIP.2011.6136886","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136886","url":null,"abstract":"The need for understanding how to distribute computations across multiple cores, have obviously increased in the multi-core era. Scheduling the functional blocks of an application for concurrent execution requires not only a good understanding of data dependencies, but also a structured way to describe the intended scheduling. In this paper we describe how the Canals language and its scheduling framework can be used for the purpose of scheduling and executing legacy code. Additionally a set of translation guidelines for translating RVC-CAL applications into Canals are presented. The proposed approaches are applied to an existing MPEG-4 Simple Profile decoder for evaluation purposes. The inverse discrete cosine transform (IDCT) is accelerated by the means of OpenCL.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127922270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acceleration of image reconstruction in 3D ultrasound computer tomography: An evaluation of CPU, GPU and FPGA computing 三维超声计算机断层扫描图像重建的加速:CPU、GPU和FPGA计算的评价
M. Birk, Alexander Guth, M. Zapf, M. Balzer, N. Ruiter, M. Hübner, J. Becker
As today's standard screening methods frequently fail to diagnose breast cancer before metastases have developed, earlier breast cancer diagnosis is still a major challenge. Three-dimensional ultrasound computer tomography promises high-quality images of the breast, but is currently limited by a time-consuming synthetic aperture focusing technique based image reconstruction. In this work, we investigate the acceleration of the image reconstruction by a GPU, and by the FPGAs embedded in our custom data acquisition system. We compare the obtained performance results with a recent multi-core CPU and show that both platforms are able to accelerate processing. The GPU reaches the highest performance. Furthermore, we draw conclusions in terms of applicability of the accelerated reconstructions in future clinical application and highlight general principles for speed-up on GPUs and FPGAs.
由于目前的标准筛查方法往往无法在乳腺癌转移发生前诊断出来,因此早期乳腺癌诊断仍然是一项重大挑战。三维超声计算机断层扫描有望获得高质量的乳房图像,但目前受到基于图像重建的耗时合成孔径聚焦技术的限制。在这项工作中,我们研究了GPU和fpga嵌入我们的自定义数据采集系统的图像重建加速。我们将获得的性能结果与最近的多核CPU进行了比较,并表明这两个平台都能够加速处理。GPU达到最高性能。此外,我们还总结了加速重建在未来临床应用中的适用性,并强调了在gpu和fpga上加速的一般原则。
{"title":"Acceleration of image reconstruction in 3D ultrasound computer tomography: An evaluation of CPU, GPU and FPGA computing","authors":"M. Birk, Alexander Guth, M. Zapf, M. Balzer, N. Ruiter, M. Hübner, J. Becker","doi":"10.1109/DASIP.2011.6136856","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136856","url":null,"abstract":"As today's standard screening methods frequently fail to diagnose breast cancer before metastases have developed, earlier breast cancer diagnosis is still a major challenge. Three-dimensional ultrasound computer tomography promises high-quality images of the breast, but is currently limited by a time-consuming synthetic aperture focusing technique based image reconstruction. In this work, we investigate the acceleration of the image reconstruction by a GPU, and by the FPGAs embedded in our custom data acquisition system. We compare the obtained performance results with a recent multi-core CPU and show that both platforms are able to accelerate processing. The GPU reaches the highest performance. Furthermore, we draw conclusions in terms of applicability of the accelerated reconstructions in future clinical application and highlight general principles for speed-up on GPUs and FPGAs.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122217798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Practical monitoring and analysis tool for WSN testing 实用的无线传感器网络测试监控分析工具
M. Hänninen, J. Suhonen, T. Hämäläinen, Marko Hännikäinen
Wireless Sensor Networks (WSN) comprise autonomous embedded nodes that combine sensing, actuation, and distributed computing with small size and low energy. Testing of WSNs is challenging due to distributed functionality, lack of test and debug interfaces, and limited communication, computation and memory resources. This paper presents the design and implementation of a practical network monitoring and analysis tool for identifying the causes of misbehavior. The tool consists of a sniffer node that passively captures WSN traffic, and multiple user interfaces that can run e.g. on a PC or mobile phone. Unlike the related proposals, our tool neither needs setting up an additional monitoring network alongside the actual WSN nor uses any node resources. Practical testing experiences with mul-tihop mesh WSN deployments show that the tool reveals design and implementation defects that are hard to discover with other testing methods such as in-network monitoring systems or debug prints.
无线传感器网络(WSN)由自主嵌入式节点组成,该节点集传感、驱动和分布式计算于一体,具有小尺寸、低能耗的特点。由于分布式功能、缺乏测试和调试接口以及有限的通信、计算和内存资源,无线传感器网络的测试具有挑战性。本文介绍了一种实用的网络监测和分析工具的设计和实现,用于识别不当行为的原因。该工具包括一个嗅探器节点,被动捕获WSN流量,以及可以在PC或移动电话上运行的多个用户界面。与相关建议不同的是,我们的工具既不需要在实际WSN旁边设置额外的监控网络,也不需要使用任何节点资源。多跳网格WSN部署的实际测试经验表明,该工具揭示了其他测试方法(如网内监控系统或调试打印)难以发现的设计和实现缺陷。
{"title":"Practical monitoring and analysis tool for WSN testing","authors":"M. Hänninen, J. Suhonen, T. Hämäläinen, Marko Hännikäinen","doi":"10.1109/DASIP.2011.6136894","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136894","url":null,"abstract":"Wireless Sensor Networks (WSN) comprise autonomous embedded nodes that combine sensing, actuation, and distributed computing with small size and low energy. Testing of WSNs is challenging due to distributed functionality, lack of test and debug interfaces, and limited communication, computation and memory resources. This paper presents the design and implementation of a practical network monitoring and analysis tool for identifying the causes of misbehavior. The tool consists of a sniffer node that passively captures WSN traffic, and multiple user interfaces that can run e.g. on a PC or mobile phone. Unlike the related proposals, our tool neither needs setting up an additional monitoring network alongside the actual WSN nor uses any node resources. Practical testing experiences with mul-tihop mesh WSN deployments show that the tool reveals design and implementation defects that are hard to discover with other testing methods such as in-network monitoring systems or debug prints.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133203558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Analyzing software inter-task communication channels on a clustered shared memory multi processor system-on-chip 集群共享内存多处理器片上系统的软件任务间通信通道分析
Daniela Genius, Nicolas Pouillon
The task graph of telecommunication applications often exhibits massive coarse grained parallelism, which can be exploited by an on-chip multiprocessor. In many cases it can be organized into several subsequent stages, each containing dozens or even hundreds of identical tasks. We implement communications between tasks via software channels mapped to on-chip memory, allowing for multiple readers and writers to access them in arbitrary order. Our architecture is based on the shared memory paradigm. The interconnection network is hierarchical, so that communication latencies vary with the distance between the cluster where the task is located and the cluster on which the channel is placed. Moreover, packet sizes and arrival rates are subject to strong variations. An analytical approach to dimensioning the channels is thus near impossible. Within a purely simulation based approach, we gain insight into the performance of such software channels.
电信应用程序的任务图经常显示大量粗粒度并行性,这可以被片上多处理器利用。在许多情况下,它可以被组织成几个后续阶段,每个阶段包含数十甚至数百个相同的任务。我们通过映射到片上存储器的软件通道实现任务之间的通信,允许多个读取器和写入器以任意顺序访问它们。我们的架构是基于共享内存范式的。互连网络是分层的,因此通信延迟随任务所在集群和信道所在集群之间的距离而变化。此外,包的大小和到达率也有很大的变化。因此,用分析方法来确定通道的尺寸几乎是不可能的。在纯粹基于模拟的方法中,我们深入了解了此类软件通道的性能。
{"title":"Analyzing software inter-task communication channels on a clustered shared memory multi processor system-on-chip","authors":"Daniela Genius, Nicolas Pouillon","doi":"10.1109/DASIP.2011.6136892","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136892","url":null,"abstract":"The task graph of telecommunication applications often exhibits massive coarse grained parallelism, which can be exploited by an on-chip multiprocessor. In many cases it can be organized into several subsequent stages, each containing dozens or even hundreds of identical tasks. We implement communications between tasks via software channels mapped to on-chip memory, allowing for multiple readers and writers to access them in arbitrary order. Our architecture is based on the shared memory paradigm. The interconnection network is hierarchical, so that communication latencies vary with the distance between the cluster where the task is located and the cluster on which the channel is placed. Moreover, packet sizes and arrival rates are subject to strong variations. An analytical approach to dimensioning the channels is thus near impossible. Within a purely simulation based approach, we gain insight into the performance of such software channels.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130293445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Embedded systems security: An evaluation methodology against Side Channel Attacks 嵌入式系统安全:对抗侧信道攻击的评估方法
Youssef Souissi, J. Danger, S. Guilley, S. Bhasin, Maxime Nassar
One of the most redoubtable attacks on modern embedded systems are Side-Channel Analysis. In this paper, we propose a security evaluation framework which aims at organizing the work of the evaluator to reliably assess the robustness of embedded systems against such attacks. Moreover, we highlight common errors made by evaluators and solutions to avoid them.
对现代嵌入式系统最可怕的攻击之一是侧信道分析。在本文中,我们提出了一个安全评估框架,旨在组织评估人员的工作,以可靠地评估嵌入式系统对此类攻击的鲁棒性。此外,我们强调了评价者常犯的错误以及避免这些错误的解决方案。
{"title":"Embedded systems security: An evaluation methodology against Side Channel Attacks","authors":"Youssef Souissi, J. Danger, S. Guilley, S. Bhasin, Maxime Nassar","doi":"10.1109/DASIP.2011.6136885","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136885","url":null,"abstract":"One of the most redoubtable attacks on modern embedded systems are Side-Channel Analysis. In this paper, we propose a security evaluation framework which aims at organizing the work of the evaluator to reliably assess the robustness of embedded systems against such attacks. Moreover, we highlight common errors made by evaluators and solutions to avoid them.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124179164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Graphic rendering application profiling on a shared memory MPSOC architecture 在共享内存MPSOC架构上的图形渲染应用程序分析
Matthieu Texier, R. David, K. B. Chehida, O. Sentieys
This paper describes the implementation of a graphic rendering pipeline on an MPSoC architecture devoted to the dynamic management of static task graphs. It exhibits the highly non stationary workloads of this application domain and provides first useful feedbacks motivating the design of innovative embedded architectures that have to face heterogeneous computation domains such as graphics and telecommunications. Especially these experiments stress the needs for data dependent resource allocation strategies.
本文描述了一个图形绘制流水线在MPSoC架构上的实现,用于静态任务图的动态管理。它展示了这个应用领域的高度非固定工作负载,并提供了第一个有用的反馈,激励了必须面对异构计算领域(如图形和电信)的创新嵌入式架构的设计。这些实验特别强调了对依赖于数据的资源分配策略的需求。
{"title":"Graphic rendering application profiling on a shared memory MPSOC architecture","authors":"Matthieu Texier, R. David, K. B. Chehida, O. Sentieys","doi":"10.1109/DASIP.2011.6136864","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136864","url":null,"abstract":"This paper describes the implementation of a graphic rendering pipeline on an MPSoC architecture devoted to the dynamic management of static task graphs. It exhibits the highly non stationary workloads of this application domain and provides first useful feedbacks motivating the design of innovative embedded architectures that have to face heterogeneous computation domains such as graphics and telecommunications. Especially these experiments stress the needs for data dependent resource allocation strategies.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131824110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1