2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)最新文献

英文中文

Energy consumption characterization of a Massively Parallel Processor Array (MPPA) platform running a hyperspectral SVM classifier 运行高光谱支持向量机分类器的大规模并行处理器阵列(MPPA)平台的能耗表征

2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)

Pub Date : 2017-09-01 DOI: 10.1109/DASIP.2017.8122112

D. Madroñal, R. Lazcano, H. Fabelo, S. Ortega, R. Salvador, G. Callicó, E. Juárez, C. Sanz

In this paper, a Massively Parallel Processor Array platform is characterized in terms of energy consumption using a Support Vector Machine for hyperspectral image classification. This platform gathers 16 clusters composed of 16 cores each, i.e., 256 processors working in parallel. The objective of the work is to associate power dissipation and energy consumed by the platform with the different resources of the architecture. Experimenting with a hyperspectral SVM classifier, this study has been conducted using three strategies: i) modifying the number of processing elements, i.e., clusters and cores, ii) increasing system frequency, and iii) varying the number of active communication links during the analysis, i.e., I/Os and DMAs. As a result, a relationship between the energy consumption and the active platform resources has been exposed using two different parallelization strategies. Finally, the implementation that fully exploits the parallelization possibilities working at 500MHz has been proven to be also the most efficient one, as it reduces the energy consumption by 98% when compared to the sequential version running at 400MHz.

本文利用支持向量机对高光谱图像进行分类，研究了大规模并行处理器阵列平台在能量消耗方面的特点。该平台汇集了16个集群，每个集群由16个核心组成，即256个处理器并行工作。这项工作的目标是将平台的功耗和能量消耗与建筑的不同资源联系起来。本研究使用高光谱SVM分类器进行实验，采用三种策略进行:i)修改处理元素的数量，即集群和核心;ii)增加系统频率;iii)在分析过程中改变主动通信链路的数量，即i / o和dma。因此，使用两种不同的并行化策略暴露了能耗和活动平台资源之间的关系。最后，充分利用在500MHz工作的并行化可能性的实现已被证明也是最有效的实现，因为与在400MHz运行的顺序版本相比，它减少了98%的能耗。

{"title":"Energy consumption characterization of a Massively Parallel Processor Array (MPPA) platform running a hyperspectral SVM classifier","authors":"D. Madroñal, R. Lazcano, H. Fabelo, S. Ortega, R. Salvador, G. Callicó, E. Juárez, C. Sanz","doi":"10.1109/DASIP.2017.8122112","DOIUrl":"https://doi.org/10.1109/DASIP.2017.8122112","url":null,"abstract":"In this paper, a Massively Parallel Processor Array platform is characterized in terms of energy consumption using a Support Vector Machine for hyperspectral image classification. This platform gathers 16 clusters composed of 16 cores each, i.e., 256 processors working in parallel. The objective of the work is to associate power dissipation and energy consumed by the platform with the different resources of the architecture. Experimenting with a hyperspectral SVM classifier, this study has been conducted using three strategies: i) modifying the number of processing elements, i.e., clusters and cores, ii) increasing system frequency, and iii) varying the number of active communication links during the analysis, i.e., I/Os and DMAs. As a result, a relationship between the energy consumption and the active platform resources has been exposed using two different parallelization strategies. Finally, the implementation that fully exploits the parallelization possibilities working at 500MHz has been proven to be also the most efficient one, as it reduces the energy consumption by 98% when compared to the sequential version running at 400MHz.","PeriodicalId":6637,"journal":{"name":"2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)","volume":"8 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82068279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Reconfigurable vision processing system for player tracking in indoor sports 室内运动中运动员跟踪的可重构视觉处理系统

2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)

Pub Date : 2017-09-01 DOI: 10.1109/DASIP.2017.8122114

O. Ibraheem, Arif Irwansyah, J. Hagemeyer, Mario Porrmann, U. Rückert

In recent years, there has been an increasing growth of using vision-based systems for tracking the players in team sports to evaluate and enhance their performance. Vision-based player tracking has high computational demands since it requires processing of a huge amount of video data based on the utilization of multiple cameras with high resolution and high frame rates. In this paper, we present a reconfigurable system to track the players in indoor sports automatically without user interaction. The proposed system can process live video data streams from multiple cameras as well as offline data from recorded video files. FPGA technology is used to accelerate this player tracking system by implementing the video acquisition, video preprocessing, player segmentation, and team identification & player detection modules in hardware, realizing a real-time system. The teams are identified and the players' positions are detected based on the colors of their jerseys. The detection results are sent from the FPGA to the host-PC where the players are tracked. Our results show that the achieved average player detection rate is up to 95.5%. The proposed system can process live video data using two GigE Vision cameras with a resolution of 1392×1040 pixels and 30 fps for each camera. A speed-up of 20 is achieved compared to an OpenCV-based software implementation on a host-PC equipped with a 2.93 GHz Intel i7 CPU.

近年来，越来越多的人使用基于视觉的系统来跟踪团队运动中的运动员，以评估和提高他们的表现。基于视觉的玩家跟踪具有很高的计算需求，因为它需要处理基于多个高分辨率和高帧率的摄像机的大量视频数据。在本文中，我们提出了一个可重构的系统来自动跟踪室内运动中的运动员，而无需用户交互。所提出的系统可以处理来自多个摄像机的实时视频数据流以及来自录制视频文件的离线数据。采用FPGA技术，在硬件上实现视频采集、视频预处理、球员分割、球队识别和球员检测等模块，对该球员跟踪系统进行加速，实现了系统的实时性。根据队服的颜色来识别球队和球员的位置。检测结果从FPGA发送到主机pc，在那里玩家被跟踪。我们的结果表明，实现的平均球员检出率高达95.5%。该系统可以使用两个GigE Vision摄像机处理实时视频数据，每个摄像机的分辨率为1392×1040像素和30 fps。与配备2.93 GHz Intel i7 CPU的主机pc上基于opencv的软件实现相比，实现了20%的速度提升。

{"title":"Reconfigurable vision processing system for player tracking in indoor sports","authors":"O. Ibraheem, Arif Irwansyah, J. Hagemeyer, Mario Porrmann, U. Rückert","doi":"10.1109/DASIP.2017.8122114","DOIUrl":"https://doi.org/10.1109/DASIP.2017.8122114","url":null,"abstract":"In recent years, there has been an increasing growth of using vision-based systems for tracking the players in team sports to evaluate and enhance their performance. Vision-based player tracking has high computational demands since it requires processing of a huge amount of video data based on the utilization of multiple cameras with high resolution and high frame rates. In this paper, we present a reconfigurable system to track the players in indoor sports automatically without user interaction. The proposed system can process live video data streams from multiple cameras as well as offline data from recorded video files. FPGA technology is used to accelerate this player tracking system by implementing the video acquisition, video preprocessing, player segmentation, and team identification & player detection modules in hardware, realizing a real-time system. The teams are identified and the players' positions are detected based on the colors of their jerseys. The detection results are sent from the FPGA to the host-PC where the players are tracked. Our results show that the achieved average player detection rate is up to 95.5%. The proposed system can process live video data using two GigE Vision cameras with a resolution of 1392×1040 pixels and 30 fps for each camera. A speed-up of 20 is achieved compared to an OpenCV-based software implementation on a host-PC equipped with a 2.93 GHz Intel i7 CPU.","PeriodicalId":6637,"journal":{"name":"2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)","volume":"28 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83509331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

An overflow free fixed-point eigenvalue decomposition algorithm: Case study of dimensionality reduction in hyperspectral images 一种无溢出不动点特征值分解算法:高光谱图像降维的实例研究

2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)

Pub Date : 2017-09-01 DOI: 10.1109/DASIP.2017.8122131

Bibek Kabi, Anand S. Sahadevan, Tapan Pradhan

We consider the problem of enabling robust range estimation of eigenvalue decomposition (EVD) algorithm for a reliable fixed-point design. The simplicity of fixed-point circuitry has always been so tempting to implement EVD algorithms in fixed-point arithmetic. Working towards an effective fixed-point design, integer bit-width allocation is a significant step which has a crucial impact on accuracy and hardware efficiency. This paper investigates the shortcomings of the existing range estimation methods while deriving bounds for the variables of the EVD algorithm. In light of the circumstances, we introduce a range estimation approach based on vector and matrix norm properties together with a scaling procedure that maintains all the assets of an analytical method. The method could derive robust and tight bounds for the variables of EVD algorithm. The bounds derived using the proposed approach remain same for any input matrix and are also independent of the number of iterations or size of the problem. Some benchmark hyperspectral data sets have been used to evaluate the efficiency of the proposed technique. It was found that by the proposed range estimation approach, all the variables generated during the computation of Jacobi EVD is bounded within ±1.

我们考虑了一个可靠不动点设计的特征值分解(EVD)算法的鲁棒范围估计问题。定点电路的简单性一直是在定点算法中实现EVD算法的诱惑。为了实现有效的定点设计，整数位宽分配是影响精度和硬件效率的重要步骤。本文分析了现有距离估计方法的不足，推导了EVD算法的变量界。鉴于这种情况，我们引入了一种基于向量和矩阵范数性质的距离估计方法，以及一种保持分析方法所有资产的缩放程序。该方法可以得到EVD算法变量的鲁棒性和紧性边界。使用该方法导出的边界对于任何输入矩阵都是相同的，并且与问题的迭代次数或大小无关。一些基准的高光谱数据集被用来评估所提出的技术的效率。结果表明，采用所提出的极差估计方法，Jacobi EVD计算过程中产生的所有变量都在±1以内有界。

{"title":"An overflow free fixed-point eigenvalue decomposition algorithm: Case study of dimensionality reduction in hyperspectral images","authors":"Bibek Kabi, Anand S. Sahadevan, Tapan Pradhan","doi":"10.1109/DASIP.2017.8122131","DOIUrl":"https://doi.org/10.1109/DASIP.2017.8122131","url":null,"abstract":"We consider the problem of enabling robust range estimation of eigenvalue decomposition (EVD) algorithm for a reliable fixed-point design. The simplicity of fixed-point circuitry has always been so tempting to implement EVD algorithms in fixed-point arithmetic. Working towards an effective fixed-point design, integer bit-width allocation is a significant step which has a crucial impact on accuracy and hardware efficiency. This paper investigates the shortcomings of the existing range estimation methods while deriving bounds for the variables of the EVD algorithm. In light of the circumstances, we introduce a range estimation approach based on vector and matrix norm properties together with a scaling procedure that maintains all the assets of an analytical method. The method could derive robust and tight bounds for the variables of EVD algorithm. The bounds derived using the proposed approach remain same for any input matrix and are also independent of the number of iterations or size of the problem. Some benchmark hyperspectral data sets have been used to evaluate the efficiency of the proposed technique. It was found that by the proposed range estimation approach, all the variables generated during the computation of Jacobi EVD is bounded within ±1.","PeriodicalId":6637,"journal":{"name":"2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)","volume":"15 1","pages":"1-9"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79128930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Demonstrator of a fingerprint recognition algorithm into a low-power microcontroller 在低功耗微控制器上演示指纹识别算法

2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)

Pub Date : 2017-09-01 DOI: 10.1109/DASIP.2017.8122121

Javier Arcenegui, Rosario Arjona, I. Baturone

A demonstrator has been developed to illustrate the performance of a lightweight fingerprint recognition algorithm based on the feature QFingerMap16, which is extracted from a window of the directional image centered at the convex core of the fingerprint. The algorithm has been implemented into a low-power ARM Cortex-M3 microcontroller included in a Texas Instruments LaunchPad CC2650 evaluation kit. It has been also implemented in a Raspberry Pi 2 so as to show the results obtained at the successive steps of the recognition process with the aid of a Graphical User Interface (GUI). The algorithm offers a good tradeoff between power consumption and recognition accuracy, being suitable for authentication on wearables.

本文开发了一种基于QFingerMap16特征的轻量级指纹识别算法，该特征是从指纹凸核为中心的方向图像窗口中提取的。该算法已在德州仪器(Texas Instruments) LaunchPad CC2650评估套件中的低功耗ARM Cortex-M3微控制器中实现。它还在Raspberry Pi 2中实现，以便在图形用户界面(GUI)的帮助下显示识别过程的连续步骤所获得的结果。该算法在功耗和识别精度之间进行了很好的权衡，适用于可穿戴设备的身份验证。

引用次数: 3

Efficient hardware acceleration for approximate inference of bitwise deep neural networks 位深度神经网络近似推理的高效硬件加速

2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)

Pub Date : 2017-09-01 DOI: 10.1109/DASIP.2017.8122127

Sebastian Vogel, A. Guntoro, G. Ascheid

In recent years, Deep Neural Networks (DNNs) have been of special interest in the area of image processing and scene perception. Albeit being effective and accurate, DNNs demand challenging computational resources. Fortunately, dedicated low bitwidth accelerators enable efficient, real-time inference of DNNs. We present an approximate evaluation method and a specialized multiplierless accelerator for the recently proposed bitwise DNNs. Our approximate evaluation method is based on the speculative recomputation of selective parts of a bitwise neural network. The selection is based on the intermediate results of a previous input evaluation. In context with limited energy budgets, our method and accelerator enable a fast, power efficient, first decision. If necessary, a reliable and accurate output is available after reevaluating the input data multiple times in an approximate manner. Our experiments on the GTSRB and CIFAR-10 dataset show that this approach results in no loss of classification performance in comparison with floating-point evaluation. Our work contributes to efficient inference of neural networks on power-constrained embedded devices.

近年来，深度神经网络(dnn)在图像处理和场景感知领域受到了广泛关注。尽管深度神经网络是有效和准确的，但它需要具有挑战性的计算资源。幸运的是，专用的低位宽加速器可以实现dnn的高效实时推断。针对最近提出的位深度神经网络，我们提出了一种近似评估方法和专门的无乘法器加速器。我们的近似评估方法是基于对位神经网络的选择部分的推测性重新计算。选择基于先前输入评估的中间结果。在能源预算有限的情况下，我们的方法和加速器能够快速，节能，首先做出决定。如有必要，在以近似方式多次重新评估输入数据后，可获得可靠和准确的输出。我们在GTSRB和CIFAR-10数据集上的实验表明，与浮点计算相比，这种方法不会导致分类性能的损失。我们的工作有助于神经网络在功率受限的嵌入式设备上的高效推理。

{"title":"Efficient hardware acceleration for approximate inference of bitwise deep neural networks","authors":"Sebastian Vogel, A. Guntoro, G. Ascheid","doi":"10.1109/DASIP.2017.8122127","DOIUrl":"https://doi.org/10.1109/DASIP.2017.8122127","url":null,"abstract":"In recent years, Deep Neural Networks (DNNs) have been of special interest in the area of image processing and scene perception. Albeit being effective and accurate, DNNs demand challenging computational resources. Fortunately, dedicated low bitwidth accelerators enable efficient, real-time inference of DNNs. We present an approximate evaluation method and a specialized multiplierless accelerator for the recently proposed bitwise DNNs. Our approximate evaluation method is based on the speculative recomputation of selective parts of a bitwise neural network. The selection is based on the intermediate results of a previous input evaluation. In context with limited energy budgets, our method and accelerator enable a fast, power efficient, first decision. If necessary, a reliable and accurate output is available after reevaluating the input data multiple times in an approximate manner. Our experiments on the GTSRB and CIFAR-10 dataset show that this approach results in no loss of classification performance in comparison with floating-point evaluation. Our work contributes to efficient inference of neural networks on power-constrained embedded devices.","PeriodicalId":6637,"journal":{"name":"2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)","volume":"104 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83412995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀