2003 IEEE International Workshop on Computer Architectures for Machine Perception最新文献

英文中文

Evaluating color instruction set extension for real-time vector quantization 评估实时矢量量化的颜色指令集扩展

2003 IEEE International Workshop on Computer Architectures for Machine Perception

Pub Date : 2003-05-12 DOI: 10.1109/CAMP.2003.1598156

J. Kim, D. S. Wills

Vector quantization (VQ) is widely used for color image and video compression. However, its high computational overhead prohibits many applications in real-time systems. This paper presents a novel method to accelerate full-search VQ algorithm by adding quantized color pack extension (QCPX) instruction set architecture (ISA). QCPX not only supports a packed 16-bit YCbCr data format but also obtains performance and code density improvements through three-color pixels in parallel in a 16-bit width. To measure execution performance of the QCPX instruction set architecture (ISA), it is evaluated in a SIMD pixel array platform developed at Georgia Tech. In addition, by varying the grain size (pixel per processing element, PPE), this study can fully measure the impact of QCPX in the presence of different levels of data parallelism. Simulation results indicate that QCPX version achieves speedups from 27% to 297% over non-QCPX with the most impressive improvements >200 % occurring above the communication-bound 16 PPE granularity. QCPX also reduces average PE idle cycles by 45%. QCPX can be incorporated in range of architectures from current ILP processors to future massively data parallel machines

矢量量化在彩色图像和视频压缩中有着广泛的应用。然而，它的高计算开销阻碍了实时系统中的许多应用。本文提出了一种通过添加量化色包扩展(QCPX)指令集架构(ISA)来加速全搜索VQ算法的新方法。QCPX不仅支持封装的16位YCbCr数据格式，而且还通过16位宽度的三色像素并行获得性能和代码密度改进。为了衡量QCPX指令集架构(ISA)的执行性能，在乔治亚理工学院开发的SIMD像素阵列平台上对其进行了评估。此外，通过改变粒度(每个处理元素像素，PPE)，本研究可以充分衡量QCPX在不同数据并行性水平下的影响。仿真结果表明，QCPX版本比非QCPX版本实现了27%到297%的速度提升，其中最令人印象深刻的改进> 200%发生在通信绑定16 PPE粒度以上。QCPX还将PE空闲周期平均减少了45%。QCPX可以集成到从当前的ILP处理器到未来的大规模数据并行机的各种架构中

{"title":"Evaluating color instruction set extension for real-time vector quantization","authors":"J. Kim, D. S. Wills","doi":"10.1109/CAMP.2003.1598156","DOIUrl":"https://doi.org/10.1109/CAMP.2003.1598156","url":null,"abstract":"Vector quantization (VQ) is widely used for color image and video compression. However, its high computational overhead prohibits many applications in real-time systems. This paper presents a novel method to accelerate full-search VQ algorithm by adding quantized color pack extension (QCPX) instruction set architecture (ISA). QCPX not only supports a packed 16-bit YCbCr data format but also obtains performance and code density improvements through three-color pixels in parallel in a 16-bit width. To measure execution performance of the QCPX instruction set architecture (ISA), it is evaluated in a SIMD pixel array platform developed at Georgia Tech. In addition, by varying the grain size (pixel per processing element, PPE), this study can fully measure the impact of QCPX in the presence of different levels of data parallelism. Simulation results indicate that QCPX version achieves speedups from 27% to 297% over non-QCPX with the most impressive improvements >200 % occurring above the communication-bound 16 PPE granularity. QCPX also reduces average PE idle cycles by 45%. QCPX can be incorporated in range of architectures from current ILP processors to future massively data parallel machines","PeriodicalId":443821,"journal":{"name":"2003 IEEE International Workshop on Computer Architectures for Machine Perception","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129065228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Brain-like computer architecture 类脑计算机结构

2003 IEEE International Workshop on Computer Architectures for Machine Perception

Pub Date : 2003-05-12 DOI: 10.1109/CAMP.2003.1598144

A. A. Cohen

This paper is concerned with novel approaches to solve performance and addressing problems inherent in computer systems. The core of the neuron-like processing machine (NLPM) comprised of a unique type of computer architecture for constructing massively parallel processing machines, with unique types of pattern recognition system and addressing mechanisms that enhances the performance for identification and retrieval of known data patterns. The architecture of the NLPM has neuronal nodes bundled as groups fully interconnected and structured similar to the human brain, therefore the machine is named, neuron like processing machine (NLPM). The paper describes some of the novel architecture consisting of hierarchical structures comprising processing nodes called super neurons (SN), positioned on geographical maps in which selected nodes are grouped together to form structures of dedicated pattern units (PU) for solving generic pattern recognition problems. The paper gives some description of the pattern unit and NLPM architecture. The NLPM makes connections between pattern unit processing nodes to solve any type of pattern recognition and identification tasks

本文关注的是解决性能和解决计算机系统固有问题的新方法。神经元样处理机器(NLPM)的核心由一种独特的计算机体系结构组成，用于构建大规模并行处理机器，具有独特的模式识别系统和寻址机制，可增强识别和检索已知数据模式的性能。NLPM的结构将神经元节点捆绑成一组，完全相互连接，结构类似于人类大脑，因此机器被命名为神经元样处理机(NLPM)。本文描述了一些由分层结构组成的新架构，该结构由称为超级神经元(SN)的处理节点组成，这些处理节点位于地理地图上，其中选定的节点被分组在一起，形成专用模式单元(PU)结构，用于解决通用模式识别问题。本文给出了模式单元和NLPM体系结构的一些描述。NLPM在模式单元处理节点之间建立连接，以解决任何类型的模式识别和识别任务

引用次数: 1

Design and implementation of a smart strain gage conditioner 一种智能应变计调节器的设计与实现

2003 IEEE International Workshop on Computer Architectures for Machine Perception

Pub Date : 2003-05-12 DOI: 10.1109/CAMP.2003.1598163

S. Poussier, H. Rabah, S. Weber

This paper presents a new design and implementation of a system on a programmable chip (SOPC) for smart strain gage conditioner. The system is designed to meet flexibility and complex computations required in thermal compensation algorithms of strain gage. To satisfy the real-time processing constraints in one hand, and parameterization in another hand, parts of the algorithms are implemented in hardware and others are implemented in software. Theses architectures are implemented on a field programmable gate array (FPGA) including a core processor. Five methodologies are developed for the thermal compensation. The first is the classical technique usually used. The second is based on Lagrange interpolation. The third is based on the Newton iteration algorithm. The fourth is based on Neville-Aitken recurrence algorithm. The last is based on the spline interpolation algorithm. Implantations techniques and experimental results are given

本文提出了一种基于可编程芯片(SOPC)的智能应变片调节器系统的新设计与实现。该系统是为满足应变片热补偿算法的灵活性和复杂性而设计的。为了满足实时处理约束和参数化要求，部分算法采用硬件实现，其余算法采用软件实现。这些架构是在包含核心处理器的现场可编程门阵列(FPGA)上实现的。提出了热补偿的五种方法。第一种是常用的经典技巧。第二种是基于拉格朗日插值。第三种是基于牛顿迭代算法。第四种是基于Neville-Aitken递归算法。最后一种是基于样条插值算法。给出了植入技术和实验结果

引用次数: 0

The task "template tracking" in a sensor dedicated to active vision 任务“模板跟踪”的传感器专用于主动视觉

2003 IEEE International Workshop on Computer Architectures for Machine Perception

Pub Date : 2003-05-12 DOI: 10.1109/CAMP.2003.1598177

P. Chalimbaud, F. Berry, P. Martinet

In this paper, we present a "visual task" which can be considered as a part of a active vision sensor. This task consists in a tracking of gray levels windows of interest. Our approach is based on an efficient matching between hardware architecture and software algorithm. The notion of active detector is introduced in order to take into account the adaptive and local aspect of the processing. To validate our approach, a high speed tracking method based on a CMOS sensor and FPGA is presented. According to the size of the window, the acquisition rate varies from 200 fr/s to 1000 fr/s

在本文中，我们提出了一个“视觉任务”，它可以被认为是主动视觉传感器的一部分。该任务包括跟踪感兴趣的灰度窗口。我们的方法是基于硬件架构和软件算法之间的有效匹配。为了考虑处理的自适应和局部性，引入了主动检测器的概念。为了验证我们的方法，提出了一种基于CMOS传感器和FPGA的高速跟踪方法。根据窗口的大小，采集速率在200fr /s ~ 1000fr /s之间变化

引用次数: 5

VLSI architecture for video-assisted global positioning 视频辅助全球定位的VLSI架构

2003 IEEE International Workshop on Computer Architectures for Machine Perception

Pub Date : 2003-05-12 DOI: 10.1109/CAMP.2003.1598170

A. Utgikar, G. Seetharaman, H. Le

We design and implement an efficient architecture for geometric computation of the global position of an airborne video camera from images of known landmarks. A solution based on this analysis, a robust Hough transform-like method facilitated by a class of CORDIC-structured computations is implemented within the framework of terrain navigation. It empowers aerial surveillance systems to navigate effectively when the global position and inertial navigation sensors are out of order. This is particularly useful when the GPS functionality is disrupted by jamming and other techniques. Our architecture exploits parallelism among independent operations and uses pipelining of critical components for superior performance. Double precision division being computationally expensive is performed minimally. Correlation between data is tapped to reduce complexity of flash ADCs, at the cost of few clock cycles once to initialize Hough voting

我们设计并实现了一种高效的架构，用于从已知地标图像中计算机载摄像机的全局位置。在此基础上，在地形导航的框架内实现了一种鲁棒的类霍夫变换方法，该方法由一类cordic结构计算实现。它使空中监视系统能够在全球定位和惯性导航传感器失灵时有效导航。当GPS功能被干扰和其他技术破坏时，这是特别有用的。我们的架构利用独立操作之间的并行性，并使用关键组件的流水线来实现卓越的性能。双精度除法在计算上的开销是最小的。利用数据之间的相关性来降低闪存adc的复杂性，以初始化霍夫投票的几个时钟周期为代价

引用次数: 0

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2003 IEEE International Workshop on Computer Architectures for Machine Perception

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀