Motion estimation algorithms on fine grain array processors

Proceedings The International Conference on Application Specific Array Processors Pub Date : 1995-07-24 DOI:10.1109/ASAP.1995.522924

Heung-Nam Kim, M. J. Irwin, R. Owens

{"title":"Motion estimation algorithms on fine grain array processors","authors":"Heung-Nam Kim, M. J. Irwin, R. Owens","doi":"10.1109/ASAP.1995.522924","DOIUrl":null,"url":null,"abstract":"Motion estimation plays a key role in video coding, (e.g., video telephone, MPEG, HDTV). Among the previous motion estimation algorithms, full-search block matching algorithms (BMA) are preferred because of their simplicity and lower control overhead when those algorithms are implemented in VLSI array processors. Previous full-search BMAs have considered one block matching at a time. There exist, however shared data in the search areas for adjacent template blocks. Therefore, if we process adjacent template blocks in parallel, we can reduce the data memory accesses for the shared data. In this paper we propose a new dataflow scheme for the efficient, systolic, full-search BMA on programmable array processors so that we can process as many adjacent template blocks as possible in unison in order to reduce the data memory accesses. We present an efficient implementation of the BMA on the Micro Grained Array Processor (MGAP) which is a fine-grained mesh-connected programmable VLSI array processor being developed at Penn State University. As a result, the BMA for the MPEG SIF video format (352/spl times/240 pixels) with a block size of 16/spl times/16 pixels, displacement range of 16 pixels, frame rate of 30 frames/sec can be computed at a real time processing rate on the MGAP.","PeriodicalId":354358,"journal":{"name":"Proceedings The International Conference on Application Specific Array Processors","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings The International Conference on Application Specific Array Processors","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASAP.1995.522924","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Motion estimation plays a key role in video coding, (e.g., video telephone, MPEG, HDTV). Among the previous motion estimation algorithms, full-search block matching algorithms (BMA) are preferred because of their simplicity and lower control overhead when those algorithms are implemented in VLSI array processors. Previous full-search BMAs have considered one block matching at a time. There exist, however shared data in the search areas for adjacent template blocks. Therefore, if we process adjacent template blocks in parallel, we can reduce the data memory accesses for the shared data. In this paper we propose a new dataflow scheme for the efficient, systolic, full-search BMA on programmable array processors so that we can process as many adjacent template blocks as possible in unison in order to reduce the data memory accesses. We present an efficient implementation of the BMA on the Micro Grained Array Processor (MGAP) which is a fine-grained mesh-connected programmable VLSI array processor being developed at Penn State University. As a result, the BMA for the MPEG SIF video format (352/spl times/240 pixels) with a block size of 16/spl times/16 pixels, displacement range of 16 pixels, frame rate of 30 frames/sec can be computed at a real time processing rate on the MGAP.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于细粒度阵列处理器的运动估计算法

运动估计在视频编码(如视频电话、MPEG、HDTV)中起着关键作用。在先前的运动估计算法中，全搜索块匹配算法(BMA)由于其简单和较低的控制开销而在VLSI阵列处理器中实现。以前的全搜索bma每次只考虑一个块匹配。然而，在相邻模板块的搜索区域中存在共享数据。因此，如果我们并行处理相邻的模板块，我们可以减少对共享数据的数据内存访问。在本文中，我们提出了一种新的数据流方案，用于在可编程阵列处理器上高效、收缩、全搜索的BMA，以便我们可以同时处理尽可能多的相邻模板块，以减少数据内存访问。我们提出了一种在微粒度阵列处理器(MGAP)上有效实现BMA的方法，MGAP是宾夕法尼亚州立大学正在开发的细粒度网格连接可编程VLSI阵列处理器。因此，在MGAP上可以计算出块大小为16/spl次/16像素、位移范围为16像素、帧率为30帧/秒的MPEG SIF视频格式(352/spl次/240像素)的BMA。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings The International Conference on Application Specific Array Processors

自引率

0.00%

发文量

期刊最新文献

Techniques for yield enhancement of VLSI adders The naive execution of affine recurrence equations A solid translation engine using ray representation Synthesis of multirate VLSI arrays Systolic filter for fast DNA similarity search