Pub Date : 2004-04-01DOI: 10.1016/j.rti.2004.02.004
S. Paschalakis, M. Bober
{"title":"Real-time face detection and tracking for mobile videoconferencing","authors":"S. Paschalakis, M. Bober","doi":"10.1016/j.rti.2004.02.004","DOIUrl":"https://doi.org/10.1016/j.rti.2004.02.004","url":null,"abstract":"","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"34 1","pages":"81-94"},"PeriodicalIF":0.0,"publicationDate":"2004-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80286511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2004-04-01DOI: 10.1016/j.rti.2004.02.003
Chinchen Chang, I. Lin
{"title":"Novel full-search schemes for speeding up image coding using vector quantization","authors":"Chinchen Chang, I. Lin","doi":"10.1016/j.rti.2004.02.003","DOIUrl":"https://doi.org/10.1016/j.rti.2004.02.003","url":null,"abstract":"","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"2 1","pages":"95-102"},"PeriodicalIF":0.0,"publicationDate":"2004-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89824504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Complexity localisation in the reference frame is the key process for the derivation of efficient scanning orders for motion estimation. The more localised the complexity is, the more computationally efficient scanning orders can be derived for reduced cost motion estimation algorithms. However, this processes entails serious pre-processing overhead which may render it unsuitable for real time video coding systems. In this paper, we propose three low complexity scanning orders of similar performance that are very competitive in terms of the operation count ratio metric with respect to the MPEG-2 raster scan order, show improvements of 7.14% on the average with respect to the number of examined macroblock rows metric and they also show an increase in the speed-up ratio of 0.12 on the average as compared to the standard. As compared to other work in the literature, the proposed scanning orders require one fourth of the operation count ratio and show an increase in the speed-up ratio of 45 times on the average.
{"title":"Three novel low complexity scanning orders for MPEG-2 full search motion estimation","authors":"Christos Grecos, Azilah Saparon, Vassilis Chouliaras","doi":"10.1016/j.rti.2004.02.001","DOIUrl":"10.1016/j.rti.2004.02.001","url":null,"abstract":"<div><p><span>Complexity localisation in the reference frame is the key process for the derivation of efficient scanning orders for motion estimation. The more localised the complexity is, the more computationally efficient scanning orders can be derived for reduced cost </span>motion estimation algorithms. However, this processes entails serious pre-processing overhead which may render it unsuitable for real time video coding systems. In this paper, we propose three low complexity scanning orders of similar performance that are very competitive in terms of the operation count ratio metric with respect to the MPEG-2 raster scan order, show improvements of 7.14% on the average with respect to the number of examined macroblock rows metric and they also show an increase in the speed-up ratio of 0.12 on the average as compared to the standard. As compared to other work in the literature, the proposed scanning orders require one fourth of the operation count ratio and show an increase in the speed-up ratio of 45 times on the average.</p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"10 1","pages":"Pages 53-65"},"PeriodicalIF":0.0,"publicationDate":"2004-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2004.02.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81419034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2004-02-01DOI: 10.1016/j.rti.2003.09.014
Shou-Yi Tseng
This paper proposes two frame-based adaptive thresholding algorithms for reducing the amount of computation involved in block-based motion estimation in the real-time and low-rate video coders. The proposed algorithms use an adaptive threshold for each of the frames to prejudge if each of the block matching computation is worthwhile. Based on the difference between the current and reference frames, the first algorithm determines a threshold in terms of the optimal trade-off between run-time and distortion, and the second algorithm determines a threshold according to a user specified percentage of total number of blocks. The experimental results demonstrate that the proposed algorithms can significantly reduce the amount of computation compared to the previous algorithms, while almost fully maintaining the quality of the reconstructed image.
{"title":"Motion estimation using a frame-based adaptive thresholding approach","authors":"Shou-Yi Tseng","doi":"10.1016/j.rti.2003.09.014","DOIUrl":"10.1016/j.rti.2003.09.014","url":null,"abstract":"<div><p>This paper proposes two frame-based adaptive thresholding algorithms for reducing the amount of computation involved in block-based motion estimation in the real-time and low-rate video coders. The proposed algorithms use an adaptive threshold for each of the frames to prejudge if each of the block matching computation is worthwhile. Based on the difference between the current and reference frames, the first algorithm determines a threshold in terms of the optimal trade-off between run-time and distortion, and the second algorithm determines a threshold according to a user specified percentage of total number of blocks. The experimental results demonstrate that the proposed algorithms can significantly reduce the amount of computation compared to the previous algorithms, while almost fully maintaining the quality of the reconstructed image.</p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"10 1","pages":"Pages 1-7"},"PeriodicalIF":0.0,"publicationDate":"2004-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2003.09.014","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85691346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2004-02-01DOI: 10.1016/j.rti.2003.12.001
N. Sudha
This paper presents a new hardware design for a neural network based colour image compression. The compressed image consists of a colour palette containing few best colours and the coded image. Kohonen's map neural network is applied to construct the colour palette and the coded image, both forming the compressed image. The Kohonen's map based compression results in linear time complexity (in the size of the image). It is advantageous over traditional JPEG in colour quantization applications and compression of images with limited colours. The architecture of the hardware unit is based on single instruction multiple data methodology. The architecture has been implemented in an application specific integrated circuit and results show that the proposed design achieves high speed allowing inputs at a video rate for compression of images up to size of 512×512 with low area requirement.
{"title":"An ASIC implementation of Kohonen's map based colour image compression","authors":"N. Sudha","doi":"10.1016/j.rti.2003.12.001","DOIUrl":"10.1016/j.rti.2003.12.001","url":null,"abstract":"<div><p><span><span>This paper presents a new hardware design for a neural network based colour image compression. The compressed image consists of a </span>colour palette<span> containing few best colours and the coded image. Kohonen's map neural network is applied to construct the colour palette and the coded image, both forming the compressed image. The Kohonen's map based compression results in linear time complexity (in the size of the image). It is advantageous over traditional JPEG in colour quantization applications and compression of images with limited colours. The architecture of the hardware unit is based on </span></span>single instruction multiple data methodology. The architecture has been implemented in an application specific integrated circuit and results show that the proposed design achieves high speed allowing inputs at a video rate for compression of images up to size of 512×512 with low area requirement.</p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"10 1","pages":"Pages 31-39"},"PeriodicalIF":0.0,"publicationDate":"2004-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2003.12.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122657942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2004-02-01DOI: 10.1016/j.rti.2003.11.001
Maojun Zhang, Nicolas D. Georganas
A color correction method for balancing the color appearances, among a group of images, about a specified object or scene, such as panoramic images and object movies, is developed and tested in this paper. In order to increase the running speed of color correction and reduce the out-of-gamut color pixels, we introduce the selection of principal regions. The average color values of principal regions are used to construct the low-degree (up to degree two) polynomial mapping functions from the source images to the corrected images. The functions are run in the decorrelated color spaces. Our method is tested using real and synthetic images. The results of these tests show the proposed method can get a better performance than other existing methods.
{"title":"Fast color correction using principal regions mapping in different color spaces","authors":"Maojun Zhang, Nicolas D. Georganas","doi":"10.1016/j.rti.2003.11.001","DOIUrl":"10.1016/j.rti.2003.11.001","url":null,"abstract":"<div><p>A color correction method for balancing the color appearances, among a group of images, about a specified object or scene, such as panoramic images and object movies, is developed and tested in this paper. In order to increase the running speed of color correction and reduce the out-of-gamut color pixels, we introduce the selection of principal regions. The average color values of principal regions are used to construct the low-degree (up to degree two) polynomial mapping functions from the source images to the corrected images. The functions are run in the decorrelated color spaces. Our method is tested using real and synthetic images. The results of these tests show the proposed method can get a better performance than other existing methods.</p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"10 1","pages":"Pages 23-30"},"PeriodicalIF":0.0,"publicationDate":"2004-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2003.11.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80034584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper presents two robust algorithms with respect to global contrast changes: one detects changes; and the other detects stationary people or objects in image sequences obtained via a fixed camera. The first one is based on a level set representation of images and exploits their suitable properties under image contrast variation. The second makes use of the first, at different time scales, to allow discriminating between the scene background, the moving parts and stationarities. This latter algorithm is justified by and tested in real-life situations; the detection of abnormal stationarities in public transit settings, e.g. subway corridors, will be presented herein with assessments carried out on a large number of real-life situations.
{"title":"Time-scale change detection applied to real-time abnormal stationarity monitoring","authors":"Didier Aubert , Frédéric Guichard , Samia Bouchafa","doi":"10.1016/j.rti.2003.10.001","DOIUrl":"10.1016/j.rti.2003.10.001","url":null,"abstract":"<div><p>This paper presents two robust algorithms with respect to global contrast changes: one detects changes; and the other detects stationary people or objects in image sequences obtained via a fixed camera. The first one is based on a level set representation of images and exploits their suitable properties under image contrast variation. The second makes use of the first, at different time scales, to allow discriminating between the scene background, the moving parts and stationarities<span>. This latter algorithm is justified by and tested in real-life situations; the detection of abnormal stationarities in public transit settings, e.g. subway corridors, will be presented herein with assessments carried out on a large number of real-life situations.</span></p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"10 1","pages":"Pages 9-22"},"PeriodicalIF":0.0,"publicationDate":"2004-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2003.10.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76241224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2004-02-01DOI: 10.1016/j.rti.2003.12.002
Andrés Bruhn , Tobias Jakob , Markus Fischer , Timo Kohlberger , Joachim Weickert , Ulrich Brüning , Christoph Schnörr
This paper deals with parallelization and implementation aspects of partial differential equation (PDE)-based image processing models for large cluster environments with distributed memory. As an example we focus on nonlinear diffusion filtering which we discretize by means of an additive operator splitting (AOS). We start by decomposing the algorithm into small modules that shall be parallelized separately. For this purpose image partitioning strategies are discussed and their impact on the communication pattern and volume is analyzed. Based on the results we develop an algorithmic implementation with excellent scaling properties on massively connected low-latency networks. Test runs on two different high-end Myrinet clusters yield almost linear speedup factors up to 209 for 256 processors. This results in typical denoising times of for five iterations on a 256×256×128 data cube.
{"title":"High performance cluster computing with 3-D nonlinear diffusion filters","authors":"Andrés Bruhn , Tobias Jakob , Markus Fischer , Timo Kohlberger , Joachim Weickert , Ulrich Brüning , Christoph Schnörr","doi":"10.1016/j.rti.2003.12.002","DOIUrl":"10.1016/j.rti.2003.12.002","url":null,"abstract":"<div><p><span>This paper deals with parallelization<span> and implementation aspects of partial differential equation (PDE)-based image processing models for large cluster environments with distributed memory. As an example we focus on nonlinear diffusion filtering<span> which we discretize by means of an additive operator splitting (AOS). We start by decomposing the algorithm into small modules that shall be parallelized separately. For this purpose image partitioning strategies are discussed and their impact on the communication pattern and volume is analyzed. Based on the results we develop an algorithmic implementation with excellent scaling properties on massively connected low-latency networks. Test runs on two different high-end Myrinet clusters yield almost linear speedup factors up to 209 for 256 processors. This results in typical denoising times of </span></span></span><span><math><mtext>0.4</mtext><mspace></mspace><mtext>s</mtext></math></span> for five iterations on a 256×256×128 data cube.</p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"10 1","pages":"Pages 41-51"},"PeriodicalIF":0.0,"publicationDate":"2004-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2003.12.002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84653696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2004-02-01DOI: 10.1016/S1077-2014(04)00021-X
{"title":"Call for papers - use file on ftp site- YRTIM_v10i1_ann.pdf","authors":"","doi":"10.1016/S1077-2014(04)00021-X","DOIUrl":"https://doi.org/10.1016/S1077-2014(04)00021-X","url":null,"abstract":"","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"10 1","pages":"Pages I-II"},"PeriodicalIF":0.0,"publicationDate":"2004-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S1077-2014(04)00021-X","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"137422972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2003-12-01DOI: 10.1016/j.rti.2003.09.017
Johannes Fürtler, Konrad J. Mayer, Werner Krattenthaler, Ivan Bajla
Although the hardware platform is often seen as the most important element of real-time imaging systems, software optimization can also provide remarkable reduction of overall computational costs. The recommended code development flow for digital signal processors based on the TMS320C6000TM architecture usually involves three phases: development of C code, refinement of C code, and programming linear assembly code. Each step requires a different level of knowledge of processor internals. The developer is not directly involved in the automatic scheduling process. In some cases, however, this may result in unacceptable code performance. A better solution can be achieved by scheduling the assembly code by hand. Unfortunately, scheduling of software pipelines by hand not only requires expert skills but is also time consuming, and moreover, prone to errors. To overcome these drawbacks we have designed an innovative development tool—the Software Pipeline Optimization Tool (SPOTTM). The SPOT is based on visualization of the scheduled assembly code by a two-dimensional interactive schedule editor, which is equipped with feedback mechanisms deduced from analysis of data dependencies and resource allocation conflicts. The paper addresses optimization techniques available by the application of the SPOT. Furthermore, the benefit of the SPOT is documented by more than 20 optimized image processing algorithms.
{"title":"SPOT—Development tool for software pipeline optimization for VLIW-DSPs used in real-time image processing","authors":"Johannes Fürtler, Konrad J. Mayer, Werner Krattenthaler, Ivan Bajla","doi":"10.1016/j.rti.2003.09.017","DOIUrl":"10.1016/j.rti.2003.09.017","url":null,"abstract":"<div><p><span>Although the hardware platform is often seen as the most important element of real-time imaging systems<span>, software optimization can also provide remarkable reduction of overall computational costs. The recommended code development flow for digital signal processors based on the TMS320C6000</span></span><sup>TM</sup> architecture usually involves three phases: development of C code, refinement of C code, and programming linear assembly code. Each step requires a different level of knowledge of processor internals. The developer is not directly involved in the automatic scheduling process. In some cases, however, this may result in unacceptable code performance. A better solution can be achieved by scheduling the assembly code by hand. Unfortunately, scheduling of software pipelines by hand not only requires expert skills but is also time consuming, and moreover, prone to errors. To overcome these drawbacks we have designed an innovative development tool—the Software Pipeline Optimization Tool (SPOT<sup>TM</sup><span>). The SPOT is based on visualization of the scheduled assembly code by a two-dimensional interactive schedule editor, which is equipped with feedback mechanisms deduced from analysis of data dependencies<span> and resource allocation conflicts. The paper addresses optimization techniques available by the application of the SPOT. Furthermore, the benefit of the SPOT is documented by more than 20 optimized image processing algorithms.</span></span></p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"9 6","pages":"Pages 387-399"},"PeriodicalIF":0.0,"publicationDate":"2003-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2003.09.017","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88488474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}