首页 > 最新文献

Real-Time Imaging最新文献

英文 中文
Volume Contents and Index 卷目目录及索引
Pub Date : 2005-10-01 DOI: 10.1016/S1077-2014(05)00083-5
{"title":"Volume Contents and Index","authors":"","doi":"10.1016/S1077-2014(05)00083-5","DOIUrl":"https://doi.org/10.1016/S1077-2014(05)00083-5","url":null,"abstract":"","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"11 5","pages":"Pages III-IX"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S1077-2014(05)00083-5","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138219523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erratum to the “Special Issue on Video Object Processing” “视频对象处理特刊”的勘误
Pub Date : 2005-10-01 DOI: 10.1016/j.rti.2005.08.001
{"title":"Erratum to the “Special Issue on Video Object Processing”","authors":"","doi":"10.1016/j.rti.2005.08.001","DOIUrl":"https://doi.org/10.1016/j.rti.2005.08.001","url":null,"abstract":"","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"11 5","pages":"Page 474"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2005.08.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138219524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast detection and impulsive noise removal in color images 彩色图像的快速检测和脉冲噪声去除
Pub Date : 2005-10-01 DOI: 10.1016/j.rti.2005.07.003
Bogdan Smolka, Andrzej Chydzinski

In this paper, a novel approach to the impulsive noise removal in color images is presented. The proposed technique employs the switching scheme based on the impulse detection mechanism using the so-called peer group concept. Compared to the vector median filter and other commonly used multichannel filters, the proposed technique consistently yields very good results in suppressing both the random and fixed-valued impulsive noise. The main advantage of the proposed noise detection framework is its enormous computational speed, which enables efficient filtering of color images in real-time applications.

本文提出了一种新的彩色图像脉冲噪声去除方法。该技术采用基于脉冲检测机制的交换方案,采用所谓的对等组概念。与矢量中值滤波器和其他常用的多通道滤波器相比,该技术在抑制随机和固定值脉冲噪声方面都取得了很好的效果。所提出的噪声检测框架的主要优点是其巨大的计算速度,能够在实时应用中有效地过滤彩色图像。
{"title":"Fast detection and impulsive noise removal in color images","authors":"Bogdan Smolka,&nbsp;Andrzej Chydzinski","doi":"10.1016/j.rti.2005.07.003","DOIUrl":"10.1016/j.rti.2005.07.003","url":null,"abstract":"<div><p>In this paper, a novel approach to the impulsive noise removal in color images is presented. The proposed technique employs the switching scheme based on the impulse detection mechanism using the so-called <em>peer group</em> concept. Compared to the <span><em>vector </em><em>median filter</em></span><span> and other commonly used multichannel filters, the proposed technique consistently yields very good results in suppressing both the random and fixed-valued impulsive noise. The main advantage of the proposed noise detection framework is its enormous computational speed, which enables efficient filtering of color images in real-time applications.</span></p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"11 5","pages":"Pages 389-402"},"PeriodicalIF":0.0,"publicationDate":"2005-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2005.07.003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83926176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 196
Modified intelligent scissors and adaptive frame skipping for video object segmentation 改进的智能剪刀和自适应跳帧视频对象分割
Pub Date : 2005-08-01 DOI: 10.1016/j.rti.2005.06.005
Yang Gaobo , Yu Shengfa

MPEG-4 introduces the concept of video object to support content-based functionalities. Video object segmentation is a crucial step for object-based coding and manipulation. In this paper, a robust semi- automatic video object segmentation scheme is proposed. To efficiently and accurately define the initial object contour, modified intelligent scissors is proposed on the basis of original intelligent scissors. It can improve about 6–8 times the processing speed with only slight sacrifice of accuracy, which just meets the requirements of initial object extraction for semi-automatic approach. To avoid errors accumulating and propagating during object tracking, an adaptive frame skipping scheme is proposed to decompose video sequence into video clips. For rigid and non-rigid video objects, two different image segmentation algorithms are utilized, and then region-based backward projection technique is adopted to interpolate the video object plane (VOPs) of other frames within every video clip. The proposed approach can cope with occlusion/disocclusion problem to most extent. Experimental results demonstrate the effectiveness and robustness of the method.

MPEG-4引入了视频对象的概念来支持基于内容的功能。视频对象分割是基于对象的编码和操作的关键步骤。本文提出了一种鲁棒的半自动视频目标分割方案。为了高效准确地定义初始目标轮廓,在原有智能剪子的基础上提出了改进的智能剪子。在精度稍有牺牲的情况下,可以将处理速度提高6-8倍左右,刚好满足半自动方法初始目标提取的要求。为了避免目标跟踪过程中误差的累积和传播,提出了一种自适应跳帧方案,将视频序列分解为视频片段。针对刚性和非刚性视频对象,采用两种不同的图像分割算法,然后采用基于区域的后向投影技术对每个视频片段内其他帧的视频对象平面(VOPs)进行插值。该方法可以在很大程度上处理咬合/去咬合问题。实验结果证明了该方法的有效性和鲁棒性。
{"title":"Modified intelligent scissors and adaptive frame skipping for video object segmentation","authors":"Yang Gaobo ,&nbsp;Yu Shengfa","doi":"10.1016/j.rti.2005.06.005","DOIUrl":"10.1016/j.rti.2005.06.005","url":null,"abstract":"<div><p>MPEG-4 introduces the concept of video object to support content-based functionalities. Video object segmentation is a crucial step for object-based coding and manipulation. In this paper, a robust semi- automatic video object segmentation scheme is proposed. To efficiently and accurately define the initial object contour<span>, modified intelligent scissors is proposed on the basis of original intelligent scissors. It can improve about 6–8 times the processing speed with only slight sacrifice of accuracy, which just meets the requirements of initial object extraction for semi-automatic approach. To avoid errors accumulating and propagating during object tracking, an adaptive frame skipping scheme is proposed to decompose video sequence into video clips. For rigid and non-rigid video objects, two different image segmentation<span> algorithms are utilized, and then region-based backward projection technique is adopted to interpolate the video object plane (VOPs) of other frames within every video clip. The proposed approach can cope with occlusion/disocclusion problem to most extent. Experimental results demonstrate the effectiveness and robustness of the method.</span></span></p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"11 4","pages":"Pages 310-322"},"PeriodicalIF":0.0,"publicationDate":"2005-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2005.06.005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83328241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Smart camera design for intensive embedded computing 针对密集嵌入式计算的智能摄像头设计
Pub Date : 2005-08-01 DOI: 10.1016/j.rti.2005.04.006
Barthélémy Heyrman , Michel Paindavoine , Renaud Schmit , Laurent Letellier , Thierry Collette

Computer-assisted vision plays an important role in our society, in various fields such as personal and goods safety, industrial production, telecommunications, robotics, etc. However, technical developments are still rare and slowed down by various factors linked to sensor cost, lack of system flexibility, difficulty of rapidly developing complex and robust applications, and lack of interaction among these systems themselves, or with their environment. This paper describes our proposal for a smart camera with real-time video processing capabilities. A CMOS sensor, processor and, reconfigurable unit associated in the same chip will allow scalability, flexibility, and high performance.

计算机辅助视觉在我们的社会中发挥着重要的作用,在各个领域,如个人和货物安全,工业生产,电信,机器人等。然而,由于传感器成本、缺乏系统灵活性、难以快速开发复杂和健壮的应用、以及这些系统本身之间或与其环境之间缺乏交互等各种因素的影响,技术发展仍然很少,而且速度很慢。本文介绍了一种具有实时视频处理能力的智能摄像机的设计方案。将CMOS传感器、处理器和可重构单元关联在同一芯片中,将实现可扩展性、灵活性和高性能。
{"title":"Smart camera design for intensive embedded computing","authors":"Barthélémy Heyrman ,&nbsp;Michel Paindavoine ,&nbsp;Renaud Schmit ,&nbsp;Laurent Letellier ,&nbsp;Thierry Collette","doi":"10.1016/j.rti.2005.04.006","DOIUrl":"10.1016/j.rti.2005.04.006","url":null,"abstract":"<div><p>Computer-assisted vision plays an important role in our society, in various fields such as personal and goods safety, industrial production, telecommunications, robotics, etc. However, technical developments are still rare and slowed down by various factors linked to sensor cost, lack of system flexibility, difficulty of rapidly developing complex and robust applications, and lack of interaction among these systems themselves, or with their environment. This paper describes our proposal for a smart camera with real-time video processing capabilities. A CMOS sensor, processor and, reconfigurable unit associated in the same chip will allow scalability, flexibility, and high performance.</p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"11 4","pages":"Pages 282-289"},"PeriodicalIF":0.0,"publicationDate":"2005-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2005.04.006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84137569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Towards real-time 3-D monocular visual tracking of human limbs in unconstrained environments 无约束环境下人体肢体实时三维单目跟踪研究
Pub Date : 2005-08-01 DOI: 10.1016/j.rti.2005.06.004
Dave Bullock , John Zelek

The 3-D visual tracking of human limbs is fundamental to a wide array of computer vision applications including gesture recognition, interactive entertainment, biomechanical analysis, vehicle driver monitoring, and electronic surveillance. The problem of limb tracking is complicated by issues of occlusion, depth ambiguities, rotational ambiguities, and high levels of noise caused by loose fitting clothing. We attempt to solve the 3-D limb tracking problem using only monocular imagery (a single 2-D video source) in largely unconstrained environments. The approach presented is a movement towards full real-time operating capabilities. The described system presents a complete visual tracking system which incorporates target detection, target model acquisition/initialization, and target tracking components into a single, cohesive, probabilistic framework. The presence of a target is detected, using visual cues alone, by recognition of an individual performing a simple pre-defined initialization cue. The physical dimensions of the limb are then learned probabilistically until a statistically stable model estimate has been found. The appearance of the limb is learned in a joint spatial-chromatic domain which incorporates normalized color data with spatial constraints in order to model complex target appearances. The target tracking is performed within a Monte Carlo particle filtering framework which is capable of maintaining multiple state-space hypotheses and propagating ambiguity until less ambiguous data is observed. Multiple image cues are combined within this framework in a principled Bayesian manner. The target detection and model acquisition components are able to perform at near real-time frame rates and are shown to accurately recognize the presence of a target and initialize a target model specific to that user. The target tracking component has demonstrated exceptional resilience to occlusion and temporary target disappearance and contains a natural mechanism for the trade-off between accuracy and speed. At this point, the target tracking component performs at sub real-time frame rates, although several methods to increase the effective operating speed are proposed.

人体肢体的三维视觉跟踪是广泛的计算机视觉应用的基础,包括手势识别、互动娱乐、生物力学分析、车辆驾驶员监控和电子监控。肢体跟踪的问题由于遮挡、深度模糊、旋转模糊以及宽松衣服引起的高水平噪声等问题而变得复杂。我们试图在很大程度上不受约束的环境中,仅使用单眼图像(单个二维视频源)来解决三维肢体跟踪问题。所提出的方法是向完全实时操作能力的转变。所描述的系统提出了一个完整的视觉跟踪系统,它将目标检测、目标模型获取/初始化和目标跟踪组件合并到一个单一的、内聚的、概率框架中。目标的存在是通过单独使用视觉线索,通过执行简单的预定义初始化线索的个体识别来检测的。肢体的物理尺寸,然后学习概率,直到一个统计上稳定的模型估计被发现。肢体的外观是在一个联合的空间-色域中学习的,该域将归一化的颜色数据与空间约束相结合,以模拟复杂的目标外观。目标跟踪在蒙特卡罗粒子滤波框架内进行,该框架能够维持多个状态空间假设并传播模糊性,直到观察到更少的模糊数据。多个图像线索以贝叶斯原则的方式组合在这个框架内。目标检测和模型获取组件能够以接近实时的帧速率执行,并且能够准确地识别目标的存在并初始化特定于该用户的目标模型。目标跟踪组件显示了对遮挡和暂时目标消失的特殊恢复能力,并包含了在精度和速度之间权衡的自然机制。在这一点上,目标跟踪组件执行亚实时帧速率,尽管提出了几种方法来提高有效运行速度。
{"title":"Towards real-time 3-D monocular visual tracking of human limbs in unconstrained environments","authors":"Dave Bullock ,&nbsp;John Zelek","doi":"10.1016/j.rti.2005.06.004","DOIUrl":"10.1016/j.rti.2005.06.004","url":null,"abstract":"<div><p><span>The 3-D visual tracking of human limbs is fundamental to a wide array of computer vision applications<span><span> including gesture recognition, interactive entertainment, </span>biomechanical analysis, vehicle driver monitoring, and electronic surveillance. The problem of limb tracking is complicated by issues of occlusion, depth ambiguities, rotational ambiguities, and high levels of noise caused by loose fitting clothing. We attempt to solve the 3-D limb tracking problem using only monocular imagery (a single 2-D video source) in largely unconstrained environments. The approach presented is a movement towards full real-time operating capabilities. The described system presents a </span></span><em>complete visual tracking system</em><span> which incorporates target detection, target model acquisition/initialization, and target tracking components into a single, cohesive, probabilistic framework. The presence of a target is detected, using visual cues alone, by recognition of an individual performing a simple pre-defined initialization cue. The physical dimensions of the limb are then learned probabilistically until a statistically stable model estimate has been found. The appearance of the limb is learned in a joint spatial-chromatic domain which incorporates normalized color data with spatial constraints in order to model complex target appearances. The target tracking is performed within a Monte Carlo particle filtering framework which is capable of maintaining multiple state-space hypotheses and propagating ambiguity until less ambiguous data is observed. Multiple image cues are combined within this framework in a principled Bayesian manner. The target detection and model acquisition components are able to perform at near real-time frame rates and are shown to accurately recognize the presence of a target and initialize a target model specific to that user. The target tracking component has demonstrated exceptional resilience to occlusion and temporary target disappearance and contains a natural mechanism for the trade-off between accuracy and speed. At this point, the target tracking component performs at sub real-time frame rates, although several methods to increase the effective operating speed are proposed.</span></p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"11 4","pages":"Pages 323-353"},"PeriodicalIF":0.0,"publicationDate":"2005-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2005.06.004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85381490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Real-time automated visual inspection system for contaminant removal from wool 用于羊毛污染物去除的实时自动目视检测系统
Pub Date : 2005-08-01 DOI: 10.1016/j.rti.2004.09.003
Liwei Zhang , Abbas Dehghani , Zhenwei Su , Tim King , Barry Greenwood , Martin Levesley

In the textile industry, scoured wool contains different types of foreign materials (contaminants) that need to be separated out before it goes into further processing, so that the textile machines are protected from damage and the quality of the final woollen products is ensured. This paper presents an automated visual inspection (AVI) system for detecting and sorting contaminants from wool in real time. The techniques were first developed in the lab and subsequently applied to a large-scale factory system. The combinative use of image processing algorithms in RGB and HSV colour spaces can segment 96% of contaminant types (minimum size around 4 cm long and 5 mm in diameter) in real-time on the lab test rig. One of the most important aspects of the system is to use the non-linear colour space transformation and merge the threshold algorithm in HSV colour space into the image processing algorithms in RGB colour space to enhance the contaminant identification in real time. The real-time capability of the system is also analysed in detail. The experimental results demonstrate that the factory AVI system could identify and remove the contaminants at a camera speed of around 800 lines/s and the conveyor speed of 20 m/min in real time.

在纺织工业中,经过洗涤的羊毛含有不同类型的异物(污染物),在进入进一步加工之前需要将其分离出来,以保护纺织机器免受损坏,并确保最终羊毛产品的质量。本文介绍了一种用于羊毛中污染物实时检测和分类的自动目视检测系统。这些技术首先在实验室开发,随后应用于大规模的工厂系统。RGB和HSV色彩空间中图像处理算法的组合使用可以在实验室测试台上实时分割96%的污染物类型(最小尺寸约为4厘米长,直径5毫米)。该系统的一个重要方面是利用非线性色彩空间变换,将HSV色彩空间中的阈值算法与RGB色彩空间中的图像处理算法相融合,提高了污染物识别的实时性。对系统的实时性进行了详细的分析。实验结果表明,工厂AVI系统能够以800线/s左右的相机速度和20 m/min的输送速度实时识别和清除污染物。
{"title":"Real-time automated visual inspection system for contaminant removal from wool","authors":"Liwei Zhang ,&nbsp;Abbas Dehghani ,&nbsp;Zhenwei Su ,&nbsp;Tim King ,&nbsp;Barry Greenwood ,&nbsp;Martin Levesley","doi":"10.1016/j.rti.2004.09.003","DOIUrl":"10.1016/j.rti.2004.09.003","url":null,"abstract":"<div><p><span>In the textile industry, scoured wool contains different types of foreign materials (contaminants) that need to be separated out before it goes into further processing, so that the textile machines are protected from damage and the quality of the final woollen products is ensured. This paper presents an automated visual inspection (AVI) system for detecting and sorting contaminants from wool in real time. The techniques were first developed in the lab and subsequently applied to a large-scale factory system. The combinative use of image processing algorithms in RGB and HSV colour spaces can segment 96% of contaminant types (minimum size around 4</span> <!-->cm long and 5<!--> <span>mm in diameter) in real-time on the lab test rig<span><span>. One of the most important aspects of the system is to use the non-linear colour space transformation and merge the threshold algorithm in HSV colour space into the image processing algorithms in RGB colour space to enhance the contaminant identification in real time. The real-time capability of the system is also analysed in detail. The experimental results demonstrate that the factory AVI system could identify and remove the contaminants at a camera speed of around 800 lines/s and the </span>conveyor speed of 20</span></span> <!-->m/min in real time.</p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"11 4","pages":"Pages 257-269"},"PeriodicalIF":0.0,"publicationDate":"2005-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2004.09.003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86066682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Improved adaptive vector quantization algorithm using hybrid codebook data structure 基于混合码本数据结构的改进自适应矢量量化算法
Pub Date : 2005-08-01 DOI: 10.1016/j.rti.2005.04.004
Hsiu-Niang Chen , Kuo-Liang Chung

Recently, Shen et al. [IEEE Transactions on Image Processing 2003;12:283–95] presented an efficient adaptive vector quantization (AVQ) algorithm and their proposed AVQ algorithm has a better peak signal-to-noise ratio (PSNR) than that of the previous benchmark AVQ algorithm. This paper presents an improved AVQ algorithm based on the proposed hybrid codebook data structure which consists of three codebooks—the locality codebook, the static codebook, and the history codebook. Due to easy maintenance advantage, the proposed AVQ algorithm leads to a considerable computation-saving effect while preserving the similar PSNR performance as in the previous AVQ algorithm by Shen et al. [IEEE Transactions on Image Processing 2003;12:283–95]. Experimental results show that the proposed AVQ algorithm over the previous AVQ algorithm has about 75% encoding time improvement ratio while both algorithms have the similar PSNR performance.

最近,Shen等[IEEE Transactions on Image Processing 2003; 12:283-95]提出了一种高效的自适应矢量量化(AVQ)算法,该算法的峰值信噪比(PSNR)优于之前的基准AVQ算法。本文在提出的混合码本数据结构的基础上,提出了一种改进的AVQ算法,该混合码本由三个码本组成:局域码本、静态码本和历史码本。由于易于维护的优点,本文提出的AVQ算法在保持与Shen等人先前的AVQ算法相似的PSNR性能的同时,具有相当大的计算节省效果。[IEEE Transactions on Image Processing 2003; 12:283-95]。实验结果表明,本文提出的AVQ算法与之前的AVQ算法相比,编码时间改进率约为75%,两种算法的PSNR性能相近。
{"title":"Improved adaptive vector quantization algorithm using hybrid codebook data structure","authors":"Hsiu-Niang Chen ,&nbsp;Kuo-Liang Chung","doi":"10.1016/j.rti.2005.04.004","DOIUrl":"10.1016/j.rti.2005.04.004","url":null,"abstract":"<div><p>Recently, Shen et al. [IEEE Transactions on Image Processing 2003;12:283–95] presented an efficient adaptive vector quantization<span> (AVQ) algorithm and their proposed AVQ algorithm has a better peak signal-to-noise ratio (PSNR) than that of the previous benchmark AVQ algorithm. This paper presents an improved AVQ algorithm based on the proposed hybrid codebook data structure which consists of three codebooks—the locality codebook, the static codebook, and the history codebook. Due to easy maintenance advantage, the proposed AVQ algorithm leads to a considerable computation-saving effect while preserving the similar PSNR performance as in the previous AVQ algorithm by Shen et al. [IEEE Transactions on Image Processing 2003;12:283–95]. Experimental results show that the proposed AVQ algorithm over the previous AVQ algorithm has about 75% encoding time improvement ratio while both algorithms have the similar PSNR performance.</span></p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"11 4","pages":"Pages 270-281"},"PeriodicalIF":0.0,"publicationDate":"2005-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2005.04.004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91497958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Robust real-time transmission of scalable multimedia for heterogeneous client bandwidths 在异构客户端带宽下实现可扩展多媒体的鲁棒实时传输
Pub Date : 2005-08-01 DOI: 10.1016/j.rti.2005.05.001
Longshe Huo , Wen Gao , Qingming Huang

For robust real-time transmission of scalable image and video data over packet-loss networks, a commonly used approach is FEC-based multiple description coding, which protects a scalable bitstream with a fixed number of packets of equal length. In this paper, by considering the problem of applying this approach of multicasting a source to a collection of clients with heterogeneous bandwidths, we propose a novel technique that changes packet length but fixes packet number. In one scenario where different clients access the server via separate links, we study the sensitivity of an optimal solution to the change of packet length, and propose a local search procedure which refines the already computed optimal solutions for other bandwidths. Compared to the scheme that computes an optimal solution individually for each bandwidth, this procedure can achieve comparable performance, however with quite lower time complexity, thus can be used in real-time applications. In another scenario where many clients share a bottleneck link, we present an embedded packetization framework for layered multiple description coding, in which even simple methods with low complexities can achieve good performance. We also propose a local search algorithm to optimize the weighted average performance in case of two layers, and a fast heuristic algorithm which can achieve very good performance tradeoff among all clients in case of more than two layers.

为了在丢包网络上实现可扩展图像和视频数据的鲁棒实时传输,一种常用的方法是基于fec的多重描述编码,它保护具有固定数量等长度数据包的可扩展比特流。在本文中,通过考虑将这种方法应用于具有异构带宽的客户端集合的问题,我们提出了一种改变数据包长度但固定数据包数量的新技术。在一个不同的客户端通过单独的链路访问服务器的场景中,我们研究了最优解对数据包长度变化的敏感性,并提出了一个局部搜索过程,该过程对已经计算的其他带宽的最优解进行了改进。与针对每个带宽单独计算最优解的方案相比,该方法可以获得相当的性能,但时间复杂度较低,因此可以用于实时应用。在另一种客户端共享瓶颈链路的场景中,我们提出了一种用于分层多描述编码的嵌入式分组框架,在该框架中,即使是简单的低复杂度方法也可以获得良好的性能。我们还提出了一种局部搜索算法来优化两层情况下的加权平均性能,以及一种快速启发式算法,该算法在两层以上的情况下可以在所有客户端之间实现很好的性能权衡。
{"title":"Robust real-time transmission of scalable multimedia for heterogeneous client bandwidths","authors":"Longshe Huo ,&nbsp;Wen Gao ,&nbsp;Qingming Huang","doi":"10.1016/j.rti.2005.05.001","DOIUrl":"10.1016/j.rti.2005.05.001","url":null,"abstract":"<div><p><span><span>For robust real-time transmission of scalable image and video data over packet-loss networks, a commonly used approach is FEC-based multiple description coding, which protects a scalable bitstream with a fixed number of packets of equal length. In this paper, by considering the problem of applying this approach of multicasting a source to a collection of clients with heterogeneous bandwidths, we propose a novel technique that changes packet length but fixes packet number. In one scenario where different clients access the server via separate links, we study the sensitivity of an optimal solution to the change of packet length, and propose a local search procedure which refines the already computed optimal solutions for other bandwidths. Compared to the scheme that computes an optimal solution individually for each bandwidth, this procedure can achieve comparable performance, however with quite lower time complexity, thus can be used in real-time applications. In another scenario where many clients share a </span>bottleneck link<span>, we present an embedded packetization framework for layered multiple description coding, in which even simple methods with low complexities can achieve good performance. We also propose a local search algorithm to optimize the weighted average performance in case of two layers, and a fast </span></span>heuristic algorithm which can achieve very good performance tradeoff among all clients in case of more than two layers.</p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"11 4","pages":"Pages 300-309"},"PeriodicalIF":0.0,"publicationDate":"2005-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2005.05.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79279244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Robust moving object segmentation on H.264/AVC compressed video using the block-based MRF model 基于块的MRF模型在H.264/AVC压缩视频中的鲁棒运动目标分割
Pub Date : 2005-08-01 DOI: 10.1016/j.rti.2005.04.008
Wei Zeng , Jun Du , Wen Gao , Qingming Huang

Moving object segmentation in compressed domain plays an important role in many real-time applications, e.g. video indexing, video transcoding, video surveillance, etc. Because H.264/AVC is the up-to-date video-coding standard, few literatures have been reported in the area of video analysis on H.264/AVC compressed video. Compared with the former MPEG standard, H.264/AVC employs several new coding tools and provides a different video format. As a consequence, moving object segmentation on H.264/AVC compressed video is a new task and challenging work. In this paper, a robust approach to extract moving objects on H.264/AVC compressed video is proposed. Our algorithm employs a block-based Markov Random Field (MRF) model to segment moving objects from the sparse motion vector field obtained directly from the bitstream. In the proposed method, object tracking is integrated in the uniform MRF model and exploits the object temporal consistency simultaneously. Experiments show that our approach provides the remarkable performance and can extract moving objects efficiently and robustly. The prominent applications of the proposed algorithm are object-based transcoding, fast moving object detection, video analysis on compressed video, etc.

压缩域运动目标分割在视频索引、视频转码、视频监控等实时应用中起着重要的作用。由于H.264/AVC是最新的视频编码标准,对H.264/AVC压缩视频进行视频分析的文献报道很少。与以前的MPEG标准相比,H.264/AVC采用了几种新的编码工具,提供了不同的视频格式。因此,对H.264/AVC压缩视频进行运动目标分割是一项全新的、具有挑战性的工作。针对H.264/AVC压缩视频,提出了一种鲁棒的运动目标提取方法。该算法采用基于块的马尔可夫随机场(MRF)模型,从直接从比特流获得的稀疏运动向量场中分割运动物体。该方法将目标跟踪集成到统一的MRF模型中,同时利用了目标的时间一致性。实验结果表明,该方法能够高效、鲁棒地提取运动目标。该算法的突出应用是基于对象的转码、快速运动目标检测、压缩视频的视频分析等。
{"title":"Robust moving object segmentation on H.264/AVC compressed video using the block-based MRF model","authors":"Wei Zeng ,&nbsp;Jun Du ,&nbsp;Wen Gao ,&nbsp;Qingming Huang","doi":"10.1016/j.rti.2005.04.008","DOIUrl":"10.1016/j.rti.2005.04.008","url":null,"abstract":"<div><p><span>Moving object segmentation in compressed domain plays an important role in many real-time applications, e.g. video indexing, video transcoding<span>, video surveillance, etc. Because H.264/AVC is the up-to-date video-coding standard, few literatures have been reported in the area of video analysis on H.264/AVC compressed video. Compared with the former MPEG standard, H.264/AVC employs several new coding tools and provides a different video format. As a consequence, moving object segmentation on H.264/AVC compressed video is a new task and challenging work. In this paper, a robust approach to extract moving objects on H.264/AVC compressed video is proposed. Our algorithm employs a block-based </span></span>Markov Random Field<span> (MRF) model to segment moving objects from the sparse motion vector<span> field obtained directly from the bitstream. In the proposed method, object tracking is integrated in the uniform MRF model and exploits the object temporal consistency simultaneously. Experiments show that our approach provides the remarkable performance and can extract moving objects efficiently and robustly. The prominent applications of the proposed algorithm are object-based transcoding, fast moving object detection, video analysis on compressed video, etc.</span></span></p></div>","PeriodicalId":101062,"journal":{"name":"Real-Time Imaging","volume":"11 4","pages":"Pages 290-299"},"PeriodicalIF":0.0,"publicationDate":"2005-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.rti.2005.04.008","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77726839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 122
期刊
Real-Time Imaging
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1