首页 > 最新文献

Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing最新文献

英文 中文
Identification of mine-shaped objects based on an efficient phase stepped-frequency radar approach 基于高效相位阶跃频率雷达的地雷状目标识别
Helge B. D. Sørensen, K. Jakobsen, Ole Nymann
A computational efficient approach to identify very small mine-shaped plastic objects, e.g. M56 Anti-Personnel (AP) mines buried in the ground, is presented. The size of the objects equals the smallest AP-mines in use today, i.e., the most difficult mines to detect with respect to humanitarian mine clearance. Our approach consists of three stages, the phase stepped-frequency radar method, generation of a quarternary image and template crosscorrelation. The phase stepped-frequency radar method belongs to the class of stepped-frequency radar methods. In a two-dimensional mesh-grid above the ground a radar probe is moved automatically to measure in each grid point a set of reflection coefficients from which phase and amplitude information are extracted. Based on a simple processing of the phase information, quarternary image and template cross-correlation a successful detection of metal- and non-metal mine-shaped objects is possible. Measurements have been performed on loamy soil containing different mine-shaped objects.
本文提出了一种有效的计算方法来识别埋在地下的M56杀伤人员(AP)地雷等非常小的地雷形塑料物体。这些物体的大小相当于目前使用的最小的杀伤地雷,即在人道主义排雷方面最难探测的地雷。我们的方法包括三个阶段,相位步进频率雷达法,四分之一图像的生成和模板互相关。相位步进频率雷达方法属于步进频率雷达方法的范畴。在地面上的二维网格中,雷达探头自动移动,在每个网格点测量一组反射系数,从中提取相位和振幅信息。基于相位信息、四元图像和模板相互关的简单处理,可以成功地检测金属和非金属矿形物体。对含有不同矿形物体的壤土进行了测量。
{"title":"Identification of mine-shaped objects based on an efficient phase stepped-frequency radar approach","authors":"Helge B. D. Sørensen, K. Jakobsen, Ole Nymann","doi":"10.1109/ICIP.1997.632027","DOIUrl":"https://doi.org/10.1109/ICIP.1997.632027","url":null,"abstract":"A computational efficient approach to identify very small mine-shaped plastic objects, e.g. M56 Anti-Personnel (AP) mines buried in the ground, is presented. The size of the objects equals the smallest AP-mines in use today, i.e., the most difficult mines to detect with respect to humanitarian mine clearance. Our approach consists of three stages, the phase stepped-frequency radar method, generation of a quarternary image and template crosscorrelation. The phase stepped-frequency radar method belongs to the class of stepped-frequency radar methods. In a two-dimensional mesh-grid above the ground a radar probe is moved automatically to measure in each grid point a set of reflection coefficients from which phase and amplitude information are extracted. Based on a simple processing of the phase information, quarternary image and template cross-correlation a successful detection of metal- and non-metal mine-shaped objects is possible. Measurements have been performed on loamy soil containing different mine-shaped objects.","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"6 1","pages":"142-145 vol.3"},"PeriodicalIF":0.0,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72904125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Digital halftoning as 2-D delta-sigma modulation 数字半调作为2-D δ - σ调制
T. Kite, B. Evans, A. Bovik, T. L. Sculley
The error diffusion algorithm for digital halftoning is equivalent in form to a noise-shaping feedback coder, a class of delta-sigma modulator. The white noise assumption of the quantizer error is known to be false; in fact, the quantizer error is seen to be highly correlated with the input image. To account for this correlation, we use a gain model for the quantizer. This model accurately predicts the edge sharpening and noise shaping caused by all error diffusion schemes. It also permits an extension of error diffusion to oversampled imagery.
数字半调的误差扩散算法在形式上相当于噪声整形反馈编码器,即一类δ - σ调制器。已知量化误差的白噪声假设为假;事实上,量化器误差与输入图像高度相关。为了解释这种相关性,我们对量化器使用增益模型。该模型准确地预测了各种误差扩散方案引起的边缘锐化和噪声整形。它还允许将误差扩散扩展到过采样图像。
{"title":"Digital halftoning as 2-D delta-sigma modulation","authors":"T. Kite, B. Evans, A. Bovik, T. L. Sculley","doi":"10.1109/ICIP.1997.648084","DOIUrl":"https://doi.org/10.1109/ICIP.1997.648084","url":null,"abstract":"The error diffusion algorithm for digital halftoning is equivalent in form to a noise-shaping feedback coder, a class of delta-sigma modulator. The white noise assumption of the quantizer error is known to be false; in fact, the quantizer error is seen to be highly correlated with the input image. To account for this correlation, we use a gain model for the quantizer. This model accurately predicts the edge sharpening and noise shaping caused by all error diffusion schemes. It also permits an extension of error diffusion to oversampled imagery.","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"13 1","pages":"799-802 vol.1"},"PeriodicalIF":0.0,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72905724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 52
Object-scalable mesh-based coding of synthetic and natural image objects 基于对象可伸缩网格的合成和自然图像对象编码
Y. Altunbasak
This paper presents an object-based image coding scheme, where each image object is encoded individually, provided that their boundaries are specified. This allows object-based quality scalability, in addition to mixing lossy and lossless coding modes depending on the requirements of each image object. Furthermore, we propose a new object coding method using 2-D mesh-based image sampling in which the quadrilateral mesh patches are warped into square blocks, and encoded by traditional data/waveform coding methods. Experimental results on test images are provided.
本文提出了一种基于对象的图像编码方案,在指定图像对象边界的情况下,对每个图像对象进行单独编码。除了根据每个图像对象的要求混合有损和无损编码模式外,这还允许基于对象的质量可伸缩性。在此基础上,提出了一种基于二维网格图像采样的目标编码方法,该方法将四边形网格块扭曲成正方形块,并采用传统的数据/波形编码方法进行编码。给出了测试图像的实验结果。
{"title":"Object-scalable mesh-based coding of synthetic and natural image objects","authors":"Y. Altunbasak","doi":"10.1109/ICIP.1997.631996","DOIUrl":"https://doi.org/10.1109/ICIP.1997.631996","url":null,"abstract":"This paper presents an object-based image coding scheme, where each image object is encoded individually, provided that their boundaries are specified. This allows object-based quality scalability, in addition to mixing lossy and lossless coding modes depending on the requirements of each image object. Furthermore, we propose a new object coding method using 2-D mesh-based image sampling in which the quadrilateral mesh patches are warped into square blocks, and encoded by traditional data/waveform coding methods. Experimental results on test images are provided.","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"9 1","pages":"94-97 vol.3"},"PeriodicalIF":0.0,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73297008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Signal subspace fusion of uncalibrated sensors with application in SAR, diagnostic medicine and video processing 无标定传感器信号子空间融合及其在SAR、诊断医学和视频处理中的应用
M. Soumekh
This paper addresses the problem of fusing the information content of two uncalibrated sensors. This problem arises in registering images of a scene when it is viewed via two different sensory systems, or detecting change in a scene when it is viewed at two different time points by a sensory system, or via two different sensory systems or observation channels. We are concerned with sensory systems which have not only a relative shift, scaling and rotational calibration error, but also an unknown point spread function (that is time-varying for a single sensor, or different for two sensors). By modeling one image in terms of an unknown linear combination of the other image, its powers and their spatially-transformed (shift, rotation and scaling) versions, a signal subspace processing is developed for fusing uncalibrated sensors. The proposed method is shown to be applicable in moving target detection (MTD) using monopulse synthetic aperture radar (SAR) with uncalibrated radars. Results are shown for video, magnetic resonance images of a human brain, moving target detector monopulse SAR, and registration of SAR images of a target obtained via two different radars or at different coordinates by the same radar for automatic target recognition (ATR).
本文研究了两个未标定传感器信息内容的融合问题。当一个场景通过两个不同的感官系统被观看时,这个问题出现在注册场景图像时,或者当一个场景在两个不同的时间点被一个感官系统观看时,或者通过两个不同的感官系统或观察通道检测场景的变化时。我们关注的传感系统不仅具有相对移位、缩放和旋转校准误差,而且具有未知的点扩展函数(对于单个传感器是时变的,或者对于两个传感器是不同的)。通过根据另一图像的未知线性组合,其功率及其空间变换(移位,旋转和缩放)版本对一幅图像进行建模,开发了一种用于融合未校准传感器的信号子空间处理。结果表明,该方法适用于无标定雷达单脉冲合成孔径雷达的运动目标检测。结果显示了视频,人脑的磁共振图像,运动目标探测器单脉冲SAR,以及通过两个不同的雷达或在不同的坐标由同一雷达获得的目标的SAR图像的配准自动目标识别(ATR)。
{"title":"Signal subspace fusion of uncalibrated sensors with application in SAR, diagnostic medicine and video processing","authors":"M. Soumekh","doi":"10.1109/ICIP.1997.632090","DOIUrl":"https://doi.org/10.1109/ICIP.1997.632090","url":null,"abstract":"This paper addresses the problem of fusing the information content of two uncalibrated sensors. This problem arises in registering images of a scene when it is viewed via two different sensory systems, or detecting change in a scene when it is viewed at two different time points by a sensory system, or via two different sensory systems or observation channels. We are concerned with sensory systems which have not only a relative shift, scaling and rotational calibration error, but also an unknown point spread function (that is time-varying for a single sensor, or different for two sensors). By modeling one image in terms of an unknown linear combination of the other image, its powers and their spatially-transformed (shift, rotation and scaling) versions, a signal subspace processing is developed for fusing uncalibrated sensors. The proposed method is shown to be applicable in moving target detection (MTD) using monopulse synthetic aperture radar (SAR) with uncalibrated radars. Results are shown for video, magnetic resonance images of a human brain, moving target detector monopulse SAR, and registration of SAR images of a target obtained via two different radars or at different coordinates by the same radar for automatic target recognition (ATR).","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"1 1","pages":"280-283 vol.3"},"PeriodicalIF":0.0,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79895929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Check image compression: a comparison of JPEG, wavelet and layered coding methods 检查图像压缩:JPEG,小波和分层编码方法的比较
Jincheng Huang, Yao Wang, E. Wong
An emerging trend in the banking industry is to digitize checks for storage and transmission. An immediate requirement for efficient storage and transmission is check image compression. General purpose compression algorithms such as JPEG and wavelet-based methods produce annoying ringing and blocking artifacts at high compression ratios. A layered approach to check image compression is proposed, based on which a check image is represented in several layers. The first layer describes the foreground map; the second layer specifies the gray levels of foreground pixels; the third layer is a lossy representation of the background image; and the fourth layer describes the error between the original and the reconstructed image based on the first three layers. The layered coding approach produces images of better quality than traditional JPEG and wavelet coding methods, especially in the foreground, i.e., the text and graphics. In addition, this approach allows progressive retrieval or transmission of different image layers.
银行业的一个新兴趋势是将支票数字化,以便存储和传输。检查图像的压缩是有效存储和传输的直接要求。通用的压缩算法,如JPEG和基于小波的方法,在高压缩比下会产生烦人的振铃和阻塞伪影。提出了一种分层的校验图像压缩方法,在此基础上将校验图像分层表示。第一层描述前景图;第二层指定前景像素的灰度级;第三层是背景图像的有损表示;第四层在前三层的基础上描述原始图像与重建图像之间的误差。分层编码方法产生的图像质量优于传统的JPEG和小波编码方法,特别是在前景,即文本和图形。此外,该方法还允许对不同图像层进行渐进式检索或传输。
{"title":"Check image compression: a comparison of JPEG, wavelet and layered coding methods","authors":"Jincheng Huang, Yao Wang, E. Wong","doi":"10.1109/ICIP.1997.632216","DOIUrl":"https://doi.org/10.1109/ICIP.1997.632216","url":null,"abstract":"An emerging trend in the banking industry is to digitize checks for storage and transmission. An immediate requirement for efficient storage and transmission is check image compression. General purpose compression algorithms such as JPEG and wavelet-based methods produce annoying ringing and blocking artifacts at high compression ratios. A layered approach to check image compression is proposed, based on which a check image is represented in several layers. The first layer describes the foreground map; the second layer specifies the gray levels of foreground pixels; the third layer is a lossy representation of the background image; and the fourth layer describes the error between the original and the reconstructed image based on the first three layers. The layered coding approach produces images of better quality than traditional JPEG and wavelet coding methods, especially in the foreground, i.e., the text and graphics. In addition, this approach allows progressive retrieval or transmission of different image layers.","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"51 1","pages":"694-697 vol.3"},"PeriodicalIF":0.0,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79988916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A new technique for region of interest tomographic image reconstruction and a comparison of the related algorithms 一种感兴趣区域层析图像重建新技术及相关算法的比较
G. Sankar, Sumana Gupta
It is known that the convolution back-projection (CBP) operator used for reconstruction of images from its 1D projections has a non-local filter that requires global projection data. The exposure time of the object to harmful radiation is thereby increased. It has been proved that the filtering operation done on a chosen wavelet function instead of the projections leads to a localised filter and minimises the number of projections needed to reconstruct the region of interest (ROI) in the image. This concept was extended using two dimensional and one dimensional multiresolution analysis filter banks, in which the filters are combined with the non-local CBP filter to get short length filters. In this paper it is proved that the presence of unwanted information in all previous algorithms is due to aliasing and a new algorithm is proposed for reconstruction without any aliasing. The proposed scheme is implemented using Shepp-Logon phantom head and the performance is compared.
众所周知,用于从其1D投影重建图像的卷积反投影(CBP)算子具有需要全局投影数据的非局部滤波器。因此,物体暴露于有害辐射的时间增加了。已经证明,在选择的小波函数上进行滤波操作,而不是在投影上进行滤波,可以得到局部滤波,并且最小化了重建图像中感兴趣区域(ROI)所需的投影数量。将这一概念扩展到二维和一维多分辨率分析滤波器组,其中滤波器与非局部CBP滤波器结合得到短长度滤波器。本文证明了以往所有算法中不需要的信息都是由于混叠造成的,并提出了一种新的无混叠重建算法。采用Shepp-Logon虚拟头实现了该方案,并对其性能进行了比较。
{"title":"A new technique for region of interest tomographic image reconstruction and a comparison of the related algorithms","authors":"G. Sankar, Sumana Gupta","doi":"10.1109/ICIP.1997.647404","DOIUrl":"https://doi.org/10.1109/ICIP.1997.647404","url":null,"abstract":"It is known that the convolution back-projection (CBP) operator used for reconstruction of images from its 1D projections has a non-local filter that requires global projection data. The exposure time of the object to harmful radiation is thereby increased. It has been proved that the filtering operation done on a chosen wavelet function instead of the projections leads to a localised filter and minimises the number of projections needed to reconstruct the region of interest (ROI) in the image. This concept was extended using two dimensional and one dimensional multiresolution analysis filter banks, in which the filters are combined with the non-local CBP filter to get short length filters. In this paper it is proved that the presence of unwanted information in all previous algorithms is due to aliasing and a new algorithm is proposed for reconstruction without any aliasing. The proposed scheme is implemented using Shepp-Logon phantom head and the performance is compared.","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"39 1","pages":"141-144 vol.1"},"PeriodicalIF":0.0,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80311606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalized max/median filtering 广义最大/中值滤波
Yuan Zou, W. Dunsmuir
Generalized max/median filtering is defined as an extension of max/median filtering. Properties for these extensions are given. The output distribution of generalized max/median filtering with independent but not identical inputs is derived and applied to the special case of regular max/median filtering thereby providing a new result. Based on these distribution results, it is shown that max/median and generalized max/median filtering can preserve image details.
广义最大/中值滤波是对最大/中值滤波的扩展。给出了这些扩展的属性。导出了输入独立但不相同的广义最大/中值滤波的输出分布,并将其应用于正则最大/中值滤波的特殊情况,从而得到了一个新的结果。基于这些分布结果,表明最大/中值滤波和广义最大/中值滤波可以保留图像细节。
{"title":"Generalized max/median filtering","authors":"Yuan Zou, W. Dunsmuir","doi":"10.1109/ICIP.1997.647798","DOIUrl":"https://doi.org/10.1109/ICIP.1997.647798","url":null,"abstract":"Generalized max/median filtering is defined as an extension of max/median filtering. Properties for these extensions are given. The output distribution of generalized max/median filtering with independent but not identical inputs is derived and applied to the special case of regular max/median filtering thereby providing a new result. Based on these distribution results, it is shown that max/median and generalized max/median filtering can preserve image details.","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"28 1","pages":"428-431 vol.1"},"PeriodicalIF":0.0,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80450443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A hierarchical and adaptive deformable model for mouth boundary detection 一种分层自适应变形口腔边界检测模型
A. R. Mirhosseini, Hong Yan, K. Lam, Catherine Chen
An automatic algorithm to extract mouth boundaries in human face images is proposed. The algorithm is based on a hierarchical model adaptation scheme using deformable models. The knowledge about the shape of the object is used to define its initial deformable template. Each mouth boundary curve is initially formed based on three control points whose locations are found through an optimization process using a suitable cost functional. The cost functional captures the essential knowledge about the shape for perceptual organization. Two control points are the mouth corners, which are used as the initial location of the mouth after an approximate mouth window is found based on locating the head boundary. The model is hierarchically improved in the second stage of the algorithm. Each boundary curve is finely tuned using more control points. An old model is adaptively replaced by a new model only if a secondary cost is further reduced. The results show that the model adaptation technique satisfactorily enhances the mouth boundary model in an automated fashion.
提出了一种人脸图像中口部边界的自动提取算法。该算法基于可变形模型的分层模型自适应方案。关于对象形状的知识用于定义其初始可变形模板。每个口边界曲线最初是基于三个控制点形成的,这些控制点的位置是通过使用合适的成本函数优化过程找到的。成本函数捕获了关于感知组织形状的基本知识。两个控制点是嘴角,在定位头部边界的基础上找到近似的嘴巴窗口后,将嘴角作为嘴巴的初始位置。在算法的第二阶段对模型进行分层改进。每个边界曲线都使用更多的控制点进行微调。只有当二次成本进一步降低时,旧模型才会自适应地被新模型所取代。结果表明,模型自适应技术能较好地自动增强口腔边界模型。
{"title":"A hierarchical and adaptive deformable model for mouth boundary detection","authors":"A. R. Mirhosseini, Hong Yan, K. Lam, Catherine Chen","doi":"10.1109/ICIP.1997.638606","DOIUrl":"https://doi.org/10.1109/ICIP.1997.638606","url":null,"abstract":"An automatic algorithm to extract mouth boundaries in human face images is proposed. The algorithm is based on a hierarchical model adaptation scheme using deformable models. The knowledge about the shape of the object is used to define its initial deformable template. Each mouth boundary curve is initially formed based on three control points whose locations are found through an optimization process using a suitable cost functional. The cost functional captures the essential knowledge about the shape for perceptual organization. Two control points are the mouth corners, which are used as the initial location of the mouth after an approximate mouth window is found based on locating the head boundary. The model is hierarchically improved in the second stage of the algorithm. Each boundary curve is finely tuned using more control points. An old model is adaptively replaced by a new model only if a secondary cost is further reduced. The results show that the model adaptation technique satisfactorily enhances the mouth boundary model in an automated fashion.","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"15 1","pages":"756-759 vol.2"},"PeriodicalIF":0.0,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82548697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Efficient optimal rate-distortion coding of video sequences under multiple rate constraints 多速率约束下视频序列的高效最优率失真编码
D. Lin, Jiann-Jone Chen
We consider optimal coding of video sequences in the sense of employing an optimal selection of quantization scales under buffer and channel constraints. As a result of the delayed-coding framework, multiple rate constraints emerge, leading to the appearance of multiple Lagrange multipliers in a Lagrangian solution approach. We review the applicable Lagrangian optimization theory developed to-date, and present an efficient new solution algorithm.
我们考虑视频序列的最优编码,即在缓冲区和信道约束下采用最优的量化尺度选择。由于延迟编码框架,出现了多个速率约束,导致拉格朗日解方法中出现多个拉格朗日乘子。本文回顾了拉格朗日优化理论,提出了一种新的求解算法。
{"title":"Efficient optimal rate-distortion coding of video sequences under multiple rate constraints","authors":"D. Lin, Jiann-Jone Chen","doi":"10.1109/ICIP.1997.638665","DOIUrl":"https://doi.org/10.1109/ICIP.1997.638665","url":null,"abstract":"We consider optimal coding of video sequences in the sense of employing an optimal selection of quantization scales under buffer and channel constraints. As a result of the delayed-coding framework, multiple rate constraints emerge, leading to the appearance of multiple Lagrange multipliers in a Lagrangian solution approach. We review the applicable Lagrangian optimization theory developed to-date, and present an efficient new solution algorithm.","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"33 1","pages":"29-32 vol.2"},"PeriodicalIF":0.0,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76244543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Multimode video coding for noisy channels 噪声信道的多模视频编码
S. Regunathan, K. Rose
We focus on the design of robust video compression schemes for transmission over noisy channels. A multimode video coding framework is introduced, which enables optimizing the tradeoff between the conflicting objectives of high compression efficiency and error resilience. The starting point is a simple block-based predictive coder, with no motion compensation, for which we develop the robust multimode video compression scheme. The proposed iterative design algorithm for multimode video coders directly minimizes the overall rate-distortion cost. We show that several conventional joint source-channel coding mechanisms can be incorporated within a multimode scheme to further enhance the video coder performance. Simulation results of compressing benchmark video sequences for transmission over noisy channel conditions are presented. They demonstrate that multimode coders outperform conventional fixed length approaches and can achieve substantial gains of more than 6 dB in PSNR of the reconstructed picture.
我们专注于设计在噪声信道上传输的鲁棒视频压缩方案。介绍了一种多模视频编码框架,该框架能够在高压缩效率和高容错性的冲突目标之间进行优化权衡。本文从一个简单的无运动补偿的基于块的预测编码器开始,开发了鲁棒的多模视频压缩方案。所提出的多模视频编码器迭代设计算法直接最小化了总体的率失真代价。我们展示了几种传统的联合源信道编码机制可以在多模方案中合并,以进一步提高视频编码器的性能。给出了压缩基准视频序列在噪声信道条件下传输的仿真结果。他们证明了多模编码器优于传统的固定长度方法,并且可以在重建图像的PSNR中获得超过6 dB的大幅增益。
{"title":"Multimode video coding for noisy channels","authors":"S. Regunathan, K. Rose","doi":"10.1109/ICIP.1997.632148","DOIUrl":"https://doi.org/10.1109/ICIP.1997.632148","url":null,"abstract":"We focus on the design of robust video compression schemes for transmission over noisy channels. A multimode video coding framework is introduced, which enables optimizing the tradeoff between the conflicting objectives of high compression efficiency and error resilience. The starting point is a simple block-based predictive coder, with no motion compensation, for which we develop the robust multimode video compression scheme. The proposed iterative design algorithm for multimode video coders directly minimizes the overall rate-distortion cost. We show that several conventional joint source-channel coding mechanisms can be incorporated within a multimode scheme to further enhance the video coder performance. Simulation results of compressing benchmark video sequences for transmission over noisy channel conditions are presented. They demonstrate that multimode coders outperform conventional fixed length approaches and can achieve substantial gains of more than 6 dB in PSNR of the reconstructed picture.","PeriodicalId":92344,"journal":{"name":"Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing","volume":"56 1","pages":"432-435 vol.3"},"PeriodicalIF":0.0,"publicationDate":"1997-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86826035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Computer analysis of images and patterns : proceedings of the ... International Conference on Automatic Image Processing. International Conference on Automatic Image Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1