首页 > 最新文献

IET Image Process.最新文献

英文 中文
An adaptive level set method based on joint estimation dealing with intensity inhomogeneity 基于联合估计的自适应水平集方法处理强度非均匀性
Pub Date : 2020-12-30 DOI: 10.1049/ipr2.12115
Jiang Zhu, Yan Zeng, Jianqi Li, Shujuan Tian, Haolin Liu
Automatic object segmentation has been a challenging task due to intensity inhomogeneity. The traditional way is to eliminate the intensity inhomogeneity, which causes the object to lose useful intensity information. The authors propose an adaptive level set method for the segmentation of intensity inhomogeneous images. Firstly, global and local features are utilised to collaboratively estimate the image, which devotes to compensating for intensity inhomogeneity. The local estimation retains detailed spatial information, and the global estimation mainly contains the regional information of the partitioned object. Then, during the construction of the energy functional, joint estimation is introduced to create the external energy. To acquire the precise location of the boundary, a weighting factor indicated by the gradient is introduced into the internal energy. Finally, after the numerical calculation of the energy functional by additive operator splitting algorithm, this method achieves the desired performance in terms of accuracy and robustness. Experimental results verify this method outperforms the comparative methods and can be applied to many real-world sce-narios.
由于强度的不均匀性,自动目标分割一直是一项具有挑战性的任务。传统的方法是消除强度不均匀性,这种不均匀性会导致物体失去有用的强度信息。提出了一种用于灰度非均匀图像分割的自适应水平集方法。首先,利用全局特征和局部特征对图像进行协同估计,对图像强度的非均匀性进行补偿;局部估计保留了详细的空间信息,全局估计主要包含了分割对象的区域信息。然后,在构造能量函数的过程中,引入联合估计来产生外部能量。为了获得边界的精确位置,在内能中引入了由梯度表示的加权因子。最后,通过加性算子分裂算法对能量泛函进行数值计算,该方法在精度和鲁棒性方面都达到了预期的效果。实验结果表明,该方法优于其他比较方法,可以应用于许多现实场景。
{"title":"An adaptive level set method based on joint estimation dealing with intensity inhomogeneity","authors":"Jiang Zhu, Yan Zeng, Jianqi Li, Shujuan Tian, Haolin Liu","doi":"10.1049/ipr2.12115","DOIUrl":"https://doi.org/10.1049/ipr2.12115","url":null,"abstract":"Automatic object segmentation has been a challenging task due to intensity inhomogeneity. The traditional way is to eliminate the intensity inhomogeneity, which causes the object to lose useful intensity information. The authors propose an adaptive level set method for the segmentation of intensity inhomogeneous images. Firstly, global and local features are utilised to collaboratively estimate the image, which devotes to compensating for intensity inhomogeneity. The local estimation retains detailed spatial information, and the global estimation mainly contains the regional information of the partitioned object. Then, during the construction of the energy functional, joint estimation is introduced to create the external energy. To acquire the precise location of the boundary, a weighting factor indicated by the gradient is introduced into the internal energy. Finally, after the numerical calculation of the energy functional by additive operator splitting algorithm, this method achieves the desired performance in terms of accuracy and robustness. Experimental results verify this method outperforms the comparative methods and can be applied to many real-world sce-narios.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"29 1","pages":"1424-1438"},"PeriodicalIF":0.0,"publicationDate":"2020-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78649737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A level-set method for inhomogeneous image segmentation with application to breast thermography images 非均匀图像分割的水平集方法及其在乳腺热成像图像中的应用
Pub Date : 2020-12-26 DOI: 10.1049/ipr2.12116
Asma Shamsi Koshki, M. Ahmadzadeh, M. Zekri, S. Sadri, E. Mahmoudzadeh
Various level-set methods have been suggested for segmenting images with intensity inhomogeneity as local region-based models. The challenge in these methods is segmenting the inhomogeneous images with smooth edges. These methods cannot properly segment regions with smooth edges in inhomogeneous images. This paper presents a new local region-based active contour model called local self-weighted active contour model. In the proposed method, a novel different weighting technique is applied. In this model, the weight of each neighbour pixel in the energy function is set by a function of its intensity and not its geometrical distance regarding the central pixel as previous methods. Considering this, the presented approach can segment regions with smooth edges in the presence of inhomogeneity as breast thermography images. The experimental results of applying the model on heterogeneous images containing synthetic images and medical images, especially breast thermography images, are compared with well-known local level-set methods which show the perfect capability of the model. The segmentation results were evaluated using the F-score, accuracy, precision and recall criteria. The results show values of 0.8, 0.62, 0.73 and 0.82 for the average accuracy, F-score, precision and recall criteria on the segmentation of breast thermography images, respectively.
对于灰度不均匀的图像,人们提出了不同的水平集分割方法作为基于局部区域的模型。这些方法的难点在于对边缘光滑的非均匀图像进行分割。这些方法不能很好地分割非均匀图像中边缘光滑的区域。提出了一种新的基于局部区域的活动轮廓线模型——局部自加权活动轮廓线模型。在该方法中,采用了一种新的不同加权技术。在该模型中,能量函数中每个相邻像素的权重由其强度函数设置,而不是像以前的方法那样由其与中心像素的几何距离函数设置。考虑到这一点,本文提出的方法可以在存在不均匀性的情况下分割边缘光滑的区域作为乳房热成像图像。将该模型应用于包含合成图像和医学图像的异构图像,特别是乳房热成像图像的实验结果与已知的局部水平集方法进行了比较,表明了该模型的良好性能。采用f值、准确率、精密度和召回率标准对分割结果进行评价。结果表明,乳腺热成像图像分割的平均准确率为0.8,F-score为0.62,精密度为0.73,召回率为0.82。
{"title":"A level-set method for inhomogeneous image segmentation with application to breast thermography images","authors":"Asma Shamsi Koshki, M. Ahmadzadeh, M. Zekri, S. Sadri, E. Mahmoudzadeh","doi":"10.1049/ipr2.12116","DOIUrl":"https://doi.org/10.1049/ipr2.12116","url":null,"abstract":"Various level-set methods have been suggested for segmenting images with intensity inhomogeneity as local region-based models. The challenge in these methods is segmenting the inhomogeneous images with smooth edges. These methods cannot properly segment regions with smooth edges in inhomogeneous images. This paper presents a new local region-based active contour model called local self-weighted active contour model. In the proposed method, a novel different weighting technique is applied. In this model, the weight of each neighbour pixel in the energy function is set by a function of its intensity and not its geometrical distance regarding the central pixel as previous methods. Considering this, the presented approach can segment regions with smooth edges in the presence of inhomogeneity as breast thermography images. The experimental results of applying the model on heterogeneous images containing synthetic images and medical images, especially breast thermography images, are compared with well-known local level-set methods which show the perfect capability of the model. The segmentation results were evaluated using the F-score, accuracy, precision and recall criteria. The results show values of 0.8, 0.62, 0.73 and 0.82 for the average accuracy, F-score, precision and recall criteria on the segmentation of breast thermography images, respectively.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"36 1","pages":"1439-1458"},"PeriodicalIF":0.0,"publicationDate":"2020-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91236276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Fusion-based simultaneous estimation of reflectance and illumination for low-light image enhancement 基于融合的低光图像反射率和照度同时估计
Pub Date : 2020-12-24 DOI: 10.1049/ipr2.12114
A. Parihar, Kavinder Singh, Hrithik Rohilla, G. Asnani
Low-light image enhancement is a challenging field in image processing. Retinex-based methods perform well for low-light images. However, reflectance and illumination estimation is an ill-posed problem. This paper presents a new framework for the simultaneous estimation of reflectance and illumination for low-light image enhancement. The algorithm estimates multiple instances of illumination and reflectance and blends them to estimate the final components. The proposed approach uses multi-scale fusion for illumination estimation and naive fusion for reflectance estimation. Extensive experimentation and analysis with a large set of low-light images validates the performance of the proposed approach. The comparison shows the superiority of the proposed approach over most of the existing low-light image enhancement methods. The proposed method provides colour constancy in low-light image enhancement and preserves the naturalness of the image.
弱光图像增强是图像处理中的一个具有挑战性的领域。基于视黄醇的方法在低光图像中表现良好。然而,反射率和照度估计是一个不适定问题。本文提出了一种同时估计低光图像反射率和照度的新框架。该算法估计光照和反射率的多个实例,并将它们混合以估计最终的分量。该方法采用多尺度融合进行照度估计,朴素融合进行反射率估计。大量低光图像的实验和分析验证了该方法的有效性。对比结果表明,该方法优于现有的大多数弱光图像增强方法。该方法在保证弱光图像增强的色彩稳定性的同时,保留了图像的自然度。
{"title":"Fusion-based simultaneous estimation of reflectance and illumination for low-light image enhancement","authors":"A. Parihar, Kavinder Singh, Hrithik Rohilla, G. Asnani","doi":"10.1049/ipr2.12114","DOIUrl":"https://doi.org/10.1049/ipr2.12114","url":null,"abstract":"Low-light image enhancement is a challenging field in image processing. Retinex-based methods perform well for low-light images. However, reflectance and illumination estimation is an ill-posed problem. This paper presents a new framework for the simultaneous estimation of reflectance and illumination for low-light image enhancement. The algorithm estimates multiple instances of illumination and reflectance and blends them to estimate the final components. The proposed approach uses multi-scale fusion for illumination estimation and naive fusion for reflectance estimation. Extensive experimentation and analysis with a large set of low-light images validates the performance of the proposed approach. The comparison shows the superiority of the proposed approach over most of the existing low-light image enhancement methods. The proposed method provides colour constancy in low-light image enhancement and preserves the naturalness of the image.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"54 1","pages":"1410-1423"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77613794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
A new recognition algorithm for high-voltage lines based on improved LSD and convolutional neural networks 基于改进LSD和卷积神经网络的高压线路识别新算法
Pub Date : 2020-12-07 DOI: 10.1049/ipr2.12031
Yanhong Luo, Xue Yu, Dongsheng Yang
With the development of high-voltage transmission and artificial intelligence technology, unmanned line inspection has become the inevitable trend of current electric power inspection. A new recognition algorithm for high-voltage lines is proposed based on colour (Red, Green, Blue) RGB image to support the unmanned line inspection. Firstly, in order to solve the problem of missing weak edges in image edge detection, an improved Canny algorithm is proposed. Fourier transform Gaussian filter is introduced to enhance the high-frequency signal of the image, which makes the extracted edge information more complete. At the same time, an improved line segment detector (LSD) algorithm is developed to extract the high-voltage line. The complementary edge information of the three channels of the colour RGB image is analyzed, and the calculation formula of the horizontal line angle is improved, which greatly reduces the possibility of false detection and missed detection in the high-voltage line extraction. In addition, the convolution neural network (CNN) is used to accurately recognize the extracted high-voltage lines, which reduces the interference of non–high-voltage lines. Simulation results show that the proposed algorithm has high
随着高压输电和人工智能技术的发展,无人线路巡检已成为当前电力巡检的必然趋势。提出了一种基于彩色(红、绿、蓝)RGB图像的高压线路识别算法,支持无人值守检测。首先,为了解决图像边缘检测中弱边缘缺失的问题,提出了一种改进的Canny算法。引入傅里叶变换高斯滤波器增强图像的高频信号,使提取的边缘信息更加完整。同时,提出了一种改进的线段检测器(LSD)算法来提取高压线路。分析了彩色RGB图像三通道的互补边缘信息,改进了水平线角度的计算公式,大大降低了高压线路提取中误检和漏检的可能性。此外,利用卷积神经网络(CNN)对提取的高压线进行准确识别,减少了非高压线的干扰。仿真结果表明,该算法具有较高的抗干扰性
{"title":"A new recognition algorithm for high-voltage lines based on improved LSD and convolutional neural networks","authors":"Yanhong Luo, Xue Yu, Dongsheng Yang","doi":"10.1049/ipr2.12031","DOIUrl":"https://doi.org/10.1049/ipr2.12031","url":null,"abstract":"With the development of high-voltage transmission and artificial intelligence technology, unmanned line inspection has become the inevitable trend of current electric power inspection. A new recognition algorithm for high-voltage lines is proposed based on colour (Red, Green, Blue) RGB image to support the unmanned line inspection. Firstly, in order to solve the problem of missing weak edges in image edge detection, an improved Canny algorithm is proposed. Fourier transform Gaussian filter is introduced to enhance the high-frequency signal of the image, which makes the extracted edge information more complete. At the same time, an improved line segment detector (LSD) algorithm is developed to extract the high-voltage line. The complementary edge information of the three channels of the colour RGB image is analyzed, and the calculation formula of the horizontal line angle is improved, which greatly reduces the possibility of false detection and missed detection in the high-voltage line extraction. In addition, the convolution neural network (CNN) is used to accurately recognize the extracted high-voltage lines, which reduces the interference of non–high-voltage lines. Simulation results show that the proposed algorithm has high","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"40 1","pages":"260-268"},"PeriodicalIF":0.0,"publicationDate":"2020-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77410191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Automatic Production of Synthetic Labeled OCT Images Using Active Shape Model 利用主动形状模型自动生成合成标记OCT图像
Pub Date : 2020-09-08 DOI: 10.1101/2020.09.05.20181917
H. Danesh, K. Maghooli, R. Kafieh, A. Dehghani
The challenge of limited labeled data in the field of medical imaging and the need for large number of labeled data for training machine learning algorithms, and to measure the performance of image processing algorithms increases the demand to use synthetic images. The purpose of this paper is to construct synthetic and labeled Optical Coherence Tomography (OCT) data to solve the problems like having access to the accurate labeled data and evaluating the processing algorithms. In this study, a modified active shape model is used which considers the anatomical features of available images such as number and thickness of the layers and their associated brightness, the retinal blood vessels, and shadow information with wise consideration of speckle noise. The algorithm is also able to provide different datasets with varying noise level. The validity of our method for synthesis of retinal images is measured by two methods (qualitative assessment and quantitative analysis).
医学成像领域中有限的标记数据的挑战,以及对大量标记数据用于训练机器学习算法和测量图像处理算法性能的需求,增加了对使用合成图像的需求。本文的目的是构建合成和标记的光学相干层析成像(OCT)数据,以解决如何获得准确的标记数据和评估处理算法等问题。在本研究中,使用了一种改进的活动形状模型,该模型考虑了可用图像的解剖特征,如层的数量和厚度及其相关亮度,视网膜血管和阴影信息,并明智地考虑了散斑噪声。该算法还能够提供不同噪声水平的不同数据集。通过定性评价和定量分析两种方法对视网膜图像合成方法的有效性进行了验证。
{"title":"Automatic Production of Synthetic Labeled OCT Images Using Active Shape Model","authors":"H. Danesh, K. Maghooli, R. Kafieh, A. Dehghani","doi":"10.1101/2020.09.05.20181917","DOIUrl":"https://doi.org/10.1101/2020.09.05.20181917","url":null,"abstract":"The challenge of limited labeled data in the field of medical imaging and the need for large number of labeled data for training machine learning algorithms, and to measure the performance of image processing algorithms increases the demand to use synthetic images. The purpose of this paper is to construct synthetic and labeled Optical Coherence Tomography (OCT) data to solve the problems like having access to the accurate labeled data and evaluating the processing algorithms. In this study, a modified active shape model is used which considers the anatomical features of available images such as number and thickness of the layers and their associated brightness, the retinal blood vessels, and shadow information with wise consideration of speckle noise. The algorithm is also able to provide different datasets with varying noise level. The validity of our method for synthesis of retinal images is measured by two methods (qualitative assessment and quantitative analysis).","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"14 1","pages":"3812-3818"},"PeriodicalIF":0.0,"publicationDate":"2020-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87347791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Efficient and robust segmentation and correction model for medical images 高效鲁棒的医学图像分割与校正模型
Pub Date : 2018-10-17 DOI: 10.1049/IET-IPR.2018.5164
Yunyun Yang, W. Jia
Accurate segmentation of medical images plays a very important role in clinical diagnosis so that the segmentation technology for medical images attracts more and more attention. However, most medical images usually suffer from severe intensity inhomogeneity and make accurate segmentation difficult. In this study, the authors propose an efficient and robust active contour model for simultaneous image segmentation and correction. The proposed model not only can accurately segment images with severe intensity inhomogeneity and serious noise but also can eliminate the intensity varying information to get the homogeneous correction images. They first present the level set formulation of the two-phase model, which is then extended to the multi-phase formulation. The split Bregman method is applied to efficiently minimise the proposed energy functionals. The proposed model is tested with lots of synthetic images and medical images with promising results. Experimental results demonstrate that the proposed model can accurately segment and correct the inhomogeneous images with serious noise. Quantitative comparison results of the proposed model and other models illustrate the proposed model is more accurate and more efficient. What's more, the proposed model not only is insensitive to the initial contour, but also is robust to the noise.
医学图像的准确分割在临床诊断中起着非常重要的作用,因此医学图像的分割技术越来越受到人们的关注。然而,大多数医学图像通常存在严重的强度不均匀性,难以准确分割。在这项研究中,作者提出了一种高效、鲁棒的活动轮廓模型,用于同时进行图像分割和校正。该模型不仅能准确分割出具有严重强度不均匀性和严重噪声的图像,还能消除强度变化信息,得到均匀的校正图像。他们首先提出了两阶段模型的水平集公式,然后将其扩展到多阶段公式。采用分裂布雷格曼方法有效地最小化所提出的能量泛函。用大量的合成图像和医学图像对该模型进行了测试,取得了良好的效果。实验结果表明,该模型能够准确地分割和校正带有严重噪声的非均匀图像。与其他模型的定量比较结果表明,本文提出的模型更准确,效率更高。该模型不仅对初始轮廓不敏感,而且对噪声具有较强的鲁棒性。
{"title":"Efficient and robust segmentation and correction model for medical images","authors":"Yunyun Yang, W. Jia","doi":"10.1049/IET-IPR.2018.5164","DOIUrl":"https://doi.org/10.1049/IET-IPR.2018.5164","url":null,"abstract":"Accurate segmentation of medical images plays a very important role in clinical diagnosis so that the segmentation technology for medical images attracts more and more attention. However, most medical images usually suffer from severe intensity inhomogeneity and make accurate segmentation difficult. In this study, the authors propose an efficient and robust active contour model for simultaneous image segmentation and correction. The proposed model not only can accurately segment images with severe intensity inhomogeneity and serious noise but also can eliminate the intensity varying information to get the homogeneous correction images. They first present the level set formulation of the two-phase model, which is then extended to the multi-phase formulation. The split Bregman method is applied to efficiently minimise the proposed energy functionals. The proposed model is tested with lots of synthetic images and medical images with promising results. Experimental results demonstrate that the proposed model can accurately segment and correct the inhomogeneous images with serious noise. Quantitative comparison results of the proposed model and other models illustrate the proposed model is more accurate and more efficient. What's more, the proposed model not only is insensitive to the initial contour, but also is robust to the noise.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"13 1","pages":"2245-2254"},"PeriodicalIF":0.0,"publicationDate":"2018-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85581787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Adaptive deblocking filter for transform domain Wyner-Ziv video coding 变换域Wyner-Ziv视频编码的自适应块化滤波器
Pub Date : 2009-12-04 DOI: 10.1049/IET-IPR.2008.0201
R. Martins, Catarina Brites, J. Ascenso, F. Pereira
Wyner-Ziv (WZ) video coding is a particular case of distributed video coding, the recent video coding paradigm based on the Slepian-Wolf and Wyner-Ziv theorems that exploits the source correlation at the decoder and not at the encoder as in predictive video coding. Although many improvements have been done over the last years, the performance of the state-of-the-art WZ video codecs still did not reach the performance of state-of-the-art predictive video codecs, especially for high and complex motion video content. This is also true in terms of subjective image quality mainly because of a considerable amount of blocking artefacts present in the decoded WZ video frames. This paper proposes an adaptive deblocking filter to improve both the subjective and objective qualities of the WZ frames in a transform domain WZ video codec. The proposed filter is an adaptation of the advanced deblocking filter defined in the H.264/AVC (advanced video coding) standard to a WZ video codec. The results obtained confirm the subjective quality improvement and objective quality gains that can go up to 0.63-dB in the overall for sequences with high motion content when large group of pictures are used.
wner - ziv (WZ)视频编码是分布式视频编码的一种特殊情况,这是一种基于Slepian-Wolf定理和wner - ziv定理的最新视频编码范式,它利用解码器的源相关性,而不是像预测视频编码那样利用编码器的源相关性。尽管在过去几年中已经做了许多改进,但最先进的WZ视频编解码器的性能仍然没有达到最先进的预测视频编解码器的性能,特别是对于高和复杂的运动视频内容。在主观图像质量方面也是如此,这主要是因为在解码的WZ视频帧中存在相当数量的阻塞伪影。为了提高变换域WZ视频编解码器WZ帧的主客观质量,提出了一种自适应分块滤波器。该滤波器将H.264/AVC(高级视频编码)标准中定义的高级块化滤波器改编为WZ视频编解码器。实验结果表明,当使用大组图像时,高运动内容序列的主观质量得到改善,客观质量总体上可提高0.63 db。
{"title":"Adaptive deblocking filter for transform domain Wyner-Ziv video coding","authors":"R. Martins, Catarina Brites, J. Ascenso, F. Pereira","doi":"10.1049/IET-IPR.2008.0201","DOIUrl":"https://doi.org/10.1049/IET-IPR.2008.0201","url":null,"abstract":"Wyner-Ziv (WZ) video coding is a particular case of distributed video coding, the recent video coding paradigm based on the Slepian-Wolf and Wyner-Ziv theorems that exploits the source correlation at the decoder and not at the encoder as in predictive video coding. Although many improvements have been done over the last years, the performance of the state-of-the-art WZ video codecs still did not reach the performance of state-of-the-art predictive video codecs, especially for high and complex motion video content. This is also true in terms of subjective image quality mainly because of a considerable amount of blocking artefacts present in the decoded WZ video frames. This paper proposes an adaptive deblocking filter to improve both the subjective and objective qualities of the WZ frames in a transform domain WZ video codec. The proposed filter is an adaptation of the advanced deblocking filter defined in the H.264/AVC (advanced video coding) standard to a WZ video codec. The results obtained confirm the subjective quality improvement and objective quality gains that can go up to 0.63-dB in the overall for sequences with high motion content when large group of pictures are used.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"13 1","pages":"315-328"},"PeriodicalIF":0.0,"publicationDate":"2009-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84968513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Enhanced reconstruction algorithm for unidirectional distributed video coding 单向分布式视频编码的增强重构算法
Pub Date : 2009-12-04 DOI: 10.1049/IET-IPR.2008.0195
W. Weerakkody, W. Fernando, A. Kondoz
Distributed video coding (DVC) is an emerging video coding technology that utilises the distributed source coding principles to build very low cost video encoders, yet with remarkable error resilience. In the most common DVC framework, the reconstruction function plays a vital role that has a direct impact on the output video quality. In this study, a novel algorithm is proposed for the reconstruction function, particularly focusing on a unidirectional DVC architecture. The proposed technique exploits the variations of the bit error rate of the Wyner-Ziv decoded bit stream and the assumed noise model in the side information stream. The simulation results show that the proposed algorithm yields a significant improvement of the objective and subjective video quality at no additional bit rate cost.
分布式视频编码(DVC)是一种新兴的视频编码技术,它利用分布式源编码原理构建非常低成本的视频编码器,同时具有显著的抗错误性。在最常见的DVC框架中,重构功能起着至关重要的作用,它直接影响到输出的视频质量。在本研究中,提出了一种新的重建函数算法,特别关注单向DVC架构。该技术利用了Wyner-Ziv译码码流误码率的变化和侧信息流中假定的噪声模型。仿真结果表明,该算法在不增加码率成本的情况下,显著提高了主客观视频质量。
{"title":"Enhanced reconstruction algorithm for unidirectional distributed video coding","authors":"W. Weerakkody, W. Fernando, A. Kondoz","doi":"10.1049/IET-IPR.2008.0195","DOIUrl":"https://doi.org/10.1049/IET-IPR.2008.0195","url":null,"abstract":"Distributed video coding (DVC) is an emerging video coding technology that utilises the distributed source coding principles to build very low cost video encoders, yet with remarkable error resilience. In the most common DVC framework, the reconstruction function plays a vital role that has a direct impact on the output video quality. In this study, a novel algorithm is proposed for the reconstruction function, particularly focusing on a unidirectional DVC architecture. The proposed technique exploits the variations of the bit error rate of the Wyner-Ziv decoded bit stream and the assumed noise model in the side information stream. The simulation results show that the proposed algorithm yields a significant improvement of the objective and subjective video quality at no additional bit rate cost.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"77 1","pages":"329-334"},"PeriodicalIF":0.0,"publicationDate":"2009-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78837909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Motion vector refinement in a Wyner-Ziv to H.264 transcoder for mobile telephony 移动电话用Wyner-Ziv转H.264转码器的运动矢量细化
Pub Date : 2009-12-04 DOI: 10.1049/IET-IPR.2008.0202
José Luis Martínez, G. Fernández-Escribano, H. Kalva, P. Cuenca
The authors develop a decoder/encoder system (transcoder) to solve the consumption constraint in the communications between end-user devices, when a new Wyner-Ziv (WZ)/H.264 framework is defined for being used in mobile-to-mobile environments. This approach is based on leaving to the devices only WZ video encoding and traditional video decoding; the lowest complexity algorithms in both paradigms. The system shifts the burden of complexity to the network, where an improved transcoder that reuses information between both paradigms is allocated. The WZ decoding motion vectors are used to reduce the H.264 motion estimation process. The proposed transcoder offers a complexity reduction up to 60- on average, without any rate distortion drop.
作者开发了一种解码器/编码器系统(转码器),以解决终端用户设备之间通信中的消耗限制,当一个新的Wyner-Ziv (WZ)/H。264框架是为在移动到移动环境中使用而定义的。该方法的基础是将WZ视频编码和传统视频解码只留给设备;两种范式中复杂度最低的算法。该系统将复杂性的负担转移到网络上,在网络上分配了一种改进的转码器,可以在两种范式之间重用信息。使用WZ解码运动向量来减少H.264的运动估计过程。所提出的转码器提供的复杂性降低高达60-平均,没有任何速率失真下降。
{"title":"Motion vector refinement in a Wyner-Ziv to H.264 transcoder for mobile telephony","authors":"José Luis Martínez, G. Fernández-Escribano, H. Kalva, P. Cuenca","doi":"10.1049/IET-IPR.2008.0202","DOIUrl":"https://doi.org/10.1049/IET-IPR.2008.0202","url":null,"abstract":"The authors develop a decoder/encoder system (transcoder) to solve the consumption constraint in the communications between end-user devices, when a new Wyner-Ziv (WZ)/H.264 framework is defined for being used in mobile-to-mobile environments. This approach is based on leaving to the devices only WZ video encoding and traditional video decoding; the lowest complexity algorithms in both paradigms. The system shifts the burden of complexity to the network, where an improved transcoder that reuses information between both paradigms is allocated. The WZ decoding motion vectors are used to reduce the H.264 motion estimation process. The proposed transcoder offers a complexity reduction up to 60- on average, without any rate distortion drop.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"94 1","pages":"335-339"},"PeriodicalIF":0.0,"publicationDate":"2009-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73815364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Transform-domain distributed video coding with rate-distortion-based adaptive quantisation 基于率失真自适应量化的变换域分布式视频编码
Pub Date : 2009-12-04 DOI: 10.1049/IET-IPR.2008.0207
Wei-Jung Chien, Lina Karam
This study presents a transform-domain distributed video coding (DVC) system with a rate-distortion (R-D)-based Adaptive QuanTisation (AQT) scheme. In the proposed system, the transform-domain Wyner-Ziv frame is divided into partitions and is adaptively quantised based on estimated local R-D characteristics for each partition. The R-D estimation is performed based on a correlation model between the original source information and the side information and can be applied at the decoder without adding complexity to the encoder. Coding results and comparisons with existing DVC schemes and with H.264/AVC interframe and intraframe coding are presented to illustrate the performance of the proposed system.
提出了一种基于率失真(R-D)自适应量化(AQT)的变换域分布式视频编码(DVC)系统。在该系统中,变换域Wyner-Ziv框架被划分为多个分区,并根据每个分区估计的局部R-D特征自适应量化。R-D估计是基于原始源信息和侧信息之间的相关模型进行的,可以应用于解码器,而不会增加编码器的复杂性。最后给出了编码结果,并与现有的DVC编码和H.264/AVC帧间和帧内编码进行了比较,以说明所提系统的性能。
{"title":"Transform-domain distributed video coding with rate-distortion-based adaptive quantisation","authors":"Wei-Jung Chien, Lina Karam","doi":"10.1049/IET-IPR.2008.0207","DOIUrl":"https://doi.org/10.1049/IET-IPR.2008.0207","url":null,"abstract":"This study presents a transform-domain distributed video coding (DVC) system with a rate-distortion (R-D)-based Adaptive QuanTisation (AQT) scheme. In the proposed system, the transform-domain Wyner-Ziv frame is divided into partitions and is adaptively quantised based on estimated local R-D characteristics for each partition. The R-D estimation is performed based on a correlation model between the original source information and the side information and can be applied at the decoder without adding complexity to the encoder. Coding results and comparisons with existing DVC schemes and with H.264/AVC interframe and intraframe coding are presented to illustrate the performance of the proposed system.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"16 1","pages":"340-354"},"PeriodicalIF":0.0,"publicationDate":"2009-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83148775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
IET Image Process.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1