首页 > 最新文献

2016 IEEE International Conference on Image Processing (ICIP)最新文献

英文 中文
Adaptive residual mapping for an efficient extension layer coding in two-layer HDR video coding 自适应残差映射用于两层HDR视频编码中高效的扩展层编码
Pub Date : 2016-09-01 DOI: 10.1109/ICIP.2016.7532587
J. Mir, Dumidu S. Talagala, H. K. Arachchi, W. Fernando
In the absence of a commercial High Dynamic Range (HDR) distribution pipeline, two-layer backward-compatible HDR video coding is a viable solution for the imminent transition from Low Dynamic Range (LDR) to HDR content transmission. However, the performance of a two-layer coding solution is governed by the extension layer coding performance. In this paper, we propose an improved two-layer backward-compatible HDR video coding solution based on an adaptive residual mapping for the extension layer, keeping in view the performance of High Efficiency Video Coding (HEVC) being used to code this information. The proposed solution outperforms the reference method achieving averaged PU-PSNR improvements of up to 5.05 dB. The proposed method also shows potential of achieving the same HDR quality as the single layer coding solution with a minimum bitrate overhead and acceptable LDR quality in the base layer.
在缺乏商用高动态范围(HDR)分发管道的情况下,两层向后兼容的HDR视频编码是即将从低动态范围(LDR)向HDR内容传输过渡的可行解决方案。然而,两层编码解决方案的性能受扩展层编码性能的支配。在本文中,我们提出了一种改进的两层向后兼容HDR视频编码方案,该方案基于扩展层的自适应残差映射,同时考虑到用于编码该信息的高效视频编码(HEVC)的性能。该解决方案优于参考方法,实现了高达5.05 dB的平均PU-PSNR改进。该方法还显示了实现与单层编码解决方案相同的HDR质量的潜力,同时具有最小的比特率开销和基础层可接受的LDR质量。
{"title":"Adaptive residual mapping for an efficient extension layer coding in two-layer HDR video coding","authors":"J. Mir, Dumidu S. Talagala, H. K. Arachchi, W. Fernando","doi":"10.1109/ICIP.2016.7532587","DOIUrl":"https://doi.org/10.1109/ICIP.2016.7532587","url":null,"abstract":"In the absence of a commercial High Dynamic Range (HDR) distribution pipeline, two-layer backward-compatible HDR video coding is a viable solution for the imminent transition from Low Dynamic Range (LDR) to HDR content transmission. However, the performance of a two-layer coding solution is governed by the extension layer coding performance. In this paper, we propose an improved two-layer backward-compatible HDR video coding solution based on an adaptive residual mapping for the extension layer, keeping in view the performance of High Efficiency Video Coding (HEVC) being used to code this information. The proposed solution outperforms the reference method achieving averaged PU-PSNR improvements of up to 5.05 dB. The proposed method also shows potential of achieving the same HDR quality as the single layer coding solution with a minimum bitrate overhead and acceptable LDR quality in the base layer.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"138 1","pages":"1394-1398"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77435113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A perceptual visibility metric for banding artifacts 条带伪影的感知可见性度量
Pub Date : 2016-09-01 DOI: 10.1109/ICIP.2016.7532722
Yilin Wang, S. Kum, Chao Chen, A. Kokaram
Banding is a common video artifact caused by compressing low texture regions with coarse quantization. Relatively few previous attempts exist to address banding and none incorporate subjective testing for calibrating the measurement. In this paper, we propose a novel metric that incorporates both edge length and contrast across the edge to measure video banding. We further introduce both reference and non-reference metrics. Our results demonstrate that the new metrics have a very high correlation with subjective assessment and certainly outperforms PSNR, SSIM, and VQM.
条带是对低纹理区域进行粗量化压缩后产生的常见视频伪影。相对较少的先前的尝试存在,以解决分带和没有纳入主观测试校准测量。在本文中,我们提出了一种结合边缘长度和边缘对比度的新度量来测量视频带。我们进一步介绍参考指标和非参考指标。我们的结果表明,新的指标与主观评估有非常高的相关性,并且肯定优于PSNR、SSIM和VQM。
{"title":"A perceptual visibility metric for banding artifacts","authors":"Yilin Wang, S. Kum, Chao Chen, A. Kokaram","doi":"10.1109/ICIP.2016.7532722","DOIUrl":"https://doi.org/10.1109/ICIP.2016.7532722","url":null,"abstract":"Banding is a common video artifact caused by compressing low texture regions with coarse quantization. Relatively few previous attempts exist to address banding and none incorporate subjective testing for calibrating the measurement. In this paper, we propose a novel metric that incorporates both edge length and contrast across the edge to measure video banding. We further introduce both reference and non-reference metrics. Our results demonstrate that the new metrics have a very high correlation with subjective assessment and certainly outperforms PSNR, SSIM, and VQM.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"31 1","pages":"2067-2071"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77183986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Automatic detection of 3D lighting inconsistencies via a facial landmark based morphable model 通过基于面部地标的变形模型自动检测3D光照不一致
Pub Date : 2016-09-01 DOI: 10.1109/ICIP.2016.7533097
Bo Peng, Wei Wang, Jing Dong, T. Tan
Existing 3D lighting consistency based forensic methods have some practical problems. They usually require additional images and human labor to reconstruct the 3D face model for lighting estimation, and furthermore, they cannot deal with expressional faces effectively. These drawbacks make them unusable in many practical cases. In this paper, we propose a more practical 3D lighting based forensic method by incorporating a facial landmark based 3D morphable model to efficiently fit the face shape. We also introduce a residual error based algorithm to automatically exclude outliers in lighting estimation. Our proposed method is fully automatic and very efficient compared to previous ones. Also, it does not depend on additional images and has better performance for expressional faces. Experiments on a realistic face dataset with variational lighting conditions indicate the efficacy and superiority of our method.
现有的基于3D光照一致性的取证方法存在一些实际问题。它们通常需要额外的图像和人工来重建三维人脸模型进行光照估计,而且它们不能有效地处理表情人脸。这些缺点使它们在许多实际情况下无法使用。在本文中,我们提出了一种更实用的基于3D照明的法医方法,通过结合基于面部地标的3D变形模型来有效地拟合面部形状。我们还介绍了一种基于残差的算法来自动排除光照估计中的异常值。与以往的方法相比,我们提出的方法是全自动的,效率很高。此外,它不依赖于额外的图像,对表情面部有更好的表现。在不同光照条件下的真实人脸数据集上的实验表明了该方法的有效性和优越性。
{"title":"Automatic detection of 3D lighting inconsistencies via a facial landmark based morphable model","authors":"Bo Peng, Wei Wang, Jing Dong, T. Tan","doi":"10.1109/ICIP.2016.7533097","DOIUrl":"https://doi.org/10.1109/ICIP.2016.7533097","url":null,"abstract":"Existing 3D lighting consistency based forensic methods have some practical problems. They usually require additional images and human labor to reconstruct the 3D face model for lighting estimation, and furthermore, they cannot deal with expressional faces effectively. These drawbacks make them unusable in many practical cases. In this paper, we propose a more practical 3D lighting based forensic method by incorporating a facial landmark based 3D morphable model to efficiently fit the face shape. We also introduce a residual error based algorithm to automatically exclude outliers in lighting estimation. Our proposed method is fully automatic and very efficient compared to previous ones. Also, it does not depend on additional images and has better performance for expressional faces. Experiments on a realistic face dataset with variational lighting conditions indicate the efficacy and superiority of our method.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"7 1","pages":"3932-3936"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76265745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Generic statistical multiplexer with a parametrized bitrate allocation criteria 具有参数化比特率分配标准的通用统计多路复用器
Pub Date : 2016-09-01 DOI: 10.1109/ICIP.2016.7532734
Médéric Blestel, M. Ropert, W. Hamidouche
In this paper, we address the problem of the statistical multiplexing of video streams. Dynamic bitrate allocation is used to improve the overall video quality of a pool of channels. The balance is obtained by providing more bits to complex channels, while deprivations are applied to non-complex ones. In this study, the error minimization optimization of several compressed video is considered along with different metrics in order to exhibit a repartition key for bitrate sharing among all the channels. The goal of this approach is to introduce a reactivity parameter able to manage the bit transfer between channels. The validity of the parametric model is verified on two particular values, and compared to a static repartition solution.
本文主要研究视频流的统计复用问题。动态比特率分配用于提高信道池的整体视频质量。平衡是通过向复杂通道提供更多的比特来实现的,而对非复杂通道进行剥夺。在本研究中,考虑了几个压缩视频的误差最小化优化以及不同的指标,以展示在所有通道之间共享比特率的重分区密钥。这种方法的目标是引入一个能够管理通道之间的位传输的反应性参数。在两个特定值上验证了参数化模型的有效性,并与静态重划分解进行了比较。
{"title":"Generic statistical multiplexer with a parametrized bitrate allocation criteria","authors":"Médéric Blestel, M. Ropert, W. Hamidouche","doi":"10.1109/ICIP.2016.7532734","DOIUrl":"https://doi.org/10.1109/ICIP.2016.7532734","url":null,"abstract":"In this paper, we address the problem of the statistical multiplexing of video streams. Dynamic bitrate allocation is used to improve the overall video quality of a pool of channels. The balance is obtained by providing more bits to complex channels, while deprivations are applied to non-complex ones. In this study, the error minimization optimization of several compressed video is considered along with different metrics in order to exhibit a repartition key for bitrate sharing among all the channels. The goal of this approach is to introduce a reactivity parameter able to manage the bit transfer between channels. The validity of the parametric model is verified on two particular values, and compared to a static repartition solution.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"131 1","pages":"2127-2131"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76713237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Unsupervised person re-identification with locality-constrained Earth Mover's distance 无监督人员再识别与位置约束的土移者的距离
Pub Date : 2016-09-01 DOI: 10.1109/ICIP.2016.7533169
Dan Wang, Canxiang Yan, S. Shan, Xilin Chen
The difficult acquisition of labeled data and the misalignment of local matching are major obstacles to apply person re-identification in real scenarios. To alleviate these problems, we propose an unsupervised method, called locality-constrained Earth Mover's Distance (LC-EMD), to learn the optimal measure between image pairs. Specifically, Gaussian mixture models (GMMs) are learned as signatures. By imposing locality constraints, LC-EMD can naturally achieve partial matching between Gaussian components. Moreover, LC-EMD has the analytical solution which can be efficiently computed. Experiments on two public datasets demonstrate LC-EMD is robust to misalignment and performs better than other unsupervised methods.
标记数据的获取困难和局部匹配的不匹配是在真实场景中应用人员再识别的主要障碍。为了解决这些问题,我们提出了一种无监督的方法,称为位置约束的地球移动者距离(LC-EMD),以学习图像对之间的最佳度量。具体来说,高斯混合模型(gmm)作为签名学习。通过施加局部性约束,LC-EMD可以很自然地实现高斯分量之间的部分匹配。此外,LC-EMD具有解析解,可以有效地进行计算。在两个公共数据集上的实验表明,LC-EMD对不对准具有鲁棒性,并且优于其他无监督方法。
{"title":"Unsupervised person re-identification with locality-constrained Earth Mover's distance","authors":"Dan Wang, Canxiang Yan, S. Shan, Xilin Chen","doi":"10.1109/ICIP.2016.7533169","DOIUrl":"https://doi.org/10.1109/ICIP.2016.7533169","url":null,"abstract":"The difficult acquisition of labeled data and the misalignment of local matching are major obstacles to apply person re-identification in real scenarios. To alleviate these problems, we propose an unsupervised method, called locality-constrained Earth Mover's Distance (LC-EMD), to learn the optimal measure between image pairs. Specifically, Gaussian mixture models (GMMs) are learned as signatures. By imposing locality constraints, LC-EMD can naturally achieve partial matching between Gaussian components. Moreover, LC-EMD has the analytical solution which can be efficiently computed. Experiments on two public datasets demonstrate LC-EMD is robust to misalignment and performs better than other unsupervised methods.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"115 1","pages":"4289-4293"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77926931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Person re-identification based on hierarchical bipartite graph matching 基于层次二部图匹配的人物再识别
Pub Date : 2016-09-01 DOI: 10.1109/ICIP.2016.7533162
Yan Huang, Hao Sheng, Z. Xiong
This work proposes a novel person re-identification method based on Hierarchical Bipartite Graph Matching. Because human eyes observe person appearance roughly first and then goes further into the details gradually, our method abstracts person image from coarse to fine granularity, and finally into a three layer tree structure. Then, three bipartite graph matching methods are proposed for the matching of each layer between the trees. At the bottom layer Non-complete Bipartite Graph matching is proposed to collect matching pairs among small local regions. At the middle layer Semi-complete Bipartite Graph matching is used to deal with the problem of spatial misalignment between two person bodies. Complete Bipartite Graph matching is presented to refine the ranking result at the top layer. The effectiveness of our method is validated on the CAVIAR4REID and VIPeR datasets, and competitive results are achieved on both datasets.
本文提出了一种基于层次二部图匹配的人物再识别方法。由于人眼先对人的外表进行粗略的观察,然后逐渐深入到细节,因此我们的方法将人的图像从粗粒度抽象到细粒度,最后形成三层树状结构。然后,提出了三种二部图匹配方法,用于树间各层的匹配。在底层提出非完全二部图匹配,收集小局部区域之间的匹配对。在中间层,采用半完全二部图匹配来处理两个人体之间的空间不对齐问题。采用完全二部图匹配的方法对顶层的排序结果进行细化。在CAVIAR4REID和VIPeR数据集上验证了该方法的有效性,并在两个数据集上取得了具有竞争力的结果。
{"title":"Person re-identification based on hierarchical bipartite graph matching","authors":"Yan Huang, Hao Sheng, Z. Xiong","doi":"10.1109/ICIP.2016.7533162","DOIUrl":"https://doi.org/10.1109/ICIP.2016.7533162","url":null,"abstract":"This work proposes a novel person re-identification method based on Hierarchical Bipartite Graph Matching. Because human eyes observe person appearance roughly first and then goes further into the details gradually, our method abstracts person image from coarse to fine granularity, and finally into a three layer tree structure. Then, three bipartite graph matching methods are proposed for the matching of each layer between the trees. At the bottom layer Non-complete Bipartite Graph matching is proposed to collect matching pairs among small local regions. At the middle layer Semi-complete Bipartite Graph matching is used to deal with the problem of spatial misalignment between two person bodies. Complete Bipartite Graph matching is presented to refine the ranking result at the top layer. The effectiveness of our method is validated on the CAVIAR4REID and VIPeR datasets, and competitive results are achieved on both datasets.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"13 1","pages":"4255-4259"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78841906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Joint denoising / compression of image contours via geometric prior and variable-length context tree 基于几何先验和变长上下文树的图像轮廓联合去噪/压缩
Pub Date : 2016-09-01 DOI: 10.1109/ICIP.2016.7532618
Amin Zheng, Gene Cheung, D. Florêncio
The advent of depth sensing technologies has eased the detection of object contours in images. For efficient image compression, coded contours can enable edge-adaptive coding techniques such as graph Fourier transform (GFT) and arbitrarily shaped sub-block motion prediction. However, acquisition noise in captured depth images means that detected contours also suffer from errors. In this paper, we propose to jointly denoise and compress detected contours in an image. Specifically, we first propose a burst error model that models typical errors encountered in an observed string y of directional edges. We then formulate a rate-constrained maximum a posteriori (MAP) problem that trades off the posterior probability P(x|y) of an estimated string x given y with its code rate R(x). Given our burst error model, we show that the negative log of the likelihood P(y|x) can be written as a simple sum of burst error events, error symbols and burst lengths, while the geometric prior P(x) states intuitively that contours are more likely straight than curvy. We design a dynamic programming (DP) algorithm that solves the posed problem optimally. Experimental results show that our joint denoising / compression scheme outperformed a competing separate scheme in rate-distortion performance noticeably.
随着深度传感技术的出现,图像中物体轮廓的检测变得更加简单。为了有效地压缩图像,编码轮廓可以启用边缘自适应编码技术,如图傅里叶变换(GFT)和任意形状的子块运动预测。然而,捕获深度图像中的采集噪声意味着检测到的轮廓也会受到误差的影响。在本文中,我们提出对图像中检测到的轮廓进行联合去噪和压缩。具体来说,我们首先提出了一个突发误差模型,该模型模拟了在观察到的方向边的字符串y中遇到的典型误差。然后,我们制定了一个速率约束的最大后验(MAP)问题,该问题将给定y的估计字符串x的后验概率P(x|y)与其码率R(x)相权衡。给定我们的突发错误模型,我们证明了似然P(y|x)的负对数可以写成突发错误事件、错误符号和突发长度的简单总和,而几何先验P(x)直观地表明轮廓更可能是直线而不是曲线。我们设计了一种动态规划(DP)算法来最优地解决所提出的问题。实验结果表明,我们的联合去噪/压缩方案在率失真性能上明显优于独立方案。
{"title":"Joint denoising / compression of image contours via geometric prior and variable-length context tree","authors":"Amin Zheng, Gene Cheung, D. Florêncio","doi":"10.1109/ICIP.2016.7532618","DOIUrl":"https://doi.org/10.1109/ICIP.2016.7532618","url":null,"abstract":"The advent of depth sensing technologies has eased the detection of object contours in images. For efficient image compression, coded contours can enable edge-adaptive coding techniques such as graph Fourier transform (GFT) and arbitrarily shaped sub-block motion prediction. However, acquisition noise in captured depth images means that detected contours also suffer from errors. In this paper, we propose to jointly denoise and compress detected contours in an image. Specifically, we first propose a burst error model that models typical errors encountered in an observed string y of directional edges. We then formulate a rate-constrained maximum a posteriori (MAP) problem that trades off the posterior probability P(x|y) of an estimated string x given y with its code rate R(x). Given our burst error model, we show that the negative log of the likelihood P(y|x) can be written as a simple sum of burst error events, error symbols and burst lengths, while the geometric prior P(x) states intuitively that contours are more likely straight than curvy. We design a dynamic programming (DP) algorithm that solves the posed problem optimally. Experimental results show that our joint denoising / compression scheme outperformed a competing separate scheme in rate-distortion performance noticeably.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"1 1","pages":"1549-1553"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79798097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Using node relationships for hierarchical classification 使用节点关系进行分层分类
Pub Date : 2016-09-01 DOI: 10.1109/ICIP.2016.7532410
Tien-Dung Mai, T. Ngo, Duy-Dinh Le, D. Duong, Kiem Hoang, S. Satoh
Hierarchical classification is a computational efficient approach for large-scale image classification. The main challenging issue of this approach is to deal with error propagation. Irrelevant branching decision made at a parent node cannot be corrected at its child nodes in traversing the tree for classification. This paper presents a novel approach to reduce branching error at a node by taking its relative relationship into account. Given a node on the tree, we model each candidate branch by considering classification response of its child nodes, grandchild nodes and their differences with siblings. A maximum margin classifier is then applied to select the most discriminating candidate. Our proposed approach outperforms related approaches on Caltech-256, SUN-397 and ILSVRC2010-1K.
分层分类是一种计算效率高的大规模图像分类方法。这种方法的主要挑战问题是处理错误传播。在遍历树进行分类时,在父节点上做出的不相关分支决策不能在其子节点上得到纠正。本文提出了一种通过考虑节点的相对关系来减少节点分支误差的新方法。给定树上的一个节点,我们通过考虑其子节点、孙子节点及其与兄弟节点的差异的分类响应来建模每个候选分支。然后应用最大边际分类器来选择最具判别性的候选对象。我们提出的方法优于Caltech-256, SUN-397和ILSVRC2010-1K上的相关方法。
{"title":"Using node relationships for hierarchical classification","authors":"Tien-Dung Mai, T. Ngo, Duy-Dinh Le, D. Duong, Kiem Hoang, S. Satoh","doi":"10.1109/ICIP.2016.7532410","DOIUrl":"https://doi.org/10.1109/ICIP.2016.7532410","url":null,"abstract":"Hierarchical classification is a computational efficient approach for large-scale image classification. The main challenging issue of this approach is to deal with error propagation. Irrelevant branching decision made at a parent node cannot be corrected at its child nodes in traversing the tree for classification. This paper presents a novel approach to reduce branching error at a node by taking its relative relationship into account. Given a node on the tree, we model each candidate branch by considering classification response of its child nodes, grandchild nodes and their differences with siblings. A maximum margin classifier is then applied to select the most discriminating candidate. Our proposed approach outperforms related approaches on Caltech-256, SUN-397 and ILSVRC2010-1K.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"4 1","pages":"514-518"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80957645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Membrane segmentation via active learning with deep networks 基于深度网络主动学习的膜分割
Pub Date : 2016-09-01 DOI: 10.1109/ICIP.2016.7532697
Utkarsh Gaur, M. Kourakis, E. Newman-Smith, William C. Smith, B. S. Manjunath
Segmentation is a key component of several bio-medical image processing systems. Recently, segmentation methods based on supervised learning such as deep convolutional networks have enjoyed immense success for natural image datasets and biological datasets alike. These methods require large volumes of data to avoid overfitting which limits their applicability. In this work, we present a transfer learning mechanism based on active learning which allows us to utilize pre-trained deep networks for segmenting new domains with limited labelled data. We introduce a novel optimization criterion to allow feedback on the most uncertain, yet abundant image patterns thus provisioning for an expert in the loop albeit with minimum amount of guidance. Our experiments demonstrate the effectiveness of the proposed method in improving segmentation performance with very limited labelled data.
分割是生物医学图像处理系统的关键组成部分。最近,基于监督学习的分割方法,如深度卷积网络,在自然图像数据集和生物数据集上都取得了巨大的成功。这些方法需要大量的数据,以避免过度拟合,从而限制了它们的适用性。在这项工作中,我们提出了一种基于主动学习的迁移学习机制,该机制允许我们利用预训练的深度网络来分割具有有限标记数据的新域。我们引入了一种新的优化准则,允许对最不确定的、但丰富的图像模式进行反馈,从而为回路中的专家提供最少的指导。我们的实验证明了该方法在非常有限的标记数据下提高分割性能的有效性。
{"title":"Membrane segmentation via active learning with deep networks","authors":"Utkarsh Gaur, M. Kourakis, E. Newman-Smith, William C. Smith, B. S. Manjunath","doi":"10.1109/ICIP.2016.7532697","DOIUrl":"https://doi.org/10.1109/ICIP.2016.7532697","url":null,"abstract":"Segmentation is a key component of several bio-medical image processing systems. Recently, segmentation methods based on supervised learning such as deep convolutional networks have enjoyed immense success for natural image datasets and biological datasets alike. These methods require large volumes of data to avoid overfitting which limits their applicability. In this work, we present a transfer learning mechanism based on active learning which allows us to utilize pre-trained deep networks for segmenting new domains with limited labelled data. We introduce a novel optimization criterion to allow feedback on the most uncertain, yet abundant image patterns thus provisioning for an expert in the loop albeit with minimum amount of guidance. Our experiments demonstrate the effectiveness of the proposed method in improving segmentation performance with very limited labelled data.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"81 1","pages":"1943-1947"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83140227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A framework of single-image deraining method based on analysis of rain characteristics 基于降雨特征分析的单幅图像脱轨方法框架
Pub Date : 2016-09-01 DOI: 10.1109/ICIP.2016.7533128
Yinglong Wang, Chen Chen, Shuyuan Zhu, B. Zeng
In this paper, we propose an algorithm to remove rain streaks from single color image. Firstly, the guided filter, cooperated with rain pixels detection are used to separate a color image into low-frequency and high-frequency parts so that most rain components exist in the high-frequency part. Then, we focus on the high-frequency part to extract the non-rain details according to the characteristics of the rain in which a dictionary learning method is used. Meanwhile, to enhance the quality of the rain-removed image, the proposed principal direction of an image patch (PDIP) and the sensitivity of variance of color channels (SVCC) are employed in our work to help extract more non-rain details. Compared with the state-of-the-art works, our proposed method can remove the rain (especially heavy rain) from color images more efficiently.
本文提出了一种从单色图像中去除雨纹的算法。首先,利用引导滤波器配合雨像点检测,将彩色图像分离为低频和高频部分,使大部分雨成分存在于高频部分;然后,我们根据雨的特征,重点提取高频部分的非雨细节,其中使用字典学习方法。同时,为了提高去雨图像的质量,我们采用了图像补丁主方向(PDIP)和颜色通道方差灵敏度(SVCC)来提取更多的非雨细节。与现有的方法相比,我们的方法可以更有效地去除彩色图像中的雨(特别是大雨)。
{"title":"A framework of single-image deraining method based on analysis of rain characteristics","authors":"Yinglong Wang, Chen Chen, Shuyuan Zhu, B. Zeng","doi":"10.1109/ICIP.2016.7533128","DOIUrl":"https://doi.org/10.1109/ICIP.2016.7533128","url":null,"abstract":"In this paper, we propose an algorithm to remove rain streaks from single color image. Firstly, the guided filter, cooperated with rain pixels detection are used to separate a color image into low-frequency and high-frequency parts so that most rain components exist in the high-frequency part. Then, we focus on the high-frequency part to extract the non-rain details according to the characteristics of the rain in which a dictionary learning method is used. Meanwhile, to enhance the quality of the rain-removed image, the proposed principal direction of an image patch (PDIP) and the sensitivity of variance of color channels (SVCC) are employed in our work to help extract more non-rain details. Compared with the state-of-the-art works, our proposed method can remove the rain (especially heavy rain) from color images more efficiently.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"4 1","pages":"4087-4091"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79626284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
期刊
2016 IEEE International Conference on Image Processing (ICIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1