首页 > 最新文献

2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)最新文献

英文 中文
Automated quantification of retinal vessel morphometry in the UK biobank cohort 英国生物库队列中视网膜血管形态测量的自动量化
R. Welikala, M. Fraz, M. Habib, S. Daniel-Tong, M. Yates, P. J. Foster, P. Whincup, A. Rudnicka, C. Owen, D. Strachan, S. Barman
The morphometric characteristics of the retinal vascular network have been associated with risk markers of many systemic and vascular diseases. However, analysis of data from large population based studies is needed to help resolve uncertainties in these associations. QUARTZ (QUantitative Analysis of Retinal vessel Topology and siZe) is a fully automated retinal image analysis system that has been designed to process large numbers of retinal images and obtains quantitative measures of vessel morphology to be used in epidemiological studies. QUARTZ has been used to process retinal images from UK Biobank which is a large population-based cohort study. In this paper, we address issues of robustness with respect to processing large datasets and validate QUARTZ using a subset of 4,692 UK Biobank retinal images. Ground truth data produced by human observers for validation have been made available online. Following validation, 135,867 retinal images (68,549 participants) from the UK Biobank study were processed by QUARTZ. 71.53% of these images were classified as being of adequate quality, which equated to 80.90% participants with at least one image of adequate quality. The vessel morphometric data are currently being used in epidemiological studies. The intention of the UK Biobank Eye and Vision Consortium is to include these derived measures in the UK Biobank data archive.
视网膜血管网的形态特征与许多系统性和血管疾病的危险标志物有关。然而,需要对基于大量人口的研究的数据进行分析,以帮助解决这些关联中的不确定性。QUARTZ (QUantitative Analysis of Retinal vessel Topology and siZe)是一个全自动的视网膜图像分析系统,用于处理大量的视网膜图像,并获得用于流行病学研究的血管形态的定量测量。QUARTZ已被用于处理来自英国生物银行的视网膜图像,这是一项基于人群的大型队列研究。在本文中,我们解决了处理大型数据集的鲁棒性问题,并使用4,692个UK Biobank视网膜图像子集验证QUARTZ。人类观察员为验证而产生的地面真实数据已在网上提供。经过验证后,QUARTZ处理了来自英国生物银行研究的135,867张视网膜图像(68,549名参与者)。71.53%的这些图像被归类为质量良好,这相当于80.90%的参与者至少有一张质量良好的图像。血管形态测量数据目前被用于流行病学研究。英国生物银行眼科和视力协会的意图是将这些衍生措施纳入英国生物银行数据档案。
{"title":"Automated quantification of retinal vessel morphometry in the UK biobank cohort","authors":"R. Welikala, M. Fraz, M. Habib, S. Daniel-Tong, M. Yates, P. J. Foster, P. Whincup, A. Rudnicka, C. Owen, D. Strachan, S. Barman","doi":"10.1109/IPTA.2017.8310108","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310108","url":null,"abstract":"The morphometric characteristics of the retinal vascular network have been associated with risk markers of many systemic and vascular diseases. However, analysis of data from large population based studies is needed to help resolve uncertainties in these associations. QUARTZ (QUantitative Analysis of Retinal vessel Topology and siZe) is a fully automated retinal image analysis system that has been designed to process large numbers of retinal images and obtains quantitative measures of vessel morphology to be used in epidemiological studies. QUARTZ has been used to process retinal images from UK Biobank which is a large population-based cohort study. In this paper, we address issues of robustness with respect to processing large datasets and validate QUARTZ using a subset of 4,692 UK Biobank retinal images. Ground truth data produced by human observers for validation have been made available online. Following validation, 135,867 retinal images (68,549 participants) from the UK Biobank study were processed by QUARTZ. 71.53% of these images were classified as being of adequate quality, which equated to 80.90% participants with at least one image of adequate quality. The vessel morphometric data are currently being used in epidemiological studies. The intention of the UK Biobank Eye and Vision Consortium is to include these derived measures in the UK Biobank data archive.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130096936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Illumination-robust multispectral demosaicing 光照鲁棒多光谱去马赛克
Sofiane Mihoubi, B. Mathon, Jean-Baptiste Thomas, O. Losson, L. Macaire
Snapshot multispectral cameras that are equipped with filter arrays acquire a raw image that represents the radiance of a scene over the electromagnetic spectrum at video rate. These cameras require a demosaicing procedure to estimate a multispectral image with full spatio-spectral definition. Such a procedure is based on spectral correlation properties that are sensitive to illumination. In this paper, we first highlight the influence of illumination on demosaicing performances. Then we propose camera-, illumination-, and raw image-based normalisations that make demosaicing robust to illumination. Experimental results on state-of-the-art demosaicing algorithms show that such normalisations improve the quality of multispectral images estimated from raw images acquired under various illuminations.
配备滤光片阵列的快照多光谱相机以视频速率获取代表电磁频谱上场景亮度的原始图像。这些相机需要一个去马赛克程序来估计具有完整空间光谱清晰度的多光谱图像。这种方法是基于对光照敏感的光谱相关特性。在本文中,我们首先强调了光照对反马赛克性能的影响。然后,我们提出了基于相机、照明和原始图像的归一化,使去马赛克对照明具有鲁棒性。对最先进的去马赛克算法的实验结果表明,这种归一化提高了从不同照明下获得的原始图像估计的多光谱图像的质量。
{"title":"Illumination-robust multispectral demosaicing","authors":"Sofiane Mihoubi, B. Mathon, Jean-Baptiste Thomas, O. Losson, L. Macaire","doi":"10.1109/IPTA.2017.8310135","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310135","url":null,"abstract":"Snapshot multispectral cameras that are equipped with filter arrays acquire a raw image that represents the radiance of a scene over the electromagnetic spectrum at video rate. These cameras require a demosaicing procedure to estimate a multispectral image with full spatio-spectral definition. Such a procedure is based on spectral correlation properties that are sensitive to illumination. In this paper, we first highlight the influence of illumination on demosaicing performances. Then we propose camera-, illumination-, and raw image-based normalisations that make demosaicing robust to illumination. Experimental results on state-of-the-art demosaicing algorithms show that such normalisations improve the quality of multispectral images estimated from raw images acquired under various illuminations.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"502 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116549915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Reversible data hiding in encrypted images based on adaptive local entropy analysis 基于自适应局部熵分析的加密图像可逆数据隐藏
Pauline Puteaux, W. Puech
With the development of cloud computing, the growth in information technology has led to serious security issues. For this reason, a lot of multimedia files are stored in encrypted forms. Methods of reversible data hiding in encrypted images (RDHEI) have been designed to provide authentication and integrity in the encrypted domain. The original image is firstly encrypted to ensure confidentiality, by making the content unreadable. A secret message is then embedded in the encrypted image, without the need of the encryption key or any access to the clear content. The challenge lies in finding the best trade-off between embedding capacity and quality of the reconstructed image. In 2008, Puech et al. suggested using the AES algorithm to encrypt an original image and to embed one bit in each block of 16 pixels (payload = 0.0625 bpp) [12]. During the decryption phase, the original image is reconstructed by measuring the standard deviation into each block. In this paper, we propose an improvement to this method, by performing an adaptive local entropy measurement. We can achieve a larger payload without altering the recovered image quality. Our obtained results are very good and better than most of the modern state-of-the-art methods, whilst offering an improved security level with the use of the AES algorithm, defined as the encryption standard by the NIST.
随着云计算的发展,信息技术的发展带来了严重的安全问题。由于这个原因,许多多媒体文件都以加密的形式存储。为了在加密域内提供身份验证和完整性,设计了加密图像中可逆数据隐藏方法(RDHEI)。首先对原始图像进行加密,通过使内容不可读来确保机密性。然后,在加密图像中嵌入一个秘密消息,而不需要加密密钥或对清晰内容的任何访问。难点在于如何在重构图像的嵌入能力和质量之间找到最佳平衡点。2008年,Puech等人建议使用AES算法对原始图像进行加密,并在16像素(payload = 0.0625 bpp)的每个块中嵌入1位。在解密阶段,通过测量每个块的标准差来重建原始图像。在本文中,我们提出了一种改进方法,通过执行自适应局部熵测量。我们可以在不改变恢复图像质量的情况下获得更大的有效载荷。我们获得的结果非常好,比大多数现代最先进的方法更好,同时通过使用AES算法提供改进的安全级别,AES算法被NIST定义为加密标准。
{"title":"Reversible data hiding in encrypted images based on adaptive local entropy analysis","authors":"Pauline Puteaux, W. Puech","doi":"10.1109/IPTA.2017.8310143","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310143","url":null,"abstract":"With the development of cloud computing, the growth in information technology has led to serious security issues. For this reason, a lot of multimedia files are stored in encrypted forms. Methods of reversible data hiding in encrypted images (RDHEI) have been designed to provide authentication and integrity in the encrypted domain. The original image is firstly encrypted to ensure confidentiality, by making the content unreadable. A secret message is then embedded in the encrypted image, without the need of the encryption key or any access to the clear content. The challenge lies in finding the best trade-off between embedding capacity and quality of the reconstructed image. In 2008, Puech et al. suggested using the AES algorithm to encrypt an original image and to embed one bit in each block of 16 pixels (payload = 0.0625 bpp) [12]. During the decryption phase, the original image is reconstructed by measuring the standard deviation into each block. In this paper, we propose an improvement to this method, by performing an adaptive local entropy measurement. We can achieve a larger payload without altering the recovered image quality. Our obtained results are very good and better than most of the modern state-of-the-art methods, whilst offering an improved security level with the use of the AES algorithm, defined as the encryption standard by the NIST.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"20 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132026405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Completed local structure patterns on three orthogonal planes for dynamic texture recognition 完成了三个正交平面上的局部结构模式,用于动态纹理识别
Thanh Tuan Nguyen, T. Nguyen, F. Bouchara
Dynamic texture (DT) is a challenging problem in computer vision because of the chaotic motion of textures. We address in this paper a new dynamic texture operator by considering local structure patterns (LSP) and completed local binary patterns (CLBP) for static images in three orthogonal planes to capture spatial-temporal texture structures. Since the typical operator of local binary patterns (LBP), which uses center pixel for thresholding, has some limitations such as sensitivity to noise and near uniform regions, the proposed approach can deal with these drawbacks by using global and local texture information for adaptive thresholding and CLBP for exploiting complementary texture information in three orthogonal planes. Evaluations on different datasets of dynamic textures (UCLA, DynTex, DynTex++) show that our proposal significantly outperforms recent results in the state-of-the-art approaches.
由于纹理的混沌运动,动态纹理是计算机视觉中的一个具有挑战性的问题。本文提出了一种新的动态纹理算子,利用局部结构模式(LSP)和完全局部二值模式(CLBP)对三个正交平面上的静态图像进行时空纹理捕获。针对典型的局部二值模式算子(LBP)使用中心像素进行阈值分割,存在对噪声和近均匀区域敏感等局限性,本文提出的方法利用全局和局部纹理信息进行自适应阈值分割,利用CLBP在三个正交平面上利用互补纹理信息进行自适应阈值分割。对不同动态纹理数据集(UCLA, DynTex, DynTex++)的评估表明,我们的建议在最先进的方法中显着优于最近的结果。
{"title":"Completed local structure patterns on three orthogonal planes for dynamic texture recognition","authors":"Thanh Tuan Nguyen, T. Nguyen, F. Bouchara","doi":"10.1109/IPTA.2017.8310130","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310130","url":null,"abstract":"Dynamic texture (DT) is a challenging problem in computer vision because of the chaotic motion of textures. We address in this paper a new dynamic texture operator by considering local structure patterns (LSP) and completed local binary patterns (CLBP) for static images in three orthogonal planes to capture spatial-temporal texture structures. Since the typical operator of local binary patterns (LBP), which uses center pixel for thresholding, has some limitations such as sensitivity to noise and near uniform regions, the proposed approach can deal with these drawbacks by using global and local texture information for adaptive thresholding and CLBP for exploiting complementary texture information in three orthogonal planes. Evaluations on different datasets of dynamic textures (UCLA, DynTex, DynTex++) show that our proposal significantly outperforms recent results in the state-of-the-art approaches.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122530336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Global visual saliency: Geometric and colorimetrie saliency fusion and its applications for 3D colored meshes 全局视觉显著性:几何和色度显著性融合及其在3D彩色网格中的应用
Anass Nouri, C. Charrier, O. Lézoray
Many computer graphics applications use visual saliency information to guide their treatments such as adaptive compression, viewpoint-selection, segmentation, etc. However, all these applications rest on a partial estimation of visual saliency insofar that only geometric properties of the considered 3D mesh are taken into account leaving aside the colorimetric ones. As humans, our visual attention is sensitive to both geometric and colorimetric informations. Indeed, colorimetric information modifies the eye mouvements while visualizing a multimedia content. We propose in this paper an innovative approach for the detection of global saliency that takes into account both geometric and colorimetric features of a 3D mesh simulating hence the Human Visual System (HVS). For this, we generate two multi-scale saliency maps based on local geometric and colorimetric patch descriptors. These saliency maps are pooled using the Evidence Theory. We show the contribution and the benefit of our proposed global saliency approach for two applications: automatic optimal viewpoint selection and adaptive denoising of 3D colored meshes.
许多计算机图形应用程序使用视觉显著性信息来指导它们的处理,如自适应压缩、视点选择、分割等。然而,所有这些应用都依赖于视觉显著性的部分估计,到目前为止,只考虑所考虑的3D网格的几何属性,而不考虑色度属性。作为人类,我们的视觉注意力对几何和色度信息都很敏感。的确,在视觉化多媒体内容时,色度信息会改变眼球运动。我们在本文中提出了一种创新的方法来检测全局显著性,该方法考虑了模拟人类视觉系统(HVS)的三维网格的几何和色度特征。为此,我们基于局部几何和色度斑块描述符生成了两个多尺度显著性地图。这些显著性图使用证据理论进行汇总。我们展示了我们提出的全局显著性方法在两种应用中的贡献和好处:自动最佳视点选择和3D彩色网格的自适应去噪。
{"title":"Global visual saliency: Geometric and colorimetrie saliency fusion and its applications for 3D colored meshes","authors":"Anass Nouri, C. Charrier, O. Lézoray","doi":"10.1109/IPTA.2017.8310142","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310142","url":null,"abstract":"Many computer graphics applications use visual saliency information to guide their treatments such as adaptive compression, viewpoint-selection, segmentation, etc. However, all these applications rest on a partial estimation of visual saliency insofar that only geometric properties of the considered 3D mesh are taken into account leaving aside the colorimetric ones. As humans, our visual attention is sensitive to both geometric and colorimetric informations. Indeed, colorimetric information modifies the eye mouvements while visualizing a multimedia content. We propose in this paper an innovative approach for the detection of global saliency that takes into account both geometric and colorimetric features of a 3D mesh simulating hence the Human Visual System (HVS). For this, we generate two multi-scale saliency maps based on local geometric and colorimetric patch descriptors. These saliency maps are pooled using the Evidence Theory. We show the contribution and the benefit of our proposed global saliency approach for two applications: automatic optimal viewpoint selection and adaptive denoising of 3D colored meshes.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134531244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Deep learning for automatic sale receipt understanding 深度学习自动销售收据理解
Rizlène Raoui-Outach, Cécile Million-Rousseau, A. Benoît, P. Lambert
As a general rule, data analytics are now mandatory for companies. Scanned document analysis brings additional challenges introduced by paper damages and scanning quality. In an industrial context, this work focuses on the automatic understanding of sale receipts which enable access to essential and accurate consumption statistics. Given an image acquired with a smart-phone, the proposed work mainly focuses on the first steps of the full tool chain which aims at providing essential information such as the store brand, purchased products and related prices with the highest possible confidence. To get this high confidence level, even if scanning is not perfectly controlled, we propose a double check processing tool-chain using Deep Convolutional Neural Networks (DCNNs) on one hand and more classical image and text processings on another hand. The originality of this work relates in this double check processing and in the joint use of DCNNs for different applications and text analysis.
作为一般规则,数据分析现在对公司来说是强制性的。扫描文档分析给纸张损伤和扫描质量带来了新的挑战。在工业环境中,这项工作的重点是自动理解销售收据,从而能够获得基本和准确的消费统计数据。给定用智能手机获取的图像,建议的工作主要集中在完整工具链的第一步,旨在以尽可能高的置信度提供基本信息,如商店品牌,购买的产品和相关价格。为了获得这种高置信度,即使扫描不是完全控制的,我们提出了一个双重检查处理工具链,一方面使用深度卷积神经网络(DCNNs),另一方面使用更经典的图像和文本处理。这项工作的独创性在于这种双重检查处理和DCNNs在不同应用和文本分析中的联合使用。
{"title":"Deep learning for automatic sale receipt understanding","authors":"Rizlène Raoui-Outach, Cécile Million-Rousseau, A. Benoît, P. Lambert","doi":"10.1109/IPTA.2017.8310088","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310088","url":null,"abstract":"As a general rule, data analytics are now mandatory for companies. Scanned document analysis brings additional challenges introduced by paper damages and scanning quality. In an industrial context, this work focuses on the automatic understanding of sale receipts which enable access to essential and accurate consumption statistics. Given an image acquired with a smart-phone, the proposed work mainly focuses on the first steps of the full tool chain which aims at providing essential information such as the store brand, purchased products and related prices with the highest possible confidence. To get this high confidence level, even if scanning is not perfectly controlled, we propose a double check processing tool-chain using Deep Convolutional Neural Networks (DCNNs) on one hand and more classical image and text processings on another hand. The originality of this work relates in this double check processing and in the joint use of DCNNs for different applications and text analysis.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114352620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Distanceless label propagation: An efficient direct connected component labeling algorithm for GPUs 无距离标签传播:一种高效的gpu直连组件标签算法
Laurent Cabaret, L. Lacassagne, D. Etiemble
Modern computer architectures are mainly composed of multi-core processors and GPUs. Consequently, solely providing a sequential implementation of algorithms or comparing algorithm performance without regard to architecture is no longer pertinent. Today, algorithms have to address parallelism, multithreading and memory topology (private/shared memory, cache or scratchpad, …). Most Connected Component Labeling (CCL) algorithms are sequential, direct and optimized for processors. Few were designed specifically for GPU architectures and none were designed to be adapted to different architectures. The most efficient GPU implementations are iterative; in order to manage synchronizations between processing units, but the number of iterations depends on the image shape and density. This paper describes the DLP (Distanceless Label Propagation) algorithms, an adaptable set of algorithms usable both on GPU and multi-core architectures, and DLP-GPU, an efficient direct CCL algorithm for GPU based on DLP mechanisms.
现代计算机体系结构主要由多核处理器和图形处理器组成。因此,仅仅提供算法的顺序实现或不考虑体系结构而比较算法性能不再相关。今天,算法必须解决并行性、多线程和内存拓扑(私有/共享内存、缓存或刮擦板等)。大多数连接组件标记(CCL)算法是顺序的,直接的和优化的处理器。很少有专门为GPU架构设计的,没有一个是为了适应不同的架构而设计的。最有效的GPU实现是迭代的;为了管理处理单元之间的同步,但迭代的次数取决于图像的形状和密度。本文介绍了DLP (distance - eless Label Propagation)算法和基于DLP机制的高效GPU直接CCL算法DLP-GPU。DLP算法是一套可用于GPU和多核架构的自适应算法。
{"title":"Distanceless label propagation: An efficient direct connected component labeling algorithm for GPUs","authors":"Laurent Cabaret, L. Lacassagne, D. Etiemble","doi":"10.1109/IPTA.2017.8310147","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310147","url":null,"abstract":"Modern computer architectures are mainly composed of multi-core processors and GPUs. Consequently, solely providing a sequential implementation of algorithms or comparing algorithm performance without regard to architecture is no longer pertinent. Today, algorithms have to address parallelism, multithreading and memory topology (private/shared memory, cache or scratchpad, …). Most Connected Component Labeling (CCL) algorithms are sequential, direct and optimized for processors. Few were designed specifically for GPU architectures and none were designed to be adapted to different architectures. The most efficient GPU implementations are iterative; in order to manage synchronizations between processing units, but the number of iterations depends on the image shape and density. This paper describes the DLP (Distanceless Label Propagation) algorithms, an adaptable set of algorithms usable both on GPU and multi-core architectures, and DLP-GPU, an efficient direct CCL algorithm for GPU based on DLP mechanisms.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130265756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A joint snake and atlas-based segmentation of plantar foot thermal images 基于联合蛇和地图集的足底热图像分割
Asma Bougrine, R. Harba, R. Canals, R. Lédée, M. Jabloun
The aim of the present study is to propose a new joint segmentation method dedicated to plantar foot thermal images. The proposed method is based on a modified active contour method (Snake) that includes a prior shape information, namely an atlas of the plantar foot contour, as an extra term in the Snake energy function. This term guides the Snake to the targeted contours during the deformation process, by calculating a curvature difference between the Snake curve and the atlas curve of the plantar foot surface. The proposed method was validated using a database of 50 plantar foot thermal images. Results showed the proposed method to outperform the classical Snake method and seven other recent methods. The comparison was done using two evaluation metrics, the root-mean-square error (RMSE) and the dice similarity coefficient (DSC). When compared to ground truth, the best average RMSE of 6 pixels and DSC score of 93% were obtained using the proposed method.
本研究的目的是提出一种新的针对足底热图像的关节分割方法。该方法基于一种改进的主动轮廓法(Snake),该方法在Snake能量函数中增加了一个先验形状信息,即足底轮廓图谱。这个术语通过计算Snake曲线和足底足表面的地图集曲线之间的曲率差,在变形过程中引导Snake到达目标轮廓。利用50张足底热图像数据库对该方法进行了验证。结果表明,该方法优于经典的Snake方法和其他7种新方法。比较采用两个评价指标,均方根误差(RMSE)和骰子相似系数(DSC)。与地面真实值相比,该方法的平均RMSE为6像素,DSC得分为93%。
{"title":"A joint snake and atlas-based segmentation of plantar foot thermal images","authors":"Asma Bougrine, R. Harba, R. Canals, R. Lédée, M. Jabloun","doi":"10.1109/IPTA.2017.8310081","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310081","url":null,"abstract":"The aim of the present study is to propose a new joint segmentation method dedicated to plantar foot thermal images. The proposed method is based on a modified active contour method (Snake) that includes a prior shape information, namely an atlas of the plantar foot contour, as an extra term in the Snake energy function. This term guides the Snake to the targeted contours during the deformation process, by calculating a curvature difference between the Snake curve and the atlas curve of the plantar foot surface. The proposed method was validated using a database of 50 plantar foot thermal images. Results showed the proposed method to outperform the classical Snake method and seven other recent methods. The comparison was done using two evaluation metrics, the root-mean-square error (RMSE) and the dice similarity coefficient (DSC). When compared to ground truth, the best average RMSE of 6 pixels and DSC score of 93% were obtained using the proposed method.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131350991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Single object tracking using offline trained deep regression networks 单目标跟踪使用离线训练深度回归网络
B. Mocanu, Ruxandra Tapu, T. Zaharia
In this paper we introduce a novel single object tracker based on two convolutional neural networks (CNNs) trained offline using data from large videos repositories. The key principle consists of alternating between tracking using motion information and adjusting the predicted location based on visual similarity. First, we construct a deep regression network architecture able to learn generic relations between the object appearance models and its associated motion patterns. Then, based on visual similarity constraints, the objects bounding box position, size and shape are continuously updated in order to maximize a patch similarity function designed using CNN. Finally, a multi-resolution fusion between the outputs of the two CNNs is performed for accurate object localization. The experimental evaluation performed on challenging datasets, proposed in the visual object tracking (VOT) international contest, validates the proposed method when compared with state-of-the-art systems. In terms of computational speed our tracker runs at 20fps.
本文介绍了一种基于两个卷积神经网络(cnn)的新型单目标跟踪器,该网络使用大型视频库中的数据进行离线训练。关键原理是在使用运动信息跟踪和基于视觉相似性调整预测位置之间交替进行。首先,我们构建了一个深度回归网络架构,能够学习物体外观模型与其相关运动模式之间的一般关系。然后,基于视觉相似性约束,不断更新物体边界框的位置、大小和形状,以最大化利用CNN设计的patch相似函数。最后,在两个cnn的输出之间进行多分辨率融合,以实现精确的目标定位。在视觉目标跟踪(VOT)国际竞赛中提出的具有挑战性的数据集上进行的实验评估,与最先进的系统相比,验证了所提出的方法。在计算速度方面,我们的跟踪器以20fps运行。
{"title":"Single object tracking using offline trained deep regression networks","authors":"B. Mocanu, Ruxandra Tapu, T. Zaharia","doi":"10.1109/IPTA.2017.8310091","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310091","url":null,"abstract":"In this paper we introduce a novel single object tracker based on two convolutional neural networks (CNNs) trained offline using data from large videos repositories. The key principle consists of alternating between tracking using motion information and adjusting the predicted location based on visual similarity. First, we construct a deep regression network architecture able to learn generic relations between the object appearance models and its associated motion patterns. Then, based on visual similarity constraints, the objects bounding box position, size and shape are continuously updated in order to maximize a patch similarity function designed using CNN. Finally, a multi-resolution fusion between the outputs of the two CNNs is performed for accurate object localization. The experimental evaluation performed on challenging datasets, proposed in the visual object tracking (VOT) international contest, validates the proposed method when compared with state-of-the-art systems. In terms of computational speed our tracker runs at 20fps.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124263304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Two-Stage volumetric texture synthesis based on structural information 基于结构信息的两阶段体积纹理合成
A. Akl, C. Yaacoub, M. Donias, Jean-Pierre Da Costa, C. Germain
Volumetric texture synthesis is mainly used in computer graphics for texturing objects in order to increase the realism of the 3D scenario. It is also of particular interest in many application domains such as studying the three-dimensional internal structure of materials and modelling volumetric data obtained by 3D imaging techniques for medical purposes. Based on a previously proposed 2D structure/texture synthesis algorithm, this paper proposes a two-stage 3D texture synthesis approach where the volumetric structure layer of the input texture is first synthesized, then used to help the synthesis of the volumetric texture. Results show that, using the structural information helps the synthesis of the volumetric texture and can outperform the synthesis based only on intensity information.
体积纹理合成主要用于计算机图形学中对物体进行纹理合成,以增加3D场景的真实感。它在许多应用领域也特别有趣,例如研究材料的三维内部结构和为医学目的通过3D成像技术获得的体积数据建模。本文在已有的二维结构/纹理合成算法的基础上,提出了一种两阶段的三维纹理合成方法,首先合成输入纹理的体积结构层,然后利用体积结构层帮助合成体积纹理。结果表明,使用结构信息有助于体积纹理的合成,并且优于仅基于强度信息的合成。
{"title":"Two-Stage volumetric texture synthesis based on structural information","authors":"A. Akl, C. Yaacoub, M. Donias, Jean-Pierre Da Costa, C. Germain","doi":"10.1109/IPTA.2017.8310141","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310141","url":null,"abstract":"Volumetric texture synthesis is mainly used in computer graphics for texturing objects in order to increase the realism of the 3D scenario. It is also of particular interest in many application domains such as studying the three-dimensional internal structure of materials and modelling volumetric data obtained by 3D imaging techniques for medical purposes. Based on a previously proposed 2D structure/texture synthesis algorithm, this paper proposes a two-stage 3D texture synthesis approach where the volumetric structure layer of the input texture is first synthesized, then used to help the synthesis of the volumetric texture. Results show that, using the structural information helps the synthesis of the volumetric texture and can outperform the synthesis based only on intensity information.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125627687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1