首页 > 最新文献

2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)最新文献

英文 中文
Deconvolution for slowly time-varying systems 3D cases 慢时变系统三维情况下的反卷积
S. Zenati, A. Boukrouche, P. Neveux
In the present work, we discuss an extension of the deconvolution techniques of Sekko [20] and Neveux [18] to 3D signals. The signals are assumed to be degraded by electronic linear systems, in which parameters are slowly time-varying such as sensors or other storage systems. For this purpose, Sekko & al. [20] developed a structure that has been adapted to time-varying systems [18] in order to produce an inverse filter with constant gain. This latter method was applied successfully to ordinary images [23]. The treatment of omnidirectional images requires working on the unit sphere. Therefore, the problem should be cast in 3D. In the 3D case, the deconvolution method [18] can be applied after some manipulations. The Heinz-Hopf fibration offers the possibility to consider that the sphere is similar to a torus. The advantage of this approach is that Kalman filtering can be applied and omnidirectional images projected on the sphere can be deconvolved.
在目前的工作中,我们讨论了Sekko[20]和Neveux[18]的反卷积技术对3D信号的扩展。假设信号被电子线性系统降级,其中参数是缓慢时变的,如传感器或其他存储系统。为此,Sekko等人[20]开发了一种适应时变系统的结构[18],以产生恒定增益的逆滤波器。后一种方法成功应用于普通图像[23]。全向图像的处理需要在单位球上进行。因此,这个问题应该在3D中投射。在三维情况下,经过一些操作后可以应用反卷积方法[18]。Heinz-Hopf纤颤提供了考虑球体类似于环面的可能性。该方法的优点是可以应用卡尔曼滤波,并且可以对投射在球面上的全向图像进行反卷积。
{"title":"Deconvolution for slowly time-varying systems 3D cases","authors":"S. Zenati, A. Boukrouche, P. Neveux","doi":"10.1109/IPTA.2012.6469552","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469552","url":null,"abstract":"In the present work, we discuss an extension of the deconvolution techniques of Sekko [20] and Neveux [18] to 3D signals. The signals are assumed to be degraded by electronic linear systems, in which parameters are slowly time-varying such as sensors or other storage systems. For this purpose, Sekko & al. [20] developed a structure that has been adapted to time-varying systems [18] in order to produce an inverse filter with constant gain. This latter method was applied successfully to ordinary images [23]. The treatment of omnidirectional images requires working on the unit sphere. Therefore, the problem should be cast in 3D. In the 3D case, the deconvolution method [18] can be applied after some manipulations. The Heinz-Hopf fibration offers the possibility to consider that the sphere is similar to a torus. The advantage of this approach is that Kalman filtering can be applied and omnidirectional images projected on the sphere can be deconvolved.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121236232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Towards automatic reconstruction of axonal structures in volumetric microscopy images depicting only active synapses 在仅描绘活跃突触的体积显微镜图像中实现轴突结构的自动重建
S. Sokoll, Hagen Beelitz, M. Heine, Klaus D. Tönnies
We propose an algorithm for the three-dimensional (3D) reconstruction of axonal structures to allow for the correlation of axonal structure, individual synaptic activity and single molecule tracking. In contrast to related works, only active synapses are stained in our acquisitions and the axonal structure is only visible by autofluorescence. We tackle this problem by detection of the medial axis line in the two-dimensional (2D) intensity projection of the 3D image and reconstruction of the 3D structure by axial interpolation between connected active synapses. Due to the noncontinuous staining, the detection of the medial axis line cannot rely on a global tree like structure. Instead, we compute an initial skeleton by global segmentation and expand it by iteratively adding line segments that are locally optimal according to the model knowledge of an axon. We evaluate our algorithm against a ground truth computed from co-transfection of surface molecules that result in reliable continuous staining of the axonal structure.
我们提出了一种三维(3D)重建轴突结构的算法,以允许轴突结构,单个突触活动和单分子跟踪的相关性。与相关研究相反,在我们的采集中,只有活跃的突触被染色,轴突结构只能通过自身荧光可见。我们通过检测三维图像的二维(2D)强度投影中的中轴线,并通过连接的活动突触之间的轴向插值重建三维结构来解决这个问题。由于不连续的染色,不能依靠全局树状结构来检测中轴线。相反,我们通过全局分割计算初始骨架,并通过根据轴突的模型知识迭代添加局部最优线段来扩展它。我们根据表面分子共转染计算的基础真理来评估我们的算法,结果是轴突结构的可靠连续染色。
{"title":"Towards automatic reconstruction of axonal structures in volumetric microscopy images depicting only active synapses","authors":"S. Sokoll, Hagen Beelitz, M. Heine, Klaus D. Tönnies","doi":"10.1109/IPTA.2012.6469549","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469549","url":null,"abstract":"We propose an algorithm for the three-dimensional (3D) reconstruction of axonal structures to allow for the correlation of axonal structure, individual synaptic activity and single molecule tracking. In contrast to related works, only active synapses are stained in our acquisitions and the axonal structure is only visible by autofluorescence. We tackle this problem by detection of the medial axis line in the two-dimensional (2D) intensity projection of the 3D image and reconstruction of the 3D structure by axial interpolation between connected active synapses. Due to the noncontinuous staining, the detection of the medial axis line cannot rely on a global tree like structure. Instead, we compute an initial skeleton by global segmentation and expand it by iteratively adding line segments that are locally optimal according to the model knowledge of an axon. We evaluate our algorithm against a ground truth computed from co-transfection of surface molecules that result in reliable continuous staining of the axonal structure.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125246188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High performance automatic number plate recognition in video streams 高性能自动车牌识别视频流
Arkadiusz Pawlik
We present a range of image and video analysis techniques that we have developed in connection with license plate recognition. Our methods focus on two areas - efficient image preprocessing to improve low-quality detection rate and combining the detection results from multiple frames to improve the accuracy of the recognized license plates. To evaluate our algorithms, we have implemented a complete ANPR system that detects and reads license plates. The system can process up to 110 frames per second on single CPU core and scales well to at least 4 cores. The recognition rate varies depending on the quality of video streams (amount of motion blur, resolution), but approaches 100% for clear, sharp license plate input data. The software is currently marketed commercially as CarID1. Some of our methods are more general and may have applications outside of the ANPR domain.
我们提出了一系列的图像和视频分析技术,我们已经开发与车牌识别。我们的方法主要集中在两个方面:高效的图像预处理,以提高低质量的检测率;结合多帧的检测结果,以提高识别车牌的准确性。为了评估我们的算法,我们实现了一个完整的ANPR系统来检测和读取车牌。该系统可以在单个CPU核心上每秒处理高达110帧,并且可以很好地扩展到至少4核。识别率取决于视频流的质量(运动模糊量、分辨率),但对于清晰、清晰的车牌输入数据,识别率接近100%。该软件目前在商业上以CarID1的名称销售。我们的一些方法更通用,可能在ANPR领域之外也有应用。
{"title":"High performance automatic number plate recognition in video streams","authors":"Arkadiusz Pawlik","doi":"10.1109/IPTA.2012.6469554","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469554","url":null,"abstract":"We present a range of image and video analysis techniques that we have developed in connection with license plate recognition. Our methods focus on two areas - efficient image preprocessing to improve low-quality detection rate and combining the detection results from multiple frames to improve the accuracy of the recognized license plates. To evaluate our algorithms, we have implemented a complete ANPR system that detects and reads license plates. The system can process up to 110 frames per second on single CPU core and scales well to at least 4 cores. The recognition rate varies depending on the quality of video streams (amount of motion blur, resolution), but approaches 100% for clear, sharp license plate input data. The software is currently marketed commercially as CarID1. Some of our methods are more general and may have applications outside of the ANPR domain.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"200 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125559339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Automatic oil spill detection in TerraSAR-X data using multi-contextual Markov modeling on irregular graphs 在不规则图形上使用多上下文马尔可夫建模的TerraSAR-X数据中的自动溢油检测
S. Martinis
This paper describes the workflow of an automatic near-real time oil spill detection approach using single-polarized high resolution X-Band Synthetic Aperture Radar satellite data. Dark formations on the water surface are classified in a completely unsupervised way using an automatic tile-based thresholding procedure. The derived global threshold value is used for the initialization of a hybrid multi-contextual Markov image model which integrates scale-dependent and spatial contextual information on irregular hierarchical graph structures into the segment-based labeling process of slick-covered and slick-free water surfaces. Experimental investigations performed on TerraSAR-X ScanSAR data acquired during large-scale oil pollutions in the Gulf of Mexico in May 2010 confirm the effectiveness of the proposed method with respect to accuracy and computational effort.
本文介绍了利用单偏振高分辨率x波段合成孔径雷达卫星数据实现近实时溢油自动检测的工作流程。水面上的暗地层使用基于瓷砖的自动阈值程序以完全无监督的方式进行分类。将得到的全局阈值用于初始化混合多上下文马尔可夫图像模型,该模型将不规则分层图结构的尺度相关信息和空间上下文信息集成到基于分段的光滑水面和无光滑水面标记过程中。对2010年5月墨西哥湾大规模石油污染期间获得的TerraSAR-X ScanSAR数据进行的实验研究证实了所提出方法在准确性和计算量方面的有效性。
{"title":"Automatic oil spill detection in TerraSAR-X data using multi-contextual Markov modeling on irregular graphs","authors":"S. Martinis","doi":"10.1109/IPTA.2012.6469508","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469508","url":null,"abstract":"This paper describes the workflow of an automatic near-real time oil spill detection approach using single-polarized high resolution X-Band Synthetic Aperture Radar satellite data. Dark formations on the water surface are classified in a completely unsupervised way using an automatic tile-based thresholding procedure. The derived global threshold value is used for the initialization of a hybrid multi-contextual Markov image model which integrates scale-dependent and spatial contextual information on irregular hierarchical graph structures into the segment-based labeling process of slick-covered and slick-free water surfaces. Experimental investigations performed on TerraSAR-X ScanSAR data acquired during large-scale oil pollutions in the Gulf of Mexico in May 2010 confirm the effectiveness of the proposed method with respect to accuracy and computational effort.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126962804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A discrete Hidden Markov models recognition module for temporal series: Application to real-time 3D hand gestures 一个离散的隐马尔可夫模型识别模块的时间序列:应用于实时三维手势
Yannick Dennemont, Guillaume Bouyer, S. Otmane, M. Mallem
This work studies, implements and evaluates a gestures recognition module based on discrete Hidden Markov Models. The module is implemented on Matlab and used from Virtools. It can be used with different inputs therefore serves different recognition purposes. We focus on the 3D positions, our devices common information, as inputs for gesture recognition. Experiments are realized with an infra-red tracked flystick. Finally, the recognition rate is more than 90% with a personalized learning base. Otherwise, the results are beyond 70%, for an evaluation of 8 users on a real time mini-game. The rates are basically 80% for simple gestures and 60% for complex ones.
本文研究、实现并评估了一个基于离散隐马尔可夫模型的手势识别模块。该模块是在Matlab上实现的,使用的是Virtools。它可以用于不同的输入,因此服务于不同的识别目的。我们专注于3D位置,我们的设备通用信息,作为手势识别的输入。实验是用红外跟踪飞杆实现的。最后,在个性化学习基础上,识别率达到90%以上。否则,在一款实时迷你游戏中,对8名用户的评估结果将超过70%。简单手势的识别率基本上是80%,复杂手势的识别率是60%。
{"title":"A discrete Hidden Markov models recognition module for temporal series: Application to real-time 3D hand gestures","authors":"Yannick Dennemont, Guillaume Bouyer, S. Otmane, M. Mallem","doi":"10.1109/IPTA.2012.6469509","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469509","url":null,"abstract":"This work studies, implements and evaluates a gestures recognition module based on discrete Hidden Markov Models. The module is implemented on Matlab and used from Virtools. It can be used with different inputs therefore serves different recognition purposes. We focus on the 3D positions, our devices common information, as inputs for gesture recognition. Experiments are realized with an infra-red tracked flystick. Finally, the recognition rate is more than 90% with a personalized learning base. Otherwise, the results are beyond 70%, for an evaluation of 8 users on a real time mini-game. The rates are basically 80% for simple gestures and 60% for complex ones.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129105058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Multi-resolution patch and window-based priority for digital image inpainting problem 基于多分辨率补丁和窗口优先级的数字图像绘制问题
T. T. Dang, M. Larabi, Azeddine Beghdadi
Recently, digital image inpainting has attracted strong research interest because of its extensive applications in real life. The terminology “inpainting” refers to automatic restoration of image defects such as scratches or blotches as well as removal of unwanted objects as, for instance, subtitles, logos, etc, such that it is undetectable by viewers without the reference to the original image. Many works on this subject have been published in recent years. This paper introduces a novel unsupervised image completion framework using a modified exemplar-based method in conjunction with a pyramidal representation of an image. A top-down iterative completion is performed gradually with multi-resolution patches and a window-based priority. The proposed approach is verified on different natural images. Also, a comparison with some existing methods coming from literature is carried out and the results show improvement in favor of our approach.
近年来,数字图像绘画因其在现实生活中的广泛应用而引起了人们的广泛关注。术语“修复”是指自动恢复图像缺陷,如划痕或斑点,以及去除不需要的物体,如字幕,徽标等,使观众在不参考原始图像的情况下无法检测到。近年来发表了许多关于这个问题的著作。本文介绍了一种新的无监督图像补全框架,该框架使用了一种改进的基于样本的方法,并结合了图像的金字塔表示。采用多分辨率补丁和基于窗口的优先级逐步进行自顶向下迭代补全。在不同的自然图像上对该方法进行了验证。并与文献中已有的几种方法进行了比较,结果表明我们的方法有所改进。
{"title":"Multi-resolution patch and window-based priority for digital image inpainting problem","authors":"T. T. Dang, M. Larabi, Azeddine Beghdadi","doi":"10.1109/IPTA.2012.6469544","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469544","url":null,"abstract":"Recently, digital image inpainting has attracted strong research interest because of its extensive applications in real life. The terminology “inpainting” refers to automatic restoration of image defects such as scratches or blotches as well as removal of unwanted objects as, for instance, subtitles, logos, etc, such that it is undetectable by viewers without the reference to the original image. Many works on this subject have been published in recent years. This paper introduces a novel unsupervised image completion framework using a modified exemplar-based method in conjunction with a pyramidal representation of an image. A top-down iterative completion is performed gradually with multi-resolution patches and a window-based priority. The proposed approach is verified on different natural images. Also, a comparison with some existing methods coming from literature is carried out and the results show improvement in favor of our approach.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128548629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Real time watermarking to authenticate the WSQ bitstream 实时水印验证WSQ位流
K. A. Saadi, Khalil Zebbiche, M. Laadjel, M. Morsli
Fingerprints are becoming popular in automated systems and for IT system user authentication. They are unique to each person and are designed to allow instant establishment personal identity in real time application. Enhancing their security in terms of fidelity and integrity becomes paramount. Since fingerprint images are usually compressed using Wavelet-packet Scalar Quantization (WSQ) before they are transmitted over networks, in this paper, we apply a fragile watermarking algorithm operating directly in compressed domain for protecting the evidentiary integrity of the WSQ bitstream. This work is motivated by the results obtained in previous video watermarking methods working in variable length codeword (VLC) domain to provide real time detection. The principle of the method is based on mapping the codewords to the outside of the used codespace, the watermark is embedded into stream as forced bit errors. The developed algorithm achieves high capacity and preserves the file size of WSQ bitstream while maintaining high perceptible quality.
指纹在自动化系统和IT系统用户认证中越来越受欢迎。它们对每个人都是独一无二的,旨在允许在实时应用中即时建立个人身份。加强他们在忠诚和诚信方面的安全变得至关重要。由于指纹图像在网络上传输之前通常使用小波包标量量化(WSQ)进行压缩,因此本文采用直接在压缩域中操作的脆弱水印算法来保护WSQ比特流的证据完整性。本研究的灵感来自于以往视频水印方法在变长码字(VLC)域中工作的结果,以提供实时检测。该方法的原理是将码字映射到所使用的码空间的外部,将水印作为强制误码嵌入到流中。该算法在保持高感知质量的同时,实现了WSQ码流的大容量和文件大小。
{"title":"Real time watermarking to authenticate the WSQ bitstream","authors":"K. A. Saadi, Khalil Zebbiche, M. Laadjel, M. Morsli","doi":"10.1109/IPTA.2012.6469556","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469556","url":null,"abstract":"Fingerprints are becoming popular in automated systems and for IT system user authentication. They are unique to each person and are designed to allow instant establishment personal identity in real time application. Enhancing their security in terms of fidelity and integrity becomes paramount. Since fingerprint images are usually compressed using Wavelet-packet Scalar Quantization (WSQ) before they are transmitted over networks, in this paper, we apply a fragile watermarking algorithm operating directly in compressed domain for protecting the evidentiary integrity of the WSQ bitstream. This work is motivated by the results obtained in previous video watermarking methods working in variable length codeword (VLC) domain to provide real time detection. The principle of the method is based on mapping the codewords to the outside of the used codespace, the watermark is embedded into stream as forced bit errors. The developed algorithm achieves high capacity and preserves the file size of WSQ bitstream while maintaining high perceptible quality.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"75 15","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114005467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An automatic level set based liver segmentation from MRI data sets 基于自动水平集的MRI数据集肝脏分割
E. Goceri, M. Z. Unlu, C. Guzelis, O. Dicle
A fast and accurate liver segmentation method is a challenging work in medical image analysis area. Liver segmentation is an important process for computer-assisted diagnosis, pre-evaluation of liver transplantation and therapy planning of liver tumors. There are several advantages of magnetic resonance imaging such as free form ionizing radiation and good contrast visualization of soft tissue. Also, innovations in recent technology and image acquisition techniques have made magnetic resonance imaging a major tool in modern medicine. However, the use of magnetic resonance images for liver segmentation has been slow when we compare applications with the central nervous systems and musculoskeletal. The reasons are irregular shape, size and position of the liver, contrast agent effects and similarities of the gray values of neighbor organs. Therefore, in this study, we present a fully automatic liver segmentation method by using an approximation of the level set based contour evolution from T2 weighted magnetic resonance data sets. The method avoids solving partial differential equations and applies only integer operations with a two-cycle segmentation algorithm. The efficiency of the proposed approach is achieved by applying the algorithm to all slices with a constant number of iteration and performing the contour evolution without any user defined initial contour. The obtained results are evaluated with four different similarity measures and they show that the automatic segmentation approach gives successful results.
快速准确的肝脏分割方法是医学图像分析领域的一项具有挑战性的工作。肝脏分割是肝脏肿瘤计算机辅助诊断、肝移植预评估和治疗规划的重要环节。磁共振成像有几个优点,如游离电离辐射和良好的软组织对比可视化。此外,最近技术和图像采集技术的创新使磁共振成像成为现代医学的主要工具。然而,当我们比较中枢神经系统和肌肉骨骼的应用时,使用磁共振图像进行肝脏分割的速度很慢。其原因与肝脏形状、大小、位置不规则、造影剂作用及邻近脏器灰度值相似有关。因此,在本研究中,我们提出了一种全自动肝脏分割方法,该方法使用基于水平集的轮廓进化近似,来自T2加权磁共振数据集。该方法避免了求解偏微分方程,只采用整数运算,采用两周期分割算法。该方法通过对所有切片进行等次迭代,并在不使用用户自定义初始轮廓的情况下进行轮廓演化,从而提高了算法的效率。用四种不同的相似度度量对得到的结果进行了评价,结果表明自动分割方法是成功的。
{"title":"An automatic level set based liver segmentation from MRI data sets","authors":"E. Goceri, M. Z. Unlu, C. Guzelis, O. Dicle","doi":"10.1109/IPTA.2012.6469551","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469551","url":null,"abstract":"A fast and accurate liver segmentation method is a challenging work in medical image analysis area. Liver segmentation is an important process for computer-assisted diagnosis, pre-evaluation of liver transplantation and therapy planning of liver tumors. There are several advantages of magnetic resonance imaging such as free form ionizing radiation and good contrast visualization of soft tissue. Also, innovations in recent technology and image acquisition techniques have made magnetic resonance imaging a major tool in modern medicine. However, the use of magnetic resonance images for liver segmentation has been slow when we compare applications with the central nervous systems and musculoskeletal. The reasons are irregular shape, size and position of the liver, contrast agent effects and similarities of the gray values of neighbor organs. Therefore, in this study, we present a fully automatic liver segmentation method by using an approximation of the level set based contour evolution from T2 weighted magnetic resonance data sets. The method avoids solving partial differential equations and applies only integer operations with a two-cycle segmentation algorithm. The efficiency of the proposed approach is achieved by applying the algorithm to all slices with a constant number of iteration and performing the contour evolution without any user defined initial contour. The obtained results are evaluated with four different similarity measures and they show that the automatic segmentation approach gives successful results.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121295351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Frequency component extraction from color images for specific sound transformation and analysis 从彩色图像中提取频率分量,用于特定声音的变换和分析
Gizem Akti, Dionysis Goularas
This paper presents a method allowing the conversion of images into sound. Initially, a frequency component extraction is realized from the original image. At this stage, the image is divided into windows in order to represent consecutive different time periods using STFT. Then, the dominant frequencies of each window are mapped into corresponding sound frequencies through Fourier analysis. This procedure is applied twice and two series of sound frequency components are produced: The first is originated from the brightness of the image, the second from the dominant RGB layer. The connection between the visual impression of the image and the psychoacoustic effect of the sound mapping is done by using different musical scales according to the dominant color of the image. The results revealed that the melody extracted from this analysis produces a certain psychoacoustic impression, as it has reported by several volunteers. Despite the fact that volunteers could not always do the association between image and sound, they could hardly believe that the music was produced by an algorithmic procedure.
本文提出了一种将图像转换为声音的方法。首先,从原始图像中提取频率成分。在这个阶段,图像被分割成窗口,以便使用STFT表示连续的不同时间段。然后,通过傅里叶分析将每个窗口的主导频率映射为相应的声音频率。这个过程被应用两次,产生了两个系列的声音频率分量:第一个来自图像的亮度,第二个来自主导RGB层。图像的视觉印象与声音映射的心理声学效果之间的联系是根据图像的主色使用不同的音阶来完成的。结果显示,从这种分析中提取的旋律产生了某种心理声学印象,正如几名志愿者所报告的那样。尽管志愿者并不总是能将图像和声音联系起来,但他们很难相信音乐是由算法程序产生的。
{"title":"Frequency component extraction from color images for specific sound transformation and analysis","authors":"Gizem Akti, Dionysis Goularas","doi":"10.1109/IPTA.2012.6469573","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469573","url":null,"abstract":"This paper presents a method allowing the conversion of images into sound. Initially, a frequency component extraction is realized from the original image. At this stage, the image is divided into windows in order to represent consecutive different time periods using STFT. Then, the dominant frequencies of each window are mapped into corresponding sound frequencies through Fourier analysis. This procedure is applied twice and two series of sound frequency components are produced: The first is originated from the brightness of the image, the second from the dominant RGB layer. The connection between the visual impression of the image and the psychoacoustic effect of the sound mapping is done by using different musical scales according to the dominant color of the image. The results revealed that the melody extracted from this analysis produces a certain psychoacoustic impression, as it has reported by several volunteers. Despite the fact that volunteers could not always do the association between image and sound, they could hardly believe that the music was produced by an algorithmic procedure.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125510566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Flickr-based semantic context to refine automatic photo annotation 基于flickr的语义上下文来改进自动照片注释
Amel Ksibi, Mouna Dammak, A. Ammar, M. Mejdoub, C. Amar
Automatic photo annotation task aims to describe the semantic content by detecting high level concepts in order to further facilitate concept based video retrieval. Most of existing approaches are based on independent semantic concept detectors without considering the contextual correlation between concepts. This drawback has its impact over the efficiency of such systems. Recently, harnessing contextual information to improve the effectiveness of concepts detection becomes a promising direction in such field. In this paper, we propose a new contextbased annotation refinement process. For this purpose, we define a new semantic measure called “Second Order Co-occurence Flickr context similarity” (SOCFCS) which aims to extract the semantic context correlation between two concepts by exploring Flickr resources (Flickr related-tags). Our measure is an extension of FCS measure by taking into consideration the FCS values of common Flickr related-tags of the two target concepts. Our proposed measure is applied to build a concept network which models the semantic context inter-relationships among concepts. A Random Walk with Restart process is performed over this network to refine the annotation results by exploring the contextual correlation among concepts. Experimental studies are conducted on ImageCLEF 2011 Collection containing 10000 images and 99 concepts. The results demonstrate the effectiveness of our proposed approach.
照片自动标注任务旨在通过检测高级概念来描述语义内容,从而进一步促进基于概念的视频检索。现有的方法大多是基于独立的语义概念检测器,没有考虑概念之间的上下文相关性。这一缺点影响了这类系统的效率。近年来,利用上下文信息来提高概念检测的有效性成为该领域一个很有前途的方向。在本文中,我们提出了一种新的基于上下文的标注改进过程。为此,我们定义了一个新的语义度量,称为“二阶共现Flickr上下文相似度”(SOCFCS),旨在通过探索Flickr资源(Flickr相关标签)来提取两个概念之间的语义上下文相关性。我们的度量是FCS度量的扩展,考虑了两个目标概念的常见Flickr相关标签的FCS值。我们提出的方法被用于建立一个概念网络,该网络对概念之间的语义上下文相互关系进行建模。在该网络上执行随机行走(Random Walk with Restart)过程,通过探索概念之间的上下文相关性来改进注释结果。在包含10000张图片和99个概念的ImageCLEF 2011 Collection上进行实验研究。结果证明了我们所提出的方法的有效性。
{"title":"Flickr-based semantic context to refine automatic photo annotation","authors":"Amel Ksibi, Mouna Dammak, A. Ammar, M. Mejdoub, C. Amar","doi":"10.1109/IPTA.2012.6469550","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469550","url":null,"abstract":"Automatic photo annotation task aims to describe the semantic content by detecting high level concepts in order to further facilitate concept based video retrieval. Most of existing approaches are based on independent semantic concept detectors without considering the contextual correlation between concepts. This drawback has its impact over the efficiency of such systems. Recently, harnessing contextual information to improve the effectiveness of concepts detection becomes a promising direction in such field. In this paper, we propose a new contextbased annotation refinement process. For this purpose, we define a new semantic measure called “Second Order Co-occurence Flickr context similarity” (SOCFCS) which aims to extract the semantic context correlation between two concepts by exploring Flickr resources (Flickr related-tags). Our measure is an extension of FCS measure by taking into consideration the FCS values of common Flickr related-tags of the two target concepts. Our proposed measure is applied to build a concept network which models the semantic context inter-relationships among concepts. A Random Walk with Restart process is performed over this network to refine the annotation results by exploring the contextual correlation among concepts. Experimental studies are conducted on ImageCLEF 2011 Collection containing 10000 images and 99 concepts. The results demonstrate the effectiveness of our proposed approach.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114637278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1