首页 > 最新文献

2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)最新文献

英文 中文
Texture segmentation using globally active contours model and Cauchy-Schwarz distance 基于全局活动轮廓模型和Cauchy-Schwarz距离的纹理分割
F. Derraz, L. Peyrodie, A. Taleb-Ahmed, G. Forzy
We present a new unsupervised segmentation based active contours model and local region texture descriptor. The proposed local region texture descriptor intrinsically describes the geometry of textural regions using the shape operator defined in Beltrami framework. The local texture descriptor is incorporated in the active contours using the Cauchy-Schwarz distance. The texture is discriminated by maximizing distance between the probability density functions which leads to distinguish textural objects of interest and background. We propose a fast Bregman split implementation of our segmentation algorithm based on the dual formulation of the Total Variation norm. Finally, we show results on some challenging images to illustrate segmentations that are possible.
提出了一种新的基于活动轮廓模型和局部区域纹理描述符的无监督分割方法。本文提出的局部区域纹理描述符使用Beltrami框架中定义的形状算子本质上描述了纹理区域的几何形状。利用Cauchy-Schwarz距离将局部纹理描述符合并到活动轮廓中。通过最大化概率密度函数之间的距离来区分纹理,从而区分感兴趣的纹理对象和背景。我们提出了一种基于总变异范数对偶公式的分割算法的快速Bregman分裂实现。最后,我们展示了一些具有挑战性的图像的结果,以说明可能的分割。
{"title":"Texture segmentation using globally active contours model and Cauchy-Schwarz distance","authors":"F. Derraz, L. Peyrodie, A. Taleb-Ahmed, G. Forzy","doi":"10.1109/IPTA.2012.6469562","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469562","url":null,"abstract":"We present a new unsupervised segmentation based active contours model and local region texture descriptor. The proposed local region texture descriptor intrinsically describes the geometry of textural regions using the shape operator defined in Beltrami framework. The local texture descriptor is incorporated in the active contours using the Cauchy-Schwarz distance. The texture is discriminated by maximizing distance between the probability density functions which leads to distinguish textural objects of interest and background. We propose a fast Bregman split implementation of our segmentation algorithm based on the dual formulation of the Total Variation norm. Finally, we show results on some challenging images to illustrate segmentations that are possible.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114542337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A discrete Hidden Markov models recognition module for temporal series: Application to real-time 3D hand gestures 一个离散的隐马尔可夫模型识别模块的时间序列:应用于实时三维手势
Yannick Dennemont, Guillaume Bouyer, S. Otmane, M. Mallem
This work studies, implements and evaluates a gestures recognition module based on discrete Hidden Markov Models. The module is implemented on Matlab and used from Virtools. It can be used with different inputs therefore serves different recognition purposes. We focus on the 3D positions, our devices common information, as inputs for gesture recognition. Experiments are realized with an infra-red tracked flystick. Finally, the recognition rate is more than 90% with a personalized learning base. Otherwise, the results are beyond 70%, for an evaluation of 8 users on a real time mini-game. The rates are basically 80% for simple gestures and 60% for complex ones.
本文研究、实现并评估了一个基于离散隐马尔可夫模型的手势识别模块。该模块是在Matlab上实现的,使用的是Virtools。它可以用于不同的输入,因此服务于不同的识别目的。我们专注于3D位置,我们的设备通用信息,作为手势识别的输入。实验是用红外跟踪飞杆实现的。最后,在个性化学习基础上,识别率达到90%以上。否则,在一款实时迷你游戏中,对8名用户的评估结果将超过70%。简单手势的识别率基本上是80%,复杂手势的识别率是60%。
{"title":"A discrete Hidden Markov models recognition module for temporal series: Application to real-time 3D hand gestures","authors":"Yannick Dennemont, Guillaume Bouyer, S. Otmane, M. Mallem","doi":"10.1109/IPTA.2012.6469509","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469509","url":null,"abstract":"This work studies, implements and evaluates a gestures recognition module based on discrete Hidden Markov Models. The module is implemented on Matlab and used from Virtools. It can be used with different inputs therefore serves different recognition purposes. We focus on the 3D positions, our devices common information, as inputs for gesture recognition. Experiments are realized with an infra-red tracked flystick. Finally, the recognition rate is more than 90% with a personalized learning base. Otherwise, the results are beyond 70%, for an evaluation of 8 users on a real time mini-game. The rates are basically 80% for simple gestures and 60% for complex ones.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129105058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
High performance automatic number plate recognition in video streams 高性能自动车牌识别视频流
Arkadiusz Pawlik
We present a range of image and video analysis techniques that we have developed in connection with license plate recognition. Our methods focus on two areas - efficient image preprocessing to improve low-quality detection rate and combining the detection results from multiple frames to improve the accuracy of the recognized license plates. To evaluate our algorithms, we have implemented a complete ANPR system that detects and reads license plates. The system can process up to 110 frames per second on single CPU core and scales well to at least 4 cores. The recognition rate varies depending on the quality of video streams (amount of motion blur, resolution), but approaches 100% for clear, sharp license plate input data. The software is currently marketed commercially as CarID1. Some of our methods are more general and may have applications outside of the ANPR domain.
我们提出了一系列的图像和视频分析技术,我们已经开发与车牌识别。我们的方法主要集中在两个方面:高效的图像预处理,以提高低质量的检测率;结合多帧的检测结果,以提高识别车牌的准确性。为了评估我们的算法,我们实现了一个完整的ANPR系统来检测和读取车牌。该系统可以在单个CPU核心上每秒处理高达110帧,并且可以很好地扩展到至少4核。识别率取决于视频流的质量(运动模糊量、分辨率),但对于清晰、清晰的车牌输入数据,识别率接近100%。该软件目前在商业上以CarID1的名称销售。我们的一些方法更通用,可能在ANPR领域之外也有应用。
{"title":"High performance automatic number plate recognition in video streams","authors":"Arkadiusz Pawlik","doi":"10.1109/IPTA.2012.6469554","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469554","url":null,"abstract":"We present a range of image and video analysis techniques that we have developed in connection with license plate recognition. Our methods focus on two areas - efficient image preprocessing to improve low-quality detection rate and combining the detection results from multiple frames to improve the accuracy of the recognized license plates. To evaluate our algorithms, we have implemented a complete ANPR system that detects and reads license plates. The system can process up to 110 frames per second on single CPU core and scales well to at least 4 cores. The recognition rate varies depending on the quality of video streams (amount of motion blur, resolution), but approaches 100% for clear, sharp license plate input data. The software is currently marketed commercially as CarID1. Some of our methods are more general and may have applications outside of the ANPR domain.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"200 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125559339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Fuzzy Rule-Based Image Segmentation technique for rock thin section images 基于模糊规则的岩石薄片图像分割技术
R. Samet, S. E. Amrahov, Ali Hikmet Ziroglu
Image segmentation is a process of partitioning the images into meaningful regions that are ready to analyze. Segmentation of rock thin section images is not trivial task due to the unpredictable structures and features of minerals. In this paper, we propose Fuzzy Rule-Based Image Segmentation technique to segment rock thin section images. Proposed technique uses RGB images of rock thin sections as input and gives segmented into minerals images as output. In order to show an advantage of proposed technique the rock thin section images were also segmented by known Fuzzy C-Means technique. Both techniques were applied to many different rock thin section images. The obtained results of proposed Fuzzy Rule-Based Image Segmentation and Fuzzy C-Means techniques were compared. Implementation results showed that proposed image segmentation technique has better accuracy than known ones.
图像分割是将图像划分为有意义的区域以供分析的过程。由于矿物的结构和特征难以预测,岩石薄片图像的分割是一项艰巨的任务。本文提出了一种基于模糊规则的岩石薄片图像分割技术。该技术使用岩石薄片的RGB图像作为输入,并给出分割成矿物的图像作为输出。为了显示该方法的优势,还对岩石薄片图像进行了模糊c均值分割。这两种技术都应用于许多不同的岩石薄片图像。比较了基于模糊规则和模糊c均值的图像分割方法的分割结果。实现结果表明,本文提出的图像分割方法具有较好的分割精度。
{"title":"Fuzzy Rule-Based Image Segmentation technique for rock thin section images","authors":"R. Samet, S. E. Amrahov, Ali Hikmet Ziroglu","doi":"10.1109/IPTA.2012.6469555","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469555","url":null,"abstract":"Image segmentation is a process of partitioning the images into meaningful regions that are ready to analyze. Segmentation of rock thin section images is not trivial task due to the unpredictable structures and features of minerals. In this paper, we propose Fuzzy Rule-Based Image Segmentation technique to segment rock thin section images. Proposed technique uses RGB images of rock thin sections as input and gives segmented into minerals images as output. In order to show an advantage of proposed technique the rock thin section images were also segmented by known Fuzzy C-Means technique. Both techniques were applied to many different rock thin section images. The obtained results of proposed Fuzzy Rule-Based Image Segmentation and Fuzzy C-Means techniques were compared. Implementation results showed that proposed image segmentation technique has better accuracy than known ones.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"292 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114720497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Automatic oil spill detection in TerraSAR-X data using multi-contextual Markov modeling on irregular graphs 在不规则图形上使用多上下文马尔可夫建模的TerraSAR-X数据中的自动溢油检测
S. Martinis
This paper describes the workflow of an automatic near-real time oil spill detection approach using single-polarized high resolution X-Band Synthetic Aperture Radar satellite data. Dark formations on the water surface are classified in a completely unsupervised way using an automatic tile-based thresholding procedure. The derived global threshold value is used for the initialization of a hybrid multi-contextual Markov image model which integrates scale-dependent and spatial contextual information on irregular hierarchical graph structures into the segment-based labeling process of slick-covered and slick-free water surfaces. Experimental investigations performed on TerraSAR-X ScanSAR data acquired during large-scale oil pollutions in the Gulf of Mexico in May 2010 confirm the effectiveness of the proposed method with respect to accuracy and computational effort.
本文介绍了利用单偏振高分辨率x波段合成孔径雷达卫星数据实现近实时溢油自动检测的工作流程。水面上的暗地层使用基于瓷砖的自动阈值程序以完全无监督的方式进行分类。将得到的全局阈值用于初始化混合多上下文马尔可夫图像模型,该模型将不规则分层图结构的尺度相关信息和空间上下文信息集成到基于分段的光滑水面和无光滑水面标记过程中。对2010年5月墨西哥湾大规模石油污染期间获得的TerraSAR-X ScanSAR数据进行的实验研究证实了所提出方法在准确性和计算量方面的有效性。
{"title":"Automatic oil spill detection in TerraSAR-X data using multi-contextual Markov modeling on irregular graphs","authors":"S. Martinis","doi":"10.1109/IPTA.2012.6469508","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469508","url":null,"abstract":"This paper describes the workflow of an automatic near-real time oil spill detection approach using single-polarized high resolution X-Band Synthetic Aperture Radar satellite data. Dark formations on the water surface are classified in a completely unsupervised way using an automatic tile-based thresholding procedure. The derived global threshold value is used for the initialization of a hybrid multi-contextual Markov image model which integrates scale-dependent and spatial contextual information on irregular hierarchical graph structures into the segment-based labeling process of slick-covered and slick-free water surfaces. Experimental investigations performed on TerraSAR-X ScanSAR data acquired during large-scale oil pollutions in the Gulf of Mexico in May 2010 confirm the effectiveness of the proposed method with respect to accuracy and computational effort.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126962804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Glacier flow monitoring by digital camera and space-borne SAR images 利用数码相机和星载SAR图像监测冰川流量
F. Vernier, Renaud Fallourd, J. Friedt, Yajing Yan, E. Trouvé, J. Nicolas, L. Moreau
Most of the image processing techniques have been first proposed and developed on small size images and progressively applied to larger and larger data sets resulting from new sensors and application requirements. In geosciences, digital cameras and remote sensing images can be used to monitor glaciers and to measure their surface velocity by different techniques. However, the image size and the number of acquisitions to be processed to analyze time series become a critical issue to derive displacement fields by the conventional correlation technique. In this paper, an efficient correlation software is used to compute from optical images the motion of a serac fall and from Synthetic Aperture Radar (SAR) images the motion of Alpine glaciers. The optical images are acquired by a digital camera installed near the Argentière glacier (Chamonix, France) and the SAR images are acquired by the high resolution TerraSAR-X satellite over the Mont-Blanc area. The results illustrate the potential of this software to monitor the glacier flow with camera images acquired every 2 h and with the size of the TerraSAR-X scenes covering 30 × 50 km2.
大多数图像处理技术最初是在小尺寸图像上提出和发展的,并逐渐应用于由于新的传感器和应用需求而产生的越来越大的数据集。在地球科学领域,数码相机和遥感图像可用于监测冰川,并通过不同的技术测量冰川的表面速度。然而,传统的相关技术想得到位移场时,图像的大小和需要处理的采集数据的数量是一个关键问题。本文利用一种高效的相关软件,分别从光学图像和合成孔径雷达(SAR)图像计算高山冰川的运动。光学图像由安装在argenti冰川(法国夏蒙尼)附近的数码相机获取,SAR图像由高分辨率TerraSAR-X卫星在勃朗峰地区获取。结果表明,该软件具有监测冰川流动的潜力,每2小时采集一次相机图像,TerraSAR-X场景的大小为30 × 50 km2。
{"title":"Glacier flow monitoring by digital camera and space-borne SAR images","authors":"F. Vernier, Renaud Fallourd, J. Friedt, Yajing Yan, E. Trouvé, J. Nicolas, L. Moreau","doi":"10.1109/IPTA.2012.6469541","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469541","url":null,"abstract":"Most of the image processing techniques have been first proposed and developed on small size images and progressively applied to larger and larger data sets resulting from new sensors and application requirements. In geosciences, digital cameras and remote sensing images can be used to monitor glaciers and to measure their surface velocity by different techniques. However, the image size and the number of acquisitions to be processed to analyze time series become a critical issue to derive displacement fields by the conventional correlation technique. In this paper, an efficient correlation software is used to compute from optical images the motion of a serac fall and from Synthetic Aperture Radar (SAR) images the motion of Alpine glaciers. The optical images are acquired by a digital camera installed near the Argentière glacier (Chamonix, France) and the SAR images are acquired by the high resolution TerraSAR-X satellite over the Mont-Blanc area. The results illustrate the potential of this software to monitor the glacier flow with camera images acquired every 2 h and with the size of the TerraSAR-X scenes covering 30 × 50 km2.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"181 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124525511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Real time watermarking to authenticate the WSQ bitstream 实时水印验证WSQ位流
K. A. Saadi, Khalil Zebbiche, M. Laadjel, M. Morsli
Fingerprints are becoming popular in automated systems and for IT system user authentication. They are unique to each person and are designed to allow instant establishment personal identity in real time application. Enhancing their security in terms of fidelity and integrity becomes paramount. Since fingerprint images are usually compressed using Wavelet-packet Scalar Quantization (WSQ) before they are transmitted over networks, in this paper, we apply a fragile watermarking algorithm operating directly in compressed domain for protecting the evidentiary integrity of the WSQ bitstream. This work is motivated by the results obtained in previous video watermarking methods working in variable length codeword (VLC) domain to provide real time detection. The principle of the method is based on mapping the codewords to the outside of the used codespace, the watermark is embedded into stream as forced bit errors. The developed algorithm achieves high capacity and preserves the file size of WSQ bitstream while maintaining high perceptible quality.
指纹在自动化系统和IT系统用户认证中越来越受欢迎。它们对每个人都是独一无二的,旨在允许在实时应用中即时建立个人身份。加强他们在忠诚和诚信方面的安全变得至关重要。由于指纹图像在网络上传输之前通常使用小波包标量量化(WSQ)进行压缩,因此本文采用直接在压缩域中操作的脆弱水印算法来保护WSQ比特流的证据完整性。本研究的灵感来自于以往视频水印方法在变长码字(VLC)域中工作的结果,以提供实时检测。该方法的原理是将码字映射到所使用的码空间的外部,将水印作为强制误码嵌入到流中。该算法在保持高感知质量的同时,实现了WSQ码流的大容量和文件大小。
{"title":"Real time watermarking to authenticate the WSQ bitstream","authors":"K. A. Saadi, Khalil Zebbiche, M. Laadjel, M. Morsli","doi":"10.1109/IPTA.2012.6469556","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469556","url":null,"abstract":"Fingerprints are becoming popular in automated systems and for IT system user authentication. They are unique to each person and are designed to allow instant establishment personal identity in real time application. Enhancing their security in terms of fidelity and integrity becomes paramount. Since fingerprint images are usually compressed using Wavelet-packet Scalar Quantization (WSQ) before they are transmitted over networks, in this paper, we apply a fragile watermarking algorithm operating directly in compressed domain for protecting the evidentiary integrity of the WSQ bitstream. This work is motivated by the results obtained in previous video watermarking methods working in variable length codeword (VLC) domain to provide real time detection. The principle of the method is based on mapping the codewords to the outside of the used codespace, the watermark is embedded into stream as forced bit errors. The developed algorithm achieves high capacity and preserves the file size of WSQ bitstream while maintaining high perceptible quality.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"75 15","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114005467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An automatic level set based liver segmentation from MRI data sets 基于自动水平集的MRI数据集肝脏分割
E. Goceri, M. Z. Unlu, C. Guzelis, O. Dicle
A fast and accurate liver segmentation method is a challenging work in medical image analysis area. Liver segmentation is an important process for computer-assisted diagnosis, pre-evaluation of liver transplantation and therapy planning of liver tumors. There are several advantages of magnetic resonance imaging such as free form ionizing radiation and good contrast visualization of soft tissue. Also, innovations in recent technology and image acquisition techniques have made magnetic resonance imaging a major tool in modern medicine. However, the use of magnetic resonance images for liver segmentation has been slow when we compare applications with the central nervous systems and musculoskeletal. The reasons are irregular shape, size and position of the liver, contrast agent effects and similarities of the gray values of neighbor organs. Therefore, in this study, we present a fully automatic liver segmentation method by using an approximation of the level set based contour evolution from T2 weighted magnetic resonance data sets. The method avoids solving partial differential equations and applies only integer operations with a two-cycle segmentation algorithm. The efficiency of the proposed approach is achieved by applying the algorithm to all slices with a constant number of iteration and performing the contour evolution without any user defined initial contour. The obtained results are evaluated with four different similarity measures and they show that the automatic segmentation approach gives successful results.
快速准确的肝脏分割方法是医学图像分析领域的一项具有挑战性的工作。肝脏分割是肝脏肿瘤计算机辅助诊断、肝移植预评估和治疗规划的重要环节。磁共振成像有几个优点,如游离电离辐射和良好的软组织对比可视化。此外,最近技术和图像采集技术的创新使磁共振成像成为现代医学的主要工具。然而,当我们比较中枢神经系统和肌肉骨骼的应用时,使用磁共振图像进行肝脏分割的速度很慢。其原因与肝脏形状、大小、位置不规则、造影剂作用及邻近脏器灰度值相似有关。因此,在本研究中,我们提出了一种全自动肝脏分割方法,该方法使用基于水平集的轮廓进化近似,来自T2加权磁共振数据集。该方法避免了求解偏微分方程,只采用整数运算,采用两周期分割算法。该方法通过对所有切片进行等次迭代,并在不使用用户自定义初始轮廓的情况下进行轮廓演化,从而提高了算法的效率。用四种不同的相似度度量对得到的结果进行了评价,结果表明自动分割方法是成功的。
{"title":"An automatic level set based liver segmentation from MRI data sets","authors":"E. Goceri, M. Z. Unlu, C. Guzelis, O. Dicle","doi":"10.1109/IPTA.2012.6469551","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469551","url":null,"abstract":"A fast and accurate liver segmentation method is a challenging work in medical image analysis area. Liver segmentation is an important process for computer-assisted diagnosis, pre-evaluation of liver transplantation and therapy planning of liver tumors. There are several advantages of magnetic resonance imaging such as free form ionizing radiation and good contrast visualization of soft tissue. Also, innovations in recent technology and image acquisition techniques have made magnetic resonance imaging a major tool in modern medicine. However, the use of magnetic resonance images for liver segmentation has been slow when we compare applications with the central nervous systems and musculoskeletal. The reasons are irregular shape, size and position of the liver, contrast agent effects and similarities of the gray values of neighbor organs. Therefore, in this study, we present a fully automatic liver segmentation method by using an approximation of the level set based contour evolution from T2 weighted magnetic resonance data sets. The method avoids solving partial differential equations and applies only integer operations with a two-cycle segmentation algorithm. The efficiency of the proposed approach is achieved by applying the algorithm to all slices with a constant number of iteration and performing the contour evolution without any user defined initial contour. The obtained results are evaluated with four different similarity measures and they show that the automatic segmentation approach gives successful results.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121295351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Frequency component extraction from color images for specific sound transformation and analysis 从彩色图像中提取频率分量,用于特定声音的变换和分析
Gizem Akti, Dionysis Goularas
This paper presents a method allowing the conversion of images into sound. Initially, a frequency component extraction is realized from the original image. At this stage, the image is divided into windows in order to represent consecutive different time periods using STFT. Then, the dominant frequencies of each window are mapped into corresponding sound frequencies through Fourier analysis. This procedure is applied twice and two series of sound frequency components are produced: The first is originated from the brightness of the image, the second from the dominant RGB layer. The connection between the visual impression of the image and the psychoacoustic effect of the sound mapping is done by using different musical scales according to the dominant color of the image. The results revealed that the melody extracted from this analysis produces a certain psychoacoustic impression, as it has reported by several volunteers. Despite the fact that volunteers could not always do the association between image and sound, they could hardly believe that the music was produced by an algorithmic procedure.
本文提出了一种将图像转换为声音的方法。首先,从原始图像中提取频率成分。在这个阶段,图像被分割成窗口,以便使用STFT表示连续的不同时间段。然后,通过傅里叶分析将每个窗口的主导频率映射为相应的声音频率。这个过程被应用两次,产生了两个系列的声音频率分量:第一个来自图像的亮度,第二个来自主导RGB层。图像的视觉印象与声音映射的心理声学效果之间的联系是根据图像的主色使用不同的音阶来完成的。结果显示,从这种分析中提取的旋律产生了某种心理声学印象,正如几名志愿者所报告的那样。尽管志愿者并不总是能将图像和声音联系起来,但他们很难相信音乐是由算法程序产生的。
{"title":"Frequency component extraction from color images for specific sound transformation and analysis","authors":"Gizem Akti, Dionysis Goularas","doi":"10.1109/IPTA.2012.6469573","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469573","url":null,"abstract":"This paper presents a method allowing the conversion of images into sound. Initially, a frequency component extraction is realized from the original image. At this stage, the image is divided into windows in order to represent consecutive different time periods using STFT. Then, the dominant frequencies of each window are mapped into corresponding sound frequencies through Fourier analysis. This procedure is applied twice and two series of sound frequency components are produced: The first is originated from the brightness of the image, the second from the dominant RGB layer. The connection between the visual impression of the image and the psychoacoustic effect of the sound mapping is done by using different musical scales according to the dominant color of the image. The results revealed that the melody extracted from this analysis produces a certain psychoacoustic impression, as it has reported by several volunteers. Despite the fact that volunteers could not always do the association between image and sound, they could hardly believe that the music was produced by an algorithmic procedure.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125510566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Iris recognition using 2-D elliptical-support wavelet filter bank 基于二维椭圆支持小波滤波器组的虹膜识别
J. Abdul-Jabbar, Zena N. Abdulkader
In this paper, a new identification method for iris recognition is presented. Among the four main steps of iris recognition, traditional segmentation and normalization steps are utilized in the proposed method. A non-traditional step for feature extraction is applied where a new bank of two-dimensional (2-D) elliptical-support wavelet Haar filter bank is used to capture the iris characteristics. The idea is based on a new geometrical image transform called 2-D elliptical-support wavelet transform (2-D ESWT). A five-level 2-D elliptical-support wavelet decomposition is needed to form a reduced fixed length quantized feature vector with improved performance. The efficient approach of Hamming distance is then applied as a final step for iris matching. Experimental results show that the proposed method is reliable with rapid recognition, since it achieves good recognition rate with reduced feature vector length. Thus, a less complex-implementation can be obtained for this identification method.
本文提出了一种新的虹膜识别方法。在虹膜识别的四个主要步骤中,该方法利用了传统的分割和归一化步骤。采用了一种非传统的特征提取步骤,其中使用了一组新的二维(2-D)椭圆支持小波Haar滤波器组来捕获虹膜特征。该思想是基于一种新的几何图像变换,称为二维椭圆支持小波变换(2d ESWT)。采用五阶二维椭圆支持小波分解,得到了性能较好的约简定长量化特征向量。然后将有效的汉明距离方法作为虹膜匹配的最后一步。实验结果表明,该方法在减少特征向量长度的情况下取得了较好的识别率,具有快速识别的特点。因此,该识别方法可以获得较简单的实现。
{"title":"Iris recognition using 2-D elliptical-support wavelet filter bank","authors":"J. Abdul-Jabbar, Zena N. Abdulkader","doi":"10.1109/IPTA.2012.6469520","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469520","url":null,"abstract":"In this paper, a new identification method for iris recognition is presented. Among the four main steps of iris recognition, traditional segmentation and normalization steps are utilized in the proposed method. A non-traditional step for feature extraction is applied where a new bank of two-dimensional (2-D) elliptical-support wavelet Haar filter bank is used to capture the iris characteristics. The idea is based on a new geometrical image transform called 2-D elliptical-support wavelet transform (2-D ESWT). A five-level 2-D elliptical-support wavelet decomposition is needed to form a reduced fixed length quantized feature vector with improved performance. The efficient approach of Hamming distance is then applied as a final step for iris matching. Experimental results show that the proposed method is reliable with rapid recognition, since it achieves good recognition rate with reduced feature vector length. Thus, a less complex-implementation can be obtained for this identification method.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122489557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1