首页 > 最新文献

2010 International Conference on Digital Image Computing: Techniques and Applications最新文献

英文 中文
Pulse Repetition Interval Modulation Recognition Using Symbolization 使用符号化的脉冲重复间隔调制识别
Kyu-Ha Song, Dong-Weon Lee, Jin-Woo Han, Byung-Koo Park
Information on the pulse repetition interval (PRI) modulation of a radar signal plays an important role in detecting and identifying each radar signal in an electronic warfare support (ES) system. In this paper, we present a new method for recognizing the PRI modulation type of a radar signal using symbolization. The proposed method uses three key feature parameters extracted from symbol sequences in order to discriminate each PRI modulation type. The recognition capability of the method presented is verified through extensive simulations.
在电子战保障(ES)系统中,雷达信号的脉冲重复间隔(PRI)调制信息在探测和识别每个雷达信号中起着重要作用。本文提出了一种利用符号化识别雷达信号PRI调制类型的新方法。该方法使用从符号序列中提取的三个关键特征参数来区分每种PRI调制类型。通过大量的仿真验证了该方法的识别能力。
{"title":"Pulse Repetition Interval Modulation Recognition Using Symbolization","authors":"Kyu-Ha Song, Dong-Weon Lee, Jin-Woo Han, Byung-Koo Park","doi":"10.1109/DICTA.2010.96","DOIUrl":"https://doi.org/10.1109/DICTA.2010.96","url":null,"abstract":"Information on the pulse repetition interval (PRI) modulation of a radar signal plays an important role in detecting and identifying each radar signal in an electronic warfare support (ES) system. In this paper, we present a new method for recognizing the PRI modulation type of a radar signal using symbolization. The proposed method uses three key feature parameters extracted from symbol sequences in order to discriminate each PRI modulation type. The recognition capability of the method presented is verified through extensive simulations.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"276 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113998626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Bone Segmentation of Magnetic Resonance Images by Gradient Vector Flow Active Contour with Atlas Based Centroid Forces 基于图集质心力梯度矢量流活动轮廓的磁共振图像骨分割
T. K. Chuah, C. W. Lim, C. Poh, K. Sheah
This paper presents a segmentation technique which utilizes atlas based centroid forces coupled with Gradient Vector Flow (GVF) parametric active contour for the segmentation of femoral cancellous bone. The atlas used in our study provides prior information to constraint contours at regions where edge based forces are missing and to initialize the active contours. GVF external force field is padded with the centroid force derived from the atlas. In our implementation, once the atlas is registered with the target image to be segmented, the segmentation process is fully automatic. Analysis of segmentation accuracy of twenty one slices at the intercondylar location of sagittal slices provides sensitivity of 97.4±1.9%; specificity of 99.6±0.1%, Dice similarity coefficient of 96.7±1.1%. From the inspection of external force fields and the accuracy results, the study suggests that the centroid force formulation is effective in approximating missing boundaries in GVF and in facilitating automatic initialization.
本文提出了一种利用基于图谱的质心力与梯度矢量流(GVF)参数化活动轮廓相结合的股骨松质骨分割技术。在我们的研究中使用的地图集提供了先验信息,以约束轮廓的区域,其中基于边缘的力缺失和初始化活动轮廓。由地图集导出的质心力填充GVF外力场。在我们的实现中,一旦地图集与待分割的目标图像注册,分割过程是全自动的。矢状面切片在髁间位置的21片分割精度分析,灵敏度为97.4±1.9%;特异性为99.6±0.1%,Dice相似系数为96.7±1.1%。从外力场检测和精度结果来看,质心力公式可以有效地逼近梯度矢量流场中缺失的边界,便于自动初始化。
{"title":"Bone Segmentation of Magnetic Resonance Images by Gradient Vector Flow Active Contour with Atlas Based Centroid Forces","authors":"T. K. Chuah, C. W. Lim, C. Poh, K. Sheah","doi":"10.1109/DICTA.2010.31","DOIUrl":"https://doi.org/10.1109/DICTA.2010.31","url":null,"abstract":"This paper presents a segmentation technique which utilizes atlas based centroid forces coupled with Gradient Vector Flow (GVF) parametric active contour for the segmentation of femoral cancellous bone. The atlas used in our study provides prior information to constraint contours at regions where edge based forces are missing and to initialize the active contours. GVF external force field is padded with the centroid force derived from the atlas. In our implementation, once the atlas is registered with the target image to be segmented, the segmentation process is fully automatic. Analysis of segmentation accuracy of twenty one slices at the intercondylar location of sagittal slices provides sensitivity of 97.4±1.9%; specificity of 99.6±0.1%, Dice similarity coefficient of 96.7±1.1%. From the inspection of external force fields and the accuracy results, the study suggests that the centroid force formulation is effective in approximating missing boundaries in GVF and in facilitating automatic initialization.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124665067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Web-Based Learning of Naturalized Color Models for Human-Machine Interaction 基于web的人机交互自然色彩模型学习
Boris Schauerte, G. Fink
In recent years, natural verbal and non-verbal human-robot interaction has attracted an increasing interest. Therefore, models for robustly detecting and describing visual attributes of objects such as, e.g., colors are of great importance. However, in order to learn robust models of visual attributes, large data sets are required. Based on the idea to overcome the shortage of annotated training data by acquiring images from the Internet, we propose a method for robustly learning natural color models. Its novel aspects with respect to prior art are: firstly, a randomized HSL transformation that reflects the slight variations and noise of colors observed in real-world imaging sensors, secondly, a probabilistic ranking and selection of the training samples, which removes a considerable amount of outliers from the training data. These two techniques allow us to estimate robust color models that better resemble the variances seen in real world images. The advantages of the proposed method over the current state-of-the-art technique using the training data without proper transformation and selection are confirmed in experimental evaluations. In combination, for models learned with pLSA-bg and HSL, the proposed techniques reduce the amount of mislabeled objects by 19.87% on the well-known E-Bay data set.
近年来,自然语言和非语言人机交互引起了人们越来越多的兴趣。因此,鲁棒检测和描述物体视觉属性(如颜色)的模型是非常重要的。然而,为了学习视觉属性的鲁棒模型,需要大量的数据集。基于从互联网获取图像来克服带注释的训练数据不足的思想,提出了一种鲁棒学习自然颜色模型的方法。相对于现有技术,它的新颖之处在于:首先,一个随机的HSL变换,反映了在现实世界的成像传感器中观察到的颜色的细微变化和噪声;其次,一个训练样本的概率排序和选择,从训练数据中去除了相当数量的异常值。这两种技术使我们能够估计健壮的颜色模型,使其更像真实世界图像中看到的差异。在实验评估中证实了该方法相对于目前最先进的使用未经适当转换和选择的训练数据的技术的优势。结合使用pls -bg和HSL学习的模型,所提出的技术在众所周知的E-Bay数据集上减少了19.87%的错误标记对象数量。
{"title":"Web-Based Learning of Naturalized Color Models for Human-Machine Interaction","authors":"Boris Schauerte, G. Fink","doi":"10.1109/DICTA.2010.90","DOIUrl":"https://doi.org/10.1109/DICTA.2010.90","url":null,"abstract":"In recent years, natural verbal and non-verbal human-robot interaction has attracted an increasing interest. Therefore, models for robustly detecting and describing visual attributes of objects such as, e.g., colors are of great importance. However, in order to learn robust models of visual attributes, large data sets are required. Based on the idea to overcome the shortage of annotated training data by acquiring images from the Internet, we propose a method for robustly learning natural color models. Its novel aspects with respect to prior art are: firstly, a randomized HSL transformation that reflects the slight variations and noise of colors observed in real-world imaging sensors, secondly, a probabilistic ranking and selection of the training samples, which removes a considerable amount of outliers from the training data. These two techniques allow us to estimate robust color models that better resemble the variances seen in real world images. The advantages of the proposed method over the current state-of-the-art technique using the training data without proper transformation and selection are confirmed in experimental evaluations. In combination, for models learned with pLSA-bg and HSL, the proposed techniques reduce the amount of mislabeled objects by 19.87% on the well-known E-Bay data set.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121246832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Chromosome Classification Based on Wavelet Neural Network 基于小波神经网络的染色体分类
Baharak Choudari Oskouei, J. Shanbehzadeh
Karyotyping, manual chromosome classification is a difficult and time consuming process. Many automated classifiers have been developed to overcome this problem. These classifiers either have high classification accuracy or high training speed. This paper proposes a classifier that performs well in both areas based on wavelet neural network (WNN), combining the wavelet into neural network for classification of chromosomes in group E (chromosomes 16, 17 and 18). The nonlinear characteristic of the network which is derived from wavelet specification improves the training speed and accuracy of the nonlinear chromosome classification. The network inputs are nine dimensional feature space extracted from the chromosome images and the outputs are three classes. The simulation result on the chromosomes in the Laboratory of Biomedical Imaging shows that the success rate of WNN was 0.93%, that is comparable to the traditional neural network (ANN) with 0.85% success rate. The number of iterations for training to reach 0.04% error rate is only 200 where it is 3500 iterations for ANN. According to the experimental results WNN achieves high accuracy with minimum training time, which makes it suitable for real-time chromosome classification in the laboratory.
人工染色体核型分类是一个困难而耗时的过程。为了克服这个问题,已经开发了许多自动分类器。这些分类器要么分类精度高,要么训练速度快。本文提出了一种基于小波神经网络(WNN)的分类器,将小波与神经网络相结合,对E组(16、17、18号染色体)的染色体进行分类。基于小波规范的神经网络的非线性特性提高了非线性染色体分类的训练速度和准确率。网络输入是从染色体图像中提取的九维特征空间,输出是三类特征空间。生物医学成像实验室对染色体的模拟结果表明,小波神经网络的成功率为0.93%,与传统神经网络(ANN)的0.85%成功率相当。训练达到0.04%错误率的迭代次数只有200次,而人工神经网络需要3500次迭代。实验结果表明,小波神经网络以最小的训练时间获得了较高的准确率,适用于实验室的实时染色体分类。
{"title":"Chromosome Classification Based on Wavelet Neural Network","authors":"Baharak Choudari Oskouei, J. Shanbehzadeh","doi":"10.1109/DICTA.2010.107","DOIUrl":"https://doi.org/10.1109/DICTA.2010.107","url":null,"abstract":"Karyotyping, manual chromosome classification is a difficult and time consuming process. Many automated classifiers have been developed to overcome this problem. These classifiers either have high classification accuracy or high training speed. This paper proposes a classifier that performs well in both areas based on wavelet neural network (WNN), combining the wavelet into neural network for classification of chromosomes in group E (chromosomes 16, 17 and 18). The nonlinear characteristic of the network which is derived from wavelet specification improves the training speed and accuracy of the nonlinear chromosome classification. The network inputs are nine dimensional feature space extracted from the chromosome images and the outputs are three classes. The simulation result on the chromosomes in the Laboratory of Biomedical Imaging shows that the success rate of WNN was 0.93%, that is comparable to the traditional neural network (ANN) with 0.85% success rate. The number of iterations for training to reach 0.04% error rate is only 200 where it is 3500 iterations for ANN. According to the experimental results WNN achieves high accuracy with minimum training time, which makes it suitable for real-time chromosome classification in the laboratory.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"258 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115010273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Non-cooperative Object Detection in Sea Using Acoustic Sensors 基于声传感器的海洋非合作目标检测
E. Cheng, S. Challa, X. Tang, Xiaohu Liu
With the increase of illegal boat arrivals, the border security is becoming more and more important for Australian government. This paper is exploring the early warnings by detecting of boat generated signals received by a hydrophone. We focus on algorithm development and real boat generated signal tests. Our experiments have proved that the developed median CFAR and post integration algorithms are very robust for various different acoustic signals, which have a high detection rate while keeping a low false alarm rate.
随着非法船只入境人数的增加,边境安全对澳大利亚政府来说变得越来越重要。本文对水听器接收到的船舶产生的信号进行探测预警进行了探讨。我们专注于算法开发和实船生成信号测试。实验证明,所开发的中值CFAR和后积分算法对各种不同的声信号具有很强的鲁棒性,在保持低虚警率的同时具有较高的检测率。
{"title":"Non-cooperative Object Detection in Sea Using Acoustic Sensors","authors":"E. Cheng, S. Challa, X. Tang, Xiaohu Liu","doi":"10.1109/DICTA.2010.58","DOIUrl":"https://doi.org/10.1109/DICTA.2010.58","url":null,"abstract":"With the increase of illegal boat arrivals, the border security is becoming more and more important for Australian government. This paper is exploring the early warnings by detecting of boat generated signals received by a hydrophone. We focus on algorithm development and real boat generated signal tests. Our experiments have proved that the developed median CFAR and post integration algorithms are very robust for various different acoustic signals, which have a high detection rate while keeping a low false alarm rate.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130946186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
High Throughput Variable Size Non-square Gabor Engine with Feature Pooling Based on GPU 基于GPU特征池的高吞吐量变大小非平方Gabor引擎
Ali Emami, A. Bigdeli, A. Postula
Increasing application of Gabor feature space in various computer vision tasks and its high computational demand, encourages using parallel computing technologies. In this work we have designed a high throughput GPU based Gabor kernel that mimics the function of initial biological visual cortex layers namely ‘Simple’ and ‘Complex’ cells. The kernel is basically a Gabor filter bank with adjustable number of orientations and scales, supporting ‘Non-Square’ and ‘Variable Size’ filter masks on different channels. Consequently our GPU based Gabor kernel tends to be adjustably more accurate, more flexible for different applications, with optimum computational cost at lower resources. The second important task of our high throughput engine is ‘Gabor Feature Pooling’ with Max and Histogram methods, similar to biological visual ‘Complex cells’. This part of our ‘Gabor Engine’ makes it very practical for computer vision applications, since in addition to massive Gabor features, it also provides more abstract spatial invariant orientational information based on image Gabor features. We have optimised the Engine design to take maximum advantage of all GPU parallel resources and maximum bandwidths of all memories.
Gabor特征空间在各种计算机视觉任务中的应用越来越多,其高计算需求鼓励使用并行计算技术。在这项工作中,我们设计了一个基于GPU的高通量Gabor内核,它模仿了初始生物视觉皮层层的功能,即“简单”和“复杂”细胞。内核基本上是一个Gabor滤波器组,具有可调数量的方向和尺度,支持不同通道上的“非正方形”和“可变大小”滤波器蒙版。因此,我们基于GPU的Gabor内核往往更准确,更灵活地适应不同的应用程序,以更低的资源获得最佳的计算成本。我们高通量引擎的第二个重要任务是使用Max和直方图方法的“Gabor特征池”,类似于生物视觉的“复杂细胞”。我们的“Gabor引擎”的这一部分使得它在计算机视觉应用中非常实用,因为除了大量的Gabor特征之外,它还提供了基于图像Gabor特征的更抽象的空间不变方向信息。我们优化了引擎设计,以最大限度地利用所有GPU并行资源和所有内存的最大带宽。
{"title":"High Throughput Variable Size Non-square Gabor Engine with Feature Pooling Based on GPU","authors":"Ali Emami, A. Bigdeli, A. Postula","doi":"10.1109/DICTA.2010.73","DOIUrl":"https://doi.org/10.1109/DICTA.2010.73","url":null,"abstract":"Increasing application of Gabor feature space in various computer vision tasks and its high computational demand, encourages using parallel computing technologies. In this work we have designed a high throughput GPU based Gabor kernel that mimics the function of initial biological visual cortex layers namely ‘Simple’ and ‘Complex’ cells. The kernel is basically a Gabor filter bank with adjustable number of orientations and scales, supporting ‘Non-Square’ and ‘Variable Size’ filter masks on different channels. Consequently our GPU based Gabor kernel tends to be adjustably more accurate, more flexible for different applications, with optimum computational cost at lower resources. The second important task of our high throughput engine is ‘Gabor Feature Pooling’ with Max and Histogram methods, similar to biological visual ‘Complex cells’. This part of our ‘Gabor Engine’ makes it very practical for computer vision applications, since in addition to massive Gabor features, it also provides more abstract spatial invariant orientational information based on image Gabor features. We have optimised the Engine design to take maximum advantage of all GPU parallel resources and maximum bandwidths of all memories.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128866708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Video Metrology without the Image-to-Ground Homography 没有像地同形的视频计量
T. Scoleri
The ability to measure metric information from videos is an essential functionality in security software packages that populate databases with human morphometric and gait descriptors. The configuration of such security systems often involves calibrating the cameras in the surveillance network. For this purpose, markers are placed on the ground in the vicinity of each camera location in order to compute a critical image-to-ground homography. This paper shows how the homography computation can be avoided by describing the key components of a system designed for view-independent video metrology. Experiments on measuring the height of subjects and ground distances from different camera views demonstrate the viability of the approach. In addition, it is shown that the calibration model of the proposed method yields more accurate measurements than the frequently used square pixel camera model.
从视频中测量度量信息的能力是安全软件包的基本功能,这些软件包填充了包含人体形态测量和步态描述符的数据库。这种安全系统的配置通常涉及对监控网络中的摄像机进行校准。为此,在每个摄像机位置附近的地面上放置标记,以便计算关键的图像到地面的同形性。本文通过描述视点无关视频计量系统的关键组件,说明了如何避免单应性计算。在不同视角下测量目标高度和地面距离的实验证明了该方法的可行性。此外,该方法的标定模型比常用的方形像素相机模型产生更精确的测量结果。
{"title":"Video Metrology without the Image-to-Ground Homography","authors":"T. Scoleri","doi":"10.1109/DICTA.2010.64","DOIUrl":"https://doi.org/10.1109/DICTA.2010.64","url":null,"abstract":"The ability to measure metric information from videos is an essential functionality in security software packages that populate databases with human morphometric and gait descriptors. The configuration of such security systems often involves calibrating the cameras in the surveillance network. For this purpose, markers are placed on the ground in the vicinity of each camera location in order to compute a critical image-to-ground homography. This paper shows how the homography computation can be avoided by describing the key components of a system designed for view-independent video metrology. Experiments on measuring the height of subjects and ground distances from different camera views demonstrate the viability of the approach. In addition, it is shown that the calibration model of the proposed method yields more accurate measurements than the frequently used square pixel camera model.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123920337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Robust Image Hashing Using Higher Order Spectral Features 基于高阶谱特征的鲁棒图像哈希
Brenden Chen, V. Chandran
Robust image hashing seeks to transform a given input image into a shorter hashed version using a key-dependent non-invertible transform. These image hashes can be used for watermarking, image integrity authentication or image indexing for fast retrieval. This paper introduces a new method of generating image hashes based on extracting Higher Order Spectral features from the Radon projection of an input image. The feature extraction process is non-invertible, non-linear and different hashes can be produced from the same image through the use of random permutations of the input. We show that the transform is robust to typical image transformations such as JPEG compression, noise, scaling, rotation, smoothing and cropping. We evaluate our system using a verification-style framework based on calculating false match, false non-match likelihoods using the publicly available Uncompressed Colour Image database (UCID) of 1320 images. We also compare our results to Swaminathan’s Fourier-Mellin based hashing method with at least 1% EER improvement under noise, scaling and sharpening.
鲁棒图像哈希寻求使用依赖键的不可逆变换将给定的输入图像转换为更短的哈希版本。这些图像哈希可以用于水印,图像完整性认证或快速检索图像索引。介绍了一种基于从输入图像的Radon投影中提取高阶谱特征来生成图像哈希的新方法。特征提取过程是不可逆的,非线性的,通过使用输入的随机排列可以从同一图像产生不同的哈希值。我们表明,该变换对典型的图像变换(如JPEG压缩、噪声、缩放、旋转、平滑和裁剪)具有鲁棒性。我们使用基于计算假匹配的验证式框架来评估我们的系统,假不匹配的可能性使用1320张图像的公开可用的未压缩彩色图像数据库(UCID)。我们还将我们的结果与Swaminathan的基于Fourier-Mellin的哈希方法进行了比较,在噪声,缩放和锐化下,我们的EER至少提高了1%。
{"title":"Robust Image Hashing Using Higher Order Spectral Features","authors":"Brenden Chen, V. Chandran","doi":"10.1109/DICTA.2010.26","DOIUrl":"https://doi.org/10.1109/DICTA.2010.26","url":null,"abstract":"Robust image hashing seeks to transform a given input image into a shorter hashed version using a key-dependent non-invertible transform. These image hashes can be used for watermarking, image integrity authentication or image indexing for fast retrieval. This paper introduces a new method of generating image hashes based on extracting Higher Order Spectral features from the Radon projection of an input image. The feature extraction process is non-invertible, non-linear and different hashes can be produced from the same image through the use of random permutations of the input. We show that the transform is robust to typical image transformations such as JPEG compression, noise, scaling, rotation, smoothing and cropping. We evaluate our system using a verification-style framework based on calculating false match, false non-match likelihoods using the publicly available Uncompressed Colour Image database (UCID) of 1320 images. We also compare our results to Swaminathan’s Fourier-Mellin based hashing method with at least 1% EER improvement under noise, scaling and sharpening.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"28 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120901616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Camera Ego-Motion Estimation Using Phase Correlation under Planar Motion Constraint 平面运动约束下基于相位相关的摄像机自运动估计
S. Effendi, R. Jarvis
An intelligent robotic living assistive system has become a popular research in the last decade. One of the important topics in that research area is 3D object reconstruction from multiple views. This process may depend on motion estimation using vision. However, often a domestic robot on an electric wheel chair has to move in a steep rotational angle that causes motion estimation from vision to become inaccurate. In addition, an oblique viewing angle creates a perspective distortion to the captured images, which further worsens the estimation result. Hence, in this paper, we propose a new approach by altering the motion estimation problem into a 2D image registration problem. Our method’s accuracy is very close to that of the Scale Invariant Feature Transform (SIFT) features tracker, whereas the Kanade-Lucas-Tomasi (KLT) tracker’s drops as soon as the rotational angle reaches about 40¿. Although our method is 2.7 times slower than the KLT tracker, it is 19 times faster than the SIFT tracker.
智能机器人生活辅助系统是近十年来研究的热点。该研究领域的一个重要课题是多视角三维物体重建。这个过程可能依赖于使用视觉的运动估计。然而,电动轮椅上的家用机器人通常必须以一个陡峭的旋转角度移动,这导致视觉上的运动估计变得不准确。此外,倾斜的视角会对捕获的图像产生透视畸变,从而进一步恶化估计结果。因此,在本文中,我们提出了一种新的方法,将运动估计问题转化为二维图像配准问题。我们的方法的精度非常接近尺度不变特征变换(SIFT)特征跟踪器的精度,而Kanade-Lucas-Tomasi (KLT)跟踪器的精度在旋转角度达到40°左右时就会下降。虽然我们的方法比KLT跟踪器慢2.7倍,但比SIFT跟踪器快19倍。
{"title":"Camera Ego-Motion Estimation Using Phase Correlation under Planar Motion Constraint","authors":"S. Effendi, R. Jarvis","doi":"10.1109/DICTA.2010.38","DOIUrl":"https://doi.org/10.1109/DICTA.2010.38","url":null,"abstract":"An intelligent robotic living assistive system has become a popular research in the last decade. One of the important topics in that research area is 3D object reconstruction from multiple views. This process may depend on motion estimation using vision. However, often a domestic robot on an electric wheel chair has to move in a steep rotational angle that causes motion estimation from vision to become inaccurate. In addition, an oblique viewing angle creates a perspective distortion to the captured images, which further worsens the estimation result. Hence, in this paper, we propose a new approach by altering the motion estimation problem into a 2D image registration problem. Our method’s accuracy is very close to that of the Scale Invariant Feature Transform (SIFT) features tracker, whereas the Kanade-Lucas-Tomasi (KLT) tracker’s drops as soon as the rotational angle reaches about 40¿. Although our method is 2.7 times slower than the KLT tracker, it is 19 times faster than the SIFT tracker.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124558421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Learn Concepts in Multiple-Instance Learning with Diverse Density Framework Using Supervised Mean Shift 利用有监督均值移位在不同密度框架下学习概念
Ruo Du, Sheng Wang, Qiang Wu, Xiangjian He
Many machine learning tasks can be achieved by using Multiple-instance learning (MIL) when the target features are ambiguous. As a general MIL framework, Diverse Density (DD) provides a way to learn those ambiguous features by maxmising the DD estimator, and the maximum of DD estimator is called a concept. However, modeling and finding multiple concepts is often difficult especially without prior knowledge of concept number, i.e., every positive bag may contain multiple coexistent and heterogeneous concepts but we do not know how many concepts exist. In this work, we present a new approach to find multiple concepts of DD by using an supervised mean shift algorithm. Unlike classic mean shift (an unsupervised clustering algorithm), our approach for the first time introduces the class label to feature point and each point differently contributes the mean shift iterations according to its label and position. A feature point derives from an MIL instance and takes corresponding bag label. Our supervised mean shift starts from positive points and converges to the local maxima that are close to the positive points and far away from the negative points. Experiments qualitatively indicate that our approach has better properties than other DD methods.
当目标特征不明确时,许多机器学习任务可以通过多实例学习来实现。作为一种通用的MIL框架,DD提供了一种通过最大化DD估计量来学习这些模糊特征的方法,DD估计量的最大值被称为概念。然而,建模和发现多个概念往往是困难的,特别是在没有概念数量的先验知识的情况下,即每个正袋可能包含多个共存和异构的概念,但我们不知道有多少概念存在。在这项工作中,我们提出了一种新的方法,通过使用监督均值移位算法来找到DD的多个概念。与经典的mean shift(一种无监督聚类算法)不同,我们的方法首次为特征点引入了类标签,每个点根据其标签和位置不同,对mean shift迭代的贡献不同。特征点派生自MIL实例并采用相应的袋标签。我们的监督均值漂移从正的点开始,收敛到离正的点很近,离负的点很远的局部最大值。定性实验表明,该方法比其他DD方法具有更好的性能。
{"title":"Learn Concepts in Multiple-Instance Learning with Diverse Density Framework Using Supervised Mean Shift","authors":"Ruo Du, Sheng Wang, Qiang Wu, Xiangjian He","doi":"10.1109/DICTA.2010.111","DOIUrl":"https://doi.org/10.1109/DICTA.2010.111","url":null,"abstract":"Many machine learning tasks can be achieved by using Multiple-instance learning (MIL) when the target features are ambiguous. As a general MIL framework, Diverse Density (DD) provides a way to learn those ambiguous features by maxmising the DD estimator, and the maximum of DD estimator is called a concept. However, modeling and finding multiple concepts is often difficult especially without prior knowledge of concept number, i.e., every positive bag may contain multiple coexistent and heterogeneous concepts but we do not know how many concepts exist. In this work, we present a new approach to find multiple concepts of DD by using an supervised mean shift algorithm. Unlike classic mean shift (an unsupervised clustering algorithm), our approach for the first time introduces the class label to feature point and each point differently contributes the mean shift iterations according to its label and position. A feature point derives from an MIL instance and takes corresponding bag label. Our supervised mean shift starts from positive points and converges to the local maxima that are close to the positive points and far away from the negative points. Experiments qualitatively indicate that our approach has better properties than other DD methods.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132641974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2010 International Conference on Digital Image Computing: Techniques and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1