首页 > 最新文献

12th International Conference on Image Analysis and Processing, 2003.Proceedings.最新文献

英文 中文
Face detection for visual surveillance 用于视觉监控的人脸检测
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234036
G. Foresti, C. Micheloni, L. Snidaro, C. Marchiol
In this paper, a real-time face detection system for color image sequences is presented. The system applies three different face detection methods and integrates the obtained results to achieve a greater location accuracy. The first method localizes the human head through outline analysis, focusing the attention of the system on a small image area. The second, a skin color method, is applied to the blobs to find skin regions (e.g., faces, hands, etc.). The third. principal component analysis, is used to reduce the dimensionality of the data set and to detect face patterns. Finally. the obtained face locations are fused to increase the detection reliability and to avoid false detections due to occlusions or unfavorable human poses. The proposed approach is used by a video-based surveillance system for monitoring indoor scenes.
提出了一种基于彩色图像序列的实时人脸检测系统。该系统应用了三种不同的人脸检测方法,并将得到的结果进行整合,以达到更高的定位精度。第一种方法是通过轮廓分析来定位人的头部,将系统的注意力集中在一个小的图像区域上。第二种方法是皮肤颜色方法,将其应用于斑点以查找皮肤区域(例如,脸,手等)。第三层。主成分分析用于降低数据集的维数并检测人脸模式。最后。将获得的人脸位置进行融合,以提高检测可靠性,并避免由于遮挡或不利的人体姿势而导致的错误检测。所提出的方法被一个基于视频的监控系统用于监控室内场景。
{"title":"Face detection for visual surveillance","authors":"G. Foresti, C. Micheloni, L. Snidaro, C. Marchiol","doi":"10.1109/ICIAP.2003.1234036","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234036","url":null,"abstract":"In this paper, a real-time face detection system for color image sequences is presented. The system applies three different face detection methods and integrates the obtained results to achieve a greater location accuracy. The first method localizes the human head through outline analysis, focusing the attention of the system on a small image area. The second, a skin color method, is applied to the blobs to find skin regions (e.g., faces, hands, etc.). The third. principal component analysis, is used to reduce the dimensionality of the data set and to detect face patterns. Finally. the obtained face locations are fused to increase the detection reliability and to avoid false detections due to occlusions or unfavorable human poses. The proposed approach is used by a video-based surveillance system for monitoring indoor scenes.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"491 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123059709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Trajectories extraction from image sequences based on kinematic 基于运动学的图像序列轨迹提取
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234089
Ghilès Mostafaoui, C. Achard, M. Milgram
The problem of moving person tracking, without knowledge about the number of persons in the scene, and by taking into account occlusion, under-segmentation and over-segmentation, is challenging. A first motion detection gives us regions with several segmentation problems due to bad acquisition conditions. The tracking step, which has to manage all these problems, is realized with the EM algorithm (expectation maximization). It uses a kinematic model: we suppose a rectilinear and uniform apparent motion, this hypothesis seems very restrictive but remains locally accurate in most applications. Good results are obtained with this approach on several sequences, without any initialization.
在不知道场景中有多少人的情况下,并考虑到遮挡、分割不足和过度分割的情况下,移动人员跟踪的问题是具有挑战性的。第一次运动检测给了我们一些由于采集条件不好而存在分割问题的区域。跟踪步骤采用期望最大化算法(EM)实现,该算法需要处理所有这些问题。它使用一个运动学模型:我们假设一个直线和均匀的表观运动,这个假设似乎非常严格,但在大多数应用中仍然是局部准确的。该方法在不进行任何初始化的情况下,对多个序列都得到了较好的结果。
{"title":"Trajectories extraction from image sequences based on kinematic","authors":"Ghilès Mostafaoui, C. Achard, M. Milgram","doi":"10.1109/ICIAP.2003.1234089","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234089","url":null,"abstract":"The problem of moving person tracking, without knowledge about the number of persons in the scene, and by taking into account occlusion, under-segmentation and over-segmentation, is challenging. A first motion detection gives us regions with several segmentation problems due to bad acquisition conditions. The tracking step, which has to manage all these problems, is realized with the EM algorithm (expectation maximization). It uses a kinematic model: we suppose a rectilinear and uniform apparent motion, this hypothesis seems very restrictive but remains locally accurate in most applications. Good results are obtained with this approach on several sequences, without any initialization.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127377264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Color correction for digital photographs 数码照片的色彩校正
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234123
F. Gasparini, R. Schettini
The paper describes a reliable and rapid method for detecting and removing a color cast (i.e. a superimposed dominant color) in a digital image without any a priori knowledge of its semantic content. A multi-step algorithm classifies the input images as having no cast, evident cast, ambiguous cast, or intrinsic cast (images presenting a cast due to a predominant color that must be preserved). If an evident or ambiguous cast is found, a cast remover step, a modified version of the white balance algorithm, is then applied in the two cases of evident or ambiguous casts. The method we propose has been tuned and tested with positive results on a data set of over 650 images.
本文描述了一种可靠而快速的方法来检测和去除数字图像中的色偏(即叠加的主色),而无需先验地了解其语义内容。多步算法将输入图像分类为无强制转换、明显强制转换、模糊强制转换或内在强制转换(由于必须保留主色而呈现强制转换的图像)。如果发现明显或不明确的强制转换,则在明显或不明确强制转换的两种情况下应用强制转换移除步骤,即白平衡算法的修改版本。我们提出的方法已经在超过650张图像的数据集上进行了调整和测试,并取得了积极的结果。
{"title":"Color correction for digital photographs","authors":"F. Gasparini, R. Schettini","doi":"10.1109/ICIAP.2003.1234123","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234123","url":null,"abstract":"The paper describes a reliable and rapid method for detecting and removing a color cast (i.e. a superimposed dominant color) in a digital image without any a priori knowledge of its semantic content. A multi-step algorithm classifies the input images as having no cast, evident cast, ambiguous cast, or intrinsic cast (images presenting a cast due to a predominant color that must be preserved). If an evident or ambiguous cast is found, a cast remover step, a modified version of the white balance algorithm, is then applied in the two cases of evident or ambiguous casts. The method we propose has been tuned and tested with positive results on a data set of over 650 images.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126076058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 65
An efficient algorithm for exhaustive template matching based on normalized cross correlation 一种基于归一化互相关的穷举模板匹配算法
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234070
L. D. Stefano, S. Mattoccia, M. Mola
This work proposes a novel technique aimed at improving the performance of exhaustive template matching based on the normalized cross correlation (NCC). An effective sufficient condition, capable of rapidly pruning those match candidates that could not provide a better cross correlation score with respect to the current best candidate, can be obtained exploiting an upper bound of the NCC function. This upper bound relies on partial evaluation of the crosscorrelation and can be computed efficiently, yielding a significant reduction of operations compared to the NCC function and allows for reducing the overall number of operations required to carry out exhaustive searches. However, the bounded partial correlation (BPC) algorithm turns out to be significantly data dependent. In this paper we propose a novel algorithm that improves the overall performance of BPC thanks to the deployment of a more selective sufficient condition which allows for rendering the algorithm significantly less data dependent. Experimental results with real images and actual CPU time are reported.
本文提出了一种基于归一化互相关(NCC)的新技术,旨在提高穷举模板匹配的性能。利用NCC函数的上界,可以获得一个有效的充分条件,能够快速修剪那些相对于当前最佳候选者不能提供更好的交叉相关分数的匹配候选者。这个上界依赖于相互关系的部分求值,可以有效地计算,与NCC函数相比,大大减少了操作,并允许减少执行穷举搜索所需的操作总数。然而,有界偏相关(BPC)算法具有明显的数据依赖性。在本文中,我们提出了一种新的算法,该算法通过部署更具选择性的充分条件来提高BPC的整体性能,从而使算法显着减少对数据的依赖。给出了真实图像和实际CPU时间下的实验结果。
{"title":"An efficient algorithm for exhaustive template matching based on normalized cross correlation","authors":"L. D. Stefano, S. Mattoccia, M. Mola","doi":"10.1109/ICIAP.2003.1234070","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234070","url":null,"abstract":"This work proposes a novel technique aimed at improving the performance of exhaustive template matching based on the normalized cross correlation (NCC). An effective sufficient condition, capable of rapidly pruning those match candidates that could not provide a better cross correlation score with respect to the current best candidate, can be obtained exploiting an upper bound of the NCC function. This upper bound relies on partial evaluation of the crosscorrelation and can be computed efficiently, yielding a significant reduction of operations compared to the NCC function and allows for reducing the overall number of operations required to carry out exhaustive searches. However, the bounded partial correlation (BPC) algorithm turns out to be significantly data dependent. In this paper we propose a novel algorithm that improves the overall performance of BPC thanks to the deployment of a more selective sufficient condition which allows for rendering the algorithm significantly less data dependent. Experimental results with real images and actual CPU time are reported.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126111947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 79
A new algorithm for bit rate allocation in JPEG2000 tile encoding 一种新的JPEG2000编码码率分配算法
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234125
E. Ardizzone, M. Cascia, Fabio Testa
A new algorithm for allocating a given bit rate to different image tiles in the JPEG2000 encoding system is proposed. The algorithm outperforms other approaches commonly used in implementations. The new algorithm is suitable when information content is not equally distributed across the image. It is based on the computation of an index of the information content of each tile. To implement the proposed approach, we modified JasPer, a free software-based JPEG2000 coder implementation (Adams, M.D. and Kossentini, F., Proc. IEEE Int. Conf. on Image Process., vol.2, p.53-6, 2000). The experimentation was carried out on a subset of the JPEG2000 test images. Experimental results are reported, showing the PSNR of the decompressed images to be better than the one produced with the traditional approach, when the information content is not equally distributed across the image, and to be comparable to that of the traditional approach, when the information distribution is quite uniform.
在JPEG2000编码系统中,提出了一种将给定比特率分配给不同图像块的新算法。该算法优于实现中常用的其他方法。新算法适用于信息内容不均匀分布的情况。它基于对每个tile的信息内容的索引的计算。为了实现所提出的方法,我们修改了JasPer,一个基于JPEG2000编码器的免费软件实现(Adams, M.D.和Kossentini, F., Proc. IEEE Int.)。关于图像处理。,第2卷,第53-6页,2000)。实验是在JPEG2000测试图像的一个子集上进行的。实验结果表明,在信息内容不均匀分布的情况下,解压缩后的图像的PSNR优于传统方法;在信息分布较为均匀的情况下,解压缩后的图像的PSNR与传统方法相当。
{"title":"A new algorithm for bit rate allocation in JPEG2000 tile encoding","authors":"E. Ardizzone, M. Cascia, Fabio Testa","doi":"10.1109/ICIAP.2003.1234125","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234125","url":null,"abstract":"A new algorithm for allocating a given bit rate to different image tiles in the JPEG2000 encoding system is proposed. The algorithm outperforms other approaches commonly used in implementations. The new algorithm is suitable when information content is not equally distributed across the image. It is based on the computation of an index of the information content of each tile. To implement the proposed approach, we modified JasPer, a free software-based JPEG2000 coder implementation (Adams, M.D. and Kossentini, F., Proc. IEEE Int. Conf. on Image Process., vol.2, p.53-6, 2000). The experimentation was carried out on a subset of the JPEG2000 test images. Experimental results are reported, showing the PSNR of the decompressed images to be better than the one produced with the traditional approach, when the information content is not equally distributed across the image, and to be comparable to that of the traditional approach, when the information distribution is quite uniform.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"261 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122714893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A complete and stable set of affine-invariant Fourier descriptors 一个完整且稳定的仿射不变傅里叶描述子集
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234112
F. Chaker, M. Bannour, F. Ghorbel
We propose here a study of a new affine-invariant Fourier descriptors (Ghorbel (1998)) which are computed on the projection of a given curve that is assumed to be evolving on three dimensional space and supposed to be far enough from the camera. This set of descriptors is compared to the well known affine curvature. These invariants satisfy the completeness and stability properties.
我们在这里提出一种新的仿射不变傅立叶描述子(Ghorbel(1998))的研究,它是在给定曲线的投影上计算的,假设该曲线在三维空间上进化,并且应该离相机足够远。这组描述符与众所周知的仿射曲率进行了比较。这些不变量满足完备性和稳定性。
{"title":"A complete and stable set of affine-invariant Fourier descriptors","authors":"F. Chaker, M. Bannour, F. Ghorbel","doi":"10.1109/ICIAP.2003.1234112","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234112","url":null,"abstract":"We propose here a study of a new affine-invariant Fourier descriptors (Ghorbel (1998)) which are computed on the projection of a given curve that is assumed to be evolving on three dimensional space and supposed to be far enough from the camera. This set of descriptors is compared to the well known affine curvature. These invariants satisfy the completeness and stability properties.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122524696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Face recognition committee machines: dynamic vs. static structures 人脸识别委员会机器:动态与静态结构
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234037
Ho-Man Tang, Michael R. Lyu, Irwin King
We propose a dynamic face recognition committee machine (DFRCM) consisting of five well-known state-of-the-art algorithms in this paper. In previous work, we have developed a static committee machine which outperforms all the individual algorithms in the experiments. However, the weight for each expert in the committee is fixed and cannot be changed once the system is trained. We propose a dynamic architecture on the committee machine which uses the input face image in the gating network to improve the overall performance. In addition, we adopt a feedback mechanism on the committee machine to adjust the weight of an individual algorithm according to the performance of the algorithm. Detailed experimental results of different algorithms and the committee machine are given to demonstrate the effectiveness of the proposed system.
本文提出了一种由五种著名算法组成的动态人脸识别委员会机(DFRCM)。在之前的工作中,我们开发了一种静态委员会机,它在实验中优于所有单独的算法。然而,委员会中每个专家的权重是固定的,一旦系统经过培训就不能改变。我们在委员会机上提出了一种动态架构,在门控网络中使用输入的人脸图像来提高委员会机的整体性能。此外,我们在委员会机上采用反馈机制,根据算法的性能调整单个算法的权重。给出了不同算法和委员会机的详细实验结果,验证了所提系统的有效性。
{"title":"Face recognition committee machines: dynamic vs. static structures","authors":"Ho-Man Tang, Michael R. Lyu, Irwin King","doi":"10.1109/ICIAP.2003.1234037","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234037","url":null,"abstract":"We propose a dynamic face recognition committee machine (DFRCM) consisting of five well-known state-of-the-art algorithms in this paper. In previous work, we have developed a static committee machine which outperforms all the individual algorithms in the experiments. However, the weight for each expert in the committee is fixed and cannot be changed once the system is trained. We propose a dynamic architecture on the committee machine which uses the input face image in the gating network to improve the overall performance. In addition, we adopt a feedback mechanism on the committee machine to adjust the weight of an individual algorithm according to the performance of the algorithm. Detailed experimental results of different algorithms and the committee machine are given to demonstrate the effectiveness of the proposed system.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123027040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Revisiting image splitting 重新审视图像分割
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234069
A. Mérigot
The paper presents a modified version of the classical split and merge algorithm (Horowitz, S. and Pavlidis, T., 1976). Instead of performing a regular decomposition of the image, it relies on a split at an optimal position that makes a good interregion separation. The implementation of the algorithm uses an initial image preprocessing to speed-up computation. Experimental results show that the number of regions generated by the split phase is largely reduced and that the distortion of the segmented image is smaller, while the execution time is slightly increased.
本文提出了经典分割和合并算法的改进版本(Horowitz, S. and Pavlidis, T., 1976)。它不是对图像进行常规分解,而是依赖于在最佳位置进行分割,从而实现良好的区域间分离。该算法的实现采用初始图像预处理来加快计算速度。实验结果表明,分割相位产生的区域数量大大减少,分割图像的失真较小,而执行时间略有增加。
{"title":"Revisiting image splitting","authors":"A. Mérigot","doi":"10.1109/ICIAP.2003.1234069","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234069","url":null,"abstract":"The paper presents a modified version of the classical split and merge algorithm (Horowitz, S. and Pavlidis, T., 1976). Instead of performing a regular decomposition of the image, it relies on a split at an optimal position that makes a good interregion separation. The implementation of the algorithm uses an initial image preprocessing to speed-up computation. Experimental results show that the number of regions generated by the split phase is largely reduced and that the distortion of the segmented image is smaller, while the execution time is slightly increased.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"116 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124611131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Multiscale Fourier descriptor for shape classification 形状分类的多尺度傅里叶描述子
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234105
I. Kunttu, Leena Lepistö, J. Rauhamaa, A. Visa
The description of object shape is an important characteristic of an image. In image processing and pattern recognition, several different shape descriptors are used. In human visual perception, shapes are processed in multiple resolutions. Therefore, multiscale shape representation is essential in shape based image classification and retrieval. In the description of an object shape, the multiresolution representation provides also additional accuracy to the shape classification. We introduce a new descriptor for shape classification. This descriptor is called the multiscale Fourier descriptor, and it combines the benefits of a Fourier descriptor and multiscale shape representation. This descriptor is formed by applying a Fourier transform to the coefficients of the wavelet transform of the object boundary. In this way, the Fourier descriptor can be presented in multiple resolutions. We performed classification experiments using three image databases. The classification results of our method are compared to those of Fourier descriptors.
物体形状的描述是图像的一个重要特征。在图像处理和模式识别中,使用了几种不同的形状描述符。在人类的视觉感知中,形状以多种分辨率进行处理。因此,在基于形状的图像分类和检索中,多尺度形状表示至关重要。在物体形状的描述中,多分辨率表示还为形状分类提供了额外的准确性。提出了一种新的形状分类描述符。这个描述符被称为多尺度傅里叶描述符,它结合了傅里叶描述符和多尺度形状表示的优点。这个描述符是通过对目标边界的小波变换的系数进行傅里叶变换而形成的。通过这种方式,傅里叶描述符可以以多种分辨率表示。我们使用三个图像数据库进行分类实验。将本文方法的分类结果与傅立叶描述子的分类结果进行了比较。
{"title":"Multiscale Fourier descriptor for shape classification","authors":"I. Kunttu, Leena Lepistö, J. Rauhamaa, A. Visa","doi":"10.1109/ICIAP.2003.1234105","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234105","url":null,"abstract":"The description of object shape is an important characteristic of an image. In image processing and pattern recognition, several different shape descriptors are used. In human visual perception, shapes are processed in multiple resolutions. Therefore, multiscale shape representation is essential in shape based image classification and retrieval. In the description of an object shape, the multiresolution representation provides also additional accuracy to the shape classification. We introduce a new descriptor for shape classification. This descriptor is called the multiscale Fourier descriptor, and it combines the benefits of a Fourier descriptor and multiscale shape representation. This descriptor is formed by applying a Fourier transform to the coefficients of the wavelet transform of the object boundary. In this way, the Fourier descriptor can be presented in multiple resolutions. We performed classification experiments using three image databases. The classification results of our method are compared to those of Fourier descriptors.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129910543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 49
Pattern spaces from graph polynomials 图多项式的模式空间
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234096
Richard C. Wilson, E. Hancock
Although graph structures have proved useful in high level vision for object recognition and matching, they can prove computationally cumbersome because of the need to establish reliable correspondences between nodes. Hence, standard pattern recognition techniques cannot be easily applied to graphs since feature vectors are not easily constructed. To overcome this problem, we turn to the spectral matrix. We show how the elements of this matrix can be used to construct symmetric polynomials that are permutation invariant. The coefficients of these polynomials can be used as graph-features which can be encoded in a vectorial manner. Hence, the symmetric polynomials lead to a representation which is invariant under node permutations and so represents the graph structure without the need for labelling or correspondence operations. We demonstrate that these features are complete and continuous for 'simple' graphs (those without repeated eigenvalues in their spectrum). The notions of stability and discrimination are discussed, and we present experimental evaluation of these properties. Finally, we show that these graph characterizations can be used to cluster graphs from real datasets.
尽管图结构已被证明在高级视觉中用于对象识别和匹配,但由于需要在节点之间建立可靠的对应关系,它们可能被证明在计算上很麻烦。因此,标准的模式识别技术不能很容易地应用于图,因为特征向量不容易构造。为了克服这个问题,我们转向谱矩阵。我们展示了如何使用这个矩阵的元素来构造排列不变的对称多项式。这些多项式的系数可以用作图形特征,可以用向量方式编码。因此,对称多项式导致在节点置换下不变的表示,因此不需要标记或对应操作就可以表示图结构。我们证明了这些特征对于“简单”图(在其谱中没有重复特征值的图)是完整和连续的。讨论了稳定性和鉴别性的概念,并给出了这些性质的实验评价。最后,我们证明了这些图特征可以用于来自真实数据集的图聚类。
{"title":"Pattern spaces from graph polynomials","authors":"Richard C. Wilson, E. Hancock","doi":"10.1109/ICIAP.2003.1234096","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234096","url":null,"abstract":"Although graph structures have proved useful in high level vision for object recognition and matching, they can prove computationally cumbersome because of the need to establish reliable correspondences between nodes. Hence, standard pattern recognition techniques cannot be easily applied to graphs since feature vectors are not easily constructed. To overcome this problem, we turn to the spectral matrix. We show how the elements of this matrix can be used to construct symmetric polynomials that are permutation invariant. The coefficients of these polynomials can be used as graph-features which can be encoded in a vectorial manner. Hence, the symmetric polynomials lead to a representation which is invariant under node permutations and so represents the graph structure without the need for labelling or correspondence operations. We demonstrate that these features are complete and continuous for 'simple' graphs (those without repeated eigenvalues in their spectrum). The notions of stability and discrimination are discussed, and we present experimental evaluation of these properties. Finally, we show that these graph characterizations can be used to cluster graphs from real datasets.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124138903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
12th International Conference on Image Analysis and Processing, 2003.Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1