首页 > 最新文献

Proceedings 11th International Conference on Image Analysis and Processing最新文献

英文 中文
Minimum entropy transform using Gabor wavelets for image compression 最小熵变换使用Gabor小波图像压缩
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957047
S. Fischer, G. Cristóbal
Most image compression methods are based on the use of the DCT or (bi-)orthogonal wavelets. However, in many cases improved performance in terms of visual quality can be expected if we consider a human visual system based model. The aim of this paper is to explore the potential of image compression techniques based on the use of nonorthogonal filters such as Gabor wavelets. The compression scheme is performed by a linear wavelet transform with filters similar to 2D Gabor functions through a quantizer based on measurements of the contrast sensitivity function of the human visual system (HVS). The compression performance is evaluated by entropy and error measures. Because of the non-orthogonality property, different image decompositions will have the same reconstruction. Thus, between all possible decompositions, one can be interested specifically in a minimum entropy wavelet transform that minimizes the information redundancy. This process can be considered as a nonlinear Gabor-wavelet transform that can be employed for compression applications. The overall optimization procedure has been implemented as an iterative algorithm producing a significant reduction in the information redundancy.
大多数图像压缩方法是基于DCT或(双)正交小波的使用。然而,在许多情况下,如果我们考虑基于人类视觉系统的模型,则可以期望在视觉质量方面提高性能。本文的目的是探索基于非正交滤波器(如Gabor小波)的图像压缩技术的潜力。压缩方案由线性小波变换和类似于二维Gabor函数的滤波器通过基于人类视觉系统(HVS)对比敏感度函数测量的量化器来执行。通过熵和误差度量来评价压缩性能。由于图像的非正交性,不同的图像分解会产生相同的重构。因此,在所有可能的分解之间,人们可以特别对最小化信息冗余的最小熵小波变换感兴趣。这个过程可以看作是一个非线性的gabor -小波变换,可以用于压缩应用。整体优化过程已被实现为一个迭代算法,产生显著减少信息冗余。
{"title":"Minimum entropy transform using Gabor wavelets for image compression","authors":"S. Fischer, G. Cristóbal","doi":"10.1109/ICIAP.2001.957047","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957047","url":null,"abstract":"Most image compression methods are based on the use of the DCT or (bi-)orthogonal wavelets. However, in many cases improved performance in terms of visual quality can be expected if we consider a human visual system based model. The aim of this paper is to explore the potential of image compression techniques based on the use of nonorthogonal filters such as Gabor wavelets. The compression scheme is performed by a linear wavelet transform with filters similar to 2D Gabor functions through a quantizer based on measurements of the contrast sensitivity function of the human visual system (HVS). The compression performance is evaluated by entropy and error measures. Because of the non-orthogonality property, different image decompositions will have the same reconstruction. Thus, between all possible decompositions, one can be interested specifically in a minimum entropy wavelet transform that minimizes the information redundancy. This process can be considered as a nonlinear Gabor-wavelet transform that can be employed for compression applications. The overall optimization procedure has been implemented as an iterative algorithm producing a significant reduction in the information redundancy.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127062386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
3D biological object detection and labeling in multidimensional microscopy imaging 三维生物目标检测和标记在多维显微镜成像
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957011
Juhui Wang, A. Trubuil, C. Graffigne
One essential assumption used in object detection and labeling by imaging is that the photometric properties of the object are homogeneous. This homogeneity requirement is often violated in microscopy imaging. Classical methods are usually of high computational cost and fail to give a stable solution. This paper presents a low computational complexity and robust method for 3D biological object detection and labeling. The developed approach is based on a statistical, non-parametric framework. The image is first divided into regular non-overlapped regions and each region is evaluated according to a general photometric variability model. The regions not consistent with this model are considered as aberrations in the data and excluded from the analysis procedure. Simultaneously, the interior parts of the object are detected. They correspond to regions where the supposed model is valid. In the second stage, the valid regions from the same object are merged under a set of hypotheses. These hypotheses are generated by taking into account photometric and geometric properties of objects and the merging is realized according to an iterative algorithm. The approach has been applied in investigations of the spatial distribution of nuclei on colonic glands of rats observed with with help of confocal fluorescence microscopy.
在物体检测和标记成像中使用的一个基本假设是物体的光度特性是均匀的。这种均匀性要求在显微镜成像中经常被违反。经典方法通常计算成本高,且不能给出稳定的解。提出了一种计算复杂度低、鲁棒性好的三维生物目标检测与标记方法。开发的方法是基于统计的非参数框架。首先将图像划分为规则的非重叠区域,并根据一般的光度变异性模型对每个区域进行评估。与该模型不一致的区域被认为是数据中的畸变,并被排除在分析程序之外。同时,检测物体的内部部分。它们对应于假定的模型有效的区域。在第二阶段,将同一目标的有效区域合并到一组假设下。这些假设是通过考虑物体的光度和几何特性产生的,并通过迭代算法实现合并。该方法已应用于共聚焦荧光显微镜观察大鼠结肠腺细胞核的空间分布。
{"title":"3D biological object detection and labeling in multidimensional microscopy imaging","authors":"Juhui Wang, A. Trubuil, C. Graffigne","doi":"10.1109/ICIAP.2001.957011","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957011","url":null,"abstract":"One essential assumption used in object detection and labeling by imaging is that the photometric properties of the object are homogeneous. This homogeneity requirement is often violated in microscopy imaging. Classical methods are usually of high computational cost and fail to give a stable solution. This paper presents a low computational complexity and robust method for 3D biological object detection and labeling. The developed approach is based on a statistical, non-parametric framework. The image is first divided into regular non-overlapped regions and each region is evaluated according to a general photometric variability model. The regions not consistent with this model are considered as aberrations in the data and excluded from the analysis procedure. Simultaneously, the interior parts of the object are detected. They correspond to regions where the supposed model is valid. In the second stage, the valid regions from the same object are merged under a set of hypotheses. These hypotheses are generated by taking into account photometric and geometric properties of objects and the merging is realized according to an iterative algorithm. The approach has been applied in investigations of the spatial distribution of nuclei on colonic glands of rats observed with with help of confocal fluorescence microscopy.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126981243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Image analysis of crosswalk 人行横道图像分析
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957003
T. Shioyama, Haiyuan Wu, Y. Nishibe, Naoki Nakamura, Suguru Kitawaki
This paper proposes a method for image analysis of a crosswalk and a traffic light. The method provides information not only about the length of a crosswalk, but also about the colour of the traffic light. The length of a crosswalk is estimated by projective geometry using white lines painted on the road at a crosswalk. Furthermore, the state of the traffic light, that is, the colour of green (walk signal) or red (stop signal), is detected by searching the green (or red) traffic light using affine moment invariants. In order to evaluate the performance, experimental results estimating the length and detecting the traffic light are presented.
提出了一种人行横道和交通灯图像分析方法。该方法不仅提供了人行横道长度的信息,还提供了交通灯颜色的信息。人行横道的长度是用投影几何在人行横道的道路上画上白线来估计的。此外,通过使用仿射矩不变量搜索绿色(或红色)交通灯来检测交通灯的状态,即绿色(步行信号)或红色(停车信号)的颜色。为了评价该算法的性能,给出了估计交通灯长度和检测交通灯长度的实验结果。
{"title":"Image analysis of crosswalk","authors":"T. Shioyama, Haiyuan Wu, Y. Nishibe, Naoki Nakamura, Suguru Kitawaki","doi":"10.1109/ICIAP.2001.957003","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957003","url":null,"abstract":"This paper proposes a method for image analysis of a crosswalk and a traffic light. The method provides information not only about the length of a crosswalk, but also about the colour of the traffic light. The length of a crosswalk is estimated by projective geometry using white lines painted on the road at a crosswalk. Furthermore, the state of the traffic light, that is, the colour of green (walk signal) or red (stop signal), is detected by searching the green (or red) traffic light using affine moment invariants. In order to evaluate the performance, experimental results estimating the length and detecting the traffic light are presented.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124148593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Maximum likelihood motion segmentation using eigendecomposition 基于特征分解的最大似然运动分割
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.956986
A. Robles-Kelly, E. Hancock
This paper presents an iterative maximum likelihood framework for motion segmentation. Our representation of the segmentation problem is based on a similarity matrix for the motion vectors for pairs of pixel blocks. By applying eigendecomposition to the similarity matrix, we develop a maximum likelihood method for grouping the pixel blocks into objects which share a common motion vector. We experiment with the resulting clustering method on a number of real-world motion sequences. Here ground truth data indicates that the method can result in motion classification errors as low as 3%.
提出了一种用于运动分割的迭代极大似然框架。我们的分割问题的表示是基于对像素块的运动向量的相似矩阵。通过将特征分解应用于相似矩阵,我们开发了一种最大似然方法,将像素块分组为共享共同运动向量的对象。我们在许多真实世界的运动序列上实验了得到的聚类方法。这里的地面真实数据表明,该方法可以导致低至3%的运动分类误差。
{"title":"Maximum likelihood motion segmentation using eigendecomposition","authors":"A. Robles-Kelly, E. Hancock","doi":"10.1109/ICIAP.2001.956986","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.956986","url":null,"abstract":"This paper presents an iterative maximum likelihood framework for motion segmentation. Our representation of the segmentation problem is based on a similarity matrix for the motion vectors for pairs of pixel blocks. By applying eigendecomposition to the similarity matrix, we develop a maximum likelihood method for grouping the pixel blocks into objects which share a common motion vector. We experiment with the resulting clustering method on a number of real-world motion sequences. Here ground truth data indicates that the method can result in motion classification errors as low as 3%.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121594012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
2D shape recognition by hidden Markov models 基于隐马尔可夫模型的二维形状识别
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.956980
M. Bicego, Vittorio Murino
In computer vision, two-dimensional shape classification is a complex and well-studied topic, often basic for three-dimensional object recognition. Object contours are a widely chosen feature for representing objects, useful in many respects for classification problems. We address the use of hidden Markov models (HMM) for shape analysis, based on chain code representation of object contours. HMM represent a widespread approach to the modeling of sequences, and are largely used for many applications, but unfortunately are poorly considered in the literature concerning shape analysis, and in any case, without reference to noise or occlusion sensitivity. The HMM approach to shape modeling is tested, probing good invariance of this method in terms of noise, occlusions, and object scaling.
在计算机视觉中,二维形状分类是一个复杂而深入研究的课题,通常是三维物体识别的基础。对象轮廓是一种广泛使用的用于表示对象的特征,在许多方面对分类问题都很有用。我们解决使用隐马尔可夫模型(HMM)的形状分析,基于链码表示的对象轮廓。HMM代表了一种广泛的序列建模方法,并且在许多应用中被广泛使用,但不幸的是,在关于形状分析的文献中,没有考虑到噪声或遮挡敏感性。对形状建模的HMM方法进行了测试,探测了该方法在噪声、遮挡和对象缩放方面的良好不变性。
{"title":"2D shape recognition by hidden Markov models","authors":"M. Bicego, Vittorio Murino","doi":"10.1109/ICIAP.2001.956980","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.956980","url":null,"abstract":"In computer vision, two-dimensional shape classification is a complex and well-studied topic, often basic for three-dimensional object recognition. Object contours are a widely chosen feature for representing objects, useful in many respects for classification problems. We address the use of hidden Markov models (HMM) for shape analysis, based on chain code representation of object contours. HMM represent a widespread approach to the modeling of sequences, and are largely used for many applications, but unfortunately are poorly considered in the literature concerning shape analysis, and in any case, without reference to noise or occlusion sensitivity. The HMM approach to shape modeling is tested, probing good invariance of this method in terms of noise, occlusions, and object scaling.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121959085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Using contextual information for image retrieval 使用上下文信息进行图像检索
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957014
L. Gregory, J. Kittler
Visual information retrieval presents many challenges for the computer vision community. The terabytes of visual information stored in digital image and video libraries will remain inaccessible if the problems of indexing and retrieval are not addressed. We present techniques for content based image retrieval using higher level contextual information. The content is represented and queried using attributed relational graphs, with colour attributes and relaxation labelling techniques. We present retrieval examples using both synthetic and real images of national flags. This, although a simplistic problem, highlights the shortcomings and difficulties associated with content based retrieval systems.
视觉信息检索对计算机视觉界提出了许多挑战。如果不解决索引和检索的问题,存储在数字图像和视频库中的tb级视觉信息将仍然无法访问。我们提出了使用更高层次上下文信息的基于内容的图像检索技术。使用带有颜色属性和松弛标记技术的属性关系图来表示和查询内容。我们给出了使用合成和真实国旗图像的检索示例。这虽然是一个简单的问题,但却突出了与基于内容的检索系统相关的缺点和困难。
{"title":"Using contextual information for image retrieval","authors":"L. Gregory, J. Kittler","doi":"10.1109/ICIAP.2001.957014","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957014","url":null,"abstract":"Visual information retrieval presents many challenges for the computer vision community. The terabytes of visual information stored in digital image and video libraries will remain inaccessible if the problems of indexing and retrieval are not addressed. We present techniques for content based image retrieval using higher level contextual information. The content is represented and queried using attributed relational graphs, with colour attributes and relaxation labelling techniques. We present retrieval examples using both synthetic and real images of national flags. This, although a simplistic problem, highlights the shortcomings and difficulties associated with content based retrieval systems.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129761098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Real-time tracking and reproduction of 3D human body motion 实时跟踪和再现三维人体运动
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.956993
C. Colombo, A. Bimbo, A. Valli
Non-intrusive human body tracking is a key issue in advanced human-computer interaction, with key applications ranging from virtual reality to videoconferencing and telepresence. This paper describes a system for vision-based tracking of body posture. The system is explicitly designed to provide a robust yet simple and inexpensive solution to real-time body tracking through a careful choice of visual and kinematic models. Human posture representation is fully compatible with the MPEG-4 standard. Results of system application to a computer graphics scenario (animation of 3D avatars) are presented and discussed.
非侵入式人体跟踪是高级人机交互中的一个关键问题,其关键应用范围从虚拟现实到视频会议和远程呈现。本文描述了一种基于视觉的人体姿态跟踪系统。该系统旨在通过精心选择视觉和运动学模型,为实时身体跟踪提供强大而简单且廉价的解决方案。人体姿势表示与MPEG-4标准完全兼容。最后给出并讨论了该系统在计算机图形场景(三维人物动画)中的应用结果。
{"title":"Real-time tracking and reproduction of 3D human body motion","authors":"C. Colombo, A. Bimbo, A. Valli","doi":"10.1109/ICIAP.2001.956993","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.956993","url":null,"abstract":"Non-intrusive human body tracking is a key issue in advanced human-computer interaction, with key applications ranging from virtual reality to videoconferencing and telepresence. This paper describes a system for vision-based tracking of body posture. The system is explicitly designed to provide a robust yet simple and inexpensive solution to real-time body tracking through a careful choice of visual and kinematic models. Human posture representation is fully compatible with the MPEG-4 standard. Results of system application to a computer graphics scenario (animation of 3D avatars) are presented and discussed.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131043690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Weighted distance transforms in rectangular grids
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957029
I. Sintorn, G. Borgefors
We investigate weighted distance transforms in 2D images in rectangular grids. We use a local neighborhood of size 3/spl times/3 and assume a rectangular grid with arbitrary ratio between the sides. The weights (local distances) are optimized by minimizing the maximum error over linear trajectories, which is an all-digital approach. General solutions for all ratios are presented. We also present numeric results for the cases when the ratio between the sides equals 1 (comparable with studies of weighted distance transforms in the square grid), 4/3 and 3. Integer solutions for both real and integer scale factors are presented.
我们研究了矩形网格中二维图像的加权距离变换。我们使用大小为3/spl * /3的局部邻域,并假设一个矩形网格,两边之间具有任意比例。权重(局部距离)通过最小化线性轨迹上的最大误差来优化,这是一种全数字方法。给出了所有比率的通解。我们还提供了边之间的比率等于1(与在正方形网格中加权距离变换的研究相比较)、4/3和3的情况下的数值结果。给出了实尺度因子和整数尺度因子的整数解。
{"title":"Weighted distance transforms in rectangular grids","authors":"I. Sintorn, G. Borgefors","doi":"10.1109/ICIAP.2001.957029","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957029","url":null,"abstract":"We investigate weighted distance transforms in 2D images in rectangular grids. We use a local neighborhood of size 3/spl times/3 and assume a rectangular grid with arbitrary ratio between the sides. The weights (local distances) are optimized by minimizing the maximum error over linear trajectories, which is an all-digital approach. General solutions for all ratios are presented. We also present numeric results for the cases when the ratio between the sides equals 1 (comparable with studies of weighted distance transforms in the square grid), 4/3 and 3. Integer solutions for both real and integer scale factors are presented.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131061681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Text enhancement with asymmetric filter for video OCR 文本增强与视频OCR的非对称滤波器
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957007
Datong Chen, K. Shearer, H. Bourlard
Stripes are common sub-structures of text characters, and the scale of these stripes varies little within a word. This scale consistency thus provides us with a useful feature for text detection and segmentation. A new form of filter is derived from the Gabor filter, and it is shown that this filter can efficiently estimate the scales of these stripes. The contrast of text in video can then be increased by enhancing the edges of only those stripes found to correspond to a suitable scale. More specifically the algorithm presented here enhances the stripes in three pre-selected scale ranges. The resulting enhancement yields much better performance from the binarization process, which is the step required before character recognition.
条纹是文本字符的常见子结构,这些条纹的大小在一个单词内变化不大。因此,这种尺度一致性为文本检测和分割提供了一个有用的特性。在Gabor滤波器的基础上提出了一种新的滤波器形式,并证明了该滤波器能有效地估计条纹的尺度。视频中文本的对比度可以通过增强那些条纹的边缘来增加,这些条纹对应于一个合适的比例。更具体地说,本文提出的算法在三个预先选择的尺度范围内增强条纹。由此产生的增强从二值化过程中产生了更好的性能,这是字符识别之前需要的步骤。
{"title":"Text enhancement with asymmetric filter for video OCR","authors":"Datong Chen, K. Shearer, H. Bourlard","doi":"10.1109/ICIAP.2001.957007","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957007","url":null,"abstract":"Stripes are common sub-structures of text characters, and the scale of these stripes varies little within a word. This scale consistency thus provides us with a useful feature for text detection and segmentation. A new form of filter is derived from the Gabor filter, and it is shown that this filter can efficiently estimate the scales of these stripes. The contrast of text in video can then be increased by enhancing the edges of only those stripes found to correspond to a suitable scale. More specifically the algorithm presented here enhances the stripes in three pre-selected scale ranges. The resulting enhancement yields much better performance from the binarization process, which is the step required before character recognition.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134219410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 74
Detection and description of human running behaviour in sports video multimedia database 体育视频多媒体数据库中人体跑步行为的检测与描述
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957037
F. Cheng, W. Christmas, J. Kittler
Motion description is an example of high-level video processing. It is attracting increasing interest in the computer vision community, due to its wide spectrum of applications. In such applications as multimedia database systems, motion descriptors act as a high-level query tool. We propose a periodic motion detection and description algorithm. We demonstrate that the descriptor extracted by the algorithm can characterise the human running behaviour. It can also serve as a basis for the classification of the human running activity. Experimental results based on Barcelona Olympic Games image sequences show the benefits of the proposed algorithm.
运动描述是高级视频处理的一个例子。由于其广泛的应用范围,它正在吸引计算机视觉社区越来越多的兴趣。在多媒体数据库系统等应用程序中,运动描述符充当高级查询工具。提出了一种周期运动检测和描述算法。我们证明了该算法提取的描述符可以描述人类的跑步行为。它也可以作为人类跑步活动分类的依据。基于巴塞罗那奥运会图像序列的实验结果表明了该算法的有效性。
{"title":"Detection and description of human running behaviour in sports video multimedia database","authors":"F. Cheng, W. Christmas, J. Kittler","doi":"10.1109/ICIAP.2001.957037","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957037","url":null,"abstract":"Motion description is an example of high-level video processing. It is attracting increasing interest in the computer vision community, due to its wide spectrum of applications. In such applications as multimedia database systems, motion descriptors act as a high-level query tool. We propose a periodic motion detection and description algorithm. We demonstrate that the descriptor extracted by the algorithm can characterise the human running behaviour. It can also serve as a basis for the classification of the human running activity. Experimental results based on Barcelona Olympic Games image sequences show the benefits of the proposed algorithm.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134139081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
Proceedings 11th International Conference on Image Analysis and Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1