12th International Conference on Image Analysis and Processing, 2003.Proceedings.最新文献

英文中文

View-invariant face detection method based on local PCA cells

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234025

K. Hotta

The paper presents a view-invariant face detection method based on local PCA cells. In order to extract the general features of faces at each view and position, Gabor filters and local PCA are used. Local PCA cells specialized to each view and position are made by applying a Gaussian to the outputs of the local PCA of Gabor features. By applying the Gaussian, only the local PCA cells which are a similar view to an input give large values. This decreases the bad influence of the local PCA cells of other views. As a result, only one classifier can treat multi-view faces well by integrating the outputs of local PCA cells. It is confirmed that the proposed method can detect multi-view faces. Generalization ability is improved by selecting the local PCA cells using a reconstruction error of local PCA.

提出了一种基于局部主成分分析单元的视图不变人脸检测方法。为了在每个视图和位置提取人脸的一般特征，使用了Gabor滤波器和局部PCA。通过对Gabor特征的局部主成分分析的输出应用高斯函数，得到每个视图和位置的局部主成分分析单元。通过应用高斯，只有与输入视图相似的局部PCA单元才会给出大的值。这减少了其他视图的局部PCA单元的不良影响。因此，只有一个分类器可以通过整合局部主成分分析单元的输出来处理多视图人脸。实验结果表明，该方法可以检测多视图人脸。利用局部主成分的重构误差选择局部主成分单元，提高了泛化能力。

引用次数: 18

A real-time text-independent speaker identification system 一种与文本无关的实时说话人识别系统

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234121

L. Cordella, P. Foggia, Carlo Sansone, M. Vento

The paper presents a real-time speaker identification system based on the analysis of the audio track of a video stream. The system has been employed in the context of automatic video segmentation. It uses features evaluated in both the time and frequency domains. Their combined use significantly improve the performance of the system. Experiments have been carried on a database extracted from over one hour of television news, including 10 speakers. The obtained results confirm the effectiveness of the approach, showing an error rate less then 1% when the time interval used for identifying a speaker is about 1.5 seconds.

提出了一种基于视频流音轨分析的实时说话人识别系统。该系统已应用于视频自动分割中。它使用在时域和频域评估的特征。它们的组合使用显著提高了系统的性能。实验是在一个从一个多小时的电视新闻中提取的数据库中进行的，其中包括10位发言者。实验结果证实了该方法的有效性，当识别说话人的时间间隔约为1.5秒时，错误率小于1%。

引用次数: 30

Object segmentation using feature based conditional morphology 基于特征的条件形态学的目标分割

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234107

M. R. Hamid, Aijaz A. Baloch, A. Bilal, Nauman Zaffar

This paper presents a new technique to segment objects of interest from cluttered background with varying edge densities and illumination conditions from gray scale imagery. An optimal background model is generated and an index of disparity of the objects from this model is computed. This index estimates the disparity, both in terms of edge densities and edge orientation. We introduce feature based conditional morphology to process the representations that are most likely to belong to the object of interest and obtain a distilled edge map. These edges are linked using N/sup th/ order interpolation to get the final outline of the object. We compare our approach with 9 contemporary background subtraction algorithms (Toyama et al. (1999)). Our approach shows significant performance advantages and uses only the gray scale images, while the other approaches also need the color images for their algorithms. A comparison with the conventional morphological techniques is also made to highlight the advantages of our algorithms.

提出了一种从灰度图像中具有不同边缘密度和光照条件的杂乱背景中分割感兴趣目标的新技术。生成最优背景模型，并以此模型计算出目标的视差指数。该指数从边缘密度和边缘方向两方面来估计差异。我们引入基于特征的条件形态学来处理最有可能属于感兴趣对象的表示，并获得一个蒸馏的边缘映射。使用N/sup /阶插值将这些边连接起来，以获得对象的最终轮廓。我们将我们的方法与9种当代背景减法算法(Toyama et al.(1999))进行了比较。我们的方法显示出明显的性能优势，并且只使用灰度图像，而其他方法的算法也需要彩色图像。并与传统形态学技术进行了比较，以突出我们的算法的优势。

引用次数: 9

Class-oriented recognizer design by weighting local decisions 基于加权局部决策的面向类识别器设计

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234128

S. Impedovo, G. Pirlo

The paper presents a new technique for the design of class-oriented recognizer. For each recognizer, a generic technique is used to determine, in an optimal way, the weights to balance the local decisions obtained from the analysis by parts of the patterns of the specific class. The experimental results, that have been obtained in the field of handwritten numeral and character recognition, demonstrate the superiority of the new technique with respect to other traditional approaches.

提出了一种面向类识别器设计的新技术。对于每个识别器，使用通用技术以最优方式确定权重，以平衡由特定类的部分模式分析获得的局部决策。在手写体数字和字符识别领域取得的实验结果表明，该方法相对于其他传统方法具有优越性。

引用次数: 11

Human detection and tracking within hostile aquatic environments 人类在恶劣的水生环境中的探测和跟踪

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234039

H. Eng, A. H. Kam, Junxian Wang, W. Yau, Lijuan Jiang

Many deployed systems for human motion tracking and detection are found inadequate when applied on hostile outdoor environments. This paper provides insights into this problem by developing an outdoor aquatic surveillance system, which detects swimmers within the hostile environment of an outdoor public swimming pool. A novel block-based background model and thresholding-with-hysteresis methodology is proposed to extract swimmers amid reflections, ripples, splashes and lighting changes. The problem of partial occlusion between swimmers is resolved based on a proposed Markov random field framework. The algorithm has been incorporated into a live system with robust results for different challenging outdoor pool conditions.

许多已部署的人体运动跟踪和检测系统在恶劣的室外环境中应用不足。本文通过开发一种室外水生监测系统提供了对这一问题的见解，该系统可以检测室外公共游泳池恶劣环境中的游泳者。提出了一种新的基于块的背景模型和迟滞阈值方法，用于在反射、涟漪、飞溅和光照变化中提取游泳者。基于提出的马尔可夫随机场框架，解决了游泳者之间的局部遮挡问题。该算法已被纳入一个实时系统，对不同具有挑战性的室外游泳池条件具有鲁棒性结果。

引用次数: 4

Spatial data structures for version management of engineering drawings in CAD database CAD数据库中工程图纸版本管理的空间数据结构

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234053

Yasuaki Nakamura, H. Dekihara

In the engineering database system, multiple versions of a design, including engineering drawings, should be managed efficiently. Spatial data structures can manage spatial objects in a drawing efficiently. The paper proposes extended spatial data structures for efficient management of multiversion engineering drawings. The R-tree is adapted as a basic data structure. The efficient mechanism to manage the difference between drawings is introduced to the R-tree to eliminate redundant duplications and to reduce the amount of storage required for the data structure. Extended data structures of the R-tree, called MVR and MVR* trees, are developed and the performances of these trees are evaluated. A series of simulation tests shows that, compared with the basic R-tree, the amounts of storage required for the MVR and MVR* trees are reduced to 50% and 30%, respectively. The search efficiencies of the R, MVR, and MVR* trees are almost the same.

在工程数据库系统中，需要对包括工程图纸在内的多个版本的设计进行有效的管理。空间数据结构可以有效地管理绘图中的空间对象。为实现多版本工程图纸的高效管理，提出了扩展的空间数据结构。r树是一种基本的数据结构。在r树中引入了有效的机制来管理图纸之间的差异，以消除冗余重复并减少数据结构所需的存储量。开发了r -树的扩展数据结构，称为MVR和MVR*树，并对这些树的性能进行了评估。一系列仿真试验表明，与基本r树相比，MVR树和MVR*树所需的存储空间分别减少了50%和30%。R树、MVR树和MVR*树的搜索效率几乎相同。

引用次数: 7

An empirical performance evaluation technique for discrete second derivative edge detectors 离散二阶导数边缘检测器的经验性能评价技术

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234115

S. Coleman, B. Scotney, M. G. Herron

The problem of edge evaluation in relation to image gradient-based edge detectors has been widely studied, and there exist a range of edge evaluation techniques that are appropriate to such edge detectors. Although discrete second derivative operators often form the basis of edge detection methods, whereby zero-crossings are used to locate edge pixels, rather less attention has been paid to the development of edge evaluation techniques that are directly appropriate to zero-crossing methods. We propose a new evaluation technique that performs edge sensitivity analysis with respect to angular orientation and displacement errors for edges located by such discrete second derivative operators. The technique applies a finite element interpolation to the output values of the second derivative operator. Hence the method is used to directly evaluate edges located by a second derivative operator without the need to use a supplementary first derivative operator for gradient approximation.

基于图像梯度的边缘检测器的边缘评估问题已经得到了广泛的研究，并且存在一系列适用于这类边缘检测器的边缘评估技术。虽然离散二阶导数算子经常构成边缘检测方法的基础，其中使用过零来定位边缘像素，但很少有人关注直接适用于过零方法的边缘评估技术的发展。我们提出了一种新的评估技术，对由这种离散二阶导数算子定位的边缘进行角取向和位移误差的边缘灵敏度分析。该技术将有限元插值应用于二阶导数算子的输出值。因此，该方法可以直接求出由二阶导数算子定位的边，而不需要使用补充的一阶导数算子进行梯度逼近。

引用次数: 4

Using hidden Markov models and wavelets for face recognition 利用隐马尔可夫模型和小波进行人脸识别

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234024

M. Bicego, U. Castellani, Vittorio Murino

In this paper, a new system for face recognition is proposed, based on hidden Markov models (HMM) and wavelet coding. A sequence of overlapping sub-images is extracted from each face image, computing the wavelet coefficients for each of them. The whole sequence is then modelled by using hidden Markov models. The proposed method is compared with a DCT coefficient-based approach (Kohir et al. (1998)), showing comparable results. By using an accurate model selection procedure, we show that results proposed in Kohir can be improved even more. The obtained results outperform all results presented in the literature on the Olivetti Research Laboratory (ORL) face database, reaching a 100% recognition rate. This performance proves the suitability of HMM to deal with the new JPEG2000 image compression standard.

本文提出了一种基于隐马尔可夫模型和小波编码的人脸识别系统。从每个人脸图像中提取重叠子图像序列，计算每个子图像的小波系数。然后用隐马尔可夫模型对整个序列进行建模。将所提出的方法与基于DCT系数的方法(Kohir et al.(1998))进行了比较，结果具有可比性。通过使用精确的模型选择程序，我们表明Kohir提出的结果可以得到更大的改进。获得的结果优于Olivetti研究实验室(ORL)人脸数据库上的所有文献结果，达到100%的识别率。这一性能证明了隐马尔可夫算法在JPEG2000图像压缩标准下的适用性。

引用次数: 97

Improving shape recovery by estimating properties of slightly-rough surfaces 通过估算微粗糙表面的特性来提高形状恢复

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234021

H. Ragheb, E. Hancock

We illustrate the use of the Beckmann formulation of the Kirchhoff theory for surface analysis problems in computer vision. The Beckmann model is a physical model that describes the reflectance of light from rough surfaces. Here, we use the modified form of the Beckmann model for slightly-rough surfaces using the modification of C.L. Vernold and J.E. Harvey (see Proc. SPIE, vol.3426, p.51-6, 1998). The parameters of the model are the surface roughness and the correlation length. We show how the surface roughness can be estimated using the specular reflectance properties. We also propose a technique for estimating the correlation length using pairs of surface images, subject to different illumination directions. With these parameters to hand, the Beckmann model may be used to perform photometric correction, and hence shape-from-shading may be applied to the corrected Lambertian image to recover improved shape. This model may also be used to re-illuminate the recovered surface. We present experiments to illustrate the utility of the method for each of these tasks.

我们说明了在计算机视觉表面分析问题中使用贝克曼公式的基尔霍夫理论。贝克曼模型是描述粗糙表面的光反射率的物理模型。在这里，我们使用贝克曼模型的修改形式，使用C.L. Vernold和J.E. Harvey的修改(见Proc. SPIE, vol.3426, p.51-6, 1998)。模型的参数为表面粗糙度和相关长度。我们展示了如何使用镜面反射特性来估计表面粗糙度。我们还提出了一种利用不同照明方向的表面图像对估计相关长度的技术。有了这些参数，贝克曼模型可以用来进行光度校正，因此可以将阴影形状应用于校正后的兰伯特图像，以恢复改进的形状。这个模型也可以用来重新照亮恢复的表面。我们提供实验来说明该方法对这些任务的效用。

引用次数: 1

Finding cavities and tunnels in 3D complex objects 在3D复杂物体中寻找空洞和隧道

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234073

S. Svensson, C. Arcelli, G. S. D. Baja

Topological properties are global features that can be useful for recognition of digital objects. For example, this is the case for objects having a complex shape without being decomposable into meaningful simple parts. In the case of 3D binary images, topological features are the object components, cavities, and tunnels. While object components and cavities are easy to define and identify, to our knowledge, no computationally convenient way to find tunnels is available. The aim of the paper is to fill this gap by presenting a convenient procedure to detect and represent tunnels in 3D objects.

拓扑属性是对数字对象识别有用的全局特征。例如，对于具有复杂形状而不能分解为有意义的简单部分的对象，情况就是如此。在三维二值图像的情况下，拓扑特征是对象组件、空腔和隧道。虽然物体组件和空腔很容易定义和识别，但据我们所知，没有计算上方便的方法可以找到隧道。本文的目的是通过提供一种方便的方法来检测和表示三维物体中的隧道，从而填补这一空白。

引用次数: 5

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

12th International Conference on Image Analysis and Processing, 2003.Proceedings.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀