Proceedings 11th International Conference on Image Analysis and Processing最新文献

英文中文

Learning and caricaturing the face space using self-organization and Hebbian learning for face processing 利用自组织和Hebbian学习进行人脸空间的学习和漫画化

Proceedings 11th International Conference on Image Analysis and Processing

Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957021

Albert Pujol, J. Villanueva, H. Wechsler

This paper shows a self-organized system designed to obtain compressed representations of instances of a population of visual forms. It is shown how, when applied to face shape information, the system evolves into a prototype of the population and induces automatic warping, or caricaturing, transformations where geometrical differences between forms are increased, improving, as a consequence, recognition performance. In this way, the proposed system provides a unified account for the whole chain of face processing tasks including data compression, detection, and recognition. Experimental data is presented to show the feasibility of our approach in terms of performance and robustness to changes in illumination and face expressions.

这篇论文展示了一个自组织系统，它被设计用来获得一群视觉形式实例的压缩表示。当应用于面部形状信息时，该系统如何演变成人口的原型，并诱导自动扭曲或漫画化，其中形状之间的几何差异增加，从而提高识别性能。通过这种方式，该系统为包括数据压缩、检测和识别在内的整个人脸处理任务链提供了一个统一的帐户。实验数据显示了我们的方法在性能和对光照和面部表情变化的鲁棒性方面的可行性。

引用次数: 6

Eigenspace-based recognition of faces: comparisons and a new approach 基于特征空间的人脸识别:比较与新方法

Proceedings 11th International Conference on Image Analysis and Processing

Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.956983

P. Navarrete, Javier Ruiz-del-Solar

Different eigenspace-based approaches have been proposed for the recognition of faces. They differ mostly in the kind of projection method used and in the similarity matching criterion employed. A first goal of this paper is to present a comparison between some of these different approaches. A second goal is to outline an adaptive, neural-based security access control system.

人们提出了不同的基于特征空间的人脸识别方法。它们的主要区别在于所使用的投影方法和所采用的相似度匹配准则。本文的第一个目标是对这些不同的方法进行比较。第二个目标是概述一个自适应的，基于神经的安全访问控制系统。

引用次数: 26

Interactive texture synthesis 交互纹理合成

Proceedings 11th International Conference on Image Analysis and Processing

Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957048

P. Parada, J. Ruiz-del-Solar, W. Plagges, M. Koppen

The TEXRET (texture retrieval) system is a new texture database retrieval system, which is based on soft-computing technologies and that is under development. One of its main features is the generation of the requested textures when they are not found in the database, which allows a continuous growing of the database. The texture generation process, implemented using causal autoregressive models and interactive genetic algorithms, is described.

TEXRET(纹理检索)系统是一种基于软计算技术的正在发展中的新型纹理数据库检索系统。它的主要特性之一是当数据库中没有找到请求的纹理时生成请求的纹理，这允许数据库不断增长。描述了使用因果自回归模型和交互式遗传算法实现的纹理生成过程。

引用次数: 7

Vanishing point detection: representation analysis and new approaches 消失点检测:表征分析与新方法

Proceedings 11th International Conference on Image Analysis and Processing

Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.956990

V. Cantoni, L. Lombardi, M. Porta, Nicolas Sicard

We introduce two different representation approaches and propose two techniques to estimate the position of vanishing points in an image, one bused on a probabilistic strategy and the other focused on a deterministic analysis. Unlike most of the methods so far developed, which exploit the Gaussian sphere, the new techniques operate in the (/spl rho/, /spl theta/) polar parameter space and in the (x, y) image plane coordinate space. Both the solutions are described and compared, through the discussion of the results obtained from their application to real images.

我们介绍了两种不同的表示方法，并提出了两种技术来估计图像中消失点的位置，一种基于概率策略，另一种侧重于确定性分析。与迄今为止开发的大多数利用高斯球的方法不同，新技术在(/spl rho/， /spl theta/)极参数空间和(x, y)像平面坐标空间中操作。通过讨论两种方法在实际图像中的应用结果，对两种方法进行了描述和比较。

引用次数: 96

Classifying audio of movies by a multi-expert system 基于多专家系统的电影音频分类

Proceedings 11th International Conference on Image Analysis and Processing

Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957040

M. D. Santo, G. Percannella, Carlo Sansone, M. Vento

The paper presents a system for the automatic MPEG format. In contrast to the approaches proposed up to now, it employs a multi-expert classification system arranged according to a multi-stage architecture. The system is able to recognize not only four pure classes (music, speech, silence and noise) but also confused audio signals, such as the ones resulting from the overlap of pure audio components (for example, speech overlapped with music or noise, etc.). An extensive experimental analysis has been carried on a large audio database extracted from about 30 moving pictures recorded on low-quality magnetic media. Results confirm the effectiveness of the approach, with an average improvement of about 45% with respect to single classifier solutions.

本文介绍了一个自动生成MPEG格式的系统。与目前提出的方法相比，该方法采用多专家分类系统，按照多阶段体系结构进行分类。该系统不仅能够识别四种纯粹的音频信号(音乐、语音、静音和噪声)，还能够识别混淆的音频信号，例如纯音频成分重叠产生的音频信号(例如语音与音乐或噪声重叠等)。对从低质量磁介质上录制的约30幅运动图像中提取的大型音频数据库进行了广泛的实验分析。结果证实了该方法的有效性，相对于单一分类器解决方案，平均提高了约45%。

引用次数: 10

Digital geometry fundaments: application to plane recognition 数字几何基础:在平面识别中的应用

Proceedings 11th International Conference on Image Analysis and Processing

Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957079

J. Chassery, F. Dupont, Isabelle Sivignon, Joëlle Vittone

Triangulation, quadrangulation problems and more generally 3D object polyhedrization are an important subject of research. In digital geometry, a 3D object is seen as a set of voxels placed in a representation space only constituted of integers. The objective of the polyhedrization is to obtain a complete description of the object with faces, edges and vertices. The recognition of digital planes is a first step which is very important. We focus on digital naive planes that have been studied through their configurations of tricubes: of (n,m)-cubes and connected or not connected voxels set. The link between the normal equation of a plane and configuration of voxels set has been studied by the construction of the corresponding Farey net. We can find many references about the recognition of digital planes. Some algorithms were related to the construction of the convex hull of the studied voxels set. Other approaches use linear programming, mean square approximation or Fourier-Motzkin transform. The first algorithms entirely discrete recognized rectangular pieces of naive planes. Wwe describe an incremental algorithm to recognize any coplanar voxels set as a digital naive plane by using Farey nets. Then we propose a polyhedrization method able to give all the digital naive planes of the surface of the 3D object.

三角剖分、四边形问题以及更普遍的三维物体多面体化是一个重要的研究课题。在数字几何中，3D对象被视为一组体素，放置在仅由整数组成的表示空间中。多面体化的目的是获得对象的面、边和顶点的完整描述。数字平面的识别是非常重要的第一步。我们关注的是数字幼稚平面，这些平面已经通过三立方体的配置进行了研究:(n,m)个立方体和连接或不连接的体素集。通过构造相应的Farey网，研究了平面法向方程与体素集构型之间的联系。我们可以找到很多关于数字平面识别的参考文献。一些算法与所研究体素集的凸包构造有关。其他方法使用线性规划、均方近似或傅立叶-莫兹金变换。第一个算法完全分离了原始平面的可识别矩形块。本文描述了一种利用Farey网络将任意共面体素集识别为数字幼稚平面的增量算法。然后，我们提出了一种能够给出三维物体表面所有数字原始平面的多面体化方法。

{"title":"Digital geometry fundaments: application to plane recognition","authors":"J. Chassery, F. Dupont, Isabelle Sivignon, Joëlle Vittone","doi":"10.1109/ICIAP.2001.957079","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957079","url":null,"abstract":"Triangulation, quadrangulation problems and more generally 3D object polyhedrization are an important subject of research. In digital geometry, a 3D object is seen as a set of voxels placed in a representation space only constituted of integers. The objective of the polyhedrization is to obtain a complete description of the object with faces, edges and vertices. The recognition of digital planes is a first step which is very important. We focus on digital naive planes that have been studied through their configurations of tricubes: of (n,m)-cubes and connected or not connected voxels set. The link between the normal equation of a plane and configuration of voxels set has been studied by the construction of the corresponding Farey net. We can find many references about the recognition of digital planes. Some algorithms were related to the construction of the convex hull of the studied voxels set. Other approaches use linear programming, mean square approximation or Fourier-Motzkin transform. The first algorithms entirely discrete recognized rectangular pieces of naive planes. Wwe describe an incremental algorithm to recognize any coplanar voxels set as a digital naive plane by using Farey nets. Then we propose a polyhedrization method able to give all the digital naive planes of the surface of the 3D object.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122241362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Integrated tracking with vision and sound 集成跟踪与视觉和声音

Proceedings 11th International Conference on Image Analysis and Processing

Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957034

A. Blake, Michel Gangnet, P. Pérez, J. Vermaak

The research summarised here is working towards automatic control systems for cameras, in support of remote meetings. Progress is reported on several fronts: use of active contours to track heads, stereo sound analysis applying particle filtering to handle both visual and aural clutter, and the use of exemplars for stabilisation of inter-frame matching.

这里总结的研究是为了支持远程会议的摄像机自动控制系统。在几个方面取得了进展:使用活动轮廓来跟踪头部，立体声分析应用粒子滤波来处理视觉和听觉杂波，以及使用示例来稳定帧间匹配。

引用次数: 5

Lip detection and tracking 唇形检测与跟踪

Proceedings 11th International Conference on Image Analysis and Processing

Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.956978

A. Caplier

Seeing the talker's lips in addition to audition can improve speech understanding which is rather based on lip shape temporal evolution than on absolute mouth shape. We propose a totally automatic algorithm which can extract lip shape over an image sequence. The algorithm does not require any make-up or markers and works under natural lighting conditions. The lip detection algorithm uses an active shape model to describe the mouth. After a training step, the mouth model is iteratively deformed under constraints according to spatiotemporal energies. The robust prior detection of mouth corners and Cupidon's arch yields the automatic positioning of the initial shape which is very difficult and must be as accurate as possible. Temporal information integration comes from the definition of Kalman filters on the independent mouth parameters. Such filtering gives an initial shape close to the final one which speeds up the convergence rate. We point out on the behaviour of our algorithm when a transition open mouth/closed mouth or closed mouth/open mouth occurs.

在听音的基础上看说话人的嘴唇可以提高对言语的理解，这是基于唇形的时间演变，而不是绝对的嘴型。提出了一种完全自动的唇形提取算法。该算法不需要任何化妆或标记，并在自然光条件下工作。唇形检测算法采用主动形状模型来描述口腔。经过一个训练步骤，嘴巴模型在时空能量约束下迭代变形。对嘴角和丘比顿弧度的鲁棒先验检测产生了初始形状的自动定位，这是非常困难的，必须尽可能准确。时间信息集成来自于对独立口参数的卡尔曼滤波的定义。这种滤波使初始形状接近最终形状，从而加快了收敛速度。我们指出了我们的算法在张嘴/闭口或闭口/张嘴发生转换时的行为。

引用次数: 34

Road signs recognition using a dynamic pixel aggregation technique in the HSV color space 基于HSV色彩空间的动态像素聚合技术的道路标志识别

Proceedings 11th International Conference on Image Analysis and Processing

Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957071

S. Vitabile, G. Pollaccia, G. Pilato, F. Sorbello

We present a system for the whole road sign detection and recognition task. Road sign regions are detected and extracted from real-world scenes on the basis of their color and shape features. Color segmentation is performed introducing a dynamic threshold in the pixel aggregation process on the HSV color space. The dynamic threshold allows the reduction of hue instability in real scenes depending on external brightness variation. Experimental results, using real road images in different environment conditions, are also reported.

提出了一种完整的道路标志检测与识别系统。道路标志区域是根据其颜色和形状特征从现实场景中进行检测和提取的。在HSV色彩空间的像素聚合过程中引入动态阈值进行颜色分割。动态阈值允许减少现实场景中依赖于外部亮度变化的色调不稳定性。本文还报道了不同环境条件下真实道路图像的实验结果。

引用次数: 128

View synthesis from two uncalibrated images 从两个未校准的图像查看合成

Proceedings 11th International Conference on Image Analysis and Processing

Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957023

F. Dornaika

View synthesis becomes a focus of attention of both the computer graphics and computer vision communities. We present a new approach for synthesizing novel views from two uncalibrated images. The two reference images as well as the novel view do not share the same viewpoint. The developed approach incorporates computer vision methods. It consists of two stages. First, the parallax field between the reference images is recovered. Second, novel images are directly synthesized by exploiting the parallax invariance, using forward warping. Solutions to the visibility problem are proposed. Constructing realistic synthesized views from real image pairs are presented.

视图合成已成为计算机图形学和计算机视觉界关注的焦点。我们提出了一种从两个未校准图像合成新视图的新方法。两种参考图像和新视角并不具有相同的观点。所开发的方法结合了计算机视觉方法。它包括两个阶段。首先，恢复参考图像之间的视差场;其次，利用视差不变性，利用前向扭曲直接合成新图像;针对可见性问题提出了解决方案。提出了利用实像对构建逼真的合成视图的方法。

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings 11th International Conference on Image Analysis and Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀