14th International Conference on Image Analysis and Processing (ICIAP 2007)最新文献

英文中文

A Nonlinear-Shift Approach to Object Tracking Based on Shape Information 基于形状信息的非线性位移目标跟踪方法

14th International Conference on Image Analysis and Processing (ICIAP 2007)

Pub Date : 2007-09-10 DOI: 10.1109/ICIAP.2007.14

M. Asadi, A. Beoldo, C. Regazzoni

This paper presents a corner-based method for tracking objects in video image frames. The method uses a vectorial shape representation based on relative object main corners positions and a non-linear voting method to evaluating the new object position at each iteration. The initialization consists of individuating an area including the object to be tracked. Information of the corners distribution around a reference point is used to find the most probable target position in the next frame. The method can be used in both fixed and mobile cameras both for vehicles and pedestrians.

提出了一种基于角点的视频图像帧目标跟踪方法。该方法使用基于物体相对主角位置的矢量形状表示和非线性投票方法来评估每次迭代的新物体位置。初始化包括对包括要跟踪的对象在内的区域进行个性化。利用参考点周围的角分布信息在下一帧中找到最可能的目标位置。该方法可用于车辆和行人的固定和移动摄像头。

引用次数: 2

A Novel Anchorperson Detection Algorithm Based on Spatio-temporal Slice 一种基于时空切片的主播检测算法

14th International Conference on Image Analysis and Processing (ICIAP 2007)

Pub Date : 2007-09-10 DOI: 10.1109/ICIAP.2007.15

Anan Liu, Sheng Tang, Yongdong Zhang, Jintao Li, Zhaoxuan Yang

For conveniently navigating and editing the news programs, it is very important to segment the video into meaningful units. The effective indexing of news videos can be fulfilled by the anchorperson shot because it is an indicator which denotes the occurrence of upcoming news stories. The paper presents a novel anchorperson detection algorithm based on spatio-temporal slice (STS). With STSpattern analysis, clustering and decision fusion, anchorperson shots can be detected for browsing news video. The large-scale experimental results demonstrate that the algorithm is accurate, robust and effective.

为了方便新闻节目的导航和编辑，将视频分割成有意义的单元是非常重要的。新闻视频的有效索引可以通过主播镜头来实现，因为它是一个指示即将发生的新闻事件发生的指标。提出了一种基于时空切片(STS)的主播检测算法。通过stpattern分析、聚类和决策融合，可以检测主播镜头，用于浏览新闻视频。大规模实验结果证明了该算法的准确性、鲁棒性和有效性。

引用次数: 6

Optical Flow Computation on Compute Unified Device Architecture 计算统一设备架构下的光流计算

14th International Conference on Image Analysis and Processing (ICIAP 2007)

Pub Date : 2007-09-10 DOI: 10.1109/ICIAP.2007.97

Y. Mizukami, Katsumi Tadamura

In this study, the implementation of an image processing technique on compute unified device architecture (CUDA) is discussed. CUDA is a new hardware and software architecture developed by NVIDIA Corporation for the general- purpose computation on graphics processing units. CUDA features an on-chip shared memory with very fast general read and write access, which enables threads in a block to share their data effectively. CUDA also provides a user- friendly development environment through an extension to the C programming language. This study focused on CUDA implementation of a representative optical flow computation proposed by Horn and Schunck in 1981. Their method produces the dense displacement field and has a straightforward processing procedure. A CUDA implementation of Horn and Schunck's method is proposed and investigated based on simulation results.

本研究讨论了一种基于CUDA的图像处理技术的实现。CUDA是NVIDIA公司为图形处理单元上的通用计算而开发的一种新的硬件和软件架构。CUDA具有片上共享内存，具有非常快速的读写访问，这使得块中的线程能够有效地共享它们的数据。CUDA还通过对C编程语言的扩展提供了一个用户友好的开发环境。本研究重点研究了Horn和Schunck在1981年提出的具有代表性的光流计算的CUDA实现。他们的方法产生密集的位移场，并且有一个简单的处理程序。基于仿真结果，提出并研究了Horn和Schunck方法的CUDA实现。

引用次数: 29

Colour and Geometric based Model for Lip Localisation: Application for Lip-reading System 基于颜色和几何的唇形定位模型:在唇读系统中的应用

14th International Conference on Image Analysis and Processing (ICIAP 2007)

Pub Date : 2007-09-10 DOI: 10.1109/ICIAP.2007.42

S. Werda, W. Mahdi, A. B. Hamadou

Motivated by humans' ability to lipread, the visual component is considered to yield information in the speech recognition system. The lip-reading is the perception of the speech purely based on observing the talker lip movements. The major difficulty of the lip- reading system is the extraction of the visual speech descriptors. In fact, to ensure this task it is necessary to carry out an automatic localization and tracking of the labial gestures. We present in this paper a new automatic approach for lip and point of interest localization on a speaker's face based both on the color information of mouth and a geometric model of lips. This hybrid solution makes our method more tolerant to noise and artifacts in the image. Experiments revealed that our lip POI localization approach for lip-reading purpose is promising. The presented results show that our system recognizes 94.64 % of French visemes.

受人类唇读能力的启发，视觉成分被认为在语音识别系统中产生信息。唇读是一种纯粹基于观察说话者嘴唇运动的言语感知。唇读系统的主要难点是视觉语音描述符的提取。事实上，为了确保完成这一任务，有必要对唇形手势进行自动定位和跟踪。本文提出了一种基于嘴巴颜色信息和嘴唇几何模型的说话人面部嘴唇和兴趣点自动定位方法。这种混合解决方案使我们的方法更能容忍图像中的噪声和伪影。实验表明，我们的唇读定位方法是很有前途的。结果表明，该系统对法语viseme的识别率为94.64%。

引用次数: 21

Fast extraction of multi-resolution Gabor features 多分辨率Gabor特征的快速提取

14th International Conference on Image Analysis and Processing (ICIAP 2007)

Pub Date : 2007-09-10 DOI: 10.1109/ICIAP.2007.67

J. Ilonen, J. Kämäräinen, H. Kälviäinen

Gabor filter responses are general purpose features for computer vision and image processing and have been very successful in many application areas, for example in bio- metric authentication (fingerprint matching, face detection, face recognition and iris recognition). In a typical feature construction, filters are utilised as a multi-resolution structure of several filters tuned to different frequencies and orientations. The multi-resolution structure is similar to wavelets, but the non-orthogonality of Gabor functions implies the main weakness: computational heaviness. The high computational complexity prevents their use in many real-time or near real-time tasks. In this study, an efficient sequential computation method for multi-resolution Gabor features is presented.

Gabor滤波器响应是计算机视觉和图像处理的通用特征，在许多应用领域都非常成功，例如生物识别认证(指纹匹配、人脸检测、人脸识别和虹膜识别)。在典型的特征结构中，滤波器被用作多个滤波器调谐到不同频率和方向的多分辨率结构。多分辨率结构与小波相似，但Gabor函数的非正交性意味着其主要缺点:计算量大。较高的计算复杂度使其无法用于许多实时或近实时任务。本文提出了一种高效的多分辨率Gabor特征序列计算方法。

引用次数: 19

Panoramic mosaicing optimization 全景拼接优化

14th International Conference on Image Analysis and Processing (ICIAP 2007)

Pub Date : 2007-09-10 DOI: 10.1109/ICIAP.2007.100

Lionel Robinault, S. Bres, S. Miguet

Motorized dome-type cameras, also called PTZ camera, allow the creation of panoramas. These panoramas represent the whole of the scene seen by the camera. In the case of a PTZ camera and with certain constraints, the scene seen by the camera can be considered as a sphere. The creation of a panorama consists in traversing a sphere in an exhaustive way. The acquired images are then projected on unspecified support which can be a cylinder, a cube or others. The projection of the rectangular images onto a sphere inevitably involves partial overlap between images. These overlaps lead to useless calculations. In order to limit the number of images we propose the calculation of an optimal trajectory for the camera according to intrinsic and extrinsic constraints.

电动圆顶型相机，也称为PTZ相机，允许创建全景。这些全景图代表了相机所看到的整个场景。在PTZ相机的情况下，在一定的约束条件下，相机看到的场景可以被认为是一个球体。全景图的创建包括以详尽的方式遍历一个球体。然后将获得的图像投射到未指定的支撑上，该支撑可以是圆柱体、立方体或其他。矩形图像在球面上的投影不可避免地涉及图像之间的部分重叠。这些重叠导致了无用的计算。为了限制图像的数量，我们提出了根据内部和外部约束计算相机最优轨迹的方法。

引用次数: 2

Hybrid Stereo Sensor with Omnidirectional Vision Capabilities: Overview and Calibration Procedures 具有全方位视觉能力的混合立体传感器:概述和校准程序

14th International Conference on Image Analysis and Processing (ICIAP 2007)

Pub Date : 2007-09-10 DOI: 10.1109/ICIAP.2007.77

S. Cagnoni, M. Mordonini, Luca Mussi, G. Adorni

In this paper, we present a compact hybrid video sensor that combines perspective and omnidirectional vision to achieve a 360deg field of view, as well as high-resolution images. Those characteristics, in association with 3D metric reconstruction capabilities, are suitable for vision tasks such as surveillance and obstacle detection for autonomous robot navigation. We describe the sensor calibration procedure, with particular regard to mirror-to-camera positioning. We also present some results obtained in testing the accuracy of 3D reconstruction, which have confirmed the correctness of the calibration.

在本文中，我们提出了一种紧凑的混合视频传感器，它结合了视角和全方位视觉，以实现360度的视野，以及高分辨率的图像。这些特征与3D度量重建能力相结合，适用于自动机器人导航的监视和障碍物检测等视觉任务。我们描述了传感器校准程序，特别是关于反光镜到相机的定位。本文还给出了一些三维重建精度测试的结果，证实了标定的正确性。

引用次数: 19

Rectification of 3D Data Obtained from Moving Range Sensors by using Multiple View Geometry 基于多视点几何的移动距离传感器三维数据校正

14th International Conference on Image Analysis and Processing (ICIAP 2007)

Pub Date : 2007-09-10 DOI: 10.1109/ICIAP.2007.110

K. Kozuka, J. Sato

For measuring the 3D shape of large objects, scanning by a moving range sensor is one of the most efficient method. However, if we use moving range sensors, the obtained data have some distortions due to the movement of the sensor during the scanning process. In this paper, we propose a method for recovering correct 3D range data from a moving range sensor by using the multiple view geometry. We assume that range sensor radiates laser beams in raster scan order, and they are observed from a static camera. We first show that we can deal with range data as 3D space-time images, and show that the extended multiple view geometry can be used for representing the relationship between the 3D space-time of camera image and the 3D space-time of range data. We next show that the multiple view geometry under extended projections can be used for rectifying 3D data obtained by the moving range sensor. The method is implemented and tested in synthetic images and range data. The stability of recovered 3D shape is also evaluated.

对于大型物体的三维形状测量，移动距离传感器扫描是最有效的方法之一。然而，如果我们使用移动距离传感器，在扫描过程中，由于传感器的运动，所获得的数据会有一定的失真。在本文中，我们提出了一种利用多视图几何方法从移动距离传感器中恢复正确的三维距离数据的方法。我们假设距离传感器以光栅扫描顺序辐射激光束，并且从静态相机上观察激光束。我们首先证明了我们可以将距离数据作为三维时空图像来处理，并证明了扩展的多视图几何可以用来表示相机图像的三维时空与距离数据的三维时空之间的关系。接下来，我们证明了扩展投影下的多视图几何可以用于校正由移动距离传感器获得的三维数据。在合成图像和距离数据中对该方法进行了实现和测试。并对恢复的三维形状的稳定性进行了评价。

引用次数: 2

A robust measure for visual correspondence 视觉对应的稳健度量

14th International Conference on Image Analysis and Processing (ICIAP 2007)

Pub Date : 2007-09-10 DOI: 10.1109/ICIAP.2007.16

Federico Tombari, L. D. Stefano, S. Mattoccia

In this paper a novel measure for visual correspondence is proposed, to be adopted for common computer vision tasks such as pattern matching, stereo vision, change detection. The proposed measure implicitly exploits the concept of order-preservation between neighbouring pixels and it is suitable for those cases where disturbance factors such as photometric distortions and occlusions occur between the images to be matched. Furthermore, this measure tends to be robust in presence of significant amount of noise which can be introduced, e.g., by cheap camera sensors. Experimental results demonstrate the effectiveness of the proposed approach in a typical template matching scenario as well as in a particular application dealing with secure gate access control.

本文提出了一种新的视觉对应度量，可用于模式匹配、立体视觉、变化检测等常见的计算机视觉任务。所提出的方法隐含地利用了相邻像素之间的顺序保持概念，它适用于那些干扰因素，如光度失真和遮挡发生在待匹配图像之间的情况。此外，这种方法在存在大量噪声的情况下具有鲁棒性，这些噪声可以通过廉价的相机传感器引入。实验结果证明了该方法在典型的模板匹配场景以及处理安全门访问控制的特定应用中的有效性。

引用次数: 23

Grey Weighted Polar Distance Transform for Outlining Circular and Approximately Circular Objects 圆形和近似圆形物体轮廓的灰色加权极坐标变换

14th International Conference on Image Analysis and Processing (ICIAP 2007)

Pub Date : 2007-09-10 DOI: 10.1109/ICIAP.2007.74

K. Norell, Joakim Lindblad, S. Svensson

We introduce the polar distance transform and the grey weighted polar distance transform for computation of minimum cost paths preferring circular shape, as well as give algorithms for implementations in a digital setting. An alternative to the polar distance transform is to transform the image to polar coordinates, and then apply a Cartesian distance transform. By using the polar distance transform, resampling of the image and interpolation of new pixel values are avoided. We also handle the case of grey weighted distance transform in a 5 times 5 neighbourhood, which, to our knowledge, is new. Initial results of using the grey weighted polar distance transform to outline annual rings in images of log end faces are presented.

我们介绍了极坐标距离变换和灰色加权极坐标距离变换来计算最优的圆形路径，并给出了在数字环境下实现的算法。另一种替代极坐标变换的方法是将图像转换为极坐标，然后应用笛卡尔距离变换。利用极距变换，避免了图像的重采样和新像素值的插值。我们还处理了5 × 5邻域的灰色加权距离变换，据我们所知，这是一种新的情况。给出了利用灰色加权极距变换对原木端面图像进行年轮轮廓的初步结果。

引用次数: 11

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

14th International Conference on Image Analysis and Processing (ICIAP 2007)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀