Object recognition supported by user interaction for service robots最新文献

英文中文

Hierarchical recognition of intentional human gestures for sports video annotation 体育视频注释中有意人类手势的层次识别

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048493

Graeme S. Chambers, S. Venkatesh, G. West, H. Bui

We present a novel technique for the recognition of complex human gestures for video annotation using accelerometers and the hidden Markov model. Our extension to the standard hidden Markov model allows us to consider gestures at different levels of abstraction through a hierarchy of hidden states. Accelerometers in the form of wrist bands are attached to humans performing intentional gestures, such as umpires in sports. Video annotation is then performed by populating the video with time stamps indicating significant events, where a particular gesture occurs. The novelty of the technique lies in the development of a probabilistic hierarchical framework for complex gesture recognition and the use of accelerometers to extract gestures and significant events for video annotation.

我们提出了一种利用加速度计和隐马尔可夫模型来识别视频注释中复杂的人类手势的新技术。我们对标准隐马尔可夫模型的扩展允许我们通过隐藏状态的层次结构在不同的抽象层次上考虑手势。腕带形式的加速计附着在人类有意做出的手势上，比如体育比赛中的裁判。然后，通过在视频中填充时间戳来执行视频注释，这些时间戳表明在特定手势发生的地方发生了重要事件。该技术的新颖之处在于开发了用于复杂手势识别的概率层次框架，并使用加速度计提取手势和视频注释的重要事件。

引用次数: 99

The analysis of a stochastic differential approach for Langevin competitive learning algorithm Langevin竞争学习算法的一种随机微分方法分析

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048242

Jinwuk Seok, Jeun-Woo Lee

Recently, various types of neural network models have been used successfully to applications in pattern recognition, control, signal processing, and so on. However, the previous models are not suitable for hardware implementation due to their complexity. In this paper, we present a survey of the stochastic analysis for the Langevin competitive learning algorithm, known for its easy hardware implementation. Since the Langevin competitive learning algorithm uses a time-invariant learning rate and a stochastic reinforcement term, it is necessary to analyze with stochastic differential or difference equation. The result of the analysis verifies that the Langevin competitive learning process is equal to the standard Ornstein-Uhlenback process and has a weak convergence property. The experimental results for Gaussian distributed data confirm the analysis provided in this paper.

近年来，各种类型的神经网络模型已成功地应用于模式识别、控制、信号处理等领域。但是，以前的模型由于其复杂性而不适合硬件实现。在本文中，我们提出了随机分析的朗之万竞争学习算法，以其易于硬件实现而闻名。由于Langevin竞争学习算法使用时不变学习率和随机强化项，因此有必要使用随机微分或差分方程进行分析。分析结果验证了Langevin竞争学习过程与标准的Ornstein-Uhlenback过程相等，并且具有弱收敛性。对高斯分布数据的实验结果证实了本文的分析。

引用次数: 0

Auto-calibration via the absolute quadric and scene constraints 通过绝对二次曲线和场景约束进行自动校准

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048381

A. Heyden, D. Huynh

A scheme is described for incorporation of scene constraints into the structure from motion problem. Specifically, the absolute quadric is recovered with constraints imposed by orthogonal scene planes. The scheme involves a number of steps. A projective reconstruction is first obtained, followed by a linear technique to form an initial estimate of the absolute quadric. A nonlinear iteration then refines this quadric and the camera intrinsic parameters to upgrade the projective reconstruction to Euclidean. Finally, a bundle adjustment algorithm optimizes the Euclidean reconstruction to give a statistically optimal result. This chain of algorithms is essentially the same as used in auto-calibration and the novelty of this paper is the inclusion of orthogonal scene plane constraints in each step. The algorithms involved are demonstrated on both simulated and real data showing the performance and usability of the proposed scheme.

从运动问题出发，提出了一种将场景约束引入结构的方案。具体来说，在正交场景平面的约束下恢复绝对二次曲线。该计划包括若干步骤。首先得到投影重建，然后用线性技术形成绝对二次曲线的初始估计。然后进行非线性迭代，对该二次曲面和摄像机的固有参数进行细化，将投影重建升级为欧几里得重建。最后，采用束平差算法对欧几里得重构进行优化，得到统计上最优的结果。该算法链本质上与自动校准中使用的算法链相同，本文的新颖之处在于在每一步中都包含了正交场景平面约束。所涉及的算法在模拟和实际数据上进行了验证，证明了所提出方案的性能和可用性。

引用次数: 6

Coupled Markov chains for video contents characterization 视频内容表征的耦合马尔可夫链

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048338

J. Sánchez, Xavier Binefa, J. Kender

We propose a compact descriptor of video contents based on modeling the temporal behavior of image features using coupled Markov chains. The framework allows us to combine multiple features within the same model, including the representation of the dependencies and relationships between them. The Kullback-Leibler divergence stands out as the base of a perceptually significant distance measure for our descriptor Our experiments show that complex highlevel visual contents in different domains can be characterized using very simple low-level features, such as motion and color.

基于对图像特征的时间行为进行耦合马尔可夫链建模，提出了一种紧凑的视频内容描述符。该框架允许我们在同一个模型中组合多个特性，包括依赖关系的表示和它们之间的关系。我们的实验表明，不同领域的复杂高级视觉内容可以使用非常简单的低级特征(如运动和颜色)来表征。

引用次数: 8

A neural architecture for fast and robust face detection 一种快速鲁棒人脸检测的神经网络结构

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048232

Christophe Garcia, M. Delakis

In this paper, we present a connectionist approach for detecting and precisely localizing semi-frontal human faces in complex images, making no assumption about the content or the lighting conditions of the scene, or about the size or the appearance of the faces. We propose a convolutional neural network architecture designed to recognize strongly variable face patterns directly from pixel images with no preprocessing, by automatically synthesizing its own set of feature extractors from a large training set of faces. We present in details the optimized design of our architecture, our learning strategy and the resulting process of face detection. We also provide experimental results to demonstrate the robustness of our approach and its capability to precisely detect extremely variable faces in uncontrolled environments.

在本文中，我们提出了一种连接主义方法，用于检测和精确定位复杂图像中的半正面人脸，不假设场景的内容或照明条件，也不假设人脸的大小或外观。我们提出了一种卷积神经网络架构，旨在通过从大量人脸训练集中自动合成自己的一组特征提取器，直接从像素图像中识别强变量的人脸模式，而无需预处理。我们详细介绍了我们的架构的优化设计，我们的学习策略和最终的人脸检测过程。我们还提供了实验结果来证明我们的方法的鲁棒性及其在不受控制的环境中精确检测极端可变人脸的能力。

引用次数: 99

Object segmentation and tracking using video locales 对象分割和跟踪使用视频场所

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048360

J. Au, Ze-Nian Li, M. S. Drew

In this paper, we present a new technique based on feature localization for segmenting and tracking objects in videos. A video locale is a sequence of image feature locales that share similar features (color, texture, shape, and motion) in the spatio-temporal domain of videos. Image feature locales are grown from tiles (blocks of pixels) and can be non-disjoint and non-connected. To exploit the temporal redundancy in digital videos, two algorithms (intra-frame and inter-frame) are used to grow locales efficiently. Multiple motion tracking is achieved by tracking and performing tile-based dominant motion estimation for each locale separately.

本文提出了一种基于特征定位的视频目标分割与跟踪技术。视频区域设置是一系列图像特征区域设置，它们在视频的时空域中共享相似的特征(颜色、纹理、形状和运动)。图像特征区域是从块(像素块)中生长出来的，可以是不相交和不连接的。为了利用数字视频中的时间冗余，采用帧内和帧间两种算法来有效地增长区域。多运动跟踪是通过对每个区域分别跟踪和执行基于tile的主导运动估计来实现的。

引用次数: 2

Combined color and texture segmentation by parametric distributional clustering 基于参数分布聚类的颜色和纹理结合分割

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048380

Thomas Zöller, L. Hermes, J. Buhmann

Unsupervised image segmentation can be formulated as a clustering problem in which pixels or small image patches are grouped together based on local feature information. In this contribution, parametric distributional clustering (PDC) is presented as a novel approach to image segmentation based on color and texture clues. The objective function of the PDC model is derived from the recently proposed Information Bottleneck framework (Tishby et al., 1999), but it can equivalently be formulated in terms of a maximum likelihood solution. Its optimization is performed by deterministic annealing. Segmentation results are shown for natural wildlife imagery.

无监督图像分割可以表述为基于局部特征信息将像素或小图像块分组在一起的聚类问题。在这篇贡献中，参数分布聚类(PDC)是一种基于颜色和纹理线索的图像分割新方法。PDC模型的目标函数来源于最近提出的信息瓶颈框架(Tishby et al.， 1999)，但它可以等效地用最大似然解来表示。采用确定性退火方法对其进行优化。自然野生动物图像的分割结果。

引用次数: 10

Improved MSEL and its medical application 改进MSEL及其医学应用

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048373

Huiguang He, Jie Tian, Jing Wang, Hong Chen, X. P. Zhang

Edge detection is the basic operation in image processing and analysis. Multiresolution sequential edge linking (MSEL) Cook and Delp (1995) has a number of advantages over other edge detection schemes, such as lower false alarm rates while maintaining full connectivity of the edge. However, it is not reasonable in the initial value selection and is time consuming. For this problem, we first use anisotropic diffusion to smooth the image while keeping the edge, and then use the feedback method to optimize the initial value. We apply our method to a medical image, and experiments show that our method is more efficient and accurate than the old MSEL.

边缘检测是图像处理和分析的基本操作。Cook和Delp(1995)与其他边缘检测方案相比，多分辨率顺序边缘连接(MSEL)具有许多优点，例如在保持边缘完全连接的同时降低误报率。但是，它在初始值选择上不合理，且耗时。针对该问题，我们首先使用各向异性扩散在保持边缘的情况下对图像进行平滑处理，然后使用反馈方法对初始值进行优化。将该方法应用于医学图像，实验结果表明，该方法比传统的MSEL方法更有效、更准确。

引用次数: 0

Empirical evaluation of MPEG-7 XM color descriptors in content-based retrieval of semantic image categories MPEG-7 XM颜色描述符在基于内容的语义图像分类检索中的实证评价

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048479

T. Ojala, Markus Aittola, Esa Matinmikko

This paper conducts an empirical evaluation of MPEG-7 visual part of experimentation model (XM) color descriptors in a challenging problem of content-based retrieval of semantic image categories. The performance of the four color descriptors provided in the current XM reference implementation, Color Layout, Color Structure, Dominant Color and Scalable Color is compared to that of HSV autocorrelogram, which has done well in recent empirical studies. Experimental results show that Color Structure provides best retrieval accuracy, whereas the computationally most expensive descriptor Dominant Color is worst in this problem.

本文针对基于内容的语义图像类别检索这一具有挑战性的问题，对MPEG-7实验模型视觉部分(XM)颜色描述符进行了实证评价。将当前XM参考实现中提供的四种颜色描述符(颜色布局、颜色结构、主色和可扩展色)的性能与HSV自相关图的性能进行了比较，HSV自相关图在最近的实证研究中取得了不错的成绩。实验结果表明，Color Structure提供了最好的检索精度，而计算代价最高的描述符Dominant Color在该问题中是最差的。

引用次数: 37

A novel approach for single view based plane metrology 一种基于单视图的平面测量新方法

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048364

Guanghui Wang, Yihong Wu, Zhanyi Hu

An approach is proposed for single view based plane metrology. The approach is based on a pair of vanishing points from two orthogonal sets of space parallel lines. Extensive experiments on simulated data as well as on real images showed that our new approach can achieve as good result as that of the homography based one which is widely used in the literature, but our new approach does not need any explicit specifications of space control points. Since in many real applications, particularly, in indoor environment, orthogonal lines are not rare, for example, a frame of window or a door, our new approach is of widely applicable.

提出了一种基于单视图的平面测量方法。该方法基于两个空间平行线正交集合的一对消失点。在模拟数据和真实图像上的大量实验表明，该方法可以达到与文献中广泛使用的基于单应性的方法相同的效果，但该方法不需要任何明确的空间控制点规范。由于在许多实际应用中，特别是在室内环境中，正交线并不罕见，例如窗框或门框，因此我们的新方法具有广泛的适用性。

引用次数: 8

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Object recognition supported by user interaction for service robots

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀