Object recognition supported by user interaction for service robots最新文献

英文中文

Tangent distance kernels for support vector machines 支持向量机的切距离核

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048439

B. Haasdonk, Daniel Keysers

When dealing with pattern recognition problems one encounters different types of a-priori knowledge. It is important to incorporate such knowledge into the classification method at hand. A very common type of a-priori knowledge is transformation invariance of the input data, e.g. geometric transformations of image-data like shifts, scaling etc. Distance based classification methods can make use of this by a modified distance measure called tangent distance. We introduce a new class of kernels for support vector machines which incorporate tangent distance and therefore are applicable in cases where such transformation invariances are known. We report experimental results which show that the performance of our method is comparable to other state-of-the-art methods, while problems of existing ones are avoided.

在处理模式识别问题时，人们会遇到不同类型的先验知识。将这些知识纳入手边的分类方法是很重要的。一种非常常见的先验知识是输入数据的变换不变性，例如图像数据的几何变换，如移位、缩放等。基于距离的分类方法可以通过一种称为切线距离的改进距离度量来利用这一点。我们为支持向量机引入了一类新的核，它包含了切线距离，因此适用于已知这种变换不变性的情况。我们报告的实验结果表明，我们的方法的性能可与其他最先进的方法相媲美，同时避免了现有方法的问题。

引用次数: 129

Painter identification using local features and naive Bayes 基于局部特征和朴素贝叶斯的画家识别

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/icpr.2002.1048341

D. Keren

The goal of this paper is to offer a framework for image classification "by type". For example, one may want to classify an image of a certain office as man-made - as opposed to outdoor scene, even if no image of a similar office exists in the training set. This is accomplished by using local features, and by using the naive Bayes classifier. The application presented here is classification of paintings; after the system is presented with a sample of paintings of various artists, it tries to determine who was the painter who painted it. The result is local - each small image block is assigned a painter, and a majority vote determines the painter. The results are roughly visually consistent with human perception of various artists' style.

本文的目标是为“按类型”的图像分类提供一个框架。例如，人们可能想要将某个办公室的图像分类为人造的，而不是室外场景，即使训练集中不存在类似办公室的图像。这是通过使用局部特征和朴素贝叶斯分类器来实现的。这里介绍的应用是绘画分类;在向系统展示了不同艺术家的画作样本后，它试图确定谁是画这幅画的画家。结果是局部的-每个小图像块被分配一个画家，并且大多数投票决定画家。结果在视觉上与人类对各种艺术家风格的感知大致一致。

引用次数: 68

Hexagonal image representation for 3-D photorealistic reconstruction 三维真实感重建的六边形图像表示

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048391

Hidenori Sato, H. Matsuoka, A. Onozawa, H. Kitazawa

A new data structure for representing the color of rays from multiple-view images is presented. The structure is a hexagonal tessellation generated from a buckyball. Using the structure, the captured colors are represented as pixel values of a hexagonal image, and the image is finally saved as a compressed normal image after a simple transformation without loss of connectivity. A surface light field data generating algorithm based on image-based scheme is also presented It assigns color for each vertex of the reconstructed surface using the structure. The experimental results show that the algorithm yields the surface light field data in a short time. In addition, photorealistic rendered views are obtained from arbitrary viewpoints.

提出了一种表示多视图图像光线颜色的新数据结构。结构是由巴基球生成的六边形镶嵌体。利用该结构，将捕获的颜色表示为六边形图像的像素值，并在不丢失连通性的情况下进行简单变换，最终将图像保存为压缩后的正常图像。提出了一种基于图像的表面光场数据生成算法，利用该结构为重建表面的每个顶点分配颜色。实验结果表明，该算法能在短时间内得到表面光场数据。此外，从任意视点获得逼真的渲染视图。

引用次数: 4

A new approach to solving Kruppa equations for camera self-calibration 求解相机自标定Kruppa方程的新方法

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048301

Lei Cheng, Fuchao Wu, Zhanyi Hu, H. Tsui

We propose an approach to solving the Kruppa equations for camera self-calibration. Traditionally, the unknown scale factors in the Kruppa equations are eliminated first, leading to a set of nonlinear constraints. Instead, we determine the scale factors by a Levenberg-Marquardt optimization or genetic optimization technique first. Then, the camera's intrinsic parameters are derived from the resulting linear constraints. Extensive simulations as well as experiments with real images verify that the above technique is both accurate and robust.

提出了一种求解相机自标定Kruppa方程的方法。传统的方法是先消除Kruppa方程中的未知尺度因子，从而得到一组非线性约束。相反，我们首先通过Levenberg-Marquardt优化或遗传优化技术确定比例因子。然后，由得到的线性约束导出相机的固有参数。大量的仿真和真实图像实验验证了上述技术的准确性和鲁棒性。

引用次数: 21

Movie genre classification by exploiting audio-visual features of previews 利用预告片的视听特征进行电影类型分类

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048494

Z. Rasheed, M. Shah

We present a method to classify movies on the basis of audio-visual cues present in previews. A preview summarizes the main idea of a movie providing a suitable amount of information to perform genre classification. In our approach movies are initially classified into action and non-action by computing the visual disturbance feature and average shot length of every movie. Visual disturbance is defined as a measure of motion content in a clip. Next we use color, audio and cinematic principles for further classification into comedy, horror drama/other and movies containing explosions and gunfire. This work is a step towards automatically building and updating a video database, thus resulting in minimum human intervention. Other potential applications include browsing and retrieval of videos on the Internet (video-on-demand), video libraries, and rating of movies.

我们提出了一种基于预告片中出现的视听线索对电影进行分类的方法。预告片总结了电影的主要思想，提供了适当数量的信息来进行类型分类。在我们的方法中，首先通过计算视觉干扰特征和每部电影的平均镜头长度将电影分为动作和非动作。视觉干扰被定义为片段中运动内容的度量。接下来，我们使用颜色，音频和电影原则来进一步分类为喜剧，恐怖戏剧/其他和包含爆炸和枪声的电影。这项工作是朝着自动建立和更新视频数据库迈出的一步，从而减少了人为干预。其他潜在的应用包括浏览和检索互联网上的视频(视频点播)、视频库和电影评级。

引用次数: 98

Improved estimation of hidden Markov model parameters from multiple observation sequences 多观测序列隐马尔可夫模型参数的改进估计

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048264

Richard I. A. Davis, B. Lovell, T. Caelli

The huge popularity of hidden Markov models (HMMs) in pattern recognition is due to the ability to "learn" model parameters from an observation sequence through Baum-Welch and other re-estimation procedures. In the case of HMM parameter estimation from an ensemble of observation sequences, rather than a single sequence, we require techniques for finding the parameters which maximize the likelihood of the estimated model given the entire set of observation sequences. The importance of this study is that HMMs with parameters estimated from multiple observations are shown to be many orders of magnitude more probable than HMM models learned from any single observation sequence - thus the effectiveness of HMM "learning" is greatly enhanced. In this paper we present techniques that usually find models significantly more likely than Rabiner's well-known method on both seen and unseen sequences.

隐马尔可夫模型(hmm)在模式识别中的巨大流行是由于能够通过Baum-Welch和其他重新估计过程从观察序列中“学习”模型参数。在从一组观测序列而不是单个序列中进行HMM参数估计的情况下，我们需要找到在给定整个观测序列集的情况下使估计模型的似然值最大化的参数的技术。本研究的重要性在于，从多个观测值中估计参数的HMM模型比从任何单个观测序列中学习的HMM模型的可能性要高许多个数量级，从而大大提高了HMM“学习”的有效性。在本文中，我们提出的技术通常比Rabiner的著名方法在可见和未见序列上更有可能找到模型。

引用次数: 48

Interactive visual pattern recognition 交互式视觉模式识别

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048342

G. Nagy, Jie Zou

Computer Assisted Visual Interactive Recognition (CAVIAR) draws on sequential pattern recognition, image database, expert systems, pen computing, and digital camera technology. It is designed to recognize wildflowers and other families of similar objects more accurately than machine vision and faster than most laypersons. The novelty of the approach is that human perceptual ability is exploited through interaction with the image of the unknown object. The computer remembers the characteristics of all previously seen classes, suggests possible operator actions, and displays confidence scores based on already detected features. In one application, consisting of 80 test images of wildflowers, 10 laypersons averaged 80% recognition accuracy at 12 seconds per flower.

计算机辅助视觉交互识别(CAVIAR)利用顺序模式识别、图像数据库、专家系统、笔式计算和数码相机技术。它被设计用来识别野花和其他类似的物体，比机器视觉更准确，比大多数外行人更快。该方法的新颖之处在于，人类的感知能力是通过与未知物体的图像交互来开发的。计算机会记住之前看到的所有类别的特征，建议可能的操作，并根据已经检测到的特征显示置信度分数。在一个由80张野花测试图像组成的应用程序中，10名外行人在每张花的12秒内平均识别准确率为80%。

引用次数: 26

The economics of classification: error vs. complexity 分类经济学:错误vs.复杂性

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048284

D. Ridder, E. Pekalska, R. Duin

Although usually classifier error is the main concern in publications, in real applications classifier evaluation complexity may play a large role as well. In the paper, a simple economic model is proposed with which a trade-off between classifier error and calculated evaluation complexity can be formulated. This trade-off can then be used to judge the necessity of increasing sample size or number of features to decrease classification error or, conversely, feature extraction or prototype selection to decrease evaluation complexity. The model is applied to the benchmark problem of handwritten digit recognition and is shown to lead to interesting conclusions, given certain assumptions.

尽管分类器错误通常是出版物中主要关注的问题，但在实际应用中，分类器评估复杂性也可能起很大作用。本文提出了一个简单的经济模型，该模型可以在分类器误差和计算的评估复杂性之间进行权衡。然后，这种权衡可以用来判断增加样本量或特征数量以减少分类错误的必要性，或者相反，特征提取或原型选择以降低评估复杂性。该模型应用于手写数字识别的基准问题，并在给定某些假设的情况下显示出有趣的结论。

引用次数: 14

Unsupervised active regions for multiresolution image segmentation 多分辨率图像分割的无监督活动区域

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048451

X. Muñoz, J. Martí, X. Cufí, J. Freixenet

An unsupervised approach to image segmentation which fuses region and boundary information is presented. The proposed approach takes advantage of the combined use of 3 different strategies: the guidance of seed placement, the control of decision criterion, and the boundary refinement. The new algorithm uses the boundary information to initialize a set of active regions which compete for the pixels in order to segment the whole image. The method is implemented on a multiresolution representation which ensures noise robustness as well as computation efficiency. The accuracy of the segmentation results has been proven through an objective comparative evaluation of the method.

提出了一种融合区域和边界信息的无监督图像分割方法。该方法综合利用了种子放置指导、决策准则控制和边界细化三种策略。该算法利用边界信息初始化一组竞争像素的活动区域，以分割整个图像。该方法采用多分辨率表示，保证了噪声的鲁棒性和计算效率。通过对该方法的客观对比评价，证明了分割结果的准确性。

引用次数: 13

Video fade detection by discrete line identification 视频褪色检测的离散线识别

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048477

S. Guimarães, A. Araújo, M. Couprie, N. J. Leite

The video segmentation problem can be regarded as a problem of detecting the fundamental video units (shots). Due to different ways of linking two consecutive shots this task turns out to be difficult. In this work, we propose a method to detect a type of gradual transition, the fade, by image segmentation tools instead of using dissimilarity measures or mathematical models. Firstly, the video is transformed into a 2D image considering the histogram information, called visual rhythm by histogram. Afterwards, we apply image processing tools to detect specified patterns in this image.

视频分割问题可以看作是一个基本视频单元(镜头)的检测问题。由于连接两个连续镜头的方法不同，这个任务变得很困难。在这项工作中，我们提出了一种方法来检测一种逐渐过渡，褪色，通过图像分割工具，而不是使用不相似的措施或数学模型。首先，根据直方图信息将视频转换成二维图像，称为视觉节奏直方图。然后，我们应用图像处理工具来检测该图像中的特定模式。

引用次数: 10

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Object recognition supported by user interaction for service robots

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀