Object recognition supported by user interaction for service robots最新文献

英文中文

Improving face verification using skin color information 改进使用肤色信息的人脸验证

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048318

S. Marcel, Samy Bengio

The performance of face verification systems has steadily improved over the last few years, mainly focusing on models rather than on feature processing. State-of-the-art methods often use the gray-scale face image as input. We propose to use an additional feature of the face image: the skin color The new feature set is tested on a benchmark database, namely XM2VTS, using a simple discriminant artificial neural network. Results show that the skin color information improves the performance.

在过去的几年里，人脸验证系统的性能稳步提高，主要集中在模型而不是特征处理上。最先进的方法通常使用灰度人脸图像作为输入。我们建议使用人脸图像的附加特征:肤色。新的特征集在一个基准数据库XM2VTS上进行测试，使用简单的判别人工神经网络。结果表明，肤色信息提高了性能。

引用次数: 58

Step acceleration based training algorithm for feedforward neural networks 基于阶跃加速的前馈神经网络训练算法

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048243

Yanlai Li, Kuanquan Wang, David Zhang

This paper presents a very fast step acceleration based training algorithm (SATA) for multilayer feedforward neural network training. The most outstanding virtue of this algorithm is that it does not need to calculate the gradient of the target function. In each iteration step, the computation only concentrates on the corresponding varied part. The proposed algorithm has attributes in simplicity, flexibility and feasibility, as well as high speed of convergence. Compared with the other methods, including the conventional backpropagation (BP), conjugate gradient, and weight extrapolation based BP, many simulations confirmed the superiority of this algorithm in terms of converging speed and computation time required.

提出了一种用于多层前馈神经网络训练的快速阶跃加速训练算法(SATA)。该算法最突出的优点是不需要计算目标函数的梯度。在每个迭代步骤中，计算只集中在相应的变化部分。该算法具有简单、灵活、可行、收敛速度快等特点。与传统的反向传播(BP)、共轭梯度和基于权外推的BP等方法相比，许多仿真验证了该算法在收敛速度和计算时间方面的优越性。

引用次数: 11

One class classification using implicit polynomial surface fitting 一类使用隐式多项式曲面拟合的分类

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048260

A. Erçil, Burak Büke

When the number of objects in the training set is too small for the number of features used, most classification procedures cannot find good classification boundaries. In this paper, we introduce a new technique to solve the one class classification problem based on fitting an implicit polynomial surface to the point cloud of features to model the one class which we are trying to separate from the others.

当训练集中的对象数量小于所使用的特征数量时，大多数分类过程无法找到良好的分类边界。本文介绍了一种新的方法来解决一类分类问题，该方法是基于隐式多项式曲面拟合特征点云来对我们试图与其他类别分离的一类进行建模。

引用次数: 1

Texture classification based on the Boolean model and its application to HEp-2 cells 基于布尔模型的纹理分类及其在HEp-2细胞中的应用

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048325

P. Perner, Horst Perner, Bernd Müller

We investigated the Boolean model for the classification of textures. We were interested in three issues: 1. What are the best features for classification? 2. How does the number of Boolean models created from the original image influence the accuracy of the classifier? 3. Is decision tree induction the right method for classification? We are working on a real-world application which is the classification of HEp-2 cells. This kind of cells are used in medicine for the identification of antinuclear autoantibodies. Human experts describe the characteristics of these cells by symbolic texture features. We apply the Boolean model to this problem and assume that the primary grains are regions of random size and shape. We use decision tree induction in order to learn the relevant classification knowledge and the structure of the classifier.

我们研究了纹理分类的布尔模型。我们对三个问题感兴趣:1。分类的最佳特性是什么?2. 从原始图像创建的布尔模型的数量如何影响分类器的准确性?3.决策树归纳法是正确的分类方法吗?我们正在研究一个现实世界的应用，那就是HEp-2细胞的分类。这种细胞在医学上用于鉴定抗核自身抗体。人类专家通过象征性的纹理特征来描述这些细胞的特征。我们将布尔模型应用于该问题，并假设初级颗粒是大小和形状随机的区域。我们使用决策树归纳法来学习相关的分类知识和分类器的结构。

引用次数: 9

Empirical evaluation of MPEG-7 XM color descriptors in content-based retrieval of semantic image categories MPEG-7 XM颜色描述符在基于内容的语义图像分类检索中的实证评价

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048479

T. Ojala, Markus Aittola, Esa Matinmikko

This paper conducts an empirical evaluation of MPEG-7 visual part of experimentation model (XM) color descriptors in a challenging problem of content-based retrieval of semantic image categories. The performance of the four color descriptors provided in the current XM reference implementation, Color Layout, Color Structure, Dominant Color and Scalable Color is compared to that of HSV autocorrelogram, which has done well in recent empirical studies. Experimental results show that Color Structure provides best retrieval accuracy, whereas the computationally most expensive descriptor Dominant Color is worst in this problem.

本文针对基于内容的语义图像类别检索这一具有挑战性的问题，对MPEG-7实验模型视觉部分(XM)颜色描述符进行了实证评价。将当前XM参考实现中提供的四种颜色描述符(颜色布局、颜色结构、主色和可扩展色)的性能与HSV自相关图的性能进行了比较，HSV自相关图在最近的实证研究中取得了不错的成绩。实验结果表明，Color Structure提供了最好的检索精度，而计算代价最高的描述符Dominant Color在该问题中是最差的。

引用次数: 37

A statistical modeling approach to content based video retrieval 基于内容的视频检索的统计建模方法

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048463

M. Naphade, S. Basu, John R. Smith, Ching-Yung Lin, Belle L. Tseng

Statistical: modeling for content based retrieval is examined in the context of recent TREC Video benchmark exercise. The TREC Video exercise can be viewed as a test bed for evaluation and comparison of a variety of different algorithms on a set of high-level queries for multimedia retrieval. We report on the use of techniques adopted from statistical learning theory. Our method depends on training of models based on large data sets. Particularly, we use statistical models such as Gaussian mixture models to build computational representations for a variety of semantic concepts including rocket-launch, outdoor greenery, sky etc. Training requires a large amount of annotated (labeled) data. Thus, we explore the use of active learning for the annotation engine that minimizes the number of training samples to be labeled for satisfactory performance.

统计:基于内容的检索建模在最近的TREC视频基准练习的背景下进行了检查。TREC视频练习可以被视为一个测试平台，用于评估和比较多媒体检索的一组高级查询上的各种不同算法。我们报告了从统计学习理论中采用的技术的使用。我们的方法依赖于基于大数据集的模型训练。特别是，我们使用高斯混合模型等统计模型来构建各种语义概念的计算表示，包括火箭发射，室外绿化，天空等。训练需要大量的标注(标记)数据。因此，我们探索了在标注引擎中使用主动学习，以最大限度地减少要标记的训练样本的数量，以获得令人满意的性能。

引用次数: 15

Supervised segmentation by iterated contextual pixel classification 基于迭代上下文像素分类的监督分割

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048456

M. Loog, B. Ginneken

We propose a general iterative contextual pixel classifier for supervised image segmentation. The iterative procedure is statistically well-founded and can be considered a variation on the iterated conditional modes (ICM) of Besag (1983). Having an initial segmentation, the algorithm iteratively updates it by reclassifying every pixel, based on the original features and, additionally, contextual information. This contextual information consists of the class labels of pixels in the neighborhood of the pixel to be reclassified. Three essential differences with the original ICM are: (1) our update step is merely based on a classification result, hence a voiding the explicit calculation of conditional probabilities; (2) the clique formalism of the Markov random field framework is not required; (3) no assumption is made w.r.t. the conditional independence of the observed pixel values given the segmented image. The important consequence of properties 1 and 2 is that one can easily incorporate rate common pattern recognition tools in our segmentation algorithm. Examples are different classifiers-e.g. Fisher linear discriminant, nearest-neighbor classifier, or support vector machines-and dimension reduction techniques like LDA, or PCA. We experimentally compare a specific instance of our general method to pixel classification, using simulated data and chest radiographs, and show that the former outperforms the latter.

我们提出了一种用于监督图像分割的通用迭代上下文像素分类器。迭代过程在统计上是有充分根据的，可以被认为是Besag(1983)的迭代条件模态(ICM)的一种变体。该算法具有初始分割，通过基于原始特征和上下文信息对每个像素进行重新分类来迭代更新它。该上下文信息由待重分类像素附近像素的类标签组成。与原始ICM的三个本质区别是:(1)我们的更新步骤仅仅基于分类结果，因此取消了条件概率的显式计算;(2)不需要马尔可夫随机场框架的团形式;(3)没有假设在给定分割图像的情况下，观察到的像素值的条件独立性。属性1和2的重要结果是，我们可以很容易地将常见的模式识别工具合并到分割算法中。例子是不同的分类器。Fisher线性判别、最近邻分类器或支持向量机，以及LDA或PCA等降维技术。我们通过实验比较了我们的一般方法与像素分类的具体实例，使用模拟数据和胸片，并表明前者优于后者。

{"title":"Supervised segmentation by iterated contextual pixel classification","authors":"M. Loog, B. Ginneken","doi":"10.1109/ICPR.2002.1048456","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048456","url":null,"abstract":"We propose a general iterative contextual pixel classifier for supervised image segmentation. The iterative procedure is statistically well-founded and can be considered a variation on the iterated conditional modes (ICM) of Besag (1983). Having an initial segmentation, the algorithm iteratively updates it by reclassifying every pixel, based on the original features and, additionally, contextual information. This contextual information consists of the class labels of pixels in the neighborhood of the pixel to be reclassified. Three essential differences with the original ICM are: (1) our update step is merely based on a classification result, hence a voiding the explicit calculation of conditional probabilities; (2) the clique formalism of the Markov random field framework is not required; (3) no assumption is made w.r.t. the conditional independence of the observed pixel values given the segmented image. The important consequence of properties 1 and 2 is that one can easily incorporate rate common pattern recognition tools in our segmentation algorithm. Examples are different classifiers-e.g. Fisher linear discriminant, nearest-neighbor classifier, or support vector machines-and dimension reduction techniques like LDA, or PCA. We experimentally compare a specific instance of our general method to pixel classification, using simulated data and chest radiographs, and show that the former outperforms the latter.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127188967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 33

Grouping salient scatterers in InSAR data for recognition of industrial buildings InSAR数据中显著散射体分组用于工业建筑识别

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048377

E. Michaelsen, U. Soergel, Uwe Stilla

InSAR data are used to recognise large industrial building complexes. Such buildings often show salient regular patterns of strong scatterers on their roofs. A previous segmentation which uses the intensity, height and coherence information extracts building cues. Strong scatterers are filtered by a spot detector and localised by a cluster formation. Strong scatterers are grouped in rows by a process that uses the contours of the building cues as context. Stich buildings are labelled as industrial buildings and serve as seeds to assemble adjacent buildings into complex structured building aggregates. The structure of the grouping process is depicted by a production net.

InSAR数据用于识别大型工业建筑群。这类建筑物通常在屋顶上显示出明显的规则的强散射模式。先前的分割利用强度、高度和相干信息提取建筑线索。强散射体通过点探测器过滤，并通过星团形成进行定位。通过使用建筑线索的轮廓作为背景，将强散射体分组成行。Stich建筑被标记为工业建筑，并作为种子将相邻建筑组装成复杂的结构建筑集合体。分组过程的结构由生产网描述。

引用次数: 9

3D models retrieval by using characteristic views 基于特征视图的三维模型检索

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048337

S. Mahmoudi, M. Daoudi

In this work we introduce a new method for indexing 3D models. This method is based on the characterization of 3D objects by a set of 7 characteristic views, including three principals, and four secondaries. The primary, secondary, and tertiary viewing directions are determined by the eigenvector analysis of the covariance matrix related to the 3D object. The secondary views are deduced from the principal views. We propose an index based on "curvature scale space", organized around a tree structure, named M-Tree, which is parameterized by a distance function and allows one to considerably decrease the calculating time by saving the intermediate distances.

本文介绍了一种新的三维模型索引方法。该方法基于一组7个特征视图对三维物体进行表征，其中包括3个主视图和4个副视图。主、次、三级观测方向由与三维物体相关的协方差矩阵的特征向量分析确定。次视图是从主视图推导出来的。我们提出了一个基于“曲率尺度空间”的索引，该索引围绕树形结构组织，称为M-Tree，它由距离函数参数化，通过节省中间距离可以大大减少计算时间。

引用次数: 75

Motion based event recognition using HMM 基于HMM的运动事件识别

Object recognition supported by user interaction for service robots

Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048431

Gu Xu, Yu-Fei Ma, HongJiang Zhang, Shiqiang Yang

Motion is an important cue for video understanding and is widely used in many semantic video analyses. We present a new motion representation scheme in which motion in a video is represented by the responses of frames to a set of motion filters. Each of these filters is designed to be most responsive to a type of dominant motion. Then we employ hidden Markov models (HMMs) to characterize the motion patterns based on these features and thus classify basketball video into 16 events. The evaluation by human satisfaction rate to classification result is 75%, demonstrating effectiveness of the proposed approach to recognizing semantic events in video.

运动是视频理解的重要线索，被广泛应用于许多语义视频分析中。我们提出了一种新的运动表示方案，其中视频中的运动由帧对一组运动滤波器的响应来表示。每一个过滤器都被设计成对一种主导运动最敏感。然后利用隐马尔可夫模型(hmm)基于这些特征来描述运动模式，从而将篮球视频分为16个事件。人类对分类结果的满意率评价为75%，证明了该方法对视频中语义事件识别的有效性。

引用次数: 49

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Object recognition supported by user interaction for service robots

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀