首页 > 最新文献

Object recognition supported by user interaction for service robots最新文献

英文 中文
Robust contrast-invariant eigen detection 鲁棒对比不变特征检测
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048410
C. Chennubhotla, A. Jepson, J. Midgley
We achieve two goals in this paper: (1) to build a novel appearance-based object representation that takes into account variations in contrast often found in training images; (2) to develop a robust appearance-based detection scheme that can handle outliers such as occlusion and structured noise. To build the representation, we decompose the input ensemble into two subspaces: a principal subspace (within-subspace) and its orthogonal complement (out-of-subspace). Before computing the principal subspace, we remove any dependency on contrast that the training set might exhibit. To account for pixel outliers in test images, we model the residual signal in the out-of-subspace by a probabilistic mixture model of an inlier distribution and a uniform outlier distribution. The mixture model, in turn, facilitates the robust estimation of the within-subspace coefficients. We show our methodology leads to an effective classifier for separating images of eyes from non-eyes extracted from the FERET dataset.
我们在本文中实现了两个目标:(1)建立一种新的基于外观的对象表示,该表示考虑了训练图像中经常发现的对比度变化;(2)开发一种鲁棒的基于外观的检测方案,该方案可以处理遮挡和结构化噪声等异常值。为了构建表示,我们将输入集合分解为两个子空间:主子空间(子空间内)和它的正交补(子空间外)。在计算主子空间之前,我们消除了训练集可能表现出的对对比度的依赖。为了考虑测试图像中的像素异常值,我们通过一个初始分布和均匀异常分布的概率混合模型来模拟子空间外的残差信号。混合模型反过来又有助于子空间内系数的鲁棒估计。我们展示了我们的方法导致了一个有效的分类器,用于从FERET数据集中提取眼睛图像和非眼睛图像。
{"title":"Robust contrast-invariant eigen detection","authors":"C. Chennubhotla, A. Jepson, J. Midgley","doi":"10.1109/ICPR.2002.1048410","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048410","url":null,"abstract":"We achieve two goals in this paper: (1) to build a novel appearance-based object representation that takes into account variations in contrast often found in training images; (2) to develop a robust appearance-based detection scheme that can handle outliers such as occlusion and structured noise. To build the representation, we decompose the input ensemble into two subspaces: a principal subspace (within-subspace) and its orthogonal complement (out-of-subspace). Before computing the principal subspace, we remove any dependency on contrast that the training set might exhibit. To account for pixel outliers in test images, we model the residual signal in the out-of-subspace by a probabilistic mixture model of an inlier distribution and a uniform outlier distribution. The mixture model, in turn, facilitates the robust estimation of the within-subspace coefficients. We show our methodology leads to an effective classifier for separating images of eyes from non-eyes extracted from the FERET dataset.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125072537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Variational image segmentation by unifying region and boundary information 统一区域和边界信息的变分图像分割
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048445
O. Ecabert, J. Thiran
This paper presents a novel variational image segmentation technique that unifies both geodesic active contours and geodesic active regions. The originality of the method is the automatic and dynamic global weighting of the respective local equations of motion. A new stopping function for the geodesic active contours is also introduced, which proves to have a better behavior in the vicinity of the object boundaries. Instead of minimizing the standard energy functional, we use a normalized version, which strongly reduces the shortening effect, improving thus the coupling with the region model. Results and method effectiveness are shown on real and medical images.
提出了一种结合测地线活动轮廓和测地线活动区域的变分图像分割方法。该方法的创新之处在于对各自的局部运动方程进行自动和动态的全局加权。本文还引入了一种新的测地线活动轮廓停止函数,该停止函数在目标边界附近具有较好的性能。而不是最小化标准的能量泛函,我们使用一个标准化的版本,这大大减少了缩短效应,从而改善了与区域模型的耦合。在真实图像和医学图像上显示了结果和方法的有效性。
{"title":"Variational image segmentation by unifying region and boundary information","authors":"O. Ecabert, J. Thiran","doi":"10.1109/ICPR.2002.1048445","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048445","url":null,"abstract":"This paper presents a novel variational image segmentation technique that unifies both geodesic active contours and geodesic active regions. The originality of the method is the automatic and dynamic global weighting of the respective local equations of motion. A new stopping function for the geodesic active contours is also introduced, which proves to have a better behavior in the vicinity of the object boundaries. Instead of minimizing the standard energy functional, we use a normalized version, which strongly reduces the shortening effect, improving thus the coupling with the region model. Results and method effectiveness are shown on real and medical images.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126043547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Near-optimal regularization parameters for applications in computer vision 近最优正则化参数在计算机视觉中的应用
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048367
Changjiang Yang, R. Duraiswami, L. Davis
Computer vision requires the solution of many ill-posed problems such as optical flow, structure from motion, shape from shading, surface reconstruction, image restoration and edge detection. Regularization is a popular method to solve ill-posed problems, in which the solution is sought by minimization of a sum of two weighted terms, one measuring the error arising from the ill-posed model, the other indicating the distance between the solution and some class of solutions chosen on the basis of prior knowledge (smoothness, or other prior information). One of important issues in regularization is choosing optimal weight (or regularization parameter). Existing methods for choosing regularization parameters either require the prior information on noise in the data, or are heuristic graphical methods. We apply a method for choosing near-optimal regularization parameters by approximately minimizing the distance between the true solution and the family of regularized solutions. We demonstrate the effectiveness of this approach for the regularization on two examples: edge detection and image restoration.
计算机视觉需要解决许多病态问题,如光流、运动的结构、阴影的形状、表面重建、图像恢复和边缘检测。正则化是一种解决病态问题的流行方法,其中通过最小化两个加权项的和来寻求解决方案,一个衡量由病态模型引起的误差,另一个表示解决方案与基于先验知识(平滑性或其他先验信息)选择的某类解决方案之间的距离。正则化中的一个重要问题是选择最优权值(或正则化参数)。现有的正则化参数选择方法要么需要数据中噪声的先验信息,要么是启发式图方法。我们通过近似最小化真解与正则化解族之间的距离,应用了一种选择近最优正则化参数的方法。我们通过边缘检测和图像恢复两个例子证明了该方法在正则化方面的有效性。
{"title":"Near-optimal regularization parameters for applications in computer vision","authors":"Changjiang Yang, R. Duraiswami, L. Davis","doi":"10.1109/ICPR.2002.1048367","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048367","url":null,"abstract":"Computer vision requires the solution of many ill-posed problems such as optical flow, structure from motion, shape from shading, surface reconstruction, image restoration and edge detection. Regularization is a popular method to solve ill-posed problems, in which the solution is sought by minimization of a sum of two weighted terms, one measuring the error arising from the ill-posed model, the other indicating the distance between the solution and some class of solutions chosen on the basis of prior knowledge (smoothness, or other prior information). One of important issues in regularization is choosing optimal weight (or regularization parameter). Existing methods for choosing regularization parameters either require the prior information on noise in the data, or are heuristic graphical methods. We apply a method for choosing near-optimal regularization parameters by approximately minimizing the distance between the true solution and the family of regularized solutions. We demonstrate the effectiveness of this approach for the regularization on two examples: edge detection and image restoration.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125381939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Beam search for feature selection in automatic SVM defect classification 基于波束搜索的SVM缺陷自动分类特征选择
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048275
Puneet Gupta, D. Doermann, D. DeMenthon
Often in pattern classification problems, one tries to extract a large number of features and base the classifier decision on as much information as possible. This yields an array of features that are 'potentially' useful. Most of the time however, large feature sets are sub-optimal in describing the samples since they tend to over-represent the data and model noise along with the useful information in the data. Selecting relevant features from the available set of features is, therefore, a challenging task. In this paper, we present an innovative feature selection algorithm called Smart Beam Search (SBS), which is used with a support vector machine (SVM) based classifier for automatic defect classification. This feature selection approach not only reduces the dimensionality of the feature space substantially, but also improves the classifier performance.
通常在模式分类问题中,人们试图提取大量的特征,并根据尽可能多的信息做出分类器决策。这就产生了一系列“潜在”有用的特性。然而,大多数时候,大型特征集在描述样本时不是最优的,因为它们倾向于过度表示数据和模型噪声以及数据中的有用信息。因此,从可用的特性集中选择相关的特性是一项具有挑战性的任务。在本文中,我们提出了一种创新的特征选择算法——智能波束搜索(SBS),该算法与基于支持向量机(SVM)的分类器一起用于缺陷自动分类。这种特征选择方法不仅大大降低了特征空间的维数,而且提高了分类器的性能。
{"title":"Beam search for feature selection in automatic SVM defect classification","authors":"Puneet Gupta, D. Doermann, D. DeMenthon","doi":"10.1109/ICPR.2002.1048275","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048275","url":null,"abstract":"Often in pattern classification problems, one tries to extract a large number of features and base the classifier decision on as much information as possible. This yields an array of features that are 'potentially' useful. Most of the time however, large feature sets are sub-optimal in describing the samples since they tend to over-represent the data and model noise along with the useful information in the data. Selecting relevant features from the available set of features is, therefore, a challenging task. In this paper, we present an innovative feature selection algorithm called Smart Beam Search (SBS), which is used with a support vector machine (SVM) based classifier for automatic defect classification. This feature selection approach not only reduces the dimensionality of the feature space substantially, but also improves the classifier performance.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126714665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Modeling object classes in aerial images using texture motifs 使用纹理图案在航拍图像中建模对象类
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048470
S. Bhagavathy, S. Newsam, B. S. Manjunath
We propose a canonical model for object classes in aerial images. This model is motivated by the observation that geographic regions of interest are characterized by collections of texture motifs corresponding to the geographic processes that generate them. We show that this model is effective in learning the common texture themes, or motifs, of the object classes.
我们提出了航空图像中对象类的规范模型。该模型的动机是观察到感兴趣的地理区域具有与产生它们的地理过程相对应的纹理图案集合的特征。我们表明,该模型在学习对象类的共同纹理主题或主题方面是有效的。
{"title":"Modeling object classes in aerial images using texture motifs","authors":"S. Bhagavathy, S. Newsam, B. S. Manjunath","doi":"10.1109/ICPR.2002.1048470","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048470","url":null,"abstract":"We propose a canonical model for object classes in aerial images. This model is motivated by the observation that geographic regions of interest are characterized by collections of texture motifs corresponding to the geographic processes that generate them. We show that this model is effective in learning the common texture themes, or motifs, of the object classes.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122421228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Learning from negative example in relevance feedback for content-based image retrieval 基于内容的图像检索中相关反馈的负例学习
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048458
M. L. Kherfi, D. Ziou, A. Bernardi
In this paper, we address some issues related to the combination of positive and negative examples to perform more efficient image retrieval. We analyze the relevance of negative example and how it can be interpreted. Then we propose a new relevance feedback model that integrates both positive and negative examples. First, a query is formulated using positive example, then negative example is used to refine the system's response. Mathematically, relevance feedback is formulated as an optimization of intra and inter variances of positive and negative examples.
在本文中,我们解决了一些与正样例和负样例相结合的问题,以执行更有效的图像检索。我们分析了负面例子的相关性以及它是如何被解释的。在此基础上,我们提出了一种新的正例与负例相结合的关联反馈模型。首先,使用正例来制定查询,然后使用负例来改进系统的响应。在数学上,相关反馈被表述为正例和负例的内方差和间方差的优化。
{"title":"Learning from negative example in relevance feedback for content-based image retrieval","authors":"M. L. Kherfi, D. Ziou, A. Bernardi","doi":"10.1109/ICPR.2002.1048458","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048458","url":null,"abstract":"In this paper, we address some issues related to the combination of positive and negative examples to perform more efficient image retrieval. We analyze the relevance of negative example and how it can be interpreted. Then we propose a new relevance feedback model that integrates both positive and negative examples. First, a query is formulated using positive example, then negative example is used to refine the system's response. Mathematically, relevance feedback is formulated as an optimization of intra and inter variances of positive and negative examples.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122737238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Discrete approach for automatic knowledge extraction from precedent large-scale data, and classification 离散方法用于从前例大规模数据中自动提取知识,并进行分类
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048269
V. Ryazanov, Victor A. Vorontchikhin
The proposed method for automatic knowledge extraction from large-scale data is based on the idea of analysing neighborhoods of "supporting" objects and construction of data covered by sets of hyper parallelepipeds. A simple procedure to choose the supporting objects is applied. Knowledge extraction (logical regularities search) is based on the solution of special discrete linear optimization tasks associated with supporting objects. Two practical tasks are considered for method illustration.
该方法基于对“支持”对象的邻域分析和超平行六面体集所覆盖的数据的构造思想,从大规模数据中自动提取知识。应用了一个简单的程序来选择支撑对象。知识提取(逻辑规律搜索)是基于与支持对象相关联的特殊离散线性优化任务的求解。为了说明方法,考虑了两个实际任务。
{"title":"Discrete approach for automatic knowledge extraction from precedent large-scale data, and classification","authors":"V. Ryazanov, Victor A. Vorontchikhin","doi":"10.1109/ICPR.2002.1048269","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048269","url":null,"abstract":"The proposed method for automatic knowledge extraction from large-scale data is based on the idea of analysing neighborhoods of \"supporting\" objects and construction of data covered by sets of hyper parallelepipeds. A simple procedure to choose the supporting objects is applied. Knowledge extraction (logical regularities search) is based on the solution of special discrete linear optimization tasks associated with supporting objects. Two practical tasks are considered for method illustration.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122934552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How many classifiers do I need? 我需要多少个分类器?
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048266
B. Schiele
Combining multiple classifiers promises to increase performance and robustness of a classification task. Currently, the understanding which combination scheme should be used and the ability to quantify the expected benefit is inadequate. This paper attempts to quantify the performance and robustness gain for different combination schemes and for two classifier types. The results indicate that the combination of a small number of classifiers may already result in a substantial performance gain. Also, the increase in robustness can be substantial by combining an adequate number of classifiers.
组合多个分类器有望提高分类任务的性能和鲁棒性。目前,对应采用何种组合方案的认识和量化预期效益的能力不足。本文试图量化不同组合方案和两种分类器类型的性能和鲁棒性增益。结果表明,少量分类器的组合可能已经产生了实质性的性能增益。此外,通过组合足够数量的分类器,鲁棒性可以得到实质性的提高。
{"title":"How many classifiers do I need?","authors":"B. Schiele","doi":"10.1109/ICPR.2002.1048266","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048266","url":null,"abstract":"Combining multiple classifiers promises to increase performance and robustness of a classification task. Currently, the understanding which combination scheme should be used and the ability to quantify the expected benefit is inadequate. This paper attempts to quantify the performance and robustness gain for different combination schemes and for two classifier types. The results indicate that the combination of a small number of classifiers may already result in a substantial performance gain. Also, the increase in robustness can be substantial by combining an adequate number of classifiers.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122971942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Variable neighborhood search for geometrically deformable templates 可变邻域搜索几何可变形模板
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048395
M. Lalonde, L. Gagnon
This paper proposes two modifications to the geometrically deformable template model. First, the optimization stage originally based on simulated annealing is replaced with a meta-heuristic called Variable Neighborhood Search that treats simulated annealing as a local search tool. Second, an affine deformation energy is introduced to improve the quality of the search. An example of optic disc segmentation in an ophthalmic image is given.
本文对几何可变形模板模型提出了两种修正方法。首先,将最初基于模拟退火的优化阶段替换为将模拟退火作为局部搜索工具的元启发式变量邻域搜索。其次,引入仿射变形能,提高搜索质量;给出了眼科图像视盘分割的一个实例。
{"title":"Variable neighborhood search for geometrically deformable templates","authors":"M. Lalonde, L. Gagnon","doi":"10.1109/ICPR.2002.1048395","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048395","url":null,"abstract":"This paper proposes two modifications to the geometrically deformable template model. First, the optimization stage originally based on simulated annealing is replaced with a meta-heuristic called Variable Neighborhood Search that treats simulated annealing as a local search tool. Second, an affine deformation energy is introduced to improve the quality of the search. An example of optic disc segmentation in an ophthalmic image is given.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"105 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128259427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Video-based sign recognition using self-organizing subunits 基于自组织子单元的视频符号识别
Pub Date : 2002-12-10 DOI: 10.1109/ICPR.2002.1048332
Britta Bauer, K. Kraiss
This paper deals with the automatic recognition of German signs. The statistical approach is based on the Bayes decision rule for minimum error rate. Following speech recognition system designs, which are in general based on phonemes, here the idea of an automatic sign language recognition system using subunits rather than models for whole signs is outlined. The advantage of such a system will be a future reduction of necessary training material. Furthermore, a simplified enlargement of the existing vocabulary is expected, as new signs can be added to the vocabulary database without re-training the existing hidden Markov models (HMMs) for subunits. Since it is difficult to define subunits for sign language, this approach employs totally self-organized subunits. In first experiences a recognition accuracy of 92,5% was achieved for 100 signs, which were previously trained. For 50 new signs an accuracy of 81% was achieved without retraining of subunit-HMMs.
本文研究了德语符号的自动识别问题。统计方法是基于最小错误率的贝叶斯决策规则。下面的语音识别系统设计,一般是基于音素的,这里概述了使用亚单位而不是整个符号模型的自动手语识别系统的想法。这种制度的优点是将来减少必要的训练材料。此外,期望简化现有词汇表的扩展,因为新符号可以添加到词汇库中,而无需重新训练子单元的现有隐马尔可夫模型(hmm)。由于很难定义手语的子单位,这种方法采用完全自组织的子单位。在最初的经验中,对100个先前训练过的符号的识别准确率达到了92.5%。在未对亚单位hmm进行再训练的情况下,对50个新体征的准确率达到81%。
{"title":"Video-based sign recognition using self-organizing subunits","authors":"Britta Bauer, K. Kraiss","doi":"10.1109/ICPR.2002.1048332","DOIUrl":"https://doi.org/10.1109/ICPR.2002.1048332","url":null,"abstract":"This paper deals with the automatic recognition of German signs. The statistical approach is based on the Bayes decision rule for minimum error rate. Following speech recognition system designs, which are in general based on phonemes, here the idea of an automatic sign language recognition system using subunits rather than models for whole signs is outlined. The advantage of such a system will be a future reduction of necessary training material. Furthermore, a simplified enlargement of the existing vocabulary is expected, as new signs can be added to the vocabulary database without re-training the existing hidden Markov models (HMMs) for subunits. Since it is difficult to define subunits for sign language, this approach employs totally self-organized subunits. In first experiences a recognition accuracy of 92,5% was achieved for 100 signs, which were previously trained. For 50 new signs an accuracy of 81% was achieved without retraining of subunit-HMMs.","PeriodicalId":159502,"journal":{"name":"Object recognition supported by user interaction for service robots","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128666859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 83
期刊
Object recognition supported by user interaction for service robots
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1