Scalable classifiers for Internet vision tasks

2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops Pub Date : 2008-06-23 DOI:10.1109/CVPRW.2008.4562958

Tom Yeh, John J. Lee, Trevor Darrell

{"title":"Scalable classifiers for Internet vision tasks","authors":"Tom Yeh, John J. Lee, Trevor Darrell","doi":"10.1109/CVPRW.2008.4562958","DOIUrl":null,"url":null,"abstract":"Object recognition systems designed for Internet applications typically need to adapt to userspsila needs in a flexible fashion and scale up to very large data sets. In this paper, we analyze the complexity of several multiclass SVM-based algorithms and highlight the computational bottleneck they suffer at test time: comparing the input image to every training image. We propose an algorithm that overcomes this bottleneck; it offers not only the efficiency of a simple nearest-neighbor classifier, by voting on class labels based on the k nearest neighbors quickly determined by a vocabulary tree, but also the recognition accuracy comparable to that of a complex SVM classifier, by incorporating SVM parameters into the voting scores incrementally accumulated from individual image features. Empirical results demonstrate that adjusting votes by relevant support vector weights can improve the recognition accuracy of a nearest-neighbor classifier without sacrificing speed. Compared to existing methods, our algorithm achieves a ten-fold speed increase while incurring an acceptable accuracy loss that can be easily offset by showing about two more labels in the result. The speed, scalability, and adaptability of our algorithm makes it suitable for Internet vision applications.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPRW.2008.4562958","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Object recognition systems designed for Internet applications typically need to adapt to userspsila needs in a flexible fashion and scale up to very large data sets. In this paper, we analyze the complexity of several multiclass SVM-based algorithms and highlight the computational bottleneck they suffer at test time: comparing the input image to every training image. We propose an algorithm that overcomes this bottleneck; it offers not only the efficiency of a simple nearest-neighbor classifier, by voting on class labels based on the k nearest neighbors quickly determined by a vocabulary tree, but also the recognition accuracy comparable to that of a complex SVM classifier, by incorporating SVM parameters into the voting scores incrementally accumulated from individual image features. Empirical results demonstrate that adjusting votes by relevant support vector weights can improve the recognition accuracy of a nearest-neighbor classifier without sacrificing speed. Compared to existing methods, our algorithm achieves a ten-fold speed increase while incurring an acceptable accuracy loss that can be easily offset by showing about two more labels in the result. The speed, scalability, and adaptability of our algorithm makes it suitable for Internet vision applications.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于互联网视觉任务的可扩展分类器

为Internet应用程序设计的对象识别系统通常需要以灵活的方式适应用户的需求，并扩展到非常大的数据集。在本文中，我们分析了几种基于svm的多类算法的复杂性，并强调了它们在测试时遇到的计算瓶颈:将输入图像与每个训练图像进行比较。我们提出了一种克服这一瓶颈的算法;它不仅提供了简单的最近邻分类器的效率，通过基于词汇树快速确定的k个最近邻对类标签进行投票，而且通过将SVM参数纳入从单个图像特征逐渐积累的投票分数中，其识别精度可与复杂的SVM分类器相媲美。实证结果表明，通过相关支持向量权重调整投票可以在不牺牲速度的情况下提高最近邻分类器的识别精度。与现有方法相比，我们的算法实现了十倍的速度提升，同时产生了可接受的精度损失，可以通过在结果中显示大约两个标签来轻松抵消。该算法的速度、可扩展性和适应性使其适合于互联网视觉应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

自引率

0.00%

发文量