Kernel Discriminant Analysis Using Triangular Kernel for Semantic Scene Classification

M. Tahir, J. Kittler, F. Yan, K. Mikolajczyk
{"title":"Kernel Discriminant Analysis Using Triangular Kernel for Semantic Scene Classification","authors":"M. Tahir, J. Kittler, F. Yan, K. Mikolajczyk","doi":"10.1109/CBMI.2009.47","DOIUrl":null,"url":null,"abstract":"Semantic scene classification is a challenging research problem that aims to categorise images into semantic classes such as beaches, sunsets or mountains. This problem can be formulated as multi-labeled classification problem where an image can belong to more than one conceptual class such as sunsets and beaches at the same time. Recently, Kernel Discriminant Analysis combined with spectral regression (SR-KDA) has been successfully used for face, text and spoken letter recognition. But SR-KDA method works only with positive definite symmetric matrices. In this paper, we have modified this method to support both definite and indefinite symmetric matrices. The main idea is to use LDLT decomposition instead of Cholesky decomposition. The modified SR-KDA is applied to scene database involving 6 concepts. We validate the advocated approach and demonstrate that it yields significant performance gains when conditionally positive definite triangular kernel is used instead of positive definite symmetric kernels such as linear, polynomial or RBF. The results also indicate performance gains when compared with the state-of-the art multi-label methods for semantic scene classification.","PeriodicalId":417012,"journal":{"name":"2009 Seventh International Workshop on Content-Based Multimedia Indexing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Seventh International Workshop on Content-Based Multimedia Indexing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMI.2009.47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Semantic scene classification is a challenging research problem that aims to categorise images into semantic classes such as beaches, sunsets or mountains. This problem can be formulated as multi-labeled classification problem where an image can belong to more than one conceptual class such as sunsets and beaches at the same time. Recently, Kernel Discriminant Analysis combined with spectral regression (SR-KDA) has been successfully used for face, text and spoken letter recognition. But SR-KDA method works only with positive definite symmetric matrices. In this paper, we have modified this method to support both definite and indefinite symmetric matrices. The main idea is to use LDLT decomposition instead of Cholesky decomposition. The modified SR-KDA is applied to scene database involving 6 concepts. We validate the advocated approach and demonstrate that it yields significant performance gains when conditionally positive definite triangular kernel is used instead of positive definite symmetric kernels such as linear, polynomial or RBF. The results also indicate performance gains when compared with the state-of-the art multi-label methods for semantic scene classification.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于三角核的语义场景分类核判别分析
语义场景分类是一个具有挑战性的研究问题,其目的是对图像进行语义分类,如海滩、日落或山脉。这个问题可以表述为多标签分类问题,其中图像可以同时属于多个概念类,例如日落和海滩。近年来,核判别分析结合谱回归(SR-KDA)已成功地应用于人脸、文本和语音字母识别。但SR-KDA方法只适用于正定对称矩阵。在本文中,我们对该方法进行了改进,使其同时支持定对称矩阵和不定对称矩阵。主要思想是使用LDLT分解而不是Cholesky分解。将改进后的SR-KDA应用于涉及6个概念的场景数据库。我们验证了所提倡的方法,并证明当使用条件正定三角形核而不是正定对称核(如线性,多项式或RBF)时,它会产生显着的性能提升。结果还表明,与最先进的多标签语义场景分类方法相比,性能有所提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Motion Vector Based Moving Object Detection and Tracking in the MPEG Compressed Domain A Comparison of L_1 Norm and L_2 Norm Multiple Kernel SVMs in Image and Video Classification Monophony vs Polyphony: A New Method Based on Weibull Bivariate Models Kernel Discriminant Analysis Using Triangular Kernel for Semantic Scene Classification Biometric Responses to Music-Rich Segments in Films: The CDVPlex
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1