基于概率潜在语义分析的图像标注无监督分类

2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2014-11-01 DOI:10.1109/DICTA.2014.7008133

Abass A. Olaode, G. Naghdy, Catherine A. Todd

{"title":"基于概率潜在语义分析的图像标注无监督分类","authors":"Abass A. Olaode, G. Naghdy, Catherine A. Todd","doi":"10.1109/DICTA.2014.7008133","DOIUrl":null,"url":null,"abstract":"Image annotation has been identified to be a suitable means by which the semantic gap which has made the accuracy of Content-based image retrieval unsatisfactory be eliminated. However existing methods of automatic annotation of images depends on supervised learning, which can be difficult to implement due to the need for manually annotated training samples which are not always readily available. This paper argues that the unsupervised learning via Probabilistic Latent Semantic Analysis provides a more suitable machine learning approach for image annotation especially due to its potential to based categorisation on the latent semantic content of the image samples, which can bridge the semantic gap present in Content Based Image Retrieval. This paper therefore proposes an unsupervised image categorisation model in which the semantic content of images are discovered using Probabilistic Latent Semantic Analysis, after which they are clustered into unique groups based on semantic content similarities using K-means algorithm, thereby providing suitable annotation exemplars. A common problem with categorisation algorithms based on Bag-of-Visual Words modelling is the loss of accuracy due to spatial incoherency of the Bag-of-Visual Word modelling, this paper also examines the effectiveness of Spatial pyramid as a means of eliminating spatial incoherency in Probabilistic Latent Semantic Analysis classification.","PeriodicalId":146695,"journal":{"name":"2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":"{\"title\":\"Unsupervised Image Classification by Probabilistic Latent Semantic Analysis for the Annotation of Images\",\"authors\":\"Abass A. Olaode, G. Naghdy, Catherine A. Todd\",\"doi\":\"10.1109/DICTA.2014.7008133\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Image annotation has been identified to be a suitable means by which the semantic gap which has made the accuracy of Content-based image retrieval unsatisfactory be eliminated. However existing methods of automatic annotation of images depends on supervised learning, which can be difficult to implement due to the need for manually annotated training samples which are not always readily available. This paper argues that the unsupervised learning via Probabilistic Latent Semantic Analysis provides a more suitable machine learning approach for image annotation especially due to its potential to based categorisation on the latent semantic content of the image samples, which can bridge the semantic gap present in Content Based Image Retrieval. This paper therefore proposes an unsupervised image categorisation model in which the semantic content of images are discovered using Probabilistic Latent Semantic Analysis, after which they are clustered into unique groups based on semantic content similarities using K-means algorithm, thereby providing suitable annotation exemplars. A common problem with categorisation algorithms based on Bag-of-Visual Words modelling is the loss of accuracy due to spatial incoherency of the Bag-of-Visual Word modelling, this paper also examines the effectiveness of Spatial pyramid as a means of eliminating spatial incoherency in Probabilistic Latent Semantic Analysis classification.\",\"PeriodicalId\":146695,\"journal\":{\"name\":\"2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"16\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA.2014.7008133\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2014.7008133","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 16

摘要

图像标注是消除基于内容的图像检索精度不理想的语义缺口的一种合适的方法。然而，现有的图像自动标注方法依赖于监督学习，由于需要手动标注的训练样本，而这些样本并不总是现成的，因此很难实现。本文认为，通过概率潜在语义分析的无监督学习为图像标注提供了一种更合适的机器学习方法，特别是由于它有可能基于图像样本的潜在语义内容进行分类，这可以弥补基于内容的图像检索中存在的语义差距。因此，本文提出了一种无监督图像分类模型，该模型使用概率潜在语义分析方法发现图像的语义内容，然后使用K-means算法根据语义内容相似度将图像聚类成唯一的组，从而提供合适的注释范例。基于Bag-of-Visual Words建模的分类算法的一个常见问题是由于Bag-of-Visual Word建模的空间不相干而导致准确性损失，本文还研究了空间金字塔作为消除概率潜在语义分析分类中空间不相干的一种手段的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Unsupervised Image Classification by Probabilistic Latent Semantic Analysis for the Annotation of Images

Image annotation has been identified to be a suitable means by which the semantic gap which has made the accuracy of Content-based image retrieval unsatisfactory be eliminated. However existing methods of automatic annotation of images depends on supervised learning, which can be difficult to implement due to the need for manually annotated training samples which are not always readily available. This paper argues that the unsupervised learning via Probabilistic Latent Semantic Analysis provides a more suitable machine learning approach for image annotation especially due to its potential to based categorisation on the latent semantic content of the image samples, which can bridge the semantic gap present in Content Based Image Retrieval. This paper therefore proposes an unsupervised image categorisation model in which the semantic content of images are discovered using Probabilistic Latent Semantic Analysis, after which they are clustered into unique groups based on semantic content similarities using K-means algorithm, thereby providing suitable annotation exemplars. A common problem with categorisation algorithms based on Bag-of-Visual Words modelling is the loss of accuracy due to spatial incoherency of the Bag-of-Visual Word modelling, this paper also examines the effectiveness of Spatial pyramid as a means of eliminating spatial incoherency in Probabilistic Latent Semantic Analysis classification.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

自引率

0.00%

发文量

期刊最新文献

3D Reconstruction of Planar Patches Seen by Omnidirectional Cameras A Blind and Robust Video Watermarking Scheme Using Chrominance Embedding Multi-Focus Image Fusion via Boundary Finding and Multi-Scale Morphological Focus-Measure Effect of Smoothing on Sparsity Prior CT Reconstruction Discriminative Key Pose Extraction Using Extended LC-KSVD for Action Recognition