Multi-Modal Video Concept Extraction Using Co-Training

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI:10.1109/ICME.2005.1521473

Rong Yan, M. Naphade

引用次数: 14

Abstract

For large scale automatic semantic video characterization, it is necessary to learn and model a large number of semantic concepts. A major obstacle to this is the insufficiency of labeled training samples. Semi-supervised learning algorithms such as co-training may help by incorporating a large amount of unlabeled data, which allows the redundant information across views to improve the learning performance. Although co-training has been successfully applied in several domains, it has not been used to detect video concepts before. In this paper, we extend co-training to the domain of video concept detection and investigate different strategies of co-training as well as their effects to the detection accuracy. We demonstrate performance based on the guideline of the TRECVID '03 semantic concept extraction task

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于协同训练的多模态视频概念提取

为了实现大规模的自动语义视频表征，需要对大量的语义概念进行学习和建模。一个主要的障碍是标记训练样本的不足。半监督学习算法(如co-training)可以通过合并大量未标记数据来提供帮助，这些数据允许跨视图的冗余信息来提高学习性能。虽然协同训练已经成功地应用于多个领域，但它还没有被用于检测视频概念。本文将协同训练扩展到视频概念检测领域，研究了不同的协同训练策略及其对检测精度的影响。我们基于TRECVID '03语义概念提取任务的指导来演示性能

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2005 IEEE International Conference on Multimedia and Expo

自引率

0.00%

发文量

期刊最新文献

Lossless image compression with tree coding of magnitude levels Maximizing the profit for cache replacement in a transcoding proxy Pre-Attentional Filtering in Compressed Video Annotation and detection of blended emotions in real human-human dialogs recorded in a call center Fast inter frame encoding based on modes pre-decision in H.264