基于内容的视频标引在计算机辅助描述视频中的应用

The 3rd Canadian Conference on Computer and Robot Vision (CRV'06) Pub Date : 2006-06-07 DOI:10.1109/CRV.2006.78

L. Gagnon, F. Laliberté, M. Lalonde, M. Beaulieu

{"title":"基于内容的视频标引在计算机辅助描述视频中的应用","authors":"L. Gagnon, F. Laliberté, M. Lalonde, M. Beaulieu","doi":"10.1109/CRV.2006.78","DOIUrl":null,"url":null,"abstract":"This paper presents the status of a project targeting the development of content-based video indexing tools, to assist a human in the generation of descriptive video for the hard of seeing people. We describe three main elements: (1) the video content that is pertinent for computer-assisted descriptive video, (2) the system dataflow, based on a light plug-in architecture of an open-source video processing software and (3) the first version of the plug-ins developed to date. Plugs-ins that are under development include shot transition detection, key-frames identification, keyface detection, key-text spotting, visual motion mapping, face recognition, facial characterization, story segmentation, gait/gesture characterization, keyplace recognition, key-object spotting and image categorization. Some of these tools are adapted from our previous works on video surveillance, audiovisual speech recognition and content-based video indexing of documentary films. We do not focus on the algorithmic details in this paper neither on the global performance since the integration is done yet. We rather concentrate on discussing application issues of automatic descriptive video usability aspects.","PeriodicalId":369170,"journal":{"name":"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Toward an Application of Content-Based Video Indexing to Computer- Assisted Descriptive Video\",\"authors\":\"L. Gagnon, F. Laliberté, M. Lalonde, M. Beaulieu\",\"doi\":\"10.1109/CRV.2006.78\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents the status of a project targeting the development of content-based video indexing tools, to assist a human in the generation of descriptive video for the hard of seeing people. We describe three main elements: (1) the video content that is pertinent for computer-assisted descriptive video, (2) the system dataflow, based on a light plug-in architecture of an open-source video processing software and (3) the first version of the plug-ins developed to date. Plugs-ins that are under development include shot transition detection, key-frames identification, keyface detection, key-text spotting, visual motion mapping, face recognition, facial characterization, story segmentation, gait/gesture characterization, keyplace recognition, key-object spotting and image categorization. Some of these tools are adapted from our previous works on video surveillance, audiovisual speech recognition and content-based video indexing of documentary films. We do not focus on the algorithmic details in this paper neither on the global performance since the integration is done yet. We rather concentrate on discussing application issues of automatic descriptive video usability aspects.\",\"PeriodicalId\":369170,\"journal\":{\"name\":\"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-06-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CRV.2006.78\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CRV.2006.78","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

摘要

本文介绍了一个以开发基于内容的视频索引工具为目标的项目的现状，以帮助人类为弱视人群生成描述性视频。我们描述了三个主要元素:(1)与计算机辅助描述性视频相关的视频内容;(2)基于开源视频处理软件的轻型插件架构的系统数据流;(3)迄今为止开发的插件的第一个版本。正在开发的插件包括镜头转换检测、关键帧识别、关键人脸检测、关键文本识别、视觉运动映射、人脸识别、面部特征、故事分割、步态/手势特征、关键位置识别、关键对象识别和图像分类。其中一些工具改编自我们之前在视频监控、视听语音识别和基于内容的纪录片视频索引方面的工作。在本文中，我们不关注算法的细节，也不关注全局性能，因为集成还没有完成。我们更专注于讨论自动描述视频可用性方面的应用问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Toward an Application of Content-Based Video Indexing to Computer- Assisted Descriptive Video

This paper presents the status of a project targeting the development of content-based video indexing tools, to assist a human in the generation of descriptive video for the hard of seeing people. We describe three main elements: (1) the video content that is pertinent for computer-assisted descriptive video, (2) the system dataflow, based on a light plug-in architecture of an open-source video processing software and (3) the first version of the plug-ins developed to date. Plugs-ins that are under development include shot transition detection, key-frames identification, keyface detection, key-text spotting, visual motion mapping, face recognition, facial characterization, story segmentation, gait/gesture characterization, keyplace recognition, key-object spotting and image categorization. Some of these tools are adapted from our previous works on video surveillance, audiovisual speech recognition and content-based video indexing of documentary films. We do not focus on the algorithmic details in this paper neither on the global performance since the integration is done yet. We rather concentrate on discussing application issues of automatic descriptive video usability aspects.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)

自引率

0.00%

发文量

期刊最新文献

Image Classification and Retrieval using Correlation Photometric Stereo with Nearby Planar Distributed Illuminants Evolving a Vision-Based Line-Following Robot Controller Line Extraction with Composite Background Subtract The Nomad 200 and the Nomad SuperScout: Reverse engineered and resurrected