基于内容的视频标引在计算机辅助描述视频中的应用

L. Gagnon, F. Laliberté, M. Lalonde, M. Beaulieu
{"title":"基于内容的视频标引在计算机辅助描述视频中的应用","authors":"L. Gagnon, F. Laliberté, M. Lalonde, M. Beaulieu","doi":"10.1109/CRV.2006.78","DOIUrl":null,"url":null,"abstract":"This paper presents the status of a project targeting the development of content-based video indexing tools, to assist a human in the generation of descriptive video for the hard of seeing people. We describe three main elements: (1) the video content that is pertinent for computer-assisted descriptive video, (2) the system dataflow, based on a light plug-in architecture of an open-source video processing software and (3) the first version of the plug-ins developed to date. Plugs-ins that are under development include shot transition detection, key-frames identification, keyface detection, key-text spotting, visual motion mapping, face recognition, facial characterization, story segmentation, gait/gesture characterization, keyplace recognition, key-object spotting and image categorization. Some of these tools are adapted from our previous works on video surveillance, audiovisual speech recognition and content-based video indexing of documentary films. We do not focus on the algorithmic details in this paper neither on the global performance since the integration is done yet. We rather concentrate on discussing application issues of automatic descriptive video usability aspects.","PeriodicalId":369170,"journal":{"name":"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Toward an Application of Content-Based Video Indexing to Computer- Assisted Descriptive Video\",\"authors\":\"L. Gagnon, F. Laliberté, M. Lalonde, M. Beaulieu\",\"doi\":\"10.1109/CRV.2006.78\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents the status of a project targeting the development of content-based video indexing tools, to assist a human in the generation of descriptive video for the hard of seeing people. We describe three main elements: (1) the video content that is pertinent for computer-assisted descriptive video, (2) the system dataflow, based on a light plug-in architecture of an open-source video processing software and (3) the first version of the plug-ins developed to date. Plugs-ins that are under development include shot transition detection, key-frames identification, keyface detection, key-text spotting, visual motion mapping, face recognition, facial characterization, story segmentation, gait/gesture characterization, keyplace recognition, key-object spotting and image categorization. Some of these tools are adapted from our previous works on video surveillance, audiovisual speech recognition and content-based video indexing of documentary films. We do not focus on the algorithmic details in this paper neither on the global performance since the integration is done yet. We rather concentrate on discussing application issues of automatic descriptive video usability aspects.\",\"PeriodicalId\":369170,\"journal\":{\"name\":\"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-06-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CRV.2006.78\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CRV.2006.78","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

摘要

本文介绍了一个以开发基于内容的视频索引工具为目标的项目的现状,以帮助人类为弱视人群生成描述性视频。我们描述了三个主要元素:(1)与计算机辅助描述性视频相关的视频内容;(2)基于开源视频处理软件的轻型插件架构的系统数据流;(3)迄今为止开发的插件的第一个版本。正在开发的插件包括镜头转换检测、关键帧识别、关键人脸检测、关键文本识别、视觉运动映射、人脸识别、面部特征、故事分割、步态/手势特征、关键位置识别、关键对象识别和图像分类。其中一些工具改编自我们之前在视频监控、视听语音识别和基于内容的纪录片视频索引方面的工作。在本文中,我们不关注算法的细节,也不关注全局性能,因为集成还没有完成。我们更专注于讨论自动描述视频可用性方面的应用问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Toward an Application of Content-Based Video Indexing to Computer- Assisted Descriptive Video
This paper presents the status of a project targeting the development of content-based video indexing tools, to assist a human in the generation of descriptive video for the hard of seeing people. We describe three main elements: (1) the video content that is pertinent for computer-assisted descriptive video, (2) the system dataflow, based on a light plug-in architecture of an open-source video processing software and (3) the first version of the plug-ins developed to date. Plugs-ins that are under development include shot transition detection, key-frames identification, keyface detection, key-text spotting, visual motion mapping, face recognition, facial characterization, story segmentation, gait/gesture characterization, keyplace recognition, key-object spotting and image categorization. Some of these tools are adapted from our previous works on video surveillance, audiovisual speech recognition and content-based video indexing of documentary films. We do not focus on the algorithmic details in this paper neither on the global performance since the integration is done yet. We rather concentrate on discussing application issues of automatic descriptive video usability aspects.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Image Classification and Retrieval using Correlation Photometric Stereo with Nearby Planar Distributed Illuminants Evolving a Vision-Based Line-Following Robot Controller Line Extraction with Composite Background Subtract The Nomad 200 and the Nomad SuperScout: Reverse engineered and resurrected
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1