通过图像和语言理解相结合的视频浏览和表征

Michael A. Smith, T. Kanade
{"title":"通过图像和语言理解相结合的视频浏览和表征","authors":"Michael A. Smith, T. Kanade","doi":"10.1109/CAIVD.1998.646034","DOIUrl":null,"url":null,"abstract":"Digital video is rapidly becoming important for education, entertainment and a host of multimedia applications. With the size of the video collections growing to thousands of hours, technology is needed to effectively browse segments in a short time without losing the content of the video. We propose a method to extract the significant audio and video information and create a skim video which represents a very short synopsis of the original. The goal of this work is to show the utility of integrating language and image understanding techniques for video skimming by extraction of significant information, such as specific objects, audio keywords and relevant video structure. The resulting skim video is much shorter; where compaction is as high as 20:1, and yet retains the essential content of the original segment. We have conducted a user-study to test the content summarization and effectiveness of the skim as a browsing tool.","PeriodicalId":360087,"journal":{"name":"Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"202","resultStr":"{\"title\":\"Video skimming and characterization through the combination of image and language understanding\",\"authors\":\"Michael A. Smith, T. Kanade\",\"doi\":\"10.1109/CAIVD.1998.646034\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Digital video is rapidly becoming important for education, entertainment and a host of multimedia applications. With the size of the video collections growing to thousands of hours, technology is needed to effectively browse segments in a short time without losing the content of the video. We propose a method to extract the significant audio and video information and create a skim video which represents a very short synopsis of the original. The goal of this work is to show the utility of integrating language and image understanding techniques for video skimming by extraction of significant information, such as specific objects, audio keywords and relevant video structure. The resulting skim video is much shorter; where compaction is as high as 20:1, and yet retains the essential content of the original segment. We have conducted a user-study to test the content summarization and effectiveness of the skim as a browsing tool.\",\"PeriodicalId\":360087,\"journal\":{\"name\":\"Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"202\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CAIVD.1998.646034\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAIVD.1998.646034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 202

摘要

数字视频正迅速成为教育、娱乐和大量多媒体应用的重要工具。随着视频集合的规模增长到数千小时,需要技术在短时间内有效地浏览片段而不丢失视频的内容。我们提出了一种方法来提取重要的音频和视频信息,并创建一个略读视频,它代表了一个非常简短的原始摘要。这项工作的目标是通过提取重要信息(如特定对象、音频关键字和相关视频结构)来展示集成语言和图像理解技术在视频浏览中的实用性。由此产生的略读视频要短得多;其中压实率高达20:1,但保留了原始段的基本内容。我们进行了一项用户研究,以测试内容摘要和略读作为浏览工具的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Video skimming and characterization through the combination of image and language understanding
Digital video is rapidly becoming important for education, entertainment and a host of multimedia applications. With the size of the video collections growing to thousands of hours, technology is needed to effectively browse segments in a short time without losing the content of the video. We propose a method to extract the significant audio and video information and create a skim video which represents a very short synopsis of the original. The goal of this work is to show the utility of integrating language and image understanding techniques for video skimming by extraction of significant information, such as specific objects, audio keywords and relevant video structure. The resulting skim video is much shorter; where compaction is as high as 20:1, and yet retains the essential content of the original segment. We have conducted a user-study to test the content summarization and effectiveness of the skim as a browsing tool.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Selecting good keys for triangle-inequality-based pruning algorithms Viewpoint-invariant indexing for content-based image retrieval Image organization and retrieval using a flexible shape model Commercial video retrieval by induced semantics Video skimming and characterization through the combination of image and language understanding
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1