在基于内容的视频索引和检索的背景下解读摄像机操作

The 3rd Canadian Conference on Computer and Robot Vision (CRV'06) Pub Date : 2006-06-07 DOI:10.1109/CRV.2006.44

Wei Pan, F. Deschênes

{"title":"在基于内容的视频索引和检索的背景下解读摄像机操作","authors":"Wei Pan, F. Deschênes","doi":"10.1109/CRV.2006.44","DOIUrl":null,"url":null,"abstract":"In this work, we intend to go one step further to overcome the difficulty that lies in the gap between low-level media features (e.g. colors, texture, motion, etc.) and high-level concepts to perform a reliable content-based indexing and retrieval. More especially, our work proposes a new way to establish a connection between both geometric and radiometric deformations and the characterization of them in terms of camera operations. Based on both the apparent motion and the defocus blur (low-level features), we estimate extrinsic and intrinsic camera parameter changes, and then deduce 3D camera operations (i.e. mid-level features), such as panning/tracking, tilting/booming, zooming/ dollying and rolling, as well as focus changes. Finally, camera operations are recorded into an index which is then used for video retrieval. Experiments confirm that the proposed mid-level features can be accurately deduced from low-level features and that they can be used for indexing and retrieval purpose.","PeriodicalId":369170,"journal":{"name":"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Interpreting Camera Operations in the Context of Content-based Video Indexing and Retrieval\",\"authors\":\"Wei Pan, F. Deschênes\",\"doi\":\"10.1109/CRV.2006.44\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we intend to go one step further to overcome the difficulty that lies in the gap between low-level media features (e.g. colors, texture, motion, etc.) and high-level concepts to perform a reliable content-based indexing and retrieval. More especially, our work proposes a new way to establish a connection between both geometric and radiometric deformations and the characterization of them in terms of camera operations. Based on both the apparent motion and the defocus blur (low-level features), we estimate extrinsic and intrinsic camera parameter changes, and then deduce 3D camera operations (i.e. mid-level features), such as panning/tracking, tilting/booming, zooming/ dollying and rolling, as well as focus changes. Finally, camera operations are recorded into an index which is then used for video retrieval. Experiments confirm that the proposed mid-level features can be accurately deduced from low-level features and that they can be used for indexing and retrieval purpose.\",\"PeriodicalId\":369170,\"journal\":{\"name\":\"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)\",\"volume\":\"66 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-06-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CRV.2006.44\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CRV.2006.44","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

在这项工作中，我们打算进一步克服底层媒体特征(如颜色、纹理、运动等)与高层概念之间的差距，以执行可靠的基于内容的索引和检索。更特别的是，我们的工作提出了一种新的方法来建立几何和辐射变形之间的联系，并在相机操作方面对它们进行表征。基于表观运动和离焦模糊(低级特征)，我们估计了相机的外在和内在参数变化，然后推断出3D相机操作(即中级特征)，如平移/跟踪，倾斜/蓬勃发展，变焦/平移和滚动，以及焦点变化。最后，摄像机操作被记录到索引中，然后用于视频检索。实验证明，所提出的中级特征可以准确地从低级特征中推导出来，并可用于索引和检索目的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Interpreting Camera Operations in the Context of Content-based Video Indexing and Retrieval

In this work, we intend to go one step further to overcome the difficulty that lies in the gap between low-level media features (e.g. colors, texture, motion, etc.) and high-level concepts to perform a reliable content-based indexing and retrieval. More especially, our work proposes a new way to establish a connection between both geometric and radiometric deformations and the characterization of them in terms of camera operations. Based on both the apparent motion and the defocus blur (low-level features), we estimate extrinsic and intrinsic camera parameter changes, and then deduce 3D camera operations (i.e. mid-level features), such as panning/tracking, tilting/booming, zooming/ dollying and rolling, as well as focus changes. Finally, camera operations are recorded into an index which is then used for video retrieval. Experiments confirm that the proposed mid-level features can be accurately deduced from low-level features and that they can be used for indexing and retrieval purpose.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)

自引率

0.00%

发文量

期刊最新文献

Image Classification and Retrieval using Correlation Photometric Stereo with Nearby Planar Distributed Illuminants Evolving a Vision-Based Line-Following Robot Controller Line Extraction with Composite Background Subtract The Nomad 200 and the Nomad SuperScout: Reverse engineered and resurrected