{"title":"在基于内容的视频索引和检索的背景下解读摄像机操作","authors":"Wei Pan, F. Deschênes","doi":"10.1109/CRV.2006.44","DOIUrl":null,"url":null,"abstract":"In this work, we intend to go one step further to overcome the difficulty that lies in the gap between low-level media features (e.g. colors, texture, motion, etc.) and high-level concepts to perform a reliable content-based indexing and retrieval. More especially, our work proposes a new way to establish a connection between both geometric and radiometric deformations and the characterization of them in terms of camera operations. Based on both the apparent motion and the defocus blur (low-level features), we estimate extrinsic and intrinsic camera parameter changes, and then deduce 3D camera operations (i.e. mid-level features), such as panning/tracking, tilting/booming, zooming/ dollying and rolling, as well as focus changes. Finally, camera operations are recorded into an index which is then used for video retrieval. Experiments confirm that the proposed mid-level features can be accurately deduced from low-level features and that they can be used for indexing and retrieval purpose.","PeriodicalId":369170,"journal":{"name":"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Interpreting Camera Operations in the Context of Content-based Video Indexing and Retrieval\",\"authors\":\"Wei Pan, F. Deschênes\",\"doi\":\"10.1109/CRV.2006.44\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we intend to go one step further to overcome the difficulty that lies in the gap between low-level media features (e.g. colors, texture, motion, etc.) and high-level concepts to perform a reliable content-based indexing and retrieval. More especially, our work proposes a new way to establish a connection between both geometric and radiometric deformations and the characterization of them in terms of camera operations. Based on both the apparent motion and the defocus blur (low-level features), we estimate extrinsic and intrinsic camera parameter changes, and then deduce 3D camera operations (i.e. mid-level features), such as panning/tracking, tilting/booming, zooming/ dollying and rolling, as well as focus changes. Finally, camera operations are recorded into an index which is then used for video retrieval. Experiments confirm that the proposed mid-level features can be accurately deduced from low-level features and that they can be used for indexing and retrieval purpose.\",\"PeriodicalId\":369170,\"journal\":{\"name\":\"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)\",\"volume\":\"66 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-06-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CRV.2006.44\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 3rd Canadian Conference on Computer and Robot Vision (CRV'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CRV.2006.44","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Interpreting Camera Operations in the Context of Content-based Video Indexing and Retrieval
In this work, we intend to go one step further to overcome the difficulty that lies in the gap between low-level media features (e.g. colors, texture, motion, etc.) and high-level concepts to perform a reliable content-based indexing and retrieval. More especially, our work proposes a new way to establish a connection between both geometric and radiometric deformations and the characterization of them in terms of camera operations. Based on both the apparent motion and the defocus blur (low-level features), we estimate extrinsic and intrinsic camera parameter changes, and then deduce 3D camera operations (i.e. mid-level features), such as panning/tracking, tilting/booming, zooming/ dollying and rolling, as well as focus changes. Finally, camera operations are recorded into an index which is then used for video retrieval. Experiments confirm that the proposed mid-level features can be accurately deduced from low-level features and that they can be used for indexing and retrieval purpose.