{"title":"Summarizing Surveillance Video by Saliency Transition and Moving Object Information","authors":"M. Salehin, M. Paul","doi":"10.1109/DICTA.2015.7371311","DOIUrl":null,"url":null,"abstract":"Everyday an enormous amount of video is captured by surveillance system for various purposes around the whole world. However, this is almost impossible for human to analyze the vast majority of video data. In this paper, a video summarization method is introduced combining foreground object, motion, and visual attention cue. Foreground objects typically provide important information about video contents. Additionally, object motion is naturally more attractive to human being. Moreover, visual attention cue indicates the human's attraction label for key frame determination. Using these features, supervised classifier support vector machine (SVM) is applied to obtain the key frames from a surveillance video. Extensive experimental results show that the proposed method performs superior to the state-of-the-art method using publicly available BL-7F surveillance video dataset.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"58 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2015.7371311","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Everyday an enormous amount of video is captured by surveillance system for various purposes around the whole world. However, this is almost impossible for human to analyze the vast majority of video data. In this paper, a video summarization method is introduced combining foreground object, motion, and visual attention cue. Foreground objects typically provide important information about video contents. Additionally, object motion is naturally more attractive to human being. Moreover, visual attention cue indicates the human's attraction label for key frame determination. Using these features, supervised classifier support vector machine (SVM) is applied to obtain the key frames from a surveillance video. Extensive experimental results show that the proposed method performs superior to the state-of-the-art method using publicly available BL-7F surveillance video dataset.