{"title":"基于层次局部区域分类的视频目标分割","authors":"Chenguang Zhang, H. Ai","doi":"10.1109/ACPR.2011.6166545","DOIUrl":null,"url":null,"abstract":"Video Object Segmentation (VOS) is to cut out a selected object from video sequences, where the main difficulties are shape deformation, appearance variations and background clutter. To cope with these difficulties, we propose a novel method, named as Hierarchical Localized Classification of Regions (HLCR). We suggest that appearance models as well as the spatial and temporal coherence between frames are the keys to break through bottleneck. Locally, in order to identify foreground regions, we propose to use Hierarchial Localized Classifiers, which organize regional features as decision trees. In global, we adopt Gaussian Mixture Color Models (GMMs). After integrating the local and global results into a probability mask, we can achieve the final segmentation result by graph cut. Experiments on various challenging video sequences demonstrate the efficiency and adaptability of the proposed method.","PeriodicalId":287232,"journal":{"name":"The First Asian Conference on Pattern Recognition","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Video Object Segmentation by Hierarchical Localized Classification of Regions\",\"authors\":\"Chenguang Zhang, H. Ai\",\"doi\":\"10.1109/ACPR.2011.6166545\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Video Object Segmentation (VOS) is to cut out a selected object from video sequences, where the main difficulties are shape deformation, appearance variations and background clutter. To cope with these difficulties, we propose a novel method, named as Hierarchical Localized Classification of Regions (HLCR). We suggest that appearance models as well as the spatial and temporal coherence between frames are the keys to break through bottleneck. Locally, in order to identify foreground regions, we propose to use Hierarchial Localized Classifiers, which organize regional features as decision trees. In global, we adopt Gaussian Mixture Color Models (GMMs). After integrating the local and global results into a probability mask, we can achieve the final segmentation result by graph cut. Experiments on various challenging video sequences demonstrate the efficiency and adaptability of the proposed method.\",\"PeriodicalId\":287232,\"journal\":{\"name\":\"The First Asian Conference on Pattern Recognition\",\"volume\":\"87 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The First Asian Conference on Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACPR.2011.6166545\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The First Asian Conference on Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACPR.2011.6166545","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Video Object Segmentation by Hierarchical Localized Classification of Regions
Video Object Segmentation (VOS) is to cut out a selected object from video sequences, where the main difficulties are shape deformation, appearance variations and background clutter. To cope with these difficulties, we propose a novel method, named as Hierarchical Localized Classification of Regions (HLCR). We suggest that appearance models as well as the spatial and temporal coherence between frames are the keys to break through bottleneck. Locally, in order to identify foreground regions, we propose to use Hierarchial Localized Classifiers, which organize regional features as decision trees. In global, we adopt Gaussian Mixture Color Models (GMMs). After integrating the local and global results into a probability mask, we can achieve the final segmentation result by graph cut. Experiments on various challenging video sequences demonstrate the efficiency and adaptability of the proposed method.