{"title":"Action Recognition for Videos by Long-Term Point Trajectory Analysis with Background Removal","authors":"Yuze Xiang, Y. Okada, Kosuke Kaneko","doi":"10.1109/SITIS.2016.13","DOIUrl":null,"url":null,"abstract":"Recently, dense trajectories were shown to be an efficient video motion representation for action recognition and achieved state-of-the-art results on a variety of video datasets. This paper improves their performance by taking into account camera motion. To estimate camera motion, the authors use long-term point trajectory analysis to cluster image points and propose an algorithm to find possible background cluster from these clusters according to background nature in a video. Considering the original clusters could not segment the foreground and background very well. The authors optimize the background cluster, and use the cluster to rectify the trajectory. Experimental results on three challenging action datasets (i.e., Hollywood2, Olympic Sports and UCF50) show that the rectified trajectories significantly outperform original dense trajectories.","PeriodicalId":403704,"journal":{"name":"2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SITIS.2016.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Recently, dense trajectories were shown to be an efficient video motion representation for action recognition and achieved state-of-the-art results on a variety of video datasets. This paper improves their performance by taking into account camera motion. To estimate camera motion, the authors use long-term point trajectory analysis to cluster image points and propose an algorithm to find possible background cluster from these clusters according to background nature in a video. Considering the original clusters could not segment the foreground and background very well. The authors optimize the background cluster, and use the cluster to rectify the trajectory. Experimental results on three challenging action datasets (i.e., Hollywood2, Olympic Sports and UCF50) show that the rectified trajectories significantly outperform original dense trajectories.