{"title":"动作识别的改进秩池策略","authors":"Fengqian Pang, Yue Li","doi":"10.1109/ISPDS56360.2022.9874009","DOIUrl":null,"url":null,"abstract":"In the field of action recognition, it is crucial to capture the temporal evolution of video content compactly and effectively. One of the solutions is the rank pooling method that enables acquiring the evolution of video content. To further enhance temporal discrimination of the rank pooling, we proposed two improved rank pooling strategies, named the Minimum Volume Enclosing Ellipsoids (MVEE) and the Temporal Minimum Volume Enclosing Ellipsoids (TMVEE). The proposed methods are compatible with rank pooling and characterize the data distribution in other orthogonal directions to improve the temporal discrimination. We performed experiments on the ChaLearn gesture recognition and HMDB51 database, the results reveal that our proposed methods outperform other mainstreaming methos.","PeriodicalId":280244,"journal":{"name":"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved Rank Pooling Strategy for Action Recognition\",\"authors\":\"Fengqian Pang, Yue Li\",\"doi\":\"10.1109/ISPDS56360.2022.9874009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the field of action recognition, it is crucial to capture the temporal evolution of video content compactly and effectively. One of the solutions is the rank pooling method that enables acquiring the evolution of video content. To further enhance temporal discrimination of the rank pooling, we proposed two improved rank pooling strategies, named the Minimum Volume Enclosing Ellipsoids (MVEE) and the Temporal Minimum Volume Enclosing Ellipsoids (TMVEE). The proposed methods are compatible with rank pooling and characterize the data distribution in other orthogonal directions to improve the temporal discrimination. We performed experiments on the ChaLearn gesture recognition and HMDB51 database, the results reveal that our proposed methods outperform other mainstreaming methos.\",\"PeriodicalId\":280244,\"journal\":{\"name\":\"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPDS56360.2022.9874009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPDS56360.2022.9874009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved Rank Pooling Strategy for Action Recognition
In the field of action recognition, it is crucial to capture the temporal evolution of video content compactly and effectively. One of the solutions is the rank pooling method that enables acquiring the evolution of video content. To further enhance temporal discrimination of the rank pooling, we proposed two improved rank pooling strategies, named the Minimum Volume Enclosing Ellipsoids (MVEE) and the Temporal Minimum Volume Enclosing Ellipsoids (TMVEE). The proposed methods are compatible with rank pooling and characterize the data distribution in other orthogonal directions to improve the temporal discrimination. We performed experiments on the ChaLearn gesture recognition and HMDB51 database, the results reveal that our proposed methods outperform other mainstreaming methos.