Yie-Tarng Chen, Wen-Hsien Fang, Shimeng Dai, Choa-Chuan Lu
{"title":"基于稀疏编码和时间金字塔池的骨骼运动姿态人体跌倒检测","authors":"Yie-Tarng Chen, Wen-Hsien Fang, Shimeng Dai, Choa-Chuan Lu","doi":"10.1109/ICASI52993.2021.9568464","DOIUrl":null,"url":null,"abstract":"This paper presents an efficient, yet high-accurate skeleton-based approach for human fall detection, which combines the sparse coding and temporal pyramid pooling techniques. The new method first separates the skeleton joints into five different parts, for each of which a moving pose descriptor is extracted to represent the human sub-actions. The principal component analysis is then employed to reduce the dimensions of the descriptors. Afterwards, sparse coding is invoked to encode each descriptor separately. Finally, these encoded descriptors are treated as a set of time series and then aggregated into the final video descriptors by temporal pyramid pooling, which can acquire temporal tendency to further boost the performance. Experimental results show that the new approach outperforms the state-of-the-art works on some commonly used datasets.","PeriodicalId":103254,"journal":{"name":"2021 7th International Conference on Applied System Innovation (ICASI)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Skeleton Moving Pose-based Human Fall Detection with Sparse Coding and Temporal Pyramid Pooling\",\"authors\":\"Yie-Tarng Chen, Wen-Hsien Fang, Shimeng Dai, Choa-Chuan Lu\",\"doi\":\"10.1109/ICASI52993.2021.9568464\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an efficient, yet high-accurate skeleton-based approach for human fall detection, which combines the sparse coding and temporal pyramid pooling techniques. The new method first separates the skeleton joints into five different parts, for each of which a moving pose descriptor is extracted to represent the human sub-actions. The principal component analysis is then employed to reduce the dimensions of the descriptors. Afterwards, sparse coding is invoked to encode each descriptor separately. Finally, these encoded descriptors are treated as a set of time series and then aggregated into the final video descriptors by temporal pyramid pooling, which can acquire temporal tendency to further boost the performance. Experimental results show that the new approach outperforms the state-of-the-art works on some commonly used datasets.\",\"PeriodicalId\":103254,\"journal\":{\"name\":\"2021 7th International Conference on Applied System Innovation (ICASI)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 7th International Conference on Applied System Innovation (ICASI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASI52993.2021.9568464\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th International Conference on Applied System Innovation (ICASI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASI52993.2021.9568464","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Skeleton Moving Pose-based Human Fall Detection with Sparse Coding and Temporal Pyramid Pooling
This paper presents an efficient, yet high-accurate skeleton-based approach for human fall detection, which combines the sparse coding and temporal pyramid pooling techniques. The new method first separates the skeleton joints into five different parts, for each of which a moving pose descriptor is extracted to represent the human sub-actions. The principal component analysis is then employed to reduce the dimensions of the descriptors. Afterwards, sparse coding is invoked to encode each descriptor separately. Finally, these encoded descriptors are treated as a set of time series and then aggregated into the final video descriptors by temporal pyramid pooling, which can acquire temporal tendency to further boost the performance. Experimental results show that the new approach outperforms the state-of-the-art works on some commonly used datasets.