Arafat Rahman, Nazmun Nahid, I. Hassan, Md Atiqur Rahman Ahad
{"title":"护理活动识别:利用随机森林处理班级失衡问题","authors":"Arafat Rahman, Nazmun Nahid, I. Hassan, Md Atiqur Rahman Ahad","doi":"10.1145/3410530.3414334","DOIUrl":null,"url":null,"abstract":"Nurse care activity recognition is a new challenging research field in human activity recognition (HAR) because unlike other activity recognition, it has severe class imbalance problem and intra-class variability depending on both the subject and the receiver. In this paper, we applied the Random Forest-based resampling method to solve the class imbalance problem in the Heiseikai data, nurse care activity dataset. This method consists of resampling, feature selection based on Gini impurity, and model training and validation with Stratified KFold cross-validation. By implementing the Random Forest classifier, we achieved 65.9% average cross-validation accuracy in classifying 12 activities conducted by nurses in both lab and real-life settings. Our team, \"Britter Baire\" developed this algorithmic pipeline for \"The 2nd Nurse Care Activity Recognition Challenge Using Lab and Field Data\".","PeriodicalId":7183,"journal":{"name":"Adjunct Proceedings of the 2020 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2020 ACM International Symposium on Wearable Computers","volume":"3 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Nurse care activity recognition: using random forest to handle imbalanced class problem\",\"authors\":\"Arafat Rahman, Nazmun Nahid, I. Hassan, Md Atiqur Rahman Ahad\",\"doi\":\"10.1145/3410530.3414334\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nurse care activity recognition is a new challenging research field in human activity recognition (HAR) because unlike other activity recognition, it has severe class imbalance problem and intra-class variability depending on both the subject and the receiver. In this paper, we applied the Random Forest-based resampling method to solve the class imbalance problem in the Heiseikai data, nurse care activity dataset. This method consists of resampling, feature selection based on Gini impurity, and model training and validation with Stratified KFold cross-validation. By implementing the Random Forest classifier, we achieved 65.9% average cross-validation accuracy in classifying 12 activities conducted by nurses in both lab and real-life settings. Our team, \\\"Britter Baire\\\" developed this algorithmic pipeline for \\\"The 2nd Nurse Care Activity Recognition Challenge Using Lab and Field Data\\\".\",\"PeriodicalId\":7183,\"journal\":{\"name\":\"Adjunct Proceedings of the 2020 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2020 ACM International Symposium on Wearable Computers\",\"volume\":\"3 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Adjunct Proceedings of the 2020 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2020 ACM International Symposium on Wearable Computers\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3410530.3414334\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Adjunct Proceedings of the 2020 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2020 ACM International Symposium on Wearable Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3410530.3414334","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Nurse care activity recognition: using random forest to handle imbalanced class problem
Nurse care activity recognition is a new challenging research field in human activity recognition (HAR) because unlike other activity recognition, it has severe class imbalance problem and intra-class variability depending on both the subject and the receiver. In this paper, we applied the Random Forest-based resampling method to solve the class imbalance problem in the Heiseikai data, nurse care activity dataset. This method consists of resampling, feature selection based on Gini impurity, and model training and validation with Stratified KFold cross-validation. By implementing the Random Forest classifier, we achieved 65.9% average cross-validation accuracy in classifying 12 activities conducted by nurses in both lab and real-life settings. Our team, "Britter Baire" developed this algorithmic pipeline for "The 2nd Nurse Care Activity Recognition Challenge Using Lab and Field Data".