Benn Henderson, P. Yogarajah, B. Gardiner, M. McGinnity, Kitty Forster, B. Nicholas, D. Wimpory, J. Wanigasinghe
{"title":"Effects of Intra-Subject Variation in Gait Analysis on ASD Classification Performance in Machine Learning Models","authors":"Benn Henderson, P. Yogarajah, B. Gardiner, M. McGinnity, Kitty Forster, B. Nicholas, D. Wimpory, J. Wanigasinghe","doi":"10.1109/ISSC49989.2020.9180201","DOIUrl":null,"url":null,"abstract":"Autism Spectrum Disorder (ASD) is a developmental disorder that is prevalent globally. Research into detecting autism traditionally focused on behavioural aspects of the condition, however, more recently, focus has shifted to more objective alternatives using techniques such as machine learning and gait analysis. Gait measurements, having been used for person identification, varies from person to person, introducing a lot of intra-subject variance. This applies to the 8 spatial-temporal features used in this study, representing the time that an individual spends in each phase of a gait cycle, collected using a Vicon motion tracking system. The features were averaged across each gait trial that the subjects performed, producing a second set of features with reduced intra-subject variance. Four common classifiers, a Support Vector Machine (SVM), K-Nearest Neighbour (KNN), Random Forests (RF) and a Decision Tree (DT) classifier, were all trained using the two feature sets and their classification rates were compared. The results show that for the RF classifier, reducing the intra-subject variance, was able to successfully increase the classification power. The KNN and DT classifiers experienced a minimal decrease in accuracy, where the SVM suffered the greatest loss when intra-subject variance was reduced. Results overall show that the effect intra-subject variance has on classification power depends heavily on the suitability of the classifier to the initial problem as well as size and class balance of the data.","PeriodicalId":351013,"journal":{"name":"2020 31st Irish Signals and Systems Conference (ISSC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 31st Irish Signals and Systems Conference (ISSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSC49989.2020.9180201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Autism Spectrum Disorder (ASD) is a developmental disorder that is prevalent globally. Research into detecting autism traditionally focused on behavioural aspects of the condition, however, more recently, focus has shifted to more objective alternatives using techniques such as machine learning and gait analysis. Gait measurements, having been used for person identification, varies from person to person, introducing a lot of intra-subject variance. This applies to the 8 spatial-temporal features used in this study, representing the time that an individual spends in each phase of a gait cycle, collected using a Vicon motion tracking system. The features were averaged across each gait trial that the subjects performed, producing a second set of features with reduced intra-subject variance. Four common classifiers, a Support Vector Machine (SVM), K-Nearest Neighbour (KNN), Random Forests (RF) and a Decision Tree (DT) classifier, were all trained using the two feature sets and their classification rates were compared. The results show that for the RF classifier, reducing the intra-subject variance, was able to successfully increase the classification power. The KNN and DT classifiers experienced a minimal decrease in accuracy, where the SVM suffered the greatest loss when intra-subject variance was reduced. Results overall show that the effect intra-subject variance has on classification power depends heavily on the suitability of the classifier to the initial problem as well as size and class balance of the data.