{"title":"Impact of Subject-specific Training Data in Anxiety Level Classification from Physiologic Data","authors":"R. Selzler, A. Chan, J. Green","doi":"10.1109/MeMeA52024.2021.9478757","DOIUrl":null,"url":null,"abstract":"The autonomic nervous system is known for the fight or flight response. Anxiety affects the autonomic nervous system, causing heightened heart rate and electrodermal activity. This paper explores machine learning methods to predict two- and three-level anxiety in spider fearful individuals watching spider video clips in a controlled trial. Features are extracted from electrocardiogram and electrodermal time-series signals. Specifically, this paper explores the performance of such models as the amount of data pertaining to the test subject increases in the training set. Standard K-fold cross-validation is here compared to leaky group-fold cross-validation with sample imputation, where we systematically vary the the number of samples from the test subject that are included in the training set. While it is possible to reach 78% and 60% k-fold accuracy for a two- and three-level anxiety prediction, respectively, excluding all test subject data from the training set causes the accuracy to drop to 73% and 45%. The results demonstrate that the features and models used here do not generalize for inter-subject classification tasks and that care should be taken when splitting subject data between training and test data. Furthermore, our results address the \"cold start problem\" by providing an indication of how much data would be required from a new subject before accurate prediction of anxiety is possible from physiologic data.","PeriodicalId":429222,"journal":{"name":"2021 IEEE International Symposium on Medical Measurements and Applications (MeMeA)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Symposium on Medical Measurements and Applications (MeMeA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MeMeA52024.2021.9478757","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The autonomic nervous system is known for the fight or flight response. Anxiety affects the autonomic nervous system, causing heightened heart rate and electrodermal activity. This paper explores machine learning methods to predict two- and three-level anxiety in spider fearful individuals watching spider video clips in a controlled trial. Features are extracted from electrocardiogram and electrodermal time-series signals. Specifically, this paper explores the performance of such models as the amount of data pertaining to the test subject increases in the training set. Standard K-fold cross-validation is here compared to leaky group-fold cross-validation with sample imputation, where we systematically vary the the number of samples from the test subject that are included in the training set. While it is possible to reach 78% and 60% k-fold accuracy for a two- and three-level anxiety prediction, respectively, excluding all test subject data from the training set causes the accuracy to drop to 73% and 45%. The results demonstrate that the features and models used here do not generalize for inter-subject classification tasks and that care should be taken when splitting subject data between training and test data. Furthermore, our results address the "cold start problem" by providing an indication of how much data would be required from a new subject before accurate prediction of anxiety is possible from physiologic data.