Deep learning has achieved significant success on intelligent medical treatments, such as automatic diagnosis and analysis of medical data. To train an automatic diagnosis system with high accuracy and strong robustness in healthcare, sufficient training data are required when using deep learning-based methods. However, given that the data collected by sensors that are embedded in medical or mobile devices are inadequate, it is challenging to train an effective and efficient classification model with state-of-the-art performance. Inspired by generative adversarial networks (GANs), we propose TS-GAN, a Time-series GAN architecture based on long short-term memory (LSTM) networks for sensor-based health data augmentation, thereby improving the performance of deep learning-based classification models. TS-GAN aims to learn a generative model that creates time-series data with the same space and time dependence as the real data. Specifically, we design an LSTM-based generator for creating realistic data and an LSTM-based discriminator for determining how similar the generated data are to real data. In particular, we design a sequential-squeeze-and-excitation module in the LSTM-based discriminator to better understand space dependence of real data, and apply the gradient penalty originated from Wasserstein GANs in the training process to stabilize the optimization. We conduct comparative experiments to evaluate the performance of TS-GAN with TimeGAN, C-RNN-GAN and Conditional Wasserstein GANs through discriminator loss, maximum mean discrepancy, visualization methods and classification accuracy on health datasets of ECG_200, NonInvasiveFatalECG_Thorax1, and mHealth, respectively. The experimental results show that TS-GAN exceeds other state-of-the-art time-series GANs in almost all the evaluation metrics, and the classifier trained on synthetic datasets generated by TS-GAN achieves the highest classification accuracy of 97.50% on ECG_200, 94.12% on NonInvasiveFatalECG_Thorax1, and 98.12% on mHealth, respectively.
{"title":"TS-GAN: Time-series GAN for Sensor-based Health Data Augmentation","authors":"Zhenyu Yang, Yantao Li, Gang Zhou","doi":"10.1145/3583593","DOIUrl":"https://doi.org/10.1145/3583593","url":null,"abstract":"Deep learning has achieved significant success on intelligent medical treatments, such as automatic diagnosis and analysis of medical data. To train an automatic diagnosis system with high accuracy and strong robustness in healthcare, sufficient training data are required when using deep learning-based methods. However, given that the data collected by sensors that are embedded in medical or mobile devices are inadequate, it is challenging to train an effective and efficient classification model with state-of-the-art performance. Inspired by generative adversarial networks (GANs), we propose TS-GAN, a Time-series GAN architecture based on long short-term memory (LSTM) networks for sensor-based health data augmentation, thereby improving the performance of deep learning-based classification models. TS-GAN aims to learn a generative model that creates time-series data with the same space and time dependence as the real data. Specifically, we design an LSTM-based generator for creating realistic data and an LSTM-based discriminator for determining how similar the generated data are to real data. In particular, we design a sequential-squeeze-and-excitation module in the LSTM-based discriminator to better understand space dependence of real data, and apply the gradient penalty originated from Wasserstein GANs in the training process to stabilize the optimization. We conduct comparative experiments to evaluate the performance of TS-GAN with TimeGAN, C-RNN-GAN and Conditional Wasserstein GANs through discriminator loss, maximum mean discrepancy, visualization methods and classification accuracy on health datasets of ECG_200, NonInvasiveFatalECG_Thorax1, and mHealth, respectively. The experimental results show that TS-GAN exceeds other state-of-the-art time-series GANs in almost all the evaluation metrics, and the classifier trained on synthetic datasets generated by TS-GAN achieves the highest classification accuracy of 97.50% on ECG_200, 94.12% on NonInvasiveFatalECG_Thorax1, and 98.12% on mHealth, respectively.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 21"},"PeriodicalIF":0.0,"publicationDate":"2023-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43220955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Population health monitoring is a fundamental component of the public health system. Due to the high-cost nature of traditional population-wise health-data collection methods, a class of sparse-sampling-completion algorithms are proposed to exploit the spatio-temporal correlation buried under the observed examples. However, for the population health data, a huge challenge for the state-of-the-art completion methods is the unstationary environment. Specifically, the underlying temporal correlation of the population health data are evolving from year to year. To this end, we propose a GAN-based year-by-year completion framework: uncertainty-aware augmented generative adversarial imputation nets (UAA-GAIN), to address the problem of unstationary environment. To further restrain the error accumulation, we develop a stronger generator as well as a stronger discriminator in the min-max equilibrium. A by-product of the augmented GAIN model allows weighting the difficulty of examples. Inspired by the idea of curriculum learning, a better training schedule is implemented in the proposed framework. We evaluate the proposed method on three real-world chronic disease datasets and the results show that UAA-GAIN outperforms other baseline methods in various settings.
{"title":"Towards Sustainable Compressive Population Health: A GAN-based Year-By-Year Imputation Method","authors":"Yujie Feng, Jiangtao Wang, Yasha Wang, Xu Chu","doi":"10.1145/3571159","DOIUrl":"https://doi.org/10.1145/3571159","url":null,"abstract":"Population health monitoring is a fundamental component of the public health system. Due to the high-cost nature of traditional population-wise health-data collection methods, a class of sparse-sampling-completion algorithms are proposed to exploit the spatio-temporal correlation buried under the observed examples. However, for the population health data, a huge challenge for the state-of-the-art completion methods is the unstationary environment. Specifically, the underlying temporal correlation of the population health data are evolving from year to year. To this end, we propose a GAN-based year-by-year completion framework: uncertainty-aware augmented generative adversarial imputation nets (UAA-GAIN), to address the problem of unstationary environment. To further restrain the error accumulation, we develop a stronger generator as well as a stronger discriminator in the min-max equilibrium. A by-product of the augmented GAIN model allows weighting the difficulty of examples. Inspired by the idea of curriculum learning, a better training schedule is implemented in the proposed framework. We evaluate the proposed method on three real-world chronic disease datasets and the results show that UAA-GAIN outperforms other baseline methods in various settings.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 18"},"PeriodicalIF":0.0,"publicationDate":"2022-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41685727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Argyris Constantinides, Marios Belk, C. Fidas, R. Beumers, David Vidal, Wanting Huang, J. Bowles, Thais Webber, Agastya Silvina, A. Pitsillides
This article proposes a user-adaptable and personalized authentication paradigm for healthcare organizations, which anticipates to seamlessly reflect patients’ episodic and autobiographical memories to graphical and textual passwords aiming to improve the security strength of user-selected passwords and provide a positive user experience. We report on a longitudinal study that spanned over 3 years in which three public European healthcare organizations participated to design and evaluate the aforementioned paradigm. Three studies were conducted (n = 169) with different stakeholders: (1) a verification study aiming to identify existing authentication practices of the three healthcare organizations with diverse stakeholders (n = 9), (2) a patient-centric feasibility study during which users interacted with the proposed authentication system (n = 68), and (3) a human guessing attack study focusing on vulnerabilities among people sharing common experiences within location-aware images used for graphical passwords (n = 92). Results revealed that the suggested paradigm scored high with regard to users’ likeability, perceived security, usability, and trust, but more importantly it assists the creation of more secure passwords. On the downside, the suggested paradigm introduces password guessing vulnerabilities by individuals sharing common experiences with the end users. Findings are expected to scaffold the design of more patient-centric knowledge-based authentication mechanisms within today's dynamic computation realms.
{"title":"Security and Usability of a Personalized User Authentication Paradigm: Insights from a Longitudinal Study with Three Healthcare Organizations","authors":"Argyris Constantinides, Marios Belk, C. Fidas, R. Beumers, David Vidal, Wanting Huang, J. Bowles, Thais Webber, Agastya Silvina, A. Pitsillides","doi":"10.1145/3564610","DOIUrl":"https://doi.org/10.1145/3564610","url":null,"abstract":"This article proposes a user-adaptable and personalized authentication paradigm for healthcare organizations, which anticipates to seamlessly reflect patients’ episodic and autobiographical memories to graphical and textual passwords aiming to improve the security strength of user-selected passwords and provide a positive user experience. We report on a longitudinal study that spanned over 3 years in which three public European healthcare organizations participated to design and evaluate the aforementioned paradigm. Three studies were conducted (n = 169) with different stakeholders: (1) a verification study aiming to identify existing authentication practices of the three healthcare organizations with diverse stakeholders (n = 9), (2) a patient-centric feasibility study during which users interacted with the proposed authentication system (n = 68), and (3) a human guessing attack study focusing on vulnerabilities among people sharing common experiences within location-aware images used for graphical passwords (n = 92). Results revealed that the suggested paradigm scored high with regard to users’ likeability, perceived security, usability, and trust, but more importantly it assists the creation of more secure passwords. On the downside, the suggested paradigm introduces password guessing vulnerabilities by individuals sharing common experiences with the end users. Findings are expected to scaffold the design of more patient-centric knowledge-based authentication mechanisms within today's dynamic computation realms.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 40"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42631776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-10-01Epub Date: 2022-11-03DOI: 10.1145/3531231
Gina Sprint, Diane J Cook, Maureen Schmitter-Edgecombe, Lawrence B Holder
New modes of technology are offering unprecedented opportunities to unobtrusively collect data about people's behavior. While there are many use cases for such information, we explore its utility for predicting multiple clinical assessment scores. Because clinical assessments are typically used as screening tools for impairment and disease, such as mild cognitive impairment (MCI), automatically mapping behavioral data to assessment scores can help detect changes in health and behavior across time. In this paper, we aim to extract behavior markers from two modalities, a smart home environment and a custom digital memory notebook app, for mapping to ten clinical assessments that are relevant for monitoring MCI onset and changes in cognitive health. Smart home-based behavior markers reflect hourly, daily, and weekly activity patterns, while app-based behavior markers reflect app usage and writing content/style derived from free-form journal entries. We describe machine learning techniques for fusing these multimodal behavior markers and utilizing joint prediction. We evaluate our approach using three regression algorithms and data from 14 participants with MCI living in a smart home environment. We observed moderate to large correlations between predicted and ground-truth assessment scores, ranging from r = 0.601 to r = 0.871 for each clinical assessment.
新的技术模式为不露痕迹地收集人们的行为数据提供了前所未有的机会。虽然此类信息有很多用例,但我们要探讨的是它在预测多种临床评估分数方面的效用。由于临床评估通常被用作损伤和疾病(如轻度认知障碍(MCI))的筛查工具,因此将行为数据自动映射到评估分数有助于检测不同时期的健康和行为变化。在本文中,我们旨在从智能家居环境和定制数字记忆笔记本应用程序这两种模式中提取行为标记,并将其映射到与监测 MCI 发病和认知健康变化相关的十项临床评估中。基于智能家居的行为标记反映了每小时、每天和每周的活动模式,而基于应用程序的行为标记则反映了应用程序的使用情况以及从自由形式的日记条目中提取的写作内容/风格。我们介绍了融合这些多模态行为标记并利用联合预测的机器学习技术。我们使用三种回归算法和 14 名生活在智能家居环境中的 MCI 患者的数据对我们的方法进行了评估。我们观察到预测得分和地面实况评估得分之间存在中等到较大的相关性,每项临床评估的相关性从 r = 0.601 到 r = 0.871 不等。
{"title":"Multimodal Fusion of Smart Home and Text-based Behavior Markers for Clinical Assessment Prediction.","authors":"Gina Sprint, Diane J Cook, Maureen Schmitter-Edgecombe, Lawrence B Holder","doi":"10.1145/3531231","DOIUrl":"10.1145/3531231","url":null,"abstract":"<p><p>New modes of technology are offering unprecedented opportunities to unobtrusively collect data about people's behavior. While there are many use cases for such information, we explore its utility for predicting multiple clinical assessment scores. Because clinical assessments are typically used as screening tools for impairment and disease, such as mild cognitive impairment (MCI), automatically mapping behavioral data to assessment scores can help detect changes in health and behavior across time. In this paper, we aim to extract behavior markers from two modalities, a smart home environment and a custom digital memory notebook app, for mapping to ten clinical assessments that are relevant for monitoring MCI onset and changes in cognitive health. Smart home-based behavior markers reflect hourly, daily, and weekly activity patterns, while app-based behavior markers reflect app usage and writing content/style derived from free-form journal entries. We describe machine learning techniques for fusing these multimodal behavior markers and utilizing joint prediction. We evaluate our approach using three regression algorithms and data from 14 participants with MCI living in a smart home environment. We observed moderate to large correlations between predicted and ground-truth assessment scores, ranging from <i>r</i> = 0.601 to <i>r</i> = 0.871 for each clinical assessment.</p>","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"3 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9645787/pdf/nihms-1822476.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10608022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Due to the increase in life expectancy in advanced societies leading to an increase in population age, data-driven systems are receiving more attention to support the older people by monitoring their health. Intelligent sensor networks provide the ability to monitor their activities without interfering with routine life. Data collected from smart homes can be used in a variety of data-driven analyses, including behavior prediction. Due to privacy concerns and the cost and time required to collect data, synthetic data generation methods have been considered seriously by the research community. In this article, we introduce a new Generative Adversarial Network (GAN) algorithm, namely, BehavGAN, that applies GAN to the problem of behavior sequence generation. This is achieved by learning the features of a target dataset and utilizing a new application for GANs in the simulation of older people’s behaviors. We also propose an effective reward function for GAN back-propagation by incorporating n-gram-based similarity measures in the reinforcement mechanism. We evaluate our proposed algorithm by generating a dataset of human behavior sequences. Our results show that BehavGAN is more effective in generating behavior sequences compared to MLE, LeakGAN, and the original SeqGAN algorithms in terms of both similarity and diversity of generated data. Our proposed algorithm outperforms current state-of-the-art methods when it comes to generating behavior sequences consisting of limited-space sequence tokens.
{"title":"Synthetic Behavior Sequence Generation Using Generative Adversarial Networks","authors":"Fatemeh Akbari, K. Sartipi, Norm Archer","doi":"10.1145/3563950","DOIUrl":"https://doi.org/10.1145/3563950","url":null,"abstract":"Due to the increase in life expectancy in advanced societies leading to an increase in population age, data-driven systems are receiving more attention to support the older people by monitoring their health. Intelligent sensor networks provide the ability to monitor their activities without interfering with routine life. Data collected from smart homes can be used in a variety of data-driven analyses, including behavior prediction. Due to privacy concerns and the cost and time required to collect data, synthetic data generation methods have been considered seriously by the research community. In this article, we introduce a new Generative Adversarial Network (GAN) algorithm, namely, BehavGAN, that applies GAN to the problem of behavior sequence generation. This is achieved by learning the features of a target dataset and utilizing a new application for GANs in the simulation of older people’s behaviors. We also propose an effective reward function for GAN back-propagation by incorporating n-gram-based similarity measures in the reinforcement mechanism. We evaluate our proposed algorithm by generating a dataset of human behavior sequences. Our results show that BehavGAN is more effective in generating behavior sequences compared to MLE, LeakGAN, and the original SeqGAN algorithms in terms of both similarity and diversity of generated data. Our proposed algorithm outperforms current state-of-the-art methods when it comes to generating behavior sequences consisting of limited-space sequence tokens.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 23"},"PeriodicalIF":0.0,"publicationDate":"2022-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46099276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
K. Amaral, Zihan Li, W. Ding, S. Crouter, Ping Chen
SummerTime seeks to summarize global time-series signals and provides a fixed-length, robust representation of the variable-length time series. Many machine learning methods depend on data instances with a fixed number of features. As a result, those methods cannot be directly applied to variable-length time series data. Existing methods such as sliding windows can lose minority local information. Summarization conducted by the SummerTime method will be a fixed-length feature vector which can be used in place of the time series dataset for use with classical machine learning methods. We use Gaussian Mixture models (GMM) over small same-length disjoint windows in the time series to group local data into clusters. The time series’ rate of membership for each cluster will be a feature in the summarization. By making use of variational methods, GMM converges to a more robust mixture, meaning the clusters are more resistant to noise and overfitting. Further, the model is naturally capable of converging to an appropriate cluster count. We validate our method on a challenging real-world dataset, an imbalanced physical activity dataset with a variable-length time series structure. We compare our results to state-of-the-art studies and show high-quality improvement by classifying with only the summarization.
{"title":"SummerTime: Variable-length Time Series Summarization with Application to Physical Activity Analysis","authors":"K. Amaral, Zihan Li, W. Ding, S. Crouter, Ping Chen","doi":"10.1145/3532628","DOIUrl":"https://doi.org/10.1145/3532628","url":null,"abstract":"SummerTime seeks to summarize global time-series signals and provides a fixed-length, robust representation of the variable-length time series. Many machine learning methods depend on data instances with a fixed number of features. As a result, those methods cannot be directly applied to variable-length time series data. Existing methods such as sliding windows can lose minority local information. Summarization conducted by the SummerTime method will be a fixed-length feature vector which can be used in place of the time series dataset for use with classical machine learning methods. We use Gaussian Mixture models (GMM) over small same-length disjoint windows in the time series to group local data into clusters. The time series’ rate of membership for each cluster will be a feature in the summarization. By making use of variational methods, GMM converges to a more robust mixture, meaning the clusters are more resistant to noise and overfitting. Further, the model is naturally capable of converging to an appropriate cluster count. We validate our method on a challenging real-world dataset, an imbalanced physical activity dataset with a variable-length time series structure. We compare our results to state-of-the-art studies and show high-quality improvement by classifying with only the summarization.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"3 1","pages":"1 - 15"},"PeriodicalIF":0.0,"publicationDate":"2022-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48949440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hieu Hanh Le, Tatsuhiro Yamada, Yuichi Honda, T. Sakamoto, Ryosuke Matsuo, Tomoyoshi Yamazaki, Kenji Araki, H. Yokota
Electronic medical record systems have been adopted by many large hospitals worldwide, enabling the recorded data to be analyzed by various computer-based techniques to gain a better understanding of hospital-based disease treatments. Among such techniques, sequential pattern mining, already widely used for data mining and knowledge discovery in other application domains, has shown great potential for discovering frequent patterns in sequences of disease treatments. However, studies have yet to evaluate the use of medical-order sequence variants, where a “frequent pattern” can include some limited variations to the pattern, or have considered the factors that lead to these variants. Such a study would be meaningful for medical tasks such as improving the quality of a particular treatment method, comparing treatments with multiple hospitals, recommending the best-suited treatment for each patient, and optimizing the running costs in hospitals. This article proposes methods for evaluating medical-order sequence variants and understanding variant factors based on a statistical approach. We consider the safety and efficiency of sequences and related information about the variants, such as gender, age, and test results from hospitals. Our proposal has been demonstrated as effective by experimentally evaluating an electronic medical record system’s real dataset and obtaining feedback from medical workers. The experimental results indicate that the medical treatment history and specimen test results after hospitalization are significant in identifying the factors that lead to variants.
{"title":"Methods for Analyzing Medical-Order Sequence Variants in Sequential Pattern Mining for Electronic Medical Record Systems","authors":"Hieu Hanh Le, Tatsuhiro Yamada, Yuichi Honda, T. Sakamoto, Ryosuke Matsuo, Tomoyoshi Yamazaki, Kenji Araki, H. Yokota","doi":"10.1145/3561825","DOIUrl":"https://doi.org/10.1145/3561825","url":null,"abstract":"Electronic medical record systems have been adopted by many large hospitals worldwide, enabling the recorded data to be analyzed by various computer-based techniques to gain a better understanding of hospital-based disease treatments. Among such techniques, sequential pattern mining, already widely used for data mining and knowledge discovery in other application domains, has shown great potential for discovering frequent patterns in sequences of disease treatments. However, studies have yet to evaluate the use of medical-order sequence variants, where a “frequent pattern” can include some limited variations to the pattern, or have considered the factors that lead to these variants. Such a study would be meaningful for medical tasks such as improving the quality of a particular treatment method, comparing treatments with multiple hospitals, recommending the best-suited treatment for each patient, and optimizing the running costs in hospitals. This article proposes methods for evaluating medical-order sequence variants and understanding variant factors based on a statistical approach. We consider the safety and efficiency of sequences and related information about the variants, such as gender, age, and test results from hospitals. Our proposal has been demonstrated as effective by experimentally evaluating an electronic medical record system’s real dataset and obtaining feedback from medical workers. The experimental results indicate that the medical treatment history and specimen test results after hospitalization are significant in identifying the factors that lead to variants.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 28"},"PeriodicalIF":0.0,"publicationDate":"2022-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43050142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yiyuan Wang, Guillaume Wattelez, S. Frayon, C. Caillaud, O. Galy, K. Yacef
Obesity is a global health concern associated with various demographic and lifestyle factors including physical activity (PA). Research studies generally used self-reported PA data or, when accelerometer-based activity trackers were used, highly aggregated data (e.g., daily average). This suggests that the rich potential of detailed activity tracker data is largely under-exploited and that deeper analyses may help better understand such relationships. This is particularly true in children and adolescents who are distinct and engage more in bursts of PA. This article presents ABIPA, a machine learning-based methodology that integrates various aspects of accelerometer-based PA data into weight status prediction for adolescents. We propose a method to derive features regarding the structure of different PA time series using Auto-Regressive Integrated Moving Average (ARIMA). The ARIMA-based PA features are combined with other individual attributes to predict weight status and the importance of these features is further unveiled. We apply ABIPA to a dataset about young adolescents (N = 206) containing, for each participant, a 7-day continuous accelerometer dataset (60 Hz, GENEActiv tracker from ActivInsights) and a range of their socio-demographic, anthropometric, and lifestyle information. The results indicate that our method provides a practical approach for integrating accelerometer-based PA patterns into weight status prediction and paves the way for validating their importance in understanding obesity factors.
{"title":"ABIPA: ARIMA-Based Integration of Accelerometer-Based Physical Activity for Adolescent Weight Status Prediction","authors":"Yiyuan Wang, Guillaume Wattelez, S. Frayon, C. Caillaud, O. Galy, K. Yacef","doi":"10.1145/3561611","DOIUrl":"https://doi.org/10.1145/3561611","url":null,"abstract":"Obesity is a global health concern associated with various demographic and lifestyle factors including physical activity (PA). Research studies generally used self-reported PA data or, when accelerometer-based activity trackers were used, highly aggregated data (e.g., daily average). This suggests that the rich potential of detailed activity tracker data is largely under-exploited and that deeper analyses may help better understand such relationships. This is particularly true in children and adolescents who are distinct and engage more in bursts of PA. This article presents ABIPA, a machine learning-based methodology that integrates various aspects of accelerometer-based PA data into weight status prediction for adolescents. We propose a method to derive features regarding the structure of different PA time series using Auto-Regressive Integrated Moving Average (ARIMA). The ARIMA-based PA features are combined with other individual attributes to predict weight status and the importance of these features is further unveiled. We apply ABIPA to a dataset about young adolescents (N = 206) containing, for each participant, a 7-day continuous accelerometer dataset (60 Hz, GENEActiv tracker from ActivInsights) and a range of their socio-demographic, anthropometric, and lifestyle information. The results indicate that our method provides a practical approach for integrating accelerometer-based PA patterns into weight status prediction and paves the way for validating their importance in understanding obesity factors.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 19"},"PeriodicalIF":0.0,"publicationDate":"2022-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43367433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Keqin Shi, Zhen Chen, Xuejing Li, Z. Xiao, Weiqiang Sun, Weisheng Hu
As an accurate measurement of physical activity, step counts data can be collected expediently by smartphones and wearable devices. Complete and high time-resolution step counts data record the time and intensity of individuals’ physical activity in a day, and can be used to mine activity habits or to recommend customized workout plans. However, sparse step counts data are common in practice due to hardware and software limitations. Understanding the value of sparse step counts data can contribute to its application in healthcare, and also can help us design cost-effective hardware and software. In this article, we aim to infer activity patterns from sparse step counts data. We design a deep learning model based on recurrent neural networks, namely MLP-GRU, which considers bidirectional short-term dependency and long-term regularity of sparse step counts data, and implements data-driven imputation and classification. We also develop an interpretable and elastic method to obtain sparse step counts data labeled with multi-granular activity patterns to train MLP-GRU. Evaluations on real-world datasets reveal that MLP-GRU outperforms other strong baseline methods. The results also show that activity patterns can be inferred from extremely sparse step counts data with high accuracy, provided that proper granularity is used for data of different sparsity.
{"title":"Inferring Activity Patterns from Sparse Step Counts Data with Recurrent Neural Networks","authors":"Keqin Shi, Zhen Chen, Xuejing Li, Z. Xiao, Weiqiang Sun, Weisheng Hu","doi":"10.1145/3560468","DOIUrl":"https://doi.org/10.1145/3560468","url":null,"abstract":"As an accurate measurement of physical activity, step counts data can be collected expediently by smartphones and wearable devices. Complete and high time-resolution step counts data record the time and intensity of individuals’ physical activity in a day, and can be used to mine activity habits or to recommend customized workout plans. However, sparse step counts data are common in practice due to hardware and software limitations. Understanding the value of sparse step counts data can contribute to its application in healthcare, and also can help us design cost-effective hardware and software. In this article, we aim to infer activity patterns from sparse step counts data. We design a deep learning model based on recurrent neural networks, namely MLP-GRU, which considers bidirectional short-term dependency and long-term regularity of sparse step counts data, and implements data-driven imputation and classification. We also develop an interpretable and elastic method to obtain sparse step counts data labeled with multi-granular activity patterns to train MLP-GRU. Evaluations on real-world datasets reveal that MLP-GRU outperforms other strong baseline methods. The results also show that activity patterns can be inferred from extremely sparse step counts data with high accuracy, provided that proper granularity is used for data of different sparsity.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 20"},"PeriodicalIF":0.0,"publicationDate":"2022-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43316831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hashini Senaratne, S. Oviatt, K. Ellis, Glenn Melvin
Recently, interest has grown in the assessment of anxiety that leverages human physiological and behavioral data to address the drawbacks of current subjective clinical assessments. Complex experiences of anxiety vary on multiple characteristics, including triggers, responses, duration and severity, and impact differently on the risk of anxiety disorders. This article reviews the past decade of studies that objectively analyzed various anxiety characteristics related to five common anxiety disorders in adults utilizing features of cardiac, electrodermal, blood pressure, respiratory, vocal, posture, movement, and eye metrics. Its originality lies in the synthesis and interpretation of consistently discovered heterogeneous predictors of anxiety and multimodal-multisensor analytics based on them. We reveal that few anxiety characteristics have been evaluated using multimodal-multisensor metrics, and many of the identified predictive features are confounded. As such, objective anxiety assessments are not yet complete or precise. That said, few multimodal-multisensor systems evaluated indicate an approximately 11.73% performance gain compared to unimodal systems, highlighting a promising powerful tool. We suggest six high-priority future directions to address the current gaps and limitations in infrastructure, basic knowledge, and application areas. Action in these directions will expedite the discovery of rich, accurate, continuous, and objective assessments and their use in impactful end-user applications.
{"title":"A Critical Review of Multimodal-multisensor Analytics for Anxiety Assessment","authors":"Hashini Senaratne, S. Oviatt, K. Ellis, Glenn Melvin","doi":"10.1145/3556980","DOIUrl":"https://doi.org/10.1145/3556980","url":null,"abstract":"Recently, interest has grown in the assessment of anxiety that leverages human physiological and behavioral data to address the drawbacks of current subjective clinical assessments. Complex experiences of anxiety vary on multiple characteristics, including triggers, responses, duration and severity, and impact differently on the risk of anxiety disorders. This article reviews the past decade of studies that objectively analyzed various anxiety characteristics related to five common anxiety disorders in adults utilizing features of cardiac, electrodermal, blood pressure, respiratory, vocal, posture, movement, and eye metrics. Its originality lies in the synthesis and interpretation of consistently discovered heterogeneous predictors of anxiety and multimodal-multisensor analytics based on them. We reveal that few anxiety characteristics have been evaluated using multimodal-multisensor metrics, and many of the identified predictive features are confounded. As such, objective anxiety assessments are not yet complete or precise. That said, few multimodal-multisensor systems evaluated indicate an approximately 11.73% performance gain compared to unimodal systems, highlighting a promising powerful tool. We suggest six high-priority future directions to address the current gaps and limitations in infrastructure, basic knowledge, and application areas. Action in these directions will expedite the discovery of rich, accurate, continuous, and objective assessments and their use in impactful end-user applications.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"3 1","pages":"1 - 42"},"PeriodicalIF":0.0,"publicationDate":"2022-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49657787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}