ACM transactions on computing for healthcare最新文献_第4页

Multimodal Fusion of Smart Home and Text-based Behavior Markers for Clinical Assessment Prediction. 多模式融合智能家居和基于文本的行为标记，用于临床评估预测。

ACM transactions on computing for healthcare

Pub Date : 2022-10-01 Epub Date: 2022-11-03 DOI: 10.1145/3531231

Gina Sprint, Diane J Cook, Maureen Schmitter-Edgecombe, Lawrence B Holder

New modes of technology are offering unprecedented opportunities to unobtrusively collect data about people's behavior. While there are many use cases for such information, we explore its utility for predicting multiple clinical assessment scores. Because clinical assessments are typically used as screening tools for impairment and disease, such as mild cognitive impairment (MCI), automatically mapping behavioral data to assessment scores can help detect changes in health and behavior across time. In this paper, we aim to extract behavior markers from two modalities, a smart home environment and a custom digital memory notebook app, for mapping to ten clinical assessments that are relevant for monitoring MCI onset and changes in cognitive health. Smart home-based behavior markers reflect hourly, daily, and weekly activity patterns, while app-based behavior markers reflect app usage and writing content/style derived from free-form journal entries. We describe machine learning techniques for fusing these multimodal behavior markers and utilizing joint prediction. We evaluate our approach using three regression algorithms and data from 14 participants with MCI living in a smart home environment. We observed moderate to large correlations between predicted and ground-truth assessment scores, ranging from r = 0.601 to r = 0.871 for each clinical assessment.

新的技术模式为不露痕迹地收集人们的行为数据提供了前所未有的机会。虽然此类信息有很多用例，但我们要探讨的是它在预测多种临床评估分数方面的效用。由于临床评估通常被用作损伤和疾病（如轻度认知障碍（MCI））的筛查工具，因此将行为数据自动映射到评估分数有助于检测不同时期的健康和行为变化。在本文中，我们旨在从智能家居环境和定制数字记忆笔记本应用程序这两种模式中提取行为标记，并将其映射到与监测 MCI 发病和认知健康变化相关的十项临床评估中。基于智能家居的行为标记反映了每小时、每天和每周的活动模式，而基于应用程序的行为标记则反映了应用程序的使用情况以及从自由形式的日记条目中提取的写作内容/风格。我们介绍了融合这些多模态行为标记并利用联合预测的机器学习技术。我们使用三种回归算法和 14 名生活在智能家居环境中的 MCI 患者的数据对我们的方法进行了评估。我们观察到预测得分和地面实况评估得分之间存在中等到较大的相关性，每项临床评估的相关性从 r = 0.601 到 r = 0.871 不等。

{"title":"Multimodal Fusion of Smart Home and Text-based Behavior Markers for Clinical Assessment Prediction.","authors":"Gina Sprint, Diane J Cook, Maureen Schmitter-Edgecombe, Lawrence B Holder","doi":"10.1145/3531231","DOIUrl":"10.1145/3531231","url":null,"abstract":"New modes of technology are offering unprecedented opportunities to unobtrusively collect data about people's behavior. While there are many use cases for such information, we explore its utility for predicting multiple clinical assessment scores. Because clinical assessments are typically used as screening tools for impairment and disease, such as mild cognitive impairment (MCI), automatically mapping behavioral data to assessment scores can help detect changes in health and behavior across time. In this paper, we aim to extract behavior markers from two modalities, a smart home environment and a custom digital memory notebook app, for mapping to ten clinical assessments that are relevant for monitoring MCI onset and changes in cognitive health. Smart home-based behavior markers reflect hourly, daily, and weekly activity patterns, while app-based behavior markers reflect app usage and writing content/style derived from free-form journal entries. We describe machine learning techniques for fusing these multimodal behavior markers and utilizing joint prediction. We evaluate our approach using three regression algorithms and data from 14 participants with MCI living in a smart home environment. We observed moderate to large correlations between predicted and ground-truth assessment scores, ranging from r = 0.601 to r = 0.871 for each clinical assessment.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"3 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9645787/pdf/nihms-1822476.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10608022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Synthetic Behavior Sequence Generation Using Generative Adversarial Networks 基于生成对抗网络的综合行为序列生成

ACM transactions on computing for healthcare

Pub Date : 2022-09-29 DOI: 10.1145/3563950

Fatemeh Akbari, K. Sartipi, Norm Archer

Due to the increase in life expectancy in advanced societies leading to an increase in population age, data-driven systems are receiving more attention to support the older people by monitoring their health. Intelligent sensor networks provide the ability to monitor their activities without interfering with routine life. Data collected from smart homes can be used in a variety of data-driven analyses, including behavior prediction. Due to privacy concerns and the cost and time required to collect data, synthetic data generation methods have been considered seriously by the research community. In this article, we introduce a new Generative Adversarial Network (GAN) algorithm, namely, BehavGAN, that applies GAN to the problem of behavior sequence generation. This is achieved by learning the features of a target dataset and utilizing a new application for GANs in the simulation of older people’s behaviors. We also propose an effective reward function for GAN back-propagation by incorporating n-gram-based similarity measures in the reinforcement mechanism. We evaluate our proposed algorithm by generating a dataset of human behavior sequences. Our results show that BehavGAN is more effective in generating behavior sequences compared to MLE, LeakGAN, and the original SeqGAN algorithms in terms of both similarity and diversity of generated data. Our proposed algorithm outperforms current state-of-the-art methods when it comes to generating behavior sequences consisting of limited-space sequence tokens.

由于发达社会的预期寿命增加导致人口老龄化，数据驱动系统越来越受到关注，通过监测老年人的健康状况来支持老年人。智能传感器网络提供了在不干扰日常生活的情况下监控他们活动的能力。从智能家居收集的数据可以用于各种数据驱动的分析，包括行为预测。由于隐私问题以及收集数据所需的成本和时间，合成数据生成方法已被研究界认真考虑。在本文中，我们介绍了一种新的生成对抗网络(GAN)算法，即BehavGAN，它将GAN应用于行为序列生成问题。这是通过学习目标数据集的特征和利用gan在老年人行为模拟中的新应用来实现的。我们还通过在强化机制中结合基于n-gram的相似性度量，提出了GAN反向传播的有效奖励函数。我们通过生成人类行为序列的数据集来评估我们提出的算法。我们的研究结果表明，在生成数据的相似性和多样性方面，与MLE、LeakGAN和原始SeqGAN算法相比，BehavGAN在生成行为序列方面更有效。当涉及到生成由有限空间序列令牌组成的行为序列时，我们提出的算法优于当前最先进的方法。

{"title":"Synthetic Behavior Sequence Generation Using Generative Adversarial Networks","authors":"Fatemeh Akbari, K. Sartipi, Norm Archer","doi":"10.1145/3563950","DOIUrl":"https://doi.org/10.1145/3563950","url":null,"abstract":"Due to the increase in life expectancy in advanced societies leading to an increase in population age, data-driven systems are receiving more attention to support the older people by monitoring their health. Intelligent sensor networks provide the ability to monitor their activities without interfering with routine life. Data collected from smart homes can be used in a variety of data-driven analyses, including behavior prediction. Due to privacy concerns and the cost and time required to collect data, synthetic data generation methods have been considered seriously by the research community. In this article, we introduce a new Generative Adversarial Network (GAN) algorithm, namely, BehavGAN, that applies GAN to the problem of behavior sequence generation. This is achieved by learning the features of a target dataset and utilizing a new application for GANs in the simulation of older people’s behaviors. We also propose an effective reward function for GAN back-propagation by incorporating n-gram-based similarity measures in the reinforcement mechanism. We evaluate our proposed algorithm by generating a dataset of human behavior sequences. Our results show that BehavGAN is more effective in generating behavior sequences compared to MLE, LeakGAN, and the original SeqGAN algorithms in terms of both similarity and diversity of generated data. Our proposed algorithm outperforms current state-of-the-art methods when it comes to generating behavior sequences consisting of limited-space sequence tokens.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 23"},"PeriodicalIF":0.0,"publicationDate":"2022-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46099276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SummerTime: Variable-length Time Series Summarization with Application to Physical Activity Analysis SummerTime：变长时间序列综述及其在体育活动分析中的应用

ACM transactions on computing for healthcare

Pub Date : 2022-09-14 DOI: 10.1145/3532628

K. Amaral, Zihan Li, W. Ding, S. Crouter, Ping Chen

SummerTime seeks to summarize global time-series signals and provides a fixed-length, robust representation of the variable-length time series. Many machine learning methods depend on data instances with a fixed number of features. As a result, those methods cannot be directly applied to variable-length time series data. Existing methods such as sliding windows can lose minority local information. Summarization conducted by the SummerTime method will be a fixed-length feature vector which can be used in place of the time series dataset for use with classical machine learning methods. We use Gaussian Mixture models (GMM) over small same-length disjoint windows in the time series to group local data into clusters. The time series’ rate of membership for each cluster will be a feature in the summarization. By making use of variational methods, GMM converges to a more robust mixture, meaning the clusters are more resistant to noise and overfitting. Further, the model is naturally capable of converging to an appropriate cluster count. We validate our method on a challenging real-world dataset, an imbalanced physical activity dataset with a variable-length time series structure. We compare our results to state-of-the-art studies and show high-quality improvement by classifying with only the summarization.

SummerTime试图总结全局时间序列信号，并提供可变长度时间序列的固定长度、稳健表示。许多机器学习方法依赖于具有固定数量特征的数据实例。因此，这些方法不能直接应用于可变长度的时间序列数据。现有的方法，如滑动窗口可能会丢失少数局部信息。SummerTime方法进行的汇总将是一个固定长度的特征向量，它可以用来代替与经典机器学习方法一起使用的时间序列数据集。我们在时间序列中的小的相同长度的不相交窗口上使用高斯混合模型（GMM）将局部数据分组到聚类中。每个集群的时间序列成员率将是摘要中的一个特征。通过使用变分方法，GMM收敛到更稳健的混合，这意味着聚类更能抵抗噪声和过拟合。此外，该模型自然能够收敛到适当的聚类计数。我们在一个具有挑战性的真实世界数据集上验证了我们的方法，该数据集是一个具有可变长度时间序列结构的不平衡体力活动数据集。我们将我们的结果与最先进的研究进行了比较，并通过仅使用摘要进行分类来显示高质量的改进。

{"title":"SummerTime: Variable-length Time Series Summarization with Application to Physical Activity Analysis","authors":"K. Amaral, Zihan Li, W. Ding, S. Crouter, Ping Chen","doi":"10.1145/3532628","DOIUrl":"https://doi.org/10.1145/3532628","url":null,"abstract":"SummerTime seeks to summarize global time-series signals and provides a fixed-length, robust representation of the variable-length time series. Many machine learning methods depend on data instances with a fixed number of features. As a result, those methods cannot be directly applied to variable-length time series data. Existing methods such as sliding windows can lose minority local information. Summarization conducted by the SummerTime method will be a fixed-length feature vector which can be used in place of the time series dataset for use with classical machine learning methods. We use Gaussian Mixture models (GMM) over small same-length disjoint windows in the time series to group local data into clusters. The time series’ rate of membership for each cluster will be a feature in the summarization. By making use of variational methods, GMM converges to a more robust mixture, meaning the clusters are more resistant to noise and overfitting. Further, the model is naturally capable of converging to an appropriate cluster count. We validate our method on a challenging real-world dataset, an imbalanced physical activity dataset with a variable-length time series structure. We compare our results to state-of-the-art studies and show high-quality improvement by classifying with only the summarization.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"3 1","pages":"1 - 15"},"PeriodicalIF":0.0,"publicationDate":"2022-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48949440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Methods for Analyzing Medical-Order Sequence Variants in Sequential Pattern Mining for Electronic Medical Record Systems 电子病历系统序列模式挖掘中医嘱序列变异的分析方法

ACM transactions on computing for healthcare

Pub Date : 2022-09-12 DOI: 10.1145/3561825

Hieu Hanh Le, Tatsuhiro Yamada, Yuichi Honda, T. Sakamoto, Ryosuke Matsuo, Tomoyoshi Yamazaki, Kenji Araki, H. Yokota

Electronic medical record systems have been adopted by many large hospitals worldwide, enabling the recorded data to be analyzed by various computer-based techniques to gain a better understanding of hospital-based disease treatments. Among such techniques, sequential pattern mining, already widely used for data mining and knowledge discovery in other application domains, has shown great potential for discovering frequent patterns in sequences of disease treatments. However, studies have yet to evaluate the use of medical-order sequence variants, where a “frequent pattern” can include some limited variations to the pattern, or have considered the factors that lead to these variants. Such a study would be meaningful for medical tasks such as improving the quality of a particular treatment method, comparing treatments with multiple hospitals, recommending the best-suited treatment for each patient, and optimizing the running costs in hospitals. This article proposes methods for evaluating medical-order sequence variants and understanding variant factors based on a statistical approach. We consider the safety and efficiency of sequences and related information about the variants, such as gender, age, and test results from hospitals. Our proposal has been demonstrated as effective by experimentally evaluating an electronic medical record system’s real dataset and obtaining feedback from medical workers. The experimental results indicate that the medical treatment history and specimen test results after hospitalization are significant in identifying the factors that lead to variants.

电子病历系统已被世界各地的许多大型医院采用，使记录的数据能够通过各种基于计算机的技术进行分析，以更好地了解基于医院的疾病治疗。在这些技术中，序列模式挖掘已经广泛用于其他应用领域的数据挖掘和知识发现，在发现疾病治疗序列中的频繁模式方面显示出巨大的潜力。然而，研究尚未评估医疗顺序序列变体的使用，其中“频繁模式”可能包括模式的一些有限变体，或者已经考虑了导致这些变体的因素。这样的研究对医疗任务有意义，例如提高特定治疗方法的质量，比较多家医院的治疗方法，为每位患者推荐最适合的治疗方法以及优化医院的运营成本。本文提出了基于统计学方法评估医嘱序列变异和理解变异因素的方法。我们考虑序列的安全性和有效性以及有关变异的相关信息，如性别、年龄和医院的检测结果。通过实验评估电子病历系统的真实数据集并从医务工作者那里获得反馈，我们的建议被证明是有效的。实验结果表明，住院后的医疗史和标本检测结果在识别导致变异的因素方面具有重要意义。

{"title":"Methods for Analyzing Medical-Order Sequence Variants in Sequential Pattern Mining for Electronic Medical Record Systems","authors":"Hieu Hanh Le, Tatsuhiro Yamada, Yuichi Honda, T. Sakamoto, Ryosuke Matsuo, Tomoyoshi Yamazaki, Kenji Araki, H. Yokota","doi":"10.1145/3561825","DOIUrl":"https://doi.org/10.1145/3561825","url":null,"abstract":"Electronic medical record systems have been adopted by many large hospitals worldwide, enabling the recorded data to be analyzed by various computer-based techniques to gain a better understanding of hospital-based disease treatments. Among such techniques, sequential pattern mining, already widely used for data mining and knowledge discovery in other application domains, has shown great potential for discovering frequent patterns in sequences of disease treatments. However, studies have yet to evaluate the use of medical-order sequence variants, where a “frequent pattern” can include some limited variations to the pattern, or have considered the factors that lead to these variants. Such a study would be meaningful for medical tasks such as improving the quality of a particular treatment method, comparing treatments with multiple hospitals, recommending the best-suited treatment for each patient, and optimizing the running costs in hospitals. This article proposes methods for evaluating medical-order sequence variants and understanding variant factors based on a statistical approach. We consider the safety and efficiency of sequences and related information about the variants, such as gender, age, and test results from hospitals. Our proposal has been demonstrated as effective by experimentally evaluating an electronic medical record system’s real dataset and obtaining feedback from medical workers. The experimental results indicate that the medical treatment history and specimen test results after hospitalization are significant in identifying the factors that lead to variants.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 28"},"PeriodicalIF":0.0,"publicationDate":"2022-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43050142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

ABIPA: ARIMA-Based Integration of Accelerometer-Based Physical Activity for Adolescent Weight Status Prediction ABIPA:基于arima的基于加速度计的体育活动对青少年体重状况预测的集成

ACM transactions on computing for healthcare

Pub Date : 2022-09-06 DOI: 10.1145/3561611

Yiyuan Wang, Guillaume Wattelez, S. Frayon, C. Caillaud, O. Galy, K. Yacef

Obesity is a global health concern associated with various demographic and lifestyle factors including physical activity (PA). Research studies generally used self-reported PA data or, when accelerometer-based activity trackers were used, highly aggregated data (e.g., daily average). This suggests that the rich potential of detailed activity tracker data is largely under-exploited and that deeper analyses may help better understand such relationships. This is particularly true in children and adolescents who are distinct and engage more in bursts of PA. This article presents ABIPA, a machine learning-based methodology that integrates various aspects of accelerometer-based PA data into weight status prediction for adolescents. We propose a method to derive features regarding the structure of different PA time series using Auto-Regressive Integrated Moving Average (ARIMA). The ARIMA-based PA features are combined with other individual attributes to predict weight status and the importance of these features is further unveiled. We apply ABIPA to a dataset about young adolescents (N = 206) containing, for each participant, a 7-day continuous accelerometer dataset (60 Hz, GENEActiv tracker from ActivInsights) and a range of their socio-demographic, anthropometric, and lifestyle information. The results indicate that our method provides a practical approach for integrating accelerometer-based PA patterns into weight status prediction and paves the way for validating their importance in understanding obesity factors.

肥胖是一个全球性的健康问题，与包括身体活动在内的各种人口和生活方式因素有关。研究通常使用自我报告的PA数据，或者当使用基于加速度计的活动跟踪器时，使用高度汇总的数据(例如，每日平均值)。这表明详细的活动跟踪数据的丰富潜力在很大程度上没有得到充分利用，而更深入的分析可能有助于更好地理解这种关系。这在儿童和青少年中尤其如此，他们性格鲜明，更多地参与到PA的爆发中。本文介绍了ABIPA，一种基于机器学习的方法，将基于加速度计的PA数据的各个方面集成到青少年体重状态预测中。我们提出了一种利用自回归综合移动平均(ARIMA)来推导不同PA时间序列结构特征的方法。基于arima的PA特征与其他个体属性相结合，以预测体重状态，并进一步揭示这些特征的重要性。我们将ABIPA应用于一个关于青少年的数据集(N = 206)，该数据集包含每个参与者连续7天的加速度计数据集(60 Hz，来自ActivInsights的GENEActiv跟踪器)以及他们的一系列社会人口统计学、人体测量学和生活方式信息。结果表明，我们的方法为将基于加速度计的PA模式整合到体重状态预测中提供了一种实用的方法，并为验证其在理解肥胖因素中的重要性铺平了道路。

{"title":"ABIPA: ARIMA-Based Integration of Accelerometer-Based Physical Activity for Adolescent Weight Status Prediction","authors":"Yiyuan Wang, Guillaume Wattelez, S. Frayon, C. Caillaud, O. Galy, K. Yacef","doi":"10.1145/3561611","DOIUrl":"https://doi.org/10.1145/3561611","url":null,"abstract":"Obesity is a global health concern associated with various demographic and lifestyle factors including physical activity (PA). Research studies generally used self-reported PA data or, when accelerometer-based activity trackers were used, highly aggregated data (e.g., daily average). This suggests that the rich potential of detailed activity tracker data is largely under-exploited and that deeper analyses may help better understand such relationships. This is particularly true in children and adolescents who are distinct and engage more in bursts of PA. This article presents ABIPA, a machine learning-based methodology that integrates various aspects of accelerometer-based PA data into weight status prediction for adolescents. We propose a method to derive features regarding the structure of different PA time series using Auto-Regressive Integrated Moving Average (ARIMA). The ARIMA-based PA features are combined with other individual attributes to predict weight status and the importance of these features is further unveiled. We apply ABIPA to a dataset about young adolescents (N = 206) containing, for each participant, a 7-day continuous accelerometer dataset (60 Hz, GENEActiv tracker from ActivInsights) and a range of their socio-demographic, anthropometric, and lifestyle information. The results indicate that our method provides a practical approach for integrating accelerometer-based PA patterns into weight status prediction and paves the way for validating their importance in understanding obesity factors.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 19"},"PeriodicalIF":0.0,"publicationDate":"2022-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43367433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Inferring Activity Patterns from Sparse Step Counts Data with Recurrent Neural Networks 用递归神经网络从稀疏步数数据推断活动模式

ACM transactions on computing for healthcare

Pub Date : 2022-08-30 DOI: 10.1145/3560468

Keqin Shi, Zhen Chen, Xuejing Li, Z. Xiao, Weiqiang Sun, Weisheng Hu

As an accurate measurement of physical activity, step counts data can be collected expediently by smartphones and wearable devices. Complete and high time-resolution step counts data record the time and intensity of individuals’ physical activity in a day, and can be used to mine activity habits or to recommend customized workout plans. However, sparse step counts data are common in practice due to hardware and software limitations. Understanding the value of sparse step counts data can contribute to its application in healthcare, and also can help us design cost-effective hardware and software. In this article, we aim to infer activity patterns from sparse step counts data. We design a deep learning model based on recurrent neural networks, namely MLP-GRU, which considers bidirectional short-term dependency and long-term regularity of sparse step counts data, and implements data-driven imputation and classification. We also develop an interpretable and elastic method to obtain sparse step counts data labeled with multi-granular activity patterns to train MLP-GRU. Evaluations on real-world datasets reveal that MLP-GRU outperforms other strong baseline methods. The results also show that activity patterns can be inferred from extremely sparse step counts data with high accuracy, provided that proper granularity is used for data of different sparsity.

作为对身体活动的精确测量，步数数据可以通过智能手机和可穿戴设备方便地收集。完整且高时间分辨率的步数数据记录了个人一天中身体活动的时间和强度，可用于挖掘活动习惯或推荐定制的锻炼计划。然而，由于硬件和软件的限制，稀疏步数数据在实践中很常见。了解稀疏步数数据的价值有助于其在医疗保健中的应用，也有助于我们设计具有成本效益的硬件和软件。在本文中，我们的目标是从稀疏的步数数据中推断活动模式。我们设计了一个基于递归神经网络的深度学习模型，即MLP-GRU，该模型考虑了稀疏步数数据的双向短期依赖性和长期规律性，并实现了数据驱动的插补和分类。我们还开发了一种可解释和弹性的方法来获得用多粒度活动模式标记的稀疏步数数据，以训练MLP-GRU。对真实世界数据集的评估表明，MLP-GRU优于其他强基线方法。结果还表明，只要对不同稀疏度的数据使用适当的粒度，就可以从极稀疏的步数数据中高精度地推断出活动模式。

{"title":"Inferring Activity Patterns from Sparse Step Counts Data with Recurrent Neural Networks","authors":"Keqin Shi, Zhen Chen, Xuejing Li, Z. Xiao, Weiqiang Sun, Weisheng Hu","doi":"10.1145/3560468","DOIUrl":"https://doi.org/10.1145/3560468","url":null,"abstract":"As an accurate measurement of physical activity, step counts data can be collected expediently by smartphones and wearable devices. Complete and high time-resolution step counts data record the time and intensity of individuals’ physical activity in a day, and can be used to mine activity habits or to recommend customized workout plans. However, sparse step counts data are common in practice due to hardware and software limitations. Understanding the value of sparse step counts data can contribute to its application in healthcare, and also can help us design cost-effective hardware and software. In this article, we aim to infer activity patterns from sparse step counts data. We design a deep learning model based on recurrent neural networks, namely MLP-GRU, which considers bidirectional short-term dependency and long-term regularity of sparse step counts data, and implements data-driven imputation and classification. We also develop an interpretable and elastic method to obtain sparse step counts data labeled with multi-granular activity patterns to train MLP-GRU. Evaluations on real-world datasets reveal that MLP-GRU outperforms other strong baseline methods. The results also show that activity patterns can be inferred from extremely sparse step counts data with high accuracy, provided that proper granularity is used for data of different sparsity.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"4 1","pages":"1 - 20"},"PeriodicalIF":0.0,"publicationDate":"2022-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43316831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Critical Review of Multimodal-multisensor Analytics for Anxiety Assessment 用于焦虑评估的多模式多传感器分析综述

ACM transactions on computing for healthcare

Pub Date : 2022-08-17 DOI: 10.1145/3556980

Hashini Senaratne, S. Oviatt, K. Ellis, Glenn Melvin

Recently, interest has grown in the assessment of anxiety that leverages human physiological and behavioral data to address the drawbacks of current subjective clinical assessments. Complex experiences of anxiety vary on multiple characteristics, including triggers, responses, duration and severity, and impact differently on the risk of anxiety disorders. This article reviews the past decade of studies that objectively analyzed various anxiety characteristics related to five common anxiety disorders in adults utilizing features of cardiac, electrodermal, blood pressure, respiratory, vocal, posture, movement, and eye metrics. Its originality lies in the synthesis and interpretation of consistently discovered heterogeneous predictors of anxiety and multimodal-multisensor analytics based on them. We reveal that few anxiety characteristics have been evaluated using multimodal-multisensor metrics, and many of the identified predictive features are confounded. As such, objective anxiety assessments are not yet complete or precise. That said, few multimodal-multisensor systems evaluated indicate an approximately 11.73% performance gain compared to unimodal systems, highlighting a promising powerful tool. We suggest six high-priority future directions to address the current gaps and limitations in infrastructure, basic knowledge, and application areas. Action in these directions will expedite the discovery of rich, accurate, continuous, and objective assessments and their use in impactful end-user applications.

最近，人们对利用人类生理和行为数据来解决当前主观临床评估的缺陷的焦虑评估越来越感兴趣。焦虑的复杂体验有多种特征，包括触发因素、反应、持续时间和严重程度，对焦虑症风险的影响也不同。本文回顾了过去十年的研究，这些研究利用心脏、皮肤电、血压、呼吸、声音、姿势、运动和眼睛指标的特征，客观分析了与成人五种常见焦虑症相关的各种焦虑特征。它的独创性在于综合和解释了一致发现的焦虑的异质预测因子，以及基于它们的多模态多传感器分析。我们发现，很少有人使用多模式多传感器指标来评估焦虑特征，而且许多已识别的预测特征都是混淆的。因此，客观的焦虑评估尚不完整或准确。也就是说，与单峰系统相比，很少有多模态多传感器系统的性能增益约为11.73%，这突出了一个有前景的强大工具。我们提出了六个高度优先的未来方向，以解决当前基础设施、基础知识和应用领域的差距和局限性。这些方向的行动将加快发现丰富、准确、连续和客观的评估，并将其用于有影响力的最终用户应用程序。

{"title":"A Critical Review of Multimodal-multisensor Analytics for Anxiety Assessment","authors":"Hashini Senaratne, S. Oviatt, K. Ellis, Glenn Melvin","doi":"10.1145/3556980","DOIUrl":"https://doi.org/10.1145/3556980","url":null,"abstract":"Recently, interest has grown in the assessment of anxiety that leverages human physiological and behavioral data to address the drawbacks of current subjective clinical assessments. Complex experiences of anxiety vary on multiple characteristics, including triggers, responses, duration and severity, and impact differently on the risk of anxiety disorders. This article reviews the past decade of studies that objectively analyzed various anxiety characteristics related to five common anxiety disorders in adults utilizing features of cardiac, electrodermal, blood pressure, respiratory, vocal, posture, movement, and eye metrics. Its originality lies in the synthesis and interpretation of consistently discovered heterogeneous predictors of anxiety and multimodal-multisensor analytics based on them. We reveal that few anxiety characteristics have been evaluated using multimodal-multisensor metrics, and many of the identified predictive features are confounded. As such, objective anxiety assessments are not yet complete or precise. That said, few multimodal-multisensor systems evaluated indicate an approximately 11.73% performance gain compared to unimodal systems, highlighting a promising powerful tool. We suggest six high-priority future directions to address the current gaps and limitations in infrastructure, basic knowledge, and application areas. Action in these directions will expedite the discovery of rich, accurate, continuous, and objective assessments and their use in impactful end-user applications.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"3 1","pages":"1 - 42"},"PeriodicalIF":0.0,"publicationDate":"2022-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49657787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

NHS Number Open Source Software: Implications for Digital Health Regulation and Development NHS数字开源软件：对数字健康监管和发展的启示

ACM transactions on computing for healthcare

Pub Date : 2022-08-05 DOI: 10.1145/3538382

H. Thimbleby

A national example of open source digital healthcare is critiqued. The code for implementing numeric patient identifiers is surprisingly naïve and bug-ridden, despite patient identifiers being computationally trivial and a critical component of reliable healthcare. The issues raised are shown to be widespread, long term, and apparently unrecognized. Problems are traced back to inadequacies in the relevant standards, and, at every stage, regulation through to development, inadequate Software Engineering input. An important finding is that the relevant healthcare standards are inconsistent and written without sufficient rigor to be at all constructive for implementing digital systems. The widely recognized problems of interoperability may be traced back to diverse (and buggy) interpretations of vague standards.

开放源码数字医疗保健的一个国家例子受到了批评。尽管患者标识符在计算上是微不足道的，并且是可靠医疗保健的关键组成部分，但实现数字患者标识符的代码令人惊讶地是naïve并且充满了bug。所提出的问题被证明是广泛的、长期的，而且显然没有被认识到。问题可以追溯到相关标准的不足，并且在每个阶段，从规则到开发，软件工程输入不足。一个重要的发现是，相关的医疗保健标准是不一致的，并且没有足够的严谨性来实施数字系统。广泛认可的互操作性问题可以追溯到对模糊标准的不同(和错误的)解释。

引用次数: 0

Evaluating Alarm Classifiers with High-confidence Data Programming 用高置信度数据编程评估报警分类器

ACM transactions on computing for healthcare

Pub Date : 2022-07-22 DOI: 10.1145/3549942

Sydney Pugh, I. Ruchkin, Christopher P. Bonafide, S. Demauro, O. Sokolsky, Insup Lee, James Weimer

Classification of clinical alarms is at the heart of prioritization, suppression, integration, postponement, and other methods of mitigating alarm fatigue. Since these methods directly affect clinical care, alarm classifiers, such as intelligent suppression systems, need to be evaluated in terms of their sensitivity and specificity, which is typically calculated on a labeled dataset of alarms. Unfortunately, the collection and particularly labeling of such datasets requires substantial effort and time, thus deterring hospitals from investigating mitigations of alarm fatigue. This article develops a lightweight method for evaluating alarm classifiers without perfect alarm labels. The method relies on probabilistic labels obtained from data programming—a labeling paradigm based on combining noisy and cheap-to-obtain labeling heuristics. Based on these labels, the method produces confidence bounds for the sensitivity/specificity values from a hypothetical evaluation with manual labeling. Our experiments on five alarm datasets collected at Children’s Hospital of Philadelphia show that the proposed method provides accurate bounds on the classifier’s sensitivity/specificity, appropriately reflecting the uncertainty from noisy labeling and limited sample sizes.

临床警报的分类是优先级、抑制、整合、延迟和其他缓解警报疲劳方法的核心。由于这些方法直接影响临床护理，因此需要根据其灵敏度和特异性来评估警报分类器，如智能抑制系统，这通常是在标记的警报数据集上计算的。不幸的是，这些数据集的收集，特别是标记需要大量的精力和时间，因此阻碍了医院调查警报疲劳的缓解措施。本文开发了一种轻量级的方法来评估没有完美警报标签的警报分类器。该方法依赖于从数据编程中获得的概率标签——这是一种基于将噪声和廉价相结合来获得标签启发式的标签范式。基于这些标签，该方法通过手动标签的假设评估产生灵敏度/特异性值的置信界限。我们在费城儿童医院收集的五个警报数据集上的实验表明，所提出的方法为分类器的灵敏度/特异性提供了准确的界限，适当地反映了噪声标记和有限样本量的不确定性。

{"title":"Evaluating Alarm Classifiers with High-confidence Data Programming","authors":"Sydney Pugh, I. Ruchkin, Christopher P. Bonafide, S. Demauro, O. Sokolsky, Insup Lee, James Weimer","doi":"10.1145/3549942","DOIUrl":"https://doi.org/10.1145/3549942","url":null,"abstract":"Classification of clinical alarms is at the heart of prioritization, suppression, integration, postponement, and other methods of mitigating alarm fatigue. Since these methods directly affect clinical care, alarm classifiers, such as intelligent suppression systems, need to be evaluated in terms of their sensitivity and specificity, which is typically calculated on a labeled dataset of alarms. Unfortunately, the collection and particularly labeling of such datasets requires substantial effort and time, thus deterring hospitals from investigating mitigations of alarm fatigue. This article develops a lightweight method for evaluating alarm classifiers without perfect alarm labels. The method relies on probabilistic labels obtained from data programming—a labeling paradigm based on combining noisy and cheap-to-obtain labeling heuristics. Based on these labels, the method produces confidence bounds for the sensitivity/specificity values from a hypothetical evaluation with manual labeling. Our experiments on five alarm datasets collected at Children’s Hospital of Philadelphia show that the proposed method provides accurate bounds on the classifier’s sensitivity/specificity, appropriately reflecting the uncertainty from noisy labeling and limited sample sizes.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"3 1","pages":"1 - 24"},"PeriodicalIF":0.0,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46915880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Leveraging Mobile Sensing and Bayesian Change Point Analysis to Monitor Community-scale Behavioral Interventions: A Case Study on COVID-19 利用移动传感和贝叶斯变化点分析监测社区行为干预：新冠肺炎病例研究

ACM transactions on computing for healthcare

Pub Date : 2022-07-20 DOI: 10.1145/3524886

Shashwat Kumar, Debajyoti Datta, Guimin Dong, Lihua Cai, Laura E. Barnes, M. Boukhechba

During pandemics, effective interventions require monitoring the problem at different scales and understanding the various tradeoffs between efficacy, privacy, and economic burden. To address these challenges, we propose a framework where we perform Bayesian change-point analysis on aggregate behavior markers extracted from mobile sensing data collected during the COVID-19 pandemic. Results generated by 598 participants for up to four months reveal rich insights: We observe an increase in smartphone usage around February 10th, followed by an increase in email usage around February 27th and, finally, a large reduction in participant’s mobility around March 13th. These behavior changes overlapped with important news events and government directives such as the naming of COVID-19, a spike in the number of reported cases in Europe, and the declaration of national emergency by President Trump. We also show that our detected change points align with changes in large scale external sources, including number of COVID-19 tweets, COVID-19 search traffic, and a large-scale foot traffic data collected by SafeGraph, providing further validation of our method. Our results show promise towards the feasibility of using mobile sensing to understand communities’ responses to public health interventions.

在流行病期间，有效的干预措施需要在不同规模上监测问题，并了解疗效、隐私和经济负担之间的各种权衡。为了应对这些挑战，我们提出了一个框架，在该框架中，我们对从新冠肺炎大流行期间收集的移动传感数据中提取的聚合行为标记进行贝叶斯变化点分析。598名参与者在长达四个月的时间里得出的结果揭示了丰富的见解：我们观察到，在2月10日左右，智能手机的使用量有所增加，随后在2月27日左右，电子邮件的使用量也有所增加，最后，在3月13日左右，参与者的行动能力大幅下降。这些行为变化与重要的新闻事件和政府指令重叠，如新冠肺炎的命名、欧洲报告病例数的激增以及特朗普总统宣布国家紧急状态。我们还表明，我们检测到的变化点与大规模外部来源的变化一致，包括新冠肺炎推文数量、新冠肺炎搜索流量和SafeGraph收集的大规模步行流量数据，为我们的方法提供了进一步的验证。我们的研究结果表明，使用移动传感来了解社区对公共卫生干预措施的反应是可行的。

{"title":"Leveraging Mobile Sensing and Bayesian Change Point Analysis to Monitor Community-scale Behavioral Interventions: A Case Study on COVID-19","authors":"Shashwat Kumar, Debajyoti Datta, Guimin Dong, Lihua Cai, Laura E. Barnes, M. Boukhechba","doi":"10.1145/3524886","DOIUrl":"https://doi.org/10.1145/3524886","url":null,"abstract":"During pandemics, effective interventions require monitoring the problem at different scales and understanding the various tradeoffs between efficacy, privacy, and economic burden. To address these challenges, we propose a framework where we perform Bayesian change-point analysis on aggregate behavior markers extracted from mobile sensing data collected during the COVID-19 pandemic. Results generated by 598 participants for up to four months reveal rich insights: We observe an increase in smartphone usage around February 10th, followed by an increase in email usage around February 27th and, finally, a large reduction in participant’s mobility around March 13th. These behavior changes overlapped with important news events and government directives such as the naming of COVID-19, a spike in the number of reported cases in Europe, and the declaration of national emergency by President Trump. We also show that our detected change points align with changes in large scale external sources, including number of COVID-19 tweets, COVID-19 search traffic, and a large-scale foot traffic data collected by SafeGraph, providing further validation of our method. Our results show promise towards the feasibility of using mobile sensing to understand communities’ responses to public health interventions.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"3 1","pages":"1 - 13"},"PeriodicalIF":0.0,"publicationDate":"2022-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45574105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2