首页 > 最新文献

ACM transactions on computing for healthcare最新文献

英文 中文
A method for comparing time series by untangling time-dependent and independent variations in biological processes 通过消除生物过程中与时间相关的独立变化来比较时间序列的方法
Pub Date : 2024-07-26 DOI: 10.1145/3681795
A. J. Thottupattu, J. Sivaswamy
Biological processes like growth, aging, and disease progression are generally studied with follow-up scans taken at different time points, i.e., image time series (TS) based analysis. Image time series represents the evolution of anatomy over time, but different anatomies may have different structural characteristics and temporal paths. Therefore, separating the time-dependent path difference and time-independent basic anatomy/shape changes is important when comparing two image time series to understand the causes of the observed differences better. A method to untangle and quantify the path and shape difference between the TS is presented in this paper. The proposed method is evaluated with simulated and adult and fetal neuro templates. Results show that the metric can separate and quantify the path and shape differences between TS.
对生长、衰老和疾病进展等生物过程的研究通常采用在不同时间点进行的随访扫描,即基于图像时间序列(TS)的分析。图像时间序列代表了解剖结构随时间的演变,但不同的解剖结构可能具有不同的结构特征和时间路径。因此,在比较两个图像时间序列时,必须将与时间相关的路径差异和与时间无关的基本解剖/形状变化区分开来,以便更好地理解观察到的差异的原因。本文提出了一种对 TS 之间的路径和形状差异进行分离和量化的方法。本文使用模拟的成人和胎儿神经模板对所提出的方法进行了评估。结果表明,该指标可以分离和量化 TS 之间的路径和形状差异。
{"title":"A method for comparing time series by untangling time-dependent and independent variations in biological processes","authors":"A. J. Thottupattu, J. Sivaswamy","doi":"10.1145/3681795","DOIUrl":"https://doi.org/10.1145/3681795","url":null,"abstract":"Biological processes like growth, aging, and disease progression are generally studied with follow-up scans taken at different time points, i.e., image time series (TS) based analysis. Image time series represents the evolution of anatomy over time, but different anatomies may have different structural characteristics and temporal paths. Therefore, separating the time-dependent path difference and time-independent basic anatomy/shape changes is important when comparing two image time series to understand the causes of the observed differences better. A method to untangle and quantify the path and shape difference between the TS is presented in this paper. The proposed method is evaluated with simulated and adult and fetal neuro templates. Results show that the metric can separate and quantify the path and shape differences between TS.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"45 20","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141800023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-assisted Diagnosing, Monitoring, and Treatment of Mental Disorders: A Survey 人工智能辅助诊断、监测和治疗精神障碍:一项调查
Pub Date : 2024-07-25 DOI: 10.1145/3681794
Faustino Muetunda, Soumaya Sabry, M. Jamil, Sebastião Pais, Gael Dias, João Cordeiro
Globally, 1 in 7 people has some kind of mental or substance use disorder that affects their thinking, feelings, and behaviour in everyday life. People with mental health disorders can continue their normal lives with proper treatment and support. Mental well-being is vital for physical health. The use of AI in mental health areas has grown exponentially in the last decade. However, mental disorders are still complex to diagnose due to similar and common symptoms for numerous mental illnesses, with a minute difference. Intelligent systems can help us identify mental diseases precisely, which is a critical step in diagnosing. Using these systems efficiently can improve the treatment and rapid recovery of patients. We survey different artificial intelligence systems used in mental healthcare, such as mobile applications, machine learning and deep learning methods, and multimodal systems and draw comparisons from recent developments and related challenges. Also, we discuss types of mental disorders and how these different techniques can support the therapist in diagnosing, monitoring, and treating patients with mental disorders.
在全球范围内,每 7 人中就有 1 人患有某种精神障碍或药物使用障碍,影响着他们在日常生活中的思维、情感和行为。有精神障碍的人只要得到适当的治疗和支持,就可以继续正常生活。心理健康对身体健康至关重要。近十年来,人工智能在精神健康领域的应用呈指数级增长。然而,由于众多精神疾病的症状相似且常见,但又存在细微差别,因此精神障碍的诊断仍然十分复杂。智能系统可以帮助我们精确识别精神疾病,这是诊断的关键一步。有效利用这些系统可以提高治疗效果,使患者迅速康复。我们调查了用于精神医疗的各种人工智能系统,如移动应用、机器学习和深度学习方法以及多模态系统,并对近期的发展和相关挑战进行了比较。此外,我们还讨论了精神障碍的类型,以及这些不同的技术如何支持治疗师诊断、监控和治疗精神障碍患者。
{"title":"AI-assisted Diagnosing, Monitoring, and Treatment of Mental Disorders: A Survey","authors":"Faustino Muetunda, Soumaya Sabry, M. Jamil, Sebastião Pais, Gael Dias, João Cordeiro","doi":"10.1145/3681794","DOIUrl":"https://doi.org/10.1145/3681794","url":null,"abstract":"Globally, 1 in 7 people has some kind of mental or substance use disorder that affects their thinking, feelings, and behaviour in everyday life. People with mental health disorders can continue their normal lives with proper treatment and support. Mental well-being is vital for physical health. The use of AI in mental health areas has grown exponentially in the last decade. However, mental disorders are still complex to diagnose due to similar and common symptoms for numerous mental illnesses, with a minute difference. Intelligent systems can help us identify mental diseases precisely, which is a critical step in diagnosing. Using these systems efficiently can improve the treatment and rapid recovery of patients. We survey different artificial intelligence systems used in mental healthcare, such as mobile applications, machine learning and deep learning methods, and multimodal systems and draw comparisons from recent developments and related challenges. Also, we discuss types of mental disorders and how these different techniques can support the therapist in diagnosing, monitoring, and treating patients with mental disorders.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"16 17","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141803001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HEalthRecordBERT (HERBERT): leveraging transformers on electronic health records for chronic kidney disease risk stratification HEalthRecordBERT (HERBERT):利用电子健康记录转换器进行慢性肾病风险分层
Pub Date : 2024-07-19 DOI: 10.1145/3665899
Alex Moore, B. Orset, A. Yassaee, Benjamin Irving, Davide Morelli
Risk stratification is an essential tool in the fight against many diseases, including chronic kidney disease. Recent work has focused on applying techniques from machine learning and leveraging the information contained in a patient’s electronic health record (EHR). Irregular intervals between data entries and the large number of variables tracked in EHR datasets can make them challenging to work with. Many of the difficulties associated with these datasets can be overcome by using large language models, such as bidirectional encoder representations from transformers (BERT). Previous attempts to apply BERT to EHR for risk stratification have shown promise. In this work we propose HERBERT, a novel application of BERT to EHR data. We identify two key areas where BERT models must be modified to adapt them to EHR data, namely: the embedding layer and the pretraining task. We show how changes to these can lead to improved performance, relative to the previous state of the art. We evaluate our model by predicting the transition of chronic kidney disease patients to end stage renal disease. The strong performance of our model justifies our architectural changes and suggests that large language models could play an important role in future renal risk stratification.
风险分层是防治包括慢性肾病在内的多种疾病的重要工具。近期的工作重点是应用机器学习技术和利用患者电子健康记录(EHR)中包含的信息。电子病历数据集的数据输入间隔不规则,跟踪的变量数量庞大,这些都给工作带来了挑战。与这些数据集相关的许多困难都可以通过使用大型语言模型来克服,例如转换器的双向编码器表示法(BERT)。之前将 BERT 应用于电子病历进行风险分层的尝试已显示出良好的前景。在这项工作中,我们提出了将 BERT 应用于电子病历数据的新方法 HERBERT。我们确定了 BERT 模型必须修改以适应电子病历数据的两个关键领域,即:嵌入层和预训练任务。我们展示了与之前的技术水平相比,对这两个方面的修改如何提高性能。我们通过预测慢性肾病患者向终末期肾病的转变来评估我们的模型。我们模型的强大性能证明了我们的架构改变是正确的,并表明大型语言模型在未来的肾脏风险分层中可以发挥重要作用。
{"title":"HEalthRecordBERT (HERBERT): leveraging transformers on electronic health records for chronic kidney disease risk stratification","authors":"Alex Moore, B. Orset, A. Yassaee, Benjamin Irving, Davide Morelli","doi":"10.1145/3665899","DOIUrl":"https://doi.org/10.1145/3665899","url":null,"abstract":"Risk stratification is an essential tool in the fight against many diseases, including chronic kidney disease. Recent work has focused on applying techniques from machine learning and leveraging the information contained in a patient’s electronic health record (EHR). Irregular intervals between data entries and the large number of variables tracked in EHR datasets can make them challenging to work with. Many of the difficulties associated with these datasets can be overcome by using large language models, such as bidirectional encoder representations from transformers (BERT). Previous attempts to apply BERT to EHR for risk stratification have shown promise. In this work we propose HERBERT, a novel application of BERT to EHR data. We identify two key areas where BERT models must be modified to adapt them to EHR data, namely: the embedding layer and the pretraining task. We show how changes to these can lead to improved performance, relative to the previous state of the art. We evaluate our model by predicting the transition of chronic kidney disease patients to end stage renal disease. The strong performance of our model justifies our architectural changes and suggests that large language models could play an important role in future renal risk stratification.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"115 46","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141822246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Computation Model to Estimate Interaction Intensity through Non-verbal Behavioral Cues: A Case Study of Intimate Couples under the Impact of Acute Alcohol Consumption 通过非语言行为线索估计互动强度的计算模型:急性酒精中毒影响下亲密伴侣的案例研究
Pub Date : 2024-07-19 DOI: 10.1145/3664826
Zhiwei, Z.Y. Yu, Cory, C.C. Crane, Linlin, L.C. Chen, Maria, M.T. Testa, Zhi, Z.Z. Zheng
This work introduced a novel analysis method to estimate interaction intensity, i.e., the level of positivity/negativity of an interaction, for intimate couples (married and heterosexual) under the impact of alcohol, which has great influences on behavioral health. Non-verbal behaviors are critical in interpersonal interactions. However, whether computer vision-detected non-verbal behaviors can effectively estimate interaction intensity of intimate couples is still unexplored. In this work, we proposed novel measurements and investigated their feasibility to estimate interaction intensities through machine learning regression models. Analyses were conducted based on a conflict-resolution conversation video dataset of intimate couples before and after acute alcohol consumption. Results showed the estimation error was at the lowest in the no-alcohol state but significantly increased if the model trained using no-alcohol data was applied to after-alcohol data, indicating that alcohol altered the interaction data in the feature space. While training a model using rich after-alcohol data is ideal to address the performance decrease, data collection in such a risky state is challenging in real life. Thus, we proposed a new State-Induced Domain Adaptation (SIDA) framework, which allows for improving estimation performance using only a small after-alcohol training dataset, pointing to a future direction of addressing data scarcity issues.
这项工作引入了一种新颖的分析方法,用于估算酒精影响下亲密伴侣(已婚和异性恋)的互动强度,即互动的积极/消极程度,酒精对行为健康有很大影响。非语言行为在人际交往中至关重要。然而,计算机视觉检测到的非语言行为是否能有效估计亲密情侣的互动强度,目前仍有待探索。在这项工作中,我们提出了新的测量方法,并研究了其通过机器学习回归模型估计互动强度的可行性。我们基于亲密情侣在急性饮酒前后的冲突解决对话视频数据集进行了分析。结果表明,在未饮酒状态下,估计误差最小,但如果将使用未饮酒数据训练的模型应用于饮酒后数据,则估计误差会显著增加,这表明酒精改变了特征空间中的互动数据。虽然使用丰富的酒后数据训练模型是解决性能下降问题的理想方法,但在这种危险状态下收集数据在现实生活中具有挑战性。因此,我们提出了一种新的状态诱导领域适应(SIDA)框架,只需使用少量酒后训练数据集即可提高估计性能,为解决数据稀缺问题指明了未来的方向。
{"title":"A Computation Model to Estimate Interaction Intensity through Non-verbal Behavioral Cues: A Case Study of Intimate Couples under the Impact of Acute Alcohol Consumption","authors":"Zhiwei, Z.Y. Yu, Cory, C.C. Crane, Linlin, L.C. Chen, Maria, M.T. Testa, Zhi, Z.Z. Zheng","doi":"10.1145/3664826","DOIUrl":"https://doi.org/10.1145/3664826","url":null,"abstract":"This work introduced a novel analysis method to estimate interaction intensity, i.e., the level of positivity/negativity of an interaction, for intimate couples (married and heterosexual) under the impact of alcohol, which has great influences on behavioral health. Non-verbal behaviors are critical in interpersonal interactions. However, whether computer vision-detected non-verbal behaviors can effectively estimate interaction intensity of intimate couples is still unexplored. In this work, we proposed novel measurements and investigated their feasibility to estimate interaction intensities through machine learning regression models. Analyses were conducted based on a conflict-resolution conversation video dataset of intimate couples before and after acute alcohol consumption. Results showed the estimation error was at the lowest in the no-alcohol state but significantly increased if the model trained using no-alcohol data was applied to after-alcohol data, indicating that alcohol altered the interaction data in the feature space. While training a model using rich after-alcohol data is ideal to address the performance decrease, data collection in such a risky state is challenging in real life. Thus, we proposed a new State-Induced Domain Adaptation (SIDA) framework, which allows for improving estimation performance using only a small after-alcohol training dataset, pointing to a future direction of addressing data scarcity issues.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":" 1092","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141823338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping Distributed Ledger Technology Characteristics to Use Cases in Healthcare: A Structured Literature Review 将分布式账本技术的特点映射到医疗保健领域的用例:结构化文献综述
Pub Date : 2024-07-19 DOI: 10.1145/3653076
Shanshan Hu, Manuel Schmidt-Kraepelin, Scott Thiebes, A. Sunyaev
Following the success of the Bitcoin blockchain, distributed ledger technology (DLT) has received extensive attention in health informatics research. Yet, the healthcare industry is highly complex with many different stakeholders, information systems, regulations, and challenges. Thus, DLT may be used in various settings and for different purposes. First surveys have started to synthesize our knowledge of the different use cases, in which healthcare may benefit from DLT implementations. However, an in-depth understanding of whether and how these use cases differ concerning their requirements of DLT characteristics (i.e., technical or administrative design features) is still lacking. In this work, we conducted a structured review of 185 studies on DLT-based applications in healthcare. The results reveal six pertinent use cases, each with its own combination of different purposes that DLT is used for. Furthermore, our study shows that each of these use cases has a unique set of requirements with regard to the most important DLT characteristics. In doing so, we seek to guide practitioners in the development of highly effective DLT-based applications in various healthcare settings and pave the way for future research to investigate the understudied areas of DLT-based applications in healthcare.
继比特币区块链取得成功后,分布式账本技术(DLT)在医疗信息学研究中受到广泛关注。然而,医疗保健行业非常复杂,有许多不同的利益相关者、信息系统、法规和挑战。因此,DLT 可用于各种环境和不同目的。首次调查已开始综合我们对不同用例的了解,在这些用例中,医疗保健可能会受益于 DLT 的实施。然而,对于这些使用案例对数字签名技术特征(即技术或管理设计特征)的要求是否不同以及如何不同,我们还缺乏深入的了解。在这项工作中,我们对基于 DLT 的医疗保健应用的 185 项研究进行了结构化回顾。研究结果显示了六种相关的使用案例,每种案例都结合了 DLT 的不同用途。此外,我们的研究还表明,这些用例中的每一种都对最重要的 DLT 特性有一套独特的要求。这样,我们就能指导从业人员在各种医疗保健环境中开发基于数字签名技术的高效应用,并为未来研究医疗保健中基于数字签名技术的应用铺平道路。
{"title":"Mapping Distributed Ledger Technology Characteristics to Use Cases in Healthcare: A Structured Literature Review","authors":"Shanshan Hu, Manuel Schmidt-Kraepelin, Scott Thiebes, A. Sunyaev","doi":"10.1145/3653076","DOIUrl":"https://doi.org/10.1145/3653076","url":null,"abstract":"Following the success of the Bitcoin blockchain, distributed ledger technology (DLT) has received extensive attention in health informatics research. Yet, the healthcare industry is highly complex with many different stakeholders, information systems, regulations, and challenges. Thus, DLT may be used in various settings and for different purposes. First surveys have started to synthesize our knowledge of the different use cases, in which healthcare may benefit from DLT implementations. However, an in-depth understanding of whether and how these use cases differ concerning their requirements of DLT characteristics (i.e., technical or administrative design features) is still lacking. In this work, we conducted a structured review of 185 studies on DLT-based applications in healthcare. The results reveal six pertinent use cases, each with its own combination of different purposes that DLT is used for. Furthermore, our study shows that each of these use cases has a unique set of requirements with regard to the most important DLT characteristics. In doing so, we seek to guide practitioners in the development of highly effective DLT-based applications in various healthcare settings and pave the way for future research to investigate the understudied areas of DLT-based applications in healthcare.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":" November","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141823614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
iScan: Detection of Colorectal Cancer From CT Scan Images Using Deep Learning iScan:利用深度学习从 CT 扫描图像中检测结直肠癌
Pub Date : 2024-07-19 DOI: 10.1145/3676282
Sagnik Ghosal, Debanjan Das, Jay Kumar Rai, Akanksha Singh Pandaw, Sakshi Verma
Colorectal cancer, a highly lethal form of cancer, can be treated effectively if detected early. However, the current diagnosis process involves a time-consuming and manual review of CT scans to identify cancerous regions and behavior, leading to resource consumption, subjectivity, and dependency on manual assessment. We propose a 3-phase deep neural system for automated colorectal cancer detection using CT scan images to address these challenges. It includes a SegNet network to identify tumor locations, an InceptionResNet V2 network to classify tumors as benign or malignant, and an analysis of tumor area cum perimeter to predict the cancer stage. The proposed model offers a fully automated solution by combining these functionalities under a single umbrella. In real-life CT scans from 37 patients, the proposed model achieved 95.8 (%) ROI segmentation accuracy, a dice coefficient of 0.6214, 69.75 (%) IoU score, and 95.83 (%) tumor classification accuracy. The unique approach using Radial Length (RL) and Circularity (C) parameters predicted the T-stage with close to 85 (%) accuracy. Based on these outcomes, the proposed system establishes itself as a reliable and suitable alternative to traditional cancer diagnosis techniques by leveraging the power of automation, deep learning, and innovative parameter analysis.
大肠癌是一种致死率极高的癌症,如果能及早发现,就能得到有效治疗。然而,目前的诊断过程需要对 CT 扫描图像进行耗时的人工检查,以确定癌变区域和癌变行为,这导致了资源消耗、主观性和对人工评估的依赖。为了应对这些挑战,我们提出了一种利用 CT 扫描图像自动检测结直肠癌的三阶段深度神经系统。该系统包括用于识别肿瘤位置的 SegNet 网络、用于将肿瘤分为良性和恶性的 InceptionResNet V2 网络,以及用于预测癌症分期的肿瘤面积和周长分析。所提出的模型将这些功能整合在一起,提供了一个全自动的解决方案。在37名患者的真实CT扫描中,所提出的模型达到了95.8的ROI分割准确率、0.6214的骰子系数、69.75的IoU得分和95.83的肿瘤分类准确率。使用径向长度(RL)和圆周率(C)参数的独特方法预测T期的准确率接近85%。基于这些结果,所提出的系统通过利用自动化、深度学习和创新参数分析的力量,成为传统癌症诊断技术的可靠和合适的替代方案。
{"title":"iScan: Detection of Colorectal Cancer From CT Scan Images Using Deep Learning","authors":"Sagnik Ghosal, Debanjan Das, Jay Kumar Rai, Akanksha Singh Pandaw, Sakshi Verma","doi":"10.1145/3676282","DOIUrl":"https://doi.org/10.1145/3676282","url":null,"abstract":"\u0000 Colorectal cancer, a highly lethal form of cancer, can be treated effectively if detected early. However, the current diagnosis process involves a time-consuming and manual review of CT scans to identify cancerous regions and behavior, leading to resource consumption, subjectivity, and dependency on manual assessment. We propose a 3-phase deep neural system for automated colorectal cancer detection using CT scan images to address these challenges. It includes a SegNet network to identify tumor locations, an InceptionResNet V2 network to classify tumors as benign or malignant, and an analysis of tumor area cum perimeter to predict the cancer stage. The proposed model offers a fully automated solution by combining these functionalities under a single umbrella. In real-life CT scans from 37 patients, the proposed model achieved 95.8\u0000 \u0000 (%)\u0000 \u0000 ROI segmentation accuracy, a dice coefficient of 0.6214, 69.75\u0000 \u0000 (%)\u0000 \u0000 IoU score, and 95.83\u0000 \u0000 (%)\u0000 \u0000 tumor classification accuracy. The unique approach using Radial Length (RL) and Circularity (C) parameters predicted the T-stage with close to 85\u0000 \u0000 (%)\u0000 \u0000 accuracy. Based on these outcomes, the proposed system establishes itself as a reliable and suitable alternative to traditional cancer diagnosis techniques by leveraging the power of automation, deep learning, and innovative parameter analysis.\u0000","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"8 23","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141822359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Loss Relaxation Strategy for Noisy Facial Video-based Automatic Depression Recognition 基于噪声面部视频的自动抑郁识别的损失松弛策略
Pub Date : 2024-03-04 DOI: 10.1145/3648696
Siyang Song, Yi-Xiang Luo, Tugba Tumer, Michel Valstar, Hatice Gunes
Automatic depression analysis has been widely investigated on face videos that have been carefully collected and annotated in lab conditions. However, videos collected under real-world conditions may suffer from various types of noises due to challenging data acquisition conditions and lack of annotators. Although deep learning (DL) models frequently show excellent depression analysis performances on datasets collected in controlled lab conditions, such noise may degrade their generalization abilities for real-world depression analysis tasks. In this paper, we uncovered that noisy facial data and annotations consistently change the distribution of training losses for facial depression DL models, i.e., noisy data-label pairs cause larger loss values compared to clean data-label pairs. Since different loss functions could be applied depending on the employed model and task, we propose a generic loss function relaxation strategy that can jointly reduce the negative impact of various noisy data and annotation problems occurring in both classification and regression loss functions, for face video-based depression analysis, where the parameters of the proposed strategy can be automatically adapted during depression model training. The experimental results on 25 different artificially created noisy depression conditions (i.e., five noise types with five different noise levels) show that our loss relaxation strategy can clearly enhance both classification and regression loss functions, enabling the generation of superior face video-based depression analysis models under almost all noisy conditions. Our approach is robust to its main variable settings, and can adaptively and automatically obtain its parameters during training.
自动抑郁分析已在实验室条件下仔细采集和标注的人脸视频中得到广泛研究。然而,由于数据采集条件具有挑战性且缺乏注释者,在真实世界条件下采集的视频可能会受到各种噪音的影响。虽然深度学习(DL)模型经常在受控实验室条件下收集的数据集上显示出出色的抑郁分析性能,但这些噪声可能会降低它们在真实世界抑郁分析任务中的泛化能力。在本文中,我们发现有噪声的面部数据和注释会持续改变面部抑郁深度学习模型的训练损失分布,也就是说,与干净的数据标签对相比,有噪声的数据标签对会导致更大的损失值。由于不同的模型和任务可以使用不同的损失函数,我们提出了一种通用的损失函数松弛策略,可以共同减少分类和回归损失函数中出现的各种噪声数据和标注问题对基于人脸视频的抑郁分析的负面影响,该策略的参数可以在抑郁模型训练过程中自动调整。在 25 种不同的人为噪声抑郁条件(即五种噪声类型和五种不同的噪声水平)下的实验结果表明,我们的损失松弛策略可以明显增强分类和回归损失函数,从而在几乎所有噪声条件下生成卓越的基于人脸视频的抑郁分析模型。我们的方法对其主要变量设置具有鲁棒性,并能在训练过程中自适应地自动获取参数。
{"title":"Loss Relaxation Strategy for Noisy Facial Video-based Automatic Depression Recognition","authors":"Siyang Song, Yi-Xiang Luo, Tugba Tumer, Michel Valstar, Hatice Gunes","doi":"10.1145/3648696","DOIUrl":"https://doi.org/10.1145/3648696","url":null,"abstract":"Automatic depression analysis has been widely investigated on face videos that have been carefully collected and annotated in lab conditions. However, videos collected under real-world conditions may suffer from various types of noises due to challenging data acquisition conditions and lack of annotators. Although deep learning (DL) models frequently show excellent depression analysis performances on datasets collected in controlled lab conditions, such noise may degrade their generalization abilities for real-world depression analysis tasks. In this paper, we uncovered that noisy facial data and annotations consistently change the distribution of training losses for facial depression DL models, i.e., noisy data-label pairs cause larger loss values compared to clean data-label pairs. Since different loss functions could be applied depending on the employed model and task, we propose a generic loss function relaxation strategy that can jointly reduce the negative impact of various noisy data and annotation problems occurring in both classification and regression loss functions, for face video-based depression analysis, where the parameters of the proposed strategy can be automatically adapted during depression model training. The experimental results on 25 different artificially created noisy depression conditions (i.e., five noise types with five different noise levels) show that our loss relaxation strategy can clearly enhance both classification and regression loss functions, enabling the generation of superior face video-based depression analysis models under almost all noisy conditions. Our approach is robust to its main variable settings, and can adaptively and automatically obtain its parameters during training.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"12 s2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140266193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Interpretable Trend Analysis Neural Networks for Longitudinal Data Analysis 用于纵向数据分析的可解释趋势分析神经网络
Pub Date : 2024-02-19 DOI: 10.1145/3648105
Zhenjie Yao, Yixin Chen, Jinwei Wang, Junjuan Li, Shuohua Chen, Shouling Wu, Yanhui Tu, Ming-Hui Zhao, Luxia Zhang
Cohort study is one of the most commonly used study methods in medical and public health researches, which result in longitudinal data. Conventional statistical models and machine learning methods are not capable of modeling the evolution trend of the variables in longitudinal data. In this paper, we propose a Trend Analysis Neural Networks (TANN), which models the evolution trend of the variables by adaptive feature learning. TANN was tested on dataset of Kaiuan research. The task was to predict occurrence of cardiovascular events within 2 and 5 years, with 3 repeated medical examinations during 2008 and 2013. For 2-year prediction, The AUC of the TANN is 0.7378, which is a significant improvement than that of conventional methods, while that of TRNS, RNN, DNN, GBDT, RF, and LR are 0.7222, 0.7034, 0.7054, 0.7136, 0.7160 and 0.7024, respectively. For 5-year prediction, TANN also shows improvement. The experimental results show that the proposed TANN achieves better prediction performance on cardiovascular events prediction than conventional models. Furthermore, by analyzing the weights of TANN, we could find out important trends of the indicators, which are ignored by conventional machine learning models. The trend discovery mechanism interprets the model well. TANN is an appropriate balance between high performance and interpretability.
队列研究是医学和公共卫生研究中最常用的研究方法之一,其结果是纵向数据。传统的统计模型和机器学习方法无法对纵向数据中变量的演变趋势进行建模。本文提出了一种趋势分析神经网络(TANN),它通过自适应特征学习对变量的演变趋势进行建模。我们在开元研究的数据集上对 TANN 进行了测试。任务是通过 2008 年和 2013 年期间的 3 次重复体检预测 2 年和 5 年内心血管事件的发生率。在 2 年预测中,TANN 的 AUC 为 0.7378,比传统方法显著提高,而 TRNS、RNN、DNN、GBDT、RF 和 LR 的 AUC 分别为 0.7222、0.7034、0.7054、0.7136、0.7160 和 0.7024。在 5 年期预测方面,TANN 也有所改进。实验结果表明,与传统模型相比,所提出的 TANN 在心血管事件预测方面取得了更好的预测效果。此外,通过分析 TANN 的权重,我们可以发现指标的重要趋势,而传统的机器学习模型会忽略这些趋势。趋势发现机制很好地解释了模型。TANN 在高性能和可解释性之间取得了适当的平衡。
{"title":"An Interpretable Trend Analysis Neural Networks for Longitudinal Data Analysis","authors":"Zhenjie Yao, Yixin Chen, Jinwei Wang, Junjuan Li, Shuohua Chen, Shouling Wu, Yanhui Tu, Ming-Hui Zhao, Luxia Zhang","doi":"10.1145/3648105","DOIUrl":"https://doi.org/10.1145/3648105","url":null,"abstract":"Cohort study is one of the most commonly used study methods in medical and public health researches, which result in longitudinal data. Conventional statistical models and machine learning methods are not capable of modeling the evolution trend of the variables in longitudinal data. In this paper, we propose a Trend Analysis Neural Networks (TANN), which models the evolution trend of the variables by adaptive feature learning. TANN was tested on dataset of Kaiuan research. The task was to predict occurrence of cardiovascular events within 2 and 5 years, with 3 repeated medical examinations during 2008 and 2013. For 2-year prediction, The AUC of the TANN is 0.7378, which is a significant improvement than that of conventional methods, while that of TRNS, RNN, DNN, GBDT, RF, and LR are 0.7222, 0.7034, 0.7054, 0.7136, 0.7160 and 0.7024, respectively. For 5-year prediction, TANN also shows improvement. The experimental results show that the proposed TANN achieves better prediction performance on cardiovascular events prediction than conventional models. Furthermore, by analyzing the weights of TANN, we could find out important trends of the indicators, which are ignored by conventional machine learning models. The trend discovery mechanism interprets the model well. TANN is an appropriate balance between high performance and interpretability.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"22 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139958360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
WalkingWizard - A truly wearable EEG headset for everyday use WalkingWizard - 适合日常使用的真正可佩戴脑电图耳机
Pub Date : 2024-02-15 DOI: 10.1145/3648106
Teck Lun Goh, L. Peh
Electroencephalography (EEG) provides an opportunity to gain insights to electrocortical activity without the need for invasive technology. While increasingly used in various application areas, EEG headsets tend to be suited only to a laboratory environment due to the long preparation time to don the headset and the need for users to remain stationary. We present our design of a dry, dual-electrodes flexible PCB assembly that realizes accurate sensing in face of practical motion artifacts. Using it, we present WalkingWizard, our prototype dry-electrode EEG baseball cap that can be used under motion in everyday scenarios. We first evaluated its hardware performance by comparing its electrode-scalp impedance and ability to capture alpha rhythm against both wet EEG, and commercially available dry EEG headsets. We then tested WalkingWizard using SSVEP experiments, achieving high classification accuracy of 87% for walking speeds up to 5.0km/hr, beating state-of-the-art. Expanding on WalkingWizard, we integrated all necessary electronic components into a flexible PCB assembly - realizing WalkingWizard Integrated , in a truly wearable form-factor. Utilizing WalkingWizard Integrated, we demonstrated several applications as proof-of-concept: Classification of SSVEP in VR environment while walking, Real-time acquisition of emotional state of users while moving around the neighbourhood, and Understanding the effect of guided meditation for relaxation.
脑电图(EEG)提供了一个无需侵入性技术即可深入了解皮层电活动的机会。虽然脑电图耳机越来越多地应用于各个领域,但由于佩戴耳机的准备时间较长,而且用户需要保持静止不动,因此往往只适用于实验室环境。我们介绍了我们设计的干式双电极柔性 PCB 组件,它能在实际运动伪影面前实现精确传感。利用它,我们推出了 WalkingWizard,这是我们的干电极脑电图棒球帽原型,可在日常运动场景下使用。我们首先评估了它的硬件性能,将其电极鳞片阻抗和捕捉α节律的能力与湿式脑电图和市售干式脑电图耳机进行了比较。然后,我们使用 SSVEP 实验对 WalkingWizard 进行了测试,在步行速度高达 5.0km/hr 的情况下,分类准确率高达 87%,超过了最先进的水平。在 WalkingWizard 的基础上,我们将所有必要的电子元件集成到一个灵活的印刷电路板组件中--实现了 WalkingWizard Integrated,具有真正的可穿戴外形。利用 WalkingWizard Integrated,我们展示了几个应用作为概念验证:步行时在 VR 环境中对 SSVEP 进行分类、在社区中移动时实时获取用户的情绪状态,以及了解引导式冥想对放松的影响。
{"title":"WalkingWizard - A truly wearable EEG headset for everyday use","authors":"Teck Lun Goh, L. Peh","doi":"10.1145/3648106","DOIUrl":"https://doi.org/10.1145/3648106","url":null,"abstract":"\u0000 Electroencephalography (EEG) provides an opportunity to gain insights to electrocortical activity without the need for invasive technology. While increasingly used in various application areas, EEG headsets tend to be suited only to a laboratory environment due to the long preparation time to don the headset and the need for users to remain stationary. We present our design of a dry, dual-electrodes flexible PCB assembly that realizes accurate sensing in face of practical motion artifacts. Using it, we present WalkingWizard, our prototype dry-electrode EEG baseball cap that can be used under motion in everyday scenarios. We first evaluated its hardware performance by comparing its electrode-scalp impedance and ability to capture alpha rhythm against both wet EEG, and commercially available dry EEG headsets. We then tested WalkingWizard using SSVEP experiments, achieving high classification accuracy of 87% for walking speeds up to 5.0km/hr, beating state-of-the-art. Expanding on WalkingWizard, we integrated all necessary electronic components into a flexible PCB assembly - realizing\u0000 WalkingWizard Integrated\u0000 , in a truly wearable form-factor. Utilizing WalkingWizard Integrated, we demonstrated several applications as proof-of-concept: Classification of SSVEP in VR environment while walking, Real-time acquisition of emotional state of users while moving around the neighbourhood, and Understanding the effect of guided meditation for relaxation.\u0000","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"118 19","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139776668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
WalkingWizard - A truly wearable EEG headset for everyday use WalkingWizard - 适合日常使用的真正可佩戴脑电图耳机
Pub Date : 2024-02-15 DOI: 10.1145/3648106
Teck Lun Goh, L. Peh
Electroencephalography (EEG) provides an opportunity to gain insights to electrocortical activity without the need for invasive technology. While increasingly used in various application areas, EEG headsets tend to be suited only to a laboratory environment due to the long preparation time to don the headset and the need for users to remain stationary. We present our design of a dry, dual-electrodes flexible PCB assembly that realizes accurate sensing in face of practical motion artifacts. Using it, we present WalkingWizard, our prototype dry-electrode EEG baseball cap that can be used under motion in everyday scenarios. We first evaluated its hardware performance by comparing its electrode-scalp impedance and ability to capture alpha rhythm against both wet EEG, and commercially available dry EEG headsets. We then tested WalkingWizard using SSVEP experiments, achieving high classification accuracy of 87% for walking speeds up to 5.0km/hr, beating state-of-the-art. Expanding on WalkingWizard, we integrated all necessary electronic components into a flexible PCB assembly - realizing WalkingWizard Integrated , in a truly wearable form-factor. Utilizing WalkingWizard Integrated, we demonstrated several applications as proof-of-concept: Classification of SSVEP in VR environment while walking, Real-time acquisition of emotional state of users while moving around the neighbourhood, and Understanding the effect of guided meditation for relaxation.
脑电图(EEG)提供了一个无需侵入性技术即可深入了解皮层电活动的机会。虽然脑电图耳机越来越多地应用于各个领域,但由于佩戴耳机的准备时间较长,而且用户需要保持静止不动,因此往往只适用于实验室环境。我们介绍了我们设计的干式双电极柔性 PCB 组件,它能在实际运动伪影面前实现精确传感。利用它,我们推出了 WalkingWizard,这是我们的干电极脑电图棒球帽原型,可在日常运动场景下使用。我们首先评估了它的硬件性能,将其电极鳞片阻抗和捕捉α节律的能力与湿式脑电图和市售干式脑电图耳机进行了比较。然后,我们使用 SSVEP 实验对 WalkingWizard 进行了测试,在步行速度高达 5.0km/hr 的情况下,分类准确率高达 87%,超过了最先进的水平。在 WalkingWizard 的基础上,我们将所有必要的电子元件集成到一个灵活的印刷电路板组件中--实现了 WalkingWizard Integrated,具有真正的可穿戴外形。利用 WalkingWizard Integrated,我们展示了几个应用作为概念验证:步行时在 VR 环境中对 SSVEP 进行分类、在社区中移动时实时获取用户的情绪状态,以及了解引导式冥想对放松的影响。
{"title":"WalkingWizard - A truly wearable EEG headset for everyday use","authors":"Teck Lun Goh, L. Peh","doi":"10.1145/3648106","DOIUrl":"https://doi.org/10.1145/3648106","url":null,"abstract":"\u0000 Electroencephalography (EEG) provides an opportunity to gain insights to electrocortical activity without the need for invasive technology. While increasingly used in various application areas, EEG headsets tend to be suited only to a laboratory environment due to the long preparation time to don the headset and the need for users to remain stationary. We present our design of a dry, dual-electrodes flexible PCB assembly that realizes accurate sensing in face of practical motion artifacts. Using it, we present WalkingWizard, our prototype dry-electrode EEG baseball cap that can be used under motion in everyday scenarios. We first evaluated its hardware performance by comparing its electrode-scalp impedance and ability to capture alpha rhythm against both wet EEG, and commercially available dry EEG headsets. We then tested WalkingWizard using SSVEP experiments, achieving high classification accuracy of 87% for walking speeds up to 5.0km/hr, beating state-of-the-art. Expanding on WalkingWizard, we integrated all necessary electronic components into a flexible PCB assembly - realizing\u0000 WalkingWizard Integrated\u0000 , in a truly wearable form-factor. Utilizing WalkingWizard Integrated, we demonstrated several applications as proof-of-concept: Classification of SSVEP in VR environment while walking, Real-time acquisition of emotional state of users while moving around the neighbourhood, and Understanding the effect of guided meditation for relaxation.\u0000","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"61 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139836174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM transactions on computing for healthcare
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1