首页 > 最新文献

IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics最新文献

英文 中文
Attention-based Imputation of Missing Values in Electronic Health Records Tabular Data. 基于注意力的电子健康记录表格数据缺失值估算。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00030
Ibna Kowsar, Shourav B Rabbani, Manar D Samad

The imputation of missing values (IMV) in electronic health records tabular data is crucial to enable machine learning for patient-specific predictive modeling. While IMV methods are developed in biostatistics and recently in machine learning, deep learning-based solutions have shown limited success in learning tabular data. This paper proposes a novel attention-based missing value imputation framework that learns to reconstruct data with missing values leveraging between-feature (self-attention) or between-sample attentions. We adopt data manipulation methods used in contrastive learning to improve the generalization of the trained imputation model. The proposed self-attention imputation method outperforms state-of-the-art statistical and machine learning-based (decision-tree) imputation methods, reducing the normalized root mean squared error by 18.4% to 74.7% on five tabular data sets and 52.6% to 82.6% on two electronic health records data sets. The proposed attention-based missing value imputation method shows superior performance across a wide range of missingness (10% to 50%) when the values are missing completely at random.

电子健康记录表格数据中缺失值的估算(IMV)对于机器学习进行特定患者预测建模至关重要。虽然生物统计学和最近的机器学习领域都开发了缺失值估算方法,但基于深度学习的解决方案在学习表格数据方面的成功率有限。本文提出了一种新颖的基于注意力的缺失值估算框架,它能利用特征间(自我注意力)或样本间注意力学习重建缺失值数据。我们采用了对比学习中使用的数据处理方法,以提高训练有素的估算模型的泛化能力。所提出的自我注意力估算方法优于最先进的统计和基于机器学习(决策树)的估算方法,在五个表格数据集上将归一化均方根误差降低了 18.4% 到 74.7%,在两个电子健康记录数据集上将归一化均方根误差降低了 52.6% 到 82.6%。当数值完全随机缺失时,所提出的基于注意力的缺失值估算方法在很大的缺失率范围(10% 到 50%)内都表现出了卓越的性能。
{"title":"Attention-based Imputation of Missing Values in Electronic Health Records Tabular Data.","authors":"Ibna Kowsar, Shourav B Rabbani, Manar D Samad","doi":"10.1109/ichi61247.2024.00030","DOIUrl":"10.1109/ichi61247.2024.00030","url":null,"abstract":"<p><p>The imputation of missing values (IMV) in electronic health records tabular data is crucial to enable machine learning for patient-specific predictive modeling. While IMV methods are developed in biostatistics and recently in machine learning, deep learning-based solutions have shown limited success in learning tabular data. This paper proposes a novel attention-based missing value imputation framework that learns to reconstruct data with missing values leveraging between-feature (self-attention) or between-sample attentions. We adopt data manipulation methods used in contrastive learning to improve the generalization of the trained imputation model. The proposed self-attention imputation method outperforms state-of-the-art statistical and machine learning-based (decision-tree) imputation methods, reducing the normalized root mean squared error by 18.4% to 74.7% on five tabular data sets and 52.6% to 82.6% on two electronic health records data sets. The proposed attention-based missing value imputation method shows superior performance across a wide range of missingness (10% to 50%) when the values are missing completely at random.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"177-182"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11463999/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142395730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Professional Radiologists' Expertise to Enhance LLMs' Evaluation for AI-generated Radiology Reports. 利用专业放射学专家的专业知识,加强法律硕士对人工智能生成的放射学报告的评估。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00058
Qingqing Zhu, Xiuying Chen, Qiao Jin, Benjamin Hou, Tejas Sudharshan Mathai, Pritam Mukherjee, Xin Gao, Ronald M Summers, Zhiyong Lu

In radiology, Artificial Intelligence (AI) has significantly advanced report generation, but automatic evaluation of these AI-produced reports remains challenging. Current metrics, such as Conventional Natural Language Generation (NLG) and Clinical Efficacy (CE), often fall short in capturing the semantic intricacies of clinical contexts or overemphasize clinical details, undermining report clarity. To overcome these issues, our proposed method synergizes the expertise of professional radiologists with Large Language Models (LLMs), like GPT-3.5 and GPT-4. Utilizing In-Context Instruction Learning (ICIL) and Chain of Thought (CoT) reasoning, our approach aligns LLM evaluations with radiologist standards, enabling detailed comparisons between human and AI-generated reports. This is further enhanced by a Regression model that aggregates sentence evaluation scores. Experimental results show that our "Detailed GPT-4 (5-shot)" model achieves a correlation that is 0.48, outperforming the METEOR metric by 0.19, while our "Regressed GPT-4" model shows even greater alignment(0.64) with expert evaluations, exceeding the best existing metric by a 0.35 margin. Moreover, the robustness of our explanations has been validated through a thorough iterative strategy. We plan to publicly release annotations from radiology experts, setting a new standard for accuracy in future assessments. This underscores the potential of our approach in enhancing the quality assessment of AI-driven medical reports.

{"title":"Leveraging Professional Radiologists' Expertise to Enhance LLMs' Evaluation for AI-generated Radiology Reports.","authors":"Qingqing Zhu, Xiuying Chen, Qiao Jin, Benjamin Hou, Tejas Sudharshan Mathai, Pritam Mukherjee, Xin Gao, Ronald M Summers, Zhiyong Lu","doi":"10.1109/ichi61247.2024.00058","DOIUrl":"10.1109/ichi61247.2024.00058","url":null,"abstract":"<p><p>In radiology, Artificial Intelligence (AI) has significantly advanced report generation, but automatic evaluation of these AI-produced reports remains challenging. Current metrics, such as Conventional Natural Language Generation (NLG) and Clinical Efficacy (CE), often fall short in capturing the semantic intricacies of clinical contexts or overemphasize clinical details, undermining report clarity. To overcome these issues, our proposed method synergizes the expertise of professional radiologists with Large Language Models (LLMs), like GPT-3.5 and GPT-4. Utilizing In-Context Instruction Learning (ICIL) and Chain of Thought (CoT) reasoning, our approach aligns LLM evaluations with radiologist standards, enabling detailed comparisons between human and AI-generated reports. This is further enhanced by a Regression model that aggregates sentence evaluation scores. Experimental results show that our \"Detailed GPT-4 (5-shot)\" model achieves a correlation that is 0.48, outperforming the METEOR metric by 0.19, while our \"Regressed GPT-4\" model shows even greater alignment(0.64) with expert evaluations, exceeding the best existing metric by a 0.35 margin. Moreover, the robustness of our explanations has been validated through a thorough iterative strategy. We plan to publicly release annotations from radiology experts, setting a new standard for accuracy in future assessments. This underscores the potential of our approach in enhancing the quality assessment of AI-driven medical reports.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"402-411"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11651630/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142848618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An average-case efficient two-stage algorithm for enumerating all longest common substrings of minimum length k between genome pairs. 一种平均情况下高效的两阶段算法,用于枚举基因组对之间最小长度为 k 的所有最长公共子串。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00020
Mattia Prosperi, Simone Marini, Christina Boucher

A problem extension of the longest common substring (LCS) between two texts is the enumeration of all LCSs given a minimum length k (ALCS- k ), along with their positions in each text. In bioinformatics, an efficient solution to the ALCS- k for very long texts -genomes or metagenomes- can provide useful insights to discover genetic signatures responsible for biological mechanisms. The ALCS- k problem has two additional requirements compared to the LCS problem: one is the minimum length k , and the other is that all common strings longer than k must be reported. We present an efficient, two-stage ALCS- k algorithm exploiting the spectrum of text substrings of length k ( k -mers). Our approach yields a worst-case time complexity loglinear in the number of k -mers for the first stage, and an average-case loglinear in the number of common k -mers for the second stage (several orders of magnitudes smaller than the total k -mer spectrum). The space complexity is linear in the first phase (disk-based), and on average linear in the second phase (disk- and memory-based). Tests performed on genomes for different organisms (including viruses, bacteria and animal chromosomes) show that run times are consistent with our theoretical estimates; further, comparisons with MUMmer4 show an asymptotic advantage with divergent genomes.

两个文本之间最长公共子串(LCS)问题的扩展是枚举给定最小长度 k 的所有 LCS(ALCS- k)以及它们在每个文本中的位置。在生物信息学中,针对超长文本--基因组或元基因组--的 ALCS- k 的有效解决方案可以为发现生物机制的遗传特征提供有用的见解。与 LCS 问题相比,ALCS- k 问题有两个额外的要求:一个是最小长度 k,另一个是必须报告所有长于 k 的普通字符串。我们提出了一种高效的两阶段 ALCS- k 算法,该算法利用了长度为 k 的文本子串谱(k -mers)。我们的方法在最坏情况下,第一阶段的时间复杂度与 k -mers 的数量成对数线性关系,在平均情况下,第二阶段的时间复杂度与常见 k -mers 的数量成对数线性关系(比总 k -mers 频谱小几个数量级)。空间复杂度在第一阶段(基于磁盘)是线性的,在第二阶段(基于磁盘和内存)平均是线性的。在不同生物体(包括病毒、细菌和动物染色体)基因组上进行的测试表明,运行时间与我们的理论估计值一致;此外,与 MUMmer4 的比较显示,在不同基因组上具有渐进优势。
{"title":"<ArticleTitle xmlns:ns0=\"http://www.w3.org/1998/Math/MathML\">An average-case efficient two-stage algorithm for enumerating all longest common substrings of minimum length <ns0:math><ns0:mi>k</ns0:mi></ns0:math> between genome pairs.","authors":"Mattia Prosperi, Simone Marini, Christina Boucher","doi":"10.1109/ichi61247.2024.00020","DOIUrl":"10.1109/ichi61247.2024.00020","url":null,"abstract":"<p><p>A problem extension of the longest common substring (LCS) between two texts is the enumeration of all LCSs given a minimum length <math><mi>k</mi></math> (ALCS- <math><mi>k</mi></math> ), along with their positions in each text. In bioinformatics, an efficient solution to the ALCS- <math><mi>k</mi></math> for very long texts -genomes or metagenomes- can provide useful insights to discover genetic signatures responsible for biological mechanisms. The ALCS- <math><mi>k</mi></math> problem has two additional requirements compared to the LCS problem: one is the minimum length <math><mi>k</mi></math> , and the other is that all common strings longer than <math><mi>k</mi></math> must be reported. We present an efficient, two-stage ALCS- <math><mi>k</mi></math> algorithm exploiting the spectrum of text substrings of length <math><mi>k</mi></math> ( <math><mi>k</mi></math> -mers). Our approach yields a worst-case time complexity loglinear in the number of <math><mi>k</mi></math> -mers for the first stage, and an average-case loglinear in the number of common <math><mi>k</mi></math> -mers for the second stage (several orders of magnitudes smaller than the total <math><mi>k</mi></math> -mer spectrum). The space complexity is linear in the first phase (disk-based), and on average linear in the second phase (disk- and memory-based). Tests performed on genomes for different organisms (including viruses, bacteria and animal chromosomes) show that run times are consistent with our theoretical estimates; further, comparisons with MUMmer4 show an asymptotic advantage with divergent genomes.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"93-102"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11412151/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142302596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Developing a computational representation of human physical activity and exercise using open ontology-based approach: a Tai Chi use case. 使用基于开放本体的方法开发人类体育活动和运动的计算表征:太极使用案例。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00012
Eloisa Nguyen, Rebecca Z Lin, Yang Gong, Cui Tao, Muhammad Tuan Amith

Many studies have examined the impact of exercise and other physical activities in influencing the health outcomes of individuals. These physical activities entail an intricate sequence and series of physical anatomy, physiological movement, movement of the anatomy, etc. To better understand how these components interact with one another and their downstream impact on health outcomes, there needs to be an information model that conceptualizes all entities involved. In this study, we introduced our early development of an ontology model to computationally describe human physical activities and the various entities that compose each activity. We developed an open-sourced biomedical ontology called the Kinetic Human Movement Ontology that reused OBO Foundry terminologies and encoded in OWL2. We applied this ontology in modeling and linking a specific Tai Chi movement. The contribution of this work could enable modeling of information relating to human physical activity, like exercise, and lead towards information standardization of human movement for analysis. Future work will include expanding our ontology to include more expressive information and completely modeling entire sets of movement from human physical activity.

许多研究都探讨了运动和其他体育活动对个人健康结果的影响。这些体能活动包含一系列错综复杂的物理解剖、生理运动、解剖运动等。为了更好地理解这些组成部分之间如何相互作用以及它们对健康结果的下游影响,需要有一个信息模型来概念化所涉及的所有实体。在本研究中,我们介绍了我们早期开发的本体模型,该模型用于计算描述人类的身体活动以及构成每项活动的各种实体。我们开发了一个开源的生物医学本体,名为 "人体运动本体"(Kinetic Human Movement Ontology),该本体重复使用了 OBO Foundry 术语,并用 OWL2 进行了编码。我们将该本体应用于特定太极运动的建模和链接。这项工作的贡献在于能够对与人类身体活动(如运动)相关的信息进行建模,并实现人类运动分析的信息标准化。未来的工作将包括扩展我们的本体,以包含更具表现力的信息,并对人类体育活动的整套动作进行完全建模。
{"title":"Developing a computational representation of human physical activity and exercise using open ontology-based approach: a Tai Chi use case.","authors":"Eloisa Nguyen, Rebecca Z Lin, Yang Gong, Cui Tao, Muhammad Tuan Amith","doi":"10.1109/ichi61247.2024.00012","DOIUrl":"10.1109/ichi61247.2024.00012","url":null,"abstract":"<p><p>Many studies have examined the impact of exercise and other physical activities in influencing the health outcomes of individuals. These physical activities entail an intricate sequence and series of physical anatomy, physiological movement, movement of the anatomy, etc. To better understand how these components interact with one another and their downstream impact on health outcomes, there needs to be an information model that conceptualizes all entities involved. In this study, we introduced our early development of an ontology model to computationally describe human physical activities and the various entities that compose each activity. We developed an open-sourced biomedical ontology called the Kinetic Human Movement Ontology that reused OBO Foundry terminologies and encoded in OWL2. We applied this ontology in modeling and linking a specific Tai Chi movement. The contribution of this work could enable modeling of information relating to human physical activity, like exercise, and lead towards information standardization of human movement for analysis. Future work will include expanding our ontology to include more expressive information and completely modeling entire sets of movement from human physical activity.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"31-39"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11503552/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing Social Factors to Enhance Suicide Prevention Across Population Groups. 分析社会因素,加强不同人群的自杀预防。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00032
Richard Li Xu, Song Wang, Zewei Wang, Yuhan Zhang, Yunyu Xiao, Jyotishman Pathak, David Hodge, Yan Leng, S Craig Watkins, Ying Ding, Yifan Peng

Social factors like family background, education level, financial status, and stress can impact public health outcomes, such as suicidal ideation. However, the analysis of social factors for suicide prevention has been limited by the lack of up-to-date suicide reporting data, variations in reporting practices, and small sample sizes. In this study, we analyzed 172,629 suicide incidents from 2014 to 2020 utilizing the National Violent Death Reporting System Restricted Access Database (NVDRS-RAD). Logistic regression models were developed to examine the relationships between demographics and suicide-related circumstances. Trends over time were assessed, and Latent Dirichlet Allocation (LDA) was used to identify common suicide-related social factors. Mental health, interpersonal relationships, mental health treatment and disclosure, and school/work-related stressors were identified as the main themes of suicide-related social factors. This study also identified systemic disparities across various population groups, particularly concerning Black individuals, young people aged under 24, healthcare practitioners, and those with limited education backgrounds, which shed light on potential directions for demographic-specific suicidal interventions.

家庭背景、教育水平、经济状况和压力等社会因素会对自杀意念等公共卫生结果产生影响。然而,由于缺乏最新的自杀报告数据、报告方法的差异以及样本量较小,对预防自杀的社会因素的分析一直受到限制。在本研究中,我们利用国家暴力死亡报告系统限制访问数据库(NVDRS-RAD)分析了 2014 年至 2020 年期间的 172629 起自杀事件。我们建立了逻辑回归模型来研究人口统计学与自杀相关情况之间的关系。对随时间变化的趋势进行了评估,并使用 Latent Dirichlet Allocation (LDA) 来识别常见的自杀相关社会因素。心理健康、人际关系、心理健康治疗和披露以及与学校/工作相关的压力因素被确定为与自杀相关的社会因素的主要主题。这项研究还发现了不同人群中存在的系统性差异,尤其是黑人、24 岁以下的年轻人、医护人员和教育背景有限的人群,这为针对特定人群的自杀干预措施提供了潜在的方向。
{"title":"Analyzing Social Factors to Enhance Suicide Prevention Across Population Groups.","authors":"Richard Li Xu, Song Wang, Zewei Wang, Yuhan Zhang, Yunyu Xiao, Jyotishman Pathak, David Hodge, Yan Leng, S Craig Watkins, Ying Ding, Yifan Peng","doi":"10.1109/ichi61247.2024.00032","DOIUrl":"10.1109/ichi61247.2024.00032","url":null,"abstract":"<p><p>Social factors like family background, education level, financial status, and stress can impact public health outcomes, such as suicidal ideation. However, the analysis of social factors for suicide prevention has been limited by the lack of up-to-date suicide reporting data, variations in reporting practices, and small sample sizes. In this study, we analyzed 172,629 suicide incidents from 2014 to 2020 utilizing the National Violent Death Reporting System Restricted Access Database (NVDRS-RAD). Logistic regression models were developed to examine the relationships between demographics and suicide-related circumstances. Trends over time were assessed, and Latent Dirichlet Allocation (LDA) was used to identify common suicide-related social factors. Mental health, interpersonal relationships, mental health treatment and disclosure, and school/work-related stressors were identified as the main themes of suicide-related social factors. This study also identified systemic disparities across various population groups, particularly concerning Black individuals, young people aged under 24, healthcare practitioners, and those with limited education backgrounds, which shed light on potential directions for demographic-specific suicidal interventions.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"189-199"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11450796/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142382637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Generative Models in Medical Imaging. 评估医学成像中的生成模型。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00084
Liyue Fan, Ashley Bang, Luca Bonomi

Data synthesis can address important data availability challenges in biomedical informatics. Quantitative evaluation of generative models may help understand their applications to synthesizing biomedical data. This poster paper examines state-of-the-art generative models used in medical imaging, such as StyleGAN and DDPM models, and evaluates their performance in learning data manifolds and in the visible features of generated samples. Results show that existing generative models have much to improve based on the studied measures.

数据合成可以解决生物医学信息学中重要的数据可用性挑战。对生成模型进行定量评估有助于了解它们在生物医学数据合成中的应用。这篇海报论文研究了医学成像中使用的最先进的生成模型,如 StyleGAN 和 DDPM 模型,并评估了它们在学习数据流形和生成样本的可见特征方面的性能。结果表明,根据所研究的指标,现有的生成模型还有很多需要改进的地方。
{"title":"Evaluating Generative Models in Medical Imaging.","authors":"Liyue Fan, Ashley Bang, Luca Bonomi","doi":"10.1109/ichi61247.2024.00084","DOIUrl":"10.1109/ichi61247.2024.00084","url":null,"abstract":"<p><p>Data synthesis can address important data availability challenges in biomedical informatics. Quantitative evaluation of generative models may help understand their applications to synthesizing biomedical data. This poster paper examines state-of-the-art generative models used in medical imaging, such as StyleGAN and DDPM models, and evaluates their performance in learning data manifolds and in the visible features of generated samples. Results show that existing generative models have much to improve based on the studied measures.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"553-555"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11508590/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fine-grained Patient Similarity Measuring using Contrastive Graph Similarity Networks.
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00009
Yuxi Liu, Zhenhao Zhang, Shaowen Qin, Flora D Salim, Jiang Bian, Antonio Jimeno Yepes

Predictive analytics using Electronic Health Records (EHRs) have become an active research area in recent years, especially with the development of deep learning techniques. A popular EHR data analysis paradigm in deep learning is patient representation learning, which aims to learn a condensed mathematical representation of individual patients. However, EHR data are often inherently irregular, i.e., data entries were captured at different times as well as with different contents due to the individualized needs of each patient. Most of the work focused on the provision of deep neural networks with attention mechanisms that generate complete patient representations that can be readily used for downstream prediction tasks. However, such approaches fail to take patient similarity into account, which is generally used in clinical reasoning scenarios. This study presents a new Contrastive Graph Similarity Network for similarity calculation among patients in large EHR datasets. Particularly, we apply graph-based similarity analysis that explicitly extracts the clinical characteristics of each patient and aggregates the information of similar patients to generate rich patient representations. Experimental results on real-world EHR databases demonstrate the effectiveness and superiority of our method for the task of vital signs imputation and ICU patient deterioration prediction.

{"title":"Fine-grained Patient Similarity Measuring using Contrastive Graph Similarity Networks.","authors":"Yuxi Liu, Zhenhao Zhang, Shaowen Qin, Flora D Salim, Jiang Bian, Antonio Jimeno Yepes","doi":"10.1109/ichi61247.2024.00009","DOIUrl":"https://doi.org/10.1109/ichi61247.2024.00009","url":null,"abstract":"<p><p>Predictive analytics using Electronic Health Records (EHRs) have become an active research area in recent years, especially with the development of deep learning techniques. A popular EHR data analysis paradigm in deep learning is patient representation learning, which aims to learn a condensed mathematical representation of individual patients. However, EHR data are often inherently irregular, i.e., data entries were captured at different times as well as with different contents due to the individualized needs of each patient. Most of the work focused on the provision of deep neural networks with attention mechanisms that generate complete patient representations that can be readily used for downstream prediction tasks. However, such approaches fail to take patient similarity into account, which is generally used in clinical reasoning scenarios. This study presents a new Contrastive Graph Similarity Network for similarity calculation among patients in large EHR datasets. Particularly, we apply graph-based similarity analysis that explicitly extracts the clinical characteristics of each patient and aggregates the information of similar patients to generate rich patient representations. Experimental results on real-world EHR databases demonstrate the effectiveness and superiority of our method for the task of vital signs imputation and ICU patient deterioration prediction.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"1-10"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11654828/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142857143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mitigating Membership Inference in Deep Survival Analyses with Differential Privacy. 利用差异隐私减轻深度生存分析中的成员推断。
Pub Date : 2023-06-01 Epub Date: 2023-12-11 DOI: 10.1109/ichi57859.2023.00022
Liyue Fan, Luca Bonomi

Deep neural networks have been increasingly integrated in healthcare applications to enable accurate predicative analyses. Sharing trained deep models not only facilitates knowledge integration in collaborative research efforts but also enables equitable access to computational intelligence. However, recent studies have shown that an adversary may leverage a shared model to learn the participation of a target individual in the training set. In this work, we investigate privacy-protecting model sharing for survival studies. Specifically, we pose three research questions. (1) Do deep survival models leak membership information? (2) How effective is differential privacy in defending against membership inference in deep survival analyses? (3) Are there other effects of differential privacy on deep survival analyses? Our study assesses the membership leakage in emerging deep survival models and develops differentially private training procedures to provide rigorous privacy protection. The experimental results show that deep survival models leak membership information and our approach effectively reduces membership inference risks. The results also show that differential privacy introduces a limited performance loss, and may improve the model robustness in the presence of noisy data, compared to non-private models.

深度神经网络已越来越多地集成到医疗保健应用中,以实现准确的预测分析。共享训练有素的深度模型不仅能促进合作研究工作中的知识整合,还能实现对计算智能的公平获取。然而,最近的研究表明,对手可能会利用共享模型来了解目标个体在训练集中的参与情况。在这项工作中,我们研究了用于生存研究的隐私保护模型共享。具体来说,我们提出了三个研究问题。(1) 深度生存模型会泄露成员信息吗?(2) 在深度生存分析中,差异隐私对防御成员推断的效果如何?(3) 差异隐私对深度生存分析是否有其他影响?我们的研究评估了新兴深度生存模型中的成员信息泄露,并开发了差异化隐私训练程序,以提供严格的隐私保护。实验结果表明,深度生存模型会泄露成员信息,而我们的方法能有效降低成员推断风险。实验结果还表明,与非隐私模型相比,差异化隐私会带来有限的性能损失,并可能提高模型在高噪声数据存在时的鲁棒性。
{"title":"Mitigating Membership Inference in Deep Survival Analyses with Differential Privacy.","authors":"Liyue Fan, Luca Bonomi","doi":"10.1109/ichi57859.2023.00022","DOIUrl":"10.1109/ichi57859.2023.00022","url":null,"abstract":"<p><p>Deep neural networks have been increasingly integrated in healthcare applications to enable accurate predicative analyses. Sharing trained deep models not only facilitates knowledge integration in collaborative research efforts but also enables equitable access to computational intelligence. However, recent studies have shown that an adversary may leverage a shared model to learn the participation of a target individual in the training set. In this work, we investigate privacy-protecting model sharing for survival studies. Specifically, we pose three research questions. (1) Do deep survival models leak membership information? (2) How effective is differential privacy in defending against membership inference in deep survival analyses? (3) Are there other effects of differential privacy on deep survival analyses? Our study assesses the membership leakage in emerging deep survival models and develops differentially private training procedures to provide rigorous privacy protection. The experimental results show that deep survival models leak membership information and our approach effectively reduces membership inference risks. The results also show that differential privacy introduces a limited performance loss, and may improve the model robustness in the presence of noisy data, compared to non-private models.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2023 ","pages":"81-90"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10751041/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139049861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An LSTM-based Gesture-to-Speech Recognition System. 基于 LSTM 的手势语音识别系统
Pub Date : 2023-06-01 Epub Date: 2023-12-11 DOI: 10.1109/ichi57859.2023.00062
Riyad Bin Rafiq, Syed Araib Karim, Mark V Albert

Fast and flexible communication options are limited for speech-impaired people. Hand gestures coupled with fast, generated speech can enable a more natural social dynamic for those individuals - particularly individuals without the fine motor skills to type on a keyboard or tablet reliably. We created a mobile phone application prototype that generates audible responses associated with trained hand movements and collects and organizes the accelerometer data for rapid training to allow tailored models for individuals who may not be able to perform standard movements such as sign language. Six participants performed 11 distinct gestures to produce the dataset. A mobile application was developed that integrated a bidirectional LSTM network architecture which was trained from this data. After evaluation using nested subject-wise cross-validation, our integrated bidirectional LSTM model demonstrates an overall recall of 91.8% in recognition of these pre-selected 11 hand gestures, with recall at 95.8% when two commonly confused gestures were not assessed. This prototype is a step in creating a mobile phone system capable of capturing new gestures and developing tailored gesture recognition models for individuals in speech-impaired populations. Further refinement of this prototype can enable fast and efficient communication with the goal of further improving social interaction for individuals unable to speak.

对于有语言障碍的人来说,快速灵活的交流方式非常有限。手势加上快速生成的语音,可以为这些人提供更自然的社交动态,尤其是没有精细运动技能在键盘或平板电脑上打字的人。我们创建了一个手机应用原型,它能生成与训练有素的手部动作相关的声音反应,并收集和整理加速度计数据以进行快速训练,从而为那些可能无法完成手语等标准动作的人提供量身定制的模型。六名参与者做出了 11 种不同的手势,从而产生了数据集。开发的移动应用程序集成了双向 LSTM 网络架构,该架构根据这些数据进行了训练。在使用嵌套主体交叉验证进行评估后,我们的集成双向 LSTM 模型在识别预选的 11 种手势方面的总体召回率为 91.8%,在不评估两种常见混淆手势的情况下,召回率为 95.8%。这个原型是创建能够捕捉新手势的手机系统和为语言障碍人群开发定制手势识别模型的一个步骤。对这一原型的进一步改进可以实现快速高效的交流,从而进一步改善无法说话人群的社交互动。
{"title":"An LSTM-based Gesture-to-Speech Recognition System.","authors":"Riyad Bin Rafiq, Syed Araib Karim, Mark V Albert","doi":"10.1109/ichi57859.2023.00062","DOIUrl":"10.1109/ichi57859.2023.00062","url":null,"abstract":"<p><p>Fast and flexible communication options are limited for speech-impaired people. Hand gestures coupled with fast, generated speech can enable a more natural social dynamic for those individuals - particularly individuals without the fine motor skills to type on a keyboard or tablet reliably. We created a mobile phone application prototype that generates audible responses associated with trained hand movements and collects and organizes the accelerometer data for rapid training to allow tailored models for individuals who may not be able to perform standard movements such as sign language. Six participants performed 11 distinct gestures to produce the dataset. A mobile application was developed that integrated a bidirectional LSTM network architecture which was trained from this data. After evaluation using nested subject-wise cross-validation, our integrated bidirectional LSTM model demonstrates an overall recall of 91.8% in recognition of these pre-selected 11 hand gestures, with recall at 95.8% when two commonly confused gestures were not assessed. This prototype is a step in creating a mobile phone system capable of capturing new gestures and developing tailored gesture recognition models for individuals in speech-impaired populations. Further refinement of this prototype can enable fast and efficient communication with the goal of further improving social interaction for individuals unable to speak.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2023 ","pages":"430-438"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10894657/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139974844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benchmarking Transformer-Based Models for Identifying Social Determinants of Health in Clinical Notes. 在临床笔记中识别健康的社会决定因素的基于变压器的模型基准。
Pub Date : 2023-06-01 Epub Date: 2023-12-11 DOI: 10.1109/ichi57859.2023.00102
Xiaoyu Wang, Dipankar Gupta, Michael Killian, Zhe He

Electronic health records (EHR) have been widely used in building machine learning models for health outcomes prediction. However, many EHR-based models are inherently biased due to lack of risk factors on social determinants of health (SDoH), which are responsible for up to 40% preventive deaths. As SDoH information is often captured in clinical notes, recent efforts have been made to extract such information from notes with natural language processing and append it to other structured data. In this work, we benchmark 7 pre-trained transformer-based models, including BERT, ALBERT, BioBERT, BioClinicalBERT, RoBERTa, ELECTRA, and RoBERTa-MIMIC-Trial, for recognizing SDoH terms using a previously annotated corpus of MIMIC-III clinical notes. Our study shows that BioClinicalBERT model performs best on F-1 scores (0.911, 0.923) under both strict and relaxed criteria. This work shows the promise of using transformer-based models for recognizing SDoH information from clinical notes.

电子健康记录(EHR)已被广泛用于建立健康结果预测的机器学习模型。然而,由于缺乏社会健康决定因素(SDoH)方面的风险因素,许多基于 EHR 的模型本身就存在偏差,而社会健康决定因素是造成高达 40% 预防性死亡的原因。由于 SDoH 信息通常记录在临床病历中,因此最近人们努力通过自然语言处理从病历中提取此类信息,并将其附加到其他结构化数据中。在这项工作中,我们使用先前注释的 MIMIC-III 临床笔记语料库,对 7 个基于转换器的预训练模型(包括 BERT、ALBERT、BioBERT、BioClinicalBERT、RoBERTa、ELECTRA 和 RoBERTa-MIMIC-Trial)进行了基准测试,以识别 SDoH 术语。我们的研究表明,在严格和宽松标准下,BioClinicalBERT 模型在 F-1 分数(0.911,0.923)上表现最佳。这项工作表明,使用基于转换器的模型识别临床笔记中的 SDoH 信息大有可为。
{"title":"Benchmarking Transformer-Based Models for Identifying Social Determinants of Health in Clinical Notes.","authors":"Xiaoyu Wang, Dipankar Gupta, Michael Killian, Zhe He","doi":"10.1109/ichi57859.2023.00102","DOIUrl":"10.1109/ichi57859.2023.00102","url":null,"abstract":"<p><p>Electronic health records (EHR) have been widely used in building machine learning models for health outcomes prediction. However, many EHR-based models are inherently biased due to lack of risk factors on social determinants of health (SDoH), which are responsible for up to 40% preventive deaths. As SDoH information is often captured in clinical notes, recent efforts have been made to extract such information from notes with natural language processing and append it to other structured data. In this work, we benchmark 7 pre-trained transformer-based models, including BERT, ALBERT, BioBERT, BioClinicalBERT, RoBERTa, ELECTRA, and RoBERTa-MIMIC-Trial, for recognizing SDoH terms using a previously annotated corpus of MIMIC-III clinical notes. Our study shows that BioClinicalBERT model performs best on F-1 scores (0.911, 0.923) under both strict and relaxed criteria. This work shows the promise of using transformer-based models for recognizing SDoH information from clinical notes.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2023 ","pages":"570-574"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10795706/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139492901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1