首页 > 最新文献

AMIA ... Annual Symposium proceedings. AMIA Symposium最新文献

英文 中文
A Computationally-guided Qualitative Analysis to Understand User Experiences with Different Types of Mobile Personal Health Records. 以计算为导向的定性分析来了解不同类型移动个人健康记录的用户体验。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Zainab A Balogun, Pronob K Barman, Bianka K Onwumbiko, Tera L Reynolds

Mobile personal health records (mPHR) are smartphone apps that grant patients portable and continuous access to their medical records, thereby increasing the potential for patients to play an active role in managing their health. An extensive body of literature has focused on understanding user experiences with web-based tethered PHRs (i.e., patient portals) offered by healthcare organizations. However, patients' opinions of smartphone-based PHRs have received less attention. To address this gap, we used a computationally-guided qualitative analysis approach to analyze user reviews of six tethered and four interconnected mPHR apps available on both Google Play and Apple app stores. This approach resulted in identifying dimensions of user experiences related to usability, usefulness, and important features to users. Our findings reveal many similarities in user experiences for HCO-tethered and HCO-independent interconnected PHRs. However, there are some differences in user experiences between the types of PHRs and the different devices and platforms.

移动个人健康记录(mPHR)是智能手机应用程序,允许患者便携式和连续访问他们的医疗记录,从而增加患者在管理自己的健康方面发挥积极作用的潜力。大量文献关注于理解医疗保健组织提供的基于web的捆绑phrr(即患者门户)的用户体验。然而,患者对基于智能手机的phrr的意见却很少受到关注。为了解决这一差距,我们使用了一种计算导向的定性分析方法来分析b谷歌Play和Apple应用商店中6款捆绑和4款相互连接的mPHR应用的用户评论。这种方法确定了与可用性、有用性和重要功能相关的用户体验维度。我们的研究结果揭示了HCO-tethered和HCO-independent互连phrr的用户体验有许多相似之处。然而,不同类型的phrr以及不同的设备和平台在用户体验上存在一些差异。
{"title":"A Computationally-guided Qualitative Analysis to Understand User Experiences with Different Types of Mobile Personal Health Records.","authors":"Zainab A Balogun, Pronob K Barman, Bianka K Onwumbiko, Tera L Reynolds","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Mobile personal health records (mPHR) are smartphone apps that grant patients portable and continuous access to their medical records, thereby increasing the potential for patients to play an active role in managing their health. An extensive body of literature has focused on understanding user experiences with web-based tethered PHRs (i.e., patient portals) offered by healthcare organizations. However, patients' opinions of smartphone-based PHRs have received less attention. To address this gap, we used a computationally-guided qualitative analysis approach to analyze user reviews of six tethered and four interconnected mPHR apps available on both Google Play and Apple app stores. This approach resulted in identifying dimensions of user experiences related to usability, usefulness, and important features to users. Our findings reveal many similarities in user experiences for HCO-tethered and HCO-independent interconnected PHRs. However, there are some differences in user experiences between the types of PHRs and the different devices and platforms.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"162-171"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099338/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multi-Task Learning Approach for Segmentation of Breast Arterial Calcifications in Screening Mammograms. 乳房x光筛查中乳腺动脉钙化分割的多任务学习方法。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Aisha Urooj, Theo Dapamede, Bhavika Patel, Chadi Ayoub, Reza Arsanjani, William Charles O'Neill, Hari Trivedi, Imon Banerjee

Screening mammogram is a standard and cost-efficient imaging procedure to measure breast cancer risk among 45+ year old women. Quantifying breast arterial calcification (BAC) from screening mammograms is a non-invasive and cost-efficient approach to assess the future risk of adverse cardiovascular events among women, such as heart attack and stroke. However, segmentation of breast arterial calcification is an involved task and poses several technical challenges such as extremely small BAC finding, low breast arteries to breast area ratio in the mammogram images, tissue features such as breast folds and heterogeneous density, have very similar imaging appearance. In this work, we aim to address the shortcomings of existing SOTA methods, e.g., SCUNet, and analyze the comparative performance. Given the fact that we will not be able to simply resize mammogram to preserve the microscopic BAC details, we adopted a patch-based methodology for segmentation using the original resolution which may hinder the model understanding of whole mammogram. We propose a multi-task learning approach for patch-based BAC segmentation by adding an auxiliary task of patch position prediction which forces the model to learn breast anatomy to comprehend the locations where BAC will not occur, such as breast boundary. The proposed method achieves state-of-the-art performance compared to the baselines. To demonstrate the utility, we also validate our method on external data and provide survival analysis for adverse cardiac events based on difference in BAC score and provide a comparison with coronary calcium score (CAC).

乳房x光筛查是衡量45岁以上女性患乳腺癌风险的一种标准且经济有效的成像程序。通过筛查乳房x线照片量化乳腺动脉钙化(BAC)是一种无创且经济有效的方法,可评估女性未来发生不良心血管事件(如心脏病发作和中风)的风险。然而,乳腺动脉钙化的分割是一项复杂的任务,并提出了一些技术挑战,如极小的BAC发现,乳房x线照片中乳腺动脉与乳房面积之比低,乳腺褶皱和非均匀密度等组织特征具有非常相似的成像外观。在这项工作中,我们的目标是解决现有SOTA方法的缺点,例如SCUNet,并分析比较性能。考虑到我们无法简单地调整乳房x光片的大小以保留微观BAC细节,我们采用了基于补丁的方法,使用原始分辨率进行分割,这可能会阻碍对整个乳房x光片的模型理解。我们提出了一种基于补片的BAC分割的多任务学习方法,通过添加补片位置预测的辅助任务,迫使模型学习乳房解剖以理解不会发生BAC的位置,如乳房边界。与基线相比,所提出的方法实现了最先进的性能。为了证明其实用性,我们还在外部数据上验证了我们的方法,并提供了基于BAC评分差异的不良心脏事件的生存分析,并提供了与冠状动脉钙评分(CAC)的比较。
{"title":"A Multi-Task Learning Approach for Segmentation of Breast Arterial Calcifications in Screening Mammograms.","authors":"Aisha Urooj, Theo Dapamede, Bhavika Patel, Chadi Ayoub, Reza Arsanjani, William Charles O'Neill, Hari Trivedi, Imon Banerjee","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Screening mammogram is a standard and cost-efficient imaging procedure to measure breast cancer risk among 45+ year old women. Quantifying breast arterial calcification (BAC) from screening mammograms is a non-invasive and cost-efficient approach to assess the future risk of adverse cardiovascular events among women, such as heart attack and stroke. However, segmentation of breast arterial calcification is an involved task and poses several technical challenges such as extremely small BAC finding, low breast arteries to breast area ratio in the mammogram images, tissue features such as breast folds and heterogeneous density, have very similar imaging appearance. In this work, we aim to address the shortcomings of existing SOTA methods, e.g., SCUNet, and analyze the comparative performance. Given the fact that we will not be able to simply resize mammogram to preserve the microscopic BAC details, we adopted a patch-based methodology for segmentation using the original resolution which may hinder the model understanding of whole mammogram. We propose a multi-task learning approach for patch-based BAC segmentation by adding an auxiliary task of patch position prediction which forces the model to learn breast anatomy to comprehend the locations where BAC will not occur, such as breast boundary. The proposed method achieves state-of-the-art performance compared to the baselines. To demonstrate the utility, we also validate our method on external data and provide survival analysis for adverse cardiac events based on difference in BAC score and provide a comparison with coronary calcium score (CAC).</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1139-1148"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099439/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial Intelligence-assisted Biomedical Literature Knowledge Synthesis to Support Decision-making in Precision Oncology. 人工智能辅助生物医学文献知识综合支持精准肿瘤学决策。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Ting He, Kory Kreimeyer, Mimi Najjar, Jonathan Spiker, Maria Fatteh, Valsamo Anagnostou, Taxiarchis Botsis

The delivery of effective targeted therapies requires comprehensive analyses of the molecular profiling of tumors and matching with clinical phenotypes in the context of existing knowledge described in biomedical literature, registries, and knowledge bases. We evaluated the performance of natural language processing (NLP) approaches in supporting knowledge retrieval and synthesis from the biomedical literature. We tested PubTator 3.0, Bidirectional Encoder Representations from Transformers (BERT), and Large Language Models (LLMs) and evaluated their ability to support named entity recognition (NER) and relation extraction (RE) from biomedical texts. PubTator 3.0 and the BioBERT model performed best in the NER task (best F1-score 0.93 and 0.89, respectively), while BioBERT outperformed all other solutions in the RE task (best F1-score 0.79) and a specific use case it was applied to by recognizing nearly all entity mentions and most of the relations. Our findings support the use of AI-assisted approaches in facilitating precision oncology decision-making.

提供有效的靶向治疗需要在生物医学文献、注册表和知识库中描述的现有知识背景下,对肿瘤分子谱进行全面分析,并与临床表型相匹配。我们评估了自然语言处理(NLP)方法在支持生物医学文献知识检索和合成方面的性能。我们测试了PubTator 3.0、来自变形器的双向编码器表示(BERT)和大型语言模型(llm),并评估了它们支持生物医学文本的命名实体识别(NER)和关系提取(RE)的能力。PubTator 3.0和BioBERT模型在NER任务中表现最好(最佳f1得分分别为0.93和0.89),而BioBERT在RE任务中表现优于所有其他解决方案(最佳f1得分为0.79),并且通过识别几乎所有实体提及和大多数关系,它应用于一个特定的用例。我们的研究结果支持使用人工智能辅助方法来促进精确的肿瘤学决策。
{"title":"Artificial Intelligence-assisted Biomedical Literature Knowledge Synthesis to Support Decision-making in Precision Oncology.","authors":"Ting He, Kory Kreimeyer, Mimi Najjar, Jonathan Spiker, Maria Fatteh, Valsamo Anagnostou, Taxiarchis Botsis","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The delivery of effective targeted therapies requires comprehensive analyses of the molecular profiling of tumors and matching with clinical phenotypes in the context of existing knowledge described in biomedical literature, registries, and knowledge bases. We evaluated the performance of natural language processing (NLP) approaches in supporting knowledge retrieval and synthesis from the biomedical literature. We tested PubTator 3.0, Bidirectional Encoder Representations from Transformers (BERT), and Large Language Models (LLMs) and evaluated their ability to support named entity recognition (NER) and relation extraction (RE) from biomedical texts. PubTator 3.0 and the BioBERT model performed best in the NER task (best F1-score 0.93 and 0.89, respectively), while BioBERT outperformed all other solutions in the RE task (best F1-score 0.79) and a specific use case it was applied to by recognizing nearly all entity mentions and most of the relations. Our findings support the use of AI-assisted approaches in facilitating precision oncology decision-making.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"513-522"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099343/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probabilistic Graphical Models for Evaluating the Utility of Data-Driven ICD Code Categories in Pediatric Sepsis. 评估数据驱动的ICD代码类别在儿童败血症中的效用的概率图形模型。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Lourdes A Valdez, Edgar Javier Hernandez, O'Connor Matthews, Matthew Mulvey, Hillary Crandall, Karen Eilbeck

Electronic health records (EHRs) are information systems designed to collect and manage clinical data in order to support various clinical activities. They have emerged as valuable sources of data for outcomes research, offering vast repositories of patient information for analysis. Definitions for pediatric sepsis diagnosis are ambiguous, resulting in delayed diagnosis and treatment, highlighting the need for precise and efficient patient categorizing techniques. Nevertheless, the use of EHRs in research poses challenges. Although EHRs were originally created to document patient encounters, the medical coding was designed to satisfy billing requirements. As a result, EHR data may lack granularity, potentially leading to misclassification and incomplete representation of patient conditions. We compared data-driven ICD code categories to chart review using probabilistic graphical models (PGMs) due to their ability to handle uncertainty and incorporate prior knowledge. Overall, this paper demonstrates the potential of using PGMs to address these challenges and improve the analysis of ICD codes for sepsis outcomes research.

电子健康记录(EHRs)是一种信息系统,旨在收集和管理临床数据,以支持各种临床活动。它们已经成为结果研究的宝贵数据来源,为分析提供了大量的患者信息。儿童败血症诊断的定义不明确,导致诊断和治疗延迟,强调需要精确和有效的患者分类技术。然而,在研究中使用电子病历带来了挑战。虽然最初创建电子病历是为了记录患者就诊情况,但医疗编码的设计是为了满足计费需求。因此,EHR数据可能缺乏粒度,可能导致错误分类和对患者病情的不完整表示。我们将数据驱动的ICD代码类别与使用概率图形模型(PGMs)的图表审查进行了比较,因为它们具有处理不确定性和合并先验知识的能力。总的来说,本文展示了使用pgm来解决这些挑战并改进败血症结局研究中ICD代码分析的潜力。
{"title":"Probabilistic Graphical Models for Evaluating the Utility of Data-Driven ICD Code Categories in Pediatric Sepsis.","authors":"Lourdes A Valdez, Edgar Javier Hernandez, O'Connor Matthews, Matthew Mulvey, Hillary Crandall, Karen Eilbeck","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Electronic health records (EHRs) are information systems designed to collect and manage clinical data in order to support various clinical activities. They have emerged as valuable sources of data for outcomes research, offering vast repositories of patient information for analysis. Definitions for pediatric sepsis diagnosis are ambiguous, resulting in delayed diagnosis and treatment, highlighting the need for precise and efficient patient categorizing techniques. Nevertheless, the use of EHRs in research poses challenges. Although EHRs were originally created to document patient encounters, the medical coding was designed to satisfy billing requirements. As a result, EHR data may lack granularity, potentially leading to misclassification and incomplete representation of patient conditions. We compared data-driven ICD code categories to chart review using probabilistic graphical models (PGMs) due to their ability to handle uncertainty and incorporate prior knowledge. Overall, this paper demonstrates the potential of using PGMs to address these challenges and improve the analysis of ICD codes for sepsis outcomes research.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1149-1158"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099341/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pruning the Path to Optimal Care: Identifying Systematically Suboptimal Medical Decision-Making with Inverse Reinforcement Learning. 修剪路径到最优护理:识别系统次优医疗决策与逆强化学习。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Inko Bovenzi, Adi Carmel, Michael Hu, Rebecca Hurwitz, Fiona McBride, Leo Benac, José Roberto Tello Ayala, Finale Doshi-Velez

In aims to uncover insights into medical decision-making embedded within observational data from clinical settings, we present a novel application of Inverse Reinforcement Learning (IRL) that identifies suboptimal clinician actions based on the actions of their peers. This approach centers two stages of IRL with an intermediate step to prune trajectories displaying behavior that deviates significantly from the consensus. This enables us to effectively identify clinical priorities and values from ICU data containing both optimal and suboptimal clinician decisions. We observe that the benefits of removing suboptimal actions vary by disease and differentially impact certain demographic groups.

为了揭示对临床环境观察数据中嵌入的医疗决策的见解,我们提出了一种新的应用逆强化学习(IRL),该应用基于同行的行为识别次优临床医生的行为。该方法以IRL的两个阶段为中心,中间步骤是修剪显示明显偏离共识的行为轨迹。这使我们能够有效地从ICU数据中识别临床优先级和价值,包括最佳和次优的临床医生决策。我们观察到,取消次优行为的益处因疾病而异,对某些人口群体的影响也不同。
{"title":"Pruning the Path to Optimal Care: Identifying Systematically Suboptimal Medical Decision-Making with Inverse Reinforcement Learning.","authors":"Inko Bovenzi, Adi Carmel, Michael Hu, Rebecca Hurwitz, Fiona McBride, Leo Benac, José Roberto Tello Ayala, Finale Doshi-Velez","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>In aims to uncover insights into medical decision-making embedded within observational data from clinical settings, we present a novel application of Inverse Reinforcement Learning (IRL) that identifies suboptimal clinician actions based on the actions of their peers. This approach centers two stages of IRL with an intermediate step to prune trajectories displaying behavior that deviates significantly from the consensus. This enables us to effectively identify clinical priorities and values from ICU data containing both optimal and suboptimal clinician decisions. We observe that the benefits of removing suboptimal actions vary by disease and differentially impact certain demographic groups.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"202-211"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099440/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OCTOPUS: Disk-based, Multiplatform, Mobile-friendly Metagenomics Classifier. 章鱼:基于磁盘,多平台,移动友好的宏基因组分类器。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Simone Marini, Alexander Barquero, Anisha Ashok Wadhwani, Jiang Bian, Jaime Ruiz, Christina Boucher, Mattia Prosperi

Portable genomic sequencers such as Oxford Nanopore's MinION enable real-time applications in clinical and environmental health. However, there is a bottleneck in the downstream analytics when bioinformatics pipelines are unavailable, e.g., when cloud processing is unreachable due to absence of Internet connection, or only low-end computing devices can be carried on site. Here we present a platform-friendly software for portable metagenomic analysis of Nanopore data, the Oligomer-based Classifier of Taxonomic Operational and Pan-genome Units via Singletons (OCTOPUS). OCTOPUS is written in Java, reimplements several features of the popular Kraken2 and KrakenUniq software, with original components for improving metagenomics classification on incomplete/sampled reference databases, making it ideal for running on smartphones or tablets. OCTOPUS obtains sensitivity and precision comparable to Kraken2, while dramatically decreasing (4- to 16-fold) the false positive rate, and yielding high correlation on real-word data. OCTOPUS is available along with customized databases at https://github.com/DataIntellSystLab/OCTOPUS and https://github.com/Ruiz-HCI-Lab/OctopusMobile.

便携式基因组测序仪,如牛津纳米孔公司的MinION,可以实时应用于临床和环境健康。然而,当生物信息学管道不可用时,下游分析存在瓶颈,例如,由于没有互联网连接而无法进行云处理,或者只能在现场进行低端计算设备。在这里,我们提出了一个平台友好的软件,用于便携式宏基因组分析的纳米孔数据,寡聚物为基础的分类操作和泛基因组单位分类器(章鱼)。OCTOPUS是用Java编写的,重新实现了流行的Kraken2和KrakenUniq软件的几个功能,具有原始组件,用于改进在不完整/采样参考数据库上的元基因组分类,使其非常适合在智能手机或平板电脑上运行。OCTOPUS获得了与Kraken2相当的灵敏度和精度,同时大大降低了误报率(4- 16倍),并对真实世界的数据产生了高相关性。OCTOPUS与定制数据库一起可在https://github.com/DataIntellSystLab/OCTOPUS和https://github.com/Ruiz-HCI-Lab/OctopusMobile上获得。
{"title":"OCTOPUS: Disk-based, Multiplatform, Mobile-friendly Metagenomics Classifier.","authors":"Simone Marini, Alexander Barquero, Anisha Ashok Wadhwani, Jiang Bian, Jaime Ruiz, Christina Boucher, Mattia Prosperi","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Portable genomic sequencers such as Oxford Nanopore's MinION enable real-time applications in clinical and environmental health. However, there is a bottleneck in the downstream analytics when bioinformatics pipelines are unavailable, e.g., when cloud processing is unreachable due to absence of Internet connection, or only low-end computing devices can be carried on site. Here we present a platform-friendly software for portable metagenomic analysis of Nanopore data, the Oligomer-based Classifier of Taxonomic Operational and Pan-genome Units via Singletons (OCTOPUS). OCTOPUS is written in Java, reimplements several features of the popular Kraken2 and KrakenUniq software, with original components for improving metagenomics classification on incomplete/sampled reference databases, making it ideal for running on smartphones or tablets. OCTOPUS obtains sensitivity and precision comparable to Kraken2, while dramatically decreasing (4- to 16-fold) the false positive rate, and yielding high correlation on real-word data. OCTOPUS is available along with customized databases at https://github.com/DataIntellSystLab/OCTOPUS and https://github.com/Ruiz-HCI-Lab/OctopusMobile.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"798-807"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099329/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated Multiple Imputation for Variables that Are Missing Not At Random in Distributed Electronic Health Records. 分布式电子健康记录中非随机缺失变量的联邦多重代入。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Yi Lian, Xiaoqian Jiang, Qi Long

Large electronic health records (EHR) have been widely implemented and are available for research activities. The magnitude of such databases often requires storage and computing infrastructure that are distributed at different sites. Restrictions on data-sharing due to privacy concerns have been another driving force behind the development of a large class of distributed and/or federated machine learning methods. While missing data problem is also present in distributed EHRs, albeit potentially more complex, distributed multiple imputation (MI) methods have not received as much attention. An important advantage of distributed MI, as well as distributed analysis, is that it allows researchers to borrow information across data sites, mitigating potential fairness issues for minority groups that do not have enough volume at certain sites. In this paper, we propose a communication-efficient and privacy-preserving distributed MI algorithms for variables that are missing not at random.

大型电子健康记录(EHR)已得到广泛实施,并可用于研究活动。这种数据库的规模往往需要分布在不同地点的存储和计算基础设施。由于隐私问题而对数据共享的限制是开发大量分布式和/或联合机器学习方法背后的另一个推动力。虽然分布式电子病历中也存在数据缺失问题,但分布式多重输入(MI)方法可能更复杂,但没有受到太多关注。分布式人工智能和分布式分析的一个重要优势是,它允许研究人员跨数据站点借用信息,减轻少数群体在某些站点没有足够容量的潜在公平问题。在本文中,我们提出了一种针对非随机缺失变量的高效通信和保护隐私的分布式MI算法。
{"title":"Federated Multiple Imputation for Variables that Are Missing Not At Random in Distributed Electronic Health Records.","authors":"Yi Lian, Xiaoqian Jiang, Qi Long","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Large electronic health records (EHR) have been widely implemented and are available for research activities. The magnitude of such databases often requires storage and computing infrastructure that are distributed at different sites. Restrictions on data-sharing due to privacy concerns have been another driving force behind the development of a large class of distributed and/or federated machine learning methods. While missing data problem is also present in distributed EHRs, albeit potentially more complex, distributed multiple imputation (MI) methods have not received as much attention. An important advantage of distributed MI, as well as distributed analysis, is that it allows researchers to borrow information across data sites, mitigating potential fairness issues for minority groups that do not have enough volume at certain sites. In this paper, we propose a communication-efficient and privacy-preserving distributed MI algorithms for variables that are missing not at random.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"703-712"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099382/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Visual Identification of Under-resourced Dermatological Diagnoses with Classifier-Steered Background Masking. 基于分类器导向背景掩蔽的资源不足皮肤病诊断的鲁棒视觉识别。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Miguel Dominguez, Julie Ryan Wolf, Paritosh Prasad, Wendemagegn Enbiale, Michael Gottlieb, Carl T Berdahl, Art Papier

Collecting images of rare dermatological diseases for machine learning detection applications is a costly, laborious task. It is difficult to collect enough images of these diagnoses to avoid the risk of low accuracy "in the wild". One of the sources of bias in these networks is irrelevant background pixel data. These pixels necessarily have no clinical significance, yet Deep Neural Networks will make weak correlations based on that information. To reduce their ability to do this, we introduce a masking augmentation algorithm, InfoMax-Cutout. It employs unsupervised Information Maximization losses to mask out background pixels. InfoMax-Cutout increased accuracy on classifying 319 diagnoses by 0.76%. These features generalized to an unseen diagnosis task (Fitzpatrick 17k), improving accuracy over a baseline by 43.3% and reducing Gini inequality by 20.9%. This approach of learning to separate out background pixels can increase accuracy in detecting diseases in Lower and Middle Income Countries.

为机器学习检测应用收集罕见皮肤病的图像是一项昂贵而费力的任务。很难收集到足够的这些诊断图像,以避免“在野外”出现低准确率的风险。这些网络中偏差的来源之一是不相关的背景像素数据。这些像素必然没有临床意义,但深度神经网络将根据这些信息建立弱相关性。为了降低它们这样做的能力,我们引入了一个掩蔽增强算法,InfoMax-Cutout。它采用无监督信息最大化损失来掩盖背景像素。InfoMax-Cutout对319种诊断的分类准确率提高了0.76%。这些特征推广到一个看不见的诊断任务(Fitzpatrick 17k),在基线上提高了43.3%的准确性,减少了20.9%的基尼不平等。这种学习分离背景像素的方法可以提高中低收入国家检测疾病的准确性。
{"title":"Robust Visual Identification of Under-resourced Dermatological Diagnoses with Classifier-Steered Background Masking.","authors":"Miguel Dominguez, Julie Ryan Wolf, Paritosh Prasad, Wendemagegn Enbiale, Michael Gottlieb, Carl T Berdahl, Art Papier","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Collecting images of rare dermatological diseases for machine learning detection applications is a costly, laborious task. It is difficult to collect enough images of these diagnoses to avoid the risk of low accuracy \"in the wild\". One of the sources of bias in these networks is irrelevant background pixel data. These pixels necessarily have no clinical significance, yet Deep Neural Networks will make weak correlations based on that information. To reduce their ability to do this, we introduce a masking augmentation algorithm, InfoMax-Cutout. It employs unsupervised Information Maximization losses to mask out background pixels. InfoMax-Cutout increased accuracy on classifying 319 diagnoses by 0.76%. These features generalized to an unseen diagnosis task (Fitzpatrick 17k), improving accuracy over a baseline by 43.3% and reducing Gini inequality by 20.9%. This approach of learning to separate out background pixels can increase accuracy in detecting diseases in Lower and Middle Income Countries.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"368-377"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099365/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LLMs-based Few-Shot Disease Predictions using EHR: A Novel Approach Combining Predictive Agent Reasoning and Critical Agent Instruction. 基于llms的基于EHR的少针疾病预测:一种结合预测代理推理和关键代理指导的新方法。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Hejie Cui, Zhuocheng Shen, Jieyu Zhang, Hui Shao, Lianhui Qin, Joyce C Ho, Carl Yang

Electronic health records (EHRs) contain valuable patient data for health-related prediction tasks, such as disease prediction. Traditional approaches rely on supervised learning methods that require large labeled datasets, which can be expensive and challenging to obtain. In this study, we investigate the feasibility of applying Large Language Models (LLMs) to convert structured patient visit data (e.g., diagnoses, labs, prescriptions) into natural language narratives. We evaluate the zero-shot and few-shot performance of LLMs using various EHR-prediction-oriented prompting strategies. Furthermore, we propose a novel approach that utilizes LLM agents with different roles: a predictor agent that makes predictions and generates reasoning processes and a critic agent that analyzes incorrect predictions and provides guidance for improving the reasoning of the predictor agent. Our results demonstrate that with the proposed approach, LLMs can achieve decent few-shot performance compared to traditional supervised learning methods in EHR-based disease predictions, suggesting its potential for health-oriented applications.

电子健康记录(EHRs)包含有价值的患者数据,用于与健康相关的预测任务,如疾病预测。传统的方法依赖于需要大量标记数据集的监督学习方法,这可能是昂贵且具有挑战性的。在本研究中,我们探讨了应用大语言模型(LLMs)将结构化患者访问数据(如诊断、实验室、处方)转换为自然语言叙述的可行性。我们使用各种面向ehr预测的提示策略来评估llm的零射击和少射击性能。此外,我们提出了一种利用具有不同角色的LLM代理的新方法:一个预测代理进行预测并生成推理过程,一个批评代理分析不正确的预测并为改进预测代理的推理提供指导。我们的研究结果表明,在基于ehr的疾病预测中,与传统的监督学习方法相比,llm可以获得不错的少数射击性能,这表明它具有面向健康的应用潜力。
{"title":"LLMs-based Few-Shot Disease Predictions using EHR: A Novel Approach Combining Predictive Agent Reasoning and Critical Agent Instruction.","authors":"Hejie Cui, Zhuocheng Shen, Jieyu Zhang, Hui Shao, Lianhui Qin, Joyce C Ho, Carl Yang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Electronic health records (EHRs) contain valuable patient data for health-related prediction tasks, such as disease prediction. Traditional approaches rely on supervised learning methods that require large labeled datasets, which can be expensive and challenging to obtain. In this study, we investigate the feasibility of applying Large Language Models (LLMs) to convert structured patient visit data (e.g., diagnoses, labs, prescriptions) into natural language narratives. We evaluate the zero-shot and few-shot performance of LLMs using various EHR-prediction-oriented prompting strategies. Furthermore, we propose a novel approach that utilizes LLM agents with different roles: a predictor agent that makes predictions and generates reasoning processes and a critic agent that analyzes incorrect predictions and provides guidance for improving the reasoning of the predictor agent. Our results demonstrate that with the proposed approach, LLMs can achieve decent few-shot performance compared to traditional supervised learning methods in EHR-based disease predictions, suggesting its potential for health-oriented applications.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"319-328"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099430/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boosting Social Determinants of Health Extraction with Semantic Knowledge Augmented Large Language Model. 基于语义知识增强的大型语言模型促进健康提取的社会决定因素。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Lei Gong, Jaren Bresnick, Aidong Zhang, Cathy Wu, Kishlay Jha

Social determinants of health (SDoH) significantly impacts health outcomes and contributes to perpetuating health disparities across healthcare applications. However, automatic extraction of SDoH information from Electronic Health Records (EHRs) is challenging due to the unstructured nature of clinical narratives that contain SDoH related information. Recent advances in Large Language Models (LLMs) have shown great promise for automated SDoH extraction. However, their performance suffers for the imbalanced SDoH categories due to the data scarcity issues. To address this, we propose an innovative approach that augments LLMs with semantic knowledge obtained from the Unified Medical Language Systems (UMLS). This strategy enriches the feature representations of imbalanced SDoH classes, leading to accurate SDoH extraction. More specifically, our proposed data augmentation strategy generates semantically enriched clinical narratives at the LLM pre-finetuning stage. This approach enables the LLM to better adapt to the target data and leads to a good initialization for the finetuning stage. Through extensive experiments using publicly available MIMIC-SDoH data, the proposed approach demonstrates significant improvement in results for the SDoH extraction, especially for the imbalanced classes.

健康的社会决定因素(SDoH)显著影响健康结果,并有助于在医疗保健应用中持续存在健康差异。然而,由于包含SDoH相关信息的临床叙述的非结构化性质,从电子健康记录(EHRs)中自动提取SDoH信息是具有挑战性的。大型语言模型(llm)的最新进展显示了自动化SDoH提取的巨大前景。然而,由于数据稀缺性问题,它们的性能在不平衡的SDoH类别中受到影响。为了解决这个问题,我们提出了一种创新的方法,通过从统一医学语言系统(UMLS)获得的语义知识来增强法学硕士。该策略丰富了不平衡SDoH类的特征表示,实现了准确的SDoH提取。更具体地说,我们提出的数据增强策略在LLM预微调阶段生成语义丰富的临床叙述。这种方法使LLM能够更好地适应目标数据,并为调优阶段提供良好的初始化。通过使用公开可用的MIMIC-SDoH数据进行大量实验,所提出的方法在SDoH提取结果方面有了显着改善,特别是对于不平衡类。
{"title":"Boosting Social Determinants of Health Extraction with Semantic Knowledge Augmented Large Language Model.","authors":"Lei Gong, Jaren Bresnick, Aidong Zhang, Cathy Wu, Kishlay Jha","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Social determinants of health (SDoH) significantly impacts health outcomes and contributes to perpetuating health disparities across healthcare applications. However, automatic extraction of SDoH information from Electronic Health Records (EHRs) is challenging due to the unstructured nature of clinical narratives that contain SDoH related information. Recent advances in Large Language Models (LLMs) have shown great promise for automated SDoH extraction. However, their performance suffers for the imbalanced SDoH categories due to the data scarcity issues. To address this, we propose an innovative approach that augments LLMs with semantic knowledge obtained from the Unified Medical Language Systems (UMLS). This strategy enriches the feature representations of imbalanced SDoH classes, leading to accurate SDoH extraction. More specifically, our proposed data augmentation strategy generates semantically enriched clinical narratives at the LLM pre-finetuning stage. This approach enables the LLM to better adapt to the target data and leads to a good initialization for the finetuning stage. Through extensive experiments using publicly available MIMIC-SDoH data, the proposed approach demonstrates significant improvement in results for the SDoH extraction, especially for the imbalanced classes.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"453-462"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099417/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AMIA ... Annual Symposium proceedings. AMIA Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1