Journal of the American Medical Informatics Association最新文献_第3页

Hot topics in artificial intelligence. 人工智能的热门话题。

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association

Pub Date : 2025-02-01 DOI: 10.1093/jamia/ocae324

Suzanne Bakken, Eric Poon

引用次数: 0

myAURA: a personalized health library for epilepsy management via knowledge graph sparsification and visualization.

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association

Pub Date : 2025-01-31 DOI: 10.1093/jamia/ocaf012

Rion Brattig Correia, Jordan C Rozum, Leonard Cross, Jack Felag, Michael Gallant, Ziqi Guo, Bruce W Herr, Aehong Min, Jon Sanchez-Valle, Deborah Stungis Rocha, Alfonso Valencia, Xuan Wang, Katy Börner, Wendy Miller, Luis M Rocha

Objectives: Report the development of the patient-centered myAURA application and suite of methods designed to aid epilepsy patients, caregivers, and clinicians in making decisions about self-management and care.

Materials and methods: myAURA rests on an unprecedented collection of epilepsy-relevant heterogeneous data resources, such as biomedical databases, social media, and electronic health records (EHRs). We use a patient-centered biomedical dictionary to link the collected data in a multilayer knowledge graph (KG) computed with a generalizable, open-source methodology.

Results: Our approach is based on a novel network sparsification method that uses the metric backbone of weighted graphs to discover important edges for inference, recommendation, and visualization. We demonstrate by studying drug-drug interaction from EHRs, extracting epilepsy-focused digital cohorts from social media, and generating a multilayer KG visualization. We also present our patient-centered design and pilot-testing of myAURA, including its user interface.

Discussion: The ability to search and explore myAURA's heterogeneous data sources in a single, sparsified, multilayer KG is highly useful for a range of epilepsy studies and stakeholder support.

Conclusion: Our stakeholder-driven, scalable approach to integrating traditional and nontraditional data sources enables both clinical discovery and data-powered patient self-management in epilepsy and can be generalized to other chronic conditions.

{"title":"myAURA: a personalized health library for epilepsy management via knowledge graph sparsification and visualization.","authors":"Rion Brattig Correia, Jordan C Rozum, Leonard Cross, Jack Felag, Michael Gallant, Ziqi Guo, Bruce W Herr, Aehong Min, Jon Sanchez-Valle, Deborah Stungis Rocha, Alfonso Valencia, Xuan Wang, Katy Börner, Wendy Miller, Luis M Rocha","doi":"10.1093/jamia/ocaf012","DOIUrl":"https://doi.org/10.1093/jamia/ocaf012","url":null,"abstract":"Objectives: Report the development of the patient-centered myAURA application and suite of methods designed to aid epilepsy patients, caregivers, and clinicians in making decisions about self-management and care.Materials and methods: myAURA rests on an unprecedented collection of epilepsy-relevant heterogeneous data resources, such as biomedical databases, social media, and electronic health records (EHRs). We use a patient-centered biomedical dictionary to link the collected data in a multilayer knowledge graph (KG) computed with a generalizable, open-source methodology.Results: Our approach is based on a novel network sparsification method that uses the metric backbone of weighted graphs to discover important edges for inference, recommendation, and visualization. We demonstrate by studying drug-drug interaction from EHRs, extracting epilepsy-focused digital cohorts from social media, and generating a multilayer KG visualization. We also present our patient-centered design and pilot-testing of myAURA, including its user interface.Discussion: The ability to search and explore myAURA's heterogeneous data sources in a single, sparsified, multilayer KG is highly useful for a range of epilepsy studies and stakeholder support.Conclusion: Our stakeholder-driven, scalable approach to integrating traditional and nontraditional data sources enables both clinical discovery and data-powered patient self-management in epilepsy and can be generalized to other chronic conditions.","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143076198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Information-blocking trends following regulatory action.

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association

Pub Date : 2025-01-30 DOI: 10.1093/jamia/ocaf007

Jordan Everson, Daniel Healy

Objective: To describe the prevalence of and trends in practices that interfere with the exchange of patient health information (potential information blocking) 2 years after implementation of information-blocking regulations.

Materials and methods: Drawing from the American Hospital Association Information Technology (IT) Supplement and a national survey of health information organizations (HIOs), we described rates and methods of potential information blocking from these organizations' perspectives in 2023 and compared them to prior years.

Results: Twenty-seven percent of hospitals sometimes or often observed potential information blocking by any actor in 2023, down from 42% in 2021 and 33% in 2022. Thirty percent of HIOs routinely observed potential information blocking by health IT developers, down from 50% in 2015. 13% of HIOs routinely observed potential information blocking by hospitals and health systems, down from 25% in 2015. According to both hospitals and HIOs, the most prevalent method of potential information blocking by developers in 2023 was through price, while the most prevalent by healthcare providers/health systems was by focusing exchange on strategic affiliations. Few hospitals and HIOs that experienced potential information blocking said that they had reported it to the Department of Health and Human Services.

Discussion: Hospitals and HIOs perceived lower rates of potential information blocking in 2023 than in prior years indicating some impact of regulations addressing information blocking. However, both respondent types reported that substantial potential information blocking persisted in 2023 and negatively impacted the exchange of information.

Conclusion: While potential information-blocking practices have decreased, they have not been eliminated, indicating the value of continued and robust enforcement of information-blocking regulations.

{"title":"Information-blocking trends following regulatory action.","authors":"Jordan Everson, Daniel Healy","doi":"10.1093/jamia/ocaf007","DOIUrl":"https://doi.org/10.1093/jamia/ocaf007","url":null,"abstract":"Objective: To describe the prevalence of and trends in practices that interfere with the exchange of patient health information (potential information blocking) 2 years after implementation of information-blocking regulations.Materials and methods: Drawing from the American Hospital Association Information Technology (IT) Supplement and a national survey of health information organizations (HIOs), we described rates and methods of potential information blocking from these organizations' perspectives in 2023 and compared them to prior years.Results: Twenty-seven percent of hospitals sometimes or often observed potential information blocking by any actor in 2023, down from 42% in 2021 and 33% in 2022. Thirty percent of HIOs routinely observed potential information blocking by health IT developers, down from 50% in 2015. 13% of HIOs routinely observed potential information blocking by hospitals and health systems, down from 25% in 2015. According to both hospitals and HIOs, the most prevalent method of potential information blocking by developers in 2023 was through price, while the most prevalent by healthcare providers/health systems was by focusing exchange on strategic affiliations. Few hospitals and HIOs that experienced potential information blocking said that they had reported it to the Department of Health and Human Services.Discussion: Hospitals and HIOs perceived lower rates of potential information blocking in 2023 than in prior years indicating some impact of regulations addressing information blocking. However, both respondent types reported that substantial potential information blocking persisted in 2023 and negatively impacted the exchange of information.Conclusion: While potential information-blocking practices have decreased, they have not been eliminated, indicating the value of continued and robust enforcement of information-blocking regulations.","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143069001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Long-term care plan recommendation for older adults with disabilities: a bipartite graph transformer and self-supervised approach.

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association

Pub Date : 2025-01-30 DOI: 10.1093/jamia/ocae327

Chunlong Miao, Jingjing Luo, Yan Liang, Hong Liang, Yuhui Cen, Shijie Guo, Hongliu Yu

Background: With the global population aging and advancements in the medical system, long-term care in healthcare institutions and home settings has become essential for older adults with disabilities. However, the diverse and scattered care requirements of these individuals make developing effective long-term care plans heavily reliant on professional nursing staff, and even experienced caregivers may make mistakes or face confusion during the care plan development process. Consequently, there is a rigid demand for intelligent systems that can recommend comprehensive long-term care plans for older adults with disabilities who have stable clinical conditions.Objective: This study aims to utilize deep learning methods to recommend comprehensive care plans for the older adults with disabilities.Methods: We model the care data of older adults with disabilities using a bipartite graph. Additionally, we employ a prediction-based graph self-supervised learning (SSL) method to mine deep representations of graph nodes. Furthermore, we propose a novel graph Transformer architecture that incorporates eigenvector centrality to augment node features and uses graph structural information as references for the self-attention mechanism. Ultimately, we present the Bipartite Graph Transformer (BiT) model to provide personalized long-term care plan recommendation.Results: We constructed a bipartite graph comprising of 1917 nodes and 195 240 edges derived from real-world care data. The proposed model demonstrates outstanding performance, achieving an overall F1 score of 0.905 for care plan recommendations. Each care service item reached an average F1 score of 0.897, indicating that the BiT model is capable of accurately selecting services and effectively balancing the trade-off between incorrect and missed selections.Discussion: The BiT model proposed in this paper demonstrates strong potential for improving long-term care plan recommendations by leveraging bipartite graph modeling and graph SSL. This approach addresses the challenges of manual care planning, such as inefficiency, bias, and errors, by offering personalized and data-driven recommendations. While the model excels in common care items, its performance on rare or complex services could be enhanced with further refinement. These findings highlight the model's ability to provide scalable, AI-driven solutions to optimize care planning, though future research should explore its applicability across diverse healthcare settings and service types.Conclusions: Compared to previous research, the novel model proposed in this article effectively learns latent topology in bipartite graphs and achieves superior recommendation performance. Our study demonstrates the applicability of SSL and graph transformers in recommending long-term care plans for older adults with disabilitie

{"title":"Long-term care plan recommendation for older adults with disabilities: a bipartite graph transformer and self-supervised approach.","authors":"Chunlong Miao, Jingjing Luo, Yan Liang, Hong Liang, Yuhui Cen, Shijie Guo, Hongliu Yu","doi":"10.1093/jamia/ocae327","DOIUrl":"https://doi.org/10.1093/jamia/ocae327","url":null,"abstract":"Background: With the global population aging and advancements in the medical system, long-term care in healthcare institutions and home settings has become essential for older adults with disabilities. However, the diverse and scattered care requirements of these individuals make developing effective long-term care plans heavily reliant on professional nursing staff, and even experienced caregivers may make mistakes or face confusion during the care plan development process. Consequently, there is a rigid demand for intelligent systems that can recommend comprehensive long-term care plans for older adults with disabilities who have stable clinical conditions.Objective: This study aims to utilize deep learning methods to recommend comprehensive care plans for the older adults with disabilities.Methods: We model the care data of older adults with disabilities using a bipartite graph. Additionally, we employ a prediction-based graph self-supervised learning (SSL) method to mine deep representations of graph nodes. Furthermore, we propose a novel graph Transformer architecture that incorporates eigenvector centrality to augment node features and uses graph structural information as references for the self-attention mechanism. Ultimately, we present the Bipartite Graph Transformer (BiT) model to provide personalized long-term care plan recommendation.Results: We constructed a bipartite graph comprising of 1917 nodes and 195 240 edges derived from real-world care data. The proposed model demonstrates outstanding performance, achieving an overall F1 score of 0.905 for care plan recommendations. Each care service item reached an average F1 score of 0.897, indicating that the BiT model is capable of accurately selecting services and effectively balancing the trade-off between incorrect and missed selections.Discussion: The BiT model proposed in this paper demonstrates strong potential for improving long-term care plan recommendations by leveraging bipartite graph modeling and graph SSL. This approach addresses the challenges of manual care planning, such as inefficiency, bias, and errors, by offering personalized and data-driven recommendations. While the model excels in common care items, its performance on rare or complex services could be enhanced with further refinement. These findings highlight the model's ability to provide scalable, AI-driven solutions to optimize care planning, though future research should explore its applicability across diverse healthcare settings and service types.Conclusions: Compared to previous research, the novel model proposed in this article effectively learns latent topology in bipartite graphs and achieves superior recommendation performance. Our study demonstrates the applicability of SSL and graph transformers in recommending long-term care plans for older adults with disabilitie","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143069022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fast and interpretable mortality risk scores for critical care patients.

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association

Pub Date : 2025-01-28 DOI: 10.1093/jamia/ocae318

Chloe Qinyu Zhu, Muhang Tian, Lesia Semenova, Jiachang Liu, Jack Xu, Joseph Scarpa, Cynthia Rudin

Objective: Prediction of mortality in intensive care unit (ICU) patients typically relies on black box models (that are unacceptable for use in hospitals) or hand-tuned interpretable models (that might lead to the loss in performance). We aim to bridge the gap between these 2 categories by building on modern interpretable machine learning (ML) techniques to design interpretable mortality risk scores that are as accurate as black boxes.

Material and methods: We developed a new algorithm, GroupFasterRisk, which has several important benefits: it uses both hard and soft direct sparsity regularization, it incorporates group sparsity to allow more cohesive models, it allows for monotonicity constraint to include domain knowledge, and it produces many equally good models, which allows domain experts to choose among them. For evaluation, we leveraged the largest existing public ICU monitoring datasets (MIMIC III and eICU).

Results: Models produced by GroupFasterRisk outperformed OASIS and SAPS II scores and performed similarly to APACHE IV/IVa while using at most a third of the parameters. For patients with sepsis/septicemia, acute myocardial infarction, heart failure, and acute kidney failure, GroupFasterRisk models outperformed OASIS and SOFA. Finally, different mortality prediction ML approaches performed better based on variables selected by GroupFasterRisk as compared to OASIS variables.

Discussion: Group Faster Risk's models performed better than risk scores currently used in hospitals, and on par with black box ML models, while being orders of magnitude sparser. Because GroupFasterRisk produces a variety of risk scores, it allows design flexibility-the key enabler of practical model creation.

Conclusion: Group Faster Risk is a fast, accessible, and flexible procedure that allows learning a diverse set of sparse risk scores for mortality prediction.

{"title":"Fast and interpretable mortality risk scores for critical care patients.","authors":"Chloe Qinyu Zhu, Muhang Tian, Lesia Semenova, Jiachang Liu, Jack Xu, Joseph Scarpa, Cynthia Rudin","doi":"10.1093/jamia/ocae318","DOIUrl":"https://doi.org/10.1093/jamia/ocae318","url":null,"abstract":"Objective: Prediction of mortality in intensive care unit (ICU) patients typically relies on black box models (that are unacceptable for use in hospitals) or hand-tuned interpretable models (that might lead to the loss in performance). We aim to bridge the gap between these 2 categories by building on modern interpretable machine learning (ML) techniques to design interpretable mortality risk scores that are as accurate as black boxes.Material and methods: We developed a new algorithm, GroupFasterRisk, which has several important benefits: it uses both hard and soft direct sparsity regularization, it incorporates group sparsity to allow more cohesive models, it allows for monotonicity constraint to include domain knowledge, and it produces many equally good models, which allows domain experts to choose among them. For evaluation, we leveraged the largest existing public ICU monitoring datasets (MIMIC III and eICU).Results: Models produced by GroupFasterRisk outperformed OASIS and SAPS II scores and performed similarly to APACHE IV/IVa while using at most a third of the parameters. For patients with sepsis/septicemia, acute myocardial infarction, heart failure, and acute kidney failure, GroupFasterRisk models outperformed OASIS and SOFA. Finally, different mortality prediction ML approaches performed better based on variables selected by GroupFasterRisk as compared to OASIS variables.Discussion: Group Faster Risk's models performed better than risk scores currently used in hospitals, and on par with black box ML models, while being orders of magnitude sparser. Because GroupFasterRisk produces a variety of risk scores, it allows design flexibility-the key enabler of practical model creation.Conclusion: Group Faster Risk is a fast, accessible, and flexible procedure that allows learning a diverse set of sparse risk scores for mortality prediction.","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143054068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evaluating robustly standardized explainable anomaly detection of implausible variables in cancer data.

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association

Pub Date : 2025-01-28 DOI: 10.1093/jamia/ocaf011

Philipp Röchner, Franz Rothlauf

Objectives: Explanations help to understand why anomaly detection algorithms identify data as anomalous. This study evaluates whether robustly standardized explanation scores correctly identify the implausible variables that make cancer data anomalous.

Materials and methods: The dataset analyzed consists of 18 587 truncated real-world cancer registry records containing 8 categorical variables describing patients diagnosed with bladder and lung tumors. We identified 800 anomalous records using an autoencoder's per-record reconstruction error, which is a common neural network-based anomaly detection approach. For each variable of a record, we determined a robust explanation score, which indicates how anomalous the variable is. A variable's robust explanation score is the autoencoder's per-variable reconstruction error measured by cross-entropy and robustly standardized across records; that is, large reconstruction errors have a small effect on standardization. To evaluate the explanation scores, medical coders identified the implausible variables of the anomalous records. We then compare the explanation scores to the medical coders' validation in a classification and ranking setting. As baselines, we identified anomalous variables using the raw autoencoder's per-variable reconstruction error, the non-robustly standardized per-variable reconstruction error, the empirical frequency of implausible variables according to the medical coders' validation, and random selection or ranking of variables.

Results: When we sort the variables by their robust explanation scores, on average, the 2.37 highest-ranked variables contain all implausible variables. For the baselines, on average, the 2.84, 2.98, 3.27, and 4.91 highest-ranked variables contain all the variables that made a record implausible.

Discussion: We found that explanations based on robust explanation scores were better than or as good as the baseline explanations examined in the classification and ranking settings. Due to the international standardization of cancer data coding, we expect our results to generalize to other cancer types and registries. As we anticipate different magnitudes of per-variable autoencoder reconstruction errors in data from other medical registries and domains, these may also benefit from robustly standardizing the reconstruction errors per variable. Future work could explore methods to identify subsets of anomalous variables, addressing whether individual variables or their combinations contribute to anomalies. This direction aims to improve the interpretability and utility of anomaly detection systems.

Conclusions: Robust explanation scores can improve explanations for identifying implausible variables in cancer data.

{"title":"Evaluating robustly standardized explainable anomaly detection of implausible variables in cancer data.","authors":"Philipp Röchner, Franz Rothlauf","doi":"10.1093/jamia/ocaf011","DOIUrl":"https://doi.org/10.1093/jamia/ocaf011","url":null,"abstract":"Objectives: Explanations help to understand why anomaly detection algorithms identify data as anomalous. This study evaluates whether robustly standardized explanation scores correctly identify the implausible variables that make cancer data anomalous.Materials and methods: The dataset analyzed consists of 18 587 truncated real-world cancer registry records containing 8 categorical variables describing patients diagnosed with bladder and lung tumors. We identified 800 anomalous records using an autoencoder's per-record reconstruction error, which is a common neural network-based anomaly detection approach. For each variable of a record, we determined a robust explanation score, which indicates how anomalous the variable is. A variable's robust explanation score is the autoencoder's per-variable reconstruction error measured by cross-entropy and robustly standardized across records; that is, large reconstruction errors have a small effect on standardization. To evaluate the explanation scores, medical coders identified the implausible variables of the anomalous records. We then compare the explanation scores to the medical coders' validation in a classification and ranking setting. As baselines, we identified anomalous variables using the raw autoencoder's per-variable reconstruction error, the non-robustly standardized per-variable reconstruction error, the empirical frequency of implausible variables according to the medical coders' validation, and random selection or ranking of variables.Results: When we sort the variables by their robust explanation scores, on average, the 2.37 highest-ranked variables contain all implausible variables. For the baselines, on average, the 2.84, 2.98, 3.27, and 4.91 highest-ranked variables contain all the variables that made a record implausible.Discussion: We found that explanations based on robust explanation scores were better than or as good as the baseline explanations examined in the classification and ranking settings. Due to the international standardization of cancer data coding, we expect our results to generalize to other cancer types and registries. As we anticipate different magnitudes of per-variable autoencoder reconstruction errors in data from other medical registries and domains, these may also benefit from robustly standardizing the reconstruction errors per variable. Future work could explore methods to identify subsets of anomalous variables, addressing whether individual variables or their combinations contribute to anomalies. This direction aims to improve the interpretability and utility of anomaly detection systems.Conclusions: Robust explanation scores can improve explanations for identifying implausible variables in cancer data.","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143054062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Patterns of willingness to share health data with key stakeholders in US consumers: a latent class analysis.

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association

Pub Date : 2025-01-28 DOI: 10.1093/jamia/ocaf014

Ashwini Nagappan, Xi Zhu

Objective: To identify distinct patterns in consumer willingness to share health data with various stakeholders and analyze characteristics across consumer groups.

Materials and methods: Data from the Rock Health Digital Health Consumer Adoption Survey from 2018, 2019, 2020, and 2022 were analyzed. This study comprised a Census-matched representative sample of U.S. adults. Latent class analysis (LCA) identified groups of respondents with similar data-sharing attitudes. Groups were compared by sociodemographics, health status, and digital health utilization.

Results: We identified three distinct LCA groups: (1) Wary (36.8%), (2) Discerning (47.9%), and (3) Permissive (15.3%). The Wary subgroup exhibited reluctance to share health data with any stakeholder, with predicted probabilities of willingness to share ranging from 0.07 for pharmaceutical companies to 0.34 for doctors/clinicians. The Permissive group showed a high willingness, with predicted probabilities greater than 0.75 for most stakeholders except technology companies and government organizations. The Discerning group was selective, willing to share with healthcare-related entities and family (predicted probabilities >0.62), but reluctant to share with other stakeholders (predicted probabilities <0.29). Individual characteristics were associated with LCA group membership.

Discussion: Findings highlight a persistent trust in traditional healthcare providers. However, the varying willingness to share with non-traditional stakeholders suggests that while some consumers are open to sharing, others remain hesitant and selective. Data privacy policies and practices need to recognize and respond to multifaceted and stakeholder-specific attitudes.

Conclusion: LCA reveals significant heterogeneity in health data-sharing attitudes among U.S. consumers, providing insights to inform the development of data privacy policies.

目的确定消费者与不同利益相关者共享健康数据的意愿的不同模式，并分析不同消费者群体的特征：对 Rock Health 数字健康消费者采用情况调查 2018 年、2019 年、2020 年和 2022 年的数据进行了分析。这项研究由人口普查匹配的美国成年人代表性样本组成。潜类分析（LCA）确定了具有相似数据共享态度的受访者群体。根据社会人口统计学、健康状况和数字健康使用情况对这些群体进行了比较：我们确定了三个不同的 LCA 群体：(1) 谨慎型（36.8%）、(2) 明察秋毫型（47.9%）和 (3) 宽容型（15.3%）。警惕亚组表现出不愿意与任何利益相关者共享健康数据，愿意共享的预测概率从制药公司的 0.07 到医生/临床医生的 0.34 不等。许可组显示出较高的意愿，除技术公司和政府组织外，大多数利益相关者的预测概率都大于 0.75。明辨组则有所选择，愿意与医疗保健相关实体和家人分享信息（预测概率大于 0.62），但不愿意与其他利益相关者分享信息（预测概率为讨论）：研究结果凸显了人们对传统医疗保健提供者的持续信任。然而，与非传统利益相关者分享信息的意愿却各不相同，这表明尽管一些消费者对分享信息持开放态度，但另一些消费者仍然犹豫不决并有所选择。数据隐私政策和实践需要认识到利益相关者多方面的特定态度，并做出回应：LCA 揭示了美国消费者在健康数据共享态度上的显著异质性，为数据隐私政策的制定提供了启示。

{"title":"Patterns of willingness to share health data with key stakeholders in US consumers: a latent class analysis.","authors":"Ashwini Nagappan, Xi Zhu","doi":"10.1093/jamia/ocaf014","DOIUrl":"https://doi.org/10.1093/jamia/ocaf014","url":null,"abstract":"Objective: To identify distinct patterns in consumer willingness to share health data with various stakeholders and analyze characteristics across consumer groups.Materials and methods: Data from the Rock Health Digital Health Consumer Adoption Survey from 2018, 2019, 2020, and 2022 were analyzed. This study comprised a Census-matched representative sample of U.S. adults. Latent class analysis (LCA) identified groups of respondents with similar data-sharing attitudes. Groups were compared by sociodemographics, health status, and digital health utilization.Results: We identified three distinct LCA groups: (1) Wary (36.8%), (2) Discerning (47.9%), and (3) Permissive (15.3%). The Wary subgroup exhibited reluctance to share health data with any stakeholder, with predicted probabilities of willingness to share ranging from 0.07 for pharmaceutical companies to 0.34 for doctors/clinicians. The Permissive group showed a high willingness, with predicted probabilities greater than 0.75 for most stakeholders except technology companies and government organizations. The Discerning group was selective, willing to share with healthcare-related entities and family (predicted probabilities >0.62), but reluctant to share with other stakeholders (predicted probabilities <0.29). Individual characteristics were associated with LCA group membership.Discussion: Findings highlight a persistent trust in traditional healthcare providers. However, the varying willingness to share with non-traditional stakeholders suggests that while some consumers are open to sharing, others remain hesitant and selective. Data privacy policies and practices need to recognize and respond to multifaceted and stakeholder-specific attitudes.Conclusion: LCA reveals significant heterogeneity in health data-sharing attitudes among U.S. consumers, providing insights to inform the development of data privacy policies.","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143054082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Communication-efficient federated learning of temporal effects on opioid use disorder with data from distributed research networks.

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association

Pub Date : 2025-01-26 DOI: 10.1093/jamia/ocae313

C Jason Liang, Chongliang Luo, Henry R Kranzler, Jiang Bian, Yong Chen

Objective: To develop a distributed algorithm to fit multi-center Cox regression models with time-varying coefficients to facilitate privacy-preserving data integration across multiple health systems.

Materials and methods: The Cox model with time-varying coefficients relaxes the proportional hazards assumption of the usual Cox model and is particularly useful to model time-to-event outcomes. We proposed a One-shot Distributed Algorithm to fit multi-center Cox regression models with Time varying coefficients (ODACT). This algorithm constructed a surrogate likelihood function to approximate the Cox partial likelihood function, using patient-level data from a lead site and aggregated data from other sites. The performance of ODACT was demonstrated by simulation and a real-world study of opioid use disorder (OUD) using decentralized data from a large clinical research network across 5 sites with 69 163 subjects.

Results: The ODACT method precisely estimated the time-varying effects over time. In the simulation study, ODACT always achieved estimation close to that of the pooled analysis, while the meta-estimator showed considerable amount of bias. In the OUD study, the bias of the estimated hazard ratios by ODACT are smaller than those of the meta-estimator for all 7 risk factors at almost all of the time points from 0 to 2.5 years. The greatest bias of the meta-estimator was for the effects of age ≥65 years, and smoking.

Conclusion: ODACT is a privacy-preserving and communication-efficient method for analyzing multi-center time-to-event data which allows the covariates' effects to be time-varying. ODACT provides estimates close to the pooled estimator and substantially outperforms the meta-analysis estimator.

Discussion: The proposed ODACT is a privacy-preserving distributed algorithm for fitting Cox models with time-varying coefficients. The limitations of ODACT include that privacy-preserving via aggregate data does rely on relatively large number of data at each individual site, and rigorous quantification of the risk of privacy leaks requires further investigation.

目的：开发一种分布式算法，用于拟合具有时变系数的多中心 Cox 回归模型：开发一种分布式算法来拟合具有时变系数的多中心 Cox 回归模型，以促进跨多个医疗系统的隐私保护数据整合：具有时变系数的 Cox 模型放宽了通常 Cox 模型的比例危险假设，尤其适用于建立时间到事件结果模型。我们提出了一种单次分布式算法（ODACT）来拟合具有时变系数的多中心 Cox 回归模型。该算法利用一个主要站点的患者水平数据和其他站点的汇总数据，构建了一个近似于 Cox 部分似然函数的替代似然函数。ODACT 的性能通过模拟和阿片类药物使用障碍（OUD）的真实世界研究得到了验证，该研究使用的分散数据来自一个大型临床研究网络，涉及 5 个研究点，69 163 名受试者：结果：ODACT 方法精确估计了随时间变化的效应。在模拟研究中，ODACT 的估算结果始终接近汇总分析的结果，而元估算器则存在相当大的偏差。在 OUD 研究中，对于所有 7 个风险因素，在 0 至 2.5 年的几乎所有时间点上，ODACT 估计的危险比的偏差都小于元估计器的偏差。元估计器的最大偏差是年龄≥65岁和吸烟的影响：ODACT是一种保护隐私、通信效率高的多中心时间到事件数据分析方法，允许协变量的影响随时间变化。ODACT 提供的估计值接近集合估计值，并大大优于荟萃分析估计值：所提出的 ODACT 是一种保护隐私的分布式算法，用于拟合系数随时间变化的 Cox 模型。ODACT 的局限性包括：通过集合数据保护隐私确实依赖于每个站点相对较多的数据量，而且隐私泄露风险的严格量化还需要进一步研究。

{"title":"Communication-efficient federated learning of temporal effects on opioid use disorder with data from distributed research networks.","authors":"C Jason Liang, Chongliang Luo, Henry R Kranzler, Jiang Bian, Yong Chen","doi":"10.1093/jamia/ocae313","DOIUrl":"https://doi.org/10.1093/jamia/ocae313","url":null,"abstract":"Objective: To develop a distributed algorithm to fit multi-center Cox regression models with time-varying coefficients to facilitate privacy-preserving data integration across multiple health systems.Materials and methods: The Cox model with time-varying coefficients relaxes the proportional hazards assumption of the usual Cox model and is particularly useful to model time-to-event outcomes. We proposed a One-shot Distributed Algorithm to fit multi-center Cox regression models with Time varying coefficients (ODACT). This algorithm constructed a surrogate likelihood function to approximate the Cox partial likelihood function, using patient-level data from a lead site and aggregated data from other sites. The performance of ODACT was demonstrated by simulation and a real-world study of opioid use disorder (OUD) using decentralized data from a large clinical research network across 5 sites with 69 163 subjects.Results: The ODACT method precisely estimated the time-varying effects over time. In the simulation study, ODACT always achieved estimation close to that of the pooled analysis, while the meta-estimator showed considerable amount of bias. In the OUD study, the bias of the estimated hazard ratios by ODACT are smaller than those of the meta-estimator for all 7 risk factors at almost all of the time points from 0 to 2.5 years. The greatest bias of the meta-estimator was for the effects of age ≥65 years, and smoking.Conclusion: ODACT is a privacy-preserving and communication-efficient method for analyzing multi-center time-to-event data which allows the covariates' effects to be time-varying. ODACT provides estimates close to the pooled estimator and substantially outperforms the meta-analysis estimator.Discussion: The proposed ODACT is a privacy-preserving distributed algorithm for fitting Cox models with time-varying coefficients. The limitations of ODACT include that privacy-preserving via aggregate data does rely on relatively large number of data at each individual site, and rigorous quantification of the risk of privacy leaks requires further investigation.","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2025-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143048666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Machine-learning based risk prediction of outcomes in patients hospitalised with COVID-19 in Australia: the AUS-COVID score.

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association

Pub Date : 2025-01-25 DOI: 10.1093/jamia/ocaf016

Hari P Sritharan, Harrison Nguyen, William van Gaal, Leonard Kritharides, Clara K Chow, Ravinay Bhindi

Objective: We aimed to develop a highly interpretable and effective, machine-learning based risk prediction algorithm to predict in-hospital mortality, intubation and adverse cardiovascular events in patients hospitalised with COVID-19 in Australia (AUS-COVID Score).

Materials and methods: This prospective study across 21 hospitals included 1714 consecutive patients aged ≥ 18 in their index hospitalization with COVID-19. The dataset was separated into training (80%) and test sets (20%). Eight supervised ML methods were used: LASSO, ridge, elastic net (EN), decision tree, support vector machine, random forest, AdaBoost and gradient boosting. A feature selection method was used to establish informative variables, which were considered in groups of 5/10/15/20/all. The final model was selected by balancing the optimal area under the curve (AUC) score with interpretability, through the number of included variables. The coefficients of the final models were used to build the AUS-COVID Score.

Results & discussion: Among the patients, 181 (10.6%) died in-hospital, 148 (8.6%) required intubation and 90 (5.3%) had adverse cardiovascular events. The LASSO model performed best for predicting in-hospital mortality (AUC 0.85) using five variables: age, respiratory rate, COVID-19 features on chest X-ray (CXR), troponin elevation, and COVID-19 vaccination (≥1 dose). The Elastic Net model performed best for predicting intubation (AUC 0.75) and adverse cardiovascular events (AUC 0.64), each with five variables. A user-friendly web-based application was built for clinician use at the bedside.

Conclusion: The AUS-COVID Score is an accurate and practical, machine-learning-based risk score to predict in-hospital mortality, intubation, and adverse cardiovascular events in hospitalized COVID-19 patients.

{"title":"Machine-learning based risk prediction of outcomes in patients hospitalised with COVID-19 in Australia: the AUS-COVID score.","authors":"Hari P Sritharan, Harrison Nguyen, William van Gaal, Leonard Kritharides, Clara K Chow, Ravinay Bhindi","doi":"10.1093/jamia/ocaf016","DOIUrl":"https://doi.org/10.1093/jamia/ocaf016","url":null,"abstract":"Objective: We aimed to develop a highly interpretable and effective, machine-learning based risk prediction algorithm to predict in-hospital mortality, intubation and adverse cardiovascular events in patients hospitalised with COVID-19 in Australia (AUS-COVID Score).Materials and methods: This prospective study across 21 hospitals included 1714 consecutive patients aged ≥ 18 in their index hospitalization with COVID-19. The dataset was separated into training (80%) and test sets (20%). Eight supervised ML methods were used: LASSO, ridge, elastic net (EN), decision tree, support vector machine, random forest, AdaBoost and gradient boosting. A feature selection method was used to establish informative variables, which were considered in groups of 5/10/15/20/all. The final model was selected by balancing the optimal area under the curve (AUC) score with interpretability, through the number of included variables. The coefficients of the final models were used to build the AUS-COVID Score.Results & discussion: Among the patients, 181 (10.6%) died in-hospital, 148 (8.6%) required intubation and 90 (5.3%) had adverse cardiovascular events. The LASSO model performed best for predicting in-hospital mortality (AUC 0.85) using five variables: age, respiratory rate, COVID-19 features on chest X-ray (CXR), troponin elevation, and COVID-19 vaccination (≥1 dose). The Elastic Net model performed best for predicting intubation (AUC 0.75) and adverse cardiovascular events (AUC 0.64), each with five variables. A user-friendly web-based application was built for clinician use at the bedside.Conclusion: The AUS-COVID Score is an accurate and practical, machine-learning-based risk score to predict in-hospital mortality, intubation, and adverse cardiovascular events in hospitalized COVID-19 patients.","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143043172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

VIEWER: an extensible visual analytics framework for enhancing mental healthcare.

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association

Pub Date : 2025-01-23 DOI: 10.1093/jamia/ocaf010

Tao Wang, David Codling, Yamiko Joseph Msosa, Matthew Broadbent, Daisy Kornblum, Catherine Polling, Thomas Searle, Claire Delaney-Pope, Barbara Arroyo, Stuart MacLellan, Zoe Keddie, Mary Docherty, Angus Roberts, Robert Stewart, Philip McGuire, Richard Dobson, Robert Harland

Objective: A proof-of-concept study aimed at designing and implementing Visual & Interactive Engagement With Electronic Records (VIEWER), a versatile toolkit for visual analytics of clinical data, and systematically evaluating its effectiveness across various clinical applications while gathering feedback for iterative improvements.

Materials and methods: VIEWER is an open-source and extensible toolkit that employs natural language processing and interactive visualization techniques to facilitate the rapid design, development, and deployment of clinical information retrieval, analysis, and visualization at the point of care. Through an iterative and collaborative participatory design approach, VIEWER was designed and implemented in one of the United Kingdom's largest National Health Services mental health Trusts, where its clinical utility and effectiveness were assessed using both quantitative and qualitative methods.

Results: VIEWER provides interactive, problem-focused, and comprehensive views of longitudinal patient data (n = 409 870) from a combination of structured clinical data and unstructured clinical notes. Despite a relatively short adoption period and users' initial unfamiliarity, VIEWER significantly improved performance and task completion speed compared to the standard clinical information system. More than 1000 users and partners in the hospital tested and used VIEWER, reporting high satisfaction and expressed strong interest in incorporating VIEWER into their daily practice.

Discussion: VIEWER provides a cost-effective enhancement to the functionalities of standard clinical information systems, with evaluation offering valuable feedback for future improvements.

Conclusion: VIEWER was developed to improve data accessibility and representation across various aspects of healthcare delivery, including population health management and patient monitoring. The deployment of VIEWER highlights the benefits of collaborative refinement in optimizing health informatics solutions for enhanced patient care.

{"title":"VIEWER: an extensible visual analytics framework for enhancing mental healthcare.","authors":"Tao Wang, David Codling, Yamiko Joseph Msosa, Matthew Broadbent, Daisy Kornblum, Catherine Polling, Thomas Searle, Claire Delaney-Pope, Barbara Arroyo, Stuart MacLellan, Zoe Keddie, Mary Docherty, Angus Roberts, Robert Stewart, Philip McGuire, Richard Dobson, Robert Harland","doi":"10.1093/jamia/ocaf010","DOIUrl":"https://doi.org/10.1093/jamia/ocaf010","url":null,"abstract":"Objective: A proof-of-concept study aimed at designing and implementing Visual & Interactive Engagement With Electronic Records (VIEWER), a versatile toolkit for visual analytics of clinical data, and systematically evaluating its effectiveness across various clinical applications while gathering feedback for iterative improvements.Materials and methods: VIEWER is an open-source and extensible toolkit that employs natural language processing and interactive visualization techniques to facilitate the rapid design, development, and deployment of clinical information retrieval, analysis, and visualization at the point of care. Through an iterative and collaborative participatory design approach, VIEWER was designed and implemented in one of the United Kingdom's largest National Health Services mental health Trusts, where its clinical utility and effectiveness were assessed using both quantitative and qualitative methods.Results: VIEWER provides interactive, problem-focused, and comprehensive views of longitudinal patient data (n = 409 870) from a combination of structured clinical data and unstructured clinical notes. Despite a relatively short adoption period and users' initial unfamiliarity, VIEWER significantly improved performance and task completion speed compared to the standard clinical information system. More than 1000 users and partners in the hospital tested and used VIEWER, reporting high satisfaction and expressed strong interest in incorporating VIEWER into their daily practice.Discussion: VIEWER provides a cost-effective enhancement to the functionalities of standard clinical information systems, with evaluation offering valuable feedback for future improvements.Conclusion: VIEWER was developed to improve data accessibility and representation across various aspects of healthcare delivery, including population health management and patient monitoring. The deployment of VIEWER highlights the benefits of collaborative refinement in optimizing health informatics solutions for enhanced patient care.","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143030081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0