首页 > 最新文献

Journal of Biomedical Informatics最新文献

英文 中文
Deep survival analysis for interpretable time-varying prediction of preeclampsia risk 深度生存分析可解释子痫前期风险的时变预测。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-11 DOI: 10.1016/j.jbi.2024.104688
Braden W. Eberhard , Kathryn J. Gray , David W. Bates , Vesela P. Kovacheva

Objective

Survival analysis is widely utilized in healthcare to predict the timing of disease onset. Traditional methods of survival analysis are usually based on Cox Proportional Hazards model and assume proportional risk for all subjects. However, this assumption is rarely true for most diseases, as the underlying factors have complex, non-linear, and time-varying relationships. This concern is especially relevant for pregnancy, where the risk for pregnancy-related complications, such as preeclampsia, varies across gestation. Recently, deep learning survival models have shown promise in addressing the limitations of classical models, as the novel models allow for non-proportional risk handling, capturing nonlinear relationships, and navigating complex temporal dynamics.

Methods

We present a methodology to model the temporal risk of preeclampsia during pregnancy and investigate the associated clinical risk factors. We utilized a retrospective dataset including 66,425 pregnant individuals who delivered in two tertiary care centers from 2015 to 2023. We modeled the preeclampsia risk by modifying DeepHit, a deep survival model, which leverages neural network architecture to capture time-varying relationships between covariates in pregnancy. We applied time series k-means clustering to DeepHit’s normalized output and investigated interpretability using Shapley values.

Results

We demonstrate that DeepHit can effectively handle high-dimensional data and evolving risk hazards over time with performance similar to the Cox Proportional Hazards model, achieving an area under the curve (AUC) of 0.78 for both models. The deep survival model outperformed traditional methodology by identifying time-varied risk trajectories for preeclampsia, providing insights for early and individualized intervention. K-means clustering resulted in patients delineating into low-risk, early-onset, and late-onset preeclampsia groups—notably, each of those has distinct risk factors.

Conclusion

This work demonstrates a novel application of deep survival analysis in time-varying prediction of preeclampsia risk. Our results highlight the advantage of deep survival models compared to Cox Proportional Hazards models in providing personalized risk trajectory and demonstrating the potential of deep survival models to generate interpretable and meaningful clinical applications in medicine.

目的:生存分析被广泛应用于医疗保健领域,以预测疾病的发病时间。传统的生存分析方法通常基于 Cox 比例危害模型,并假设所有受试者的风险成正比。然而,对于大多数疾病来说,这一假设很少成立,因为潜在的因素具有复杂、非线性和时变的关系。这一问题与妊娠尤其相关,因为妊娠相关并发症(如子痫前期)的风险在不同的妊娠期是不同的。最近,深度学习生存模型有望解决经典模型的局限性,因为新型模型可以处理非比例风险,捕捉非线性关系,并驾驭复杂的时间动态:我们提出了一种方法来模拟孕期子痫前期的时间风险,并研究相关的临床风险因素。我们利用了一个回顾性数据集,其中包括 2015 年至 2023 年期间在两个三级医疗中心分娩的 66425 名孕妇。我们通过修改深度生存模型 DeepHit 对子痫前期风险进行建模,该模型利用神经网络架构捕捉妊娠期协变量之间的时变关系。我们对 DeepHit 的归一化输出进行了时间序列 k-means 聚类,并使用 Shapley 值研究了可解释性:我们证明,DeepHit 能有效处理高维数据和随时间演变的风险危害,其性能与 Cox 比例危害模型相似,两个模型的曲线下面积 (AUC) 均为 0.78。深度生存模型通过识别子痫前期随时间变化的风险轨迹,为早期和个体化干预提供了见解,其性能优于传统方法。K均值聚类将患者划分为低风险、早发和晚发子痫前期群体,值得注意的是,每个群体都有不同的风险因素:这项研究展示了深度生存分析在子痫前期风险时变预测中的新应用。我们的研究结果凸显了深度生存模型与 Cox 比例危害模型相比在提供个性化风险轨迹方面的优势,并展示了深度生存模型在医学领域产生可解释且有意义的临床应用的潜力。
{"title":"Deep survival analysis for interpretable time-varying prediction of preeclampsia risk","authors":"Braden W. Eberhard ,&nbsp;Kathryn J. Gray ,&nbsp;David W. Bates ,&nbsp;Vesela P. Kovacheva","doi":"10.1016/j.jbi.2024.104688","DOIUrl":"10.1016/j.jbi.2024.104688","url":null,"abstract":"<div><h3>Objective</h3><p>Survival analysis is widely utilized in healthcare to predict the timing of disease onset. Traditional methods of survival analysis are usually based on Cox Proportional Hazards model and assume proportional risk for all subjects. However, this assumption is rarely true for most diseases, as the underlying factors have complex, non-linear, and time-varying relationships. This concern is especially relevant for pregnancy, where the risk for pregnancy-related complications, such as preeclampsia, varies across gestation. Recently, deep learning survival models have shown promise in addressing the limitations of classical models, as the novel models allow for non-proportional risk handling, capturing nonlinear relationships, and navigating complex temporal dynamics.</p></div><div><h3>Methods</h3><p>We present a methodology to model the temporal risk of preeclampsia during pregnancy and investigate the associated clinical risk factors. We utilized a retrospective dataset including 66,425 pregnant individuals who delivered in two tertiary care centers from 2015 to 2023. We modeled the preeclampsia risk by modifying DeepHit, a deep survival model, which leverages neural network architecture to capture time-varying relationships between covariates in pregnancy. We applied time series k-means clustering to DeepHit’s normalized output and investigated interpretability using Shapley values.</p></div><div><h3>Results</h3><p>We demonstrate that DeepHit can effectively handle high-dimensional data and evolving risk hazards over time with performance similar to the Cox Proportional Hazards model, achieving an area under the curve (AUC) of 0.78 for both models. The deep survival model outperformed traditional methodology by identifying time-varied risk trajectories for preeclampsia, providing insights for early and individualized intervention. K-means clustering resulted in patients delineating into low-risk, early-onset, and late-onset preeclampsia groups—notably, each of those has distinct risk factors.</p></div><div><h3>Conclusion</h3><p>This work demonstrates a novel application of deep survival analysis in time-varying prediction of preeclampsia risk. Our results highlight the advantage of deep survival models compared to Cox Proportional Hazards models in providing personalized risk trajectory and demonstrating the potential of deep survival models to generate interpretable and meaningful clinical applications in medicine.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"156 ","pages":"Article 104688"},"PeriodicalIF":4.0,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141603697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Utilization of geospatial distribution in the measurement of study cohort representativeness 利用地理空间分布测量研究队列的代表性。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-08 DOI: 10.1016/j.jbi.2024.104687

Objective

The ability to apply results from a study to a broader population remains a primary objective in translational science. Distinct from intrinsic elements of scientific rigor, the extrinsic concept of generalization requires there be alignment between a study cohort and population in which results are expected to be applied. Widespread efforts have been made to quantify representativeness of study cohorts. These techniques, however, often consider the study and target cohorts as monolithic collections that can be directly compared. Overlooking known impacts to health from socio-demographic and environmental factors tied to individual’s geographical location, and potentially obfuscating misalignment in underrepresented population subgroups. This manuscript introduces several measures to account for geographic information in the assessment of cohort representation.

Methods

Metrics were defined across two themes: First, measures of recruitment, to assess if a study cohort is drawn at an expected rate and in an expected geographical pattern with respect to individuals in a reference cohort. Second, measures of individual characteristics, to assess if the individuals in the study cohort accurately reflect the sociodemographic, clinical, and geographic diversity observed across a reference cohort while accounting for the geospatial proximity of individuals.

Results

As an empirical demonstration, methods are applied to an active clinical study examining asthma in Black/African American patients at a US Midwestern pediatric hospital. Results illustrate how areas of over- and under-recruitment can be identified and contextualized in light of study recruitment patterns at an individual-level, highlighting the ability to identify a subset of features for which the study cohort closely resembled the broader population. In addition they provide an opportunity to dive deeper into misalignments, to identify study cohort members that are in some way distinct from the communities for which they are expected to represent.

Conclusion

Together, these metrics provide a comprehensive spatial assessment of a study cohort with respect to a broader target population. Such an approach offers researchers a toolset by which to target expected generalization of results derived from a given study.

目的:将研究结果应用于更广泛人群的能力仍是转化科学的首要目标。与科学严谨性的内在要素不同,"推广 "这一外在概念要求研究队列与预期应用研究结果的人群保持一致。为了量化研究队列的代表性,人们做出了广泛的努力。然而,这些技术通常将研究队列和目标队列视为可以直接比较的单一集合。这种方法忽视了与个人地理位置相关的社会人口和环境因素对健康的已知影响,并有可能掩盖代表性不足的人口亚群中的错位。本手稿介绍了几种测量方法,以便在测量群组代表性时考虑地理信息:方法:我们定义了两个主题的衡量标准。首先是招募指标,用于评估研究队列是否以预期的速度和预期的地理模式招募参照队列中的个体。量化目标人群在不同地理区域的覆盖率和分布情况。第二,测量个体特征,以评估研究队列是否准确反映了在参照队列中观察到的社会人口、临床和地理多样性。采用个体内部距离测量法和总体排列测量法,旨在考虑个体的地理空间接近性:结果:作为实证演示,我们将这些方法应用于一项正在进行的临床研究中,研究对象是美国中西部一家儿科医院的黑人和非裔美国人哮喘患者。研究结果表明了如何根据研究招募模式来确定过度招募和招募不足的区域,并对其进行背景分析。在个体层面上,突出了确定研究队列与更广泛人群密切相关的特征子集的能力。此外,还有机会深入研究错位问题,以确定研究队列成员在某种程度上有别于其预期代表的社区:这些指标结合在一起,为研究队列与更广泛的目标人群提供了全面的空间评估。这种方法为研究人员提供了一个工具集,可以据此对特定研究得出的结果进行预期推广。
{"title":"Utilization of geospatial distribution in the measurement of study cohort representativeness","authors":"","doi":"10.1016/j.jbi.2024.104687","DOIUrl":"10.1016/j.jbi.2024.104687","url":null,"abstract":"<div><h3>Objective</h3><p>The ability to apply results from a study to a broader population remains a primary objective in translational science. Distinct from intrinsic elements of scientific rigor, the extrinsic concept of generalization requires there be alignment between a study cohort and population in which results are expected to be applied. Widespread efforts have been made to quantify representativeness of study cohorts. These techniques, however, often consider the study and target cohorts as monolithic collections that can be directly compared. Overlooking known impacts to health from socio-demographic and environmental factors tied to individual’s geographical location, and potentially obfuscating misalignment in underrepresented population subgroups. This manuscript introduces several measures to account for geographic information in the assessment of cohort representation.</p></div><div><h3>Methods</h3><p>Metrics were defined across two themes: First, <em>measures of recruitment,</em> to assess if a study cohort is drawn at an expected rate and in an expected geographical pattern with respect to individuals in a reference cohort. Second, <em>measures of individual characteristics,</em> to assess if the individuals in the study cohort accurately reflect the sociodemographic, clinical, and geographic diversity observed across a reference cohort while accounting for the geospatial proximity of individuals.</p></div><div><h3>Results</h3><p>As an empirical demonstration, methods are applied to an active clinical study examining asthma in Black/African American patients at a US Midwestern pediatric hospital. Results illustrate how areas of over- and under-recruitment can be identified and contextualized in light of study recruitment patterns at an individual-level, highlighting the ability to identify a subset of features for which the study cohort closely resembled the broader population. In addition they provide an opportunity to dive deeper into misalignments, to identify study cohort members that are in some way distinct from the communities for which they are expected to represent.</p></div><div><h3>Conclusion</h3><p>Together, these metrics provide a comprehensive spatial assessment of a study cohort with respect to a broader target population. Such an approach offers researchers a toolset by which to target expected generalization of results derived from a given study.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104687"},"PeriodicalIF":4.0,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1532046424001059/pdfft?md5=d8afd893b376d6738e0d2c0c882fac22&pid=1-s2.0-S1532046424001059-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141579806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implementation and evaluation of a system for assessment of the quality of long-term management of patients at a geriatric hospital 老年病医院病人长期管理质量评估系统的实施与评估。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-06 DOI: 10.1016/j.jbi.2024.104686
Erez Shalom , Ayelet Goldstein , Rony Weiss , Maya Selivanova , Nogah Melamed Cohen , Yuval Shahar

Background

The increasing aging population presents a significant challenge, accompanied by a shortage of professional caregivers, adding to the therapeutic burden. Clinical decision support systems, utilizing computerized clinical guidelines, can improve healthcare quality, reduce expenses, save time, and boost caregiver efficiency.

Objectives

1) Develop and evaluate an automated quality assessment (QA) system for retrospective longitudinal care quality analysis, focusing on clinical staff adherence to evidence-based guidelines (GLs). 2) Assess the system’s technical feasibility and functional capability for senior nurse use in geriatric pressure-ulcer management.

Methods

A computational QA system using our Quality Assessment Temporal Patterns (QATP) methodology was designed and implemented. Our methodology transforms the GL’s procedural-knowledge into declarative-knowledge temporal-abstraction patterns representing the expected execution trace in the patient’s data for correct therapy application. Fuzzy temporal logic allows for partial compliance, reflecting individual and grouped action performance considering their values and temporal aspects. The system was tested using a pressure ulcer treatment GL and data from 100 geriatric patients’ Electronic Medical Records (EMR). After technical evaluation for accuracy and feasibility, an extensive functional evaluation was conducted by an experienced nurse, comparing QA scores with and without system support, and versus automated system scores. Time efficiency was also measured.

Results

QA scores from the geriatric nurse, with and without system’s support, did not significantly differ from those provided by the automated system (p < 0.05), demonstrating the effectiveness and reliability of both manual and automated methods. The system-supported manual QA process reduced scoring time by approximately two-thirds, from an average of 17.3 min per patient manually to about 5.9 min with the system’s assistance, highlighting the system’s efficiency potential in clinical practice.

Conclusion

The QA system based on QATP, produces scores consistent with an experienced nurse’s assessment for complex care over extended periods. It enables quick and accurate quality care evaluation for multiple patients after brief training. Such automated QA systems may empower nursing staff, enabling them to manage more patients, accurately and consistently, while reducing costs due to saved time and effort, and enhanced compliance with evidence-based guidelines.

背景:人口老龄化的加剧带来了巨大的挑战,同时专业护理人员的短缺也加重了治疗负担。利用计算机临床指南的临床决策支持系统可以提高医疗质量、减少开支、节省时间并提高护理人员的工作效率:1)开发并评估一个自动质量评估(QA)系统,用于回顾性纵向护理质量分析,重点关注临床工作人员对循证指南(GL)的遵守情况。2)评估该系统在老年压疮管理中用于高级护士的技术可行性和功能能力:方法:采用我们的质量评估时间模式(QATP)方法设计并实施了一个计算质量评估系统。我们的方法将 GL 的程序性知识转化为陈述性知识的时间抽象模式,代表了患者数据中的预期执行轨迹,从而实现正确的治疗应用。模糊时间逻辑允许部分遵从,反映出考虑到其价值和时间方面的个人和分组行动性能。该系统使用压疮治疗 GL 和 100 名老年病人的电子病历(EMR)数据进行了测试。在对准确性和可行性进行技术评估后,由一名经验丰富的护士进行了广泛的功能评估,比较了有系统支持和无系统支持的 QA 分数,以及自动系统分数。此外,还对时间效率进行了测量:结果:在有系统支持和没有系统支持的情况下,老年病科护士提供的 QA 分数与自动系统提供的分数没有显著差异(p 结论:老年病科护士提供的 QA 分数与自动系统提供的分数没有显著差异(p 结论):以 QATP 为基础的质量评估系统得出的分数与经验丰富的护士对长期复杂护理的评估结果一致。经过简短培训后,该系统可对多名患者进行快速、准确的优质护理评估。这种自动化质量评估系统可以增强护理人员的能力,使他们能够准确、一致地管理更多病人,同时由于节省了时间和精力而降低了成本,并能更好地遵守循证指南。
{"title":"Implementation and evaluation of a system for assessment of the quality of long-term management of patients at a geriatric hospital","authors":"Erez Shalom ,&nbsp;Ayelet Goldstein ,&nbsp;Rony Weiss ,&nbsp;Maya Selivanova ,&nbsp;Nogah Melamed Cohen ,&nbsp;Yuval Shahar","doi":"10.1016/j.jbi.2024.104686","DOIUrl":"10.1016/j.jbi.2024.104686","url":null,"abstract":"<div><h3>Background</h3><p>The increasing aging population presents a significant challenge, accompanied by a shortage of professional caregivers, adding to the therapeutic burden. Clinical decision support systems, utilizing computerized clinical guidelines, can improve healthcare quality, reduce expenses, save time, and boost caregiver efficiency.</p></div><div><h3>Objectives</h3><p>1) Develop and evaluate an automated quality assessment (QA) system for retrospective longitudinal care quality analysis, focusing on clinical staff adherence to evidence-based guidelines (GLs). 2) Assess the system’s technical feasibility and functional capability for senior nurse use in geriatric pressure-ulcer management.</p></div><div><h3>Methods</h3><p>A computational QA system using our <em>Quality Assessment Temporal Patterns</em> (QATP) methodology was designed and implemented. Our methodology transforms the GL’s <em>procedural-knowledge</em> into <em>declarative-knowledge</em> temporal-abstraction patterns representing the expected execution trace in the patient’s data for correct therapy application. Fuzzy temporal logic allows for partial compliance, reflecting individual and grouped action performance considering their values and temporal aspects. The system was tested using a pressure ulcer treatment GL and data from 100 geriatric patients’ Electronic Medical Records (EMR). After technical evaluation for accuracy and feasibility, an extensive functional evaluation was conducted by an experienced nurse, comparing QA scores with and without system support, and versus automated system scores. Time efficiency was also measured.</p></div><div><h3>Results</h3><p>QA scores from the geriatric nurse, with and without system’s support, did not significantly differ from those provided by the automated system (p &lt; 0.05), demonstrating the effectiveness and reliability of both manual and automated methods. The system-supported manual QA process reduced scoring time by approximately two-thirds, from an average of 17.3 min per patient manually to about 5.9 min with the system’s assistance, highlighting the system’s efficiency potential in clinical practice.</p></div><div><h3>Conclusion</h3><p>The QA system based on QATP, produces scores consistent with an experienced nurse’s assessment for complex care over extended periods. It enables quick and accurate quality care evaluation for multiple patients after brief training. Such automated QA systems may empower nursing staff, enabling them to manage more patients, accurately and consistently, while reducing costs due to saved time and effort, and enhanced compliance with evidence-based guidelines.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"156 ","pages":"Article 104686"},"PeriodicalIF":4.0,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141558850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Provision and evaluation of explanations within an automated planning-based approach to solving the multimorbidity problem 在基于自动规划的方法中提供和评估解释,以解决多病问题。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-01 DOI: 10.1016/j.jbi.2024.104681
Martin Michalowski , Szymon Wilk , Wojtek Michalowski , Malvika Rao , Marc Carrier

The multimorbidity problem involves the identification and mitigation of adverse interactions that occur when multiple computer interpretable guidelines are applied concurrently to develop a treatment plan for a patient diagnosed with multiple diseases. Solving this problem requires decision support approaches which are difficult to comprehend for physicians. As such, the rationale for treatment plans generated by these approaches needs to be provided.

Objective:

To develop an explainability component for an automated planning-based approach to the multimorbidity problem, and to assess the fidelity and interpretability of generated explanations using a clinical case study.

Methods:

The explainability component leverages the task-network model for representing computer interpretable guidelines. It generates post-hoc explanations composed of three aspects that answer why specific clinical actions are in a treatment plan, why specific revisions were applied, and how factors like medication cost, patient’s adherence, etc. influence the selection of specific actions. The explainability component is implemented as part of MitPlan, where we revised our planning-based approach to support explainability. We developed an evaluation instrument based on the system causability scale and other vetted surveys to evaluate the fidelity and interpretability of its explanations using a two dimensional comparison study design.

Results:

The explainability component was implemented for MitPlan and tested in the context of a clinical case study. The fidelity and interpretability of the generated explanations were assessed using a physician-focused evaluation study involving 21 participants from two different specialties and two levels of experience. Results show that explanations provided by the explainability component in MitPlan are of acceptable fidelity and interpretability, and that the clinical justification of the actions in a treatment plan is important to physicians.

Conclusion:

We created an explainability component that enriches an automated planning-based approach to solving the multimorbidity problem with meaningful explanations for actions in a treatment plan. This component relies on the task-network model to represent computer interpretable guidelines and as such can be ported to other approaches that also use the task-network model representation. Our evaluation study demonstrated that explanations that support a physician’s understanding of the clinical reasons for the actions in a treatment plan are useful and important.

多病同治问题是指在为确诊患有多种疾病的患者制定治疗方案时,同时应用多种计算机可解释的指南,从而识别和减轻不良相互作用。要解决这一问题,需要采用医生难以理解的决策支持方法。因此,需要为这些方法生成的治疗方案提供理由:目的:为基于自动规划的方法开发一个可解释性组件,以解决多病症问题,并通过临床案例研究评估所生成解释的可信度和可解释性:可解释性组件利用任务网络模型表示计算机可解释指南。方法:可解释性组件利用任务网络模型来表示计算机可解释的指南,它能生成由三个方面组成的事后解释,回答治疗计划中为什么会有特定的临床行动、为什么要进行特定的修订,以及药物成本、患者的依从性等因素如何影响特定行动的选择。可解释性部分是作为 MitPlan 的一部分实施的,我们对基于计划的方法进行了修订,以支持可解释性。我们在系统因果关系量表和其他经过审核的调查的基础上开发了一种评估工具,采用二维比较研究设计来评估其解释的忠实性和可解释性:为 MitPlan 实施了可解释性组件,并在临床案例研究中进行了测试。通过一项以医生为中心的评估研究,对所生成的解释的忠实性和可解释性进行了评估,共有来自两个不同专业和两种经验水平的 21 名参与者参与。结果表明,MitPlan 中的可解释性组件所提供的解释具有可接受的保真度和可解释性,而且治疗计划中行动的临床理由对医生来说非常重要:我们创建了一个可解释性组件,通过对治疗计划中的行动进行有意义的解释,丰富了解决多病症问题的自动化计划方法。该组件依靠任务网络模型来表示计算机可解释的指南,因此可以移植到同样使用任务网络模型表示的其他方法中。我们的评估研究表明,能够帮助医生理解治疗计划中行动的临床原因的解释是有用和重要的。
{"title":"Provision and evaluation of explanations within an automated planning-based approach to solving the multimorbidity problem","authors":"Martin Michalowski ,&nbsp;Szymon Wilk ,&nbsp;Wojtek Michalowski ,&nbsp;Malvika Rao ,&nbsp;Marc Carrier","doi":"10.1016/j.jbi.2024.104681","DOIUrl":"10.1016/j.jbi.2024.104681","url":null,"abstract":"<div><p>The multimorbidity problem involves the identification and mitigation of adverse interactions that occur when multiple computer interpretable guidelines are applied concurrently to develop a treatment plan for a patient diagnosed with multiple diseases. Solving this problem requires decision support approaches which are difficult to comprehend for physicians. As such, the rationale for treatment plans generated by these approaches needs to be provided.</p></div><div><h3>Objective:</h3><p>To develop an explainability component for an automated planning-based approach to the multimorbidity problem, and to assess the fidelity and interpretability of generated explanations using a clinical case study.</p></div><div><h3>Methods:</h3><p>The explainability component leverages the task-network model for representing computer interpretable guidelines. It generates post-hoc explanations composed of three aspects that answer why specific clinical actions are in a treatment plan, why specific revisions were applied, and how factors like medication cost, patient’s adherence, etc. influence the selection of specific actions. The explainability component is implemented as part of MitPlan, where we revised our planning-based approach to support explainability. We developed an evaluation instrument based on the system causability scale and other vetted surveys to evaluate the fidelity and interpretability of its explanations using a two dimensional comparison study design.</p></div><div><h3>Results:</h3><p>The explainability component was implemented for MitPlan and tested in the context of a clinical case study. The fidelity and interpretability of the generated explanations were assessed using a physician-focused evaluation study involving 21 participants from two different specialties and two levels of experience. Results show that explanations provided by the explainability component in MitPlan are of acceptable fidelity and interpretability, and that the clinical justification of the actions in a treatment plan is important to physicians.</p></div><div><h3>Conclusion:</h3><p>We created an explainability component that enriches an automated planning-based approach to solving the multimorbidity problem with meaningful explanations for actions in a treatment plan. This component relies on the task-network model to represent computer interpretable guidelines and as such can be ported to other approaches that also use the task-network model representation. Our evaluation study demonstrated that explanations that support a physician’s understanding of the clinical reasons for the actions in a treatment plan are useful and important.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"156 ","pages":"Article 104681"},"PeriodicalIF":4.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141498155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing healthcare process analysis through object-centric process mining: Transforming OMOP common data models into object-centric event logs 通过以对象为中心的流程挖掘加强医疗流程分析:将 OMOP 通用数据模型转化为以对象为中心的事件日志。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-06-27 DOI: 10.1016/j.jbi.2024.104682
Gyunam Park , Yaejin Lee , Minsu Cho

Objectives:

This study aims to enhance the analysis of healthcare processes by introducing Object-Centric Process Mining (OCPM). By offering a holistic perspective that accounts for the interactions among various objects, OCPM transcends the constraints of conventional patient-centric process mining approaches, ensuring a more detailed and inclusive understanding of healthcare dynamics.

Methods:

We develop a novel method to transform the Observational Medical Outcomes Partnership Common Data Models (OMOP CDM) into Object-Centric Event Logs (OCELs). First, an OMOP CDM4PM is created from the standard OMOP CDM, focusing on data relevant to generating OCEL and addressing healthcare data’s heterogeneity and standardization challenges. Second, this subset is transformed into OCEL based on specified healthcare criteria, including identifying various object types, clinical activities, and their relationships. The methodology is tested on the MIMIC-IV database to evaluate its effectiveness and utility.

Results:

Our proposed method effectively produces OCELs when applied to the MIMIC-IV dataset, allowing for the implementation of OCPM in the healthcare industry. We rigorously evaluate the comprehensiveness and level of abstraction to validate our approach’s effectiveness. Additionally, we create diverse object-centric process models intricately designed to navigate the complexities inherent in healthcare processes.

Conclusion:

Our approach introduces a novel perspective by integrating multiple viewpoints simultaneously. To the best of our knowledge, this is the inaugural application of OCPM within the healthcare sector, marking a significant advancement in the field.

研究目的本研究旨在通过引入以对象为中心的流程挖掘(OCPM)来加强对医疗流程的分析。OCPM 从整体角度考虑各种对象之间的相互作用,超越了传统的以患者为中心的流程挖掘方法的限制,确保对医疗动态有更详细、更全面的了解:我们开发了一种新方法,将观察性医疗结果合作组织通用数据模型(OMOP CDM)转化为以对象为中心的事件日志(OCEL)。首先,从标准 OMOP CDM 创建 OMOP CDM4PM,重点关注与生成 OCEL 相关的数据,并解决医疗保健数据的异构性和标准化难题。其次,根据指定的医疗标准将该子集转换为 OCEL,包括识别各种对象类型、临床活动及其关系。该方法在 MIMIC-IV 数据库中进行了测试,以评估其有效性和实用性:结果:我们提出的方法在应用于 MIMIC-IV 数据集时有效地生成了 OCEL,使 OCPM 得以在医疗行业中实施。我们对方法的全面性和抽象程度进行了严格评估,以验证方法的有效性。此外,我们还创建了多种以对象为中心的流程模型,这些模型设计精巧,可应对医疗保健流程固有的复杂性。我们的方法通过同时整合多种观点引入了一种新的视角。据我们所知,这是 OCPM 在医疗保健领域的首次应用,标志着该领域的重大进展。
{"title":"Enhancing healthcare process analysis through object-centric process mining: Transforming OMOP common data models into object-centric event logs","authors":"Gyunam Park ,&nbsp;Yaejin Lee ,&nbsp;Minsu Cho","doi":"10.1016/j.jbi.2024.104682","DOIUrl":"10.1016/j.jbi.2024.104682","url":null,"abstract":"<div><h3>Objectives:</h3><p>This study aims to enhance the analysis of healthcare processes by introducing Object-Centric Process Mining (OCPM). By offering a holistic perspective that accounts for the interactions among various objects, OCPM transcends the constraints of conventional patient-centric process mining approaches, ensuring a more detailed and inclusive understanding of healthcare dynamics.</p></div><div><h3>Methods:</h3><p>We develop a novel method to transform the Observational Medical Outcomes Partnership Common Data Models (OMOP CDM) into Object-Centric Event Logs (OCELs). First, an OMOP CDM4PM is created from the standard OMOP CDM, focusing on data relevant to generating OCEL and addressing healthcare data’s heterogeneity and standardization challenges. Second, this subset is transformed into OCEL based on specified healthcare criteria, including identifying various object types, clinical activities, and their relationships. The methodology is tested on the MIMIC-IV database to evaluate its effectiveness and utility.</p></div><div><h3>Results:</h3><p>Our proposed method effectively produces OCELs when applied to the MIMIC-IV dataset, allowing for the implementation of OCPM in the healthcare industry. We rigorously evaluate the comprehensiveness and level of abstraction to validate our approach’s effectiveness. Additionally, we create diverse object-centric process models intricately designed to navigate the complexities inherent in healthcare processes.</p></div><div><h3>Conclusion:</h3><p>Our approach introduces a novel perspective by integrating multiple viewpoints simultaneously. To the best of our knowledge, this is the inaugural application of OCPM within the healthcare sector, marking a significant advancement in the field.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"156 ","pages":"Article 104682"},"PeriodicalIF":4.0,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141468240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Privacy-preserving model evaluation for logistic and linear regression using homomorphically encrypted genotype data 使用同态加密基因型数据对逻辑回归和线性回归进行隐私保护模型评估。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-06-25 DOI: 10.1016/j.jbi.2024.104678
Seungwan Hong , Yoolim A. Choi , Daniel S. Joo , Gamze Gürsoy

Objective:

Linear and logistic regression are widely used statistical techniques in population genetics for analyzing genetic data and uncovering patterns and associations in large genetic datasets, such as identifying genetic variations linked to specific diseases or traits. However, obtaining statistically significant results from these studies requires large amounts of sensitive genotype and phenotype information from thousands of patients, which raises privacy concerns. Although cryptographic techniques such as homomorphic encryption offers a potential solution to the privacy concerns as it allows computations on encrypted data, previous methods leveraging homomorphic encryption have not addressed the confidentiality of shared models, which can leak information about the training data.

Methods:

In this work, we present a secure model evaluation method for linear and logistic regression using homomorphic encryption for six prediction tasks, where input genotypes, output phenotypes, and model parameters are all encrypted.

Results:

Our method ensures no private information leakage during inference and achieves high accuracy (93% for all outcomes) with each inference taking less than ten seconds for 200 genomes.

Conclusion:

Our study demonstrates that it is possible to perform linear and logistic regression model evaluation while protecting patient confidentiality with theoretical security guarantees. Our implementation and test data are available at https://github.com/G2Lab/privateML/.

目的:线性回归和逻辑回归是群体遗传学中广泛使用的统计技术,用于分析遗传数据和揭示大型遗传数据集中的模式和关联,如确定与特定疾病或性状相关的遗传变异。然而,要从这些研究中获得具有统计学意义的模型,需要从成千上万的患者那里获得大量敏感的基因型和表型信息,这就引起了隐私问题。虽然同态加密等加密技术允许在加密数据上进行计算,为隐私问题提供了潜在的解决方案,但以往利用同态加密的方法并没有解决共享模型的保密性问题,这可能会泄露训练数据的信息:在这项工作中,我们针对六项预测任务提出了一种使用同态加密的线性和逻辑回归安全模型评估方法,其中输入基因型、输出结果和模型参数都是加密的:我们的方法确保了推理过程中不会泄露私人信息,并实现了很高的准确率(所有结果的准确率≥93%),对200个基因组的每次推理耗时不到10秒钟:我们的研究表明,在利用理论安全保证保护患者机密的同时,可以进行高质量的线性和逻辑回归模型评估。我们的实现方法和测试数据可在 https://github.com/G2Lab/privateML/ 上获得。
{"title":"Privacy-preserving model evaluation for logistic and linear regression using homomorphically encrypted genotype data","authors":"Seungwan Hong ,&nbsp;Yoolim A. Choi ,&nbsp;Daniel S. Joo ,&nbsp;Gamze Gürsoy","doi":"10.1016/j.jbi.2024.104678","DOIUrl":"10.1016/j.jbi.2024.104678","url":null,"abstract":"<div><h3>Objective:</h3><p>Linear and logistic regression are widely used statistical techniques in population genetics for analyzing genetic data and uncovering patterns and associations in large genetic datasets, such as identifying genetic variations linked to specific diseases or traits. However, obtaining statistically significant results from these studies requires large amounts of sensitive genotype and phenotype information from thousands of patients, which raises privacy concerns. Although cryptographic techniques such as homomorphic encryption offers a potential solution to the privacy concerns as it allows computations on encrypted data, previous methods leveraging homomorphic encryption have not addressed the confidentiality of shared models, which can leak information about the training data.</p></div><div><h3>Methods:</h3><p>In this work, we present a secure model evaluation method for linear and logistic regression using homomorphic encryption for six prediction tasks, where input genotypes, output phenotypes, and model parameters are all encrypted.</p></div><div><h3>Results:</h3><p>Our method ensures no private information leakage during inference and achieves high accuracy (<span><math><mrow><mo>≥</mo><mn>93</mn><mtext>%</mtext></mrow></math></span> for all outcomes) with each inference taking less than ten seconds for <span><math><mrow><mo>∼</mo><mn>200</mn></mrow></math></span> genomes.</p></div><div><h3>Conclusion:</h3><p>Our study demonstrates that it is possible to perform linear and logistic regression model evaluation while protecting patient confidentiality with theoretical security guarantees. Our implementation and test data are available at <span>https://github.com/G2Lab/privateML/</span><svg><path></path></svg>.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"156 ","pages":"Article 104678"},"PeriodicalIF":4.0,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1532046424000960/pdfft?md5=34588fd687c78201223e9bd8ca85f8fe&pid=1-s2.0-S1532046424000960-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141468241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep self-reconstruction driven joint nonnegative matrix factorization model for identifying multiple genomic imaging associations in complex diseases 深度自我重构驱动的联合非负矩阵因式分解模型,用于识别复杂疾病的多基因组成像关联。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-06-25 DOI: 10.1016/j.jbi.2024.104684
Jin Deng , Kai Wei , Jiana Fang , Ying Li

Objective

Comprehensive analysis of histopathology images and transcriptomics data enables the identification of candidate biomarkers and multimodal association patterns. Most existing multimodal data association studies are derived from extensions of the joint nonnegative matrix factorization model for identifying complex data associations, which can make full use of clinical prior information. However, the raw data were usually taken as the input without considering the underlying complex multi-subspace structure, influencing the subsequent integration analysis results.

Methods

This study proposed a deep-self reconstructed joint nonnegative matrix factorization (DSRJNMF) model to use self-expressive properties to reconstruct the raw data to characterize the similarity structure associated with clinical labels. Then, the sparsity, orthogonality, and regularization constraints constructed from prior information are added to the DSRJNMF model to determine the sparse set of biologically relevant features across modalities.

Results

The algorithm has been applied to identify the imaging genetic association of triple negative breast cancer (TNBC). Multilevel experimental results demonstrate that the proposed algorithm better estimates potential associations between pathological image features and miRNA-gene and identifies consistent multimodal imaging genetic biomarkers to guide the interpretation of TNBC.

Conclusion

The propose method provides a novel idea of data association analysis oriented to complex diseases.

目的:通过对组织病理学图像和转录组学数据进行综合分析,可以确定候选生物标记物和多模态关联模式。现有的多模态数据关联研究大多源自用于识别复杂数据关联的联合非负矩阵因式分解模型的扩展,该模型可充分利用临床先验信息。然而,原始数据通常作为输入,没有考虑底层复杂的多子空间结构,影响了后续的整合分析结果:本研究提出了一种深度自我重构联合非负矩阵因式分解(DSRJNMF)模型,利用自我表达特性重构原始数据,以表征与临床标签相关的相似性结构。然后,根据先验信息构建的稀疏性、正交性和正则化约束条件被添加到 DSRJNMF 模型中,以确定跨模态的生物相关特征稀疏集:结果:该算法已被应用于识别三阴性乳腺癌(TNBC)的影像遗传关联。多层次实验结果表明,所提出的算法能更好地估计病理图像特征与 miRNA 基因之间的潜在关联,并识别出一致的多模态成像基因生物标志物,以指导 TNBC 的解读:结论:所提出的方法为复杂疾病的数据关联分析提供了一种新思路。
{"title":"Deep self-reconstruction driven joint nonnegative matrix factorization model for identifying multiple genomic imaging associations in complex diseases","authors":"Jin Deng ,&nbsp;Kai Wei ,&nbsp;Jiana Fang ,&nbsp;Ying Li","doi":"10.1016/j.jbi.2024.104684","DOIUrl":"10.1016/j.jbi.2024.104684","url":null,"abstract":"<div><h3>Objective</h3><p>Comprehensive analysis of histopathology images and transcriptomics data enables the identification of candidate biomarkers and multimodal association patterns. Most existing multimodal data association studies are derived from extensions of the joint nonnegative matrix factorization model for identifying complex data associations, which can make full use of clinical prior information. However, the raw data were usually taken as the input without considering the underlying complex multi-subspace structure, influencing the subsequent integration analysis results.</p></div><div><h3>Methods</h3><p>This study proposed a deep-self reconstructed joint nonnegative matrix factorization (DSRJNMF) model to use self-expressive properties to reconstruct the raw data to characterize the similarity structure associated with clinical labels. Then, the sparsity, orthogonality, and regularization constraints constructed from prior information are added to the DSRJNMF model to determine the sparse set of biologically relevant features across modalities.</p></div><div><h3>Results</h3><p>The algorithm has been applied to identify the imaging genetic association of triple negative breast cancer (TNBC). Multilevel experimental results demonstrate that the proposed algorithm better estimates potential associations between pathological image features and miRNA-gene and identifies consistent multimodal imaging genetic biomarkers to guide the interpretation of TNBC.</p></div><div><h3>Conclusion</h3><p>The propose method provides a novel idea of data association analysis oriented to complex diseases.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"156 ","pages":"Article 104684"},"PeriodicalIF":4.0,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141468242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing racial bias in healthcare predictive models: Practical lessons from an empirical evaluation of 30-day hospital readmission models 评估医疗保健预测模型中的种族偏见:从 30 天重新入院模型的实证评估中汲取实用经验。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-06-24 DOI: 10.1016/j.jbi.2024.104683
H. Echo Wang , Jonathan P. Weiner , Suchi Saria , Harold Lehmann , Hadi Kharrazi

Objective

Despite increased availability of methodologies to identify algorithmic bias, the operationalization of bias evaluation for healthcare predictive models is still limited. Therefore, this study proposes a process for bias evaluation through an empirical assessment of common hospital readmission models. The process includes selecting bias measures, interpretation, determining disparity impact and potential mitigations.

Methods

This retrospective analysis evaluated racial bias of four common models predicting 30-day unplanned readmission (i.e., LACE Index, HOSPITAL Score, and the CMS readmission measure applied as is and retrained). The models were assessed using 2.4 million adult inpatient discharges in Maryland from 2016 to 2019. Fairness metrics that are model-agnostic, easy to compute, and interpretable were implemented and apprised to select the most appropriate bias measures. The impact of changing model’s risk thresholds on these measures was further assessed to guide the selection of optimal thresholds to control and mitigate bias.

Results

Four bias measures were selected for the predictive task: zero-one-loss difference, false negative rate (FNR) parity, false positive rate (FPR) parity, and generalized entropy index. Based on these measures, the HOSPITAL score and the retrained CMS measure demonstrated the lowest racial bias. White patients showed a higher FNR while Black patients resulted in a higher FPR and zero-one-loss. As the models’ risk threshold changed, trade-offs between models’ fairness and overall performance were observed, and the assessment showed all models’ default thresholds were reasonable for balancing accuracy and bias.

Conclusions

This study proposes an Applied Framework to Assess Fairness of Predictive Models (AFAFPM) and demonstrates the process using 30-day hospital readmission model as the example. It suggests the feasibility of applying algorithmic bias assessment to determine optimized risk thresholds so that predictive models can be used more equitably and accurately. It is evident that a combination of qualitative and quantitative methods and a multidisciplinary team are necessary to identify, understand and respond to algorithm bias in real-world healthcare settings. Users should also apply multiple bias measures to ensure a more comprehensive, tailored, and balanced view. The results of bias measures, however, must be interpreted with caution and consider the larger operational, clinical, and policy context.

目的:尽管识别算法偏差的方法越来越多,但医疗预测模型偏差评估的可操作性仍然有限。因此,本研究通过对常见的医院再入院模型进行实证评估,提出了一套偏差评估流程。该流程包括选择偏差测量、解释、确定差异影响和潜在缓解措施:这项回顾性分析评估了预测 30 天非计划再入院的四种常见模型(即 LACE 指数、HOSPITAL 评分和 CMS 再入院衡量标准的原样应用和再训练)的种族偏差。这些模型使用 2016 年至 2019 年马里兰州 240 万成人住院出院病例进行评估。采用了与模型无关、易于计算和解释的公平性指标,并对其进行了评估,以选择最合适的偏差测量方法。进一步评估了改变模型风险阈值对这些指标的影响,以指导选择最佳阈值来控制和减轻偏差:为预测任务选择了四种偏差测量方法:零一损失差、假阴性率(FNR)奇偶性、假阳性率(FPR)奇偶性和广义熵指数。根据这些指标,HOSPITAL 评分和重新训练的 CMS 指标显示出最低的种族偏差。白人患者的误诊率较高,而黑人患者的误诊率和零损失率较高。随着模型风险阈值的变化,模型的公平性和整体性能之间的权衡被观察到,评估显示所有模型的默认阈值在平衡准确性和偏差方面都是合理的:本研究提出了评估预测模型公平性的应用框架(AFAFPM),并以 30 天再入院模型为例演示了这一过程。它提出了应用算法偏差评估来确定优化风险阈值的可行性,以便更公平、更准确地使用预测模型。显然,要识别、理解和应对现实世界医疗环境中的算法偏差,就必须结合定性和定量方法以及多学科团队。用户还应采用多种偏差测量方法,以确保获得更全面、更有针对性、更平衡的观点。不过,在解释偏差测量结果时必须谨慎,并考虑到更大的操作、临床和政策背景。
{"title":"Assessing racial bias in healthcare predictive models: Practical lessons from an empirical evaluation of 30-day hospital readmission models","authors":"H. Echo Wang ,&nbsp;Jonathan P. Weiner ,&nbsp;Suchi Saria ,&nbsp;Harold Lehmann ,&nbsp;Hadi Kharrazi","doi":"10.1016/j.jbi.2024.104683","DOIUrl":"10.1016/j.jbi.2024.104683","url":null,"abstract":"<div><h3>Objective</h3><p>Despite increased availability of methodologies to identify algorithmic bias, the operationalization of bias evaluation for healthcare predictive models is still limited. Therefore, this study proposes a process for bias evaluation through an empirical assessment of common hospital readmission models. The process includes selecting bias measures, interpretation, determining disparity impact and potential mitigations.</p></div><div><h3>Methods</h3><p>This retrospective analysis evaluated racial bias of four common models predicting 30-day unplanned readmission (i.e., LACE Index, HOSPITAL Score, and the CMS readmission measure applied as is and retrained). The models were assessed using 2.4 million adult inpatient discharges in Maryland from 2016 to 2019. Fairness metrics that are model-agnostic, easy to compute, and interpretable were implemented and apprised to select the most appropriate bias measures. The impact of changing model’s risk thresholds on these measures was further assessed to guide the selection of optimal thresholds to control and mitigate bias.</p></div><div><h3>Results</h3><p>Four bias measures were selected for the predictive task: zero-one-loss difference, false negative rate (FNR) parity, false positive rate (FPR) parity, and generalized entropy index. Based on these measures, the HOSPITAL score and the retrained CMS measure demonstrated the lowest racial bias. White patients showed a higher FNR while Black patients resulted in a higher FPR and zero-one-loss. As the models’ risk threshold changed, trade-offs between models’ fairness and overall performance were observed, and the assessment showed all models’ default thresholds were reasonable for balancing accuracy and bias.</p></div><div><h3>Conclusions</h3><p>This study proposes an Applied Framework to Assess Fairness of Predictive Models (AFAFPM) and demonstrates the process using 30-day hospital readmission model as the example. It suggests the feasibility of applying algorithmic bias assessment to determine optimized risk thresholds so that predictive models can be used more equitably and accurately. It is evident that a combination of qualitative and quantitative methods and a multidisciplinary team are necessary to identify, understand and respond to algorithm bias in real-world healthcare settings. Users should also apply multiple bias measures to ensure a more comprehensive, tailored, and balanced view. The results of bias measures, however, must be interpreted with caution and consider the larger operational, clinical, and policy context.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"156 ","pages":"Article 104683"},"PeriodicalIF":4.0,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141457163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Non-invasive prediction of massive transfusion during surgery using intraoperative hemodynamic monitoring data 利用术中血流动力学监测数据对手术中大量输血进行无创预测。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-06-22 DOI: 10.1016/j.jbi.2024.104680
Doyun Kwon , Young Mi Jung , Hyung-Chul Lee , Tae Kyong Kim , Kwangsoo Kim , Garam Lee , Dokyoon Kim , Seung-Bo Lee , Seung Mi Lee

Objective

Failure to receive prompt blood transfusion leads to severe complications if massive bleeding occurs during surgery. For the timely preparation of blood products, predicting the possibility of massive transfusion (MT) is essential to decrease morbidity and mortality. This study aimed to develop a model for predicting MT 10 min in advance using non-invasive bio-signal waveforms that change in real-time.

Methods

In this retrospective study, we developed a deep learning-based algorithm (DLA) to predict intraoperative MT within 10 min. MT was defined as the transfusion of 3 or more units of red blood cells within an hour. The datasets consisted of 18,135 patients who underwent surgery at Seoul National University Hospital (SNUH) for model development and internal validation and 621 patients who underwent surgery at the Boramae Medical Center (BMC) for external validation. We constructed the DLA by using features extracted from plethysmography (collected at 500 Hz) and hematocrit measured during surgery.

Results

Among 18,135 patients in SNUH and 621 patients in BMC, 265 patients (1.46%) and 14 patients (2.25%) received MT during surgery, respectively. The area under the receiver operating characteristic curve (AUROC) of DLA predicting intraoperative MT before 10 min was 0.962 (95% confidence interval [CI], 0.948–0.974) in internal validation and 0.922 (95% CI, 0.882–0.959) in external validation, respectively.

Conclusion

The DLA can successfully predict intraoperative MT using non-invasive bio-signal waveforms.

目的:如果在手术过程中发生大出血,不能及时输血将导致严重的并发症。为了及时准备血制品,预测大量输血(MT)的可能性对于降低发病率和死亡率至关重要。本研究旨在利用实时变化的无创生物信号波形,开发一种提前 10 分钟预测 MT 的模型:在这项回顾性研究中,我们开发了一种基于深度学习的算法(DLA),用于在 10 分钟内预测术中 MT。MT定义为一小时内输注3个或更多单位的红细胞。数据集包括在首尔大学医院(SNUH)接受手术的18135名患者,用于模型开发和内部验证;以及在Boramae医疗中心(BMC)接受手术的621名患者,用于外部验证。我们利用从胸透(以 500 Hz 频率采集)和手术中测量的血细胞比容中提取的特征构建了 DLA:在上海南方医科大学附属南方医院的18135名患者和北京协和医院的621名患者中,分别有265名患者(1.46%)和14名患者(2.25%)在手术中接受了MT。在内部验证和外部验证中,DLA 预测 10 分钟前术中 MT 的接收器操作特征曲线下面积(AUROC)分别为 0.962(95% 置信区间 [CI],0.948-0.974)和 0.922(95% CI,0.882-0.959):结论:DLA 可以利用无创生物信号波形成功预测术中 MT。
{"title":"Non-invasive prediction of massive transfusion during surgery using intraoperative hemodynamic monitoring data","authors":"Doyun Kwon ,&nbsp;Young Mi Jung ,&nbsp;Hyung-Chul Lee ,&nbsp;Tae Kyong Kim ,&nbsp;Kwangsoo Kim ,&nbsp;Garam Lee ,&nbsp;Dokyoon Kim ,&nbsp;Seung-Bo Lee ,&nbsp;Seung Mi Lee","doi":"10.1016/j.jbi.2024.104680","DOIUrl":"10.1016/j.jbi.2024.104680","url":null,"abstract":"<div><h3>Objective</h3><p>Failure to receive prompt blood transfusion leads to severe complications if massive bleeding occurs during surgery. For the timely preparation of blood products, predicting the possibility of massive transfusion (MT) is essential to decrease morbidity and mortality. This study aimed to develop a model for predicting MT 10 min in advance using non-invasive bio-signal waveforms that change in real-time.</p></div><div><h3>Methods</h3><p>In this retrospective study, we developed a deep learning-based algorithm (DLA) to predict intraoperative MT within 10 min. MT was defined as the transfusion of 3 or more units of red blood cells within an hour. The datasets consisted of 18,135 patients who underwent surgery at Seoul National University Hospital (SNUH) for model development and internal validation and 621 patients who underwent surgery at the Boramae Medical Center (BMC) for external validation. We constructed the DLA by using features extracted from plethysmography (collected at 500 Hz) and hematocrit measured during surgery.</p></div><div><h3>Results</h3><p>Among 18,135 patients in SNUH and 621 patients in BMC, 265 patients (1.46%) and 14 patients (2.25%) received MT during surgery, respectively. The area under the receiver operating characteristic curve (AUROC) of DLA predicting intraoperative MT before 10 min was 0.962 (95% confidence interval [CI], 0.948–0.974) in internal validation and 0.922 (95% CI, 0.882–0.959) in external validation, respectively.</p></div><div><h3>Conclusion</h3><p>The DLA can successfully predict intraoperative MT using non-invasive bio-signal waveforms.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"156 ","pages":"Article 104680"},"PeriodicalIF":4.0,"publicationDate":"2024-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141446258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing inclusion and representativeness on digital platforms for health education: Evidence from YouTube 评估健康教育数字平台的包容性和代表性:来自 YouTube 的证据。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-06-15 DOI: 10.1016/j.jbi.2024.104669

Background

Studies confirm that significant biases exist in online recommendation platforms, exacerbating pre-existing disparities and leading to less-than-optimal outcomes for underrepresented demographics. We study issues of bias in inclusion and representativeness in the context of healthcare information disseminated via videos on the YouTube social media platform, a widely used online channel for multi-media rich information. With one in three US adults using the Internet to learn about a health concern, it is critical to assess inclusivity and representativeness regarding how health information is disseminated by digital platforms such as YouTube.

Methods

Leveraging methods from fair machine learning (ML), natural language processing and voice and facial recognition methods, we examine inclusivity and representativeness of video content presenters using a large corpus of videos and their metadata on a chronic condition (diabetes) extracted from the YouTube platform. Regression models are used to determine whether presenter demographics impact video popularity, measured by the video’s average daily view count. A video that generates a higher view count is considered to be more popular.

Results

The voice and facial recognition methods predicted the gender and race of the presenter with reasonable success. Gender is predicted through voice recognition (accuracy = 78%, AUC = 76%), while the gender and race predictions use facial recognition (accuracy = 93%, AUC = 92% and accuracy = 82%, AUC = 80%, respectively). The gender of the presenter is more significant for video views only when the face of the presenter is not visible while videos with male presenters with no face visibility have a positive relationship with view counts. Furthermore, videos with white and male presenters have a positive influence on view counts while videos with female and non − white group have high view counts.

Conclusion

Presenters’ demographics do have an influence on average daily view count of videos viewed on social media platforms as shown by advanced voice and facial recognition algorithms used for assessing inclusion and representativeness of the video content. Future research can explore short videos and those at the channel level because popularity of the channel name and the number of videos associated with that channel do have an influence on view counts.

背景:研究证实,在线推荐平台存在严重的偏差,加剧了原有的差异,并导致代表性不足的人群无法获得理想的结果。我们研究了在 YouTube 社交媒体平台上通过视频传播医疗保健信息时的包容性和代表性偏差问题,YouTube 是一个广泛使用的多媒体信息在线渠道。每三名美国成年人中就有一人使用互联网了解健康问题,因此评估 YouTube 等数字平台如何传播健康信息的包容性和代表性至关重要:利用公平机器学习 (ML)、自然语言处理以及语音和面部识别方法,我们使用从 YouTube 平台提取的有关慢性病(糖尿病)的大型视频语料库及其元数据,对视频内容主持人的包容性和代表性进行了研究。我们使用回归模型来确定主持人的人口统计学特征是否会影响视频的受欢迎程度(以视频的日平均观看次数来衡量)。观看次数越高的视频越受欢迎:结果:语音识别和面部识别方法成功预测了主持人的性别和种族。性别是通过语音识别预测的(准确率 = 78 %,AUC = 76 %),而性别和种族则是通过面部识别预测的(准确率 = 93 %,AUC = 92 %,准确率 = 82 %,AUC = 80 %)。只有当主持人的脸部不可见时,主持人的性别才会对视频观看次数产生较大影响,而主持人为男性且脸部不可见的视频则与观看次数呈正相关。此外,白人和男性主持人的视频对观看次数有积极影响,而女性和非白人主持人的视频观看次数较高:主持人的人口统计学特征确实对社交媒体平台上视频的日均观看次数有影响,用于评估视频内容包容性和代表性的先进语音和面部识别算法也证明了这一点。未来的研究可以探讨短视频和频道层面的视频,因为频道名称的流行程度和与该频道相关的视频数量确实会对观看次数产生影响。
{"title":"Assessing inclusion and representativeness on digital platforms for health education: Evidence from YouTube","authors":"","doi":"10.1016/j.jbi.2024.104669","DOIUrl":"10.1016/j.jbi.2024.104669","url":null,"abstract":"<div><h3>Background</h3><p>Studies confirm that significant biases exist in online recommendation platforms, exacerbating pre-existing disparities and leading to less-than-optimal outcomes for underrepresented demographics. We study issues of bias in inclusion and representativeness in the context of healthcare information disseminated via videos on the YouTube social media platform, a widely used online channel for multi-media rich information. With one in three US adults using the Internet to learn about a health concern, it is critical to assess inclusivity and representativeness regarding how health information is disseminated by digital platforms such as YouTube.</p></div><div><h3>Methods</h3><p>Leveraging methods from fair machine learning (ML), natural language processing and voice and facial recognition methods, we examine inclusivity and representativeness of video content presenters using a large corpus of videos and their metadata on a chronic condition (diabetes) extracted from the YouTube platform. Regression models are used to determine whether presenter demographics impact video popularity, measured by the video’s average daily view count. A video that generates a higher view count is considered to be more popular.</p></div><div><h3>Results</h3><p>The voice and facial recognition methods predicted the gender and race of the presenter with reasonable success. Gender is predicted through voice recognition (accuracy = 78%, AUC = 76%), while the gender and race predictions use facial recognition (accuracy = 93%, AUC = 92% and accuracy = 82%, AUC = 80%, respectively). The gender of the presenter is more significant for video views only when the face of the presenter is not visible while videos with male presenters with no face visibility have a positive relationship with view counts. Furthermore, videos with white and male presenters have a positive influence on view counts while videos with female and non − white group have high view counts.</p></div><div><h3>Conclusion</h3><p>Presenters’ demographics do have an influence on average daily view count of videos viewed on social media platforms as shown by advanced voice and facial recognition algorithms used for assessing inclusion and representativeness of the video content. Future research can explore short videos and those at the channel level because popularity of the channel name and the number of videos associated with that channel do have an influence on view counts.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104669"},"PeriodicalIF":4.0,"publicationDate":"2024-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S153204642400087X/pdfft?md5=68fb3de7672785c8592be295997c2ab8&pid=1-s2.0-S153204642400087X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141330975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Biomedical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1