首页 > 最新文献

Journal of Biomedical Informatics最新文献

英文 中文
How can we reward you? A compliance and reward ontology (CaRO) for eliciting quantitative reward rules for engagement in mHealth app and healthy behaviors 我们如何奖励您?合规与奖励本体论 (CaRO),用于为参与移动医疗应用和健康行为制定量化奖励规则。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-15 DOI: 10.1016/j.jbi.2024.104655
Mor Peleg , Nicole Veggiotti , Lucia Sacchi , Szymon Wilk

Objective

When developing mHealth apps with point reward systems, knowledge engineers and domain experts should define app requirements capturing quantitative reward patterns that reflect patient compliance with health behaviors. This is a difficult task, and they could be aided by an ontology that defines systematically quantitative behavior goals that address more than merely the recommended behavior but also rewards for partial compliance or practicing the behavior more than recommended. No ontology and algorithm exist for defining point rewards systematically.

Methods

We developed an OWL ontology for point rewards that leverages the Basic Formal Ontology, the Behaviour Change Intervention Ontology and the Gamification Domain Ontology. This Compliance and Reward Ontology (CaRO) allows defining temporal elementary reward patterns for single and multiple sessions of practicing a behavior. These could be assembled to define more complex temporal patterns for persistence behavior over longer time intervals as well as logical combinations of simpler reward patterns. We also developed an algorithm for calculating the points that should be rewarded to users, given data regarding their actual performance. A natural language generation algorithm generates from ontology instances app requirements in the form of user stories. To assess the usefulness of the ontology and algorithms, information system students who are trained to be system analysts/knowledge engineers evaluated whether the ontology and algorithms can improve the requirement elicitation of point rewards for compliance patterns more completely and correctly.

Results

For single-session rewards, the ontology improved formulation of two of the six requirements as well as the total time for specifying them. For multi-session rewards, the ontology improved formulation of five of the 11 requirements.

Conclusion

CaRO is a first attempt of its kind, and it covers all of the cases of compliance and reward pattern definitions that were needed for a full-scale system that was developed as part of a large European project. The ontology and algorithm are available at https://github.com/capable-project/rewards.

目的:在开发带有积分奖励系统的移动医疗应用程序时,知识工程师和领域专家应定义应用程序要求,捕捉反映患者健康行为依从性的量化奖励模式。这是一项艰巨的任务,本体论可以帮助他们系统地定义定量行为目标,这些目标不仅涉及推荐行为,还包括对部分遵从或超出推荐行为的奖励。目前还没有系统定义积分奖励的本体和算法:我们利用基本形式本体、行为改变干预本体和游戏化领域本体,为积分奖励开发了一个 OWL 本体。这种 "遵守与奖励本体论"(CaRO)可定义单次或多次行为练习的时间基本奖励模式。这些模式可以组合在一起,为更长的时间间隔内的坚持行为定义更复杂的时间模式,以及简单奖励模式的逻辑组合。我们还开发了一种算法,用于根据用户的实际表现数据计算应奖励给用户的积分。自然语言生成算法可从本体实例中生成用户故事形式的应用程序需求。为了评估本体论和算法的实用性,接受过系统分析员/知识工程师培训的信息系统专业学生评估了本体论和算法是否能更完整、更正确地改进针对合规模式的积分奖励需求激发:就单次奖励而言,本体改进了六项要求中两项要求的表述,并缩短了表述要求的总时间。对于多环节奖励,本体论改进了 11 项要求中 5 项要求的表述:CaRO 是同类产品中的首次尝试,它涵盖了作为一个大型欧洲项目的一部分而开发的全面系统所需的所有合规情况和奖励模式定义。本体和算法见 https://github.com/capable-project/rewards。
{"title":"How can we reward you? A compliance and reward ontology (CaRO) for eliciting quantitative reward rules for engagement in mHealth app and healthy behaviors","authors":"Mor Peleg ,&nbsp;Nicole Veggiotti ,&nbsp;Lucia Sacchi ,&nbsp;Szymon Wilk","doi":"10.1016/j.jbi.2024.104655","DOIUrl":"10.1016/j.jbi.2024.104655","url":null,"abstract":"<div><h3>Objective</h3><p>When developing mHealth apps with point reward systems, knowledge engineers and domain experts should define app requirements capturing quantitative reward patterns that reflect patient compliance with health behaviors. This is a difficult task, and they could be aided by an ontology that defines systematically quantitative behavior goals that address more than merely the recommended behavior but also rewards for partial compliance or practicing the behavior more than recommended. No ontology and algorithm exist for defining point rewards systematically.</p></div><div><h3>Methods</h3><p>We developed an OWL ontology for point rewards that leverages the Basic Formal Ontology, the Behaviour Change Intervention Ontology and the Gamification Domain Ontology. This Compliance and Reward Ontology (CaRO) allows defining temporal elementary reward patterns for single and multiple sessions of practicing a behavior. These could be assembled to define more complex temporal patterns for persistence behavior over longer time intervals as well as logical combinations of simpler reward patterns. We also developed an algorithm for calculating the points that should be rewarded to users, given data regarding their actual performance. A natural language generation algorithm generates from ontology instances app requirements in the form of user stories. To assess the usefulness of the ontology and algorithms, information system students who are trained to be system analysts/knowledge engineers evaluated whether the ontology and algorithms can improve the requirement elicitation of point rewards for compliance patterns more completely and correctly.</p></div><div><h3>Results</h3><p>For single-session rewards, the ontology improved formulation of two of the six requirements as well as the total time for specifying them. For multi-session rewards, the ontology improved formulation of five of the 11 requirements.</p></div><div><h3>Conclusion</h3><p>CaRO is a first attempt of its kind, and it covers all of the cases of compliance and reward pattern definitions that were needed for a full-scale system that was developed as part of a large European project. The ontology and algorithm are available at <span>https://github.com/capable-project/rewards.</span><svg><path></path></svg></p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"154 ","pages":"Article 104655"},"PeriodicalIF":4.5,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140956844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A roadmap to artificial intelligence (AI): Methods for designing and building AI ready data to promote fairness 人工智能(AI)路线图:设计和构建人工智能就绪数据的方法,以促进公平。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-11 DOI: 10.1016/j.jbi.2024.104654
Farah Kidwai-Khan , Rixin Wang , Melissa Skanderson , Cynthia A. Brandt , Samah Fodeh , Julie A. Womack

Objectives

We evaluated methods for preparing electronic health record data to reduce bias before applying artificial intelligence (AI).

Methods

We created methods for transforming raw data into a data framework for applying machine learning and natural language processing techniques for predicting falls and fractures. Strategies such as inclusion and reporting for multiple races, mixed data sources such as outpatient, inpatient, structured codes, and unstructured notes, and addressing missingness were applied to raw data to promote a reduction in bias. The raw data was carefully curated using validated definitions to create data variables such as age, race, gender, and healthcare utilization. For the formation of these variables, clinical, statistical, and data expertise were used. The research team included a variety of experts with diverse professional and demographic backgrounds to include diverse perspectives.

Results

For the prediction of falls, information extracted from radiology reports was converted to a matrix for applying machine learning. The processing of the data resulted in an input of 5,377,673 reports to the machine learning algorithm, out of which 45,304 were flagged as positive and 5,332,369 as negative for falls. Processed data resulted in lower missingness and a better representation of race and diagnosis codes. For fractures, specialized algorithms extracted snippets of text around keywork “femoral” from dual x-ray absorptiometry (DXA) scans to identify femoral neck T-scores that are important for predicting fracture risk. The natural language processing algorithms yielded 98% accuracy and 2% error rate The methods to prepare data for input to artificial intelligence processes are reproducible and can be applied to other studies.

Conclusion

The life cycle of data from raw to analytic form includes data governance, cleaning, management, and analysis. When applying artificial intelligence methods, input data must be prepared optimally to reduce algorithmic bias, as biased output is harmful. Building AI-ready data frameworks that improve efficiency can contribute to transparency and reproducibility. The roadmap for the application of AI involves applying specialized techniques to input data, some of which are suggested here. This study highlights data curation aspects to be considered when preparing data for the application of artificial intelligence to reduce bias.

目的:我们评估了在应用人工智能(AI)之前准备电子健康记录数据以减少偏差的方法:我们评估了在应用人工智能(AI)之前准备电子健康记录数据以减少偏差的方法:我们创建了将原始数据转化为数据框架的方法,以便应用机器学习和自然语言处理技术预测跌倒和骨折。为了减少偏差,我们对原始数据采用了多种策略,如纳入和报告多个种族、混合数据源(如门诊病人、住院病人、结构化代码和非结构化笔记)以及解决遗漏问题。原始数据经过仔细整理,使用有效的定义来创建年龄、种族、性别和医疗保健使用情况等数据变量。在形成这些变量时,使用了临床、统计和数据方面的专业知识。研究团队包括不同专业和人口背景的专家,以纳入不同的观点:为了预测跌倒,从放射报告中提取的信息被转换成矩阵,用于机器学习。数据处理后,机器学习算法输入了 5,377,673 份报告,其中 45,304 份报告被标记为跌倒阳性,5,332,369 份报告被标记为跌倒阴性。经过处理的数据遗漏率更低,种族和诊断代码的代表性更好。在骨折方面,专门的算法从双 X 射线吸收测量(DXA)扫描中提取了关键字 "股骨 "周围的文本片段,以确定对预测骨折风险非常重要的股骨颈 T 值。自然语言处理算法的准确率为 98%,错误率为 2%。为输入人工智能流程准备数据的方法具有可重复性,可应用于其他研究:从原始数据到分析数据的生命周期包括数据治理、清理、管理和分析。在应用人工智能方法时,输入数据必须经过最佳准备,以减少算法偏差,因为有偏差的输出是有害的。建立可提高效率的人工智能就绪数据框架有助于提高透明度和可重复性。应用人工智能的路线图涉及对输入数据应用专门技术,这里提出了其中一些技术。本研究强调了为人工智能应用准备数据时应考虑的数据整理方面,以减少偏差。
{"title":"A roadmap to artificial intelligence (AI): Methods for designing and building AI ready data to promote fairness","authors":"Farah Kidwai-Khan ,&nbsp;Rixin Wang ,&nbsp;Melissa Skanderson ,&nbsp;Cynthia A. Brandt ,&nbsp;Samah Fodeh ,&nbsp;Julie A. Womack","doi":"10.1016/j.jbi.2024.104654","DOIUrl":"10.1016/j.jbi.2024.104654","url":null,"abstract":"<div><h3>Objectives</h3><p>We evaluated methods for preparing electronic health record data to reduce bias before applying artificial intelligence (AI).</p></div><div><h3>Methods</h3><p>We created methods for transforming raw data into a data framework for applying machine learning and natural language processing techniques for predicting falls and fractures. Strategies such as inclusion and reporting for multiple races, mixed data sources such as outpatient, inpatient, structured codes, and unstructured notes, and addressing missingness were applied to raw data to promote a reduction in bias. The raw data was carefully curated using validated definitions to create data variables such as age, race, gender, and healthcare utilization. For the formation of these variables, clinical, statistical, and data expertise were used. The research team included a variety of experts with diverse professional and demographic backgrounds to include diverse perspectives.</p></div><div><h3>Results</h3><p>For the prediction of falls, information extracted from radiology reports was converted to a matrix for applying machine learning. The processing of the data resulted in an input of 5,377,673 reports to the machine learning algorithm, out of which 45,304 were flagged as positive and 5,332,369 as negative for falls. Processed data resulted in lower missingness and a better representation of race and diagnosis codes. For fractures, specialized algorithms extracted snippets of text around keywork “femoral” from dual x-ray absorptiometry (DXA) scans to identify femoral neck T-scores that are important for predicting fracture risk. The natural language processing algorithms yielded 98% accuracy and 2% error rate The methods to prepare data for input to artificial intelligence processes are reproducible and can be applied to other studies.</p></div><div><h3>Conclusion</h3><p>The life cycle of data from raw to analytic form includes data governance, cleaning, management, and analysis. When applying artificial intelligence methods, input data must be prepared optimally to reduce algorithmic bias, as biased output is harmful. Building AI-ready data frameworks that improve efficiency can contribute to transparency and reproducibility. The roadmap for the application of AI involves applying specialized techniques to input data, some of which are suggested here. This study highlights data curation aspects to be considered when preparing data for the application of artificial intelligence to reduce bias.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"154 ","pages":"Article 104654"},"PeriodicalIF":4.5,"publicationDate":"2024-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140915721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Opportunities for incorporating intersectionality into biomedical informatics 将交叉性纳入生物医学信息学的机遇。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-10 DOI: 10.1016/j.jbi.2024.104653
Oliver J. Bear Don't Walk IV , Amandalynne Paullada , Avery Everhart , Reggie Casanova-Perez , Trevor Cohen , Tiffany Veinot

Many approaches in biomedical informatics (BMI) rely on the ability to define, gather, and manipulate biomedical data to support health through a cyclical research-practice lifecycle. Researchers within this field are often fortunate to work closely with healthcare and public health systems to influence data generation and capture and have access to a vast amount of biomedical data. Many informaticists also have the expertise to engage with stakeholders, develop new methods and applications, and influence policy. However, research and policy that explicitly seeks to address the systemic drivers of health would more effectively support health. Intersectionality is a theoretical framework that can facilitate such research. It holds that individual human experiences reflect larger socio-structural level systems of privilege and oppression, and cannot be truly understood if these systems are examined in isolation. Intersectionality explicitly accounts for the interrelated nature of systems of privilege and oppression, providing a lens through which to examine and challenge inequities. In this paper, we propose intersectionality as an intervention into how we conduct BMI research. We begin by discussing intersectionality’s history and core principles as they apply to BMI. We then elaborate on the potential for intersectionality to stimulate BMI research. Specifically, we posit that our efforts in BMI to improve health should address intersectionality’s five key considerations: (1) systems of privilege and oppression that shape health; (2) the interrelated nature of upstream health drivers; (3) the nuances of health outcomes within groups; (4) the problematic and power-laden nature of categories that we assign to people in research and in society; and (5) research to inform and support social change.

生物医学信息学(BMI)的许多方法都依赖于定义、收集和处理生物医学数据的能力,以便通过研究与实践的循环生命周期为健康提供支持。该领域的研究人员通常有幸与医疗保健和公共卫生系统密切合作,以影响数据的生成和捕获,并能获取大量生物医学数据。许多信息学家还拥有与利益相关者合作、开发新方法和应用以及影响政策的专业知识。然而,明确寻求解决健康的系统性驱动因素的研究和政策将更有效地支持健康。交叉性是一个可以促进此类研究的理论框架。它认为,人类的个人经历反映了特权和压迫等更大的社会结构层面的系统,如果孤立地研究这些系统,就无法真正理解这些经历。交叉性明确说明了特权和压迫体系的相互关联性,为研究和挑战不平等提供了一个视角。在本文中,我们建议将交叉性作为对如何开展 BMI 研究的一种干预。我们首先讨论交叉性的历史和核心原则,因为它们适用于 BMI。然后,我们阐述了交叉性在促进 BMI 研究方面的潜力。具体来说,我们认为,我们在 BMI 方面为改善健康所做的努力应考虑到交叉性的五个关键因素:1)影响健康的特权和压迫体系;2)上游健康驱动因素的相互关联性;3)群体内健康结果的细微差别;4)我们在研究和社会中对人的分类存在问题和权力色彩;5)研究为社会变革提供信息和支持。
{"title":"Opportunities for incorporating intersectionality into biomedical informatics","authors":"Oliver J. Bear Don't Walk IV ,&nbsp;Amandalynne Paullada ,&nbsp;Avery Everhart ,&nbsp;Reggie Casanova-Perez ,&nbsp;Trevor Cohen ,&nbsp;Tiffany Veinot","doi":"10.1016/j.jbi.2024.104653","DOIUrl":"10.1016/j.jbi.2024.104653","url":null,"abstract":"<div><p>Many approaches in biomedical informatics (BMI) rely on the ability to define, gather, and manipulate biomedical data to support health through a cyclical research-practice lifecycle. Researchers within this field are often fortunate to work closely with healthcare and public health systems to influence data generation and capture and have access to a vast amount of biomedical data. Many informaticists also have the expertise to engage with stakeholders, develop new methods and applications, and influence policy. However, research and policy that explicitly seeks to address the systemic drivers of health would more effectively support health. Intersectionality is a theoretical framework that can facilitate such research. It holds that individual human experiences reflect larger socio-structural level systems of privilege and oppression, and cannot be truly understood if these systems are examined in isolation. Intersectionality explicitly accounts for the interrelated nature of systems of privilege and oppression, providing a lens through which to examine and challenge inequities. In this paper, we propose intersectionality as an intervention into how we conduct BMI research. We begin by discussing intersectionality’s history and core principles as they apply to BMI. We then elaborate on the potential for intersectionality to stimulate BMI research. Specifically, we posit that our efforts in BMI to improve health should address intersectionality’s five key considerations: (1) systems of privilege and oppression that shape health; (2) the interrelated nature of upstream health drivers; (3) the nuances of health outcomes within groups; (4) the problematic and power-laden nature of categories that we assign to people in research and in society; and (5) research to inform and support social change.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"154 ","pages":"Article 104653"},"PeriodicalIF":4.5,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140908791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cost prediction for ischemic heart disease hospitalization: Interpretable feature extraction using network analysis 缺血性心脏病住院费用预测:使用网络分析提取可解释的特征。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-06 DOI: 10.1016/j.jbi.2024.104652
Kaidi Gong , Yajun Xue , Lingyun Kong , Xiaolei Xie

Objectives

: Ischemic heart disease (IHD) is a significant contributor to global mortality and disability, imposing a substantial social and economic burden on individuals and healthcare systems. To enhance the efficient allocation of medical resources and ultimately benefit a larger population, accurate prediction of healthcare costs is crucial.

Methods:

We developed an interpretable IHD hospitalization cost prediction model that integrates network analysis with machine learning. Specifically, our network-enhanced model extracts explainable features by leveraging a diagnosis-procedure concurrence network and advanced graph kernel techniques, facilitating the capture of intricate relationships between medical codes.

Results:

The proposed model achieved an R2 of 0.804 ± 0.008 and a root mean square error (RMSE) of 17,076 ± 420 CNY on the temporal validation dataset, demonstrating comparable performance to the model employing less interpretable code embedding features (R2: 0.800 ± 0.008; RMSE: 17,279 ± 437 CNY) and the hybrid graph isomorphism network (R2: 0.802 ± 0.007; RMSE: 17,249 ± 387 CNY). The interpretation of the network-enhanced model assisted in pinpointing specific diagnoses and procedures associated with higher hospitalization costs, including acute kidney injury, permanent atrial fibrillation, intra-aortic balloon bump, and temporary pacemaker placement, among others.

Conclusion:

Our analysis results demonstrate that the proposed model strikes a balance between predictive accuracy and interpretability. It aids in identifying specific diagnoses and procedures associated with higher hospitalization costs, underscoring its potential to support intelligent management of IHD.

目标:缺血性心脏病(IHD)是导致全球死亡和残疾的重要因素,给个人和医疗系统带来了巨大的社会和经济负担。为了提高医疗资源的分配效率并最终使更多的人受益,准确预测医疗成本至关重要:我们开发了一种可解释的 IHD 住院费用预测模型,该模型将网络分析与机器学习相结合。具体来说,我们的网络增强模型通过利用诊断-程序并发网络和先进的图核技术提取可解释的特征,便于捕捉医疗代码之间错综复杂的关系:在时间验证数据集上,所提模型的 R2 值为 0.804 ± 0.008,均方根误差(RMSE)为 17,076 ± 420 元人民币,与采用较少可解释代码嵌入特征的模型(R2:0.800 ± 0.008;RMSE:17,279 ± 437 元人民币)和混合图同构网络(R2:0.802 ± 0.007;RMSE:17,249 ± 387 元人民币)性能相当。对网络增强模型的解释有助于确定与较高住院费用相关的特定诊断和手术,包括急性肾损伤、永久性心房颤动、主动脉内球囊撞击和临时起搏器置入等:我们的分析结果表明,所提出的模型在预测准确性和可解释性之间取得了平衡。结论:我们的分析结果表明,所提出的模型在预测准确性和可解释性之间取得了平衡,有助于确定与较高住院费用相关的特定诊断和手术,突出了其支持智能化管理 IHD 的潜力。
{"title":"Cost prediction for ischemic heart disease hospitalization: Interpretable feature extraction using network analysis","authors":"Kaidi Gong ,&nbsp;Yajun Xue ,&nbsp;Lingyun Kong ,&nbsp;Xiaolei Xie","doi":"10.1016/j.jbi.2024.104652","DOIUrl":"10.1016/j.jbi.2024.104652","url":null,"abstract":"<div><h3>Objectives</h3><p>: Ischemic heart disease (IHD) is a significant contributor to global mortality and disability, imposing a substantial social and economic burden on individuals and healthcare systems. To enhance the efficient allocation of medical resources and ultimately benefit a larger population, accurate prediction of healthcare costs is crucial.</p></div><div><h3>Methods:</h3><p>We developed an interpretable IHD hospitalization cost prediction model that integrates network analysis with machine learning. Specifically, our network-enhanced model extracts explainable features by leveraging a diagnosis-procedure concurrence network and advanced graph kernel techniques, facilitating the capture of intricate relationships between medical codes.</p></div><div><h3>Results:</h3><p>The proposed model achieved an <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.804 ± 0.008 and a root mean square error (RMSE) of 17,076 ± 420 CNY on the temporal validation dataset, demonstrating comparable performance to the model employing less interpretable code embedding features (<span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span>: 0.800 ± 0.008; RMSE: 17,279 ± 437 CNY) and the hybrid graph isomorphism network (<span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span>: 0.802 ± 0.007; RMSE: 17,249 ± 387 CNY). The interpretation of the network-enhanced model assisted in pinpointing specific diagnoses and procedures associated with higher hospitalization costs, including acute kidney injury, permanent atrial fibrillation, intra-aortic balloon bump, and temporary pacemaker placement, among others.</p></div><div><h3>Conclusion:</h3><p>Our analysis results demonstrate that the proposed model strikes a balance between predictive accuracy and interpretability. It aids in identifying specific diagnoses and procedures associated with higher hospitalization costs, underscoring its potential to support intelligent management of IHD.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"154 ","pages":"Article 104652"},"PeriodicalIF":4.5,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140891701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chat-ePRO: Development and pilot study of an electronic patient-reported outcomes system based on ChatGPT Chat-ePRO:基于 ChatGPT 的电子患者报告结果系统的开发和试点研究。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-03 DOI: 10.1016/j.jbi.2024.104651
Zikang Chen , Qinchuan Wang , Yaoqian Sun , Hailing Cai , Xudong Lu

Objective

Chatbots have the potential to improve user compliance in electronic Patient-Reported Outcome (ePRO) system. Compared to rule-based chatbots, Large Language Model (LLM) offers advantages such as simplifying the development process and increasing conversational flexibility. However, there is currently a lack of practical applications of LLMs in ePRO systems. Therefore, this study utilized ChatGPT to develop the Chat-ePRO system and designed a pilot study to explore the feasibility of building an ePRO system based on LLM.

Materials and Methods

This study employed prompt engineering and offline knowledge distillation to design a dialogue algorithm and built the Chat-ePRO system on the WeChat Mini Program platform. In order to compare Chat-ePRO with the form-based ePRO and rule-based chatbot ePRO used in previous studies, we conducted a pilot study applying the three ePRO systems sequentially at the Sir Run Run Shaw Hospital to collect patients’ PRO data.

Result

Chat-ePRO is capable of correctly generating conversation based on PRO forms (success rate: 95.7 %) and accurately extracting the PRO data instantaneously from conversation (Macro-F1: 0.95). The majority of subjective evaluations from doctors (>70 %) suggest that Chat-ePRO is able to comprehend questions and consistently generate responses. Pilot study shows that Chat-ePRO demonstrates higher response rate (9/10, 90 %) and longer interaction time (10.86 s/turn) compared to the other two methods.

Conclusion

Our study demonstrated the feasibility of utilizing algorithms such as prompt engineering to drive LLM in completing ePRO data collection tasks, and validated that the Chat-ePRO system can effectively enhance patient compliance.

目的聊天机器人有可能提高电子患者报告结果(ePRO)系统中用户的依从性。与基于规则的聊天机器人相比,大语言模型(LLM)具有简化开发流程和提高对话灵活性等优势。然而,目前在 ePRO 系统中缺乏对 LLM 的实际应用。因此,本研究利用 ChatGPT 开发了 Chat-ePRO 系统,并设计了一项试点研究来探索基于 LLM 构建 ePRO 系统的可行性:本研究采用提示工程和离线知识提炼设计了对话算法,并在微信小程序平台上构建了Chat-ePRO系统。为了将 Chat-ePRO 与之前研究中使用的基于表单的电子病历和基于规则的聊天机器人电子病历进行比较,我们在邵逸夫医院进行了一项试点研究,依次应用这三种电子病历系统收集患者的 PRO 数据:结果:Chat-ePRO 能够根据 PRO 表格正确生成对话(成功率:95.7%),并能准确地从对话中即时提取 PRO 数据(Macro-F1:0.95)。大多数医生的主观评价(大于 70%)表明,Chat-ePRO 能够理解问题并持续生成回复。试点研究表明,与其他两种方法相比,Chat-ePRO 的回复率更高(9/10,90%),互动时间更长(10.86 秒/转):我们的研究证明了利用提示工程等算法驱动 LLM 完成 ePRO 数据收集任务的可行性,并验证了 Chat-ePRO 系统能有效提高患者的依从性。
{"title":"Chat-ePRO: Development and pilot study of an electronic patient-reported outcomes system based on ChatGPT","authors":"Zikang Chen ,&nbsp;Qinchuan Wang ,&nbsp;Yaoqian Sun ,&nbsp;Hailing Cai ,&nbsp;Xudong Lu","doi":"10.1016/j.jbi.2024.104651","DOIUrl":"10.1016/j.jbi.2024.104651","url":null,"abstract":"<div><h3>Objective</h3><p>Chatbots have the potential to improve user compliance in electronic Patient-Reported Outcome (ePRO) system. Compared to rule-based chatbots, Large Language Model (LLM) offers advantages such as simplifying the development process and increasing conversational flexibility. However, there is currently a lack of practical applications of LLMs in ePRO systems. Therefore, this study utilized ChatGPT to develop the Chat-ePRO system and designed a pilot study to explore the feasibility of building an ePRO system based on LLM.</p></div><div><h3>Materials and Methods</h3><p>This study employed prompt engineering and offline knowledge distillation to design a dialogue algorithm and built the Chat-ePRO system on the WeChat Mini Program platform. In order to compare Chat-ePRO with the form-based ePRO and rule-based chatbot ePRO used in previous studies, we conducted a pilot study applying the three ePRO systems sequentially at the Sir Run Run Shaw Hospital to collect patients’ PRO data.</p></div><div><h3>Result</h3><p>Chat-ePRO is capable of correctly generating conversation based on PRO forms (success rate: 95.7 %) and accurately extracting the PRO data instantaneously from conversation (Macro-F1: 0.95). The majority of subjective evaluations from doctors (&gt;70 %) suggest that Chat-ePRO is able to comprehend questions and consistently generate responses. Pilot study shows that Chat-ePRO demonstrates higher response rate (9/10, 90 %) and longer interaction time (10.86 s/turn) compared to the other two methods.</p></div><div><h3>Conclusion</h3><p>Our study demonstrated the feasibility of utilizing algorithms such as prompt engineering to drive LLM in completing ePRO data collection tasks, and validated that the Chat-ePRO system can effectively enhance patient compliance.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"154 ","pages":"Article 104651"},"PeriodicalIF":4.5,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140855026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated annotation of disease subtypes 疾病亚型的自动注释
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-01 DOI: 10.1016/j.jbi.2024.104650
Dan Ofer, Michal Linial

Background

Distinguishing diseases into distinct subtypes is crucial for study and effective treatment strategies. The Open Targets Platform (OT) integrates biomedical, genetic, and biochemical datasets to empower disease ontologies, classifications, and potential gene targets. Nevertheless, many disease annotations are incomplete, requiring laborious expert medical input. This challenge is especially pronounced for rare and orphan diseases, where resources are scarce.

Methods

We present a machine learning approach to identifying diseases with potential subtypes, using the approximately 23,000 diseases documented in OT. We derive novel features for predicting diseases with subtypes using direct evidence. Machine learning models were applied to analyze feature importance and evaluate predictive performance for discovering both known and novel disease subtypes.

Results

Our model achieves a high (89.4%) ROC AUC (Area Under the Receiver Operating Characteristic Curve) in identifying known disease subtypes. We integrated pre-trained deep-learning language models and showed their benefits. Moreover, we identify 515 disease candidates predicted to possess previously unannotated subtypes.

Conclusions

Our models can partition diseases into distinct subtypes. This methodology enables a robust, scalable approach for improving knowledge-based annotations and a comprehensive assessment of disease ontology tiers. Our candidates are attractive targets for further study and personalized medicine, potentially aiding in the unveiling of new therapeutic indications for sought-after targets.

背景将疾病分为不同的亚型对研究和有效的治疗策略至关重要。开放靶点平台(OT)整合了生物医学、遗传学和生物化学数据集,以增强疾病本体、分类和潜在基因靶点的能力。然而,许多疾病注释并不完整,需要专家费力地输入医学信息。我们提出了一种机器学习方法,利用 OT 中记录的约 23,000 种疾病来识别具有潜在亚型的疾病。我们利用直接证据得出了预测疾病亚型的新特征。结果我们的模型在识别已知疾病亚型方面达到了很高(89.4%)的 ROC AUC(接收者工作特征曲线下面积)。我们整合了预先训练的深度学习语言模型,并展示了其优势。此外,我们还确定了 515 种候选疾病,预测它们具有以前未注明的亚型。我们的模型可以将疾病划分为不同的亚型,这种方法是改进基于知识的注释和全面评估疾病本体层级的一种稳健、可扩展的方法。我们的候选目标对进一步研究和个性化医疗很有吸引力,可能有助于揭示热门目标的新治疗适应症。
{"title":"Automated annotation of disease subtypes","authors":"Dan Ofer,&nbsp;Michal Linial","doi":"10.1016/j.jbi.2024.104650","DOIUrl":"https://doi.org/10.1016/j.jbi.2024.104650","url":null,"abstract":"<div><h3>Background</h3><p>Distinguishing diseases into distinct subtypes is crucial for study and effective treatment strategies. The Open Targets Platform (OT) integrates biomedical, genetic, and biochemical datasets to empower disease ontologies, classifications, and potential gene targets. Nevertheless, many disease annotations are incomplete, requiring laborious expert medical input. This challenge is especially pronounced for rare and orphan diseases, where resources are scarce.</p></div><div><h3>Methods</h3><p>We present a machine learning approach to identifying diseases with potential subtypes, using the approximately 23,000 diseases documented in OT. We derive novel features for predicting diseases with subtypes using direct evidence. Machine learning models were applied to analyze feature importance and evaluate predictive performance for discovering both known and novel disease subtypes.</p></div><div><h3>Results</h3><p>Our model achieves a high (89.4%) ROC AUC (Area Under the Receiver Operating Characteristic Curve) in identifying known disease subtypes. We integrated pre-trained deep-learning language models and showed their benefits. Moreover, we identify 515 disease candidates predicted to possess previously unannotated subtypes.</p></div><div><h3>Conclusions</h3><p>Our models can partition diseases into distinct subtypes. This methodology enables a robust, scalable approach for improving knowledge-based annotations and a comprehensive assessment of disease ontology tiers. Our candidates are attractive targets for further study and personalized medicine, potentially aiding in the unveiling of new therapeutic indications for sought-after targets.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"154 ","pages":"Article 104650"},"PeriodicalIF":4.5,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1532046424000686/pdfft?md5=2a780c6c20882fa8a92e8fd6785f441b&pid=1-s2.0-S1532046424000686-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140822205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Criteria2Query 3.0: Leveraging generative large language models for clinical trial eligibility query generation Criteria2Query 3.0:利用生成式大语言模型生成临床试验资格查询
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-04-30 DOI: 10.1016/j.jbi.2024.104649
Jimyung Park , Yilu Fang , Casey Ta , Gongbo Zhang , Betina Idnay , Fangyi Chen , David Feng , Rebecca Shyu , Emily R. Gordon , Matthew Spotnitz , Chunhua Weng

Objective

Automated identification of eligible patients is a bottleneck of clinical research. We propose Criteria2Query (C2Q) 3.0, a system that leverages GPT-4 for the semi-automatic transformation of clinical trial eligibility criteria text into executable clinical database queries.

Materials and Methods

C2Q 3.0 integrated three GPT-4 prompts for concept extraction, SQL query generation, and reasoning. Each prompt was designed and evaluated separately. The concept extraction prompt was benchmarked against manual annotations from 20 clinical trials by two evaluators, who later also measured SQL generation accuracy and identified errors in GPT-generated SQL queries from 5 clinical trials. The reasoning prompt was assessed by three evaluators on four metrics: readability, correctness, coherence, and usefulness, using corrected SQL queries and an open-ended feedback questionnaire.

Results

Out of 518 concepts from 20 clinical trials, GPT-4 achieved an F1-score of 0.891 in concept extraction. For SQL generation, 29 errors spanning seven categories were detected, with logic errors being the most common (n = 10; 34.48 %). Reasoning evaluations yielded a high coherence rating, with the mean score being 4.70 but relatively lower readability, with a mean of 3.95. Mean scores of correctness and usefulness were identified as 3.97 and 4.37, respectively.

Conclusion

GPT-4 significantly improves the accuracy of extracting clinical trial eligibility criteria concepts in C2Q 3.0. Continued research is warranted to ensure the reliability of large language models.

目标自动识别符合条件的患者是临床研究的一个瓶颈。我们提出了 Criteria2Query(C2Q)3.0,这是一个利用 GPT-4 将临床试验资格标准文本半自动转换为可执行临床数据库查询的系统。每个提示都是单独设计和评估的。概念提取提示由两名评估人员根据 20 项临床试验的人工注释进行基准测试,他们随后还测量了 SQL 生成的准确性,并找出了 GPT 生成的 5 项临床试验 SQL 查询中的错误。推理提示由三位评估员根据四项指标进行评估:可读性、正确性、连贯性和实用性,并使用校正后的 SQL 查询和开放式反馈问卷。结果在 20 项临床试验的 518 个概念中,GPT-4 的概念提取 F1 分数达到 0.891。在 SQL 生成过程中,发现了 7 个类别的 29 个错误,其中逻辑错误最为常见(n = 10; 34.48 %)。推理评估的一致性评分较高,平均得分为 4.70,但可读性相对较低,平均得分为 3.95。结论 GPT-4 显著提高了 C2Q 3.0 中提取临床试验资格标准概念的准确性。为确保大型语言模型的可靠性,有必要继续开展研究。
{"title":"Criteria2Query 3.0: Leveraging generative large language models for clinical trial eligibility query generation","authors":"Jimyung Park ,&nbsp;Yilu Fang ,&nbsp;Casey Ta ,&nbsp;Gongbo Zhang ,&nbsp;Betina Idnay ,&nbsp;Fangyi Chen ,&nbsp;David Feng ,&nbsp;Rebecca Shyu ,&nbsp;Emily R. Gordon ,&nbsp;Matthew Spotnitz ,&nbsp;Chunhua Weng","doi":"10.1016/j.jbi.2024.104649","DOIUrl":"https://doi.org/10.1016/j.jbi.2024.104649","url":null,"abstract":"<div><h3>Objective</h3><p>Automated identification of eligible patients is a bottleneck of clinical research. We propose Criteria2Query (C2Q) 3.0, a system that leverages GPT-4 for the semi-automatic transformation of clinical trial eligibility criteria text into executable clinical database queries.</p></div><div><h3>Materials and Methods</h3><p>C2Q 3.0 integrated three GPT-4 prompts for concept extraction, SQL query generation, and reasoning. Each prompt was designed and evaluated separately. The concept extraction prompt was benchmarked against manual annotations from 20 clinical trials by two evaluators, who later also measured SQL generation accuracy and identified errors in GPT-generated SQL queries from 5 clinical trials. The reasoning prompt was assessed by three evaluators on four metrics: readability, correctness, coherence, and usefulness, using corrected SQL queries and an open-ended feedback questionnaire.</p></div><div><h3>Results</h3><p>Out of 518 concepts from 20 clinical trials, GPT-4 achieved an F1-score of 0.891 in concept extraction. For SQL generation, 29 errors spanning seven categories were detected, with logic errors being the most common (n = 10; 34.48 %). Reasoning evaluations yielded a high coherence rating, with the mean score being 4.70 but relatively lower readability, with a mean of 3.95. Mean scores of correctness and usefulness were identified as 3.97 and 4.37, respectively.</p></div><div><h3>Conclusion</h3><p>GPT-4 significantly improves the accuracy of extracting clinical trial eligibility criteria concepts in C2Q 3.0. Continued research is warranted to ensure the reliability of large language models.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"154 ","pages":"Article 104649"},"PeriodicalIF":4.5,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140816662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Machine-FAIR: Representing software and datasets to facilitate reuse and scientific discovery by machines 迈向 "机器-公平":表征软件和数据集,促进机器重用和科学发现
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-04-30 DOI: 10.1016/j.jbi.2024.104647
Michael M. Wagner , William R. Hogan , John D. Levander , Matthew Diller

Objective

To use software, datasets, and data formats in the domain of Infectious Disease Epidemiology as a test collection to evaluate a novel M1 use case, which we introduce in this paper. M1 is a machine that upon receipt of a new digital object of research exhaustively finds all valid compositions of it with existing objects.

Method

We implemented a data-format-matching-only M1 using exhaustive search, which we refer to as M1DFM. We then ran M1DFM on the test collection and used error analysis to identify needed semantic constraints.

Results

Precision of M1DFM search was 61.7%. Error analysis identified needed semantic constraints and needed changes in handling of data services. Most semantic constraints were simple, but one data format was sufficiently complex to be practically impossible to represent semantic constraints over, from which we conclude limitatively that software developers will have to meet the machines halfway by engineering software whose inputs are sufficiently simple that their semantic constraints can be represented, akin to the simple APIs of services. We summarize these insights as M1-FAIR guiding principles for composability and suggest a roadmap for progressively capable devices in the service of reuse and accelerated scientific discovery.

Conclusion

Algorithmic search of digital repositories for valid workflow compositions has potential to accelerate scientific discovery but requires a scalable solution to the problem of knowledge acquisition about semantic constraints on software inputs. Additionally, practical limitations on the logical complexity of semantic constraints must be respected, which has implications for the design of software.

目标使用传染病流行病学领域的软件、数据集和数据格式作为测试集合,评估我们在本文中介绍的新型 M1 用例。M1 是一种机器,在接收到新的数字研究对象时,它能穷举地找到该对象与现有对象的所有有效组合。方法我们使用穷举搜索实现了仅数据格式匹配的 M1,我们称之为 M1DFM。然后,我们在测试集合上运行了 M1DFM,并利用误差分析确定了所需的语义约束。结果M1DFM 搜索的精确度为 61.7%。错误分析确定了所需的语义约束和数据服务处理中需要的更改。大多数语义约束都很简单,但有一种数据格式非常复杂,实际上无法表示语义约束,由此我们得出一个有限的结论,即软件开发人员必须满足机器的一半要求,即工程软件的输入必须足够简单,以便能够表示其语义约束,类似于服务的简单应用程序接口。我们将这些见解总结为可组合性的M1-FAIR指导原则,并提出了逐步提高设备能力的路线图,以服务于重复使用和加速科学发现。 结论 通过算法搜索数字资源库中的有效工作流组合具有加速科学发现的潜力,但需要一个可扩展的解决方案来解决有关软件输入语义约束的知识获取问题。此外,必须尊重对语义约束逻辑复杂性的实际限制,这对软件设计也有影响。
{"title":"Towards Machine-FAIR: Representing software and datasets to facilitate reuse and scientific discovery by machines","authors":"Michael M. Wagner ,&nbsp;William R. Hogan ,&nbsp;John D. Levander ,&nbsp;Matthew Diller","doi":"10.1016/j.jbi.2024.104647","DOIUrl":"https://doi.org/10.1016/j.jbi.2024.104647","url":null,"abstract":"<div><h3>Objective</h3><p>To use software, datasets, and data formats in the domain of Infectious Disease Epidemiology as a test collection to evaluate a novel <em>M1 use case</em>, which we introduce in this paper. M1 is a machine that upon receipt of a new digital object of research exhaustively finds all valid compositions of it with existing objects.</p></div><div><h3>Method</h3><p>We implemented a data-format-matching-only M1 using exhaustive search, which we refer to as M1<sub>DFM</sub>. We then ran M1<sub>DFM</sub> on the test collection and used error analysis to identify needed semantic constraints.</p></div><div><h3>Results</h3><p>Precision of M1<sub>DFM</sub> search was 61.7%. Error analysis identified needed semantic constraints and needed changes in handling of data services. Most semantic constraints were simple, but one data format was sufficiently complex to be practically impossible to represent semantic constraints over, from which we conclude limitatively that software developers will have to meet the machines halfway by engineering software whose inputs are sufficiently simple that their semantic constraints can be represented, akin to the simple APIs of services. We summarize these insights as M1-FAIR guiding principles for composability and suggest a roadmap for progressively capable devices in the service of reuse and accelerated scientific discovery.</p></div><div><h3>Conclusion</h3><p>Algorithmic search of digital repositories for valid workflow compositions has potential to accelerate scientific discovery but requires a scalable solution to the problem of knowledge acquisition about semantic constraints on software inputs. Additionally, practical limitations on the logical complexity of semantic constraints must be respected, which has implications for the design of software.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"154 ","pages":"Article 104647"},"PeriodicalIF":4.5,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1532046424000650/pdfft?md5=7430184232c45602f4b33a5d72e51310&pid=1-s2.0-S1532046424000650-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140844080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting acute kidney injury and resource utilization in ICU patients using longitudinal, multimodal models 利用纵向多模态模型预测重症监护室患者的急性肾损伤和资源利用情况
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-04-30 DOI: 10.1016/j.jbi.2024.104648
Yukun Tan , Merve Dede , Vakul Mohanty , Jinzhuang Dou , Holly Hill , Elmer Bernstam , Ken Chen

Background

Advances in artificial intelligence (AI) have realized the potential of revolutionizing healthcare, such as predicting disease progression via longitudinal inspection of Electronic Health Records (EHRs) and lab tests from patients admitted to Intensive Care Units (ICU). Although substantial literature exists addressing broad subjects, including the prediction of mortality, length-of-stay, and readmission, studies focusing on forecasting Acute Kidney Injury (AKI), specifically dialysis anticipation like Continuous Renal Replacement Therapy (CRRT) are scarce. The technicality of how to implement AI remains elusive.

Objective

This study aims to elucidate the important factors and methods that are required to develop effective predictive models of AKI and CRRT for patients admitted to ICU, using EHRs in the Medical Information Mart for Intensive Care (MIMIC) database.

Methods

We conducted a comprehensive comparative analysis of established predictive models, considering both time-series measurements and clinical notes from MIMIC-IV databases. Subsequently, we proposed a novel multi-modal model which integrates embeddings of top-performing unimodal models, including Long Short-Term Memory (LSTM) and BioMedBERT, and leverages both unstructured clinical notes and structured time series measurements derived from EHRs to enable the early prediction of AKI and CRRT.

Results

Our multimodal model achieved a lead time of at least 12 h ahead of clinical manifestation, with an Area Under the Receiver Operating Characteristic Curve (AUROC) of 0.888 for AKI and 0.997 for CRRT, as well as an Area Under the Precision Recall Curve (AUPRC) of 0.727 for AKI and 0.840 for CRRT, respectively, which significantly outperformed the baseline models. Additionally, we performed a SHapley Additive exPlanation (SHAP) analysis using the expected gradients algorithm, which highlighted important, previously underappreciated predictive features for AKI and CRRT.

Conclusion

Our study revealed the importance and the technicality of applying longitudinal, multimodal modeling to improve early prediction of AKI and CRRT, offering insights for timely interventions. The performance and interpretability of our model indicate its potential for further assessment towards clinical applications, to ultimately optimize AKI management and enhance patient outcomes.

背景人工智能(AI)的进步已经实现了彻底改变医疗保健的潜力,例如通过纵向检查电子健康记录(EHR)和重症监护病房(ICU)病人的实验室检查来预测疾病的进展。虽然已有大量文献涉及死亡率、住院时间和再入院率预测等广泛主题,但侧重于预测急性肾损伤(AKI),特别是连续肾脏替代疗法(CRRT)等透析预期的研究却很少。本研究旨在利用重症监护医学信息市场(MIMIC)数据库中的电子病历,阐明为重症监护室住院患者开发有效的 AKI 和 CRRT 预测模型所需的重要因素和方法。方法我们对已建立的预测模型进行了全面的比较分析,同时考虑了 MIMIC-IV 数据库中的时间序列测量和临床记录。随后,我们提出了一个新颖的多模态模型,该模型整合了包括长短期记忆(LSTM)和 BioMedBERT 在内的顶级单模态模型的嵌入,并利用非结构化临床笔记和来自电子病历的结构化时间序列测量结果,实现了对 AKI 和 CRRT 的早期预测。结果我们的多模态模型可在临床表现出现前至少提前 12 小时进行预测,AKI 和 CRRT 的接收者工作特征曲线下面积(AUROC)分别为 0.888 和 0.997,AKI 和 CRRT 的精确度召回曲线下面积(AUPRC)分别为 0.727 和 0.840,明显优于基线模型。结论我们的研究揭示了应用纵向多模式建模改善 AKI 和 CRRT 早期预测的重要性和技术性,为及时干预提供了真知灼见。我们模型的性能和可解释性表明,它具有进一步评估临床应用的潜力,最终可优化 AKI 管理并改善患者预后。
{"title":"Forecasting acute kidney injury and resource utilization in ICU patients using longitudinal, multimodal models","authors":"Yukun Tan ,&nbsp;Merve Dede ,&nbsp;Vakul Mohanty ,&nbsp;Jinzhuang Dou ,&nbsp;Holly Hill ,&nbsp;Elmer Bernstam ,&nbsp;Ken Chen","doi":"10.1016/j.jbi.2024.104648","DOIUrl":"https://doi.org/10.1016/j.jbi.2024.104648","url":null,"abstract":"<div><h3>Background</h3><p>Advances in artificial intelligence (AI) have realized the potential of revolutionizing healthcare, such as predicting disease progression via longitudinal inspection of Electronic Health Records (EHRs) and lab tests from patients admitted to Intensive Care Units (ICU). Although substantial literature exists addressing broad subjects, including the prediction of mortality, length-of-stay, and readmission, studies focusing on forecasting Acute Kidney Injury (AKI), specifically dialysis anticipation like Continuous Renal Replacement Therapy (CRRT) are scarce. The technicality of how to implement AI remains elusive.</p></div><div><h3>Objective</h3><p>This study aims to elucidate the important factors and methods that are required to develop effective predictive models of AKI and CRRT for patients admitted to ICU, using EHRs in the Medical Information Mart for Intensive Care (MIMIC) database.</p></div><div><h3>Methods</h3><p>We conducted a comprehensive comparative analysis of established predictive models, considering both time-series measurements and clinical notes from MIMIC-IV databases. Subsequently, we proposed a novel multi-modal model which integrates embeddings of top-performing unimodal models, including Long Short-Term Memory (LSTM) and BioMedBERT, and leverages both unstructured clinical notes and structured time series measurements derived from EHRs to enable the early prediction of AKI and CRRT.</p></div><div><h3>Results</h3><p>Our multimodal model achieved a lead time of at least 12 h ahead of clinical manifestation, with an Area Under the Receiver Operating Characteristic Curve (AUROC) of 0.888 for AKI and 0.997 for CRRT, as well as an Area Under the Precision Recall Curve (AUPRC) of 0.727 for AKI and 0.840 for CRRT, respectively, which significantly outperformed the baseline models. Additionally, we performed a SHapley Additive exPlanation (SHAP) analysis using the expected gradients algorithm, which highlighted important, previously underappreciated predictive features for AKI and CRRT.</p></div><div><h3>Conclusion</h3><p>Our study revealed the importance and the technicality of applying longitudinal, multimodal modeling to improve early prediction of AKI and CRRT, offering insights for timely interventions. The performance and interpretability of our model indicate its potential for further assessment towards clinical applications, to ultimately optimize AKI management and enhance patient outcomes.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"154 ","pages":"Article 104648"},"PeriodicalIF":4.5,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140822160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A survey of recent methods for addressing AI fairness and bias in biomedicine 解决生物医学中人工智能公平性和偏见的最新方法概览
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-04-25 DOI: 10.1016/j.jbi.2024.104646
Yifan Yang , Mingquan Lin , Han Zhao , Yifan Peng , Furong Huang , Zhiyong Lu

Objectives

Artificial intelligence (AI) systems have the potential to revolutionize clinical practices, including improving diagnostic accuracy and surgical decision-making, while also reducing costs and manpower. However, it is important to recognize that these systems may perpetuate social inequities or demonstrate biases, such as those based on race or gender. Such biases can occur before, during, or after the development of AI models, making it critical to understand and address potential biases to enable the accurate and reliable application of AI models in clinical settings. To mitigate bias concerns during model development, we surveyed recent publications on different debiasing methods in the fields of biomedical natural language processing (NLP) or computer vision (CV). Then we discussed the methods, such as data perturbation and adversarial learning, that have been applied in the biomedical domain to address bias.

Methods

We performed our literature search on PubMed, ACM digital library, and IEEE Xplore of relevant articles published between January 2018 and December 2023 using multiple combinations of keywords. We then filtered the result of 10,041 articles automatically with loose constraints, and manually inspected the abstracts of the remaining 890 articles to identify the 55 articles included in this review. Additional articles in the references are also included in this review. We discuss each method and compare its strengths and weaknesses. Finally, we review other potential methods from the general domain that could be applied to biomedicine to address bias and improve fairness.

Results

The bias of AIs in biomedicine can originate from multiple sources such as insufficient data, sampling bias and the use of health-irrelevant features or race-adjusted algorithms. Existing debiasing methods that focus on algorithms can be categorized into distributional or algorithmic. Distributional methods include data augmentation, data perturbation, data reweighting methods, and federated learning. Algorithmic approaches include unsupervised representation learning, adversarial learning, disentangled representation learning, loss-based methods and causality-based methods.

目标人工智能(AI)系统有可能彻底改变临床实践,包括提高诊断准确性和手术决策,同时降低成本和人力。然而,重要的是要认识到这些系统可能会延续社会不平等或表现出偏见,例如基于种族或性别的偏见。这些偏见可能发生在人工智能模型开发之前、期间或之后,因此了解和解决潜在的偏见至关重要,以便在临床环境中准确可靠地应用人工智能模型。为了减轻模型开发过程中的偏差问题,我们调查了最近在生物医学自然语言处理(NLP)或计算机视觉(CV)领域发表的有关不同去偏差方法的文章。然后,我们讨论了在生物医学领域应用于解决偏差问题的方法,如数据扰动和对抗学习。方法我们在 PubMed、ACM 数字图书馆和 IEEE Xplore 上使用多种关键词组合对 2018 年 1 月至 2023 年 12 月间发表的相关文章进行了文献检索。然后,我们以宽松的限制条件自动过滤了 10041 篇文章,并人工检查了剩余 890 篇文章的摘要,最终确定了纳入本综述的 55 篇文章。参考文献中的其他文章也包含在本综述中。我们讨论了每种方法,并比较了其优缺点。结果生物医学中人工智能的偏差可能来自多个方面,如数据不足、抽样偏差、使用与健康无关的特征或种族调整算法。现有的针对算法的去污方法可分为分布式和算法式两种。分布式方法包括数据增强、数据扰动、数据重权方法和联合学习。算法方法包括无监督表示学习、对抗学习、分离表示学习、基于损失的方法和基于因果关系的方法。
{"title":"A survey of recent methods for addressing AI fairness and bias in biomedicine","authors":"Yifan Yang ,&nbsp;Mingquan Lin ,&nbsp;Han Zhao ,&nbsp;Yifan Peng ,&nbsp;Furong Huang ,&nbsp;Zhiyong Lu","doi":"10.1016/j.jbi.2024.104646","DOIUrl":"https://doi.org/10.1016/j.jbi.2024.104646","url":null,"abstract":"<div><h3>Objectives</h3><p>Artificial intelligence (AI) systems have the potential to revolutionize clinical practices, including improving diagnostic accuracy and surgical decision-making, while also reducing costs and manpower. However, it is important to recognize that these systems may perpetuate social inequities or demonstrate biases, such as those based on race or gender. Such biases can occur before, during, or after the development of AI models, making it critical to understand and address potential biases to enable the accurate and reliable application of AI models in clinical settings. To mitigate bias concerns during model development, we surveyed recent publications on different debiasing methods in the fields of biomedical natural language processing (NLP) or computer vision (CV). Then we discussed the methods, such as data perturbation and adversarial learning, that have been applied in the biomedical domain to address bias.</p></div><div><h3>Methods</h3><p>We performed our literature search on PubMed, ACM digital library, and IEEE Xplore of relevant articles published between January 2018 and December 2023 using multiple combinations of keywords. We then filtered the result of 10,041 articles automatically with loose constraints, and manually inspected the abstracts of the remaining 890 articles to identify the 55 articles included in this review. Additional articles in the references are also included in this review. We discuss each method and compare its strengths and weaknesses. Finally, we review other potential methods from the general domain that could be applied to biomedicine to address bias and improve fairness.</p></div><div><h3>Results</h3><p>The bias of AIs in biomedicine can originate from multiple sources such as insufficient data, sampling bias and the use of health-irrelevant features or race-adjusted algorithms. Existing debiasing methods that focus on algorithms can be categorized into distributional or algorithmic. Distributional methods include data augmentation, data perturbation, data reweighting methods, and federated learning. Algorithmic approaches include unsupervised representation learning, adversarial learning, disentangled representation learning, loss-based methods and causality-based methods.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"154 ","pages":"Article 104646"},"PeriodicalIF":4.5,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1532046424000649/pdfft?md5=463472b5f244a8f4a4f49707c8ee30a5&pid=1-s2.0-S1532046424000649-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140807086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Biomedical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1