首页 > 最新文献

Journal of the American Medical Informatics Association最新文献

英文 中文
Research to classrooms: a co-designed curriculum brings All of Us data to secondary schools. 将研究带入课堂:共同设计的课程将 "我们所有人 "的数据带入中学。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-09 DOI: 10.1093/jamia/ocae167
Louisa A Stark, Kristin E Fenker, Harini Krishnan, Molly Malone, Rebecca J Peterson, Regina Cowan, Jeremy Ensrud, Hector Gamboa, Crstina Gayed, Patricia Refino, Tia Tolk, Teresa Walters, Yong Crosby, Rubin Baskir

Objectives: We describe new curriculum materials for engaging secondary school students in exploring the "big data" in the NIH All of Us Research Program's Public Data Browser and the co-design processes used to collaboratively develop the materials. We also describe the methods used to develop and validate assessment items for studying the efficacy of the materials for student learning as well as preliminary findings from these studies.

Materials and methods: Secondary-level biology teachers from across the United States participated in a 2.5-day Co-design Summer Institute. After learning about the All of Us Research Program and its Data Browser, they collaboratively developed learning objectives and initial ideas for learning experiences related to exploring the Data Browser and big data. The Genetic Science Learning Center team at the University of Utah further developed the educators' ideas. Additional teachers and their students participated in classroom pilot studies to validate a 22-item instrument that assesses students' knowledge. Educators completed surveys about the materials and their experiences.

Results: The "Exploring Big Data with the All of Us Data Browser" curriculum module includes 3 data exploration guides that engage students in using the Data Browser, 3 related multimedia pieces, and teacher support materials. Pilot testing showed substantial growth in students' understanding of key big data concepts and research applications.

Discussion and conclusion: Our co-design process provides a model for educator engagement. The new curriculum module serves as a model for introducing secondary students to big data and precision medicine research by exploring diverse real-world datasets.

目的:我们介绍了让中学生参与探索美国国立卫生研究院全民研究计划公共数据浏览器中的 "大数据 "的新课程材料,以及合作开发这些材料所采用的共同设计过程。我们还介绍了用于开发和验证评估项目的方法,以研究教材对学生学习的有效性,以及这些研究的初步结果:来自美国各地的中学生物教师参加了为期 2.5 天的共同设计暑期学院。在了解了 "我们所有人 "研究计划及其数据浏览器之后,他们共同制定了学习目标,并初步构想了与探索数据浏览器和大数据有关的学习体验。犹他大学遗传科学学习中心团队进一步完善了教育工作者的想法。其他教师及其学生参与了课堂试点研究,以验证评估学生知识的 22 个项目的工具。教育工作者完成了有关教材及其经验的调查:使用我们所有人的数据浏览器探索大数据 "课程模块包括 3 个数据探索指南(让学生参与使用数据浏览器)、3 个相关的多媒体作品和教师支持材料。试点测试表明,学生对关键大数据概念和研究应用的理解有了很大提高:我们的共同设计过程为教育工作者的参与提供了一种模式。新课程模块是通过探索各种真实世界数据集向中学生介绍大数据和精准医学研究的典范。
{"title":"Research to classrooms: a co-designed curriculum brings All of Us data to secondary schools.","authors":"Louisa A Stark, Kristin E Fenker, Harini Krishnan, Molly Malone, Rebecca J Peterson, Regina Cowan, Jeremy Ensrud, Hector Gamboa, Crstina Gayed, Patricia Refino, Tia Tolk, Teresa Walters, Yong Crosby, Rubin Baskir","doi":"10.1093/jamia/ocae167","DOIUrl":"10.1093/jamia/ocae167","url":null,"abstract":"<p><strong>Objectives: </strong>We describe new curriculum materials for engaging secondary school students in exploring the \"big data\" in the NIH All of Us Research Program's Public Data Browser and the co-design processes used to collaboratively develop the materials. We also describe the methods used to develop and validate assessment items for studying the efficacy of the materials for student learning as well as preliminary findings from these studies.</p><p><strong>Materials and methods: </strong>Secondary-level biology teachers from across the United States participated in a 2.5-day Co-design Summer Institute. After learning about the All of Us Research Program and its Data Browser, they collaboratively developed learning objectives and initial ideas for learning experiences related to exploring the Data Browser and big data. The Genetic Science Learning Center team at the University of Utah further developed the educators' ideas. Additional teachers and their students participated in classroom pilot studies to validate a 22-item instrument that assesses students' knowledge. Educators completed surveys about the materials and their experiences.</p><p><strong>Results: </strong>The \"Exploring Big Data with the All of Us Data Browser\" curriculum module includes 3 data exploration guides that engage students in using the Data Browser, 3 related multimedia pieces, and teacher support materials. Pilot testing showed substantial growth in students' understanding of key big data concepts and research applications.</p><p><strong>Discussion and conclusion: </strong>Our co-design process provides a model for educator engagement. The new curriculum module serves as a model for introducing secondary students to big data and precision medicine research by exploring diverse real-world datasets.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141564952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Use of calibration to improve the precision of estimates obtained from All of Us data. 利用校准提高从 "我们所有人 "数据中获得的估计值的精确度。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-09 DOI: 10.1093/jamia/ocae181
Vivian Hsing-Chun Wang, Julie Holm, José A Pagán

Objectives: To highlight the use of calibration weighting to improve the precision of estimates obtained from All of Us data and increase the return of value to communities from the All of Us Research Program.

Materials and methods: We used All of Us (2017-2022) data and raking to obtain prevalence estimates in two examples: discrimination in medical settings (N = 41 875) and food insecurity (N = 82 266). Weights were constructed using known population proportions (age, sex, race/ethnicity, region of residence, annual household income, and home ownership) from the 2020 National Health Interview Survey.

Results: About 37% of adults experienced discrimination in a medical setting. About 20% of adults who had not seen a doctor reported being food insecure compared with 14% of adults who regularly saw a doctor.

Conclusions: Calibration using raking is cost-effective and may lead to more precise estimates when analyzing All of Us data.

目标:强调校准加权的使用,以提高从 "我们所有人 "数据中获得的估计值的精确度,并增加 "我们所有人 "研究计划对社区的价值回报:我们使用 All of Us(2017-2022 年)数据和耙法获得了两个实例的流行率估计值:医疗环境中的歧视(N = 41 875)和粮食不安全(N = 82 266)。利用 2020 年全国健康访谈调查的已知人口比例(年龄、性别、种族/民族、居住地区、家庭年收入和房屋所有权)构建权重:约 37% 的成年人在医疗环境中遭受过歧视。约 20% 没有看过医生的成年人表示食物无保障,而定期看医生的成年人中这一比例为 14%:在分析 "我们所有人 "数据时,使用耙法进行校准具有成本效益,并可获得更精确的估计值。
{"title":"Use of calibration to improve the precision of estimates obtained from All of Us data.","authors":"Vivian Hsing-Chun Wang, Julie Holm, José A Pagán","doi":"10.1093/jamia/ocae181","DOIUrl":"https://doi.org/10.1093/jamia/ocae181","url":null,"abstract":"<p><strong>Objectives: </strong>To highlight the use of calibration weighting to improve the precision of estimates obtained from All of Us data and increase the return of value to communities from the All of Us Research Program.</p><p><strong>Materials and methods: </strong>We used All of Us (2017-2022) data and raking to obtain prevalence estimates in two examples: discrimination in medical settings (N = 41 875) and food insecurity (N = 82 266). Weights were constructed using known population proportions (age, sex, race/ethnicity, region of residence, annual household income, and home ownership) from the 2020 National Health Interview Survey.</p><p><strong>Results: </strong>About 37% of adults experienced discrimination in a medical setting. About 20% of adults who had not seen a doctor reported being food insecure compared with 14% of adults who regularly saw a doctor.</p><p><strong>Conclusions: </strong>Calibration using raking is cost-effective and may lead to more precise estimates when analyzing All of Us data.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141564953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning classification of new firearm injury encounters in the St Louis region: 2010-2020. 2010-2020 年圣路易斯地区新发生的枪支伤害事件的机器学习分类。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-08 DOI: 10.1093/jamia/ocae173
Rachel M Ancona, Benjamin P Cooper, Randi Foraker, Taylor Kaser, Opeolu Adeoye, Kristen L Mueller

Objectives: To improve firearm injury encounter classification (new vs follow-up) using machine learning (ML) and compare our ML model to other common approaches.

Materials and methods: This retrospective study used data from the St Louis region-wide hospital-based violence intervention program data repository (2010-2020). We randomly selected 500 patients with a firearm injury diagnosis for inclusion, with 808 total firearm injury encounters split (70/30) for training and testing. We trained a least absolute shrinkage and selection operator (LASSO) regression model with the following predictors: admission type, time between firearm injury visits, number of prior firearm injury emergency department (ED) visits, encounter type (ED or other), and diagnostic codes. Our gold standard for new firearm injury encounter classification was manual chart review. We then used our test data to compare the performance of our ML model to other commonly used approaches (proxy measures of ED visits and time between firearm injury encounters, and diagnostic code encounter type designation [initial vs subsequent or sequela]). Performance metrics included area under the curve (AUC), sensitivity, and specificity with 95% confidence intervals (CIs).

Results: The ML model had excellent discrimination (0.92, 0.88-0.96) with high sensitivity (0.95, 0.90-0.98) and specificity (0.89, 0.81-0.95). AUC was significantly higher than time-based outcomes, sensitivity was slightly (but not significantly) lower than other approaches, and specificity was higher than all other methods.

Discussion: ML successfully delineated new firearm injury encounters, outperforming other approaches in ruling out encounters for follow-up.

Conclusion: ML can be used to identify new firearm injury encounters and may be particularly useful in studies assessing re-injuries.

目标: 使用机器学习(ML)改进枪支伤害事故分类(新伤与后续伤),并比较我们的 ML 模型:利用机器学习(ML)改进枪支伤害的分类(新伤与后续伤),并将我们的ML模型与其他常见方法进行比较:这项回顾性研究使用的数据来自圣路易斯地区的医院暴力干预计划数据存储库(2010-2020 年)。我们随机选取了 500 名被诊断为枪支伤害的患者作为研究对象,其中 808 名枪支伤害患者分别(70/30)接受了训练和测试。我们使用以下预测因子训练了最小绝对收缩和选择算子 (LASSO) 回归模型:入院类型、枪支伤害就诊间隔时间、之前枪支伤害急诊科 (ED) 就诊次数、就诊类型(ED 或其他)和诊断代码。我们对新发生的枪支伤害事件进行分类的金标准是人工病历审查。然后,我们使用测试数据来比较我们的 ML 模型与其他常用方法(急诊室就诊次数和枪支伤害就诊间隔时间的替代指标,以及诊断代码的就诊类型指定[初次与后续或续发])的性能。性能指标包括曲线下面积(AUC)、灵敏度和特异性,以及 95% 的置信区间(CI):结果:ML 模型具有极佳的区分度(0.92,0.88-0.96),灵敏度(0.95,0.90-0.98)和特异度(0.89,0.81-0.95)都很高。AUC明显高于基于时间的结果,灵敏度略低于(但不明显)其他方法,特异性高于所有其他方法:讨论:ML 成功地划分了新的枪支伤害事件,在排除需要随访的事件方面优于其他方法:结论:ML 可用来识别新的枪支伤害事件,在评估再次伤害的研究中可能特别有用。
{"title":"Machine learning classification of new firearm injury encounters in the St Louis region: 2010-2020.","authors":"Rachel M Ancona, Benjamin P Cooper, Randi Foraker, Taylor Kaser, Opeolu Adeoye, Kristen L Mueller","doi":"10.1093/jamia/ocae173","DOIUrl":"https://doi.org/10.1093/jamia/ocae173","url":null,"abstract":"<p><strong>Objectives: </strong>To improve firearm injury encounter classification (new vs follow-up) using machine learning (ML) and compare our ML model to other common approaches.</p><p><strong>Materials and methods: </strong>This retrospective study used data from the St Louis region-wide hospital-based violence intervention program data repository (2010-2020). We randomly selected 500 patients with a firearm injury diagnosis for inclusion, with 808 total firearm injury encounters split (70/30) for training and testing. We trained a least absolute shrinkage and selection operator (LASSO) regression model with the following predictors: admission type, time between firearm injury visits, number of prior firearm injury emergency department (ED) visits, encounter type (ED or other), and diagnostic codes. Our gold standard for new firearm injury encounter classification was manual chart review. We then used our test data to compare the performance of our ML model to other commonly used approaches (proxy measures of ED visits and time between firearm injury encounters, and diagnostic code encounter type designation [initial vs subsequent or sequela]). Performance metrics included area under the curve (AUC), sensitivity, and specificity with 95% confidence intervals (CIs).</p><p><strong>Results: </strong>The ML model had excellent discrimination (0.92, 0.88-0.96) with high sensitivity (0.95, 0.90-0.98) and specificity (0.89, 0.81-0.95). AUC was significantly higher than time-based outcomes, sensitivity was slightly (but not significantly) lower than other approaches, and specificity was higher than all other methods.</p><p><strong>Discussion: </strong>ML successfully delineated new firearm injury encounters, outperforming other approaches in ruling out encounters for follow-up.</p><p><strong>Conclusion: </strong>ML can be used to identify new firearm injury encounters and may be particularly useful in studies assessing re-injuries.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141560218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fair prediction of 2-year stroke risk in patients with atrial fibrillation. 对心房颤动患者 2 年中风风险的合理预测。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-03 DOI: 10.1093/jamia/ocae170
Jifan Gao, Philip Mar, Zheng-Zheng Tang, Guanhua Chen

Objective: This study aims to develop machine learning models that provide both accurate and equitable predictions of 2-year stroke risk for patients with atrial fibrillation across diverse racial groups.

Materials and methods: Our study utilized structured electronic health records (EHR) data from the All of Us Research Program. Machine learning models (LightGBM) were utilized to capture the relations between stroke risks and the predictors used by the widely recognized CHADS2 and CHA2DS2-VASc scores. We mitigated the racial disparity by creating a representative tuning set, customizing tuning criteria, and setting binary thresholds separately for subgroups. We constructed a hold-out test set that not only supports temporal validation but also includes a larger proportion of Black/African Americans for fairness validation.

Results: Compared to the original CHADS2 and CHA2DS2-VASc scores, significant improvements were achieved by modeling their predictors using machine learning models (Area Under the Receiver Operating Characteristic curve from near 0.70 to above 0.80). Furthermore, applying our disparity mitigation strategies can effectively enhance model fairness compared to the conventional cross-validation approach.

Discussion: Modeling CHADS2 and CHA2DS2-VASc risk factors with LightGBM and our disparity mitigation strategies achieved decent discriminative performance and excellent fairness performance. In addition, this approach can provide a complete interpretation of each predictor. These highlight its potential utility in clinical practice.

Conclusions: Our research presents a practical example of addressing clinical challenges through the All of Us Research Program data. The disparity mitigation framework we proposed is adaptable across various models and data modalities, demonstrating broad potential in clinical informatics.

目的: 本研究旨在开发机器学习模型,以准确、公平地预测不同种族群体心房颤动患者的 2 年中风风险:本研究旨在开发机器学习模型,为不同种族群体的心房颤动患者提供准确、公平的 2 年中风风险预测:我们的研究利用了 "我们所有人研究计划 "的结构化电子健康记录(EHR)数据。我们利用机器学习模型(LightGBM)来捕捉中风风险与被广泛认可的 CHADS2 和 CHA2DS2-VASc 评分所使用的预测因子之间的关系。我们通过创建具有代表性的调整集、定制调整标准以及为亚组分别设置二进制阈值来减少种族差异。我们构建了一个暂不测试集,它不仅支持时间验证,还包括更大比例的黑人/非裔美国人,用于公平性验证:结果:与最初的 CHADS2 和 CHA2DS2-VASc 评分相比,通过使用机器学习模型对其预测因子进行建模,结果有了显著改善(接收者工作特征曲线下面积从接近 0.70 提高到 0.80 以上)。此外,与传统的交叉验证方法相比,采用我们的差异缓解策略可以有效提高模型的公平性:讨论:利用 LightGBM 和我们的差异缓解策略对 CHADS2 和 CHA2DS2-VASc 危险因素建模,取得了良好的判别性能和出色的公平性。此外,这种方法还能提供对每个预测因子的完整解释。这些都凸显了它在临床实践中的潜在用途:我们的研究提供了一个通过 "全民研究计划 "数据应对临床挑战的实例。我们提出的差异缓解框架可适用于各种模型和数据模式,展示了临床信息学的广泛潜力。
{"title":"Fair prediction of 2-year stroke risk in patients with atrial fibrillation.","authors":"Jifan Gao, Philip Mar, Zheng-Zheng Tang, Guanhua Chen","doi":"10.1093/jamia/ocae170","DOIUrl":"https://doi.org/10.1093/jamia/ocae170","url":null,"abstract":"<p><strong>Objective: </strong>This study aims to develop machine learning models that provide both accurate and equitable predictions of 2-year stroke risk for patients with atrial fibrillation across diverse racial groups.</p><p><strong>Materials and methods: </strong>Our study utilized structured electronic health records (EHR) data from the All of Us Research Program. Machine learning models (LightGBM) were utilized to capture the relations between stroke risks and the predictors used by the widely recognized CHADS2 and CHA2DS2-VASc scores. We mitigated the racial disparity by creating a representative tuning set, customizing tuning criteria, and setting binary thresholds separately for subgroups. We constructed a hold-out test set that not only supports temporal validation but also includes a larger proportion of Black/African Americans for fairness validation.</p><p><strong>Results: </strong>Compared to the original CHADS2 and CHA2DS2-VASc scores, significant improvements were achieved by modeling their predictors using machine learning models (Area Under the Receiver Operating Characteristic curve from near 0.70 to above 0.80). Furthermore, applying our disparity mitigation strategies can effectively enhance model fairness compared to the conventional cross-validation approach.</p><p><strong>Discussion: </strong>Modeling CHADS2 and CHA2DS2-VASc risk factors with LightGBM and our disparity mitigation strategies achieved decent discriminative performance and excellent fairness performance. In addition, this approach can provide a complete interpretation of each predictor. These highlight its potential utility in clinical practice.</p><p><strong>Conclusions: </strong>Our research presents a practical example of addressing clinical challenges through the All of Us Research Program data. The disparity mitigation framework we proposed is adaptable across various models and data modalities, demonstrating broad potential in clinical informatics.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141499494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reducing diagnostic delays in acute hepatic porphyria using health records data and machine learning. 利用健康记录数据和机器学习减少急性肝性卟啉症的诊断延误。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-01 DOI: 10.1093/jamia/ocae141
Balu Bhasuran, Katharina Schmolly, Yuvraaj Kapoor, Nanditha Lakshmi Jayakumar, Raymond Doan, Jigar Amin, Stephen Meninger, Nathan Cheng, Robert Deering, Karl Anderson, Simon W Beaven, Bruce Wang, Vivek A Rudrapatna

Background: Acute hepatic porphyria (AHP) is a group of rare but treatable conditions associated with diagnostic delays of 15 years on average. The advent of electronic health records (EHR) data and machine learning (ML) may improve the timely recognition of rare diseases like AHP. However, prediction models can be difficult to train given the limited case numbers, unstructured EHR data, and selection biases intrinsic to healthcare delivery. We sought to train and characterize models for identifying patients with AHP.

Methods: This diagnostic study used structured and notes-based EHR data from 2 centers at the University of California, UCSF (2012-2022) and UCLA (2019-2022). The data were split into 2 cohorts (referral and diagnosis) and used to develop models that predict (1) who will be referred for testing of acute porphyria, among those who presented with abdominal pain (a cardinal symptom of AHP), and (2) who will test positive, among those referred. The referral cohort consisted of 747 patients referred for testing and 99 849 contemporaneous patients who were not. The diagnosis cohort consisted of 72 confirmed AHP cases and 347 patients who tested negative. The case cohort was 81% female and 6-75 years old at the time of diagnosis. Candidate models used a range of architectures. Feature selection was semi-automated and incorporated publicly available data from knowledge graphs. Our primary outcome was the F-score on an outcome-stratified test set.

Results: The best center-specific referral models achieved an F-score of 86%-91%. The best diagnosis model achieved an F-score of 92%. To further test our model, we contacted 372 current patients who lack an AHP diagnosis but were predicted by our models as potentially having it (≥10% probability of referral, ≥50% of testing positive). However, we were only able to recruit 10 of these patients for biochemical testing, all of whom were negative. Nonetheless, post hoc evaluations suggested that these models could identify 71% of cases earlier than their diagnosis date, saving 1.2 years.

Conclusions: ML can reduce diagnostic delays in AHP and other rare diseases. Robust recruitment strategies and multicenter coordination will be needed to validate these models before they can be deployed.

背景:急性肝卟啉症(AHP)是一组罕见但可治疗的疾病,平均诊断延迟时间长达 15 年。电子健康记录(EHR)数据和机器学习(ML)的出现可能会改善对 AHP 等罕见疾病的及时识别。然而,由于病例数量有限、电子病历数据不结构化以及医疗服务固有的选择偏差,预测模型可能很难训练。我们试图训练和描述识别 AHP 患者的模型:这项诊断研究使用了加州大学旧金山分校(2012-2022 年)和加州大学洛杉矶分校(2019-2022 年)两个中心的结构化和基于笔记的电子病历数据。这些数据被分为两个队列(转诊和诊断),并用于建立模型,预测:(1) 在出现腹痛(AHP 的主要症状)的患者中,哪些人会被转诊接受急性卟啉症检测;(2) 在转诊患者中,哪些人会检测呈阳性。转诊队列由 747 名转诊患者和 99 849 名未转诊的同期患者组成。诊断队列包括 72 例确诊的 AHP 病例和 347 例检测呈阴性的患者。病例群中 81% 为女性,诊断时年龄为 6-75 岁。候选模型采用了一系列架构。特征选择是半自动化的,并结合了知识图谱中的公开数据。我们的主要结果是结果分层测试集上的 F 分数:结果:最佳中心特定转诊模型的 F 分数达到了 86%-91%。最佳诊断模型的 F 分数为 92%。为了进一步测试我们的模型,我们联系了 372 名目前没有 AHP 诊断但被我们的模型预测为可能有 AHP 诊断的患者(转诊概率≥10%,测试阳性概率≥50%)。然而,我们只能招募其中的 10 名患者进行生化检测,结果全部为阴性。尽管如此,事后评估表明,这些模型可以在诊断日期之前发现 71% 的病例,节省了 1.2 年的时间:结论:ML 可以减少 AHP 和其他罕见病的诊断延误。在部署这些模型之前,还需要强有力的招募策略和多中心协调来验证它们。
{"title":"Reducing diagnostic delays in acute hepatic porphyria using health records data and machine learning.","authors":"Balu Bhasuran, Katharina Schmolly, Yuvraaj Kapoor, Nanditha Lakshmi Jayakumar, Raymond Doan, Jigar Amin, Stephen Meninger, Nathan Cheng, Robert Deering, Karl Anderson, Simon W Beaven, Bruce Wang, Vivek A Rudrapatna","doi":"10.1093/jamia/ocae141","DOIUrl":"10.1093/jamia/ocae141","url":null,"abstract":"<p><strong>Background: </strong>Acute hepatic porphyria (AHP) is a group of rare but treatable conditions associated with diagnostic delays of 15 years on average. The advent of electronic health records (EHR) data and machine learning (ML) may improve the timely recognition of rare diseases like AHP. However, prediction models can be difficult to train given the limited case numbers, unstructured EHR data, and selection biases intrinsic to healthcare delivery. We sought to train and characterize models for identifying patients with AHP.</p><p><strong>Methods: </strong>This diagnostic study used structured and notes-based EHR data from 2 centers at the University of California, UCSF (2012-2022) and UCLA (2019-2022). The data were split into 2 cohorts (referral and diagnosis) and used to develop models that predict (1) who will be referred for testing of acute porphyria, among those who presented with abdominal pain (a cardinal symptom of AHP), and (2) who will test positive, among those referred. The referral cohort consisted of 747 patients referred for testing and 99 849 contemporaneous patients who were not. The diagnosis cohort consisted of 72 confirmed AHP cases and 347 patients who tested negative. The case cohort was 81% female and 6-75 years old at the time of diagnosis. Candidate models used a range of architectures. Feature selection was semi-automated and incorporated publicly available data from knowledge graphs. Our primary outcome was the F-score on an outcome-stratified test set.</p><p><strong>Results: </strong>The best center-specific referral models achieved an F-score of 86%-91%. The best diagnosis model achieved an F-score of 92%. To further test our model, we contacted 372 current patients who lack an AHP diagnosis but were predicted by our models as potentially having it (≥10% probability of referral, ≥50% of testing positive). However, we were only able to recruit 10 of these patients for biochemical testing, all of whom were negative. Nonetheless, post hoc evaluations suggested that these models could identify 71% of cases earlier than their diagnosis date, saving 1.2 years.</p><p><strong>Conclusions: </strong>ML can reduce diagnostic delays in AHP and other rare diseases. Robust recruitment strategies and multicenter coordination will be needed to validate these models before they can be deployed.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141472084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards objective and systematic evaluation of bias in artificial intelligence for medical imaging. 对医学成像人工智能中的偏差进行客观、系统的评估。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-28 DOI: 10.1093/jamia/ocae165
Emma A M Stanley, Raissa Souza, Anthony J Winder, Vedant Gulve, Kimberly Amador, Matthias Wilms, Nils D Forkert

Objective: Artificial intelligence (AI) models trained using medical images for clinical tasks often exhibit bias in the form of subgroup performance disparities. However, since not all sources of bias in real-world medical imaging data are easily identifiable, it is challenging to comprehensively assess their impacts. In this article, we introduce an analysis framework for systematically and objectively investigating the impact of biases in medical images on AI models.

Materials and methods: Our framework utilizes synthetic neuroimages with known disease effects and sources of bias. We evaluated the impact of bias effects and the efficacy of 3 bias mitigation strategies in counterfactual data scenarios on a convolutional neural network (CNN) classifier.

Results: The analysis revealed that training a CNN model on the datasets containing bias effects resulted in expected subgroup performance disparities. Moreover, reweighing was the most successful bias mitigation strategy for this setup. Finally, we demonstrated that explainable AI methods can aid in investigating the manifestation of bias in the model using this framework.

Discussion: The value of this framework is showcased in our findings on the impact of bias scenarios and efficacy of bias mitigation in a deep learning model pipeline. This systematic analysis can be easily expanded to conduct further controlled in silico trials in other investigations of bias in medical imaging AI.

Conclusion: Our novel methodology for objectively studying bias in medical imaging AI can help support the development of clinical decision-support tools that are robust and responsible.

目的:使用医学影像进行临床任务训练的人工智能(AI)模型经常会以亚组性能差异的形式表现出偏差。然而,由于现实世界医学影像数据中并非所有偏差来源都能轻易识别,因此全面评估其影响具有挑战性。在本文中,我们介绍了一个分析框架,用于系统、客观地研究医学图像中的偏差对人工智能模型的影响:我们的框架利用已知疾病影响和偏差来源的合成神经图像。我们评估了偏差效应的影响以及 3 种偏差缓解策略在反事实数据场景下对卷积神经网络(CNN)分类器的功效:分析结果表明,在含有偏差效应的数据集上训练卷积神经网络模型会导致预期的亚组性能差异。此外,在这种设置下,重权重是最成功的偏差缓解策略。最后,我们证明了可解释的人工智能方法可以帮助使用该框架调查模型中的偏差表现:这个框架的价值体现在我们对深度学习模型流水线中偏差情景的影响和偏差缓解效果的研究结果上。这一系统性分析可以很容易地扩展开来,在其他医学影像人工智能偏差研究中进行进一步的受控硅学试验:我们客观研究医学影像人工智能偏差的新方法有助于支持开发稳健、负责任的临床决策支持工具。
{"title":"Towards objective and systematic evaluation of bias in artificial intelligence for medical imaging.","authors":"Emma A M Stanley, Raissa Souza, Anthony J Winder, Vedant Gulve, Kimberly Amador, Matthias Wilms, Nils D Forkert","doi":"10.1093/jamia/ocae165","DOIUrl":"https://doi.org/10.1093/jamia/ocae165","url":null,"abstract":"<p><strong>Objective: </strong>Artificial intelligence (AI) models trained using medical images for clinical tasks often exhibit bias in the form of subgroup performance disparities. However, since not all sources of bias in real-world medical imaging data are easily identifiable, it is challenging to comprehensively assess their impacts. In this article, we introduce an analysis framework for systematically and objectively investigating the impact of biases in medical images on AI models.</p><p><strong>Materials and methods: </strong>Our framework utilizes synthetic neuroimages with known disease effects and sources of bias. We evaluated the impact of bias effects and the efficacy of 3 bias mitigation strategies in counterfactual data scenarios on a convolutional neural network (CNN) classifier.</p><p><strong>Results: </strong>The analysis revealed that training a CNN model on the datasets containing bias effects resulted in expected subgroup performance disparities. Moreover, reweighing was the most successful bias mitigation strategy for this setup. Finally, we demonstrated that explainable AI methods can aid in investigating the manifestation of bias in the model using this framework.</p><p><strong>Discussion: </strong>The value of this framework is showcased in our findings on the impact of bias scenarios and efficacy of bias mitigation in a deep learning model pipeline. This systematic analysis can be easily expanded to conduct further controlled in silico trials in other investigations of bias in medical imaging AI.</p><p><strong>Conclusion: </strong>Our novel methodology for objectively studying bias in medical imaging AI can help support the development of clinical decision-support tools that are robust and responsible.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7,"publicationDate":"2024-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141472085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using patient portals for large-scale recruitment of individuals underrepresented in biomedical research: an evaluation of engagement patterns throughout the patient portal recruitment process at a single site within the All of Us Research Program. 利用患者门户网站大规模招募在生物医学研究中代表性不足的人员:对 "我们所有人 "研究计划中单一研究机构的患者门户网站招募过程中的参与模式进行评估。
IF 4.7 2区 医学 Q1 Medicine Pub Date : 2024-06-25 DOI: 10.1093/jamia/ocae135
Maura Beaton, Xinzhuo Jiang, Elise Minto, Chun Yee Lau, Lennon Turner, George Hripcsak, Kanchan Chaudhari, Karthik Natarajan

Objective: To evaluate the use of patient portal messaging to recruit individuals historically underrepresented in biomedical research (UBR) to the All of Us Research Program (AoURP) at a single recruitment site.

Materials and methods: Patient portal-based recruitment was implemented at Columbia University Irving Medical Center. Patient engagement was assessed using patient's electronic health record (EHR) at four recruitment stages: Consenting to be contacted, opening messages, responding to messages, and showing interest in participating. Demographic and socioeconomic data were also collected from patient's EHR and univariate logistic regression analyses were conducted to assess patient engagement.

Results: Between October 2022 and November 2023, a total of 59 592 patients received patient portal messages inviting them to join the AoURP. Among them, 24 445 (41.0%) opened the message, 8983 (15.1%) responded, and 3765 (6.3%) showed interest in joining the program. Though we were unable to link enrollment data with EHR data, we estimate about 2% of patients contacted ultimately enrolled in the AoURP. Patients from underrepresented race and ethnicity communities had lower odds of consenting to be contacted and opening messages, but higher odds of showing interest after responding.

Discussion: Patient portal messaging provided both patients and recruitment staff with a more efficient approach to outreach, but patterns of engagement varied across UBR groups.

Conclusion: Patient portal-based recruitment enables researchers to contact a substantial number of participants from diverse communities. However, more effort is needed to improve engagement from underrepresented racial and ethnic groups at the early stages of the recruitment process.

目的评估使用患者门户网站信息在单一招募地点招募历来在生物医学研究领域代表性不足的人员(UBR)参加 "我们所有人研究计划"(AoURP)的情况:哥伦比亚大学欧文医学中心实施了基于患者门户网站的招募。在四个招募阶段,使用患者的电子健康记录(EHR)对患者的参与度进行评估:同意联系、打开信息、回复信息以及表示有兴趣参与。此外,还从患者的电子病历中收集了人口统计学和社会经济学数据,并进行了单变量逻辑回归分析,以评估患者的参与度:2022年10月至2023年11月期间,共有59 592名患者收到了患者门户网站邀请他们加入AoURP的信息。其中,24 445 人(41.0%)打开了信息,8983 人(15.1%)做出了回复,3765 人(6.3%)表示有兴趣加入该计划。虽然我们无法将注册数据与电子病历数据联系起来,但我们估计约有 2% 联系过的患者最终加入了 AoURP。来自代表性不足的种族和民族社区的患者同意联系和打开信息的几率较低,但回复后表示有兴趣的几率较高:讨论:患者门户网站的信息发布为患者和招募人员提供了更有效的外联方法,但不同 UBR 群体的参与模式各不相同:结论:基于患者门户网站的招募使研究人员能够联系到来自不同社区的大量参与者。然而,在招募过程的早期阶段,还需要做出更多努力来提高代表性不足的种族和民族群体的参与度。
{"title":"Using patient portals for large-scale recruitment of individuals underrepresented in biomedical research: an evaluation of engagement patterns throughout the patient portal recruitment process at a single site within the All of Us Research Program.","authors":"Maura Beaton, Xinzhuo Jiang, Elise Minto, Chun Yee Lau, Lennon Turner, George Hripcsak, Kanchan Chaudhari, Karthik Natarajan","doi":"10.1093/jamia/ocae135","DOIUrl":"https://doi.org/10.1093/jamia/ocae135","url":null,"abstract":"<p><strong>Objective: </strong>To evaluate the use of patient portal messaging to recruit individuals historically underrepresented in biomedical research (UBR) to the All of Us Research Program (AoURP) at a single recruitment site.</p><p><strong>Materials and methods: </strong>Patient portal-based recruitment was implemented at Columbia University Irving Medical Center. Patient engagement was assessed using patient's electronic health record (EHR) at four recruitment stages: Consenting to be contacted, opening messages, responding to messages, and showing interest in participating. Demographic and socioeconomic data were also collected from patient's EHR and univariate logistic regression analyses were conducted to assess patient engagement.</p><p><strong>Results: </strong>Between October 2022 and November 2023, a total of 59 592 patients received patient portal messages inviting them to join the AoURP. Among them, 24 445 (41.0%) opened the message, 8983 (15.1%) responded, and 3765 (6.3%) showed interest in joining the program. Though we were unable to link enrollment data with EHR data, we estimate about 2% of patients contacted ultimately enrolled in the AoURP. Patients from underrepresented race and ethnicity communities had lower odds of consenting to be contacted and opening messages, but higher odds of showing interest after responding.</p><p><strong>Discussion: </strong>Patient portal messaging provided both patients and recruitment staff with a more efficient approach to outreach, but patterns of engagement varied across UBR groups.</p><p><strong>Conclusion: </strong>Patient portal-based recruitment enables researchers to contact a substantial number of participants from diverse communities. However, more effort is needed to improve engagement from underrepresented racial and ethnic groups at the early stages of the recruitment process.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141452054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
All of whom? Limitations encountered using All of Us Researcher Workbench in a Primary Care residents secondary data analysis research training block. 所有人?在初级保健住院医师二次数据分析研究培训模块中使用 "我们所有人 "研究员工作台遇到的限制。
IF 4.7 2区 医学 Q1 Medicine Pub Date : 2024-06-25 DOI: 10.1093/jamia/ocae162
Fred Willie Zametkin LaPolla, Marco Barber Grossi, Sharon Chen, Tai Wei Guo, Kathryn Havranek, Olivia Jebb, Minh Thu Nguyen, Sneha Panganamamula, Noah Smith, Shree Sundaresh, Jonathan Yu, Gabrielle Mayer

Objectives: The goal of this case report is to detail experiences and challenges experienced in the training of Primary Care residents in secondary analysis using All of Us Researcher Workbench. At our large, urban safety net hospital, Primary Care/Internal Medicine residents in their third year undergo a research intensive block, the Research Practicum, where they work as a team to conduct secondary data analysis on a dataset with faculty facilitation. In 2023, this research block focused on use of the All of Us Researcher Workbench for secondary data analysis.

Materials and methods: Two groups of 5 residents underwent training to access the All of Us Researcher Workbench, and each group explored available data with a faculty facilitator and generated original research questions. Two blocks of residents successfully completed their research blocks and created original presentations on "social isolation and A1C" levels and "medical discrimination and diabetes management."

Results: Departmental faculty were satisfied with the depth of learning and data exploration. In focus groups, some residents noted that for those without interest in performing research, the activity felt extraneous to their career goals, while others were glad for the opportunity to publish. In both blocks, residents highlighted dissatisfaction with the degree to which the All of Us Researcher Workbench was representative of patients they encounter in a large safety net hospital.

Discussion: Using the All of Us Researcher Workbench provided residents with an opportunity to explore novel questions in a massive data source. Many residents however noted that because the population described in the All of Us Researcher Workbench appeared to be more highly educated and less racially diverse than patients they encounter in their practice, research may be hard to generalize in a community health context. Additionally, given that the data required knowledge of 1 of 2 code-based data analysis languages (R or Python) and work within an idiosyncratic coding environment, residents were heavily reliant on a faculty facilitator to assist with analysis.

Conclusion: Using the All of Us Researcher Workbench for research training allowed residents to explore novel questions and gain first-hand exposure to opportunities and challenges in secondary data analysis.

目的:本病例报告旨在详细介绍使用 "我们所有人 "研究员工作台对初级保健住院医师进行二次分析培训的经验和挑战。在我们这家大型城市安全网医院,初级保健/内科住院医师在第三年要接受研究实习这一研究强化阶段的培训,在这一阶段,他们以团队的形式在教师的协助下对数据集进行二次数据分析。2023 年,该研究单元的重点是使用 "我们所有人 "研究员工作台进行二级数据分析:两组共 5 名住院医师接受了访问 All of Us Researcher Workbench 的培训,每组在教师的协助下探索可用数据,并提出原创研究问题。两组住院医师成功完成了他们的研究模块,并就 "社会隔离与 A1C "水平和 "医疗歧视与糖尿病管理 "发表了原创演讲:部门教师对学习和数据探索的深度表示满意。在焦点小组中,一些住院医师指出,对于那些没有兴趣从事研究的住院医师来说,这项活动感觉与他们的职业目标无关,而另一些住院医师则为有机会发表论文而感到高兴。在这两个讨论组中,住院医师们都强调了对 "我们所有人 "研究人员工作台在多大程度上代表了他们在大型安全网医院中遇到的病人的不满:讨论:使用 "我们所有人 "研究人员工作台为住院医师提供了一个在海量数据源中探索新问题的机会。然而,许多居民指出,由于 "我们所有人 "研究人员工作台中描述的人群与他们在实践中遇到的患者相比,受教育程度更高,种族多样性更少,因此研究可能难以在社区卫生环境中推广。此外,鉴于数据需要掌握 2 种基于代码的数据分析语言(R 或 Python)中的一种,并且需要在特殊的编码环境中工作,因此居民在很大程度上依赖于教师协助分析:使用 "我们所有人 "研究人员工作台进行研究培训,使住院医师能够探索新问题,并亲身体验二手数据分析的机遇和挑战。
{"title":"All of whom? Limitations encountered using All of Us Researcher Workbench in a Primary Care residents secondary data analysis research training block.","authors":"Fred Willie Zametkin LaPolla, Marco Barber Grossi, Sharon Chen, Tai Wei Guo, Kathryn Havranek, Olivia Jebb, Minh Thu Nguyen, Sneha Panganamamula, Noah Smith, Shree Sundaresh, Jonathan Yu, Gabrielle Mayer","doi":"10.1093/jamia/ocae162","DOIUrl":"https://doi.org/10.1093/jamia/ocae162","url":null,"abstract":"<p><strong>Objectives: </strong>The goal of this case report is to detail experiences and challenges experienced in the training of Primary Care residents in secondary analysis using All of Us Researcher Workbench. At our large, urban safety net hospital, Primary Care/Internal Medicine residents in their third year undergo a research intensive block, the Research Practicum, where they work as a team to conduct secondary data analysis on a dataset with faculty facilitation. In 2023, this research block focused on use of the All of Us Researcher Workbench for secondary data analysis.</p><p><strong>Materials and methods: </strong>Two groups of 5 residents underwent training to access the All of Us Researcher Workbench, and each group explored available data with a faculty facilitator and generated original research questions. Two blocks of residents successfully completed their research blocks and created original presentations on \"social isolation and A1C\" levels and \"medical discrimination and diabetes management.\"</p><p><strong>Results: </strong>Departmental faculty were satisfied with the depth of learning and data exploration. In focus groups, some residents noted that for those without interest in performing research, the activity felt extraneous to their career goals, while others were glad for the opportunity to publish. In both blocks, residents highlighted dissatisfaction with the degree to which the All of Us Researcher Workbench was representative of patients they encounter in a large safety net hospital.</p><p><strong>Discussion: </strong>Using the All of Us Researcher Workbench provided residents with an opportunity to explore novel questions in a massive data source. Many residents however noted that because the population described in the All of Us Researcher Workbench appeared to be more highly educated and less racially diverse than patients they encounter in their practice, research may be hard to generalize in a community health context. Additionally, given that the data required knowledge of 1 of 2 code-based data analysis languages (R or Python) and work within an idiosyncratic coding environment, residents were heavily reliant on a faculty facilitator to assist with analysis.</p><p><strong>Conclusion: </strong>Using the All of Us Researcher Workbench for research training allowed residents to explore novel questions and gain first-hand exposure to opportunities and challenges in secondary data analysis.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141452050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TrajVis: a visual clinical decision support system to translate artificial intelligence trajectory models in the precision management of chronic kidney disease. TrajVis:可视化临床决策支持系统,将人工智能轨迹模型应用于慢性肾病的精准管理。
IF 4.7 2区 医学 Q1 Medicine Pub Date : 2024-06-25 DOI: 10.1093/jamia/ocae158
Zuotian Li, Xiang Liu, Ziyang Tang, Nanxin Jin, Pengyue Zhang, Michael T Eadon, Qianqian Song, Yingjie V Chen, Jing Su

Objective: Our objective is to develop and validate TrajVis, an interactive tool that assists clinicians in using artificial intelligence (AI) models to leverage patients' longitudinal electronic medical records (EMRs) for personalized precision management of chronic disease progression.

Materials and methods: We first perform requirement analysis with clinicians and data scientists to determine the visual analytics tasks of the TrajVis system as well as its design and functionalities. A graph AI model for chronic kidney disease (CKD) trajectory inference named DisEase PrOgression Trajectory (DEPOT) is used for system development and demonstration. TrajVis is implemented as a full-stack web application with synthetic EMR data derived from the Atrium Health Wake Forest Baptist Translational Data Warehouse and the Indiana Network for Patient Care research database. A case study with a nephrologist and a user experience survey of clinicians and data scientists are conducted to evaluate the TrajVis system.

Results: The TrajVis clinical information system is composed of 4 panels: the Patient View for demographic and clinical information, the Trajectory View to visualize the DEPOT-derived CKD trajectories in latent space, the Clinical Indicator View to elucidate longitudinal patterns of clinical features and interpret DEPOT predictions, and the Analysis View to demonstrate personal CKD progression trajectories. System evaluations suggest that TrajVis supports clinicians in summarizing clinical data, identifying individualized risk predictors, and visualizing patient disease progression trajectories, overcoming the barriers of AI implementation in healthcare.

Discussion: The TrajVis system provides a novel visualization solution which is complimentary to other risk estimators such as the Kidney Failure Risk Equations.

Conclusion: TrajVis bridges the gap between the fast-growing AI/ML modeling and the clinical use of such models for personalized and precision management of chronic diseases.

目标:我们的目标是开发并验证 TrajVis,这是一种交互式工具,可协助临床医生使用人工智能(AI)模型,利用患者的纵向电子病历(EMR)对慢性疾病进展进行个性化精准管理:我们首先与临床医生和数据科学家一起进行了需求分析,以确定 TrajVis 系统的可视化分析任务及其设计和功能。用于慢性肾脏病(CKD)轨迹推断的图人工智能模型被命名为 "疾病进展轨迹"(DEPOT),用于系统开发和演示。TrajVis 是作为一个全栈网络应用程序实施的,其合成 EMR 数据来自 Atrium Health Wake Forest Baptist Translational Data Warehouse 和 Indiana Network for Patient Care 研究数据库。为了评估 TrajVis 系统,我们对一名肾病专家进行了案例研究,并对临床医生和数据科学家进行了用户体验调查:TrajVis 临床信息系统由 4 个面板组成:患者视图用于显示人口统计学和临床信息;轨迹视图用于显示潜空间中 DEPOT 衍生的 CKD 轨迹;临床指标视图用于阐明临床特征的纵向模式并解释 DEPOT 预测;分析视图用于展示个人 CKD 进展轨迹。系统评估表明,TrajVis 支持临床医生总结临床数据、识别个体化风险预测因素和可视化患者疾病进展轨迹,克服了在医疗保健领域实施人工智能的障碍:讨论:TrajVis 系统提供了一种新颖的可视化解决方案,与肾衰竭风险方程等其他风险评估工具相辅相成:TrajVis弥补了快速发展的人工智能/ML建模与临床使用此类模型进行个性化和精准慢性病管理之间的差距。
{"title":"TrajVis: a visual clinical decision support system to translate artificial intelligence trajectory models in the precision management of chronic kidney disease.","authors":"Zuotian Li, Xiang Liu, Ziyang Tang, Nanxin Jin, Pengyue Zhang, Michael T Eadon, Qianqian Song, Yingjie V Chen, Jing Su","doi":"10.1093/jamia/ocae158","DOIUrl":"https://doi.org/10.1093/jamia/ocae158","url":null,"abstract":"<p><strong>Objective: </strong>Our objective is to develop and validate TrajVis, an interactive tool that assists clinicians in using artificial intelligence (AI) models to leverage patients' longitudinal electronic medical records (EMRs) for personalized precision management of chronic disease progression.</p><p><strong>Materials and methods: </strong>We first perform requirement analysis with clinicians and data scientists to determine the visual analytics tasks of the TrajVis system as well as its design and functionalities. A graph AI model for chronic kidney disease (CKD) trajectory inference named DisEase PrOgression Trajectory (DEPOT) is used for system development and demonstration. TrajVis is implemented as a full-stack web application with synthetic EMR data derived from the Atrium Health Wake Forest Baptist Translational Data Warehouse and the Indiana Network for Patient Care research database. A case study with a nephrologist and a user experience survey of clinicians and data scientists are conducted to evaluate the TrajVis system.</p><p><strong>Results: </strong>The TrajVis clinical information system is composed of 4 panels: the Patient View for demographic and clinical information, the Trajectory View to visualize the DEPOT-derived CKD trajectories in latent space, the Clinical Indicator View to elucidate longitudinal patterns of clinical features and interpret DEPOT predictions, and the Analysis View to demonstrate personal CKD progression trajectories. System evaluations suggest that TrajVis supports clinicians in summarizing clinical data, identifying individualized risk predictors, and visualizing patient disease progression trajectories, overcoming the barriers of AI implementation in healthcare.</p><p><strong>Discussion: </strong>The TrajVis system provides a novel visualization solution which is complimentary to other risk estimators such as the Kidney Failure Risk Equations.</p><p><strong>Conclusion: </strong>TrajVis bridges the gap between the fast-growing AI/ML modeling and the clinical use of such models for personalized and precision management of chronic diseases.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141447484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disparities in ABO blood type determination across diverse ancestries: a systematic review and validation in the All of Us Research Program. 不同血统 ABO 血型测定的差异:"我们所有人 "研究计划的系统回顾和验证。
IF 4.7 2区 医学 Q1 Medicine Pub Date : 2024-06-25 DOI: 10.1093/jamia/ocae161
Kiana L Martinez, Andrew Klein, Jennifer R Martin, Chinwuwanuju U Sampson, Jason B Giles, Madison L Beck, Krupa Bhakta, Gino Quatraro, Juvie Farol, Jason H Karnes

Objectives: ABO blood types have widespread clinical use and robust associations with disease. The purpose of this study is to evaluate the portability and suitability of tag single-nucleotide polymorphisms (tSNPs) used to determine ABO alleles and blood types across diverse populations in published literature.

Materials and methods: Bibliographic databases were searched for studies using tSNPs to determine ABO alleles. We calculated linkage between tSNPs and functional variants across inferred continental ancestry groups from 1000 Genomes. We compared r2 across ancestry and assessed real-world consequences by comparing tSNP-derived blood types to serology in a diverse population from the All of Us Research Program.

Results: Linkage between functional variants and O allele tSNPs was significantly lower in African (median r2 = 0.443) compared to East Asian (r2 = 0.946, P = 1.1 × 10-5) and European (r2 = 0.869, P = .023) populations. In All of Us, discordance between tSNP-derived blood types and serology was high across all SNPs in African ancestry individuals and linkage was strongly correlated with discordance across all ancestries (ρ = -0.90, P = 3.08 × 10-23).

Discussion: Many studies determine ABO blood types using tSNPs. However, tSNPs with low linkage disequilibrium promote misinference of ABO blood types, particularly in diverse populations. We observe common use of inappropriate tSNPs to determine ABO blood type, particularly for O alleles and with some tSNPs mistyping up to 58% of individuals.

Conclusion: Our results highlight the lack of transferability of tSNPs across ancestries and potential exacerbation of disparities in genomic research for underrepresented populations. This is especially relevant as more diverse cohorts are made publicly available.

目的:ABO 血型在临床上广泛使用,并与疾病密切相关。本研究旨在评估已发表文献中用于确定不同人群 ABO 等位基因和血型的标记单核苷酸多态性(tSNPs)的可移植性和适用性:我们在文献数据库中搜索了使用 tSNPs 确定 ABO 等位基因的研究。我们计算了从 1000 个基因组中推断出的大陆祖先群体中 tSNPs 与功能变异之间的联系。我们比较了不同祖先的 r2,并通过比较 tSNP 导出的血型与 "我们所有人研究计划 "中不同人群的血清学来评估现实世界的后果:结果:与东亚人(r2 = 0.946,P = 1.1 × 10-5)和欧洲人(r2 = 0.869,P = .023)相比,非洲人(中位数 r2 = 0.443)的功能变异和 O 等位基因 tSNPs 之间的联系明显较低。在 "我们所有人 "中,在非洲血统个体的所有 SNPs 上,tSNP 导出的血型与血清学之间的不一致性都很高,而且连接与所有血统的不一致性密切相关(ρ = -0.90,P = 3.08 × 10-23):讨论:许多研究利用 tSNPs 确定 ABO 血型。然而,低连锁不平衡的 tSNPs 会导致 ABO 血型的错误推断,尤其是在不同的人群中。我们观察到使用不恰当的 tSNPs 来确定 ABO 血型的情况很普遍,尤其是 O 等位基因,有些 tSNPs 可误判高达 58% 的个体:我们的研究结果凸显了 tSNPs 缺乏跨血统的可转移性,可能会加剧代表性不足人群在基因组研究中的不平等。随着更多不同的队列被公开,这一点尤为重要。
{"title":"Disparities in ABO blood type determination across diverse ancestries: a systematic review and validation in the All of Us Research Program.","authors":"Kiana L Martinez, Andrew Klein, Jennifer R Martin, Chinwuwanuju U Sampson, Jason B Giles, Madison L Beck, Krupa Bhakta, Gino Quatraro, Juvie Farol, Jason H Karnes","doi":"10.1093/jamia/ocae161","DOIUrl":"https://doi.org/10.1093/jamia/ocae161","url":null,"abstract":"<p><strong>Objectives: </strong>ABO blood types have widespread clinical use and robust associations with disease. The purpose of this study is to evaluate the portability and suitability of tag single-nucleotide polymorphisms (tSNPs) used to determine ABO alleles and blood types across diverse populations in published literature.</p><p><strong>Materials and methods: </strong>Bibliographic databases were searched for studies using tSNPs to determine ABO alleles. We calculated linkage between tSNPs and functional variants across inferred continental ancestry groups from 1000 Genomes. We compared r2 across ancestry and assessed real-world consequences by comparing tSNP-derived blood types to serology in a diverse population from the All of Us Research Program.</p><p><strong>Results: </strong>Linkage between functional variants and O allele tSNPs was significantly lower in African (median r2 = 0.443) compared to East Asian (r2 = 0.946, P = 1.1 × 10-5) and European (r2 = 0.869, P = .023) populations. In All of Us, discordance between tSNP-derived blood types and serology was high across all SNPs in African ancestry individuals and linkage was strongly correlated with discordance across all ancestries (ρ = -0.90, P = 3.08 × 10-23).</p><p><strong>Discussion: </strong>Many studies determine ABO blood types using tSNPs. However, tSNPs with low linkage disequilibrium promote misinference of ABO blood types, particularly in diverse populations. We observe common use of inappropriate tSNPs to determine ABO blood type, particularly for O alleles and with some tSNPs mistyping up to 58% of individuals.</p><p><strong>Conclusion: </strong>Our results highlight the lack of transferability of tSNPs across ancestries and potential exacerbation of disparities in genomic research for underrepresented populations. This is especially relevant as more diverse cohorts are made publicly available.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141452052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the American Medical Informatics Association
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1