首页 > 最新文献

Journal of the American Medical Informatics Association最新文献

英文 中文
Efficacy of the mLab App: a randomized clinical trial for increasing HIV testing uptake using mobile technology. mLab 应用程序的功效:利用移动技术提高艾滋病病毒检测接受率的随机临床试验。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-19 DOI: 10.1093/jamia/ocae261
Rebecca Schnall, Thomas Foster Scherr, Lisa M Kuhns, Patrick Janulis, Haomiao Jia, Olivia R Wood, Michael Almodovar, Robert Garofalo

Objective: To determine the efficacy of the mLab App, a mobile-delivered HIV prevention intervention to increase HIV self-testing in MSM and TGW.

Materials and methods: This was a randomized (2:2:1) clinical trial of the efficacy the mLab App as compared to standard of care vs mailed home HIV test arm among 525 MSM and TGW aged 18-29 years to increase HIV testing.

Results: The mLab App arm participants demonstrated an increase from 35.1% reporting HIV testing in the prior 6 months compared to 88.5% at 6 months. In contrast, 28.8% of control participants reported an HIV test at baseline, which only increased to 65.1% at 6 months. In a generalized linear mixed model estimating this change and controlling for multiple observations of participants, this equated to control participants reporting a 61.2% smaller increase in HIV testing relative to mLab participants (P = .001) at 6 months. This difference was maintained at 12 months with control participants reporting an 82.6% smaller increase relative to mLab App participants (P < .001) from baseline to 12 months.

Discussion and conclusion: Findings suggest that the mLab App is well-supported, evidence-based, behavioral risk-reduction intervention for increasing HIV testing rates as compared to the standard of care, suggesting that this may be a useful behavioral risk-reduction intervention for increasing HIV testing among young MSM.

Trial registration: This trial was registered with Clinicaltrials.gov NCT03803683.

目的确定 mLab 应用程序的疗效,该应用程序是一种移动艾滋病毒预防干预措施,旨在提高 MSM 和 TGW 的艾滋病毒自我检测率:这是一项随机(2:2:1)临床试验,目的是在 525 名 18-29 岁的 MSM 和 TGW 中,比较 mLab App 与标准护理和邮寄家庭 HIV 检测工具的效果,以增加 HIV 检测:结果:mLab 应用程序组的参与者在过去 6 个月中报告接受 HIV 检测的比例从 35.1%上升到 6 个月时的 88.5%。相比之下,28.8% 的对照组参与者在基线时报告进行了 HIV 检测,而在 6 个月时仅增加到 65.1%。通过广义线性混合模型对这一变化进行估算,并对参与者的多重观察结果进行控制,结果显示,对照组参与者在 6 个月时报告的 HIV 检测率比 mLab 参与者低 61.2%(P = .001)。这一差异在 12 个月时得以保持,对照组参与者的艾滋病检测率比 mLab 应用程序参与者低 82.6%(P 讨论和结论:研究结果表明,与标准护理相比,mLab 应用程序是得到充分支持的循证行为风险降低干预措施,可提高 HIV 检测率,这表明它可能是提高年轻 MSM HIV 检测率的有效行为风险降低干预措施:该试验已在 Clinicaltrials.gov NCT03803683 上注册。
{"title":"Efficacy of the mLab App: a randomized clinical trial for increasing HIV testing uptake using mobile technology.","authors":"Rebecca Schnall, Thomas Foster Scherr, Lisa M Kuhns, Patrick Janulis, Haomiao Jia, Olivia R Wood, Michael Almodovar, Robert Garofalo","doi":"10.1093/jamia/ocae261","DOIUrl":"https://doi.org/10.1093/jamia/ocae261","url":null,"abstract":"<p><strong>Objective: </strong>To determine the efficacy of the mLab App, a mobile-delivered HIV prevention intervention to increase HIV self-testing in MSM and TGW.</p><p><strong>Materials and methods: </strong>This was a randomized (2:2:1) clinical trial of the efficacy the mLab App as compared to standard of care vs mailed home HIV test arm among 525 MSM and TGW aged 18-29 years to increase HIV testing.</p><p><strong>Results: </strong>The mLab App arm participants demonstrated an increase from 35.1% reporting HIV testing in the prior 6 months compared to 88.5% at 6 months. In contrast, 28.8% of control participants reported an HIV test at baseline, which only increased to 65.1% at 6 months. In a generalized linear mixed model estimating this change and controlling for multiple observations of participants, this equated to control participants reporting a 61.2% smaller increase in HIV testing relative to mLab participants (P = .001) at 6 months. This difference was maintained at 12 months with control participants reporting an 82.6% smaller increase relative to mLab App participants (P < .001) from baseline to 12 months.</p><p><strong>Discussion and conclusion: </strong>Findings suggest that the mLab App is well-supported, evidence-based, behavioral risk-reduction intervention for increasing HIV testing rates as compared to the standard of care, suggesting that this may be a useful behavioral risk-reduction intervention for increasing HIV testing among young MSM.</p><p><strong>Trial registration: </strong>This trial was registered with Clinicaltrials.gov NCT03803683.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142669892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning-based prediction models in medical decision-making in kidney disease: patient, caregiver, and clinician perspectives on trust and appropriate use. 肾病医疗决策中基于机器学习的预测模型:患者、护理人员和临床医生对信任和适当使用的看法。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-14 DOI: 10.1093/jamia/ocae255
Jessica Sperling, Whitney Welsh, Erin Haseley, Stella Quenstedt, Perusi B Muhigaba, Adrian Brown, Patti Ephraim, Tariq Shafi, Michael Waitzkin, David Casarett, Benjamin A Goldstein

Objectives: This study aims to improve the ethical use of machine learning (ML)-based clinical prediction models (CPMs) in shared decision-making for patients with kidney failure on dialysis. We explore factors that inform acceptability, interpretability, and implementation of ML-based CPMs among multiple constituent groups.

Materials and methods: We collected and analyzed qualitative data from focus groups with varied end users, including: dialysis support providers (clinical providers and additional dialysis support providers such as dialysis clinic staff and social workers); patients; patients' caregivers (n = 52).

Results: Participants were broadly accepting of ML-based CPMs, but with concerns on data sources, factors included in the model, and accuracy. Use was desired in conjunction with providers' views and explanations. Differences among respondent types were minimal overall but most prevalent in discussions of CPM presentation and model use.

Discussion and conclusion: Evidence of acceptability of ML-based CPM usage provides support for ethical use, but numerous specific considerations in acceptability, model construction, and model use for shared clinical decision-making must be considered. There are specific steps that could be taken by data scientists and health systems to engender use that is accepted by end users and facilitates trust, but there are also ongoing barriers or challenges in addressing desires for use. This study contributes to emerging literature on interpretability, mechanisms for sharing complexities, including uncertainty regarding the model results, and implications for decision-making. It examines numerous stakeholder groups including providers, patients, and caregivers to provide specific considerations that can influence health system use and provide a basis for future research.

研究目的本研究旨在改善基于机器学习(ML)的临床预测模型(CPM)在透析肾衰竭患者共同决策中的伦理使用。我们探讨了多个组成群体对基于机器学习的临床预测模型的可接受性、可解释性和实施情况的影响因素:我们收集并分析了来自焦点小组的定性数据,这些焦点小组由不同的终端用户组成,包括:透析支持服务提供者(临床服务提供者和其他透析支持服务提供者,如透析诊所工作人员和社会工作者);患者;患者的护理人员(n = 52):结果:参与者普遍接受基于 ML 的 CPM,但对数据来源、模型中包含的因素和准确性表示担忧。他们希望结合医疗服务提供者的观点和解释来使用。受访者类型之间的差异总体上很小,但在 CPM 演示和模型使用的讨论中最为普遍:基于 ML 的 CPM 使用的可接受性证据为道德使用提供了支持,但在可接受性、模型构建和临床决策共享模型使用方面必须考虑许多具体因素。数据科学家和医疗系统可以采取一些具体步骤来促进最终用户接受和信任的使用,但在满足使用愿望方面也存在持续的障碍或挑战。本研究为有关可解释性、复杂性共享机制(包括模型结果的不确定性)以及对决策的影响的新兴文献做出了贡献。它对包括医疗服务提供者、患者和护理人员在内的众多利益相关者群体进行了研究,以提供可影响医疗系统使用的具体考虑因素,并为未来的研究奠定基础。
{"title":"Machine learning-based prediction models in medical decision-making in kidney disease: patient, caregiver, and clinician perspectives on trust and appropriate use.","authors":"Jessica Sperling, Whitney Welsh, Erin Haseley, Stella Quenstedt, Perusi B Muhigaba, Adrian Brown, Patti Ephraim, Tariq Shafi, Michael Waitzkin, David Casarett, Benjamin A Goldstein","doi":"10.1093/jamia/ocae255","DOIUrl":"10.1093/jamia/ocae255","url":null,"abstract":"<p><strong>Objectives: </strong>This study aims to improve the ethical use of machine learning (ML)-based clinical prediction models (CPMs) in shared decision-making for patients with kidney failure on dialysis. We explore factors that inform acceptability, interpretability, and implementation of ML-based CPMs among multiple constituent groups.</p><p><strong>Materials and methods: </strong>We collected and analyzed qualitative data from focus groups with varied end users, including: dialysis support providers (clinical providers and additional dialysis support providers such as dialysis clinic staff and social workers); patients; patients' caregivers (n = 52).</p><p><strong>Results: </strong>Participants were broadly accepting of ML-based CPMs, but with concerns on data sources, factors included in the model, and accuracy. Use was desired in conjunction with providers' views and explanations. Differences among respondent types were minimal overall but most prevalent in discussions of CPM presentation and model use.</p><p><strong>Discussion and conclusion: </strong>Evidence of acceptability of ML-based CPM usage provides support for ethical use, but numerous specific considerations in acceptability, model construction, and model use for shared clinical decision-making must be considered. There are specific steps that could be taken by data scientists and health systems to engender use that is accepted by end users and facilitates trust, but there are also ongoing barriers or challenges in addressing desires for use. This study contributes to emerging literature on interpretability, mechanisms for sharing complexities, including uncertainty regarding the model results, and implications for decision-making. It examines numerous stakeholder groups including providers, patients, and caregivers to provide specific considerations that can influence health system use and provide a basis for future research.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142631467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research for all: building a diverse researcher community for the All of Us Research Program. 全民研究:为 "全民研究计划 "建立一个多元化的研究人员社区。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-14 DOI: 10.1093/jamia/ocae270
Rubin Baskir, Minnkyong Lee, Sydney J McMaster, Jessica Lee, Faith Blackburne-Proctor, Romuladus Azuine, Nakia Mack, Sheri D Schully, Martin Mendoza, Janeth Sanchez, Yong Crosby, Erica Zumba, Michael Hahn, Naomi Aspaas, Ahmed Elmi, Shanté Alerté, Elizabeth Stewart, Danielle Wilfong, Meag Doherty, Margaret M Farrell, Grace B Hébert, Sula Hood, Cheryl M Thomas, Debra D Murray, Brendan Lee, Louisa A Stark, Megan A Lewis, Jen D Uhrig, Laura R Bartlett, Edgar Gil Rico, Adolph Falcón, Elizabeth Cohn, Mitchell R Lunn, Juno Obedin-Maliver, Linda Cottler, Milton Eder, Fornessa T Randal, Jason Karnes, KiTani Lemieux, Nelson Lemieux, Nelson Lemieux, Lilanta Bradley, Ronnie Tepp, Meredith Wilson, Monica Rodriguez, Chris Lunt, Karriem Watson

Objectives: The NIH All of Us Research Program (All of Us) is engaging a diverse community of more than 10 000 registered researchers using a robust engagement ecosystem model. We describe strategies used to build an ecosystem that attracts and supports a diverse and inclusive researcher community to use the All of Us dataset and provide metrics on All of Us researcher usage growth.

Materials and methods: Researcher audiences and diversity categories were defined to guide a strategy. A researcher engagement strategy was codeveloped with program partners to support a researcher engagement ecosystem. An adapted ecological model guided the ecosystem to address multiple levels of influence to support All of Us data use. Statistics from the All of Us Researcher Workbench demographic survey describe trends in researchers' and institutional use of the Workbench and publication numbers.

Results: From 2022 to 2024, some 13 partner organizations and their subawardees conducted outreach, built capacity, or supported researchers and institutions in using the data. Trends indicate that Workbench registrations and use have increased over time, including among researchers underrepresented in the biomedical workforce. Data Use and Registration Agreements from minority-serving institutions also increased.

Discussion: All of Us built a diverse, inclusive, and growing research community via intentional engagement with researchers and via partnerships to address systemic data access issues. Future programs will provide additional support to researchers and institutions to ameliorate All of Us data use challenges.

Conclusion: The approach described helps address structural inequities in the biomedical research field to advance health equity.

目标:美国国立卫生研究院的 "我们所有人研究计划"(All of Us)正在利用一个强大的参与生态系统模式吸引一个由 10,000 多名注册研究人员组成的多元化社区。我们描述了为建立一个吸引和支持多元化、包容性研究人员社区使用 All of Us 数据集的生态系统所采用的策略,并提供了有关 All of Us 研究人员使用量增长的指标:定义研究人员受众和多样性类别,为战略提供指导。与项目合作伙伴共同制定了研究人员参与战略,以支持研究人员参与生态系统。一个经过调整的生态模型为生态系统提供指导,以解决多层次的影响问题,支持 "我们所有 "数据的使用。来自 "我们所有 "研究人员工作台人口调查的统计数据描述了研究人员和机构使用工作台的趋势以及发表论文的数量:从 2022 年到 2024 年,约有 13 个合作伙伴组织及其次级受款人开展了外联活动、能力建设或支持研究人员和机构使用数据。趋势表明,随着时间的推移,Workbench 的注册量和使用量都在增加,其中包括在生物医学队伍中代表性不足的研究人员。来自少数民族服务机构的数据使用和注册协议也有所增加:讨论:"我们所有人 "计划通过有意识地与研究人员接触,并通过合作伙伴关系来解决系统性数据访问问题,从而建立了一个多样化、包容性和不断发展的研究社区。未来的计划将为研究人员和机构提供更多支持,以改善 "我们所有 "数据使用方面的挑战:结论:所述方法有助于解决生物医学研究领域的结构性不平等问题,从而促进健康公平。
{"title":"Research for all: building a diverse researcher community for the All of Us Research Program.","authors":"Rubin Baskir, Minnkyong Lee, Sydney J McMaster, Jessica Lee, Faith Blackburne-Proctor, Romuladus Azuine, Nakia Mack, Sheri D Schully, Martin Mendoza, Janeth Sanchez, Yong Crosby, Erica Zumba, Michael Hahn, Naomi Aspaas, Ahmed Elmi, Shanté Alerté, Elizabeth Stewart, Danielle Wilfong, Meag Doherty, Margaret M Farrell, Grace B Hébert, Sula Hood, Cheryl M Thomas, Debra D Murray, Brendan Lee, Louisa A Stark, Megan A Lewis, Jen D Uhrig, Laura R Bartlett, Edgar Gil Rico, Adolph Falcón, Elizabeth Cohn, Mitchell R Lunn, Juno Obedin-Maliver, Linda Cottler, Milton Eder, Fornessa T Randal, Jason Karnes, KiTani Lemieux, Nelson Lemieux, Nelson Lemieux, Lilanta Bradley, Ronnie Tepp, Meredith Wilson, Monica Rodriguez, Chris Lunt, Karriem Watson","doi":"10.1093/jamia/ocae270","DOIUrl":"10.1093/jamia/ocae270","url":null,"abstract":"<p><strong>Objectives: </strong>The NIH All of Us Research Program (All of Us) is engaging a diverse community of more than 10 000 registered researchers using a robust engagement ecosystem model. We describe strategies used to build an ecosystem that attracts and supports a diverse and inclusive researcher community to use the All of Us dataset and provide metrics on All of Us researcher usage growth.</p><p><strong>Materials and methods: </strong>Researcher audiences and diversity categories were defined to guide a strategy. A researcher engagement strategy was codeveloped with program partners to support a researcher engagement ecosystem. An adapted ecological model guided the ecosystem to address multiple levels of influence to support All of Us data use. Statistics from the All of Us Researcher Workbench demographic survey describe trends in researchers' and institutional use of the Workbench and publication numbers.</p><p><strong>Results: </strong>From 2022 to 2024, some 13 partner organizations and their subawardees conducted outreach, built capacity, or supported researchers and institutions in using the data. Trends indicate that Workbench registrations and use have increased over time, including among researchers underrepresented in the biomedical workforce. Data Use and Registration Agreements from minority-serving institutions also increased.</p><p><strong>Discussion: </strong>All of Us built a diverse, inclusive, and growing research community via intentional engagement with researchers and via partnerships to address systemic data access issues. Future programs will provide additional support to researchers and institutions to ameliorate All of Us data use challenges.</p><p><strong>Conclusion: </strong>The approach described helps address structural inequities in the biomedical research field to advance health equity.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142631475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning health system linchpins: information exchange and a common data model. 学习卫生系统的关键:信息交换和通用数据模型。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-13 DOI: 10.1093/jamia/ocae277
Aaron S Eisman, Elizabeth S Chen, Wen-Chih Wu, Karen M Crowley, Dilum P Aluthge, Katherine Brown, Indra Neil Sarkar

Objective: To demonstrate the potential for a centrally managed health information exchange standardized to a common data model (HIE-CDM) to facilitate semantic data flow needed to support a learning health system (LHS).

Materials and methods: The Rhode Island Quality Institute operates the Rhode Island (RI) statewide HIE, which aggregates RI health data for more than half of the state's population from 47 data partners. We standardized HIE data to the Observational Medical Outcomes Partnership (OMOP) CDM. Atherosclerotic cardiovascular disease (ASCVD) risk and primary prevention practices were selected to demonstrate LHS semantic data flow from 2013 to 2023.

Results: We calculated longitudinal 10-year ASCVD risk on 62,999 individuals. Nearly two-thirds had ASCVD risk factors from more than one data partner. This enabled granular tracking of individual ASCVD risk, primary prevention (ie, statin therapy), and incident disease. The population was on statins for fewer than half of the guideline-recommended days. We also found that individuals receiving care at Federally Qualified Health Centers were more likely to have unfavorable ASCVD risk profiles and more likely to be on statins. CDM transformation reduced data heterogeneity through a unified health record that adheres to defined terminologies per OMOP domain.

Discussion: We demonstrated the potential for an HIE-CDM to enable observational population health research. We also showed how to leverage existing health information technology infrastructure and health data best practices to break down LHS barriers.

Conclusion: HIE-CDM facilitates knowledge curation and health system intervention development at the individual, health system, and population levels.

目的展示以通用数据模型(HIE-CDM)为标准的集中管理式医疗信息交换的潜力,以促进支持学习型医疗系统(LHS)所需的语义数据流:罗德岛质量研究所运营着罗德岛(RI)全州范围的 HIE,该 HIE 从 47 个数据合作伙伴处汇集了罗德岛半数以上人口的健康数据。我们将 HIE 数据标准化为观察性医疗结果合作组织 (OMOP) CDM。我们选择了动脉粥样硬化性心血管疾病(ASCVD)风险和一级预防实践,以展示从 2013 年到 2023 年的 LHS 语义数据流:我们计算了 62999 人的 10 年纵向 ASCVD 风险。近三分之二的人的 ASCVD 风险因素来自一个以上的数据合作伙伴。这样就可以对个人的 ASCVD 风险、一级预防(即他汀类药物治疗)和突发疾病进行细粒度跟踪。该人群使用他汀类药物的天数不到指南推荐天数的一半。我们还发现,在联邦合格医疗中心接受治疗的人更有可能具有不利的 ASCVD 风险特征,也更有可能服用他汀类药物。CDM 转换通过统一的健康记录减少了数据的异质性,该健康记录遵循每个 OMOP 领域的定义术语:我们展示了 HIE-CDM 在开展人口健康观察研究方面的潜力。我们还展示了如何利用现有的健康信息技术基础设施和健康数据最佳实践来打破 LHS 的障碍:HIE-CDM有助于在个人、卫生系统和人口层面进行知识整理和卫生系统干预开发。
{"title":"Learning health system linchpins: information exchange and a common data model.","authors":"Aaron S Eisman, Elizabeth S Chen, Wen-Chih Wu, Karen M Crowley, Dilum P Aluthge, Katherine Brown, Indra Neil Sarkar","doi":"10.1093/jamia/ocae277","DOIUrl":"https://doi.org/10.1093/jamia/ocae277","url":null,"abstract":"<p><strong>Objective: </strong>To demonstrate the potential for a centrally managed health information exchange standardized to a common data model (HIE-CDM) to facilitate semantic data flow needed to support a learning health system (LHS).</p><p><strong>Materials and methods: </strong>The Rhode Island Quality Institute operates the Rhode Island (RI) statewide HIE, which aggregates RI health data for more than half of the state's population from 47 data partners. We standardized HIE data to the Observational Medical Outcomes Partnership (OMOP) CDM. Atherosclerotic cardiovascular disease (ASCVD) risk and primary prevention practices were selected to demonstrate LHS semantic data flow from 2013 to 2023.</p><p><strong>Results: </strong>We calculated longitudinal 10-year ASCVD risk on 62,999 individuals. Nearly two-thirds had ASCVD risk factors from more than one data partner. This enabled granular tracking of individual ASCVD risk, primary prevention (ie, statin therapy), and incident disease. The population was on statins for fewer than half of the guideline-recommended days. We also found that individuals receiving care at Federally Qualified Health Centers were more likely to have unfavorable ASCVD risk profiles and more likely to be on statins. CDM transformation reduced data heterogeneity through a unified health record that adheres to defined terminologies per OMOP domain.</p><p><strong>Discussion: </strong>We demonstrated the potential for an HIE-CDM to enable observational population health research. We also showed how to leverage existing health information technology infrastructure and health data best practices to break down LHS barriers.</p><p><strong>Conclusion: </strong>HIE-CDM facilitates knowledge curation and health system intervention development at the individual, health system, and population levels.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142631463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Oncointerpreter.ai enables interactive, personalized summarization of cancer diagnostics data. Oncointerpreter.ai 可对癌症诊断数据进行交互式、个性化的汇总。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-13 DOI: 10.1093/jamia/ocae284
Arihant Tripathi, Brett Ecker, Patrick Boland, Saum Ghodoussipour, Gregory R Riedlinger, Subhajyoti De

Objectives: Cancer diagnosis comes as a shock to many patients, and many of them feel unprepared to handle the complexity of the life-changing event, understand technicalities of the diagnostic reports, and fully engage with the clinical team regarding the personalized clinical decision-making.

Materials and methods: We develop Oncointerpreter.ai an interactive resource to offer personalized summarization of clinical cancer genomic and pathological data, and frame questions or address queries about therapeutic opportunities in near-real time via a graphical interface. It is built on the Mistral-7B and Llama-2 7B large language models trained on a local database trained using a large, curated corpus.

Results: We showcase its utility with case studies, where Oncointerpreter.ai extracted key clinical and molecular attributes from deidentified pathology and clinical genomics reports, summarized their contextual significance and answered queries on pertinent treatment options. Oncointerpreter also provided personalized summary of currently active clinical trials that match the patients' disease status, their selection criteria, and geographic locations. Benchmarking and comparative assessment indicated that the model responses were generally consistent, and hallucination, ie, factually incorrect or nonsensical response was rare; treatment- and outcome related queries led to context-aware responses, and response time correlated with verbosity.

Discussion: The choice of model and domain-specific training also affected the response quality.

Conclusion: Oncointerpreter.ai can aid the existing clinical care with interactive, individualized summarization of diagnostics data to promote informed dialogs with the patients with new cancer diagnoses.

Availability: https://github.com/Siris2314/Oncointerpreter.

目的:癌症诊断对许多患者来说是一个打击,他们中的许多人感到没有准备好应对这一改变生命事件的复杂性、理解诊断报告的技术细节,以及就个性化临床决策与临床团队充分互动:我们开发的 Oncointerpreter.ai 是一种交互式资源,可对临床癌症基因组和病理数据进行个性化总结,并通过图形界面近乎实时地提出问题或解决有关治疗机会的疑问。它建立在 Mistral-7B 和 Llama-2 7B 大型语言模型的基础上,这些模型是在一个本地数据库中使用大型语料库训练而成的:我们通过案例研究展示了 Oncointerpreter.ai 的实用性。在案例研究中,Oncointerpreter.ai 从去标识化的病理和临床基因组学报告中提取关键的临床和分子属性,总结其背景意义,并回答相关治疗方案的查询。Oncointerpreter 还提供了与患者疾病状况、选择标准和地理位置相匹配的当前活跃临床试验的个性化摘要。基准测试和比较评估结果表明,模型的回答基本一致,很少出现幻觉,即回答与事实不符或无意义的情况;与治疗和结果相关的询问都能得到上下文感知的回答,而回答时间则与语言冗长度相关:讨论:模型的选择和特定领域的训练也会影响响应质量:Oncointerpreter.ai可以通过对诊断数据进行交互式、个性化的总结来帮助现有的临床治疗,从而促进与新诊断出癌症的患者进行知情对话。可用性:https://github.com/Siris2314/Oncointerpreter。
{"title":"Oncointerpreter.ai enables interactive, personalized summarization of cancer diagnostics data.","authors":"Arihant Tripathi, Brett Ecker, Patrick Boland, Saum Ghodoussipour, Gregory R Riedlinger, Subhajyoti De","doi":"10.1093/jamia/ocae284","DOIUrl":"10.1093/jamia/ocae284","url":null,"abstract":"<p><strong>Objectives: </strong>Cancer diagnosis comes as a shock to many patients, and many of them feel unprepared to handle the complexity of the life-changing event, understand technicalities of the diagnostic reports, and fully engage with the clinical team regarding the personalized clinical decision-making.</p><p><strong>Materials and methods: </strong>We develop Oncointerpreter.ai an interactive resource to offer personalized summarization of clinical cancer genomic and pathological data, and frame questions or address queries about therapeutic opportunities in near-real time via a graphical interface. It is built on the Mistral-7B and Llama-2 7B large language models trained on a local database trained using a large, curated corpus.</p><p><strong>Results: </strong>We showcase its utility with case studies, where Oncointerpreter.ai extracted key clinical and molecular attributes from deidentified pathology and clinical genomics reports, summarized their contextual significance and answered queries on pertinent treatment options. Oncointerpreter also provided personalized summary of currently active clinical trials that match the patients' disease status, their selection criteria, and geographic locations. Benchmarking and comparative assessment indicated that the model responses were generally consistent, and hallucination, ie, factually incorrect or nonsensical response was rare; treatment- and outcome related queries led to context-aware responses, and response time correlated with verbosity.</p><p><strong>Discussion: </strong>The choice of model and domain-specific training also affected the response quality.</p><p><strong>Conclusion: </strong>Oncointerpreter.ai can aid the existing clinical care with interactive, individualized summarization of diagnostics data to promote informed dialogs with the patients with new cancer diagnoses.</p><p><strong>Availability: </strong>https://github.com/Siris2314/Oncointerpreter.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142631472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of six natural language processing approaches to assessing firearm access in Veterans Health Administration electronic health records. 比较六种自然语言处理方法,以评估退伍军人健康管理局电子健康记录中的枪支使用情况。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-12 DOI: 10.1093/jamia/ocae169
Joshua Trujeque, R Adams Dudley, Nathan Mesfin, Nicholas E Ingraham, Isai Ortiz, Ann Bangerter, Anjan Chakraborty, Dalton Schutte, Jeremy Yeung, Ying Liu, Alicia Woodward-Abel, Emma Bromley, Rui Zhang, Lisa A Brenner, Joseph A Simonetti

Objective: Access to firearms is associated with increased suicide risk. Our aim was to develop a natural language processing approach to characterizing firearm access in clinical records.

Materials and methods: We used clinical notes from 36 685 Veterans Health Administration (VHA) patients between April 10, 2023 and April 10, 2024. We expanded preexisting firearm term sets using subject matter experts and generated 250-character snippets around each firearm term appearing in notes. Annotators labeled 3000 snippets into three classes. Using these annotated snippets, we compared four nonneural machine learning models (random forest, bagging, gradient boosting, logistic regression with ridge penalization) and two versions of Bidirectional Encoder Representations from Transformers, or BERT (specifically, BioBERT and Bio-ClinicalBERT) for classifying firearm access as "definite access", "definitely no access", or "other".

Results: Firearm terms were identified in 36 685 patient records (41.3%), 33.7% of snippets were categorized as definite access, 9.0% as definitely no access, and 57.2% as "other". Among models classifying firearm access, five of six had acceptable performance, with BioBERT and Bio-ClinicalBERT performing best, with F1s of 0.876 (95% confidence interval, 0.874-0.879) and 0.896 (95% confidence interval, 0.894-0.899), respectively.

Discussion and conclusion: Firearm-related terminology is common in the clinical records of VHA patients. The ability to use text to identify and characterize patients' firearm access could enhance suicide prevention efforts, and five of our six models could be used to identify patients for clinical interventions.

目的:接触枪支与自杀风险增加有关。我们的目的是开发一种自然语言处理方法来描述临床记录中的枪支使用情况:我们使用了 2023 年 4 月 10 日至 2024 年 4 月 10 日期间 36 685 名退伍军人健康管理局(VHA)患者的临床记录。我们利用主题专家扩充了已有的枪支术语集,并围绕笔记中出现的每个枪支术语生成了 250 个字符的片段。注释者将 3000 个片段分为三类。利用这些标注片段,我们比较了四种非神经机器学习模型(随机森林、bagging、梯度提升、带山脊惩罚的逻辑回归)和两个版本的双向编码器表征转换器(Bidirectional Encoder Representations from Transformers,简称 BERT)(特别是 BioBERT 和 Bio-ClinicalBERT),以将枪支接触分为 "肯定接触"、"肯定不接触 "或 "其他":在 36 685 份病历(41.3%)中识别出了枪支术语,33.7% 的片段被归类为明确接触枪支,9.0% 被归类为明确不接触枪支,57.2% 被归类为 "其他"。在对使用枪支进行分类的模型中,六个模型中有五个的性能可以接受,其中 BioBERT 和 Bio-ClinicalBERT 的性能最好,F1 分别为 0.876(95% 置信区间,0.874-0.879)和 0.896(95% 置信区间,0.894-0.899):在退伍军人事务部患者的临床记录中,与枪支有关的术语很常见。利用文本识别和描述患者使用枪支情况的能力可以加强自杀预防工作,我们的六个模型中有五个模型可用于识别患者以进行临床干预。
{"title":"Comparison of six natural language processing approaches to assessing firearm access in Veterans Health Administration electronic health records.","authors":"Joshua Trujeque, R Adams Dudley, Nathan Mesfin, Nicholas E Ingraham, Isai Ortiz, Ann Bangerter, Anjan Chakraborty, Dalton Schutte, Jeremy Yeung, Ying Liu, Alicia Woodward-Abel, Emma Bromley, Rui Zhang, Lisa A Brenner, Joseph A Simonetti","doi":"10.1093/jamia/ocae169","DOIUrl":"https://doi.org/10.1093/jamia/ocae169","url":null,"abstract":"<p><strong>Objective: </strong>Access to firearms is associated with increased suicide risk. Our aim was to develop a natural language processing approach to characterizing firearm access in clinical records.</p><p><strong>Materials and methods: </strong>We used clinical notes from 36 685 Veterans Health Administration (VHA) patients between April 10, 2023 and April 10, 2024. We expanded preexisting firearm term sets using subject matter experts and generated 250-character snippets around each firearm term appearing in notes. Annotators labeled 3000 snippets into three classes. Using these annotated snippets, we compared four nonneural machine learning models (random forest, bagging, gradient boosting, logistic regression with ridge penalization) and two versions of Bidirectional Encoder Representations from Transformers, or BERT (specifically, BioBERT and Bio-ClinicalBERT) for classifying firearm access as \"definite access\", \"definitely no access\", or \"other\".</p><p><strong>Results: </strong>Firearm terms were identified in 36 685 patient records (41.3%), 33.7% of snippets were categorized as definite access, 9.0% as definitely no access, and 57.2% as \"other\". Among models classifying firearm access, five of six had acceptable performance, with BioBERT and Bio-ClinicalBERT performing best, with F1s of 0.876 (95% confidence interval, 0.874-0.879) and 0.896 (95% confidence interval, 0.894-0.899), respectively.</p><p><strong>Discussion and conclusion: </strong>Firearm-related terminology is common in the clinical records of VHA patients. The ability to use text to identify and characterize patients' firearm access could enhance suicide prevention efforts, and five of our six models could be used to identify patients for clinical interventions.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142631461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning-based infection diagnostic and prognostic models in post-acute care settings: a systematic review. 基于机器学习的急性期后护理环境感染诊断和预后模型:系统综述。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-12 DOI: 10.1093/jamia/ocae278
Zidu Xu, Danielle Scharp, Mollie Hobensek, Jiancheng Ye, Jungang Zou, Sirui Ding, Jingjing Shang, Maxim Topaz

Objectives: This study aims to (1) review machine learning (ML)-based models for early infection diagnostic and prognosis prediction in post-acute care (PAC) settings, (2) identify key risk predictors influencing infection-related outcomes, and (3) examine the quality and limitations of these models.

Materials and methods: PubMed, Web of Science, Scopus, IEEE Xplore, CINAHL, and ACM digital library were searched in February 2024. Eligible studies leveraged PAC data to develop and evaluate ML models for infection-related risks. Data extraction followed the CHARMS checklist. Quality appraisal followed the PROBAST tool. Data synthesis was guided by the socio-ecological conceptual framework.

Results: Thirteen studies were included, mainly focusing on respiratory infections and nursing homes. Most used regression models with structured electronic health record data. Since 2020, there has been a shift toward advanced ML algorithms and multimodal data, biosensors, and clinical notes being significant sources of unstructured data. Despite these advances, there is insufficient evidence to support performance improvements over traditional models. Individual-level risk predictors, like impaired cognition, declined function, and tachycardia, were commonly used, while contextual-level predictors were barely utilized, consequently limiting model fairness. Major sources of bias included lack of external validation, inadequate model calibration, and insufficient consideration of data complexity.

Discussion and conclusion: Despite the growth of advanced modeling approaches in infection-related models in PAC settings, evidence supporting their superiority remains limited. Future research should leverage a socio-ecological lens for predictor selection and model construction, exploring optimal data modalities and ML model usage in PAC, while ensuring rigorous methodologies and fairness considerations.

研究目的本研究旨在:(1) 综述基于机器学习(ML)的急性期后护理(PAC)环境中早期感染诊断和预后预测模型;(2) 确定影响感染相关结果的关键风险预测因素;(3) 检验这些模型的质量和局限性:于 2024 年 2 月检索了 PubMed、Web of Science、Scopus、IEEE Xplore、CINAHL 和 ACM 数字图书馆。符合条件的研究利用 PAC 数据开发并评估了感染相关风险的 ML 模型。数据提取遵循 CHARMS 核对表。质量评估采用 PROBAST 工具。数据综合以社会生态概念框架为指导:共纳入 13 项研究,主要集中在呼吸道感染和疗养院。大多数研究使用了结构化电子健康记录数据回归模型。自 2020 年以来,先进的 ML 算法、多模态数据、生物传感器和临床笔记已成为非结构化数据的重要来源。尽管取得了这些进展,但仍没有足够的证据支持其性能比传统模型有所提高。个体层面的风险预测因素,如认知能力受损、功能下降和心动过速等,被普遍使用,而情境层面的预测因素几乎未被使用,从而限制了模型的公平性。偏差的主要来源包括缺乏外部验证、模型校准不足以及对数据复杂性考虑不足:尽管先进的建模方法在 PAC 环境中的感染相关模型中得到了发展,但支持其优越性的证据仍然有限。未来的研究应利用社会生态学的视角来选择预测因子和构建模型,探索 PAC 中的最佳数据模式和 ML 模型用法,同时确保采用严格的方法并考虑公平性。
{"title":"Machine learning-based infection diagnostic and prognostic models in post-acute care settings: a systematic review.","authors":"Zidu Xu, Danielle Scharp, Mollie Hobensek, Jiancheng Ye, Jungang Zou, Sirui Ding, Jingjing Shang, Maxim Topaz","doi":"10.1093/jamia/ocae278","DOIUrl":"https://doi.org/10.1093/jamia/ocae278","url":null,"abstract":"<p><strong>Objectives: </strong>This study aims to (1) review machine learning (ML)-based models for early infection diagnostic and prognosis prediction in post-acute care (PAC) settings, (2) identify key risk predictors influencing infection-related outcomes, and (3) examine the quality and limitations of these models.</p><p><strong>Materials and methods: </strong>PubMed, Web of Science, Scopus, IEEE Xplore, CINAHL, and ACM digital library were searched in February 2024. Eligible studies leveraged PAC data to develop and evaluate ML models for infection-related risks. Data extraction followed the CHARMS checklist. Quality appraisal followed the PROBAST tool. Data synthesis was guided by the socio-ecological conceptual framework.</p><p><strong>Results: </strong>Thirteen studies were included, mainly focusing on respiratory infections and nursing homes. Most used regression models with structured electronic health record data. Since 2020, there has been a shift toward advanced ML algorithms and multimodal data, biosensors, and clinical notes being significant sources of unstructured data. Despite these advances, there is insufficient evidence to support performance improvements over traditional models. Individual-level risk predictors, like impaired cognition, declined function, and tachycardia, were commonly used, while contextual-level predictors were barely utilized, consequently limiting model fairness. Major sources of bias included lack of external validation, inadequate model calibration, and insufficient consideration of data complexity.</p><p><strong>Discussion and conclusion: </strong>Despite the growth of advanced modeling approaches in infection-related models in PAC settings, evidence supporting their superiority remains limited. Future research should leverage a socio-ecological lens for predictor selection and model construction, exploring optimal data modalities and ML model usage in PAC, while ensuring rigorous methodologies and fairness considerations.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142631465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mini-mental status examination phenotyping for Alzheimer's disease patients using both structured and narrative electronic health record features. 利用结构化和叙事性电子健康记录特征对阿尔茨海默病患者进行迷你精神状态检查表型分析。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-09 DOI: 10.1093/jamia/ocae274
Betina Idnay, Gongbo Zhang, Fangyi Chen, Casey N Ta, Matthew W Schelke, Karen Marder, Chunhua Weng

Objective: This study aims to automate the prediction of Mini-Mental State Examination (MMSE) scores, a widely adopted standard for cognitive assessment in patients with Alzheimer's disease, using natural language processing (NLP) and machine learning (ML) on structured and unstructured EHR data.

Materials and methods: We extracted demographic data, diagnoses, medications, and unstructured clinical visit notes from the EHRs. We used Latent Dirichlet Allocation (LDA) for topic modeling and Term-Frequency Inverse Document Frequency (TF-IDF) for n-grams. In addition, we extracted meta-features such as age, ethnicity, and race. Model training and evaluation employed eXtreme Gradient Boosting (XGBoost), Stochastic Gradient Descent Regressor (SGDRegressor), and Multi-Layer Perceptron (MLP).

Results: We analyzed 1654 clinical visit notes collected between September 2019 and June 2023 for 1000 Alzheimer's disease patients. The average MMSE score was 20, with patients averaging 76.4 years old, 54.7% female, and 54.7% identifying as White. The best-performing model (ie, lowest root mean squared error (RMSE)) is MLP, which achieved an RMSE of 5.53 on the validation set using n-grams, indicating superior prediction performance over other models and feature sets. The RMSE on the test set was 5.85.

Discussion: This study developed a ML method to predict MMSE scores from unstructured clinical notes, demonstrating the feasibility of utilizing NLP to support cognitive assessment. Future work should focus on refining the model and evaluating its clinical relevance across diverse settings.

Conclusion: We contributed a model for automating MMSE estimation using EHR features, potentially transforming cognitive assessment for Alzheimer's patients and paving the way for more informed clinical decisions and cohort identification.

研究目的本研究旨在使用自然语言处理(NLP)和机器学习(ML)对结构化和非结构化电子病历数据自动预测小型精神状态检查(MMSE)评分,这是阿尔茨海默病患者认知评估中广泛采用的标准:我们从电子病历中提取了人口统计学数据、诊断、药物和非结构化临床就诊记录。我们使用 Latent Dirichlet Allocation (LDA) 进行主题建模,使用 Term-Frequency Inverse Document Frequency (TF-IDF) 进行 n-grams 建模。此外,我们还提取了年龄、民族和种族等元特征。模型的训练和评估采用了极梯度提升(XGBoost)、随机梯度下降回归器(SGDRegressor)和多层感知器(MLP):我们分析了 2019 年 9 月至 2023 年 6 月期间收集的 1654 份临床就诊记录,涉及 1000 名阿尔茨海默病患者。平均 MMSE 得分为 20 分,患者平均年龄为 76.4 岁,54.7% 为女性,54.7% 为白人。表现最好的模型(即均方根误差(RMSE)最小)是 MLP,该模型使用 n-grams,在验证集上的 RMSE 为 5.53,表明其预测性能优于其他模型和特征集。测试集上的 RMSE 为 5.85:本研究开发了一种从非结构化临床笔记中预测 MMSE 分数的 ML 方法,证明了利用 NLP 支持认知评估的可行性。今后的工作重点是完善模型,并评估其在不同环境下的临床相关性:我们利用电子病历特征建立了一个 MMSE 自动估算模型,有可能改变对阿尔茨海默病患者的认知评估,为更明智的临床决策和队列识别铺平道路。
{"title":"Mini-mental status examination phenotyping for Alzheimer's disease patients using both structured and narrative electronic health record features.","authors":"Betina Idnay, Gongbo Zhang, Fangyi Chen, Casey N Ta, Matthew W Schelke, Karen Marder, Chunhua Weng","doi":"10.1093/jamia/ocae274","DOIUrl":"https://doi.org/10.1093/jamia/ocae274","url":null,"abstract":"<p><strong>Objective: </strong>This study aims to automate the prediction of Mini-Mental State Examination (MMSE) scores, a widely adopted standard for cognitive assessment in patients with Alzheimer's disease, using natural language processing (NLP) and machine learning (ML) on structured and unstructured EHR data.</p><p><strong>Materials and methods: </strong>We extracted demographic data, diagnoses, medications, and unstructured clinical visit notes from the EHRs. We used Latent Dirichlet Allocation (LDA) for topic modeling and Term-Frequency Inverse Document Frequency (TF-IDF) for n-grams. In addition, we extracted meta-features such as age, ethnicity, and race. Model training and evaluation employed eXtreme Gradient Boosting (XGBoost), Stochastic Gradient Descent Regressor (SGDRegressor), and Multi-Layer Perceptron (MLP).</p><p><strong>Results: </strong>We analyzed 1654 clinical visit notes collected between September 2019 and June 2023 for 1000 Alzheimer's disease patients. The average MMSE score was 20, with patients averaging 76.4 years old, 54.7% female, and 54.7% identifying as White. The best-performing model (ie, lowest root mean squared error (RMSE)) is MLP, which achieved an RMSE of 5.53 on the validation set using n-grams, indicating superior prediction performance over other models and feature sets. The RMSE on the test set was 5.85.</p><p><strong>Discussion: </strong>This study developed a ML method to predict MMSE scores from unstructured clinical notes, demonstrating the feasibility of utilizing NLP to support cognitive assessment. Future work should focus on refining the model and evaluating its clinical relevance across diverse settings.</p><p><strong>Conclusion: </strong>We contributed a model for automating MMSE estimation using EHR features, potentially transforming cognitive assessment for Alzheimer's patients and paving the way for more informed clinical decisions and cohort identification.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142631470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantitatively assessing the impact of the quality of SNOMED CT subtype hierarchy on cohort queries. 定量评估 SNOMED CT 亚型分级质量对队列查询的影响。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-09 DOI: 10.1093/jamia/ocae272
Xubing Hao, Xiaojin Li, Yan Huang, Jay Shi, Rashmie Abeysinghe, Cui Tao, Kirk Roberts, Guo-Qiang Zhang, Licong Cui

Objective: SNOMED CT provides a standardized terminology for clinical concepts, allowing cohort queries over heterogeneous clinical data including Electronic Health Records (EHRs). While it is intuitive that missing and inaccurate subtype (or is-a) relations in SNOMED CT reduce the recall and precision of cohort queries, the extent of these impacts has not been formally assessed. This study fills this gap by developing quantitative metrics to measure these impacts and performing statistical analysis on their significance.

Material and methods: We used the Optum de-identified COVID-19 Electronic Health Record dataset. We defined micro-averaged and macro-averaged recall and precision metrics to assess the impact of missing and inaccurate is-a relations on cohort queries. Both practical and simulated analyses were performed. Practical analyses involved 407 missing and 48 inaccurate is-a relations confirmed by domain experts, with statistical testing using Wilcoxon signed-rank tests. Simulated analyses used two random sets of 400 is-a relations to simulate missing and inaccurate is-a relations.

Results: Wilcoxon signed-rank tests from both practical and simulated analyses (P-values < .001) showed that missing is-a relations significantly reduced the micro- and macro-averaged recall, and inaccurate is-a relations significantly reduced the micro- and macro-averaged precision.

Discussion: The introduced impact metrics can assist SNOMED CT maintainers in prioritizing critical hierarchical defects for quality enhancement. These metrics are generally applicable for assessing the quality impact of a terminology's subtype hierarchy on its cohort query applications.

Conclusion: Our results indicate a significant impact of missing and inaccurate is-a relations in SNOMED CT on the recall and precision of cohort queries. Our work highlights the importance of high-quality terminology hierarchy for cohort queries over EHR data and provides valuable insights for prioritizing quality improvements of SNOMED CT's hierarchy.

目的SNOMED CT 为临床概念提供了标准化术语,允许对包括电子健康记录 (EHR) 在内的异构临床数据进行队列查询。SNOMED CT 中缺失和不准确的子类型(或 is-a)关系会降低队列查询的召回率和精确度,这一点很直观,但这些影响的程度尚未得到正式评估。本研究通过制定量化指标来衡量这些影响并对其重要性进行统计分析,填补了这一空白:我们使用了 Optum 去标识化 COVID-19 电子健康记录数据集。我们定义了微观平均和宏观平均召回率和精确度指标,以评估缺失和不准确的 is-a 关系对队列查询的影响。我们进行了实际分析和模拟分析。实际分析包括经领域专家确认的 407 个缺失的 is-a 关系和 48 个不准确的 is-a 关系,并使用 Wilcoxon 符号秩检验进行统计检验。模拟分析使用了两组随机的 400 个 is-a 关系来模拟缺失和不准确的 is-a 关系:实际分析和模拟分析的 Wilcoxon 符号秩检验(P 值 < .001)表明,缺失的 is-a 关系显著降低了微观和宏观平均召回率,而不准确的 is-a 关系显著降低了微观和宏观平均精确率:所介绍的影响度量标准可以帮助 SNOMED CT 维护者优先处理关键的分层缺陷,以提高质量。这些指标通常适用于评估术语的子类型层次结构对其同类查询应用的质量影响:我们的研究结果表明,SNOMED CT 中缺失和不准确的 is-a 关系对队列查询的召回率和精确度有很大影响。我们的工作凸显了高质量术语层次结构对电子病历数据队列查询的重要性,并为优先提高 SNOMED CT 层次结构的质量提供了有价值的见解。
{"title":"Quantitatively assessing the impact of the quality of SNOMED CT subtype hierarchy on cohort queries.","authors":"Xubing Hao, Xiaojin Li, Yan Huang, Jay Shi, Rashmie Abeysinghe, Cui Tao, Kirk Roberts, Guo-Qiang Zhang, Licong Cui","doi":"10.1093/jamia/ocae272","DOIUrl":"https://doi.org/10.1093/jamia/ocae272","url":null,"abstract":"<p><strong>Objective: </strong>SNOMED CT provides a standardized terminology for clinical concepts, allowing cohort queries over heterogeneous clinical data including Electronic Health Records (EHRs). While it is intuitive that missing and inaccurate subtype (or is-a) relations in SNOMED CT reduce the recall and precision of cohort queries, the extent of these impacts has not been formally assessed. This study fills this gap by developing quantitative metrics to measure these impacts and performing statistical analysis on their significance.</p><p><strong>Material and methods: </strong>We used the Optum de-identified COVID-19 Electronic Health Record dataset. We defined micro-averaged and macro-averaged recall and precision metrics to assess the impact of missing and inaccurate is-a relations on cohort queries. Both practical and simulated analyses were performed. Practical analyses involved 407 missing and 48 inaccurate is-a relations confirmed by domain experts, with statistical testing using Wilcoxon signed-rank tests. Simulated analyses used two random sets of 400 is-a relations to simulate missing and inaccurate is-a relations.</p><p><strong>Results: </strong>Wilcoxon signed-rank tests from both practical and simulated analyses (P-values < .001) showed that missing is-a relations significantly reduced the micro- and macro-averaged recall, and inaccurate is-a relations significantly reduced the micro- and macro-averaged precision.</p><p><strong>Discussion: </strong>The introduced impact metrics can assist SNOMED CT maintainers in prioritizing critical hierarchical defects for quality enhancement. These metrics are generally applicable for assessing the quality impact of a terminology's subtype hierarchy on its cohort query applications.</p><p><strong>Conclusion: </strong>Our results indicate a significant impact of missing and inaccurate is-a relations in SNOMED CT on the recall and precision of cohort queries. Our work highlights the importance of high-quality terminology hierarchy for cohort queries over EHR data and provides valuable insights for prioritizing quality improvements of SNOMED CT's hierarchy.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142631474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Seeing things the same way: perspectives and lessons learned from research-design collaborations. 以同样的方式看待事物:研究与设计合作的观点和经验教训。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-01-18 DOI: 10.1093/jamia/ocad124
Sabrina Mangal, Lauren Berger, Jean-Marie Bruzzese, Alexandra de la Cruz, Maichou Lor, Imama A Naqvi, Eugenio Solis de Ovando, Nicole Spiegel-Gotsch, Samantha Stonbraker, Adriana Arcia

Information visualizations are increasingly being developed by informatics researchers to communicate health information to lay audiences. For high-quality results, it is advisable to collaborate with creative professionals such as graphic designers, illustrators, or user interface/user experience designers. However, such collaborations are often a novel experience for both parties, each of which may be unfamiliar with the needs and processes of the other. We have coalesced our experiences from both the research and design perspectives to offer practical guidance in hopes of promoting the success of future collaborations. We offer suggestions for determining design needs, communicating with design professionals, and carrying out the design process. We assert that successful collaborations are predicated on careful and intentional planning at the outset of a project, a thorough understanding of each party's scope expertise, clear communication, and ample time for the design process to unfold.

信息学研究人员越来越多地开发信息可视化技术,以便向非专业受众传达健康信息。为了获得高质量的成果,建议与专业创意人员合作,如平面设计师、插图画家或用户界面/用户体验设计师。然而,这种合作对双方来说往往都是一种全新的体验,因为双方都可能不熟悉对方的需求和流程。我们从研究和设计两个角度总结经验,提供实用指导,希望能促进未来合作的成功。我们为确定设计需求、与专业设计人员沟通以及实施设计流程提供了建议。我们认为,成功合作的前提是在项目伊始就进行精心和周密的规划,充分了解各方的专业范围,进行清晰的沟通,并为设计过程的展开留出充足的时间。
{"title":"Seeing things the same way: perspectives and lessons learned from research-design collaborations.","authors":"Sabrina Mangal, Lauren Berger, Jean-Marie Bruzzese, Alexandra de la Cruz, Maichou Lor, Imama A Naqvi, Eugenio Solis de Ovando, Nicole Spiegel-Gotsch, Samantha Stonbraker, Adriana Arcia","doi":"10.1093/jamia/ocad124","DOIUrl":"10.1093/jamia/ocad124","url":null,"abstract":"<p><p>Information visualizations are increasingly being developed by informatics researchers to communicate health information to lay audiences. For high-quality results, it is advisable to collaborate with creative professionals such as graphic designers, illustrators, or user interface/user experience designers. However, such collaborations are often a novel experience for both parties, each of which may be unfamiliar with the needs and processes of the other. We have coalesced our experiences from both the research and design perspectives to offer practical guidance in hopes of promoting the success of future collaborations. We offer suggestions for determining design needs, communicating with design professionals, and carrying out the design process. We assert that successful collaborations are predicated on careful and intentional planning at the outset of a project, a thorough understanding of each party's scope expertise, clear communication, and ample time for the design process to unfold.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"542-547"},"PeriodicalIF":4.7,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10797272/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9772907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the American Medical Informatics Association
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1