首页 > 最新文献

JAMIA Open最新文献

英文 中文
Clinical document corpora-real ones, translated and synthetic substitutes, and assorted domain proxies: a survey of diversity in corpus design, with focus on German text data. 临床文献语料库-真实的,翻译的和合成的替代品,以及分类的领域代理:语料库设计多样性的调查,重点是德语文本数据。
IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-05-14 eCollection Date: 2025-06-01 DOI: 10.1093/jamiaopen/ooaf024
Udo Hahn
<p><strong>Objective: </strong>We survey clinical document corpora, with a focus on German textual data. Due to rigid data privacy legislation in Germany, these resources, with only few exceptions, are stored in protected clinical data spaces and locked against clinic-external researchers. This situation stands in stark contrast with established workflows in the field of natural language processing, where easy accessibility and reuse of (textual) data collections are common practice. Hence, alternative corpus designs have been examined to escape from data poverty. Besides machine translation of English clinical datasets and the generation of synthetic corpora with fictitious clinical contents, several types of domain proxies have come up as substitutes for real clinical documents. Common instances of close proxies are medical journal publications, therapy guidelines, drug labels, etc., more distant proxies include medical contents from social media channels or online encyclopedic medical articles.</p><p><strong>Methods: </strong>We follow the PRISM (Preferred Reporting Items for Systematic reviews and Meta-analyses) guidelines for surveying the field of German-language clinical/medical corpora. Four bibliographic databases were searched: PubMed, ACL Anthology, Google Scholar, and the author's personal literature database.</p><p><strong>Results: </strong>After PRISM-conformant identification of 362 hits from the 4 bibliographic systems, the screening process yielded 78 relevant documents for inclusion in this review. They contained overall 92 different published versions of corpora, from which 71 were truly unique in terms of their underlying document sets. Out of these, the majority were clinical corpora-46 real ones from which 32 were unique, 5 translated ones (3 unique), and 6 synthetic ones (3 unique). As to domain proxies, we identified 18 close ones (16 unique) and 17 distant ones (all of them unique).</p><p><strong>Discussion: </strong>There is a clear divide between the large number of non-accessible real clinical German-language corpora and their publicly accessible substitutes: translated or synthetic datasets, close or more distant proxies. So, at first sight, the data bottleneck seems broken. Intuitively, yet, differences in genre-specific writing style, lexical or terminological diction, and required medical background expertise in this typological space are also obvious. This raises the question how valid alternative corpus designs really are. A systematic, empirically grounded yardstick for comparing real clinical corpora with those suggested substitutes and proxies is missing until now.</p><p><strong>Conclusion: </strong>The extreme sparsity of real clinical corpora in almost all non-Anglo-American countries worldwide, Germany in particular, has triggered an active search for alternative, publicly accessible data resources laid out in this survey. However, the utility of these substitutes compared with real clinical corpora and the
目的:调查临床文献语料库,重点是德语文本数据。由于德国严格的数据隐私立法,除了少数例外,这些资源都存储在受保护的临床数据空间中,并对临床外部研究人员锁定。这种情况与自然语言处理领域中已建立的工作流形成鲜明对比,在自然语言处理领域中,易于访问和重用(文本)数据集合是常见的做法。因此,已经研究了替代语料库设计以避免数据贫乏。除了英文临床数据集的机器翻译和虚构临床内容的合成语料库的生成之外,还出现了几种类型的领域代理作为真实临床文档的替代品。近距离代理的常见例子是医学期刊出版物、治疗指南、药物标签等,较远的代理包括来自社交媒体渠道或在线百科全书式医学文章的医疗内容。方法:我们遵循PRISM(系统评价和荟萃分析的首选报告项目)指南调查德语临床/医学语料库领域。检索了四个书目数据库:PubMed、ACL Anthology、谷歌Scholar和作者个人文献数据库。结果:从4个文献系统中筛选出362个符合prism标准的结果后,筛选过程中产生了78个相关文献纳入本综述。它们包含了总共92个不同版本的语料库,其中71个在其基础文档集方面是真正独特的。其中以临床语料库为主,46份真实语料库32份唯一,5份翻译语料库3份唯一,6份合成语料库3份唯一。至于域代理,我们确定了18个近代理(16个唯一的)和17个远代理(它们都是唯一的)。讨论:在大量无法访问的真实临床德语语料库和它们的可公开访问的替代品之间存在明显的鸿沟:翻译或合成数据集,近或更远的代理。因此,乍一看,数据瓶颈似乎被打破了。然而,从直观上看,在特定体裁的写作风格、词汇或术语的措辞以及在这一类型学领域所需的医学背景专业知识方面的差异也是显而易见的。这就提出了一个问题,替代语料库设计到底有多有效。到目前为止,还没有一个系统的、基于经验的标准来比较真实的临床语料库与那些建议的替代品和代理。结论:在世界上几乎所有非英美国家,尤其是德国,真正的临床语料库极度稀少,这引发了对本调查中列出的可公开访问的替代数据资源的积极搜索。然而,这些替代品与真实临床语料库的效用及其与真实临床语料库的语义和体裁特定距离仍有待研究,因此它们的价值仍有待适当评估。此外,语料库描述在相关描述属性方面往往是不完整的。本文将这些观察结果捆绑在一起,并提出了一个所谓的语料库卡片模板,以改进足够的语料库文档。
{"title":"Clinical document corpora-real ones, translated and synthetic substitutes, and assorted domain proxies: a survey of diversity in corpus design, with focus on German text data.","authors":"Udo Hahn","doi":"10.1093/jamiaopen/ooaf024","DOIUrl":"10.1093/jamiaopen/ooaf024","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;We survey clinical document corpora, with a focus on German textual data. Due to rigid data privacy legislation in Germany, these resources, with only few exceptions, are stored in protected clinical data spaces and locked against clinic-external researchers. This situation stands in stark contrast with established workflows in the field of natural language processing, where easy accessibility and reuse of (textual) data collections are common practice. Hence, alternative corpus designs have been examined to escape from data poverty. Besides machine translation of English clinical datasets and the generation of synthetic corpora with fictitious clinical contents, several types of domain proxies have come up as substitutes for real clinical documents. Common instances of close proxies are medical journal publications, therapy guidelines, drug labels, etc., more distant proxies include medical contents from social media channels or online encyclopedic medical articles.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We follow the PRISM (Preferred Reporting Items for Systematic reviews and Meta-analyses) guidelines for surveying the field of German-language clinical/medical corpora. Four bibliographic databases were searched: PubMed, ACL Anthology, Google Scholar, and the author's personal literature database.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;After PRISM-conformant identification of 362 hits from the 4 bibliographic systems, the screening process yielded 78 relevant documents for inclusion in this review. They contained overall 92 different published versions of corpora, from which 71 were truly unique in terms of their underlying document sets. Out of these, the majority were clinical corpora-46 real ones from which 32 were unique, 5 translated ones (3 unique), and 6 synthetic ones (3 unique). As to domain proxies, we identified 18 close ones (16 unique) and 17 distant ones (all of them unique).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Discussion: &lt;/strong&gt;There is a clear divide between the large number of non-accessible real clinical German-language corpora and their publicly accessible substitutes: translated or synthetic datasets, close or more distant proxies. So, at first sight, the data bottleneck seems broken. Intuitively, yet, differences in genre-specific writing style, lexical or terminological diction, and required medical background expertise in this typological space are also obvious. This raises the question how valid alternative corpus designs really are. A systematic, empirically grounded yardstick for comparing real clinical corpora with those suggested substitutes and proxies is missing until now.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusion: &lt;/strong&gt;The extreme sparsity of real clinical corpora in almost all non-Anglo-American countries worldwide, Germany in particular, has triggered an active search for alternative, publicly accessible data resources laid out in this survey. However, the utility of these substitutes compared with real clinical corpora and the","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 3","pages":"ooaf024"},"PeriodicalIF":2.5,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12077144/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144080516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating algorithmic bias on biomarker classification of breast cancer pathology reports. 评估乳腺癌病理报告中生物标志物分类的算法偏差。
IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-05-09 eCollection Date: 2025-06-01 DOI: 10.1093/jamiaopen/ooaf033
Jordan Tschida, Mayanka Chandrashekar, Alina Peluso, Zachary Fox, Patrycja Krawczuk, Dakota Murdock, Xiao-Cheng Wu, John Gounley, Heidi A Hanson

Objectives: This work evaluated algorithmic bias in biomarkers classification using electronic pathology reports from female breast cancer cases. Bias was assessed across 5 subgroups: cancer registry, race, Hispanic ethnicity, age at diagnosis, and socioeconomic status.

Materials and methods: We utilized 594 875 electronic pathology reports from 178 121 tumors diagnosed in Kentucky, Louisiana, New Jersey, New Mexico, Seattle, and Utah to train 2 deep-learning algorithms to classify breast cancer patients using their biomarkers test results. We used balanced error rate (BER), demographic parity (DP), equalized odds (EOD), and equal opportunity (EOP) to assess bias.

Results: We found differences in predictive accuracy between registries, with the highest accuracy in the registry that contributed the most data (Seattle Registry, BER ratios for all registries >1.25). BER showed no significant algorithmic bias in extracting biomarkers (estrogen receptor, progesterone receptor, human epidermal growth factor receptor 2) for race, Hispanic ethnicity, age at diagnosis, or socioeconomic subgroups (BER ratio <1.25). DP, EOD, and EOP all showed insignificant results.

Discussion: We observed significant differences in BER by registry, but no significant bias using the DP, EOD, and EOP metrics for socio-demographic or racial categories. This highlights the importance of employing a diverse set of metrics for a comprehensive evaluation of model fairness.

Conclusion: A thorough evaluation of algorithmic biases that may affect equality in clinical care is a critical step before deploying algorithms in the real world. We found little evidence of algorithmic bias in our biomarker classification tool. Artificial intelligence tools to expedite information extraction from clinical records could accelerate clinical trial matching and improve care.

目的:本研究利用女性乳腺癌病例的电子病理报告评估生物标志物分类的算法偏差。在5个亚组中评估偏倚:癌症登记、种族、西班牙裔、诊断年龄和社会经济地位。材料和方法:我们利用肯塔基州、路易斯安那州、新泽西州、新墨西哥州、西雅图和犹他州诊断的178 121例肿瘤的594 875份电子病理报告,训练两种深度学习算法,根据其生物标志物检测结果对乳腺癌患者进行分类。我们使用平衡错误率(BER)、人口均等(DP)、均等几率(EOD)和均等机会(EOP)来评估偏倚。结果:我们发现注册中心之间的预测准确性存在差异,在提供最多数据的注册中心中准确率最高(西雅图注册中心,所有注册中心的误码率为1.25)。在提取种族、西班牙裔、诊断年龄或社会经济亚组的生物标志物(雌激素受体、孕激素受体、人表皮生长因子受体2)时,BER显示没有显著的算法偏差。讨论:我们通过登记观察到BER的显著差异,但使用DP、EOD和EOP指标对社会人口统计学或种族分类没有显著的偏差。这突出了采用一套不同的指标来全面评估模型公平性的重要性。结论:在将算法应用于现实世界之前,对可能影响临床护理公平性的算法偏差进行彻底评估是至关重要的一步。我们发现在我们的生物标记物分类工具中几乎没有算法偏差的证据。加速从临床记录中提取信息的人工智能工具可以加速临床试验匹配并改善护理。
{"title":"Evaluating algorithmic bias on biomarker classification of breast cancer pathology reports.","authors":"Jordan Tschida, Mayanka Chandrashekar, Alina Peluso, Zachary Fox, Patrycja Krawczuk, Dakota Murdock, Xiao-Cheng Wu, John Gounley, Heidi A Hanson","doi":"10.1093/jamiaopen/ooaf033","DOIUrl":"10.1093/jamiaopen/ooaf033","url":null,"abstract":"<p><strong>Objectives: </strong>This work evaluated algorithmic bias in biomarkers classification using electronic pathology reports from female breast cancer cases. Bias was assessed across 5 subgroups: cancer registry, race, Hispanic ethnicity, age at diagnosis, and socioeconomic status.</p><p><strong>Materials and methods: </strong>We utilized 594 875 electronic pathology reports from 178 121 tumors diagnosed in Kentucky, Louisiana, New Jersey, New Mexico, Seattle, and Utah to train 2 deep-learning algorithms to classify breast cancer patients using their biomarkers test results. We used balanced error rate (BER), demographic parity (DP), equalized odds (EOD), and equal opportunity (EOP) to assess bias.</p><p><strong>Results: </strong>We found differences in predictive accuracy between registries, with the highest accuracy in the registry that contributed the most data (Seattle Registry, BER ratios for all registries >1.25). BER showed no significant algorithmic bias in extracting biomarkers (estrogen receptor, progesterone receptor, human epidermal growth factor receptor 2) for race, Hispanic ethnicity, age at diagnosis, or socioeconomic subgroups (BER ratio <1.25). DP, EOD, and EOP all showed insignificant results.</p><p><strong>Discussion: </strong>We observed significant differences in BER by registry, but no significant bias using the DP, EOD, and EOP metrics for socio-demographic or racial categories. This highlights the importance of employing a diverse set of metrics for a comprehensive evaluation of model fairness.</p><p><strong>Conclusion: </strong>A thorough evaluation of algorithmic biases that may affect equality in clinical care is a critical step before deploying algorithms in the real world. We found little evidence of algorithmic bias in our biomarker classification tool. Artificial intelligence tools to expedite information extraction from clinical records could accelerate clinical trial matching and improve care.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 3","pages":"ooaf033"},"PeriodicalIF":2.5,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12063583/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144037004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of a score for identifying hospital stays that trigger a pharmacist intervention: integration into a clinical decision support system. 评估确定触发药剂师干预的住院时间的分数:整合到临床决策支持系统。
IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-05-05 eCollection Date: 2025-06-01 DOI: 10.1093/jamiaopen/ooaf030
Laurine Robert, Nathalie Vidoni, Erwin Gérard, Emmanuel Chazard, Pascal Odou, Chloé Rousselière, Bertrand Décaudin

Objectives: The objective of the study was to determine, after medication review, the patient risk score threshold that would distinguish between stays with prescriptions triggering pharmacist intervention (PI) and stays with prescriptions not triggering PI.

Materials and methods: The study was retrospective and observational, conducted in the clinical pharmacy team. The patient risk score was adapted from a Canadian score and was integrated in the clinical decision support system (CDSS). For each hospital stay, the score was calculated at the beginning of hospitalization and we retrospectively showed if a medication review and a PI were conducted. Then, the optimal patient risk score threshold was determined to help pharmacist in optimizing medication review.

Results: During the study, 973 (56.7%) medication reviews were performed and 248 (25.5%) led to a PI. After analyzing sensitivity, specificity, and positive predictive value of different thresholds, the threshold of 4 was deemed discriminating to identify hospital stays likely to lead to a PI following a medication review. At this threshold, 600 hospital stays would have been detected (33.3% of the latter led to a PI), and 5.0% of stays with a medication review would not have been detected even though they were hospital stays that had triggered a PI.

Discussion and conclusion: Integration of a patient risk score in a CDSS can help clinical pharmacist to target hospital stays likely to trigger a PI. However, an optimal threshold is difficult to determine. Constructing and using a score in practice should be organized with the local clinical pharmacy team, in order to understand the tool's limitations and maximize its use in detecting at-risk drug prescriptions.

目的:本研究的目的是在用药审查后,确定患者风险评分阈值,以区分处方触发药师干预(PI)和处方未触发PI的住院。材料与方法:本研究为回顾性观察性研究,在临床药学团队中进行。患者风险评分改编自加拿大评分,并整合到临床决策支持系统(CDSS)中。对于每次住院,在住院开始时计算得分,我们回顾性地显示是否进行了药物审查和PI。然后确定最佳患者风险评分阈值,以帮助药师优化用药评价。结果:在研究期间,进行了973例(56.7%)药物回顾,248例(25.5%)导致PI。在分析了不同阈值的敏感性、特异性和阳性预测值后,阈值4被认为是鉴别在药物审查后可能导致PI的住院时间。在这个阈值下,600次住院将被检测到(33.3%的后者导致PI), 5.0%的药物审查住院将不会被检测到,即使它们是触发PI的住院。讨论与结论:在CDSS中整合患者风险评分可以帮助临床药师针对可能引发PI的住院时间。然而,最佳阈值很难确定。在实践中,应与当地临床药学团队组织构建和使用评分,以了解该工具的局限性,并最大限度地利用其在检测高危药物处方方面的作用。
{"title":"Evaluation of a score for identifying hospital stays that trigger a pharmacist intervention: integration into a clinical decision support system.","authors":"Laurine Robert, Nathalie Vidoni, Erwin Gérard, Emmanuel Chazard, Pascal Odou, Chloé Rousselière, Bertrand Décaudin","doi":"10.1093/jamiaopen/ooaf030","DOIUrl":"10.1093/jamiaopen/ooaf030","url":null,"abstract":"<p><strong>Objectives: </strong>The objective of the study was to determine, after medication review, the patient risk score threshold that would distinguish between stays with prescriptions triggering pharmacist intervention (PI) and stays with prescriptions not triggering PI.</p><p><strong>Materials and methods: </strong>The study was retrospective and observational, conducted in the clinical pharmacy team. The patient risk score was adapted from a Canadian score and was integrated in the clinical decision support system (CDSS). For each hospital stay, the score was calculated at the beginning of hospitalization and we retrospectively showed if a medication review and a PI were conducted. Then, the optimal patient risk score threshold was determined to help pharmacist in optimizing medication review.</p><p><strong>Results: </strong>During the study, 973 (56.7%) medication reviews were performed and 248 (25.5%) led to a PI. After analyzing sensitivity, specificity, and positive predictive value of different thresholds, the threshold of 4 was deemed discriminating to identify hospital stays likely to lead to a PI following a medication review. At this threshold, 600 hospital stays would have been detected (33.3% of the latter led to a PI), and 5.0% of stays with a medication review would not have been detected even though they were hospital stays that had triggered a PI.</p><p><strong>Discussion and conclusion: </strong>Integration of a patient risk score in a CDSS can help clinical pharmacist to target hospital stays likely to trigger a PI. However, an optimal threshold is difficult to determine. Constructing and using a score in practice should be organized with the local clinical pharmacy team, in order to understand the tool's limitations and maximize its use in detecting at-risk drug prescriptions.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 3","pages":"ooaf030"},"PeriodicalIF":2.5,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12051848/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144001387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing the acceptability of using patient portals to recruit pregnant women and new mothers for maternal-child health research. 评估使用患者门户网站招募孕妇和新妈妈进行妇幼保健研究的可接受性。
IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-05-02 eCollection Date: 2025-06-01 DOI: 10.1093/jamiaopen/ooaf027
Sean N Halpin, Rebecca Wright, Angela Gwaltney, Annabelle Frantz, Holly Peay, Emily Olsson, Melissa Raspa, Lisa Gehtland, Sara M Andrews

Objective: Electronic patient portals (PP) allow for targeted and efficient research recruitment. We assessed pre- and postnatal women's recruitment methods preferences, focusing on PP.

Materials and methods: We conducted 4 in-person focus groups with new and expecting mothers. Participants reported demographics, health status, and comfort with technology including PP. We used descriptive statistics to characterize quantitative data and a quasi-deductive approach to analyze qualitative data.

Results: Participants (n = 32) were an average age of 31.9 years, mostly White (65.6%), married (90.6%), and had a 4-year degree or higher (71.9%). Although they preferred PP for research recruitment over other methods (eg, in-person, physical mail), participants suggested potential barriers, including high message frequency, messages feeling like spam, and concerns about confidentiality. Participants suggested solutions, including enhancing autonomy through opt-in methods; integrating their healthcare provider's feedback; sending personal and relevant messages; and assuring their PP data are confidential.

Discussion: PPs are a promising recruitment method for pre- and postnatal women including for maternal-child health studies. To ensure engagement with the method, researchers must respond to known patient concerns and incorporate their feedback into future efforts.

Conclusion: Although PP were generally viewed as an acceptable recruitment method, researchers should be mindful of barriers that may limit its reach and effectiveness.

目的:电子患者门户(PP)允许有针对性和高效的研究招募。我们评估了产前和产后妇女的招聘方式偏好,重点是pp。材料和方法:我们对新妈妈和准妈妈进行了4个面对面的焦点小组。参与者报告了人口统计、健康状况和对包括PP在内的技术的舒适度。我们使用描述性统计来描述定量数据,并使用准演绎方法来分析定性数据。结果:参与者(n = 32)平均年龄31.9岁,以白人(65.6%)为主,已婚(90.6%),具有4年制及以上学历(71.9%)。尽管他们更倾向于使用PP进行研究招聘,而不是其他方法(例如,面对面的、实体的邮件),但参与者提出了潜在的障碍,包括信息频率高,信息感觉像垃圾邮件,以及对机密性的担忧。与会者提出了解决方案,包括通过选择加入的方式增强自主权;整合医疗保健提供者的反馈;发送个人和相关信息;并确保他们的PP数据是保密的。讨论:PPs是一种很有前途的产前和产后妇女招募方法,包括用于母婴健康研究。为了确保对该方法的参与,研究人员必须对已知患者的担忧做出回应,并将他们的反馈纳入未来的工作中。结论:尽管PP通常被认为是一种可接受的招聘方法,但研究人员应该注意可能限制其范围和有效性的障碍。
{"title":"Assessing the acceptability of using patient portals to recruit pregnant women and new mothers for maternal-child health research.","authors":"Sean N Halpin, Rebecca Wright, Angela Gwaltney, Annabelle Frantz, Holly Peay, Emily Olsson, Melissa Raspa, Lisa Gehtland, Sara M Andrews","doi":"10.1093/jamiaopen/ooaf027","DOIUrl":"10.1093/jamiaopen/ooaf027","url":null,"abstract":"<p><strong>Objective: </strong>Electronic patient portals (PP) allow for targeted and efficient research recruitment. We assessed pre- and postnatal women's recruitment methods preferences, focusing on PP.</p><p><strong>Materials and methods: </strong>We conducted 4 in-person focus groups with new and expecting mothers. Participants reported demographics, health status, and comfort with technology including PP. We used descriptive statistics to characterize quantitative data and a quasi-deductive approach to analyze qualitative data.</p><p><strong>Results: </strong>Participants (<i>n</i> = 32) were an average age of 31.9 years, mostly White (65.6%), married (90.6%), and had a 4-year degree or higher (71.9%). Although they preferred PP for research recruitment over other methods (eg, in-person, physical mail), participants suggested potential barriers, including high message frequency, messages feeling like spam, and concerns about confidentiality. Participants suggested solutions, including enhancing autonomy through opt-in methods; integrating their healthcare provider's feedback; sending personal and relevant messages; and assuring their PP data are confidential.</p><p><strong>Discussion: </strong>PPs are a promising recruitment method for pre- and postnatal women including for maternal-child health studies. To ensure engagement with the method, researchers must respond to known patient concerns and incorporate their feedback into future efforts.</p><p><strong>Conclusion: </strong>Although PP were generally viewed as an acceptable recruitment method, researchers should be mindful of barriers that may limit its reach and effectiveness.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 3","pages":"ooaf027"},"PeriodicalIF":2.5,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12047077/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144051812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating public preferences to overcome racial disparities in research: findings from a US survey on enhancing trust in research data-sharing practices. 整合公众偏好以克服研究中的种族差异:美国一项关于增强对研究数据共享实践的信任的调查结果。
IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-05-02 eCollection Date: 2025-06-01 DOI: 10.1093/jamiaopen/ooaf031
Stephanie Niño de Rivera, Yihong Zhao, Shalom Omollo, Sarah Eslami, Natalie Benda, Yashika Sharma, Meghan Reading Turchioe, Marianne Sharko, Lydia S Dugdale, Ruth Masterson Creber

Objectives: Data-sharing policies are rapidly evolving toward increased data sharing. However, participants' perspectives are not well understood and could have an adverse impact on participation in research. We evaluated participants' preferences for sharing specific types of data with specific groups, and strategies to enhance trust in data-sharing practices.

Materials and methods: In March 2023, we conducted a nationally representative online survey with 610 US adults and used logistic regression models to assess sociodemographic differences in their willingness to share different types of data.

Results: Our findings highlight notable racial disparities in willingness to share research data with external entities, especially health policy and public health organizations. Black participants were significantly less likely to share most health data with public health organizations, including mental health (odds ratio [OR]: 0.543, 95% CI, 0.323-0.895) and sexual health/fertility information (OR: 0.404, 95% CI, 0.228-0.691), compared to White participants. Moreover, 63% of participants expressed that their trust in researchers would improve if given control over the data recipients.

Discussion: Participants exhibit reluctance to share specific types of personal research data, emphasizing strong preferences regarding external data access. This highlights the need for a critical reassessment of current data-sharing policies to align with participant concerns.

Conclusion: It is imperative for data-sharing policies to integrate diverse patient viewpoints to mitigate risk of distrust and a potential unintended consequence of lower participation among racial and ethnic minority participants in research.

目标:数据共享政策正朝着增加数据共享的方向迅速发展。然而,参与者的观点没有得到很好的理解,可能对参与研究产生不利影响。我们评估了参与者与特定群体共享特定类型数据的偏好,以及在数据共享实践中增强信任的策略。材料和方法:2023年3月,我们对610名美国成年人进行了一项具有全国代表性的在线调查,并使用逻辑回归模型来评估他们分享不同类型数据的意愿的社会人口统计学差异。结果:我们的研究结果突出了在与外部实体,特别是卫生政策和公共卫生组织共享研究数据的意愿方面存在显著的种族差异。与白人参与者相比,黑人参与者与公共卫生组织分享大多数健康数据的可能性显着降低,包括心理健康(优势比[OR]: 0.543, 95% CI, 0.323-0.895)和性健康/生育信息(OR: 0.404, 95% CI, 0.228-0.691)。此外,63%的参与者表示,如果对数据接收者有控制权,他们对研究人员的信任将会提高。讨论:参与者表现出不愿分享特定类型的个人研究数据,强调对外部数据访问的强烈偏好。这突出表明,有必要对当前的数据共享政策进行重大重新评估,以符合参与者的关切。结论:数据共享政策必须整合不同患者的观点,以减轻不信任的风险,以及少数种族和少数民族参与者参与研究的潜在意外后果。
{"title":"Integrating public preferences to overcome racial disparities in research: findings from a US survey on enhancing trust in research data-sharing practices.","authors":"Stephanie Niño de Rivera, Yihong Zhao, Shalom Omollo, Sarah Eslami, Natalie Benda, Yashika Sharma, Meghan Reading Turchioe, Marianne Sharko, Lydia S Dugdale, Ruth Masterson Creber","doi":"10.1093/jamiaopen/ooaf031","DOIUrl":"10.1093/jamiaopen/ooaf031","url":null,"abstract":"<p><strong>Objectives: </strong>Data-sharing policies are rapidly evolving toward increased data sharing. However, participants' perspectives are not well understood and could have an adverse impact on participation in research. We evaluated participants' preferences for sharing specific types of data with specific groups, and strategies to enhance trust in data-sharing practices.</p><p><strong>Materials and methods: </strong>In March 2023, we conducted a nationally representative online survey with 610 US adults and used logistic regression models to assess sociodemographic differences in their willingness to share different types of data.</p><p><strong>Results: </strong>Our findings highlight notable racial disparities in willingness to share research data with external entities, especially health policy and public health organizations. Black participants were significantly less likely to share most health data with public health organizations, including mental health (odds ratio [OR]: 0.543, 95% CI, 0.323-0.895) and sexual health/fertility information (OR: 0.404, 95% CI, 0.228-0.691), compared to White participants. Moreover, 63% of participants expressed that their trust in researchers would improve if given control over the data recipients.</p><p><strong>Discussion: </strong>Participants exhibit reluctance to share specific types of personal research data, emphasizing strong preferences regarding external data access. This highlights the need for a critical reassessment of current data-sharing policies to align with participant concerns.</p><p><strong>Conclusion: </strong>It is imperative for data-sharing policies to integrate diverse patient viewpoints to mitigate risk of distrust and a potential unintended consequence of lower participation among racial and ethnic minority participants in research.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 3","pages":"ooaf031"},"PeriodicalIF":2.5,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12047078/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144056912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accurate treatment effect estimation using inverse probability of treatment weighting with deep learning. 基于深度学习的治疗加权逆概率的准确治疗效果估计。
IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-04-26 eCollection Date: 2025-04-01 DOI: 10.1093/jamiaopen/ooaf032
Junghwan Lee, Simin Ma, Nicoleta Serban, Shihao Yang

Objectives: Observational data have been actively used to estimate treatment effect, driven by the growing availability of electronic health records (EHRs). However, EHRs typically consist of longitudinal records, often introducing time-dependent confounding that hinder the unbiased estimation of treatment effect. Inverse probability of treatment weighting (IPTW) is a widely used propensity score method since it provides unbiased treatment effect estimation and its derivation is straightforward. In this study, we aim to utilize IPTW to estimate treatment effect in the presence of time-dependent confounding using claims records.

Materials and methods: Previous studies have utilized propensity score methods with features derived from claims records through feature processing, which generally requires domain knowledge and additional resources to extract information to accurately estimate propensity scores. Deep learning, particularly using deep sequence models such as recurrent neural networks and Transformer, has demonstrated good performance in modeling EHRs for various downstream tasks. We propose that these deep sequence models can provide accurate IPTW estimation of treatment effect by directly estimating the propensity scores from claims records without the need for feature processing.

Results: Comprehensive evaluations on synthetic and semi-synthetic datasets demonstrate that IPTW treatment effect estimation using deep sequence models consistently outperforms baseline approaches, including logistic regression and multilayer perceptrons, combined with feature processing.

Discussion: Our findings demonstrate that deep sequence models consistently outperform traditional approaches in estimating treatment effects, particularly under time-dependent confounding. Moreover, Transformer-based models offer interpretability by assigning higher attention weights to relevant confounders, even when prior domain knowledge is limited.

Conclusion: Deep sequence models enable accurate treatment effect estimation through IPTW without the need for feature processing.

目的:在电子健康记录(EHRs)日益普及的推动下,观察性数据已被积极用于评估治疗效果。然而,电子病历通常由纵向记录组成,经常引入时间相关的混淆,妨碍对治疗效果的无偏估计。治疗加权逆概率法(Inverse probability of treatment weighting, IPTW)是一种被广泛使用的倾向评分方法,因为它能提供无偏的治疗效果估计,而且推导简单。在这项研究中,我们的目的是利用IPTW来估计治疗效果的存在时间依赖的混淆使用索赔记录。材料和方法:以往的研究使用倾向得分方法,通过特征处理从索赔记录中提取特征,通常需要领域知识和额外的资源来提取信息,以准确估计倾向得分。深度学习,特别是使用深度序列模型,如循环神经网络和Transformer,在为各种下游任务建模电子病历方面表现良好。我们提出这些深度序列模型可以通过直接估计索赔记录的倾向得分而不需要特征处理来提供准确的治疗效果IPTW估计。结果:对合成和半合成数据集的综合评估表明,使用深度序列模型估计IPTW治疗效果始终优于基线方法,包括逻辑回归和多层感知器,并结合特征处理。讨论:我们的研究结果表明,深度序列模型在估计治疗效果方面始终优于传统方法,特别是在时间相关的混杂情况下。此外,基于transformer的模型通过为相关的混杂因素分配更高的关注权重来提供可解释性,即使在先前的领域知识有限的情况下也是如此。结论:深度序列模型可以在不需要特征处理的情况下,通过IPTW准确估计治疗效果。
{"title":"Accurate treatment effect estimation using inverse probability of treatment weighting with deep learning.","authors":"Junghwan Lee, Simin Ma, Nicoleta Serban, Shihao Yang","doi":"10.1093/jamiaopen/ooaf032","DOIUrl":"10.1093/jamiaopen/ooaf032","url":null,"abstract":"<p><strong>Objectives: </strong>Observational data have been actively used to estimate treatment effect, driven by the growing availability of electronic health records (EHRs). However, EHRs typically consist of longitudinal records, often introducing time-dependent confounding that hinder the unbiased estimation of treatment effect. Inverse probability of treatment weighting (IPTW) is a widely used propensity score method since it provides unbiased treatment effect estimation and its derivation is straightforward. In this study, we aim to utilize IPTW to estimate treatment effect in the presence of time-dependent confounding using claims records.</p><p><strong>Materials and methods: </strong>Previous studies have utilized propensity score methods with features derived from claims records through feature processing, which generally requires domain knowledge and additional resources to extract information to accurately estimate propensity scores. Deep learning, particularly using deep sequence models such as recurrent neural networks and Transformer, has demonstrated good performance in modeling EHRs for various downstream tasks. We propose that these deep sequence models can provide accurate IPTW estimation of treatment effect by directly estimating the propensity scores from claims records without the need for feature processing.</p><p><strong>Results: </strong>Comprehensive evaluations on synthetic and semi-synthetic datasets demonstrate that IPTW treatment effect estimation using deep sequence models consistently outperforms baseline approaches, including logistic regression and multilayer perceptrons, combined with feature processing.</p><p><strong>Discussion: </strong>Our findings demonstrate that deep sequence models consistently outperform traditional approaches in estimating treatment effects, particularly under time-dependent confounding. Moreover, Transformer-based models offer interpretability by assigning higher attention weights to relevant confounders, even when prior domain knowledge is limited.</p><p><strong>Conclusion: </strong>Deep sequence models enable accurate treatment effect estimation through IPTW without the need for feature processing.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 2","pages":"ooaf032"},"PeriodicalIF":2.5,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12033031/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144042421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
"Everything is electronic health record-driven": the role of the electronic health record in the emergency department diagnostic process. “一切都是电子病历驱动”:电子病历在急诊科诊断过程中的作用。
IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-04-23 eCollection Date: 2025-04-01 DOI: 10.1093/jamiaopen/ooaf029
Tyler G James, Courtney W Mangus, Sarah J Parker, P Paul Chandanabhumma, C M Cassady, Fernanda Bellolio, Kalyan Pasupathy, Milisa Manojlovich, Hardeep Singh, Prashant Mahajan

Objectives: There is limited knowledge on how providers and patients in the emergency department (ED) use electronic health records (EHRs) to facilitate the diagnostic process. While EHRs can support diagnostic decision-making, EHR features that are not user-centered may increase the likelihood of diagnostic error. We aimed to identify how EHRs facilitate or impede the diagnostic process in the ED and to identify opportunities to reduce diagnostic errors and improve care quality.

Materials and methods: We conducted semistructured interviews with 10 physicians, 15 nurses, and 8 patients across 4 EDs. Data were analyzed using a hybrid thematic analysis approach, which blends deductive (ie, using multiple conceptual frameworks) and inductive coding strategies. A team of 4 coders performed coding.

Results: We identified 4 themes, 3 at the care team level and 1 at the patient level. At the care team level, the benefits of the EHR in the diagnostic process included (1) customizing features to facilitate diagnostic workup and (2) aiding in communication. However, (3) EHR-driven protocols were found to potentially burden the care process and reliance on asynchronous communication could impede team dynamics. At the patient-level, we found that (4) patient portals facilitated meaningful patient engagement through timely delivery of results.

Discussion: While EHRs can improve the diagnostic process, they can also impair communication and increase workload. Electronic health record design should leverage provider-created tools to improve usability and enhance diagnostic safety.

Conclusions: Our findings have important implications for health information technology design and policy. Further work should assess optimal ways to release patient results via the EHR portal.

目的:关于急诊科(ED)的提供者和患者如何使用电子健康记录(EHRs)来促进诊断过程的知识有限。虽然EHR可以支持诊断决策,但不以用户为中心的EHR功能可能会增加诊断错误的可能性。我们的目的是确定电子病历如何促进或阻碍急诊科的诊断过程,并确定减少诊断错误和提高护理质量的机会。材料和方法:我们对4个急诊科的10名医生、15名护士和8名患者进行了半结构化访谈。数据分析使用混合主题分析方法,该方法混合了演绎(即使用多个概念框架)和归纳编码策略。一个由4名编码员组成的团队进行编码。结果:我们确定了4个主题,3个在护理团队层面,1个在患者层面。在护理团队层面,电子病历在诊断过程中的好处包括:(1)定制特征以促进诊断检查;(2)帮助沟通。然而,(3)ehr驱动的协议可能会增加护理过程的负担,对异步通信的依赖可能会阻碍团队动态。在患者层面,我们发现(4)患者门户网站通过及时提供结果促进了有意义的患者参与。讨论:虽然电子病历可以改进诊断过程,但它们也会损害沟通并增加工作量。电子健康记录设计应利用提供商创建的工具来改进可用性并增强诊断安全性。结论:我们的研究结果对卫生信息技术设计和政策具有重要意义。进一步的工作应评估通过电子病历门户网站发布患者结果的最佳方式。
{"title":"\"Everything is electronic health record-driven\": the role of the electronic health record in the emergency department diagnostic process.","authors":"Tyler G James, Courtney W Mangus, Sarah J Parker, P Paul Chandanabhumma, C M Cassady, Fernanda Bellolio, Kalyan Pasupathy, Milisa Manojlovich, Hardeep Singh, Prashant Mahajan","doi":"10.1093/jamiaopen/ooaf029","DOIUrl":"https://doi.org/10.1093/jamiaopen/ooaf029","url":null,"abstract":"<p><strong>Objectives: </strong>There is limited knowledge on how providers and patients in the emergency department (ED) use electronic health records (EHRs) to facilitate the diagnostic process. While EHRs can support diagnostic decision-making, EHR features that are not user-centered may increase the likelihood of diagnostic error. We aimed to identify how EHRs facilitate or impede the diagnostic process in the ED and to identify opportunities to reduce diagnostic errors and improve care quality.</p><p><strong>Materials and methods: </strong>We conducted semistructured interviews with 10 physicians, 15 nurses, and 8 patients across 4 EDs. Data were analyzed using a hybrid thematic analysis approach, which blends deductive (ie, using multiple conceptual frameworks) and inductive coding strategies. A team of 4 coders performed coding.</p><p><strong>Results: </strong>We identified 4 themes, 3 at the care team level and 1 at the patient level. At the care team level, the benefits of the EHR in the diagnostic process included (1) customizing features to facilitate diagnostic workup and (2) aiding in communication. However, (3) EHR-driven protocols were found to potentially burden the care process and reliance on asynchronous communication could impede team dynamics. At the patient-level, we found that (4) patient portals facilitated meaningful patient engagement through timely delivery of results.</p><p><strong>Discussion: </strong>While EHRs can improve the diagnostic process, they can also impair communication and increase workload. Electronic health record design should leverage provider-created tools to improve usability and enhance diagnostic safety.</p><p><strong>Conclusions: </strong>Our findings have important implications for health information technology design and policy. Further work should assess optimal ways to release patient results via the EHR portal.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 2","pages":"ooaf029"},"PeriodicalIF":2.5,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12015938/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144000007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Leveraging deep learning to detect stance in Spanish tweets on COVID-19 vaccination. 更正:利用深度学习来检测西班牙语关于COVID-19疫苗接种的推文立场。
IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-04-15 eCollection Date: 2025-04-01 DOI: 10.1093/jamiaopen/ooaf028

[This corrects the article DOI: 10.1093/jamiaopen/ooaf007.].

[这更正了文章DOI: 10.1093/jamiaopen/ooaf007.]。
{"title":"Correction to: Leveraging deep learning to detect stance in Spanish tweets on COVID-19 vaccination.","authors":"","doi":"10.1093/jamiaopen/ooaf028","DOIUrl":"https://doi.org/10.1093/jamiaopen/ooaf028","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.1093/jamiaopen/ooaf007.].</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 2","pages":"ooaf028"},"PeriodicalIF":2.5,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11999061/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144052043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A deep learning model for clinical outcome prediction using longitudinal inpatient electronic health records. 使用纵向住院患者电子健康记录进行临床结果预测的深度学习模型。
IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-04-10 eCollection Date: 2025-04-01 DOI: 10.1093/jamiaopen/ooaf026
Ruichen Rong, Zifan Gu, Hongyin Lai, Tanna L Nelson, Tony Keller, Clark Walker, Kevin W Jin, Catherine Chen, Ann Marie Navar, Ferdinand Velasco, Eric D Peterson, Guanghua Xiao, Donghan M Yang, Yang Xie

Objectives: Recent advances in deep learning show significant potential in analyzing continuous monitoring electronic health records (EHR) data for clinical outcome prediction. We aim to develop a Transformer-based, Encounter-level Clinical Outcome (TECO) model to predict mortality in the intensive care unit (ICU) using inpatient EHR data.

Materials and methods: The TECO model was developed using multiple baseline and time-dependent clinical variables from 2579 hospitalized COVID-19 patients to predict ICU mortality and was validated externally in an acute respiratory distress syndrome cohort (n = 2799) and a sepsis cohort (n = 6622) from the Medical Information Mart for Intensive Care IV (MIMIC-IV). Model performance was evaluated based on the area under the receiver operating characteristic (AUC) and compared with Epic Deterioration Index (EDI), random forest (RF), and extreme gradient boosting (XGBoost).

Results: In the COVID-19 development dataset, TECO achieved higher AUC (0.89-0.97) across various time intervals compared to EDI (0.86-0.95), RF (0.87-0.96), and XGBoost (0.88-0.96). In the 2 MIMIC testing datasets (EDI not available), TECO yielded higher AUC (0.65-0.77) than RF (0.59-0.75) and XGBoost (0.59-0.74). In addition, TECO was able to identify clinically interpretable features that were correlated with the outcome.

Discussion: The TECO model outperformed proprietary metrics and conventional machine learning models in predicting ICU mortality among patients with COVID-19, widespread inflammation, respiratory illness, and other organ failures.

Conclusion: The TECO model demonstrates a strong capability for predicting ICU mortality using continuous monitoring data. While further validation is needed, TECO has the potential to serve as a powerful early warning tool across various diseases in inpatient settings.

目的:深度学习的最新进展显示了在分析连续监测电子健康记录(EHR)数据以预测临床结果方面的巨大潜力。我们的目标是开发一种基于transformer的遭遇级临床结果(TECO)模型,利用住院患者EHR数据预测重症监护病房(ICU)的死亡率。材料和方法:采用来自2579名住院COVID-19患者的多个基线和时间相关临床变量建立TECO模型,以预测ICU死亡率,并在重症监护医学信息市场(MIMIC-IV)的急性呼吸窘迫综合征队列(n = 2799)和脓毒症队列(n = 6622)中进行外部验证。基于接收器工作特征(AUC)下的面积评估模型性能,并与Epic劣化指数(EDI)、随机森林(RF)和极端梯度提升(XGBoost)进行比较。结果:在COVID-19发展数据集中,与EDI(0.86-0.95)、RF(0.87-0.96)和XGBoost(0.88-0.96)相比,TECO在不同时间间隔内的AUC(0.89-0.97)更高。在2个MIMIC测试数据集(EDI不可用)中,TECO产生的AUC(0.65-0.77)高于RF(0.59-0.75)和XGBoost(0.59-0.74)。此外,TECO能够识别与结果相关的临床可解释特征。讨论:TECO模型在预测COVID-19、广泛炎症、呼吸系统疾病和其他器官衰竭患者的ICU死亡率方面优于专有指标和传统机器学习模型。结论:TECO模型具有较强的利用连续监测数据预测ICU死亡率的能力。虽然需要进一步验证,但TECO有潜力作为住院环境中各种疾病的强大早期预警工具。
{"title":"A deep learning model for clinical outcome prediction using longitudinal inpatient electronic health records.","authors":"Ruichen Rong, Zifan Gu, Hongyin Lai, Tanna L Nelson, Tony Keller, Clark Walker, Kevin W Jin, Catherine Chen, Ann Marie Navar, Ferdinand Velasco, Eric D Peterson, Guanghua Xiao, Donghan M Yang, Yang Xie","doi":"10.1093/jamiaopen/ooaf026","DOIUrl":"10.1093/jamiaopen/ooaf026","url":null,"abstract":"<p><strong>Objectives: </strong>Recent advances in deep learning show significant potential in analyzing continuous monitoring electronic health records (EHR) data for clinical outcome prediction. We aim to develop a Transformer-based, Encounter-level Clinical Outcome (TECO) model to predict mortality in the intensive care unit (ICU) using inpatient EHR data.</p><p><strong>Materials and methods: </strong>The TECO model was developed using multiple baseline and time-dependent clinical variables from 2579 hospitalized COVID-19 patients to predict ICU mortality and was validated externally in an acute respiratory distress syndrome cohort (<i>n</i> = 2799) and a sepsis cohort (<i>n</i> = 6622) from the Medical Information Mart for Intensive Care IV (MIMIC-IV). Model performance was evaluated based on the area under the receiver operating characteristic (AUC) and compared with Epic Deterioration Index (EDI), random forest (RF), and extreme gradient boosting (XGBoost).</p><p><strong>Results: </strong>In the COVID-19 development dataset, TECO achieved higher AUC (0.89-0.97) across various time intervals compared to EDI (0.86-0.95), RF (0.87-0.96), and XGBoost (0.88-0.96). In the 2 MIMIC testing datasets (EDI not available), TECO yielded higher AUC (0.65-0.77) than RF (0.59-0.75) and XGBoost (0.59-0.74). In addition, TECO was able to identify clinically interpretable features that were correlated with the outcome.</p><p><strong>Discussion: </strong>The TECO model outperformed proprietary metrics and conventional machine learning models in predicting ICU mortality among patients with COVID-19, widespread inflammation, respiratory illness, and other organ failures.</p><p><strong>Conclusion: </strong>The TECO model demonstrates a strong capability for predicting ICU mortality using continuous monitoring data. While further validation is needed, TECO has the potential to serve as a powerful early warning tool across various diseases in inpatient settings.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 2","pages":"ooaf026"},"PeriodicalIF":2.5,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11984207/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144048372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A proof-of-concept study for patient use of open notes with large language models. 一项关于病人使用大型语言模型的开放笔记的概念验证研究。
IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-04-09 eCollection Date: 2025-04-01 DOI: 10.1093/jamiaopen/ooaf021
Liz Salmi, Dana M Lewis, Jennifer L Clarke, Zhiyong Dong, Rudy Fischmann, Emily I McIntosh, Chethan R Sarabu, Catherine M DesRoches

Objectives: The use of large language models (LLMs) is growing for both clinicians and patients. While researchers and clinicians have explored LLMs to manage patient portal messages and reduce burnout, there is less documentation about how patients use these tools to understand clinical notes and inform decision-making. This proof-of-concept study examined the reliability and accuracy of LLMs in responding to patient queries based on an open visit note.

Materials and methods: In a cross-sectional proof-of-concept study, 3 commercially available LLMs (ChatGPT 4o, Claude 3 Opus, Gemini 1.5) were evaluated using 4 distinct prompt series-Standard, Randomized, Persona, and Randomized Persona-with multiple questions, designed by patients, in response to a single neuro-oncology progress note. LLM responses were scored by the note author (neuro-oncologist) and a patient who receives care from the note author, using an 8-criterion rubric that assessed Accuracy, Relevance, Clarity, Actionability, Empathy/Tone, Completeness, Evidence, and Consistency. Descriptive statistics were used to summarize the performance of each LLM across all prompts.

Results: Overall, the Standard and Persona-based prompt series yielded the best results across all criterion regardless of LLM. Chat-GPT 4o using Persona-based prompts scored highest in all categories. All LLMs scored low in the use of Evidence.

Discussion: This proof-of-concept study highlighted the potential for LLMs to assist patients in interpreting open notes. The most effective LLM responses were achieved by applying Persona-style prompts to a patient's question.

Conclusion: Optimizing LLMs for patient-driven queries, and patient education and counseling around the use of LLMs, have potential to enhance patient use and understanding of their health information.

目的:临床医生和患者越来越多地使用大型语言模型(llm)。虽然研究人员和临床医生已经探索了法学硕士来管理患者门户信息和减少倦怠,但关于患者如何使用这些工具来理解临床记录并为决策提供信息的文献较少。这项概念验证研究检查了法学硕士在回应基于公开访问记录的患者查询时的可靠性和准确性。材料和方法:在一项横断面概念验证研究中,使用4个不同的提示系列(标准、随机、角色和随机角色)对3个商业llm (ChatGPT 40、Claude 3 Opus、Gemini 1.5)进行评估,并对患者设计的多个问题进行评估,以回应单一的神经肿瘤学进展记录。LLM的回答由笔记作者(神经肿瘤学家)和接受笔记作者治疗的患者评分,使用8个标准来评估准确性、相关性、清晰度、可操作性、移情/语气、完整性、证据性和一致性。描述性统计用于总结所有提示中每个LLM的性能。结果:总体而言,标准和基于角色的提示系列在所有标准中都产生了最好的结果,而不考虑LLM。使用基于角色提示的Chat-GPT 40在所有类别中得分最高。所有法学硕士在证据使用方面得分都很低。讨论:这项概念验证研究强调了llm帮助患者解读开放笔记的潜力。最有效的LLM回应是通过对患者的问题应用Persona-style提示来实现的。结论:为患者驱动的查询优化法学硕士,并围绕法学硕士的使用进行患者教育和咨询,有可能提高患者对其健康信息的使用和理解。
{"title":"A proof-of-concept study for patient use of open notes with large language models.","authors":"Liz Salmi, Dana M Lewis, Jennifer L Clarke, Zhiyong Dong, Rudy Fischmann, Emily I McIntosh, Chethan R Sarabu, Catherine M DesRoches","doi":"10.1093/jamiaopen/ooaf021","DOIUrl":"https://doi.org/10.1093/jamiaopen/ooaf021","url":null,"abstract":"<p><strong>Objectives: </strong>The use of large language models (LLMs) is growing for both clinicians and patients. While researchers and clinicians have explored LLMs to manage patient portal messages and reduce burnout, there is less documentation about how patients use these tools to understand clinical notes and inform decision-making. This proof-of-concept study examined the reliability and accuracy of LLMs in responding to patient queries based on an open visit note.</p><p><strong>Materials and methods: </strong>In a cross-sectional proof-of-concept study, 3 commercially available LLMs (ChatGPT 4o, Claude 3 Opus, Gemini 1.5) were evaluated using 4 distinct prompt series-<i>Standard</i>, <i>Randomized</i>, <i>Persona</i>, and <i>Randomized Persona</i>-with multiple questions, designed by patients, in response to a single neuro-oncology progress note. LLM responses were scored by the note author (neuro-oncologist) and a patient who receives care from the note author, using an 8-criterion rubric that assessed <i>Accuracy</i>, <i>Relevance</i>, <i>Clarity</i>, <i>Actionability</i>, <i>Empathy/Tone</i>, <i>Completeness</i>, <i>Evidence</i>, and <i>Consistency</i>. Descriptive statistics were used to summarize the performance of each LLM across all prompts.</p><p><strong>Results: </strong>Overall, the Standard and Persona-based prompt series yielded the best results across all criterion regardless of LLM. Chat-GPT 4o using Persona-based prompts scored highest in all categories. All LLMs scored low in the use of <i>Evidence</i>.</p><p><strong>Discussion: </strong>This proof-of-concept study highlighted the potential for LLMs to assist patients in interpreting open notes. The most effective LLM responses were achieved by applying <i>Persona</i>-style prompts to a patient's question.</p><p><strong>Conclusion: </strong>Optimizing LLMs for patient-driven queries, and patient education and counseling around the use of LLMs, have potential to enhance patient use and understanding of their health information.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 2","pages":"ooaf021"},"PeriodicalIF":2.5,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11980777/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144031741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JAMIA Open
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1