首页 > 最新文献

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science最新文献

英文 中文
Automated Family Histories Significantly Improve Risk Prediction in an EHR. 电子病历中的自动家族病史可显著改善风险预测。
Xiayuan Huang, Ross Kleiman, David Page, Scott Hebbring

We recently demonstrated that electronically constructed family pedigrees (e-pedigrees) have great value in epidemiologic research using electronic health record (EHR) data. Prior to this work, it has been well accepted that family health history is a major predictor for a wide spectrum of diseases, reflecting shared effects of genetics, environment, and lifestyle. With the widespread digitalization of patient data via EHRs, there is an unprecedented opportunity to use machine learning algorithms to better predict disease risk. Although predictive models have previously been constructed for a few important diseases, we currently know very little about how accurately the risk for most diseases can be predicted. It is further unknown if the incorporation of e-pedigrees in machine learning can improve the value of these models. In this study, we devised a family pedigree-driven high-throughput machine learning pipeline to simultaneously predict risks for thousands of diagnosis codes using thousands of input features. Models were built to predict future disease risk for three time windows using both Logistic Regression and XGBoost. For example, we achieved average areas under the receiver operating characteristic curves (AUCs) of 0.82, 0.77 and 0.71 for 1, 6, and 24 months, respectively using XGBoost and without e-pedigrees. When adding e-pedigree features to the XGBoost pipeline, AUCs increased to 0.83, 0.79 and 0.74 for the same three time periods, respectively. E-pedigrees similarly improved the predictions when using Logistic Regression. These results emphasize the potential value of incorporating family health history via e-pedigrees into machine learning with no further human time.

最近,我们利用电子健康记录(EHR)数据证明了电子构建家系(e-pedigrees)在流行病学研究中的巨大价值。在这项工作之前,家族健康史是多种疾病的主要预测因素,反映了遗传、环境和生活方式的共同影响,这一点已被广泛接受。随着电子病历(EHR)对患者数据的广泛数字化,为使用机器学习算法更好地预测疾病风险提供了前所未有的机会。虽然以前已经针对一些重要疾病建立了预测模型,但我们目前对如何准确预测大多数疾病的风险知之甚少。此外,我们还不知道在机器学习中加入电子病历是否能提高这些模型的价值。在这项研究中,我们设计了一个家系驱动的高通量机器学习管道,利用数千个输入特征同时预测数千个诊断代码的风险。我们利用 Logistic 回归和 XGBoost 建立了预测三个时间窗未来疾病风险的模型。例如,在使用 XGBoost 和不使用电子病历的情况下,我们在 1 个月、6 个月和 24 个月的接收者工作特征曲线下的平均面积(AUC)分别为 0.82、0.77 和 0.71。在 XGBoost 管道中添加电子家谱特征后,相同三个时间段的 AUC 分别增加到 0.83、0.79 和 0.74。在使用逻辑回归时,电子家谱同样提高了预测结果。这些结果凸显了通过电子pedigrees将家族健康史纳入机器学习的潜在价值,而无需花费更多的人力时间。
{"title":"Automated Family Histories Significantly Improve Risk Prediction in an EHR.","authors":"Xiayuan Huang, Ross Kleiman, David Page, Scott Hebbring","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>We recently demonstrated that electronically constructed family pedigrees (e-pedigrees) have great value in epidemiologic research using electronic health record (EHR) data. Prior to this work, it has been well accepted that family health history is a major predictor for a wide spectrum of diseases, reflecting shared effects of genetics, environment, and lifestyle. With the widespread digitalization of patient data via EHRs, there is an unprecedented opportunity to use machine learning algorithms to better predict disease risk. Although predictive models have previously been constructed for a few important diseases, we currently know very little about how accurately the risk for most diseases can be predicted. It is further unknown if the incorporation of e-pedigrees in machine learning can improve the value of these models. In this study, we devised a family pedigree-driven high-throughput machine learning pipeline to simultaneously predict risks for thousands of diagnosis codes using thousands of input features. Models were built to predict future disease risk for three time windows using both Logistic Regression and XGBoost. For example, we achieved average areas under the receiver operating characteristic curves (AUCs) of 0.82, 0.77 and 0.71 for 1, 6, and 24 months, respectively using XGBoost and without e-pedigrees. When adding e-pedigree features to the XGBoost pipeline, AUCs increased to 0.83, 0.79 and 0.74 for the same three time periods, respectively. E-pedigrees similarly improved the predictions when using Logistic Regression. These results emphasize the potential value of incorporating family health history via e-pedigrees into machine learning with no further human time.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"221-229"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141855/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effects of Added Emphasis and Pause in Audio Delivery of Health Information. 在健康信息音频传播中增加强调和暂停的效果。
Arif Ahmed, Gondy Leroy, Stephen A Rains, Philip Harber, David Kauchak, Prosanta Barai

Health literacy is crucial to supporting good health and is a major national goal. Audio delivery of information is becoming more popular for informing oneself. In this study, we evaluate the effect of audio enhancements in the form of information emphasis and pauses with health texts of varying difficulty and we measure health information comprehension and retention. We produced audio snippets from difficult and easy text and conducted the study on Amazon Mechanical Turk (AMT). Our findings suggest that emphasis matters for both information comprehension and retention. When there is no added pause, emphasizing significant information can lower the perceived difficulty for difficult and easy texts. Comprehension is higher (54%) with correctly placed emphasis for the difficult texts compared to not adding emphasis (50%). Adding a pause lowers perceived difficulty and can improve retention but adversely affects information comprehension.

健康知识普及对支持良好的健康至关重要,也是一项重要的国家目标。通过音频传递信息越来越受到人们的欢迎。在本研究中,我们评估了以信息强调和停顿的形式对不同难度的健康文本进行音频增强的效果,并测量了健康信息的理解和保留情况。我们制作了难易文本的音频片段,并在亚马逊机械手(Amazon Mechanical Turk,AMT)上进行了研究。我们的研究结果表明,强调对于信息的理解和保留都很重要。在没有额外停顿的情况下,强调重要信息可以降低难懂和简单文本的感知难度。与不加停顿(50%)相比,正确强调难懂文章的理解率更高(54%)。添加停顿可降低感知难度,并能提高信息的保留率,但会对信息的理解产生不利影响。
{"title":"Effects of Added Emphasis and Pause in Audio Delivery of Health Information.","authors":"Arif Ahmed, Gondy Leroy, Stephen A Rains, Philip Harber, David Kauchak, Prosanta Barai","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Health literacy is crucial to supporting good health and is a major national goal. Audio delivery of information is becoming more popular for informing oneself. In this study, we evaluate the effect of audio enhancements in the form of information emphasis and pauses with health texts of varying difficulty and we measure health information comprehension and retention. We produced audio snippets from difficult and easy text and conducted the study on Amazon Mechanical Turk (AMT). Our findings suggest that emphasis matters for both information comprehension and retention. When there is no added pause, emphasizing significant information can lower the perceived difficulty for difficult and easy texts. Comprehension is higher (54%) with correctly placed emphasis for the difficult texts compared to not adding emphasis (50%). Adding a pause lowers perceived difficulty and can improve retention but adversely affects information comprehension.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"54-62"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141844/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141200730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating Cross-Domain Binary Relation Classification in Biomedical Natural Language Processing. 研究生物医学自然语言处理中的跨域二元关系分类。
Alberto Purpura, Natasha Mulligan, Uri Kartoun, Eileen Koski, Vibha Anand, Joao Bettencourt-Silva

This paper addresses the challenge of binary relation classification in biomedical Natural Language Processing (NLP), focusing on diverse domains including gene-disease associations, compound protein interactions, and social determinants of health (SDOH). We evaluate different approaches, including fine-tuning Bidirectional Encoder Representations from Transformers (BERT) models and generative Large Language Models (LLMs), and examine their performance in zero and few-shot settings. We also introduce a novel dataset of biomedical text annotated with social and clinical entities to facilitate research into relation classification. Our results underscore the continued complexity of this task for both humans and models. BERT-based models trained on domain-specific data excelled in certain domains and achieved comparable performance and generalization power to generative LLMs in others. Despite these encouraging results, these models are still far from achieving human-level performance. We also highlight the significance of high-quality training data and domain-specific fine-tuning on the performance of all the considered models.

本文探讨了生物医学自然语言处理(NLP)中二元关系分类所面临的挑战,重点关注基因-疾病关联、复合蛋白质相互作用和健康的社会决定因素(SDOH)等不同领域。我们评估了不同的方法,包括微调变换器双向编码器表征(BERT)模型和生成式大型语言模型(LLM),并检验了它们在零点和少点设置下的性能。我们还引入了一个标注了社会和临床实体的生物医学文本新数据集,以促进关系分类研究。我们的研究结果凸显了这项任务对于人类和模型的持续复杂性。基于特定领域数据训练的 BERT 模型在某些领域表现出色,而在其他领域则取得了与生成式 LLM 相媲美的性能和泛化能力。尽管取得了这些令人鼓舞的结果,但这些模型仍远未达到人类水平。我们还强调了高质量的训练数据和特定领域的微调对所有模型性能的重要性。
{"title":"Investigating Cross-Domain Binary Relation Classification in Biomedical Natural Language Processing.","authors":"Alberto Purpura, Natasha Mulligan, Uri Kartoun, Eileen Koski, Vibha Anand, Joao Bettencourt-Silva","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This paper addresses the challenge of binary relation classification in biomedical Natural Language Processing (NLP), focusing on diverse domains including gene-disease associations, compound protein interactions, and social determinants of health (SDOH). We evaluate different approaches, including fine-tuning Bidirectional Encoder Representations from Transformers (BERT) models and generative Large Language Models (LLMs), and examine their performance in zero and few-shot settings. We also introduce a novel dataset of biomedical text annotated with social and clinical entities to facilitate research into relation classification. Our results underscore the continued complexity of this task for both humans and models. BERT-based models trained on domain-specific data excelled in certain domains and achieved comparable performance and generalization power to generative LLMs in others. Despite these encouraging results, these models are still far from achieving human-level performance. We also highlight the significance of high-quality training data and domain-specific fine-tuning on the performance of all the considered models.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"384-390"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141837/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Selection and Implementation of Virtual Scribe Solutions to Reduce Documentation Burden: A Mixed Methods Pilot. 选择和实施虚拟抄写员解决方案以减轻文档记录负担:混合方法试验。
Carly Hudelson, Melissa A Gunderson, Debbie Pestka, Tori Christiaansen, Bret Stotka, Lynn Kissock, Rebecca Markowitz, Sameer Badlani, Genevieve B Melton

Electronic health record (EHR) documentation is a leading reason for clinician burnout. While technology-enabled solutions like virtual and digital scribes aim to improve this, there is limited evidence of their effectiveness and minimal guidance for healthcare systems around solution selection and implementation. A transdisciplinary approach, informed by clinician interviews and other considerations, was used to evaluate and select a virtual scribe solution to pilot in a rapid iterative sprint over 12 weeks. Surveys, interviews, and EHR metadata were analyzed over a staggered 30 day implementation with live and asynchronous virtual scribe solutions. Among 16 pilot clinicians, documentation burden metrics decreased for some but not all. Some clinicians had highly positive comments, and others had concerns regarding scribe training and quality. Our findings demonstrate that virtual scribes may reduce documentation burden for some clinicians and describe a method for a collaborative and iterative technology selection process for digital tools in practice.

电子健康记录(EHR)文档是造成临床医生职业倦怠的一个主要原因。虽然虚拟和数字抄写员等技术辅助解决方案旨在改善这一问题,但有关其有效性的证据有限,对医疗保健系统选择和实施解决方案的指导也少之又少。在临床医生访谈和其他考虑因素的基础上,我们采用了一种跨学科的方法来评估和选择虚拟抄写员解决方案,并在 12 周的快速迭代冲刺阶段进行试点。在 30 天的交错实施过程中,对实时和异步虚拟抄写员解决方案的调查、访谈和电子病历元数据进行了分析。在 16 名试点临床医生中,部分人的文档负担指标有所减轻,但并非全部。一些临床医生给予了高度评价,另一些则对抄写员的培训和质量表示担忧。我们的研究结果表明,虚拟抄写员可以减轻部分临床医生的文档记录负担,并描述了一种在实践中对数字工具进行协作和迭代技术选择的方法。
{"title":"Selection and Implementation of Virtual Scribe Solutions to Reduce Documentation Burden: A Mixed Methods Pilot.","authors":"Carly Hudelson, Melissa A Gunderson, Debbie Pestka, Tori Christiaansen, Bret Stotka, Lynn Kissock, Rebecca Markowitz, Sameer Badlani, Genevieve B Melton","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Electronic health record (EHR) documentation is a leading reason for clinician burnout. While technology-enabled solutions like virtual and digital scribes aim to improve this, there is limited evidence of their effectiveness and minimal guidance for healthcare systems around solution selection and implementation. A transdisciplinary approach, informed by clinician interviews and other considerations, was used to evaluate and select a virtual scribe solution to pilot in a rapid iterative sprint over 12 weeks. Surveys, interviews, and EHR metadata were analyzed over a staggered 30 day implementation with live and asynchronous virtual scribe solutions. Among 16 pilot clinicians, documentation burden metrics decreased for some but not all. Some clinicians had highly positive comments, and others had concerns regarding scribe training and quality. Our findings demonstrate that virtual scribes may reduce documentation burden for some clinicians and describe a method for a collaborative and iterative technology selection process for digital tools in practice.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"230-238"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141854/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Predicting Smoking Events for Just-in-time Interventions. 预测吸烟事件,进行及时干预。
Hang Yu, Michael Kotlyar, Paul Thuras, Sheena Dufresne, Serguei Vs Pakhomov

Consumer-grade heart rate (HR) sensors are widely used for tracking physical and mental health status. We explore the feasibility of using Polar H10 electrocardiogram (ECG) sensor to detect and predict cigarette smoking events in naturalistic settings with several machine learning approaches. We have collected and analyzed data for 28 participants observed over a two-week period. We found that using bidirectional long short-term memory (BiLSTM) with ECG-derived and GPS location input features yielded the highest mean accuracy of 69% for smoking event detection. For predicting smoking events, the highest accuracy of 67% was achieved using the fine-tuned LSTM approach. We also found a significant correlation between accuracy and the number of smoking events available from each participant. Our findings indicate that both detection and prediction of smoking events are feasible but require an individualized approach to training the models, particularly for prediction.

消费级心率(HR)传感器被广泛用于跟踪身体和精神健康状况。我们利用几种机器学习方法探索了使用 Polar H10 心电图(ECG)传感器在自然环境中检测和预测吸烟事件的可行性。我们收集并分析了 28 名参与者两周内的观察数据。我们发现,使用双向长短期记忆(BiLSTM)以及心电图衍生和 GPS 位置输入特征检测吸烟事件的平均准确率最高,达到 69%。在预测吸烟事件方面,微调 LSTM 方法的准确率最高,达到 67%。我们还发现,准确率与每位参与者的吸烟事件数量之间存在明显的相关性。我们的研究结果表明,吸烟事件的检测和预测都是可行的,但需要采用个性化的方法来训练模型,尤其是预测模型。
{"title":"Towards Predicting Smoking Events for Just-in-time Interventions.","authors":"Hang Yu, Michael Kotlyar, Paul Thuras, Sheena Dufresne, Serguei Vs Pakhomov","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Consumer-grade heart rate (HR) sensors are widely used for tracking physical and mental health status. We explore the feasibility of using Polar H10 electrocardiogram (ECG) sensor to detect and predict cigarette smoking events in naturalistic settings with several machine learning approaches. We have collected and analyzed data for 28 participants observed over a two-week period. We found that using bidirectional long short-term memory (BiLSTM) with ECG-derived and GPS location input features yielded the highest mean accuracy of 69% for smoking event detection. For predicting smoking events, the highest accuracy of 67% was achieved using the fine-tuned LSTM approach. We also found a significant correlation between accuracy and the number of smoking events available from each participant. Our findings indicate that both detection and prediction of smoking events are feasible but require an individualized approach to training the models, particularly for prediction.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"468-477"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141818/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Utilizing Large Language Models to Generate Synthetic Data to Increase the Performance of BERT-Based Neural Networks. 利用大型语言模型生成合成数据,提高基于 BERT 的神经网络的性能。
Chancellor R Woolsey, Prakash Bisht, Joshua Rothman, Gondy Leroy

An important problem impacting healthcare is the lack of available experts. Machine learning (ML) models may help resolve this by aiding in screening and diagnosing patients. However, creating large, representative datasets to train models is expensive. We evaluated large language models (LLMs) for data creation. Using Autism Spectrum Disorders (ASD), we prompted GPT-3.5 and GPT-4 to generate 4,200 synthetic examples of behaviors to augment existing medical observations. Our goal is to label behaviors corresponding to autism criteria and improve model accuracy with synthetic training data. We used a BERT classifier pretrained on biomedical literature to assess differences in performance between models. A random sample (N=140) from the LLM-generated data was also evaluated by a clinician and found to contain 83% correct behavioral example-label pairs. Augmenting the dataset increased recall by 13% but decreased precision by 16%. Future work will investigate how different synthetic data characteristics affect ML outcomes.

影响医疗保健的一个重要问题是缺乏可用的专家。机器学习 (ML) 模型可以帮助筛查和诊断病人,从而解决这一问题。然而,创建大型、有代表性的数据集来训练模型的成本很高。我们评估了用于创建数据的大型语言模型(LLM)。利用自闭症谱系障碍(ASD),我们促使 GPT-3.5 和 GPT-4 生成了 4,200 个合成行为示例,以增强现有的医学观察结果。我们的目标是标注与自闭症标准相对应的行为,并通过合成训练数据提高模型的准确性。我们使用生物医学文献预训练的 BERT 分类器来评估不同模型之间的性能差异。临床医生也对 LLM 生成数据中的随机样本(N=140)进行了评估,发现其中包含 83% 正确的行为示例-标签对。扩充数据集后,召回率提高了 13%,但精确度降低了 16%。未来的工作将研究不同的合成数据特征如何影响 ML 结果。
{"title":"Utilizing Large Language Models to Generate Synthetic Data to Increase the Performance of BERT-Based Neural Networks.","authors":"Chancellor R Woolsey, Prakash Bisht, Joshua Rothman, Gondy Leroy","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>An important problem impacting healthcare is the lack of available experts. Machine learning (ML) models may help resolve this by aiding in screening and diagnosing patients. However, creating large, representative datasets to train models is expensive. We evaluated large language models (LLMs) for data creation. Using Autism Spectrum Disorders (ASD), we prompted GPT-3.5 and GPT-4 to generate 4,200 synthetic examples of behaviors to augment existing medical observations. Our goal is to label behaviors corresponding to autism criteria and improve model accuracy with synthetic training data. We used a BERT classifier pretrained on biomedical literature to assess differences in performance between models. A random sample (N=140) from the LLM-generated data was also evaluated by a clinician and found to contain 83% correct behavioral example-label pairs. Augmenting the dataset increased recall by 13% but decreased precision by 16%. Future work will investigate how different synthetic data characteristics affect ML outcomes.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"429-438"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141799/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Electronic Phenotyping of Urinary Tract Infections as a Silver Standard Label for Machine Learning. 将尿路感染电子表型作为机器学习的银标准标签
Stephen P Ma, Ebru Hosgur, Conor K Corbin, Ivan Lopez, Amy Chang, Jonathan H Chen

This study explored the efficacy of electronic phenotyping in data labeling for machine learning with a focus on urinary tract infections (UTIs). We contrasted labels from electronic phenotyping against previously published labels such as urine culture positivity. In comparison, electronic phenotyping showed the potential to enhance specificity in UTI labeling while maintaining similar sensitivity and was easily scaled for application to a large dataset suitable for machine learning, which we used to train and validate a machine learning model. Electronic phenotyping offers a valuable method for machine learning label generation in healthcare, with potential benefits for patient care and antimicrobial stewardship. Further research will expand its application and optimize techniques for increased performance.

本研究探讨了电子表型在机器学习数据标注中的功效,重点关注尿路感染(UTI)。我们将电子表型的标签与之前公布的标签(如尿培养阳性)进行了对比。相比之下,电子表型技术显示出了提高UTI标签特异性的潜力,同时保持了相似的灵敏度,而且很容易扩展应用到适合机器学习的大型数据集,我们用它来训练和验证机器学习模型。电子表型为医疗保健领域的机器学习标签生成提供了一种有价值的方法,可为患者护理和抗菌药物管理带来潜在益处。进一步的研究将扩大其应用范围并优化技术以提高性能。
{"title":"Electronic Phenotyping of Urinary Tract Infections as a Silver Standard Label for Machine Learning.","authors":"Stephen P Ma, Ebru Hosgur, Conor K Corbin, Ivan Lopez, Amy Chang, Jonathan H Chen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This study explored the efficacy of electronic phenotyping in data labeling for machine learning with a focus on urinary tract infections (UTIs). We contrasted labels from electronic phenotyping against previously published labels such as urine culture positivity. In comparison, electronic phenotyping showed the potential to enhance specificity in UTI labeling while maintaining similar sensitivity and was easily scaled for application to a large dataset suitable for machine learning, which we used to train and validate a machine learning model. Electronic phenotyping offers a valuable method for machine learning label generation in healthcare, with potential benefits for patient care and antimicrobial stewardship. Further research will expand its application and optimize techniques for increased performance.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"182-189"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141812/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141200681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pathophysiological Features in Electronic Medical Records Sustain Model Performance under Temporal Dataset Shift. 电子病历中的病理生理学特征可在数据集时空转移的情况下维持模型性能。
Raphael Brosula, Conor K Corbin, Jonathan H Chen

Access to real-world data streams like electronic medical records (EMRs) has accelerated the development of supervised machine learning (ML) models for clinical applications. However, few studies investigate the differential impact of particular features in the EMR on model performance under temporal dataset shift. To explain how features in the EMR impact models over time, this study aggregates features into feature groups by their source (e.g. medication orders, diagnosis codes and lab results) and feature categories based on their reflection of patient pathophysiology or healthcare processes. We adapt Shapley values to explain feature groups' and feature categories' marginal contribution to initial and sustained model performance. We investigate three standard clinical prediction tasks and find that while feature contributions to initial performance differ across tasks, pathophysiological features help mitigate temporal discrimination deterioration. These results provide interpretable insights on how specific feature groups contribute to model performance and robustness to temporal dataset shift.

电子病历(EMR)等真实世界数据流的获取加速了临床应用中监督机器学习(ML)模型的开发。然而,很少有研究调查 EMR 中的特定特征对模型性能在时间数据集转移下的不同影响。为了解释 EMR 中的特征如何随着时间的推移对模型产生影响,本研究将特征按其来源(如医嘱、诊断代码和化验结果)聚合成特征组,并根据其对患者病理生理学或医疗流程的反映将特征分类。我们采用夏普利值来解释特征组和特征类别对初始和持续模型性能的边际贡献。我们对三项标准临床预测任务进行了研究,发现虽然不同任务的特征对初始性能的贡献不同,但病理生理特征有助于缓解时间辨别能力的退化。这些结果提供了可解释的见解,说明特定特征组如何对模型性能和对时间数据集转移的稳健性做出贡献。
{"title":"Pathophysiological Features in Electronic Medical Records Sustain Model Performance under Temporal Dataset Shift.","authors":"Raphael Brosula, Conor K Corbin, Jonathan H Chen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Access to real-world data streams like electronic medical records (EMRs) has accelerated the development of supervised machine learning (ML) models for clinical applications. However, few studies investigate the differential impact of particular features in the EMR on model performance under temporal dataset shift. To explain how features in the EMR impact models over time, this study aggregates features into <i>feature groups</i> by their source (e.g. medication orders, diagnosis codes and lab results) and <i>feature categories</i> based on their reflection of patient pathophysiology or healthcare processes. We adapt Shapley values to explain feature groups' and feature categories' marginal contribution to initial and sustained model performance. We investigate three standard clinical prediction tasks and find that while feature contributions to initial performance differ across tasks, pathophysiological features help mitigate temporal discrimination deterioration. These results provide interpretable insights on how specific feature groups contribute to model performance and robustness to temporal dataset shift.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"95-104"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141811/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic Population of the Case Report Forms for an International Multifactorial Adaptive Platform Trial Amid the COVID-19 Pandemic. 在 COVID-19 大流行中自动生成国际多因素自适应平台试验的病例报告表。
Andrew J King, Lisa Higgins, Carly Au, Salim Malakouti, Edvin Music, Kyle Kalchthaler, Gilles Clermont, William Garrard, David T Huang, Bryan J McVerry, Christopher W Seymour, Kelsey Linstrum, Amanda McNamara, Cameron Green, India Loar, Tracey Roberts, Oscar Marroquin, Derek C Angus, Christopher M Horvat

Objectives: To automatically populate the case report forms (CRFs) for an international, pragmatic, multifactorial, response-adaptive, Bayesian COVID-19 platform trial.

Methods: The locations of focus included 27 hospitals and 2 large electronic health record (EHR) instances (1 Cerner Millennium and 1 Epic) that are part of the same health system in the United States. This paper describes our efforts to use EHR data to automatically populate four of the trial's forms: baseline, daily, discharge, and response-adaptive randomization.

Results: Between April 2020 and May 2022, 417 patients from the UPMC health system were enrolled in the trial. A MySQL-based extract, transform, and load pipeline automatically populated 499 of 526 CRF variables. The populated forms were statistically and manually reviewed and then reported to the trial's international data coordinating center.

Conclusions: We accomplished automatic population of CRFs in a large platform trial and made recommendations for improving this process for future trials.

目的为一项国际性、务实、多因素、反应自适应、贝叶斯 COVID-19 平台试验自动填充病例报告表 (CRF):重点研究地点包括隶属于美国同一医疗系统的 27 家医院和 2 个大型电子病历 (EHR) 实例(1 个 Cerner Millennium 和 1 个 Epic)。本文介绍了我们在使用电子病历数据自动填充试验的四种表格方面所做的努力:基线、日常、出院和反应自适应随机化:2020年4月至2022年5月期间,UPMC医疗系统的417名患者加入了试验。基于 MySQL 的提取、转换和加载管道自动填充了 526 个 CRF 变量中的 499 个。填充后的表格经过统计和人工审核,然后报告给试验的国际数据协调中心:我们在一项大型平台试验中实现了 CRF 的自动填充,并为今后的试验提出了改进这一流程的建议。
{"title":"Automatic Population of the Case Report Forms for an International Multifactorial Adaptive Platform Trial Amid the COVID-19 Pandemic.","authors":"Andrew J King, Lisa Higgins, Carly Au, Salim Malakouti, Edvin Music, Kyle Kalchthaler, Gilles Clermont, William Garrard, David T Huang, Bryan J McVerry, Christopher W Seymour, Kelsey Linstrum, Amanda McNamara, Cameron Green, India Loar, Tracey Roberts, Oscar Marroquin, Derek C Angus, Christopher M Horvat","doi":"","DOIUrl":"","url":null,"abstract":"<p><strong>Objectives: </strong>To automatically populate the case report forms (CRFs) for an international, pragmatic, multifactorial, response-adaptive, Bayesian COVID-19 platform trial.</p><p><strong>Methods: </strong>The locations of focus included 27 hospitals and 2 large electronic health record (EHR) instances (1 Cerner Millennium and 1 Epic) that are part of the same health system in the United States. This paper describes our efforts to use EHR data to automatically populate four of the trial's forms: baseline, daily, discharge, and response-adaptive randomization.</p><p><strong>Results: </strong>Between April 2020 and May 2022, 417 patients from the UPMC health system were enrolled in the trial. A MySQL-based extract, transform, and load pipeline automatically populated 499 of 526 CRF variables. The populated forms were statistically and manually reviewed and then reported to the trial's international data coordinating center.</p><p><strong>Conclusions: </strong>We accomplished automatic population of CRFs in a large platform trial and made recommendations for improving this process for future trials.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"276-284"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141839/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FERI: A Multitask-based Fairness Achieving Algorithm with Applications to Fair Organ Transplantation. FERI:基于多任务的公平实现算法,应用于公平器官移植。
Can Li, Dejian Lai, Xiaoqian Jiang, Kai Zhang

Liver transplantation often faces fairness challenges across subgroups defined by sensitive attributes such as age group, gender, and race/ethnicity. Machine learning models for outcome prediction can introduce additional biases. Therefore, we introduce Fairness through the Equitable Rate of Improvement in Multitask Learning (FERI) algorithm for fair predictions of graft failure risk in liver transplant patients. FERI constrains subgroup loss by balancing learning rates and preventing subgroup dominance in the training process. Our results show that FERI maintained high predictive accuracy with AUROC and AUPRC comparable to baseline models. More importantly, FERI demonstrated an ability to improve fairness without sacrificing accuracy. Specifically, for the gender, FERI reduced the demographic parity disparity by 71.74%, and for the age group, it decreased the equalized odds disparity by 40.46%. Therefore, the FERI algorithm advanced fairness-aware predictive modeling in healthcare and provides an invaluable tool for equitable healthcare systems.

肝脏移植往往面临着由年龄组、性别和种族/民族等敏感属性所定义的亚组的公平性挑战。用于结果预测的机器学习模型可能会引入额外的偏差。因此,我们引入了多任务学习中的公平改进率(FERI)算法,用于公平预测肝移植患者的移植物失败风险。FERI 通过平衡学习率和防止训练过程中的亚组优势来限制亚组损失。我们的研究结果表明,FERI 保持了较高的预测准确率,其 AUROC 和 AUPRC 与基线模型相当。更重要的是,FERI 能够在不牺牲准确性的情况下提高公平性。具体来说,在性别方面,FERI 将人口均等差距缩小了 71.74%,在年龄组方面,将均等几率差距缩小了 40.46%。因此,FERI 算法推进了医疗保健领域的公平感知预测建模,为公平医疗保健系统提供了宝贵的工具。
{"title":"FERI: A Multitask-based Fairness Achieving Algorithm with Applications to Fair Organ Transplantation.","authors":"Can Li, Dejian Lai, Xiaoqian Jiang, Kai Zhang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Liver transplantation often faces fairness challenges across subgroups defined by sensitive attributes such as age group, gender, and race/ethnicity. Machine learning models for outcome prediction can introduce additional biases. Therefore, we introduce <b>F</b>airness through the <b>E</b>quitable <b>R</b>ate of <b>I</b>mprovement in Multitask Learning (FERI) algorithm for fair predictions of graft failure risk in liver transplant patients. FERI constrains subgroup loss by balancing learning rates and preventing subgroup dominance in the training process. Our results show that FERI maintained high predictive accuracy with AUROC and AUPRC comparable to baseline models. More importantly, FERI demonstrated an ability to improve fairness without sacrificing accuracy. Specifically, for the gender, FERI reduced the demographic parity disparity by 71.74%, and for the age group, it decreased the equalized odds disparity by 40.46%. Therefore, the FERI algorithm advanced fairness-aware predictive modeling in healthcare and provides an invaluable tool for equitable healthcare systems.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"593-602"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141863/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141200786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1