JAMIA Open最新文献_第4页

Investigation on the preferences for data quality assessment indicators of electronic health records: user-oriented perspective. 对电子健康档案数据质量评估指标的偏好调查：以用户为导向的观点。

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open

Pub Date : 2024-12-11 eCollection Date: 2024-12-01 DOI: 10.1093/jamiaopen/ooae142

Liu Yang, Mudan Ren, Shuifa Sun, Ji Lu, Yirong Wu

Objectives: This study aims to investigate whether different types of electronic health record (EHR) users have distinct preferences for data quality assessment indicators (DQAI) and explore how these preferences can guide the enhancement of EHR systems and the optimization of related policies.

Materials and methods: High-frequency indicators were identified by a systematic literature review to construct a DQAI system, which was assessed by a user-oriented investigation involving doctors, nurses, hospital supervisors, and clinical researchers. The entropy weight method and fuzzy comprehensive evaluation model were employed for the system comprehensive evaluation. Exploratory factor analysis was used to construct dimensions, and visualization analysis was utilized to explore preferences at both the indicator and dimension levels.

Results: Sixteen indicators were identified to construct the DQAI system and grouped into 2 dimensions: structural and relational. The DQAI system achieved a comprehensive evaluation score of 90.445, corresponding to a "very important" membership level (62.5%). Doctors and nurses exhibited a higher score mean (4.43-4.66 out of 5) than supervisors (3.73-4.55 out of 5). Researchers emphasized credibility, with a score mean of 4.79 out of 5.

Discussion: The findings reveal that different types of EHR users exhibit distinct preferences for the DQAI at both indicator and dimension levels. Doctors and nurses thought that all indicators were important, clinical researchers emphasized credibility, and supervisors focused mainly on accuracy. Indicators in the relational dimension were generally more valued than structural ones. Doctors and nurses prioritized indicators of relational dimension, while researchers and supervisors leaned towards indicators of structural dimension. These insights suggest that tailored approaches in EHR system development and policy-making could enhance EHR data quality.

Conclusion: This study underscores the importance of user-centered approaches in optimizing EHR systems, highlighting diverse user preferences at both indicator and dimension levels.

目的：探讨不同类型的电子病历用户对数据质量评估指标（DQAI）的偏好是否存在差异，并探讨这些偏好如何指导电子病历系统的完善和相关政策的优化。材料和方法：通过系统的文献综述，确定高频指标，构建DQAI系统，通过面向用户的调查，包括医生、护士、医院主管和临床研究人员对DQAI系统进行评估。采用熵权法和模糊综合评价模型对系统进行综合评价。探索性因子分析用于构建维度，可视化分析用于在指标和维度水平上探索偏好。结果：确定了16个指标来构建DQAI体系，并将其分为结构和关系两个维度。DQAI系统的综合评价得分为90.445，对应于“非常重要”的成员水平（62.5%）。医生和护士的平均得分（4.43-4.66分）高于主管（3.73-4.55分）。研究人员强调可信度，平均得分为4.79分（5分）。讨论：研究结果表明，不同类型的电子病历用户在指标和维度水平上对DQAI表现出不同的偏好。医生和护士认为所有指标都很重要，临床研究者强调可信度，管理者主要关注准确性。关系维度的指标通常比结构维度的指标更受重视。医生和护士优先考虑关系维度的指标，而研究人员和主管倾向于结构维度的指标。这些见解表明，在电子病历系统开发和决策中采用量身定制的方法可以提高电子病历数据质量。结论：本研究强调了以用户为中心的方法在优化电子病历系统中的重要性，强调了在指标和维度水平上不同的用户偏好。

{"title":"Investigation on the preferences for data quality assessment indicators of electronic health records: user-oriented perspective.","authors":"Liu Yang, Mudan Ren, Shuifa Sun, Ji Lu, Yirong Wu","doi":"10.1093/jamiaopen/ooae142","DOIUrl":"10.1093/jamiaopen/ooae142","url":null,"abstract":"Objectives: This study aims to investigate whether different types of electronic health record (EHR) users have distinct preferences for data quality assessment indicators (DQAI) and explore how these preferences can guide the enhancement of EHR systems and the optimization of related policies.Materials and methods: High-frequency indicators were identified by a systematic literature review to construct a DQAI system, which was assessed by a user-oriented investigation involving doctors, nurses, hospital supervisors, and clinical researchers. The entropy weight method and fuzzy comprehensive evaluation model were employed for the system comprehensive evaluation. Exploratory factor analysis was used to construct dimensions, and visualization analysis was utilized to explore preferences at both the indicator and dimension levels.Results: Sixteen indicators were identified to construct the DQAI system and grouped into 2 dimensions: structural and relational. The DQAI system achieved a comprehensive evaluation score of 90.445, corresponding to a \"very important\" membership level (62.5%). Doctors and nurses exhibited a higher score mean (4.43-4.66 out of 5) than supervisors (3.73-4.55 out of 5). Researchers emphasized credibility, with a score mean of 4.79 out of 5.Discussion: The findings reveal that different types of EHR users exhibit distinct preferences for the DQAI at both indicator and dimension levels. Doctors and nurses thought that all indicators were important, clinical researchers emphasized credibility, and supervisors focused mainly on accuracy. Indicators in the relational dimension were generally more valued than structural ones. Doctors and nurses prioritized indicators of relational dimension, while researchers and supervisors leaned towards indicators of structural dimension. These insights suggest that tailored approaches in EHR system development and policy-making could enhance EHR data quality.Conclusion: This study underscores the importance of user-centered approaches in optimizing EHR systems, highlighting diverse user preferences at both indicator and dimension levels.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"7 4","pages":"ooae142"},"PeriodicalIF":2.5,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11633948/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142814452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decoding disparities: evaluating automatic speech recognition system performance in transcribing Black and White patient verbal communication with nurses in home healthcare. 解码差异：评估自动语音识别系统在转录黑人和白人患者与护士的口头交流中的表现。

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open

Pub Date : 2024-12-10 eCollection Date: 2024-12-01 DOI: 10.1093/jamiaopen/ooae130

Maryam Zolnoori, Sasha Vergez, Zidu Xu, Elyas Esmaeili, Ali Zolnour, Krystal Anne Briggs, Jihye Kim Scroggins, Seyed Farid Hosseini Ebrahimabad, James M Noble, Maxim Topaz, Suzanne Bakken, Kathryn H Bowles, Ian Spens, Nicole Onorato, Sridevi Sridharan, Margaret V McDonald

Objectives: As artificial intelligence evolves, integrating speech processing into home healthcare (HHC) workflows is increasingly feasible. Audio-recorded communications enhance risk identification models, with automatic speech recognition (ASR) systems as a key component. This study evaluates the transcription accuracy and equity of 4 ASR systems-Amazon Web Services (AWS) General, AWS Medical, Whisper, and Wave2Vec-in transcribing patient-nurse communication in US HHC, focusing on their ability in accurate transcription of speech from Black and White English-speaking patients.

Materials and methods: We analyzed audio recordings of patient-nurse encounters from 35 patients (16 Black and 19 White) in a New York City-based HHC service. Overall, 860 utterances were available for study, including 475 drawn from Black patients and 385 from White patients. Automatic speech recognition performance was measured using word error rate (WER), benchmarked against a manual gold standard. Disparities were assessed by comparing ASR performance across racial groups using the linguistic inquiry and word count (LIWC) tool, focusing on 10 linguistic dimensions, as well as specific speech elements including repetition, filler words, and proper nouns (medical and nonmedical terms).

Results: The average age of participants was 67.8 years (SD = 14.4). Communication lasted an average of 15 minutes (range: 11-21 minutes) with a median of 1186 words per patient. Of 860 total utterances, 475 were from Black patients and 385 from White patients. Amazon Web Services General had the highest accuracy, with a median WER of 39%. However, all systems showed reduced accuracy for Black patients, with significant discrepancies in LIWC dimensions such as "Affect," "Social," and "Drives." Amazon Web Services Medical performed best for medical terms, though all systems have difficulties with filler words, repetition, and nonmedical terms, with AWS General showing the lowest error rates at 65%, 64%, and 53%, respectively.

Discussion: While AWS systems demonstrated superior accuracy, significant disparities by race highlight the need for more diverse training datasets and improved dialect sensitivity. Addressing these disparities is critical for ensuring equitable ASR performance in HHC settings and enhancing risk prediction models through audio-recorded communication.

目标：随着人工智能的发展，将语音处理集成到家庭医疗（HHC）工作流程中越来越可行。录音通信增强了风险识别模型，其中自动语音识别（ASR）系统是关键组成部分。本研究评估了4种ASR系统——amazon Web Services (AWS) General、AWS Medical、Whisper和wave2vec——在转录美国HHC患者-护士交流中的转录准确性和公平性，重点关注它们准确转录黑人和白人英语患者语音的能力。材料和方法：我们分析了纽约市HHC服务中35名患者（16名黑人和19名白人）的患者-护士接触录音。总共有860个话语可供研究，其中475个来自黑人患者，385个来自白人患者。自动语音识别性能是用单词错误率（WER）来衡量的，以人工黄金标准为基准。通过使用语言查询和单词计数（LIWC）工具比较不同种族的ASR表现，评估差异，重点关注10个语言维度，以及特定的语音元素，包括重复、填充词和专有名词（医学和非医学术语）。结果：参与者平均年龄67.8岁（SD = 14.4）。交流平均持续15分钟（范围：11-21分钟），平均每位患者1186个单词。在860个话语中，475个来自黑人患者，385个来自白人患者。Amazon Web Services General的准确率最高，WER的中位数为39%。然而，所有系统对黑人患者的准确性都有所降低，在LIWC维度（如“影响”、“社会”和“驱动”）上存在显著差异。Amazon Web Services Medical在医疗术语方面表现最好，尽管所有系统在填充词、重复词和非医疗术语方面都存在困难，但AWS General的错误率最低，分别为65%、64%和53%。讨论：虽然AWS系统显示出优越的准确性，但种族之间的显著差异突出了对更多样化的训练数据集和改进方言敏感性的需求。解决这些差异对于确保卫生保健环境中公平的ASR绩效和通过录音交流加强风险预测模型至关重要。

{"title":"Decoding disparities: evaluating automatic speech recognition system performance in transcribing Black and White patient verbal communication with nurses in home healthcare.","authors":"Maryam Zolnoori, Sasha Vergez, Zidu Xu, Elyas Esmaeili, Ali Zolnour, Krystal Anne Briggs, Jihye Kim Scroggins, Seyed Farid Hosseini Ebrahimabad, James M Noble, Maxim Topaz, Suzanne Bakken, Kathryn H Bowles, Ian Spens, Nicole Onorato, Sridevi Sridharan, Margaret V McDonald","doi":"10.1093/jamiaopen/ooae130","DOIUrl":"10.1093/jamiaopen/ooae130","url":null,"abstract":"Objectives: As artificial intelligence evolves, integrating speech processing into home healthcare (HHC) workflows is increasingly feasible. Audio-recorded communications enhance risk identification models, with automatic speech recognition (ASR) systems as a key component. This study evaluates the transcription accuracy and equity of 4 ASR systems-Amazon Web Services (AWS) General, AWS Medical, Whisper, and Wave2Vec-in transcribing patient-nurse communication in US HHC, focusing on their ability in accurate transcription of speech from Black and White English-speaking patients.Materials and methods: We analyzed audio recordings of patient-nurse encounters from 35 patients (16 Black and 19 White) in a New York City-based HHC service. Overall, 860 utterances were available for study, including 475 drawn from Black patients and 385 from White patients. Automatic speech recognition performance was measured using word error rate (WER), benchmarked against a manual gold standard. Disparities were assessed by comparing ASR performance across racial groups using the linguistic inquiry and word count (LIWC) tool, focusing on 10 linguistic dimensions, as well as specific speech elements including repetition, filler words, and proper nouns (medical and nonmedical terms).Results: The average age of participants was 67.8 years (SD = 14.4). Communication lasted an average of 15 minutes (range: 11-21 minutes) with a median of 1186 words per patient. Of 860 total utterances, 475 were from Black patients and 385 from White patients. Amazon Web Services General had the highest accuracy, with a median WER of 39%. However, all systems showed reduced accuracy for Black patients, with significant discrepancies in LIWC dimensions such as \"Affect,\" \"Social,\" and \"Drives.\" Amazon Web Services Medical performed best for medical terms, though all systems have difficulties with filler words, repetition, and nonmedical terms, with AWS General showing the lowest error rates at 65%, 64%, and 53%, respectively.Discussion: While AWS systems demonstrated superior accuracy, significant disparities by race highlight the need for more diverse training datasets and improved dialect sensitivity. Addressing these disparities is critical for ensuring equitable ASR performance in HHC settings and enhancing risk prediction models through audio-recorded communication.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"7 4","pages":"ooae130"},"PeriodicalIF":2.5,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11631515/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142808080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Do electronic health records used by primary care practices support recommended alcohol-related care? 初级保健实践使用的电子健康记录是否支持推荐的酒精相关护理？

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open

Pub Date : 2024-12-04 eCollection Date: 2024-12-01 DOI: 10.1093/jamiaopen/ooae125

Katharine Bradley, James McCormack, Megan Addis, Leah K Hamilton, Gwen T Lapham, Daniel Jonas, Dawn Bishop, Darla Parsons, Cheryl Budimir, Victoria Sanchez, Jennifer Bannon, Gabriela Villalobos, Alex H Krist, Theresa Walunas, Anya Day

Objective: The quality of alcohol-related prevention and treatment in US primary care is poor. The purpose of this study was to describe the extent to which Electronic Health Records (EHRs) used by 167 primary care practices across 7 states currently include the necessary prompts, clinical support, and performance reporting essential for improving alcohol-related prevention and treatment in primary care.

Materials and methods: Experts from five regional quality improvement programs identified basic EHR features needed to support evidence-based alcohol-related prevention (ie, screening and brief intervention) and treatment of alcohol use disorders (AUD). Data were collected regarding whether EHRs included these features.

Results: EHRs from 21 vendors were used by the primary care practices. For prevention, 62% of the 167 practices' EHRs included a validated screening questionnaire, 46% automatically scored the screening instrument, 62% could report the percent screened, and 37% could report the percent screening positive. Only 7% could report the percent offered brief intervention. For alcohol treatment, 49% of practices could report the percent diagnosed with AUD, 58% and 91% allowed documentation of referral and treatment with AUD medication, respectively. Only 3% could report the percent of patients diagnosed with AUD who received treatment.

Discussion: Most EHRs observed across 167 primary care practices across 7 US states lacked basic functionality necessary to support evidence-based alcohol-related prevention and AUD treatment. Only 3% and 7% of EHRs, respectively, included the ability to report widely recommended quality measures needed to improve the quality of recommended alcohol-related prevention and treatment in primary care.

Conclusion: Improving EHR functionality is likely necessary before alcohol-related primary care can be improved.

目的：美国初级保健中酒精相关预防和治疗的质量较差。本研究的目的是描述目前7个州167个初级保健实践使用的电子健康记录（EHRs）在多大程度上包括必要的提示、临床支持和绩效报告，这些都是改善初级保健中与酒精相关的预防和治疗所必需的。材料和方法：来自五个区域质量改进项目的专家确定了支持基于证据的酒精相关预防（即筛查和短暂干预）和治疗酒精使用障碍（AUD）所需的基本电子病历特征。收集了关于电子病历是否包括这些特征的数据。结果：21家供应商的电子病历被用于初级保健实践。在预防方面，167个实践中有62%的电子病历包括有效的筛查问卷，46%的人自动对筛查工具进行评分，62%的人可以报告筛查的百分比，37%的人可以报告筛查阳性的百分比。只有7%的人可以报告提供短暂干预的百分比。对于酒精治疗，49%的实践可以报告诊断为AUD的百分比，58%和91%分别允许转诊和使用AUD药物治疗的记录。只有3%的人可以报告诊断为AUD的患者接受治疗的百分比。讨论：在美国7个州167个初级保健实践中观察到的大多数电子病历缺乏支持基于证据的酒精相关预防和AUD治疗所需的基本功能。分别只有3%和7%的电子病历包括报告广泛推荐的质量措施的能力，这些措施是提高初级保健中推荐的酒精相关预防和治疗质量所必需的。结论：在改善与酒精相关的初级保健之前，可能有必要改善电子病历功能。

{"title":"Do electronic health records used by primary care practices support recommended alcohol-related care?","authors":"Katharine Bradley, James McCormack, Megan Addis, Leah K Hamilton, Gwen T Lapham, Daniel Jonas, Dawn Bishop, Darla Parsons, Cheryl Budimir, Victoria Sanchez, Jennifer Bannon, Gabriela Villalobos, Alex H Krist, Theresa Walunas, Anya Day","doi":"10.1093/jamiaopen/ooae125","DOIUrl":"10.1093/jamiaopen/ooae125","url":null,"abstract":"Objective: The quality of alcohol-related prevention and treatment in US primary care is poor. The purpose of this study was to describe the extent to which Electronic Health Records (EHRs) used by 167 primary care practices across 7 states currently include the necessary prompts, clinical support, and performance reporting essential for improving alcohol-related prevention and treatment in primary care.Materials and methods: Experts from five regional quality improvement programs identified basic EHR features needed to support evidence-based alcohol-related prevention (ie, screening and brief intervention) and treatment of alcohol use disorders (AUD). Data were collected regarding whether EHRs included these features.Results: EHRs from 21 vendors were used by the primary care practices. For prevention, 62% of the 167 practices' EHRs included a validated screening questionnaire, 46% automatically scored the screening instrument, 62% could report the percent screened, and 37% could report the percent screening positive. Only 7% could report the percent offered brief intervention. For alcohol treatment, 49% of practices could report the percent diagnosed with AUD, 58% and 91% allowed documentation of referral and treatment with AUD medication, respectively. Only 3% could report the percent of patients diagnosed with AUD who received treatment.Discussion: Most EHRs observed across 167 primary care practices across 7 US states lacked basic functionality necessary to support evidence-based alcohol-related prevention and AUD treatment. Only 3% and 7% of EHRs, respectively, included the ability to report widely recommended quality measures needed to improve the quality of recommended alcohol-related prevention and treatment in primary care.Conclusion: Improving EHR functionality is likely necessary before alcohol-related primary care can be improved.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"7 4","pages":"ooae125"},"PeriodicalIF":2.5,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11630038/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142808085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A retrospective cohort study on predicting infants at a risk of defaulting routine immunization in Uganda using machine learning models. 一项使用机器学习模型预测乌干达有常规免疫违约风险的婴儿的回顾性队列研究。

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open

Pub Date : 2024-11-29 eCollection Date: 2024-12-01 DOI: 10.1093/jamiaopen/ooae132

Bartha Alexandra Nantongo, Josephine Nabukenya, Peter Nabende, John Kamulegeya

Objectives: Using machine learning models to predict infants at risk of defaulting routine immunization (RI) and identify significant features for Uganda.

Materials and methods: Principal component analysis reduced dimensionality. Datasets were balanced using synthetic minority over-sampling technique. k-Nearest Neighbors, Decision Trees, Random Forests (RFs), Support Vector Machine (SVM), Naïve-Bayes, Logistic Regression (LR), XGBoost, Adoptive-Boosting, and Gradient-Boosting were used on Uganda's 2016 Demographic and health survey data with social-economic and demographic factors as predictors. Experiments with and without K-fold cross-validation were performed. Models were evaluated for accuracy, recall, precision, and area under a curve (AUC).

Results and discussion: Experimental results revealed that the rate of defaulting increases as an infant's age increases at 5.3% Bacille Calmette-Guérin (BCG), 7.3% pentavalentI, 22.9% pentavalentIII, and 22.1% for measles. Significant predictors for BCG were immunization card, polio0, cluster altitude. Reception of pneumococcal1, BCG, and district for pentavalentI; polio3, pentavalentII for pentavalentIII; polio active and pentavalentIII for measles.RF had the best performance at predicting vaccine defaulting with 96%, 95%, 94%, 84% accuracy for BCG, PentavalentI, pentavalentIII, measles, respectively. Similarly, RF had the same precision, recall, AUC at 1.0. However, XGBoost, SVM, LR displayed the worst discriminatory power among infants who received the vaccine from defaulters with AUC ≤0.57.

Conclusion: Immunization card, preceding vaccines reception, and district were the most influential predictors. RF was the best classifier among the 9 models to predict defaulting RI. The study recommends regular outreaches, daily vaccination, provision of immunization cards, and accessible water sources to reduce defaulting.

目的：使用机器学习模型来预测处于默认常规免疫（RI）风险中的婴儿，并确定乌干达的重要特征。材料和方法：主成分分析降维。使用合成少数派过采样技术平衡数据集。以社会经济和人口因素为预测因子，对乌干达2016年人口和健康调查数据使用k-近邻、决策树、随机森林（rf）、支持向量机（SVM）、Naïve-Bayes、逻辑回归（LR）、XGBoost、adop- boosting和Gradient-Boosting。进行了有和没有k倍交叉验证的实验。评估模型的准确性、召回率、精密度和曲线下面积（AUC）。结果和讨论：实验结果显示，随着婴儿年龄的增长，违约率增加，卡介苗（BCG）为5.3%，五价苗（pentavalentI）为7.3%，五价苗（pentavalentii）为22.9%，麻疹为22.1%。卡介苗的显著预测因子为免疫卡、脊髓灰质炎、群集高度。接受肺炎球菌1、卡介苗和地区五价疫苗接种；小儿麻痹症，五价为五价；小儿麻痹症活动性和麻疹五价疫苗。RF在预测卡介苗、五联疫苗、五联疫苗和麻疹疫苗违约方面表现最佳，准确率分别为96%、95%、94%和84%。同样，RF在1.0时具有相同的精度，召回率，AUC。而在AUC≤0.57的未接种者接种疫苗的婴儿中，XGBoost、SVM、LR的歧视力最差。结论：免疫接种卡、既往疫苗接种情况和地区是影响因素。在预测默认RI的9个模型中，RF是最好的分类器。该研究建议定期开展外展活动，每天接种疫苗，提供免疫接种卡和可获得的水源，以减少违约情况。

{"title":"A retrospective cohort study on predicting infants at a risk of defaulting routine immunization in Uganda using machine learning models.","authors":"Bartha Alexandra Nantongo, Josephine Nabukenya, Peter Nabende, John Kamulegeya","doi":"10.1093/jamiaopen/ooae132","DOIUrl":"10.1093/jamiaopen/ooae132","url":null,"abstract":"Objectives: Using machine learning models to predict infants at risk of defaulting routine immunization (RI) and identify significant features for Uganda.Materials and methods: Principal component analysis reduced dimensionality. Datasets were balanced using synthetic minority over-sampling technique. k-Nearest Neighbors, Decision Trees, Random Forests (RFs), Support Vector Machine (SVM), Naïve-Bayes, Logistic Regression (LR), XGBoost, Adoptive-Boosting, and Gradient-Boosting were used on Uganda's 2016 Demographic and health survey data with social-economic and demographic factors as predictors. Experiments with and without K-fold cross-validation were performed. Models were evaluated for accuracy, recall, precision, and area under a curve (AUC).Results and discussion: Experimental results revealed that the rate of defaulting increases as an infant's age increases at 5.3% Bacille Calmette-Guérin (BCG), 7.3% pentavalentI, 22.9% pentavalentIII, and 22.1% for measles. Significant predictors for BCG were immunization card, polio0, cluster altitude. Reception of pneumococcal1, BCG, and district for pentavalentI; polio3, pentavalentII for pentavalentIII; polio active and pentavalentIII for measles.RF had the best performance at predicting vaccine defaulting with 96%, 95%, 94%, 84% accuracy for BCG, PentavalentI, pentavalentIII, measles, respectively. Similarly, RF had the same precision, recall, AUC at 1.0. However, XGBoost, SVM, LR displayed the worst discriminatory power among infants who received the vaccine from defaulters with AUC ≤0.57.Conclusion: Immunization card, preceding vaccines reception, and district were the most influential predictors. RF was the best classifier among the 9 models to predict defaulting RI. The study recommends regular outreaches, daily vaccination, provision of immunization cards, and accessible water sources to reduce defaulting.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"7 4","pages":"ooae132"},"PeriodicalIF":2.5,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11645499/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142829960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Business intelligence systems for population health management: a scoping review. 用于人口健康管理的商业智能系统：范围审查。

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open

Pub Date : 2024-11-27 eCollection Date: 2024-12-01 DOI: 10.1093/jamiaopen/ooae122

Els Roorda, Marc Bruijnzeels, Jeroen Struijs, Marco Spruit

Objective: Population health management (PHM) is a promising data-driven approach to address the challenges faced by health care systems worldwide. Although Business Intelligence (BI) systems are known to be relevant for a data-driven approach, the usage for PHM is limited in its elaboration. To explore available scientific publications, a systematic review guided by PRISMA was conducted of mature BI initiatives to investigate their decision contexts and BI capabilities.

Materials and methods: PubMed, Embase, and Web of Science were searched for articles published from January 2012 through November 2023. Articles were included if they described a (potential) BI system for PHM goals. Additional relevant publications were identified through snowballing. Technological Readiness Levels were evaluated to select mature initiatives from the 29 initiatives found. From the 11 most mature systems the decision context (eg, patient identification, risk stratification) and BI capabilities (eg, data warehouse, linked biobank) were extracted.

Results: The initiatives found are highly fragmented in decision context and BI capabilities. Varied terminology is used and much information is missing. Impact on population's health is currently limited for most initiatives. Care Link, CommunityRx, and Gesundes Kinzigtal currently stand out in aligning BI capabilities with their decision contexts.

Discussion and conclusion: PHM is a data-driven approach that requires a coherent data strategy and understanding of decision contexts and user needs. Effective BI capabilities depend on this understanding. Designing public-private partnerships to protect intellectual property while enabling rapid knowledge development is crucial. Development of a framework is proposed for systematic knowledge building.

目的：人口健康管理（PHM）是一种有前途的数据驱动方法，可用于应对全球医疗保健系统面临的挑战。虽然众所周知商业智能（BI）系统与数据驱动方法相关，但其在人口健康管理方面的应用却很有限。为了探索现有的科学出版物，我们在 PRISMA 的指导下对成熟的商业智能计划进行了系统性回顾，以调查其决策背景和商业智能能力：在 PubMed、Embase 和 Web of Science 上搜索了 2012 年 1 月至 2023 年 11 月期间发表的文章。如果文章描述了针对公共健康管理目标的（潜在）商业智能系统，则会被收录。此外，还通过 "滚雪球 "的方式确定了其他相关出版物。对技术就绪程度进行评估，以便从找到的 29 项计划中选择成熟的计划。从 11 个最成熟的系统中提取了决策背景（如患者识别、风险分层）和商业智能功能（如数据仓库、链接生物库）：结果：所发现的倡议在决策背景和商业智能能力方面非常分散。所使用的术语多种多样，许多信息缺失。大多数计划目前对人口健康的影响有限。目前，Care Link、CommunityRx 和 Gesundes Kinzigtal 在将商业智能能力与其决策背景相结合方面表现突出：PHM 是一种数据驱动的方法，需要协调一致的数据战略以及对决策背景和用户需求的理解。有效的商业智能能力取决于这种理解。设计公私合作伙伴关系以保护知识产权，同时促进知识的快速发展至关重要。建议为系统性知识建设制定一个框架。

{"title":"Business intelligence systems for population health management: a scoping review.","authors":"Els Roorda, Marc Bruijnzeels, Jeroen Struijs, Marco Spruit","doi":"10.1093/jamiaopen/ooae122","DOIUrl":"10.1093/jamiaopen/ooae122","url":null,"abstract":"Objective: Population health management (PHM) is a promising data-driven approach to address the challenges faced by health care systems worldwide. Although Business Intelligence (BI) systems are known to be relevant for a data-driven approach, the usage for PHM is limited in its elaboration. To explore available scientific publications, a systematic review guided by PRISMA was conducted of mature BI initiatives to investigate their decision contexts and BI capabilities.Materials and methods: PubMed, Embase, and Web of Science were searched for articles published from January 2012 through November 2023. Articles were included if they described a (potential) BI system for PHM goals. Additional relevant publications were identified through snowballing. Technological Readiness Levels were evaluated to select mature initiatives from the 29 initiatives found. From the 11 most mature systems the decision context (eg, patient identification, risk stratification) and BI capabilities (eg, data warehouse, linked biobank) were extracted.Results: The initiatives found are highly fragmented in decision context and BI capabilities. Varied terminology is used and much information is missing. Impact on population's health is currently limited for most initiatives. Care Link, CommunityRx, and Gesundes Kinzigtal currently stand out in aligning BI capabilities with their decision contexts.Discussion and conclusion: PHM is a data-driven approach that requires a coherent data strategy and understanding of decision contexts and user needs. Effective BI capabilities depend on this understanding. Designing public-private partnerships to protect intellectual property while enabling rapid knowledge development is crucial. Development of a framework is proposed for systematic knowledge building.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"7 4","pages":"ooae122"},"PeriodicalIF":2.5,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11602128/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142740813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Addressing ethical issues in healthcare artificial intelligence using a lifecycle-informed process. 利用生命周期知情流程解决医疗人工智能中的伦理问题。

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open

Pub Date : 2024-11-15 eCollection Date: 2024-12-01 DOI: 10.1093/jamiaopen/ooae108

Benjamin X Collins, Jean-Christophe Bélisle-Pipon, Barbara J Evans, Kadija Ferryman, Xiaoqian Jiang, Camille Nebeker, Laurie Novak, Kirk Roberts, Martin Were, Zhijun Yin, Vardit Ravitsky, Joseph Coco, Rachele Hendricks-Sturrup, Ishan Williams, Ellen W Clayton, Bradley A Malin

Objectives: Artificial intelligence (AI) proceeds through an iterative and evaluative process of development, use, and refinement which may be characterized as a lifecycle. Within this context, stakeholders can vary in their interests and perceptions of the ethical issues associated with this rapidly evolving technology in ways that can fail to identify and avert adverse outcomes. Identifying issues throughout the AI lifecycle in a systematic manner can facilitate better-informed ethical deliberation.

Materials and methods: We analyzed existing lifecycles from within the current literature for ethical issues of AI in healthcare to identify themes, which we relied upon to create a lifecycle that consolidates these themes into a more comprehensive lifecycle. We then considered the potential benefits and harms of AI through this lifecycle to identify ethical questions that can arise at each step and to identify where conflicts and errors could arise in ethical analysis. We illustrated the approach in 3 case studies that highlight how different ethical dilemmas arise at different points in the lifecycle.

Results discussion and conclusion: Through case studies, we show how a systematic lifecycle-informed approach to the ethical analysis of AI enables mapping of the effects of AI onto different steps to guide deliberations on benefits and harms. The lifecycle-informed approach has broad applicability to different stakeholders and can facilitate communication on ethical issues for patients, healthcare professionals, research participants, and other stakeholders.

目的：人工智能（AI）的发展、使用和完善经历了一个迭代和评估的过程，可以说是一个生命周期。在此背景下，利益相关者对这一快速发展的技术所涉及的伦理问题的兴趣和看法可能各不相同，从而无法识别和避免不利的结果。以系统化的方式识别人工智能整个生命周期中的问题，有助于在更知情的情况下进行伦理审议：我们分析了现有文献中关于医疗保健领域人工智能伦理问题的生命周期，以确定主题，并以此为基础创建了一个生命周期，将这些主题整合到一个更全面的生命周期中。然后，我们通过这个生命周期来考虑人工智能的潜在益处和危害，以确定每个步骤中可能出现的伦理问题，并找出伦理分析中可能出现的冲突和错误。我们通过 3 个案例研究说明了这一方法，突出了在生命周期的不同阶段如何出现不同的伦理困境：通过案例研究，我们展示了在对人工智能进行伦理分析时，如何采用以生命周期为依据的系统方法，将人工智能的影响映射到不同的步骤中，以指导对利益和危害的审议。生命周期知情方法对不同的利益相关者具有广泛的适用性，可以促进患者、医疗保健专业人员、研究参与者和其他利益相关者在伦理问题上的沟通。

{"title":"Addressing ethical issues in healthcare artificial intelligence using a lifecycle-informed process.","authors":"Benjamin X Collins, Jean-Christophe Bélisle-Pipon, Barbara J Evans, Kadija Ferryman, Xiaoqian Jiang, Camille Nebeker, Laurie Novak, Kirk Roberts, Martin Were, Zhijun Yin, Vardit Ravitsky, Joseph Coco, Rachele Hendricks-Sturrup, Ishan Williams, Ellen W Clayton, Bradley A Malin","doi":"10.1093/jamiaopen/ooae108","DOIUrl":"10.1093/jamiaopen/ooae108","url":null,"abstract":"Objectives: Artificial intelligence (AI) proceeds through an iterative and evaluative process of development, use, and refinement which may be characterized as a lifecycle. Within this context, stakeholders can vary in their interests and perceptions of the ethical issues associated with this rapidly evolving technology in ways that can fail to identify and avert adverse outcomes. Identifying issues throughout the AI lifecycle in a systematic manner can facilitate better-informed ethical deliberation.Materials and methods: We analyzed existing lifecycles from within the current literature for ethical issues of AI in healthcare to identify themes, which we relied upon to create a lifecycle that consolidates these themes into a more comprehensive lifecycle. We then considered the potential benefits and harms of AI through this lifecycle to identify ethical questions that can arise at each step and to identify where conflicts and errors could arise in ethical analysis. We illustrated the approach in 3 case studies that highlight how different ethical dilemmas arise at different points in the lifecycle.Results discussion and conclusion: Through case studies, we show how a systematic lifecycle-informed approach to the ethical analysis of AI enables mapping of the effects of AI onto different steps to guide deliberations on benefits and harms. The lifecycle-informed approach has broad applicability to different stakeholders and can facilitate communication on ethical issues for patients, healthcare professionals, research participants, and other stakeholders.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"7 4","pages":"ooae108"},"PeriodicalIF":2.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11565898/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142648917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Designing and testing clinical simulations of an early warning system for implementation in acute care settings. 设计和测试预警系统的临床模拟，以便在急症护理环境中实施。

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open

Pub Date : 2024-10-16 eCollection Date: 2024-12-01 DOI: 10.1093/jamiaopen/ooae092

Min-Jeoung Kang, Sarah C Rossetti, Graham Lowenthal, Christopher Knaplund, Li Zhou, Kumiko O Schnock, Kenrick D Cato, Patricia C Dykes

Objectives: Conducting simulation testing with end-users is essential for facilitating successful implementation of new health information technologies. This study designed a standardized simulation testing process with a system prototype prior to implementation to help study teams identify the system's interpretability and feasibility from the end-user perspective and to effectively integrate new innovations into real-world clinical settings and workflows.

Materials and methods: A clinical simulation model was developed to test a new Clinical Decision Support (CDS) system outside of the clinical environment while maintaining high fidelity. A web-based CDS prototype, the "CONCERN Smart Application," which leverages clinical data to measure and express a patient's risk of deterioration on a 3-level scale ("low," "moderate," or "high"), and audiovisual-integrated materials, were used to lead simulation sessions.

Results: A total of 6 simulation sessions with 17 nurses were held to investigate how nurses interact with the CONCERN Smart application and how it influences their critical thinking, and clinical responses. Four themes were extracted from the simulation debriefing sessions and used to inform implementation strategies. The strategies include how the CDS should be improved for practical real-world use.

Discussion and conclusions: Standardized simulation testing procedures identified and informed the necessary CDS improvements, the enhancements needed for real-world use, and the training requirements to effectively prepare end-users for system go-live.

目的：与最终用户一起进行模拟测试对于促进新医疗信息技术的成功实施至关重要。本研究设计了一个标准化的模拟测试流程，在系统原型实施前帮助研究团队从最终用户的角度确定系统的可解释性和可行性，并将新的创新技术有效地整合到现实世界的临床环境和工作流程中：开发了一个临床模拟模型，用于在临床环境之外测试新的临床决策支持（CDS）系统，同时保持高保真。基于网络的 CDS 原型、"CONCERN 智能应用程序"（利用临床数据以 3 级量表（"低"、"中 "或 "高"）衡量和表达患者病情恶化的风险）以及视听结合材料被用于引导模拟会议：共有 17 名护士参加了 6 次模拟课程，以研究护士如何与 CONCERN Smart 应用程序互动，以及该应用程序如何影响护士的批判性思维和临床反应。从模拟汇报环节中提取了四个主题，并用于制定实施策略。这些策略包括应如何改进 CDS，以便在现实世界中实际使用：标准化的模拟测试程序确定并告知了必要的 CDS 改进、实际使用所需的改进以及培训要求，以便有效地让最终用户为系统上线做好准备。

{"title":"Designing and testing clinical simulations of an early warning system for implementation in acute care settings.","authors":"Min-Jeoung Kang, Sarah C Rossetti, Graham Lowenthal, Christopher Knaplund, Li Zhou, Kumiko O Schnock, Kenrick D Cato, Patricia C Dykes","doi":"10.1093/jamiaopen/ooae092","DOIUrl":"10.1093/jamiaopen/ooae092","url":null,"abstract":"Objectives: Conducting simulation testing with end-users is essential for facilitating successful implementation of new health information technologies. This study designed a standardized simulation testing process with a system prototype prior to implementation to help study teams identify the system's interpretability and feasibility from the end-user perspective and to effectively integrate new innovations into real-world clinical settings and workflows.Materials and methods: A clinical simulation model was developed to test a new Clinical Decision Support (CDS) system outside of the clinical environment while maintaining high fidelity. A web-based CDS prototype, the \"CONCERN Smart Application,\" which leverages clinical data to measure and express a patient's risk of deterioration on a 3-level scale (\"low,\" \"moderate,\" or \"high\"), and audiovisual-integrated materials, were used to lead simulation sessions.Results: A total of 6 simulation sessions with 17 nurses were held to investigate how nurses interact with the CONCERN Smart application and how it influences their critical thinking, and clinical responses. Four themes were extracted from the simulation debriefing sessions and used to inform implementation strategies. The strategies include how the CDS should be improved for practical real-world use.Discussion and conclusions: Standardized simulation testing procedures identified and informed the necessary CDS improvements, the enhancements needed for real-world use, and the training requirements to effectively prepare end-users for system go-live.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"7 4","pages":"ooae092"},"PeriodicalIF":2.5,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11483109/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tree-based classification model for Long-COVID infection prediction with age stratification using data from the National COVID Cohort Collaborative. 利用国家 COVID 队列协作组织的数据，建立基于树分类的长 COVID 感染预测模型，并进行年龄分层。

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open

Pub Date : 2024-10-09 eCollection Date: 2024-12-01 DOI: 10.1093/jamiaopen/ooae111

Will Ke Wang, Hayoung Jeong, Leeor Hershkovich, Peter Cho, Karnika Singh, Lauren Lederer, Ali R Roghanizad, Md Mobashir Hasan Shandhi, Warren Kibbe, Jessilyn Dunn

Objectives: We propose and validate a domain knowledge-driven classification model for diagnosing post-acute sequelae of SARS-CoV-2 infection (PASC), also known as Long COVID, using Electronic Health Records (EHRs) data.

Materials and methods: We developed a robust model that incorporates features strongly indicative of PASC or associated with the severity of COVID-19 symptoms as identified in our literature review. The XGBoost tree-based architecture was chosen for its ability to handle class-imbalanced data and its potential for high interpretability. Using the training data provided by the Long COVID Computation Challenge (L3C), which was a sample of the National COVID Cohort Collaborative (N3C), our models were fine-tuned and calibrated to optimize Area Under the Receiver Operating characteristic curve (AUROC) and the F1 score, following best practices for the class-imbalanced N3C data.

Results: Our age-stratified classification model demonstrated strong performance with an average 5-fold cross-validated AUROC of 0.844 and F1 score of 0.539 across the young adult, mid-aged, and older-aged populations in the training data. In an independent testing dataset, which was made available after the challenge was over, we achieved an overall AUROC score of 0.814 and F1 score of 0.545.

Discussion: The results demonstrated the utility of knowledge-driven feature engineering in a sparse EHR data and demographic stratification in model development to diagnose a complex and heterogeneously presenting condition like PASC. The model's architecture, mirroring natural clinician decision-making processes, contributed to its robustness and interpretability, which are crucial for clinical translatability. Further, the model's generalizability was evaluated over a new cross-sectional data as provided in the later stages of the L3C challenge.

Conclusion: The study proposed and validated the effectiveness of age-stratified, tree-based classification models to diagnose PASC. Our approach highlights the potential of machine learning in addressing the diagnostic challenges posed by the heterogeneity of Long-COVID symptoms.

目的：我们利用电子健康记录（EHR）数据，提出并验证了一种领域知识驱动的分类模型，用于诊断 SARS-CoV-2 感染后的急性后遗症（PASC），也称为长 COVID：我们开发了一个稳健的模型，该模型包含了文献综述中确定的强烈提示 PASC 或与 COVID-19 症状严重程度相关的特征。之所以选择基于 XGBoost 树的架构，是因为它能够处理类不平衡数据，并具有较高的可解释性。利用 Long COVID 计算挑战赛（L3C）提供的训练数据（L3C 是全国 COVID 队列协作组织（N3C）的一个样本），我们对模型进行了微调和校准，以优化接收者工作特征曲线下面积（AUROC）和 F1 分数，并遵循 N3C 数据的类不平衡最佳实践：我们的年龄分层分类模型表现出色，在训练数据中，青壮年、中年和老年群体的 5 倍交叉验证平均 AUROC 为 0.844，F1 得分为 0.539。在挑战赛结束后提供的独立测试数据集中，我们取得了 0.814 的总 AUROC 分和 0.545 的 F1 分：结果表明，在稀疏的电子病历数据中采用知识驱动的特征工程以及在模型开发过程中进行人口分层，对于诊断像 PASC 这样复杂且异质性的病症非常有用。该模型的结构反映了临床医生的自然决策过程，有助于提高其稳健性和可解释性，这对临床转化至关重要。此外，该模型的可推广性还通过 L3C 挑战赛后期提供的新横截面数据进行了评估：本研究提出并验证了基于树状结构的年龄分层分类模型诊断 PASC 的有效性。我们的方法凸显了机器学习在应对长期慢性阻塞性肺疾病症状异质性所带来的诊断挑战方面的潜力。

{"title":"Tree-based classification model for Long-COVID infection prediction with age stratification using data from the National COVID Cohort Collaborative.","authors":"Will Ke Wang, Hayoung Jeong, Leeor Hershkovich, Peter Cho, Karnika Singh, Lauren Lederer, Ali R Roghanizad, Md Mobashir Hasan Shandhi, Warren Kibbe, Jessilyn Dunn","doi":"10.1093/jamiaopen/ooae111","DOIUrl":"10.1093/jamiaopen/ooae111","url":null,"abstract":"Objectives: We propose and validate a domain knowledge-driven classification model for diagnosing post-acute sequelae of SARS-CoV-2 infection (PASC), also known as Long COVID, using Electronic Health Records (EHRs) data.Materials and methods: We developed a robust model that incorporates features strongly indicative of PASC or associated with the severity of COVID-19 symptoms as identified in our literature review. The XGBoost tree-based architecture was chosen for its ability to handle class-imbalanced data and its potential for high interpretability. Using the training data provided by the Long COVID Computation Challenge (L3C), which was a sample of the National COVID Cohort Collaborative (N3C), our models were fine-tuned and calibrated to optimize Area Under the Receiver Operating characteristic curve (AUROC) and the F1 score, following best practices for the class-imbalanced N3C data.Results: Our age-stratified classification model demonstrated strong performance with an average 5-fold cross-validated AUROC of 0.844 and F1 score of 0.539 across the young adult, mid-aged, and older-aged populations in the training data. In an independent testing dataset, which was made available after the challenge was over, we achieved an overall AUROC score of 0.814 and F1 score of 0.545.Discussion: The results demonstrated the utility of knowledge-driven feature engineering in a sparse EHR data and demographic stratification in model development to diagnose a complex and heterogeneously presenting condition like PASC. The model's architecture, mirroring natural clinician decision-making processes, contributed to its robustness and interpretability, which are crucial for clinical translatability. Further, the model's generalizability was evaluated over a new cross-sectional data as provided in the later stages of the L3C challenge.Conclusion: The study proposed and validated the effectiveness of age-stratified, tree-based classification models to diagnose PASC. Our approach highlights the potential of machine learning in addressing the diagnostic challenges posed by the heterogeneity of Long-COVID symptoms.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"7 4","pages":"ooae111"},"PeriodicalIF":2.5,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11547948/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142629987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

VMAP: Vaginal Microbiome Atlas during Pregnancy. VMAP：孕期阴道微生物组图谱。

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open

Pub Date : 2024-09-27 eCollection Date: 2024-10-01 DOI: 10.1093/jamiaopen/ooae099

Antonio Parraga-Leo, Tomiko T Oskotsky, Boris Oskotsky, Camilla Wibrand, Alennie Roldan, Alice S Tang, Connie W Y Ha, Ronald J Wong, Samuel S Minot, Gaia Andreoletti, Idit Kosti, Kevin R Theis, Sherrianne Ng, Yun S Lee, Patricia Diaz-Gimeno, Phillip R Bennett, David A MacIntyre, Susan V Lynch, Roberto Romero, Adi L Tarca, David K Stevenson, Nima Aghaeepour, Jonathan L Golob, Marina Sirota

Objectives: To enable interactive visualization of the vaginal microbiome across the pregnancy and facilitate discovery of novel insights and generation of new hypotheses.

Material and methods: Vaginal Microbiome Atlas during Pregnancy (VMAP) was created with R shiny to generate visualizations of structured vaginal microbiome data from multiple studies.

Results: VMAP (http://vmapapp.org) visualizes 3880 vaginal microbiome samples of 1402 pregnant individuals from 11 studies, aggregated via open-source tool MaLiAmPi. Visualized features include diversity measures, VALENCIA community state types, and composition (phylotypes, taxonomy) that can be filtered by various categories.

Discussion: This work represents one of the largest and most geographically diverse aggregations of the vaginal microbiome in pregnancy to date and serves as a user-friendly resource to further analyze vaginal microbiome data and better understand pregnancies and associated outcomes.

Conclusion: VMAP can be obtained from https://github.com/msirota/vmap.git and is currently deployed as an online app for non-R users.

目的实现孕期阴道微生物组的交互式可视化，促进新见解的发现和新假设的产生：妊娠期阴道微生物组图谱（VMAP）由R shiny创建，可对多项研究中的结构化阴道微生物组数据进行可视化：VMAP（http://vmapapp.org）可视化了来自11项研究的1402名孕妇的3880份阴道微生物组样本，这些样本通过开源工具MaLiAmPi汇总。可视化特征包括多样性测量、VALENCIA群落状态类型和组成（系统型、分类学），可按不同类别进行筛选：这项工作代表了迄今为止规模最大、地理位置最多样化的妊娠期阴道微生物群集合之一，是进一步分析阴道微生物群数据和更好地了解妊娠及相关结果的用户友好型资源：VMAP 可从 https://github.com/msirota/vmap.git 获取，目前已作为在线应用程序部署给非 R 用户。

{"title":"VMAP: Vaginal Microbiome Atlas during Pregnancy.","authors":"Antonio Parraga-Leo, Tomiko T Oskotsky, Boris Oskotsky, Camilla Wibrand, Alennie Roldan, Alice S Tang, Connie W Y Ha, Ronald J Wong, Samuel S Minot, Gaia Andreoletti, Idit Kosti, Kevin R Theis, Sherrianne Ng, Yun S Lee, Patricia Diaz-Gimeno, Phillip R Bennett, David A MacIntyre, Susan V Lynch, Roberto Romero, Adi L Tarca, David K Stevenson, Nima Aghaeepour, Jonathan L Golob, Marina Sirota","doi":"10.1093/jamiaopen/ooae099","DOIUrl":"10.1093/jamiaopen/ooae099","url":null,"abstract":"Objectives: To enable interactive visualization of the vaginal microbiome across the pregnancy and facilitate discovery of novel insights and generation of new hypotheses.Material and methods: Vaginal Microbiome Atlas during Pregnancy (VMAP) was created with R shiny to generate visualizations of structured vaginal microbiome data from multiple studies.Results: VMAP (http://vmapapp.org) visualizes 3880 vaginal microbiome samples of 1402 pregnant individuals from 11 studies, aggregated via open-source tool MaLiAmPi. Visualized features include diversity measures, VALENCIA community state types, and composition (phylotypes, taxonomy) that can be filtered by various categories.Discussion: This work represents one of the largest and most geographically diverse aggregations of the vaginal microbiome in pregnancy to date and serves as a user-friendly resource to further analyze vaginal microbiome data and better understand pregnancies and associated outcomes.Conclusion: VMAP can be obtained from https://github.com/msirota/vmap.git and is currently deployed as an online app for non-R users.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"7 3","pages":"ooae099"},"PeriodicalIF":2.5,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11430916/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142355733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Literature search sandbox: a large language model that generates search queries for systematic reviews. 文献检索沙箱：为系统综述生成检索查询的大型语言模型。

IF 2.5 Q2 HEALTH CARE SCIENCES & SERVICES

JAMIA Open

Pub Date : 2024-09-25 eCollection Date: 2024-10-01 DOI: 10.1093/jamiaopen/ooae098

Gaelen P Adam, Jay DeYoung, Alice Paul, Ian J Saldanha, Ethan M Balk, Thomas A Trikalinos, Byron C Wallace

Objectives: Development of search queries for systematic reviews (SRs) is time-consuming. In this work, we capitalize on recent advances in large language models (LLMs) and a relatively large dataset of natural language descriptions of reviews and corresponding Boolean searches to generate Boolean search queries from SR titles and key questions.

Materials and methods: We curated a training dataset of 10 346 SR search queries registered in PROSPERO. We used this dataset to fine-tune a set of models to generate search queries based on Mistral-Instruct-7b. We evaluated the models quantitatively using an evaluation dataset of 57 SRs and qualitatively through semi-structured interviews with 8 experienced medical librarians.

Results: The model-generated search queries had median sensitivity of 85% (interquartile range [IQR] 40%-100%) and number needed to read of 1206 citations (IQR 205-5810). The interviews suggested that the models lack both the necessary sensitivity and precision to be used without scrutiny but could be useful for topic scoping or as initial queries to be refined.

Discussion: Future research should focus on improving the dataset with more high-quality search queries, assessing whether fine-tuning the model on other fields, such as the population and intervention, improves performance, and exploring the addition of interactivity to the interface.

Conclusions: The datasets developed for this project can be used to train and evaluate LLMs that map review descriptions to Boolean search queries. The models cannot replace thoughtful search query design but may be useful in providing suggestions for key words and the framework for the query.

目的：开发系统性综述（SR）的搜索查询非常耗时。在这项工作中，我们利用最近在大型语言模型（LLMs）方面取得的进展以及一个相对较大的综述自然语言描述数据集和相应的布尔搜索，从系统综述标题和关键问题中生成布尔搜索查询：我们建立了一个训练数据集，其中包括在 PROSPERO 中登记的 10 346 条评论搜索查询。我们使用该数据集对一组模型进行微调，以便根据 Mistral-Instruct-7b 生成检索查询。我们使用 57 个 SR 的评估数据集对这些模型进行了定量评估，并通过与 8 位经验丰富的医学图书馆员进行半结构化访谈对这些模型进行了定性评估：结果：模型生成的检索查询的灵敏度中位数为 85%（四分位距 [IQR]为 40%-100%），需要阅读的引文数为 1206 条（IQR 为 205-5810）。访谈表明，这些模型缺乏必要的灵敏度和精确度，因此无需仔细审查即可使用，但对于主题范围界定或作为有待完善的初始查询可能有用：讨论：今后的研究应侧重于通过更多高质量的搜索查询来改进数据集，评估在人口和干预等其他领域对模型进行微调是否能提高性能，以及探索在界面中增加交互性：为本项目开发的数据集可用于训练和评估将综述描述映射到布尔搜索查询的 LLM。这些模型不能取代深思熟虑的搜索查询设计，但在提供关键词建议和查询框架方面可能会有所帮助。

{"title":"Literature search sandbox: a large language model that generates search queries for systematic reviews.","authors":"Gaelen P Adam, Jay DeYoung, Alice Paul, Ian J Saldanha, Ethan M Balk, Thomas A Trikalinos, Byron C Wallace","doi":"10.1093/jamiaopen/ooae098","DOIUrl":"https://doi.org/10.1093/jamiaopen/ooae098","url":null,"abstract":"Objectives: Development of search queries for systematic reviews (SRs) is time-consuming. In this work, we capitalize on recent advances in large language models (LLMs) and a relatively large dataset of natural language descriptions of reviews and corresponding Boolean searches to generate Boolean search queries from SR titles and key questions.Materials and methods: We curated a training dataset of 10 346 SR search queries registered in PROSPERO. We used this dataset to fine-tune a set of models to generate search queries based on Mistral-Instruct-7b. We evaluated the models quantitatively using an evaluation dataset of 57 SRs and qualitatively through semi-structured interviews with 8 experienced medical librarians.Results: The model-generated search queries had median sensitivity of 85% (interquartile range [IQR] 40%-100%) and number needed to read of 1206 citations (IQR 205-5810). The interviews suggested that the models lack both the necessary sensitivity and precision to be used without scrutiny but could be useful for topic scoping or as initial queries to be refined.Discussion: Future research should focus on improving the dataset with more high-quality search queries, assessing whether fine-tuning the model on other fields, such as the population and intervention, improves performance, and exploring the addition of interactivity to the interface.Conclusions: The datasets developed for this project can be used to train and evaluate LLMs that map review descriptions to Boolean search queries. The models cannot replace thoughtful search query design but may be useful in providing suggestions for key words and the framework for the query.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"7 3","pages":"ooae098"},"PeriodicalIF":2.5,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11424077/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142355731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0