首页 > 最新文献

PLOS digital health最新文献

英文 中文
Differential behaviour of a risk score for emergency hospital admission by demographics in Scotland-A retrospective study. 苏格兰人口统计学对急诊住院风险评分的差异行为——回顾性研究
Pub Date : 2024-12-17 eCollection Date: 2024-12-01 DOI: 10.1371/journal.pdig.0000675
Ioanna Thoma, Simon Rogers, Jillian Ireland, Rachel Porteous, Katie Borland, Catalina A Vallejos, Louis J M Aslett, James Liley

The Scottish Patients at Risk of Re-Admission and Admission (SPARRA) score predicts individual risk of emergency hospital admission for approximately 80% of the Scottish population. It was developed using routinely collected electronic health records, and is used by primary care practitioners to inform anticipatory care, particularly for individuals with high healthcare needs. We comprehensively assess the SPARRA score across population subgroups defined by age, sex, ethnicity, socioeconomic deprivation, and geographic location. For these subgroups, we consider differences in overall performance, score distribution, and false positive and negative rates, using causal methods to identify effects mediated through age, sex, and deprivation. We show that the score is well-calibrated across subgroups, but that rates of false positives and negatives vary widely, mediated by various causes including variability in demographic characteristics, admission reasons, and potentially differential data availability. Our work assists practitioners in the application and interpretation of the SPARRA score in population subgroups.

苏格兰患者再次入院和入院风险(SPARRA)评分预测了大约80%的苏格兰人口急诊入院的个人风险。它是利用常规收集的电子健康记录开发的,初级保健从业人员使用它为预期保健提供信息,特别是为有高保健需求的个人提供信息。我们综合评估了按年龄、性别、种族、社会经济剥夺和地理位置定义的人群亚组的SPARRA评分。对于这些亚组,我们考虑了总体表现、得分分布、假阳性和阴性率的差异,并使用因果方法来确定由年龄、性别和剥夺介导的影响。我们的研究表明,亚组间的评分得到了很好的校准,但假阳性和阴性的比率差异很大,这是由各种原因造成的,包括人口统计学特征的变化、入院原因和潜在的数据可用性差异。我们的工作有助于从业者在人口亚组中应用和解释SPARRA评分。
{"title":"Differential behaviour of a risk score for emergency hospital admission by demographics in Scotland-A retrospective study.","authors":"Ioanna Thoma, Simon Rogers, Jillian Ireland, Rachel Porteous, Katie Borland, Catalina A Vallejos, Louis J M Aslett, James Liley","doi":"10.1371/journal.pdig.0000675","DOIUrl":"10.1371/journal.pdig.0000675","url":null,"abstract":"<p><p>The Scottish Patients at Risk of Re-Admission and Admission (SPARRA) score predicts individual risk of emergency hospital admission for approximately 80% of the Scottish population. It was developed using routinely collected electronic health records, and is used by primary care practitioners to inform anticipatory care, particularly for individuals with high healthcare needs. We comprehensively assess the SPARRA score across population subgroups defined by age, sex, ethnicity, socioeconomic deprivation, and geographic location. For these subgroups, we consider differences in overall performance, score distribution, and false positive and negative rates, using causal methods to identify effects mediated through age, sex, and deprivation. We show that the score is well-calibrated across subgroups, but that rates of false positives and negatives vary widely, mediated by various causes including variability in demographic characteristics, admission reasons, and potentially differential data availability. Our work assists practitioners in the application and interpretation of the SPARRA score in population subgroups.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000675"},"PeriodicalIF":0.0,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11651550/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142848384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neonatal apnea and hypopnea prediction in infants with Robin sequence with neural additive models for time series. 利用时间序列神经加法模型预测罗宾序列婴儿的新生儿呼吸暂停和呼吸减弱。
Pub Date : 2024-12-13 eCollection Date: 2024-12-01 DOI: 10.1371/journal.pdig.0000678
Julius Vetter, Kathleen Lim, Tjeerd M H Dijkstra, Peter A Dargaville, Oliver Kohlbacher, Jakob H Macke, Christian F Poets

Neonatal apneas and hypopneas present a serious risk for healthy infant development. Treating these adverse events requires frequent manual stimulation by skilled personnel, which can lead to alarm fatigue. This study aims to develop and validate an interpretable model that can predict apneas and hypopneas. Automatically predicting these adverse events before they occur would enable the use of methods for automatic intervention. We propose a neural additive model to predict individual occurrences of neonatal apnea and hypopnea and apply it to a physiological dataset from infants with Robin sequence at risk of upper airway obstruction. The dataset will be made publicly available together with this study. Our proposed model allows the prediction of individual apneas and hypopneas, achieving an average AuROC of 0.80 when discriminating segments of polysomnography recordings starting 15 seconds before the onset of apneas and hypopneas from control segments. Its additive nature makes the model inherently interpretable, which allowed insights into how important a given signal modality is for prediction and which patterns in the signal are discriminative. For our problem of predicting apneas and hypopneas in infants with Robin sequence, prior irregularities in breathing-related modalities as well as decreases in SpO2 levels were especially discriminative. Our prediction model presents a step towards an automatic prediction of neonatal apneas and hypopneas in infants at risk for upper airway obstruction. Together with the publicly released dataset, it has the potential to facilitate the development and application of methods for automatic intervention in clinical practice.

新生儿呼吸暂停和呼吸不足是影响婴儿健康发育的重要因素。治疗这些不良事件需要熟练人员频繁的手动刺激,这可能导致报警疲劳。本研究旨在建立和验证一个可解释的模型,可以预测呼吸暂停和呼吸不足。在这些不良事件发生之前自动预测它们将使使用自动干预的方法成为可能。我们提出了一个神经相加模型来预测新生儿呼吸暂停和低通气的个体发生率,并将其应用于有上气道阻塞风险的Robin序列婴儿的生理数据集。该数据集将与本研究一起公开。我们提出的模型可以预测个体呼吸暂停和呼吸不足,在区分呼吸暂停和呼吸不足发作前15秒开始的多导睡眠图记录片段与对照片段时,平均AuROC为0.80。它的可加性使模型具有固有的可解释性,从而可以深入了解给定信号模态对预测的重要性以及信号中的哪些模式是判别性的。对于我们预测罗宾序列婴儿呼吸暂停和呼吸不足的问题,先前呼吸相关模式的不规则性以及SpO2水平的降低尤其具有歧视性。我们的预测模型向自动预测新生儿呼吸暂停和呼吸不足的婴儿上气道阻塞的风险迈出了一步。与公开发布的数据集一起,它有可能促进临床实践中自动干预方法的开发和应用。
{"title":"Neonatal apnea and hypopnea prediction in infants with Robin sequence with neural additive models for time series.","authors":"Julius Vetter, Kathleen Lim, Tjeerd M H Dijkstra, Peter A Dargaville, Oliver Kohlbacher, Jakob H Macke, Christian F Poets","doi":"10.1371/journal.pdig.0000678","DOIUrl":"10.1371/journal.pdig.0000678","url":null,"abstract":"<p><p>Neonatal apneas and hypopneas present a serious risk for healthy infant development. Treating these adverse events requires frequent manual stimulation by skilled personnel, which can lead to alarm fatigue. This study aims to develop and validate an interpretable model that can predict apneas and hypopneas. Automatically predicting these adverse events before they occur would enable the use of methods for automatic intervention. We propose a neural additive model to predict individual occurrences of neonatal apnea and hypopnea and apply it to a physiological dataset from infants with Robin sequence at risk of upper airway obstruction. The dataset will be made publicly available together with this study. Our proposed model allows the prediction of individual apneas and hypopneas, achieving an average AuROC of 0.80 when discriminating segments of polysomnography recordings starting 15 seconds before the onset of apneas and hypopneas from control segments. Its additive nature makes the model inherently interpretable, which allowed insights into how important a given signal modality is for prediction and which patterns in the signal are discriminative. For our problem of predicting apneas and hypopneas in infants with Robin sequence, prior irregularities in breathing-related modalities as well as decreases in SpO2 levels were especially discriminative. Our prediction model presents a step towards an automatic prediction of neonatal apneas and hypopneas in infants at risk for upper airway obstruction. Together with the publicly released dataset, it has the potential to facilitate the development and application of methods for automatic intervention in clinical practice.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000678"},"PeriodicalIF":0.0,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11642933/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142822814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of periodontitis stage and grade using natural language processing techniques. 利用自然语言处理技术对牙周炎阶段和等级进行分类。
Pub Date : 2024-12-13 eCollection Date: 2024-12-01 DOI: 10.1371/journal.pdig.0000692
Nazila Ameli, Tahereh Firoozi, Monica Gibson, Hollis Lai

Periodontitis is a complex and microbiome-related inflammatory condition impacting dental supporting tissues. Emphasizing the potential of Clinical Decision Support Systems (CDSS), this study aims to facilitate early diagnosis of periodontitis by extracting patients' information collected as dental charts and notes. We developed a CDSS to predict the stage and grade of periodontitis using natural language processing (NLP) techniques including bidirectional encoder representation for transformers (BERT). We compared the performance of BERT with that of a baseline feature-engineered model. A secondary data analysis was conducted using 309 anonymized patient periodontal charts and corresponding clinician's notes obtained from the university periodontal clinic. After data preprocessing, we added a classification layer on top of the pre-trained BERT model to classify the clinical notes into their corresponding stage and grades. Then, we fine-tuned the pre-trained BERT model on 70% of our data. The performance of the model was evaluated on 32 unseen new patients' clinical notes. The results were compared with the output of a baseline feature-engineered algorithm coupled with MLP techniques to classify the stage and grade of periodontitis. Our proposed BERT model predicted the patients' stage and grade with 77% and 75% accuracy, respectively. MLP model showed that the accuracy of correct classification of stage and grade of the periodontitis on a set of 32 new unseen data was 59.4% and 62.5%, respectively. The BERT model could predict the periodontitis stage and grade on the same new dataset with higher accuracy (66% and 72%, respectively). The utilization of BERT in this context represents a groundbreaking application in dentistry, particularly in CDSS. Our BERT model outperformed baseline models, even with reduced information, promising efficient review of patient notes. This integration of advanced NLP techniques with CDSS frameworks holds potential for timely interventions, preventing complications and reducing healthcare costs.

牙周炎是一种影响牙支撑组织的复杂的微生物相关炎症。强调临床决策支持系统(CDSS)的潜力,本研究旨在通过提取牙科图表和笔记收集的患者信息来促进牙周炎的早期诊断。我们开发了一个CDSS来预测牙周炎的阶段和等级,使用自然语言处理(NLP)技术,包括双向编码器表示转换器(BERT)。我们比较了BERT与基线特征工程模型的性能。对309名匿名患者牙周病表和相应的临床医生记录进行二次数据分析。数据预处理后,我们在预训练的BERT模型上增加一个分类层,将临床笔记分类到相应的阶段和等级。然后,我们在70%的数据上微调预训练的BERT模型。该模型的性能评估了32个看不见的新患者的临床记录。将结果与基线特征工程算法结合MLP技术的输出进行比较,以分类牙周炎的阶段和等级。我们提出的BERT模型预测患者的分期和分级的准确率分别为77%和75%。MLP模型显示,对32组新的未见数据进行牙周炎分期和分级的正确分类准确率分别为59.4%和62.5%。BERT模型可以在相同的新数据集上预测牙周炎的分期和分级,准确率更高(分别为66%和72%)。在这种情况下,BERT的使用代表了牙科,特别是CDSS的开创性应用。我们的BERT模型优于基线模型,即使在减少信息的情况下,也有望有效地审查患者的记录。这种先进的NLP技术与CDSS框架的整合具有及时干预、预防并发症和降低医疗成本的潜力。
{"title":"Classification of periodontitis stage and grade using natural language processing techniques.","authors":"Nazila Ameli, Tahereh Firoozi, Monica Gibson, Hollis Lai","doi":"10.1371/journal.pdig.0000692","DOIUrl":"10.1371/journal.pdig.0000692","url":null,"abstract":"<p><p>Periodontitis is a complex and microbiome-related inflammatory condition impacting dental supporting tissues. Emphasizing the potential of Clinical Decision Support Systems (CDSS), this study aims to facilitate early diagnosis of periodontitis by extracting patients' information collected as dental charts and notes. We developed a CDSS to predict the stage and grade of periodontitis using natural language processing (NLP) techniques including bidirectional encoder representation for transformers (BERT). We compared the performance of BERT with that of a baseline feature-engineered model. A secondary data analysis was conducted using 309 anonymized patient periodontal charts and corresponding clinician's notes obtained from the university periodontal clinic. After data preprocessing, we added a classification layer on top of the pre-trained BERT model to classify the clinical notes into their corresponding stage and grades. Then, we fine-tuned the pre-trained BERT model on 70% of our data. The performance of the model was evaluated on 32 unseen new patients' clinical notes. The results were compared with the output of a baseline feature-engineered algorithm coupled with MLP techniques to classify the stage and grade of periodontitis. Our proposed BERT model predicted the patients' stage and grade with 77% and 75% accuracy, respectively. MLP model showed that the accuracy of correct classification of stage and grade of the periodontitis on a set of 32 new unseen data was 59.4% and 62.5%, respectively. The BERT model could predict the periodontitis stage and grade on the same new dataset with higher accuracy (66% and 72%, respectively). The utilization of BERT in this context represents a groundbreaking application in dentistry, particularly in CDSS. Our BERT model outperformed baseline models, even with reduced information, promising efficient review of patient notes. This integration of advanced NLP techniques with CDSS frameworks holds potential for timely interventions, preventing complications and reducing healthcare costs.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000692"},"PeriodicalIF":0.0,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11642968/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142822807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adoption of telepharmacy among pharmacists, physicians, and nurses at Hawassa City Public Hospitals, Ethiopia. 埃塞俄比亚阿瓦萨市公立医院的药剂师、医生和护士采用远程药房。
Pub Date : 2024-12-11 eCollection Date: 2024-12-01 DOI: 10.1371/journal.pdig.0000693
Jenberu Mekurianew Kelkay, Henok Dessie Wubneh, Henok Molla Beri, Abel Melaku Tefera, Rediet Abebe Molla, Addisu Alem Negatu

Pharmaceutical care in the majority of developing countries is hindered by a lack of techniques, limitations in mobility, and a shortage of staff to provide patient care. However, there is no evidence that professionals intend to use telepharmacy in patient care. To fill this gap, this study was designed to examine whether pharmacists, physicians, and nursing professionals intend to use telepharamcy in their care practice.A cross-sectional investigation was carried out from November 29 to December 30, 2023. A study was conducted at all Hawassa public hospitals. A total of 592 Pharmacists, Physicians, and nurses participated. Simple random sampling and proportional allocation were utilized. A structured self-administered questionnaire was used, and a 5% pretest was administered. The data were entered into Epi Data 4.6 and exported to SPSS 26. The AMOS 23 SEM was also used to describe and assess the degree and significance of the relationships between variables.51.4% (304/592) (95% CI, 47.2-55.4) of the participants intended to use telepharmacy. Performance expectancy (β = 0.23, p-value <0.05), social influence (β = 0.295, p-value <0.05), and digital literacy (β = 0.309, p-value <0.001) had positive relationships with the intention to use telepharmacy. Age and gender were also moderators of performance expectancy in telepharmacy.Overall, Pharmacists', Physicians', and nurses' intentions to use telepharamcy were found to be promising for the future. Performance expectancy, social influence, and digital literacy had a significantly positive influence on the intention to use telepharamcy. Digital literacy had a more significant prediction power than others. The results could be useful in terms of designing emerging systems and understanding users' computer skills.

在大多数发展中国家,由于缺乏技术、行动受限和缺乏提供病人护理的工作人员,药物保健受到阻碍。然而,没有证据表明专业人员打算在患者护理中使用远程药房。为了填补这一空白,本研究旨在调查药剂师、医生和护理专业人员是否打算在他们的护理实践中使用远程药学。横断面调查于2023年11月29日至12月30日进行。在所有哈瓦萨公立医院进行了一项研究。共有592名药剂师、医生和护士参与了调查。采用简单随机抽样和比例分配。采用结构化的自我管理问卷,并进行5%的预测。数据输入Epi data 4.6,导出到SPSS 26。AMOS 23 SEM也用于描述和评估变量之间关系的程度和显著性。51.4% (304/592)(95% CI, 47.2-55.4)的参与者打算使用远程药房。业绩预期(β = 0.23, p值
{"title":"Adoption of telepharmacy among pharmacists, physicians, and nurses at Hawassa City Public Hospitals, Ethiopia.","authors":"Jenberu Mekurianew Kelkay, Henok Dessie Wubneh, Henok Molla Beri, Abel Melaku Tefera, Rediet Abebe Molla, Addisu Alem Negatu","doi":"10.1371/journal.pdig.0000693","DOIUrl":"10.1371/journal.pdig.0000693","url":null,"abstract":"<p><p>Pharmaceutical care in the majority of developing countries is hindered by a lack of techniques, limitations in mobility, and a shortage of staff to provide patient care. However, there is no evidence that professionals intend to use telepharmacy in patient care. To fill this gap, this study was designed to examine whether pharmacists, physicians, and nursing professionals intend to use telepharamcy in their care practice.A cross-sectional investigation was carried out from November 29 to December 30, 2023. A study was conducted at all Hawassa public hospitals. A total of 592 Pharmacists, Physicians, and nurses participated. Simple random sampling and proportional allocation were utilized. A structured self-administered questionnaire was used, and a 5% pretest was administered. The data were entered into Epi Data 4.6 and exported to SPSS 26. The AMOS 23 SEM was also used to describe and assess the degree and significance of the relationships between variables.51.4% (304/592) (95% CI, 47.2-55.4) of the participants intended to use telepharmacy. Performance expectancy (β = 0.23, p-value <0.05), social influence (β = 0.295, p-value <0.05), and digital literacy (β = 0.309, p-value <0.001) had positive relationships with the intention to use telepharmacy. Age and gender were also moderators of performance expectancy in telepharmacy.Overall, Pharmacists', Physicians', and nurses' intentions to use telepharamcy were found to be promising for the future. Performance expectancy, social influence, and digital literacy had a significantly positive influence on the intention to use telepharamcy. Digital literacy had a more significant prediction power than others. The results could be useful in terms of designing emerging systems and understanding users' computer skills.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000693"},"PeriodicalIF":0.0,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11633999/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142815176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Large Language Models in extracting cognitive exam dates and scores. 评估大型语言模型在提取认知考试日期和分数中的应用。
Pub Date : 2024-12-11 eCollection Date: 2024-12-01 DOI: 10.1371/journal.pdig.0000685
Hao Zhang, Neil Jethani, Simon Jones, Nicholas Genes, Vincent J Major, Ian S Jaffe, Anthony B Cardillo, Noah Heilenbach, Nadia Fazal Ali, Luke J Bonanni, Andrew J Clayburn, Zain Khera, Erica C Sadler, Jaideep Prasad, Jamie Schlacter, Kevin Liu, Benjamin Silva, Sophie Montgomery, Eric J Kim, Jacob Lester, Theodore M Hill, Alba Avoricani, Ethan Chervonski, James Davydov, William Small, Eesha Chakravartty, Himanshu Grover, John A Dodson, Abraham A Brody, Yindalon Aphinyanaphongs, Arjun Masurkar, Narges Razavian

Ensuring reliability of Large Language Models (LLMs) in clinical tasks is crucial. Our study assesses two state-of-the-art LLMs (ChatGPT and LlaMA-2) for extracting clinical information, focusing on cognitive tests like MMSE and CDR. Our data consisted of 135,307 clinical notes (Jan 12th, 2010 to May 24th, 2023) mentioning MMSE, CDR, or MoCA. After applying inclusion criteria 34,465 notes remained, of which 765 underwent ChatGPT (GPT-4) and LlaMA-2, and 22 experts reviewed the responses. ChatGPT successfully extracted MMSE and CDR instances with dates from 742 notes. We used 20 notes for fine-tuning and training the reviewers. The remaining 722 were assigned to reviewers, with 309 each assigned to two reviewers simultaneously. Inter-rater-agreement (Fleiss' Kappa), precision, recall, true/false negative rates, and accuracy were calculated. Our study follows TRIPOD reporting guidelines for model validation. For MMSE information extraction, ChatGPT (vs. LlaMA-2) achieved accuracy of 83% (vs. 66.4%), sensitivity of 89.7% (vs. 69.9%), true-negative rates of 96% (vs 60.0%), and precision of 82.7% (vs 62.2%). For CDR the results were lower overall, with accuracy of 87.1% (vs. 74.5%), sensitivity of 84.3% (vs. 39.7%), true-negative rates of 99.8% (98.4%), and precision of 48.3% (vs. 16.1%). We qualitatively evaluated the MMSE errors of ChatGPT and LlaMA-2 on double-reviewed notes. LlaMA-2 errors included 27 cases of total hallucination, 19 cases of reporting other scores instead of MMSE, 25 missed scores, and 23 cases of reporting only the wrong date. In comparison, ChatGPT's errors included only 3 cases of total hallucination, 17 cases of wrong test reported instead of MMSE, and 19 cases of reporting a wrong date. In this diagnostic/prognostic study of ChatGPT and LlaMA-2 for extracting cognitive exam dates and scores from clinical notes, ChatGPT exhibited high accuracy, with better performance compared to LlaMA-2. The use of LLMs could benefit dementia research and clinical care, by identifying eligible patients for treatments initialization or clinical trial enrollments. Rigorous evaluation of LLMs is crucial to understanding their capabilities and limitations.

确保大型语言模型(llm)在临床任务中的可靠性至关重要。我们的研究评估了两个最先进的llm (ChatGPT和LlaMA-2)用于提取临床信息,重点是认知测试,如MMSE和CDR。我们的数据包括135,307份涉及MMSE、CDR或MoCA的临床记录(2010年1月12日至2023年5月24日)。在应用纳入标准后,剩下34,465条注释,其中765条进行了ChatGPT (GPT-4)和LlaMA-2, 22位专家对回复进行了审查。ChatGPT成功地从742条注释中提取了带有日期的MMSE和CDR实例。我们使用了20个音符进行微调和培训评审人员。其余722份被分配给审稿人,其中309份同时分配给两名审稿人。评估间一致性(Fleiss’Kappa)、准确率、召回率、真/假阴性率和准确率进行计算。我们的研究遵循TRIPOD报告准则进行模型验证。对于MMSE信息提取,ChatGPT (vs. LlaMA-2)的准确率为83% (vs. 66.4%),灵敏度为89.7% (vs. 69.9%),真阴性率为96% (vs. 60.0%),精密度为82.7% (vs. 62.2%)。CDR结果总体上较低,准确率为87.1% (vs. 74.5%),灵敏度为84.3% (vs. 39.7%),真阴性率为99.8% (vs. 98.4%),精密度为48.3% (vs. 16.1%)。我们对ChatGPT和LlaMA-2在复核笔记上的MMSE误差进行了定性评价。LlaMA-2错误包括27例总幻觉,19例报告其他分数而不是MMSE, 25例漏报分数,23例仅报告错误日期。相比之下,ChatGPT的错误仅包括3例总幻觉,17例报告错误的测试而不是MMSE, 19例报告错误的日期。在这项将ChatGPT和LlaMA-2用于从临床记录中提取认知考试日期和分数的诊断/预后研究中,ChatGPT表现出较高的准确性,与LlaMA-2相比表现更好。llm的使用可以通过确定治疗初始化或临床试验登记的合格患者,使痴呆症研究和临床护理受益。对法学硕士进行严格的评估对于了解其能力和局限性至关重要。
{"title":"Evaluating Large Language Models in extracting cognitive exam dates and scores.","authors":"Hao Zhang, Neil Jethani, Simon Jones, Nicholas Genes, Vincent J Major, Ian S Jaffe, Anthony B Cardillo, Noah Heilenbach, Nadia Fazal Ali, Luke J Bonanni, Andrew J Clayburn, Zain Khera, Erica C Sadler, Jaideep Prasad, Jamie Schlacter, Kevin Liu, Benjamin Silva, Sophie Montgomery, Eric J Kim, Jacob Lester, Theodore M Hill, Alba Avoricani, Ethan Chervonski, James Davydov, William Small, Eesha Chakravartty, Himanshu Grover, John A Dodson, Abraham A Brody, Yindalon Aphinyanaphongs, Arjun Masurkar, Narges Razavian","doi":"10.1371/journal.pdig.0000685","DOIUrl":"10.1371/journal.pdig.0000685","url":null,"abstract":"<p><p>Ensuring reliability of Large Language Models (LLMs) in clinical tasks is crucial. Our study assesses two state-of-the-art LLMs (ChatGPT and LlaMA-2) for extracting clinical information, focusing on cognitive tests like MMSE and CDR. Our data consisted of 135,307 clinical notes (Jan 12th, 2010 to May 24th, 2023) mentioning MMSE, CDR, or MoCA. After applying inclusion criteria 34,465 notes remained, of which 765 underwent ChatGPT (GPT-4) and LlaMA-2, and 22 experts reviewed the responses. ChatGPT successfully extracted MMSE and CDR instances with dates from 742 notes. We used 20 notes for fine-tuning and training the reviewers. The remaining 722 were assigned to reviewers, with 309 each assigned to two reviewers simultaneously. Inter-rater-agreement (Fleiss' Kappa), precision, recall, true/false negative rates, and accuracy were calculated. Our study follows TRIPOD reporting guidelines for model validation. For MMSE information extraction, ChatGPT (vs. LlaMA-2) achieved accuracy of 83% (vs. 66.4%), sensitivity of 89.7% (vs. 69.9%), true-negative rates of 96% (vs 60.0%), and precision of 82.7% (vs 62.2%). For CDR the results were lower overall, with accuracy of 87.1% (vs. 74.5%), sensitivity of 84.3% (vs. 39.7%), true-negative rates of 99.8% (98.4%), and precision of 48.3% (vs. 16.1%). We qualitatively evaluated the MMSE errors of ChatGPT and LlaMA-2 on double-reviewed notes. LlaMA-2 errors included 27 cases of total hallucination, 19 cases of reporting other scores instead of MMSE, 25 missed scores, and 23 cases of reporting only the wrong date. In comparison, ChatGPT's errors included only 3 cases of total hallucination, 17 cases of wrong test reported instead of MMSE, and 19 cases of reporting a wrong date. In this diagnostic/prognostic study of ChatGPT and LlaMA-2 for extracting cognitive exam dates and scores from clinical notes, ChatGPT exhibited high accuracy, with better performance compared to LlaMA-2. The use of LLMs could benefit dementia research and clinical care, by identifying eligible patients for treatments initialization or clinical trial enrollments. Rigorous evaluation of LLMs is crucial to understanding their capabilities and limitations.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000685"},"PeriodicalIF":0.0,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11634005/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142815231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Afraid of the dentist? There's an app for that: Development and usability testing of a cognitive behavior therapy-based mobile app. 害怕看牙医?有一个应用程序:开发和可用性测试基于认知行为治疗的移动应用程序。
Pub Date : 2024-12-11 eCollection Date: 2024-12-01 DOI: 10.1371/journal.pdig.0000690
Kelly A Daly, Kiara A Diaz-Gutierrez, Armon Beheshtian, Richard E Heyman, Amy M Smith Slep, Mark S Wolff

Objectives: Although several brief cognitive behavior therapy (CBT)-based treatments for dental fear have proven efficacious, these interventions remain largely unavailable outside of the specialty clinics in which they were developed. Leveraging technology, we sought to increase access to treatment for individuals with dental fear through the development of a mobile application (Dental FearLess).

Materials and methods: To assess the resonance of our app as an avenue for dental fear treatment, we conducted a study assessing the usability, feasibility, and acceptability of the beta app. Participants with moderate to severe dental fear (N = 80) completed the app and reported on the perceived usability of the mobile interface (Systems Usability Scale, SUS; α = .82) and credibility of the intervention (Credibility and Expectancy Questionnaire, CEQ; α = .88). A sub-sample of participants naïve to the app (n = 10) completed the app during a think-aloud procedure, sharing their candid thoughts and reactions while using the app, prior to reporting on usability and credibility metrics.

Results: Overall usability (M = 78.5, SD = 17.7) and credibility (M = 21.7, SD = 5.5) of the beta version of the app were good. The think-aloud data further corroborated the app's acceptability, while highlighting several areas for user improvement (i.e., aesthetics, navigation, engagement).

Conclusions: Usability and acceptability results are promising for the viability of an accessible, feasible, self-administered intervention for dental fear. Refinements made based on user feedback have produced a clinical-trial-ready mobile application. App refinement decisions, informed by user feedback, are representative of the larger literature-that is, of the ubiquitous negotiations m-health developers must make across treatment fidelity, usability, and engagement. Implications for future research are discussed.

目的:尽管一些基于认知行为疗法(CBT)的治疗牙科恐惧的方法已被证明是有效的,但这些干预措施在其开发的专业诊所之外仍然很大程度上不可用。利用技术,我们试图通过开发一个移动应用程序(dental FearLess)来增加患有牙科恐惧症的个人获得治疗的机会。材料和方法:为了评估我们的应用程序作为牙科恐惧治疗途径的共鸣,我们进行了一项研究,评估了测试应用程序的可用性、可行性和可接受性。患有中度至重度牙科恐惧的参与者(N = 80)完成了应用程序,并报告了移动界面的感知可用性(系统可用性量表,SUS;α = 0.82)和干预的可信度(可信度和期望问卷,CEQ;α = 0.88)。应用程序参与者的子样本naïve (n = 10)在思考过程中完成了应用程序,在使用应用程序时分享了他们坦率的想法和反应,然后报告可用性和可信度指标。结果:应用程序测试版的总体可用性(M = 78.5, SD = 17.7)和可信度(M = 21.7, SD = 5.5)较好。有声思考数据进一步证实了应用程序的可接受性,同时强调了用户改进的几个领域(即美学,导航,参与度)。结论:可用性和可接受性的结果是有希望的可行性,可访问的,可行的,自我管理的牙科恐惧干预。基于用户反馈的改进已经产生了一个临床试验就绪的移动应用程序。基于用户反馈的应用程序优化决策代表了更广泛的文献——也就是说,移动医疗开发者必须在治疗保真度、可用性和参与度方面进行无处不在的谈判。讨论了对未来研究的启示。
{"title":"Afraid of the dentist? There's an app for that: Development and usability testing of a cognitive behavior therapy-based mobile app.","authors":"Kelly A Daly, Kiara A Diaz-Gutierrez, Armon Beheshtian, Richard E Heyman, Amy M Smith Slep, Mark S Wolff","doi":"10.1371/journal.pdig.0000690","DOIUrl":"10.1371/journal.pdig.0000690","url":null,"abstract":"<p><strong>Objectives: </strong>Although several brief cognitive behavior therapy (CBT)-based treatments for dental fear have proven efficacious, these interventions remain largely unavailable outside of the specialty clinics in which they were developed. Leveraging technology, we sought to increase access to treatment for individuals with dental fear through the development of a mobile application (Dental FearLess).</p><p><strong>Materials and methods: </strong>To assess the resonance of our app as an avenue for dental fear treatment, we conducted a study assessing the usability, feasibility, and acceptability of the beta app. Participants with moderate to severe dental fear (N = 80) completed the app and reported on the perceived usability of the mobile interface (Systems Usability Scale, SUS; α = .82) and credibility of the intervention (Credibility and Expectancy Questionnaire, CEQ; α = .88). A sub-sample of participants naïve to the app (n = 10) completed the app during a think-aloud procedure, sharing their candid thoughts and reactions while using the app, prior to reporting on usability and credibility metrics.</p><p><strong>Results: </strong>Overall usability (M = 78.5, SD = 17.7) and credibility (M = 21.7, SD = 5.5) of the beta version of the app were good. The think-aloud data further corroborated the app's acceptability, while highlighting several areas for user improvement (i.e., aesthetics, navigation, engagement).</p><p><strong>Conclusions: </strong>Usability and acceptability results are promising for the viability of an accessible, feasible, self-administered intervention for dental fear. Refinements made based on user feedback have produced a clinical-trial-ready mobile application. App refinement decisions, informed by user feedback, are representative of the larger literature-that is, of the ubiquitous negotiations m-health developers must make across treatment fidelity, usability, and engagement. Implications for future research are discussed.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000690"},"PeriodicalIF":0.0,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11633963/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142815180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A statistical modelling approach for determining the cause of reported respiratory syndromes from internet-based participatory surveillance when influenza virus and SARS-CoV-2 are co-circulating. 在流感病毒和SARS-CoV-2共同传播时,用于确定基于互联网的参与式监测中报告的呼吸道综合征原因的统计建模方法。
Pub Date : 2024-12-09 eCollection Date: 2024-12-01 DOI: 10.1371/journal.pdig.0000655
Scott A McDonald, Albert Jan van Hoek, Daniela Paolotti, Mariette Hooiveld, Adam Meijer, Marit de Lange, Arianne van Gageldonk-Lafeber, Jacco Wallinga

Symptom-only case definitions are insufficient to discriminate COVID-like illness from acute respiratory infection (ARI) or influenza-like illness (ILI), due to the overlap in case definitions. Our objective was to develop a statistical method that does not rely on case definitions to determine the contribution of influenza virus and SARS-CoV-2 to the ARI burden during periods when both viruses are circulating. Data sources used for testing the approach were weekly ARI syndrome reports from the Infectieradar participatory syndromic surveillance system during the analysis period (the first 25 weeks of 2022, in which SARS-CoV-2 and influenza virus co-circulated in the Netherlands) and data from virologically tested ARI (including ILI) patients who consulted a general practitioner in the same period. Estimation of the proportions of ARI attributable to influenza virus, SARS-CoV-2, or another cause was framed as an inference problem, through which all data sources are combined within a Bayesian framework to infer the weekly numbers of ARI reports attributable to each cause. Posterior distributions for the attribution proportions were obtained using Markov Chain Monte-Carlo methods. Application of the approach to the example data sources indicated that, of the total ARI reports (total of 11,312; weekly mean of 452) during the analysis period, the model attributed 35.4% (95% CrI: 29.2-40.0%) and 27.0% (95% CrI: 19.3-35.2%) to influenza virus and SARS-CoV-2, respectively. The proposed statistical model allows the attribution of respiratory syndrome reports from participatory surveillance to either influenza virus or SARS-CoV-2 infection in periods when both viruses are circulating, but comparability of the participatory surveillance and virologically tested populations is important. Portability for use by other countries with established participatory respiratory surveillance systems is an asset.

由于病例定义的重叠,仅凭症状的病例定义不足以区分covid - 19样疾病与急性呼吸道感染或流感样疾病。我们的目标是开发一种不依赖病例定义的统计方法,以确定流感病毒和SARS-CoV-2在两种病毒流行期间对ARI负担的贡献。用于测试该方法的数据来源是在分析期间(2022年的前25周,SARS-CoV-2和流感病毒在荷兰共同传播)感染参与性综合征监测系统每周ARI综合征报告,以及同期咨询全科医生的经病毒学检测的ARI(包括ILI)患者的数据。对流感病毒、SARS-CoV-2或其他原因导致的ARI比例的估计被视为一个推断问题,通过该问题,将所有数据源合并到贝叶斯框架内,以推断每周归因于每种原因的ARI报告数量。利用马尔可夫链蒙特卡罗方法得到归因比例的后验分布。将该方法应用于示例数据来源表明,在ARI报告总数中(总数为11,312;每周平均452例),该模型将流感病毒和SARS-CoV-2分别归因于35.4% (95% CrI: 29.2-40.0%)和27.0% (95% CrI: 19.3-35.2%)。提出的统计模型允许在流感病毒或SARS-CoV-2病毒流行期间,将参与式监测的呼吸综合征报告归因为流感病毒或SARS-CoV-2感染,但参与式监测和病毒学检测人群的可比性很重要。便于其他已建立参与式呼吸监测系统的国家使用是一项优势。
{"title":"A statistical modelling approach for determining the cause of reported respiratory syndromes from internet-based participatory surveillance when influenza virus and SARS-CoV-2 are co-circulating.","authors":"Scott A McDonald, Albert Jan van Hoek, Daniela Paolotti, Mariette Hooiveld, Adam Meijer, Marit de Lange, Arianne van Gageldonk-Lafeber, Jacco Wallinga","doi":"10.1371/journal.pdig.0000655","DOIUrl":"10.1371/journal.pdig.0000655","url":null,"abstract":"<p><p>Symptom-only case definitions are insufficient to discriminate COVID-like illness from acute respiratory infection (ARI) or influenza-like illness (ILI), due to the overlap in case definitions. Our objective was to develop a statistical method that does not rely on case definitions to determine the contribution of influenza virus and SARS-CoV-2 to the ARI burden during periods when both viruses are circulating. Data sources used for testing the approach were weekly ARI syndrome reports from the Infectieradar participatory syndromic surveillance system during the analysis period (the first 25 weeks of 2022, in which SARS-CoV-2 and influenza virus co-circulated in the Netherlands) and data from virologically tested ARI (including ILI) patients who consulted a general practitioner in the same period. Estimation of the proportions of ARI attributable to influenza virus, SARS-CoV-2, or another cause was framed as an inference problem, through which all data sources are combined within a Bayesian framework to infer the weekly numbers of ARI reports attributable to each cause. Posterior distributions for the attribution proportions were obtained using Markov Chain Monte-Carlo methods. Application of the approach to the example data sources indicated that, of the total ARI reports (total of 11,312; weekly mean of 452) during the analysis period, the model attributed 35.4% (95% CrI: 29.2-40.0%) and 27.0% (95% CrI: 19.3-35.2%) to influenza virus and SARS-CoV-2, respectively. The proposed statistical model allows the attribution of respiratory syndrome reports from participatory surveillance to either influenza virus or SARS-CoV-2 infection in periods when both viruses are circulating, but comparability of the participatory surveillance and virologically tested populations is important. Portability for use by other countries with established participatory respiratory surveillance systems is an asset.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000655"},"PeriodicalIF":0.0,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11627408/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Barriers and facilitators for the use of telehealth by healthcare providers in India-A systematic review. 印度卫生保健提供者使用远程保健的障碍和促进因素:系统审查。
Pub Date : 2024-12-06 eCollection Date: 2024-12-01 DOI: 10.1371/journal.pdig.0000398
Parth Sharma, Shirish Rao, Padmavathy Krishna Kumar, Aiswarya R Nair, Disha Agrawal, Siddhesh Zadey, Gayathri Surendran, Rachna George Joseph, Girish Dayma, Liya Rafeekh, Shubhashis Saha, Sitanshi Sharma, S S Prakash, Venkatesan Sankarapandian, Preethi John, Vikram Patel

It is widely assumed that telehealth tools like mHealth (mobile health), telemedicine, and tele-education can supplement the efficiency of Healthcare Providers (HCPs). We conducted a systematic review of evidence on the barriers and facilitators associated with the use of telehealth by HCPs in India. A systematic literature search following a pre-registered protocol (https://doi.org/10.17605/OSF.IO/KQ3U9 [PROTOCOL DOI]) was conducted on PubMed. The search strategy, inclusion, and exclusion criteria were based on the World Health Organization's action framework on Human Resources for Health (HRH) and Universal Health Coverage (UHC) in India with a specific focus on telehealth tools. Eligible articles published in English from 1st January 2001 to 17th February 2022 were included. One hundred and six studies were included in the review. Of these, 53 studies (50%) involved mHealth interventions, 25 (23.6%) involved telemedicine interventions whereas the remaining 28 (26.4%) involved the use of tele-education interventions by HCPs in India. In each category, most of the studies followed a quantitative study design and were mostly published in the last 5 years. The study sites were more commonly present in states in south India. The facilitators and barriers related to each type of intervention were analyzed under the following sub-headings- 1) Human resource related, 2) Application related 3) Technical, and 4) Others. The interventions were most commonly used for improving the management of mental health, non-communicable diseases, and maternal and child health. The use of telehealth has not been uniformly studied in India. The facilitators and barriers to telehealth use need to be kept in mind while designing the intervention. Future studies should focus on looking at region-specific, intervention-specific, and health cadre-specific barriers and facilitators for the use of telehealth.

人们普遍认为,远程医疗工具,如移动医疗(移动医疗)、远程医疗和远程教育可以补充医疗保健提供者(HCPs)的效率。我们对印度卫生服务提供者使用远程医疗的相关障碍和促进因素的证据进行了系统审查。按照预先注册的协议(https://doi.org/10.17605/OSF.IO/KQ3U9 [protocol DOI])在PubMed上进行了系统的文献检索。搜索战略、纳入和排除标准以世界卫生组织关于印度卫生人力资源和全民健康覆盖的行动框架为基础,特别注重远程医疗工具。包括2001年1月1日至2022年2月17日期间发表的符合条件的英文文章。该综述纳入了106项研究。其中,53项研究(50%)涉及移动健康干预,25项研究(23.6%)涉及远程医疗干预,其余28项研究(26.4%)涉及印度hcp使用远程教育干预。在每个类别中,大多数研究都遵循定量研究设计,并且大多发表于最近5年。研究地点在印度南部各州更为普遍。每种干预措施的促进因素和障碍按以下小标题进行分析:1)人力资源相关,2)应用相关,3)技术相关,4)其他相关。这些干预措施最常用于改善对精神健康、非传染性疾病和妇幼保健的管理。在印度,对远程保健的使用情况没有进行统一的研究。在设计干预措施时,需要考虑到远程保健使用的促进因素和障碍。未来的研究应侧重于研究具体区域、具体干预措施和具体保健干部使用远程保健的障碍和促进因素。
{"title":"Barriers and facilitators for the use of telehealth by healthcare providers in India-A systematic review.","authors":"Parth Sharma, Shirish Rao, Padmavathy Krishna Kumar, Aiswarya R Nair, Disha Agrawal, Siddhesh Zadey, Gayathri Surendran, Rachna George Joseph, Girish Dayma, Liya Rafeekh, Shubhashis Saha, Sitanshi Sharma, S S Prakash, Venkatesan Sankarapandian, Preethi John, Vikram Patel","doi":"10.1371/journal.pdig.0000398","DOIUrl":"10.1371/journal.pdig.0000398","url":null,"abstract":"<p><p>It is widely assumed that telehealth tools like mHealth (mobile health), telemedicine, and tele-education can supplement the efficiency of Healthcare Providers (HCPs). We conducted a systematic review of evidence on the barriers and facilitators associated with the use of telehealth by HCPs in India. A systematic literature search following a pre-registered protocol (https://doi.org/10.17605/OSF.IO/KQ3U9 [PROTOCOL DOI]) was conducted on PubMed. The search strategy, inclusion, and exclusion criteria were based on the World Health Organization's action framework on Human Resources for Health (HRH) and Universal Health Coverage (UHC) in India with a specific focus on telehealth tools. Eligible articles published in English from 1st January 2001 to 17th February 2022 were included. One hundred and six studies were included in the review. Of these, 53 studies (50%) involved mHealth interventions, 25 (23.6%) involved telemedicine interventions whereas the remaining 28 (26.4%) involved the use of tele-education interventions by HCPs in India. In each category, most of the studies followed a quantitative study design and were mostly published in the last 5 years. The study sites were more commonly present in states in south India. The facilitators and barriers related to each type of intervention were analyzed under the following sub-headings- 1) Human resource related, 2) Application related 3) Technical, and 4) Others. The interventions were most commonly used for improving the management of mental health, non-communicable diseases, and maternal and child health. The use of telehealth has not been uniformly studied in India. The facilitators and barriers to telehealth use need to be kept in mind while designing the intervention. Future studies should focus on looking at region-specific, intervention-specific, and health cadre-specific barriers and facilitators for the use of telehealth.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000398"},"PeriodicalIF":0.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623477/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142789693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The usefulness of an application-supported nutritional intervention on non-high-density lipoprotein cholesterol in people with a risk of lifestyle-related diseases. 应用支持的营养干预对生活方式相关疾病风险人群非高密度脂蛋白胆固醇的有用性
Pub Date : 2024-12-06 eCollection Date: 2024-12-01 DOI: 10.1371/journal.pdig.0000648
Yuko Noda, Mitsuhiro Kometani, Akihiro Nomura, Masao Noda, Rie Oka, Mayuko Kadono, Takashi Yoneda

Lifestyle-related diseases, such as diabetes, are mostly caused by poor lifestyle habits; therefore, modifying these habits is important. In Japan, a system of specific health checkups (SHC) and specific health guidance (SHG) was introduced in 2008. The challenges faced include low retention rates and difficulty in maintaining results. Digital technologies can support self-management and increase patient convenience, although evidence of the usefulness of this technology for SHG is limited. This study evaluated the usefulness of nutritional guidance using a smartphone application (app) added to conventional SHG. We recruited eligible participants for SHG in Japan from November 2018 to March 2020. We assigned them to "Intervention Group: Application-Supported Nutrition Therapy" or "Control Group: Human Nutrition Therapy" based on their desire to use the app. The primary outcome was a change in non-high-density lipoprotein cholesterol (non-HDL-C) levels post-intervention. The secondary outcomes were a change in lipid profile, metabolic indices, and frequency of logins to the app. We assessed 109 participants in two cohorts: 3-month (short-term) and 6-month (long-term). The short-term cohort had 23 intervention and 29 control participants, while the long-term cohort had 35 and 22, respectively. There was a significant improvement in non-HDL-C levels in the short-term intervention group compared to the control group. There was no significant difference in non-HDL-C levels in the long-term groups or at 1 year. There were significant improvements in body weight (BW) in the short-term cohort until 1 year compared within the groups. The retention rate remained high in the short-term cohort (92%) but decreased to 57.8% at 6 months in the long-term cohort. Using an app system to facilitate dietary recordings and guidance for patients at risk of lifestyle-related diseases led to improved lipid levels and BW. These benefits persisted to some extent after 1 year. This app may partially supplement conventional SHG.

与生活方式有关的疾病,如糖尿病,大多是由不良的生活习惯引起的;因此,改变这些习惯很重要。在日本,2008年引入了特定健康检查(SHC)和特定健康指导(SHG)系统。面临的挑战包括低保留率和难以维持成果。数字技术可以支持自我管理并增加患者的便利性,尽管该技术对SHG有用的证据有限。本研究评估了使用智能手机应用程序(app)添加到传统SHG的营养指导的有效性。我们从2018年11月到2020年3月在日本招募了符合条件的SHG参与者。根据他们使用应用程序的意愿,我们将他们分为“干预组:应用支持的营养治疗”或“对照组:人类营养治疗”。主要结果是干预后非高密度脂蛋白胆固醇(non-HDL-C)水平的变化。次要结果是血脂、代谢指标和登录应用程序频率的变化。我们评估了109名参与者,分为两个队列:3个月(短期)和6个月(长期)。短期队列有23名干预参与者和29名对照参与者,而长期队列分别有35名和22名参与者。与对照组相比,短期干预组的非hdl - c水平有显著改善。非hdl - c水平在长期组和1年后无显著差异。与组内比较,短期队列的体重(BW)在1年内有显著改善。短期队列的保留率仍然很高(92%),但在6个月的长期队列中下降到57.8%。使用一个应用程序系统来促进饮食记录和指导有生活方式相关疾病风险的患者,从而改善了血脂水平和体重。这些益处在1年后仍有一定程度的持续。该应用程序可以部分补充传统SHG。
{"title":"The usefulness of an application-supported nutritional intervention on non-high-density lipoprotein cholesterol in people with a risk of lifestyle-related diseases.","authors":"Yuko Noda, Mitsuhiro Kometani, Akihiro Nomura, Masao Noda, Rie Oka, Mayuko Kadono, Takashi Yoneda","doi":"10.1371/journal.pdig.0000648","DOIUrl":"10.1371/journal.pdig.0000648","url":null,"abstract":"<p><p>Lifestyle-related diseases, such as diabetes, are mostly caused by poor lifestyle habits; therefore, modifying these habits is important. In Japan, a system of specific health checkups (SHC) and specific health guidance (SHG) was introduced in 2008. The challenges faced include low retention rates and difficulty in maintaining results. Digital technologies can support self-management and increase patient convenience, although evidence of the usefulness of this technology for SHG is limited. This study evaluated the usefulness of nutritional guidance using a smartphone application (app) added to conventional SHG. We recruited eligible participants for SHG in Japan from November 2018 to March 2020. We assigned them to \"Intervention Group: Application-Supported Nutrition Therapy\" or \"Control Group: Human Nutrition Therapy\" based on their desire to use the app. The primary outcome was a change in non-high-density lipoprotein cholesterol (non-HDL-C) levels post-intervention. The secondary outcomes were a change in lipid profile, metabolic indices, and frequency of logins to the app. We assessed 109 participants in two cohorts: 3-month (short-term) and 6-month (long-term). The short-term cohort had 23 intervention and 29 control participants, while the long-term cohort had 35 and 22, respectively. There was a significant improvement in non-HDL-C levels in the short-term intervention group compared to the control group. There was no significant difference in non-HDL-C levels in the long-term groups or at 1 year. There were significant improvements in body weight (BW) in the short-term cohort until 1 year compared within the groups. The retention rate remained high in the short-term cohort (92%) but decreased to 57.8% at 6 months in the long-term cohort. Using an app system to facilitate dietary recordings and guidance for patients at risk of lifestyle-related diseases led to improved lipid levels and BW. These benefits persisted to some extent after 1 year. This app may partially supplement conventional SHG.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000648"},"PeriodicalIF":0.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623450/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142789702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CPLLM: Clinical prediction with large language models. CPLLM:大型语言模型的临床预测。
Pub Date : 2024-12-06 eCollection Date: 2024-12-01 DOI: 10.1371/journal.pdig.0000680
Ofir Ben Shoham, Nadav Rappoport

We present Clinical Prediction with Large Language Models (CPLLM), a method that involves fine-tuning a pre-trained Large Language Model (LLM) for predicting clinical disease and readmission. We utilized quantization and fine-tuned the LLM using prompts. For diagnostic predictions, we predicted whether patients would be diagnosed with a target disease during their next visit or in the subsequent diagnosis, leveraging their historical medical records. We compared our results to various baselines, including Retain and Med-BERT, the latter of which is the current state-of-the-art model for disease prediction using temporal structured EHR data. In addition, we also evaluated CPLLM's utility in predicting hospital readmission and compared our method's performance with benchmark baselines. Our experiments ultimately revealed that our proposed method, CPLLM, surpasses all the tested models in terms of PR-AUC and ROC-AUC metrics, providing state-of-the-art performance as a tool for predicting disease diagnosis and patient hospital readmission without requiring pre-training on medical data. Such a method can be easily implemented and integrated into the clinical workflow to help care providers plan next steps for their patients.

我们提出了使用大语言模型进行临床预测(CPLLM),这是一种涉及微调预训练的大语言模型(LLM)的方法,用于预测临床疾病和再入院。我们使用了量化并使用提示对LLM进行了微调。对于诊断预测,我们利用患者的历史医疗记录,预测患者是否会在下次就诊或随后的诊断中被诊断出患有目标疾病。我们将我们的结果与各种基线进行了比较,包括Retain和Med-BERT,后者是目前使用时间结构化电子病历数据进行疾病预测的最先进模型。此外,我们还评估了CPLLM在预测医院再入院方面的效用,并将我们的方法的性能与基准基线进行了比较。我们的实验最终表明,我们提出的方法CPLLM在PR-AUC和ROC-AUC指标方面超过了所有已测试的模型,提供了最先进的性能,作为预测疾病诊断和患者再入院的工具,而无需对医疗数据进行预训练。这种方法可以很容易地实施并集成到临床工作流程中,以帮助护理提供者为患者计划下一步。
{"title":"CPLLM: Clinical prediction with large language models.","authors":"Ofir Ben Shoham, Nadav Rappoport","doi":"10.1371/journal.pdig.0000680","DOIUrl":"10.1371/journal.pdig.0000680","url":null,"abstract":"<p><p>We present Clinical Prediction with Large Language Models (CPLLM), a method that involves fine-tuning a pre-trained Large Language Model (LLM) for predicting clinical disease and readmission. We utilized quantization and fine-tuned the LLM using prompts. For diagnostic predictions, we predicted whether patients would be diagnosed with a target disease during their next visit or in the subsequent diagnosis, leveraging their historical medical records. We compared our results to various baselines, including Retain and Med-BERT, the latter of which is the current state-of-the-art model for disease prediction using temporal structured EHR data. In addition, we also evaluated CPLLM's utility in predicting hospital readmission and compared our method's performance with benchmark baselines. Our experiments ultimately revealed that our proposed method, CPLLM, surpasses all the tested models in terms of PR-AUC and ROC-AUC metrics, providing state-of-the-art performance as a tool for predicting disease diagnosis and patient hospital readmission without requiring pre-training on medical data. Such a method can be easily implemented and integrated into the clinical workflow to help care providers plan next steps for their patients.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 12","pages":"e0000680"},"PeriodicalIF":0.0,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623460/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142789694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PLOS digital health
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1