首页 > 最新文献

PLOS digital health最新文献

英文 中文
Beyond diagnostic connectivity: Leveraging digital health technology for the real-time collection and provision of high-quality actionable data on infectious diseases in Uganda. 超越诊断连接:利用数字医疗技术实时收集和提供乌干达传染病的高质量可操作数据。
Pub Date : 2024-08-23 eCollection Date: 2024-08-01 DOI: 10.1371/journal.pdig.0000566
Dennis Mujuni, Julius Tumwine, Kenneth Musisi, Edward Otim, Maha Reda Farhat, Dorothy Nabulobi, Nyombi Abdunoor, Arnold Kennedy Tumuhairwe, Marvin Derrick Mugisa, Denis Oola, Fred Semitala, Raymond Byaruhanga, Stavia Turyahabwe, Moses Joloba

Automated data transmission from diagnostic instrument networks to a central database at the Ministries of Health has the potential of providing real-time quality data not only on diagnostic instrument performance, but also continuous disease surveillance and patient care. We aimed at sharing how a locally developed novel diagnostic connectivity solution channels actionable data from diagnostic instruments to the national dashboards for disease control in Uganda between May 2022 and May 2023. The diagnostic connectivity solution was successfully configured on a selected network of multiplexing diagnostic instruments at 260 sites in Uganda, providing a layered access of data. Of these, 909,674 test results were automatically collected from 269 "GeneXpert" machines, 5597 test results from 28 "Truenat" and >12,000 were from 3 digital x-ray devices to different stakeholder levels to ensure optimal use of data for their intended purpose. The government and relevant stakeholders are empowered with usable and actionable data from the diagnostic instruments. The successful implementation of the diagnostic connectivity solution depended on some key operational strategies namely; sustained internet connectivity and short message services, stakeholder engagement, a strong in-country laboratory coordination network, human resource capacity building, establishing a network for the diagnostic instruments, and integration with existing health data collection tools. Poor bandwidth at some locations was a major hindrance for the successful implementation of the connectivity solution. Maintaining stakeholder engagement at the clinical level is key for sustaining diagnostic data connectivity. The locally developed diagnostic connectivity solution as a digital health technology offers the chance to collect high-quality data on a number of parameters for disease control, including error analysis, thereby strengthening the quality of data from the networked diagnostic sites to relevant stakeholders.

从诊断仪器网络到卫生部中央数据库的自动数据传输不仅可以提供诊断仪器性能的实时优质数据,还可以提供持续的疾病监测和患者护理数据。我们旨在分享当地开发的新型诊断连接解决方案如何在 2022 年 5 月至 2023 年 5 月期间将诊断仪器的可操作数据传输到乌干达的国家疾病控制仪表板。诊断连接解决方案已成功配置在乌干达 260 个地点的多路复用诊断仪器网络上,提供分层数据访问。其中,从 269 台 "GeneXpert "机器上自动收集了 909674 项检测结果,从 28 台 "Truenat "机器上自动收集了 5597 项检测结果,从 3 台数字 X 光设备上自动收集了超过 12000 项检测结果,并将这些数据提供给不同级别的利益攸关方,以确保数据的最佳使用达到预期目的。政府和相关利益攸关方可以从诊断仪器中获得可用和可操作的数据。诊断连接解决方案的成功实施取决于一些关键的业务战略,即:持续的互联网连接和短信服务、利益攸关方的参与、强大的国内实验室协调网络、人力资源能力建设、建立诊断仪器网络以及与现有卫生数据收集工具的整合。某些地点带宽不足是成功实施连接解决方案的主要障碍。在临床层面保持利益相关者的参与是保持诊断数据连通性的关键。本地开发的诊断连接解决方案作为一种数字医疗技术,为收集疾病控制方面的一些参数的高质量数据(包括误差分析)提供了机会,从而提高了从联网诊断站点向相关利益攸关方提供的数据的质量。
{"title":"Beyond diagnostic connectivity: Leveraging digital health technology for the real-time collection and provision of high-quality actionable data on infectious diseases in Uganda.","authors":"Dennis Mujuni, Julius Tumwine, Kenneth Musisi, Edward Otim, Maha Reda Farhat, Dorothy Nabulobi, Nyombi Abdunoor, Arnold Kennedy Tumuhairwe, Marvin Derrick Mugisa, Denis Oola, Fred Semitala, Raymond Byaruhanga, Stavia Turyahabwe, Moses Joloba","doi":"10.1371/journal.pdig.0000566","DOIUrl":"10.1371/journal.pdig.0000566","url":null,"abstract":"<p><p>Automated data transmission from diagnostic instrument networks to a central database at the Ministries of Health has the potential of providing real-time quality data not only on diagnostic instrument performance, but also continuous disease surveillance and patient care. We aimed at sharing how a locally developed novel diagnostic connectivity solution channels actionable data from diagnostic instruments to the national dashboards for disease control in Uganda between May 2022 and May 2023. The diagnostic connectivity solution was successfully configured on a selected network of multiplexing diagnostic instruments at 260 sites in Uganda, providing a layered access of data. Of these, 909,674 test results were automatically collected from 269 \"GeneXpert\" machines, 5597 test results from 28 \"Truenat\" and >12,000 were from 3 digital x-ray devices to different stakeholder levels to ensure optimal use of data for their intended purpose. The government and relevant stakeholders are empowered with usable and actionable data from the diagnostic instruments. The successful implementation of the diagnostic connectivity solution depended on some key operational strategies namely; sustained internet connectivity and short message services, stakeholder engagement, a strong in-country laboratory coordination network, human resource capacity building, establishing a network for the diagnostic instruments, and integration with existing health data collection tools. Poor bandwidth at some locations was a major hindrance for the successful implementation of the connectivity solution. Maintaining stakeholder engagement at the clinical level is key for sustaining diagnostic data connectivity. The locally developed diagnostic connectivity solution as a digital health technology offers the chance to collect high-quality data on a number of parameters for disease control, including error analysis, thereby strengthening the quality of data from the networked diagnostic sites to relevant stakeholders.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000566"},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11343378/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142044218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Supervised machine learning to predict smoking lapses from Ecological Momentary Assessments and sensor data: Implications for just-in-time adaptive intervention development. 通过监督机器学习从生态瞬间评估和传感器数据中预测吸烟间隔:对及时适应性干预开发的影响。
Pub Date : 2024-08-23 eCollection Date: 2024-08-01 DOI: 10.1371/journal.pdig.0000594
Olga Perski, Dimitra Kale, Corinna Leppin, Tosan Okpako, David Simons, Stephanie P Goldstein, Eric Hekler, Jamie Brown

Specific moments of lapse among smokers attempting to quit often lead to full relapse, which highlights a need for interventions that target lapses before they might occur, such as just-in-time adaptive interventions (JITAIs). To inform the decision points and tailoring variables of a lapse prevention JITAI, we trained and tested supervised machine learning algorithms that use Ecological Momentary Assessments (EMAs) and wearable sensor data of potential lapse triggers and lapse incidence. We aimed to identify a best-performing and feasible algorithm to take forwards in a JITAI. For 10 days, adult smokers attempting to quit were asked to complete 16 hourly EMAs/day assessing cravings, mood, activity, social context, physical context, and lapse incidence, and to wear a Fitbit Charge 4 during waking hours to passively collect data on steps and heart rate. A series of group-level supervised machine learning algorithms (e.g., Random Forest, XGBoost) were trained and tested, without and with the sensor data. Their ability to predict lapses for out-of-sample (i) observations and (ii) individuals were evaluated. Next, a series of individual-level and hybrid (i.e., group- and individual-level) algorithms were trained and tested. Participants (N = 38) responded to 6,124 EMAs (with 6.9% of responses reporting a lapse). Without sensor data, the best-performing group-level algorithm had an area under the receiver operating characteristic curve (AUC) of 0.899 (95% CI = 0.871-0.928). Its ability to classify lapses for out-of-sample individuals ranged from poor to excellent (AUCper person = 0.524-0.994; median AUC = 0.639). 15/38 participants had adequate data for individual-level algorithms to be constructed, with a median AUC of 0.855 (range: 0.451-1.000). Hybrid algorithms could be constructed for 25/38 participants, with a median AUC of 0.692 (range: 0.523 to 0.998). With sensor data, the best-performing group-level algorithm had an AUC of 0.952 (95% CI = 0.933-0.970). Its ability to classify lapses for out-of-sample individuals ranged from poor to excellent (AUCper person = 0.494-0.979; median AUC = 0.745). 11/30 participants had adequate data for individual-level algorithms to be constructed, with a median AUC of 0.983 (range: 0.549-1.000). Hybrid algorithms could be constructed for 20/30 participants, with a median AUC of 0.772 (range: 0.444 to 0.968). In conclusion, high-performing group-level lapse prediction algorithms without and with sensor data had variable performance when applied to out-of-sample individuals. Individual-level and hybrid algorithms could be constructed for a limited number of individuals but had improved performance, particularly when incorporating sensor data for participants with sufficient wear time. Feasibility constraints and the need to balance multiple success criteria in the JITAI development and implementation process are discussed.

试图戒烟的吸烟者在特定的戒烟失误时刻往往会导致完全复吸,这凸显了在失误可能发生之前针对失误进行干预的必要性,例如及时适应性干预(JITAIs)。为了为预防失效的 JITAI 的决策点和定制变量提供信息,我们训练并测试了有监督的机器学习算法,该算法使用了潜在失效触发因素和失效发生率的生态瞬间评估(EMA)和可穿戴传感器数据。我们的目标是确定一种性能最佳且可行的算法,以便在 JITAI 中推广。在为期 10 天的时间里,我们要求试图戒烟的成年吸烟者每天每小时完成 16 次 EMA,评估渴望、情绪、活动、社会环境、身体环境和失效发生率,并在清醒时佩戴 Fitbit Charge 4,被动收集步数和心率数据。在没有传感器数据和有传感器数据的情况下,对一系列组级监督机器学习算法(如随机森林、XGBoost)进行了训练和测试。评估了这些算法预测样本外 (i) 观察结果和 (ii) 个人失误的能力。接下来,对一系列个体级和混合(即群体级和个体级)算法进行了训练和测试。参与者(N = 38)回答了 6124 次 EMA(6.9% 的回答报告了失误)。在没有传感器数据的情况下,表现最好的群体级算法的接收器工作特征曲线下面积 (AUC) 为 0.899(95% CI = 0.871-0.928)。该算法对样本外个体的失误分类能力从较差到优秀不等(每人的 AUC = 0.524-0.994;中位数 AUC = 0.639)。有 15/38 名参与者拥有足够的数据来构建个人层面的算法,AUC 中位数为 0.855(范围:0.451-1.000)。25/38 名参与者可以构建混合算法,AUC 中位数为 0.692(范围:0.523 至 0.998)。通过传感器数据,表现最好的组级算法的 AUC 为 0.952(95% CI = 0.933-0.970)。该算法对样本外个人的失误分类能力从较差到优秀不等(每人的 AUC = 0.494-0.979;中位数 AUC = 0.745)。11/30的参与者有足够的数据来构建个人层面的算法,中位AUC为0.983(范围:0.549-1.000)。有 20/30 名参与者可以构建混合算法,AUC 中位数为 0.772(范围:0.444 至 0.968)。总之,无传感器数据和有传感器数据的高性能群体级失效预测算法在应用于样本外个体时性能各异。个人级算法和混合算法可用于数量有限的个体,但性能有所提高,特别是在为有足够佩戴时间的参与者纳入传感器数据时。本文讨论了可行性限制以及在 JITAI 开发和实施过程中平衡多种成功标准的必要性。
{"title":"Supervised machine learning to predict smoking lapses from Ecological Momentary Assessments and sensor data: Implications for just-in-time adaptive intervention development.","authors":"Olga Perski, Dimitra Kale, Corinna Leppin, Tosan Okpako, David Simons, Stephanie P Goldstein, Eric Hekler, Jamie Brown","doi":"10.1371/journal.pdig.0000594","DOIUrl":"10.1371/journal.pdig.0000594","url":null,"abstract":"<p><p>Specific moments of lapse among smokers attempting to quit often lead to full relapse, which highlights a need for interventions that target lapses before they might occur, such as just-in-time adaptive interventions (JITAIs). To inform the decision points and tailoring variables of a lapse prevention JITAI, we trained and tested supervised machine learning algorithms that use Ecological Momentary Assessments (EMAs) and wearable sensor data of potential lapse triggers and lapse incidence. We aimed to identify a best-performing and feasible algorithm to take forwards in a JITAI. For 10 days, adult smokers attempting to quit were asked to complete 16 hourly EMAs/day assessing cravings, mood, activity, social context, physical context, and lapse incidence, and to wear a Fitbit Charge 4 during waking hours to passively collect data on steps and heart rate. A series of group-level supervised machine learning algorithms (e.g., Random Forest, XGBoost) were trained and tested, without and with the sensor data. Their ability to predict lapses for out-of-sample (i) observations and (ii) individuals were evaluated. Next, a series of individual-level and hybrid (i.e., group- and individual-level) algorithms were trained and tested. Participants (N = 38) responded to 6,124 EMAs (with 6.9% of responses reporting a lapse). Without sensor data, the best-performing group-level algorithm had an area under the receiver operating characteristic curve (AUC) of 0.899 (95% CI = 0.871-0.928). Its ability to classify lapses for out-of-sample individuals ranged from poor to excellent (AUCper person = 0.524-0.994; median AUC = 0.639). 15/38 participants had adequate data for individual-level algorithms to be constructed, with a median AUC of 0.855 (range: 0.451-1.000). Hybrid algorithms could be constructed for 25/38 participants, with a median AUC of 0.692 (range: 0.523 to 0.998). With sensor data, the best-performing group-level algorithm had an AUC of 0.952 (95% CI = 0.933-0.970). Its ability to classify lapses for out-of-sample individuals ranged from poor to excellent (AUCper person = 0.494-0.979; median AUC = 0.745). 11/30 participants had adequate data for individual-level algorithms to be constructed, with a median AUC of 0.983 (range: 0.549-1.000). Hybrid algorithms could be constructed for 20/30 participants, with a median AUC of 0.772 (range: 0.444 to 0.968). In conclusion, high-performing group-level lapse prediction algorithms without and with sensor data had variable performance when applied to out-of-sample individuals. Individual-level and hybrid algorithms could be constructed for a limited number of individuals but had improved performance, particularly when incorporating sensor data for participants with sufficient wear time. Feasibility constraints and the need to balance multiple success criteria in the JITAI development and implementation process are discussed.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000594"},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11343380/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142044220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feasibility characteristics of wrist-worn fitness trackers in health status monitoring for post-COVID patients in remote and rural areas. 腕戴式健身追踪器在监测偏远和农村地区 COVID 后患者健康状况中的可行性特征。
Pub Date : 2024-08-22 eCollection Date: 2024-08-01 DOI: 10.1371/journal.pdig.0000571
Madeleine Wiebe, Marnie Mackay, Ragur Krishnan, Julie Tian, Jakob Larsson, Setayesh Modanloo, Christiane Job McIntosh, Melissa Sztym, Gail Elton-Smith, Alyssa Rose, Chester Ho, Andrew Greenshaw, Bo Cao, Andrew Chan, Jake Hayward

Introduction: Common, consumer-grade biosensors mounted on fitness trackers and smartwatches can measure an array of biometrics that have potential utility in post-discharge medical monitoring, especially in remote/rural communities. The feasibility characteristics for wrist-worn biosensors are poorly described for post-COVID conditions and rural populations.

Methods: We prospectively recruited patients in rural communities who were enrolled in an at-home rehabilitation program for post-COVID conditions. They were asked to wear a FitBit Charge 2 device and biosensor parameters were analyzed [e.g. heart rate, sleep, and activity]. Electronic patient reported outcome measures [E-PROMS] for mental [bi-weekly] and physical [daily] symptoms were collected using SMS text or email [per patient preference]. Exit surveys and interviews evaluated the patient experience.

Results: Ten patients were observed for an average of 58 days and half [N = 5] were monitored for 8 weeks or more. Five patients [50%] had been hospitalized with COVID [mean stay = 41 days] and 4 [36%] had required mechanical ventilation. As baseline, patients had moderate to severe levels of anxiety, depression, and stress; fatigue and shortness of breath were the most prevalent physical symptoms. Four patients [40%] already owned a smartwatch. In total, 575 patient days of patient monitoring occurred across 10 patients. Biosensor data was usable for 91.3% of study hours and surveys were completed 82.1% and 78.7% of the time for physical and mental symptoms, respectively. Positive correlations were observed between stress and resting heart rate [r = 0.360, p<0.01], stress and daily steps [r = 0.335, p<0.01], and anxiety and daily steps [r = 0.289, p<0.01]. There was a trend toward negative correlation between sleep time and physical symptom burden [r = -0.211, p = 0.05]. Patients reported an overall positive experience and identified the potential for wearable devices to improve medical safety and access to care. Concerns around data privacy/security were infrequent.

Conclusions: We report excellent feasibility characteristics for wrist-worn biosensors and e-PROMS as a possible substrate for multi-modal disease tracking in post-COVID conditions. Adapting consumer-grade wearables for medical use and scalable remote patient monitoring holds great potential.

导言:安装在健身追踪器和智能手表上的普通消费级生物传感器可以测量一系列生物特征,在出院后医疗监测中具有潜在用途,尤其是在偏远/农村社区。腕戴式生物传感器在 COVID 后情况和农村人口中的可行性特征描述较少:我们前瞻性地招募了农村社区的患者,他们都参加了一项针对 COVID 后遗症的居家康复计划。我们要求他们佩戴 FitBit Charge 2 设备,并对生物传感器参数(如心率、睡眠和活动)进行分析。根据患者的偏好,通过短信或电子邮件收集患者对精神症状(每两周一次)和身体症状(每天一次)的电子报告结果[E-PROMS]。退出调查和访谈评估了患者的体验:10 名患者接受了平均 58 天的观察,半数患者 [N = 5] 接受了 8 周或更长时间的观察。五名患者[50%]曾因 COVID 住院[平均住院时间 = 41 天],四名患者[36%]需要机械通气。作为基线,患者的焦虑、抑郁和压力程度为中度到重度;疲劳和气短是最常见的身体症状。四名患者(40%)已拥有智能手表。10 名患者共接受了 575 天的患者监测。91.3%的研究时间内生物传感器数据可用,82.1%和78.7%的时间内完成了身体和精神症状调查。压力和静息心率之间呈正相关[r = 0.360, p结论:我们报告了腕戴式生物传感器和 e-PROMS 作为后 COVID 条件下多模式疾病跟踪的可能基质的出色可行性特征。将消费级可穿戴设备用于医疗用途和可扩展的远程病人监测具有巨大的潜力。
{"title":"Feasibility characteristics of wrist-worn fitness trackers in health status monitoring for post-COVID patients in remote and rural areas.","authors":"Madeleine Wiebe, Marnie Mackay, Ragur Krishnan, Julie Tian, Jakob Larsson, Setayesh Modanloo, Christiane Job McIntosh, Melissa Sztym, Gail Elton-Smith, Alyssa Rose, Chester Ho, Andrew Greenshaw, Bo Cao, Andrew Chan, Jake Hayward","doi":"10.1371/journal.pdig.0000571","DOIUrl":"10.1371/journal.pdig.0000571","url":null,"abstract":"<p><strong>Introduction: </strong>Common, consumer-grade biosensors mounted on fitness trackers and smartwatches can measure an array of biometrics that have potential utility in post-discharge medical monitoring, especially in remote/rural communities. The feasibility characteristics for wrist-worn biosensors are poorly described for post-COVID conditions and rural populations.</p><p><strong>Methods: </strong>We prospectively recruited patients in rural communities who were enrolled in an at-home rehabilitation program for post-COVID conditions. They were asked to wear a FitBit Charge 2 device and biosensor parameters were analyzed [e.g. heart rate, sleep, and activity]. Electronic patient reported outcome measures [E-PROMS] for mental [bi-weekly] and physical [daily] symptoms were collected using SMS text or email [per patient preference]. Exit surveys and interviews evaluated the patient experience.</p><p><strong>Results: </strong>Ten patients were observed for an average of 58 days and half [N = 5] were monitored for 8 weeks or more. Five patients [50%] had been hospitalized with COVID [mean stay = 41 days] and 4 [36%] had required mechanical ventilation. As baseline, patients had moderate to severe levels of anxiety, depression, and stress; fatigue and shortness of breath were the most prevalent physical symptoms. Four patients [40%] already owned a smartwatch. In total, 575 patient days of patient monitoring occurred across 10 patients. Biosensor data was usable for 91.3% of study hours and surveys were completed 82.1% and 78.7% of the time for physical and mental symptoms, respectively. Positive correlations were observed between stress and resting heart rate [r = 0.360, p<0.01], stress and daily steps [r = 0.335, p<0.01], and anxiety and daily steps [r = 0.289, p<0.01]. There was a trend toward negative correlation between sleep time and physical symptom burden [r = -0.211, p = 0.05]. Patients reported an overall positive experience and identified the potential for wearable devices to improve medical safety and access to care. Concerns around data privacy/security were infrequent.</p><p><strong>Conclusions: </strong>We report excellent feasibility characteristics for wrist-worn biosensors and e-PROMS as a possible substrate for multi-modal disease tracking in post-COVID conditions. Adapting consumer-grade wearables for medical use and scalable remote patient monitoring holds great potential.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000571"},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11340956/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142037890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synergistic patient factors are driving recent increased pediatric urgent care demand. 患者的协同因素推动了近期儿科紧急护理需求的增长。
Pub Date : 2024-08-22 eCollection Date: 2024-08-01 DOI: 10.1371/journal.pdig.0000572
Emily Lehan, Peyton Briand, Eileen O'Brien, Aleena Amjad Hafeez, Daniel J Mulder

Objectives: We aimed to use the high fidelity urgent care patient data to model the factors that have led to the increased demand at our local pediatric urgent care centre.

Methods: The dataset for this retrospective cohort study was obtained from our local healthcare centre's national reporting data for pediatric urgent care visits from 2006 to 2022. Variables analyzed included: basic patient demographics, chief complaint, triage urgency, date and time of registration/discharge, discharge diagnosis, and discharge destination. Statistical analysis of non-linear trends was summarized by locally estimated scatterplot smoothing splines. For machine learning, we used the tidymodels R package. Models were validated in training using k-fold cross validation with k = 5. We used univariate linear regression as a baseline model. After the data was standardized, correlation and homoscedasticity were evaluated between all parameter permutations.

Results: This dataset consisted of 164,660 unique visits to our academic centre's pediatric urgent care. Over the study period, there was an overall substantial increase in the number of urgent care visits per day, with a rapid increase beyond previous levels in 2021 and further in 2022. The increased length of stay trend was consistent across presenting complaint categories. The proportion of patients without primary care in 2022 was 2.5 times higher than in 2013. A random forest machine learning model revealed the relative importance of features to predicting a visit in 2022 were: longer stay, later registration in the day, diagnosis of an infectious illness, and younger age.

Conclusions: This study identified a combination of declining primary care access, circulating viral infections, and shifting chief complaints as factors driving the recent increase in frequency and duration of visits to our urgent care service.

目的我们旨在利用高保真紧急护理患者数据来模拟导致当地儿科紧急护理中心需求增加的因素:这项回顾性队列研究的数据集来自当地医疗保健中心 2006 年至 2022 年的全国儿科急诊就诊报告数据。分析的变量包括:患者基本人口统计学特征、主诉、分诊紧急程度、登记/出院日期和时间、出院诊断和出院目的地。对非线性趋势的统计分析通过局部估计的散点图平滑样条进行总结。在机器学习方面,我们使用了 tidymodels R 软件包。使用 k = 5 的 k 倍交叉验证对模型进行训练验证。我们使用单变量线性回归作为基线模型。数据标准化后,对所有参数排列之间的相关性和同方差性进行了评估:该数据集包括 164,660 人次到我们学术中心的儿科急诊就诊。在研究期间,每天的急诊就诊人数总体上大幅增加,在 2021 年迅速超过以前的水平,在 2022 年进一步增加。住院时间延长的趋势在各种主诉类别中都是一致的。2022 年没有初级医疗服务的患者比例是 2013 年的 2.5 倍。随机森林机器学习模型显示,预测2022年就诊的相对重要特征是:住院时间较长、当天挂号时间较晚、诊断为传染病以及年龄较小:这项研究发现,基层医疗机构就诊率的下降、病毒感染的流行以及主诉的变化是导致近期急诊就诊频率和就诊时间增加的综合因素。
{"title":"Synergistic patient factors are driving recent increased pediatric urgent care demand.","authors":"Emily Lehan, Peyton Briand, Eileen O'Brien, Aleena Amjad Hafeez, Daniel J Mulder","doi":"10.1371/journal.pdig.0000572","DOIUrl":"10.1371/journal.pdig.0000572","url":null,"abstract":"<p><strong>Objectives: </strong>We aimed to use the high fidelity urgent care patient data to model the factors that have led to the increased demand at our local pediatric urgent care centre.</p><p><strong>Methods: </strong>The dataset for this retrospective cohort study was obtained from our local healthcare centre's national reporting data for pediatric urgent care visits from 2006 to 2022. Variables analyzed included: basic patient demographics, chief complaint, triage urgency, date and time of registration/discharge, discharge diagnosis, and discharge destination. Statistical analysis of non-linear trends was summarized by locally estimated scatterplot smoothing splines. For machine learning, we used the tidymodels R package. Models were validated in training using k-fold cross validation with k = 5. We used univariate linear regression as a baseline model. After the data was standardized, correlation and homoscedasticity were evaluated between all parameter permutations.</p><p><strong>Results: </strong>This dataset consisted of 164,660 unique visits to our academic centre's pediatric urgent care. Over the study period, there was an overall substantial increase in the number of urgent care visits per day, with a rapid increase beyond previous levels in 2021 and further in 2022. The increased length of stay trend was consistent across presenting complaint categories. The proportion of patients without primary care in 2022 was 2.5 times higher than in 2013. A random forest machine learning model revealed the relative importance of features to predicting a visit in 2022 were: longer stay, later registration in the day, diagnosis of an infectious illness, and younger age.</p><p><strong>Conclusions: </strong>This study identified a combination of declining primary care access, circulating viral infections, and shifting chief complaints as factors driving the recent increase in frequency and duration of visits to our urgent care service.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000572"},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11340883/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142037892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applied artificial intelligence for global child health: Addressing biases and barriers. 应用人工智能促进全球儿童健康:消除偏见和障碍。
Pub Date : 2024-08-22 eCollection Date: 2024-08-01 DOI: 10.1371/journal.pdig.0000583
Vijaytha Muralidharan, Joel Schamroth, Alaa Youssef, Leo A Celi, Roxana Daneshjou

Given the potential benefits of artificial intelligence and machine learning (AI/ML) within healthcare, it is critical to consider how these technologies can be deployed in pediatric research and practice. Currently, healthcare AI/ML has not yet adapted to the specific technical considerations related to pediatric data nor adequately addressed the specific vulnerabilities of children and young people (CYP) in relation to AI. While the greatest burden of disease in CYP is firmly concentrated in lower and middle-income countries (LMICs), existing applied pediatric AI/ML efforts are concentrated in a small number of high-income countries (HICs). In LMICs, use-cases remain primarily in the proof-of-concept stage. This narrative review identifies a number of intersecting challenges that pose barriers to effective AI/ML for CYP globally and explores the shifts needed to make progress across multiple domains. Child-specific technical considerations throughout the AI/ML lifecycle have been largely overlooked thus far, yet these can be critical to model effectiveness. Governance concerns are paramount, with suitable national and international frameworks and guidance required to enable the safe and responsible deployment of advanced technologies impacting the care of CYP and using their data. An ambitious vision for child health demands that the potential benefits of AI/Ml are realized universally through greater international collaboration, capacity building, strong oversight, and ultimately diffusing the AI/ML locus of power to empower researchers and clinicians globally. In order that AI/ML systems that do not exacerbate inequalities in pediatric care, teams researching and developing these technologies in LMICs must ensure that AI/ML research is inclusive of the needs and concerns of CYP and their caregivers. A broad, interdisciplinary, and human-centered approach to AI/ML is essential for developing tools for healthcare workers delivering care, such that the creation and deployment of ML is grounded in local systems, cultures, and clinical practice. Decisions to invest in developing and testing pediatric AI/ML in resource-constrained settings must always be part of a broader evaluation of the overall needs of a healthcare system, considering the critical building blocks underpinning effective, sustainable, and cost-efficient healthcare delivery for CYP.

鉴于人工智能和机器学习(AI/ML)在医疗保健领域的潜在益处,考虑如何将这些技术应用于儿科研究和实践至关重要。目前,医疗保健领域的人工智能/机器学习尚未适应与儿科数据相关的特定技术考虑因素,也未充分解决儿童和青少年(CYP)在人工智能方面的特定脆弱性。虽然儿童和青少年最大的疾病负担主要集中在中低收入国家(LMICs),但现有的儿科人工智能/移动医疗应用却集中在少数高收入国家(HICs)。在中低收入国家,使用案例仍主要处于概念验证阶段。本综述指出了一些相互交织的挑战,这些挑战阻碍了在全球范围内对儿童青少年进行有效的人工智能/移动医疗,并探讨了在多个领域取得进展所需的转变。迄今为止,在整个人工智能/移动媒体生命周期中,针对儿童的技术考虑因素在很大程度上被忽视了,但这些因素对模型的有效性至关重要。管理问题至关重要,需要有适当的国家和国际框架与指导,以便安全、负责任地部署影响儿童保健的先进技术,并使用他们的数据。儿童健康的宏伟愿景要求通过加强国际合作、能力建设、强有力的监督以及最终在全球范围内分散人工智能/移动医疗的权力以增强研究人员和临床医生的能力,从而普遍实现人工智能/移动医疗的潜在益处。为了使人工智能/移动医疗系统不会加剧儿科护理中的不平等,在低收入和中等收入国家研究和开发这些技术的团队必须确保人工智能/移动医疗研究能够兼顾儿童青少年及其护理人员的需求和关切。对人工智能/移动医疗采取广泛、跨学科和以人为本的方法,对于为提供医疗服务的医护人员开发工具至关重要,这样才能使移动医疗的创建和部署立足于当地的系统、文化和临床实践。在资源有限的环境中,投资开发和测试儿科人工智能/移动语言的决策必须始终是对医疗保健系统整体需求进行更广泛评估的一部分,同时考虑到为儿童青少年提供有效、可持续和具有成本效益的医疗保健服务的关键组成部分。
{"title":"Applied artificial intelligence for global child health: Addressing biases and barriers.","authors":"Vijaytha Muralidharan, Joel Schamroth, Alaa Youssef, Leo A Celi, Roxana Daneshjou","doi":"10.1371/journal.pdig.0000583","DOIUrl":"10.1371/journal.pdig.0000583","url":null,"abstract":"<p><p>Given the potential benefits of artificial intelligence and machine learning (AI/ML) within healthcare, it is critical to consider how these technologies can be deployed in pediatric research and practice. Currently, healthcare AI/ML has not yet adapted to the specific technical considerations related to pediatric data nor adequately addressed the specific vulnerabilities of children and young people (CYP) in relation to AI. While the greatest burden of disease in CYP is firmly concentrated in lower and middle-income countries (LMICs), existing applied pediatric AI/ML efforts are concentrated in a small number of high-income countries (HICs). In LMICs, use-cases remain primarily in the proof-of-concept stage. This narrative review identifies a number of intersecting challenges that pose barriers to effective AI/ML for CYP globally and explores the shifts needed to make progress across multiple domains. Child-specific technical considerations throughout the AI/ML lifecycle have been largely overlooked thus far, yet these can be critical to model effectiveness. Governance concerns are paramount, with suitable national and international frameworks and guidance required to enable the safe and responsible deployment of advanced technologies impacting the care of CYP and using their data. An ambitious vision for child health demands that the potential benefits of AI/Ml are realized universally through greater international collaboration, capacity building, strong oversight, and ultimately diffusing the AI/ML locus of power to empower researchers and clinicians globally. In order that AI/ML systems that do not exacerbate inequalities in pediatric care, teams researching and developing these technologies in LMICs must ensure that AI/ML research is inclusive of the needs and concerns of CYP and their caregivers. A broad, interdisciplinary, and human-centered approach to AI/ML is essential for developing tools for healthcare workers delivering care, such that the creation and deployment of ML is grounded in local systems, cultures, and clinical practice. Decisions to invest in developing and testing pediatric AI/ML in resource-constrained settings must always be part of a broader evaluation of the overall needs of a healthcare system, considering the critical building blocks underpinning effective, sustainable, and cost-efficient healthcare delivery for CYP.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000583"},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11340888/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142037889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How to design equitable digital health tools: A narrative review of design tactics, case studies, and opportunities. 如何设计公平的数字健康工具:对设计策略、案例研究和机遇的叙述性回顾。
Pub Date : 2024-08-22 eCollection Date: 2024-08-01 DOI: 10.1371/journal.pdig.0000591
Amy Bucher, Beenish M Chaudhry, Jean W Davis, Katharine Lawrence, Emily Panza, Manal Baqer, Rebecca T Feinstein, Sherecce A Fields, Jennifer Huberty, Deanna M Kaplan, Isabelle S Kusters, Frank T Materia, Susanna Y Park, Maura Kepper

With a renewed focus on health equity in the United States driven by national crises and legislation to improve digital healthcare innovation, there is a need for the designers of digital health tools to take deliberate steps to design for equity in their work. A concrete toolkit of methods to design for health equity is needed to support digital health practitioners in this aim. This narrative review summarizes several health equity frameworks to help digital health practitioners conceptualize the equity dimensions of importance for their work, and then provides design approaches that accommodate an equity focus. Specifically, the Double Diamond Model, the IDEAS framework and toolkit, and community collaboration techniques such as participatory design are explored as mechanisms for practitioners to solicit input from members of underserved groups and better design digital health tools that serve their needs. Each of these design methods requires a deliberate effort by practitioners to infuse health equity into the approach. A series of case studies that use different methods to build in equity considerations are offered to provide examples of how this can be accomplished and demonstrate the range of applications available depending on resources, budget, product maturity, and other factors. We conclude with a call for shared rigor around designing digital health tools that deliver equitable outcomes for members of underserved populations.

在国家危机和改善数字医疗创新立法的推动下,美国重新开始关注健康公平问题,因此数字医疗工具的设计者有必要采取审慎措施,在其工作中进行公平设计。我们需要一套具体的健康公平设计方法工具包,以支持数字医疗从业者实现这一目标。本综述总结了几个健康公平框架,以帮助数字健康从业者从概念上理解对其工作具有重要意义的公平维度,然后提供了适应公平重点的设计方法。具体来说,本文探讨了双钻模型、IDEAS 框架和工具包,以及参与式设计等社区合作技术,作为从业人员征求服务不足群体成员意见的机制,并更好地设计出满足他们需求的数字医疗工具。每一种设计方法都需要从业人员深思熟虑,将健康公平融入到设计方法中。我们提供了一系列使用不同方法纳入公平考虑因素的案例研究,以举例说明如何做到这一点,并展示了根据资源、预算、产品成熟度和其他因素的不同,可采用的应用范围。最后,我们呼吁大家在设计数字医疗工具时要共同严格把关,为得不到充分服务的人群提供公平的结果。
{"title":"How to design equitable digital health tools: A narrative review of design tactics, case studies, and opportunities.","authors":"Amy Bucher, Beenish M Chaudhry, Jean W Davis, Katharine Lawrence, Emily Panza, Manal Baqer, Rebecca T Feinstein, Sherecce A Fields, Jennifer Huberty, Deanna M Kaplan, Isabelle S Kusters, Frank T Materia, Susanna Y Park, Maura Kepper","doi":"10.1371/journal.pdig.0000591","DOIUrl":"10.1371/journal.pdig.0000591","url":null,"abstract":"<p><p>With a renewed focus on health equity in the United States driven by national crises and legislation to improve digital healthcare innovation, there is a need for the designers of digital health tools to take deliberate steps to design for equity in their work. A concrete toolkit of methods to design for health equity is needed to support digital health practitioners in this aim. This narrative review summarizes several health equity frameworks to help digital health practitioners conceptualize the equity dimensions of importance for their work, and then provides design approaches that accommodate an equity focus. Specifically, the Double Diamond Model, the IDEAS framework and toolkit, and community collaboration techniques such as participatory design are explored as mechanisms for practitioners to solicit input from members of underserved groups and better design digital health tools that serve their needs. Each of these design methods requires a deliberate effort by practitioners to infuse health equity into the approach. A series of case studies that use different methods to build in equity considerations are offered to provide examples of how this can be accomplished and demonstrate the range of applications available depending on resources, budget, product maturity, and other factors. We conclude with a call for shared rigor around designing digital health tools that deliver equitable outcomes for members of underserved populations.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000591"},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11340894/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142037891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cutting consumption without diluting the experience: Preferences for different tactics for reducing alcohol consumption among increasing-and-higher-risk drinkers based on drinking context. 减少消费而不冲淡体验:根据饮酒环境,饮酒风险增加和增加的饮酒者对减少饮酒量的不同策略的偏好。
Pub Date : 2024-08-21 eCollection Date: 2024-08-01 DOI: 10.1371/journal.pdig.0000523
Melissa Oldham, Tosan Okpako, Corinna Leppin, Claire Garnett, Larisa-Maria Dina, Abigail Stevely, Andrew Jones, John Holmes

Contexts in which people drink vary. Certain drinking contexts may be more amenable to change than others and the effectiveness of alcohol reduction tactics may differ across contexts. This study aimed to explore how helpful context-specific tactics for alcohol reduction were perceived as being amongst increasing-and-higher-risk drinkers. Using the Behaviour Change Technique Taxonomy, context-specific tactics to reduce alcohol consumption were developed by the research team and revised following consultation with experts in behaviour change. In four focus groups (two online, two in-person), N = 20 adult increasing-and-higher-risk drinkers in the UK discussed how helpful tactics developed for four drinking contexts would be: drinking at home alone (19 tactics), drinking at home with partner or family (21 tactics), in the pub with friends (23 tactics), and a meal out of the home (20 tactics). Transcripts were analysed using constant comparison methods. Participants endorsed four broad approaches to reducing alcohol consumption which encompassed all the individual tactics developed by the research team: Diluting and substituting drinks for those containing less alcohol (e.g. switching to soft drinks or no- or low-alcohol drinks); Reducing external pressure to drink (e.g. setting expectations in advance); Creating barriers to drinking (e.g. not buying alcohol to keep at home or storing it in less visible places), and Setting new habits (e.g. breaking old patterns and taking up new hobbies). Three cross-cutting themes influenced how applicable these approaches were to different drinking contexts. These were: Situational pressure, Drinking motives, and Financial motivation. Diluting and substituting drinks which enabled covert reduction and Reducing external pressure to drink were favoured in social drinking contexts. Diluting and substituting drinks which enabled participants to feel that they were having 'a treat' or which facilitated relaxation and Creating barriers to drinking were preferred at home. Interventions to reduce alcohol consumption should offer tactics tailored to individuals' drinking contexts and which account for context-specific individual and situational pressure to drink.

人们饮酒的环境各不相同。某些饮酒环境可能比其他环境更容易改变,不同环境下的减酒策略的效果也可能不同。本研究旨在探讨特定环境下的减少饮酒策略在日益增加的高风险饮酒者心目中的帮助程度。研究小组利用行为改变技术分类法,制定了针对具体情况的减少饮酒策略,并在咨询行为改变专家后进行了修订。在四个焦点小组(两个在线小组,两个现场小组)中,N = 20 名英国成年高危饮酒者讨论了针对以下四种饮酒环境制定的策略会有多大帮助:独自在家饮酒(19 个策略)、与伴侣或家人在家饮酒(21 个策略)、与朋友在酒吧饮酒(23 个策略)以及外出就餐(20 个策略)。采用恒定比较法对记录誊本进行了分析。参与者赞同四种广泛的减少饮酒方法,其中包括研究小组制定的所有策略:稀释饮料或用含酒精较少的饮料代替(如改喝软饮料或无酒精或低酒精饮料);减少饮酒的外部压力(如提前设定期望值);为饮酒设置障碍(如不买酒放在家里或将酒存放在不显眼的地方),以及养成新习惯(如打破旧模式和培养新爱好)。三个交叉主题影响了这些方法在不同饮酒环境中的适用性。它们是情境压力、饮酒动机和经济动机。在社交饮酒环境中,稀释和替代饮品可以隐蔽地减少饮酒量,减少外部饮酒压力。在家中,稀释和替代饮品能让参与者感觉自己在 "享受",或有助于放松和创造饮酒障碍的饮品更受青睐。减少饮酒的干预措施应针对个人的饮酒环境,并考虑到特定环境下个人和情景的饮酒压力。
{"title":"Cutting consumption without diluting the experience: Preferences for different tactics for reducing alcohol consumption among increasing-and-higher-risk drinkers based on drinking context.","authors":"Melissa Oldham, Tosan Okpako, Corinna Leppin, Claire Garnett, Larisa-Maria Dina, Abigail Stevely, Andrew Jones, John Holmes","doi":"10.1371/journal.pdig.0000523","DOIUrl":"10.1371/journal.pdig.0000523","url":null,"abstract":"<p><p>Contexts in which people drink vary. Certain drinking contexts may be more amenable to change than others and the effectiveness of alcohol reduction tactics may differ across contexts. This study aimed to explore how helpful context-specific tactics for alcohol reduction were perceived as being amongst increasing-and-higher-risk drinkers. Using the Behaviour Change Technique Taxonomy, context-specific tactics to reduce alcohol consumption were developed by the research team and revised following consultation with experts in behaviour change. In four focus groups (two online, two in-person), N = 20 adult increasing-and-higher-risk drinkers in the UK discussed how helpful tactics developed for four drinking contexts would be: drinking at home alone (19 tactics), drinking at home with partner or family (21 tactics), in the pub with friends (23 tactics), and a meal out of the home (20 tactics). Transcripts were analysed using constant comparison methods. Participants endorsed four broad approaches to reducing alcohol consumption which encompassed all the individual tactics developed by the research team: Diluting and substituting drinks for those containing less alcohol (e.g. switching to soft drinks or no- or low-alcohol drinks); Reducing external pressure to drink (e.g. setting expectations in advance); Creating barriers to drinking (e.g. not buying alcohol to keep at home or storing it in less visible places), and Setting new habits (e.g. breaking old patterns and taking up new hobbies). Three cross-cutting themes influenced how applicable these approaches were to different drinking contexts. These were: Situational pressure, Drinking motives, and Financial motivation. Diluting and substituting drinks which enabled covert reduction and Reducing external pressure to drink were favoured in social drinking contexts. Diluting and substituting drinks which enabled participants to feel that they were having 'a treat' or which facilitated relaxation and Creating barriers to drinking were preferred at home. Interventions to reduce alcohol consumption should offer tactics tailored to individuals' drinking contexts and which account for context-specific individual and situational pressure to drink.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000523"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11338454/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142019808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model. 使用检索增强语言模型提高生物医学数据 GPT-3/4 结果的准确性。
Pub Date : 2024-08-21 eCollection Date: 2024-08-01 DOI: 10.1371/journal.pdig.0000568
David Soong, Sriram Sridhar, Han Si, Jan-Samuel Wagner, Ana Caroline Costa Sá, Christina Y Yu, Kubra Karagoz, Meijian Guan, Sanyam Kumar, Hisham Hamadeh, Brandon W Higgs

Large language models (LLMs) have made a significant impact on the fields of general artificial intelligence. General purpose LLMs exhibit strong logic and reasoning skills and general world knowledge but can sometimes generate misleading results when prompted on specific subject areas. LLMs trained with domain-specific knowledge can reduce the generation of misleading information (i.e. hallucinations) and enhance the precision of LLMs in specialized contexts. Training new LLMs on specific corpora however can be resource intensive. Here we explored the use of a retrieval-augmented generation (RAG) model which we tested on literature specific to a biomedical research area. OpenAI's GPT-3.5, GPT-4, Microsoft's Prometheus, and a custom RAG model were used to answer 19 questions pertaining to diffuse large B-cell lymphoma (DLBCL) disease biology and treatment. Eight independent reviewers assessed LLM responses based on accuracy, relevance, and readability, rating responses on a 3-point scale for each category. These scores were then used to compare LLM performance. The performance of the LLMs varied across scoring categories. On accuracy and relevance, the RAG model outperformed other models with higher scores on average and the most top scores across questions. GPT-4 was more comparable to the RAG model on relevance versus accuracy. By the same measures, GPT-4 and GPT-3.5 had the highest scores for readability of answers when compared to the other LLMs. GPT-4 and 3.5 also had more answers with hallucinations than the other LLMs, due to non-existent references and inaccurate responses to clinical questions. Our findings suggest that an oncology research-focused RAG model may outperform general-purpose LLMs in accuracy and relevance when answering subject-related questions. This framework can be tailored to Q&A in other subject areas. Further research will help understand the impact of LLM architectures, RAG methodologies, and prompting techniques in answering questions across different subject areas.

大型语言模型(LLM)对通用人工智能领域产生了重大影响。通用 LLM 具备强大的逻辑推理能力和广博的世界知识,但在特定主题领域进行提示时,有时会产生误导性结果。经过特定领域知识训练的 LLMs 可以减少误导信息(即幻觉)的产生,并提高 LLMs 在特定情况下的精确度。然而,在特定语料库中训练新的 LLM 可能会耗费大量资源。在此,我们探索了使用检索增强生成(RAG)模型,并在生物医学研究领域的特定文献中进行了测试。我们使用 OpenAI 的 GPT-3.5、GPT-4、微软的 Prometheus 和自定义 RAG 模型回答了与弥漫大 B 细胞淋巴瘤(DLBCL)疾病生物学和治疗有关的 19 个问题。八位独立审稿人根据准确性、相关性和可读性对 LLM 的回答进行了评估,每类回答按 3 分制评分。然后用这些分数来比较 LLM 的性能。在不同的评分类别中,法律硕士的表现各不相同。在准确性和相关性方面,RAG 模型的表现优于其他模型,平均得分更高,而且在所有问题中得分最高。在相关性和准确性方面,GPT-4 与 RAG 模型更具有可比性。根据相同的衡量标准,与其他 LLM 相比,GPT-4 和 GPT-3.5 在答案的可读性方面得分最高。此外,GPT-4 和 3.5 中出现幻觉的答案也多于其他 LLM,原因是参考文献不存在以及对临床问题的回答不准确。我们的研究结果表明,在回答与主题相关的问题时,以肿瘤研究为重点的 RAG 模型在准确性和相关性方面可能优于通用 LLM。这一框架可根据其他学科领域的问答情况进行调整。进一步的研究将有助于了解 LLM 架构、RAG 方法和提示技术对回答不同学科领域问题的影响。
{"title":"Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model.","authors":"David Soong, Sriram Sridhar, Han Si, Jan-Samuel Wagner, Ana Caroline Costa Sá, Christina Y Yu, Kubra Karagoz, Meijian Guan, Sanyam Kumar, Hisham Hamadeh, Brandon W Higgs","doi":"10.1371/journal.pdig.0000568","DOIUrl":"10.1371/journal.pdig.0000568","url":null,"abstract":"<p><p>Large language models (LLMs) have made a significant impact on the fields of general artificial intelligence. General purpose LLMs exhibit strong logic and reasoning skills and general world knowledge but can sometimes generate misleading results when prompted on specific subject areas. LLMs trained with domain-specific knowledge can reduce the generation of misleading information (i.e. hallucinations) and enhance the precision of LLMs in specialized contexts. Training new LLMs on specific corpora however can be resource intensive. Here we explored the use of a retrieval-augmented generation (RAG) model which we tested on literature specific to a biomedical research area. OpenAI's GPT-3.5, GPT-4, Microsoft's Prometheus, and a custom RAG model were used to answer 19 questions pertaining to diffuse large B-cell lymphoma (DLBCL) disease biology and treatment. Eight independent reviewers assessed LLM responses based on accuracy, relevance, and readability, rating responses on a 3-point scale for each category. These scores were then used to compare LLM performance. The performance of the LLMs varied across scoring categories. On accuracy and relevance, the RAG model outperformed other models with higher scores on average and the most top scores across questions. GPT-4 was more comparable to the RAG model on relevance versus accuracy. By the same measures, GPT-4 and GPT-3.5 had the highest scores for readability of answers when compared to the other LLMs. GPT-4 and 3.5 also had more answers with hallucinations than the other LLMs, due to non-existent references and inaccurate responses to clinical questions. Our findings suggest that an oncology research-focused RAG model may outperform general-purpose LLMs in accuracy and relevance when answering subject-related questions. This framework can be tailored to Q&A in other subject areas. Further research will help understand the impact of LLM architectures, RAG methodologies, and prompting techniques in answering questions across different subject areas.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000568"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11338460/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142019809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Laboratory reference intervals influence referral patterns for hemoglobin abnormalities in the Ontario virtual care system. 实验室参考区间对安大略省虚拟医疗系统中血红蛋白异常转诊模式的影响。
Pub Date : 2024-08-21 eCollection Date: 2024-08-01 DOI: 10.1371/journal.pdig.0000580
Maud Ahmad, Benjamin Chin-Yee, Ian H Chin-Yee, Ben Hedley, Cyrus C Hsia

This retrospective cross-sectional study investigates the impact of laboratory-specific hemoglobin reference intervals on electronic consultation (eConsult) referral patterns for suspected anemia and elevated hemoglobin at a tertiary care center in London, Ontario that serves Southwestern Ontario. The study analyzed referrals through the Ontario Telemedicine Network's eConsult platform for hemoglobin abnormalities, excluding patients under 18 years old, between July 1, 2019, and June 30, 2023.The main outcome measures were influence of hemoglobin reference intervals on the referral patterns for suspected anemia and elevated hemoglobin, as well as the extent of pre-referral laboratory testing. Of the 619 eConsults reviewed, 251 referrals for suspected anemia and 93 for elevated hemoglobin were analyzed. Referral patterns showed significant variance in hemoglobin levels based on different laboratory thresholds. Referrals for suspected anemia in females from laboratories whose lower limit was 120 g/L or greater had a hemoglobin concentration 7.5 g/L greater than referrals that used laboratories with a threshold lower than 120 g/L. The study also identified potential areas for improvement in pre-referral investigations; 44% of eConsults did not provide a ferritin level, 53% were missing a B12 level, and 81% were missing a reticulocyte count. In conclusion, laboratory reference intervals for hemoglobin significantly influence referral patterns for suspected hemoglobin abnormalities in Ontario's eConsult system. There is a need for standardized reference intervals and comprehensive pre-referral testing to avoid unnecessary medicalization and referrals. We propose an anemia management algorithm to guide primary care providers in the pre-referral investigation process.

这项回顾性横断面研究调查了实验室特异性血红蛋白参考区间对安大略省伦敦市一家服务于安大略省西南部的三级医疗中心疑似贫血和血红蛋白升高的电子会诊(eConsult)转诊模式的影响。研究分析了2019年7月1日至2023年6月30日期间通过安大略省远程医疗网络的电子会诊平台转诊的血红蛋白异常患者,不包括18岁以下患者。主要结果指标是血红蛋白参考区间对疑似贫血和血红蛋白升高转诊模式的影响,以及转诊前实验室检测的程度。在审查的 619 份电子会诊中,对 251 份疑似贫血转诊和 93 份血红蛋白升高转诊进行了分析。转诊模式显示,根据不同的实验室阈值,血红蛋白水平存在很大差异。从下限为 120 克/升或更高的实验室转介的女性疑似贫血患者的血红蛋白浓度比使用阈值低于 120 克/升的实验室转介的患者高出 7.5 克/升。该研究还发现了转诊前检查中需要改进的潜在领域;44% 的电子会诊没有提供铁蛋白水平,53% 的电子会诊缺少 B12 水平,81% 的电子会诊缺少网织红细胞计数。总之,血红蛋白的实验室参考区间极大地影响了安大略省电子会诊系统中疑似血红蛋白异常的转诊模式。我们需要标准化的参考区间和全面的转诊前检测,以避免不必要的医疗和转诊。我们提出了一种贫血管理算法,以指导初级保健提供者进行转诊前调查。
{"title":"Laboratory reference intervals influence referral patterns for hemoglobin abnormalities in the Ontario virtual care system.","authors":"Maud Ahmad, Benjamin Chin-Yee, Ian H Chin-Yee, Ben Hedley, Cyrus C Hsia","doi":"10.1371/journal.pdig.0000580","DOIUrl":"10.1371/journal.pdig.0000580","url":null,"abstract":"<p><p>This retrospective cross-sectional study investigates the impact of laboratory-specific hemoglobin reference intervals on electronic consultation (eConsult) referral patterns for suspected anemia and elevated hemoglobin at a tertiary care center in London, Ontario that serves Southwestern Ontario. The study analyzed referrals through the Ontario Telemedicine Network's eConsult platform for hemoglobin abnormalities, excluding patients under 18 years old, between July 1, 2019, and June 30, 2023.The main outcome measures were influence of hemoglobin reference intervals on the referral patterns for suspected anemia and elevated hemoglobin, as well as the extent of pre-referral laboratory testing. Of the 619 eConsults reviewed, 251 referrals for suspected anemia and 93 for elevated hemoglobin were analyzed. Referral patterns showed significant variance in hemoglobin levels based on different laboratory thresholds. Referrals for suspected anemia in females from laboratories whose lower limit was 120 g/L or greater had a hemoglobin concentration 7.5 g/L greater than referrals that used laboratories with a threshold lower than 120 g/L. The study also identified potential areas for improvement in pre-referral investigations; 44% of eConsults did not provide a ferritin level, 53% were missing a B12 level, and 81% were missing a reticulocyte count. In conclusion, laboratory reference intervals for hemoglobin significantly influence referral patterns for suspected hemoglobin abnormalities in Ontario's eConsult system. There is a need for standardized reference intervals and comprehensive pre-referral testing to avoid unnecessary medicalization and referrals. We propose an anemia management algorithm to guide primary care providers in the pre-referral investigation process.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000580"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11338453/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142019810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of machine-learning and logistic regression models for prediction of 30-day unplanned readmission in electronic health records: A development and validation study. 比较机器学习和逻辑回归模型对电子健康记录中 30 天非计划再入院的预测:开发与验证研究。
Pub Date : 2024-08-20 eCollection Date: 2024-08-01 DOI: 10.1371/journal.pdig.0000578
Masao Iwagami, Ryota Inokuchi, Eiryo Kawakami, Tomohide Yamada, Atsushi Goto, Toshiki Kuno, Yohei Hashimoto, Nobuaki Michihata, Tadahiro Goto, Tomohiro Shinozaki, Yu Sun, Yuta Taniguchi, Jun Komiyama, Kazuaki Uda, Toshikazu Abe, Nanako Tamiya

It is expected but unknown whether machine-learning models can outperform regression models, such as a logistic regression (LR) model, especially when the number and types of predictor variables increase in electronic health records (EHRs). We aimed to compare the predictive performance of gradient-boosted decision tree (GBDT), random forest (RF), deep neural network (DNN), and LR with the least absolute shrinkage and selection operator (LR-LASSO) for unplanned readmission. We used EHRs of patients discharged alive from 38 hospitals in 2015-2017 for derivation and in 2018 for validation, including basic characteristics, diagnosis, surgery, procedure, and drug codes, and blood-test results. The outcome was 30-day unplanned readmission. We created six patterns of data tables having different numbers of binary variables (that ≥5% or ≥1% of patients or ≥10 patients had) with and without blood-test results. For each pattern of data tables, we used the derivation data to establish the machine-learning and LR models, and used the validation data to evaluate the performance of each model. The incidence of outcome was 6.8% (23,108/339,513 discharges) and 6.4% (7,507/118,074 discharges) in the derivation and validation datasets, respectively. For the first data table with the smallest number of variables (102 variables that ≥5% of patients had, without blood-test results), the c-statistic was highest for GBDT (0.740), followed by RF (0.734), LR-LASSO (0.720), and DNN (0.664). For the last data table with the largest number of variables (1543 variables that ≥10 patients had, including blood-test results), the c-statistic was highest for GBDT (0.764), followed by LR-LASSO (0.755), RF (0.751), and DNN (0.720), suggesting that the difference between GBDT and LR-LASSO was small and their 95% confidence intervals overlapped. In conclusion, GBDT generally outperformed LR-LASSO to predict unplanned readmission, but the difference of c-statistic became smaller as the number of variables was increased and blood-test results were used.

机器学习模型是否能优于回归模型(如逻辑回归模型),尤其是当电子健康记录(EHR)中预测变量的数量和类型增加时,这一点虽在意料之中,但却不得而知。我们旨在比较梯度提升决策树(GBDT)、随机森林(RF)、深度神经网络(DNN)和带有最小绝对收缩和选择算子的逻辑回归(LR-LASSO)对计划外再入院的预测性能。我们使用了 38 家医院 2015-2017 年出院的存活患者的电子病历作为推导,2018 年出院的存活患者的电子病历作为验证,包括基本特征、诊断、手术、程序和药物代码以及血液检测结果。结果为 30 天非计划再入院。我们创建了六种模式的数据表,这些数据表具有不同数量的二进制变量(≥5% 或≥1% 的患者或≥10 名患者具有),有血液检测结果和无血液检测结果。对于每种数据表模式,我们使用推导数据建立机器学习模型和 LR 模型,并使用验证数据评估每个模型的性能。在推导数据集和验证数据集中,结果发生率分别为 6.8%(23108/339,513 例出院者)和 6.4%(7507/118,074 例出院者)。对于变量数量最少的第一个数据表(≥5% 的患者拥有的 102 个变量,无血液检测结果),GBDT 的 c 统计量最高(0.740),其次是 RF(0.734)、LR-LASSO(0.720)和 DNN(0.664)。在变量数最多的最后一张数据表中(包括血液检测结果在内的 1543 个≥10 名患者拥有的变量),GBDT 的 c 统计量最高(0.764),其次是 LR-LASSO(0.755)、RF(0.751)和 DNN(0.720),这表明 GBDT 和 LR-LASSO 之间的差异很小,且它们的 95% 置信区间重叠。总之,在预测非计划再入院方面,GBDT 总体上优于 LR-LASSO,但随着变量数量的增加和血液检测结果的使用,c 统计量的差异越来越小。
{"title":"Comparison of machine-learning and logistic regression models for prediction of 30-day unplanned readmission in electronic health records: A development and validation study.","authors":"Masao Iwagami, Ryota Inokuchi, Eiryo Kawakami, Tomohide Yamada, Atsushi Goto, Toshiki Kuno, Yohei Hashimoto, Nobuaki Michihata, Tadahiro Goto, Tomohiro Shinozaki, Yu Sun, Yuta Taniguchi, Jun Komiyama, Kazuaki Uda, Toshikazu Abe, Nanako Tamiya","doi":"10.1371/journal.pdig.0000578","DOIUrl":"10.1371/journal.pdig.0000578","url":null,"abstract":"<p><p>It is expected but unknown whether machine-learning models can outperform regression models, such as a logistic regression (LR) model, especially when the number and types of predictor variables increase in electronic health records (EHRs). We aimed to compare the predictive performance of gradient-boosted decision tree (GBDT), random forest (RF), deep neural network (DNN), and LR with the least absolute shrinkage and selection operator (LR-LASSO) for unplanned readmission. We used EHRs of patients discharged alive from 38 hospitals in 2015-2017 for derivation and in 2018 for validation, including basic characteristics, diagnosis, surgery, procedure, and drug codes, and blood-test results. The outcome was 30-day unplanned readmission. We created six patterns of data tables having different numbers of binary variables (that ≥5% or ≥1% of patients or ≥10 patients had) with and without blood-test results. For each pattern of data tables, we used the derivation data to establish the machine-learning and LR models, and used the validation data to evaluate the performance of each model. The incidence of outcome was 6.8% (23,108/339,513 discharges) and 6.4% (7,507/118,074 discharges) in the derivation and validation datasets, respectively. For the first data table with the smallest number of variables (102 variables that ≥5% of patients had, without blood-test results), the c-statistic was highest for GBDT (0.740), followed by RF (0.734), LR-LASSO (0.720), and DNN (0.664). For the last data table with the largest number of variables (1543 variables that ≥10 patients had, including blood-test results), the c-statistic was highest for GBDT (0.764), followed by LR-LASSO (0.755), RF (0.751), and DNN (0.720), suggesting that the difference between GBDT and LR-LASSO was small and their 95% confidence intervals overlapped. In conclusion, GBDT generally outperformed LR-LASSO to predict unplanned readmission, but the difference of c-statistic became smaller as the number of variables was increased and blood-test results were used.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000578"},"PeriodicalIF":0.0,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11335098/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142010067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PLOS digital health
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1