首页 > 最新文献

BMJ Health & Care Informatics最新文献

英文 中文
Measuring provider-level differences in perioperative workflow using computer vision-based artificial intelligence. 使用基于计算机视觉的人工智能测量围手术期工作流程中提供者级别的差异。
IF 4.4 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-21 DOI: 10.1136/bmjhci-2025-101591
Theoren Loo, Brandon Mcglennen, Stephen Incavo, Nate Hilger

Objectives: To evaluate provider-level variability across the full perioperative workflow using a computer vision-based artificial intelligence (AI) system that automatically detects and timestamps operating room events.

Methods: A cross-sectional study of total knee arthroplasty cases performed between September 2022 and March 2025 at a regional health system was conducted. An ambient surgical platform equipped with wall-mounted cameras continuously captured perioperative activity. A YOLO-based model identified patients, staff and equipment, and a transformer-based event detector predicted key perioperative events in real time. Detected events were used to segment cases into eight workflow phases: anaesthesia induction, patient preparation, final preparation, active procedure, postoperation, patient exit, room cleanup and room setup. Provider-level variability in segment durations was evaluated after adjusting for case characteristics, daily surgical volume and team composition.

Results: The computer vision event detection system achieved high agreement with ground truth annotations. Across 2502 surgical cases, significant provider-level variability was observed in all workflow segments except for room exit. Active procedure showed the greatest variation among surgeons (F=28.4, p<0.001; β IQR=-20.9 to 8.8 min) followed by room setup among circulating nurses (F=1.3, p<0.001; β IQR=-5.2 to 4.4 min) and room setup among scrub nurses (F=1.4, p<0.001; β IQR=-3.7 to 3.2 min).

Conclusions: Automated workflow segmentation using computer vision provides a scalable method to evaluate perioperative efficiency with greater granularity. Broader case segmentation may support more targeted and effective surgical quality improvement initiatives.

目的:使用基于计算机视觉的人工智能(AI)系统评估整个围手术期工作流程中提供者级别的可变性,该系统可以自动检测手术室事件并为其添加时间戳。方法:对2022年9月至2025年3月在某地区卫生系统进行的全膝关节置换术病例进行横断面研究。配有壁挂式摄像机的环境手术平台连续捕捉围手术期活动。基于yolo的模型可以识别患者、工作人员和设备,基于变压器的事件检测器可以实时预测关键的围手术期事件。使用检测到的事件将病例划分为八个工作流程阶段:麻醉诱导、患者准备、最终准备、主动程序、术后、患者退出、房间清理和房间设置。在调整了病例特征、每日手术量和团队组成后,评估了提供者水平的分段持续时间的可变性。结果:计算机视觉事件检测系统与地面真值标注具有较高的一致性。在2502例手术病例中,除房间出口外,在所有工作流程部分都观察到显著的提供者水平差异。结论:使用计算机视觉的自动化工作流程分割提供了一种可扩展的方法,以更大的粒度评估围手术期效率。更广泛的病例分割可能支持更有针对性和更有效的手术质量改进举措。
{"title":"Measuring provider-level differences in perioperative workflow using computer vision-based artificial intelligence.","authors":"Theoren Loo, Brandon Mcglennen, Stephen Incavo, Nate Hilger","doi":"10.1136/bmjhci-2025-101591","DOIUrl":"10.1136/bmjhci-2025-101591","url":null,"abstract":"<p><strong>Objectives: </strong>To evaluate provider-level variability across the full perioperative workflow using a computer vision-based artificial intelligence (AI) system that automatically detects and timestamps operating room events.</p><p><strong>Methods: </strong>A cross-sectional study of total knee arthroplasty cases performed between September 2022 and March 2025 at a regional health system was conducted. An ambient surgical platform equipped with wall-mounted cameras continuously captured perioperative activity. A YOLO-based model identified patients, staff and equipment, and a transformer-based event detector predicted key perioperative events in real time. Detected events were used to segment cases into eight workflow phases: anaesthesia induction, patient preparation, final preparation, active procedure, postoperation, patient exit, room cleanup and room setup. Provider-level variability in segment durations was evaluated after adjusting for case characteristics, daily surgical volume and team composition.</p><p><strong>Results: </strong>The computer vision event detection system achieved high agreement with ground truth annotations. Across 2502 surgical cases, significant provider-level variability was observed in all workflow segments except for room exit. Active procedure showed the greatest variation among surgeons (F=28.4, p<0.001; β IQR=-20.9 to 8.8 min) followed by room setup among circulating nurses (F=1.3, p<0.001; β IQR=-5.2 to 4.4 min) and room setup among scrub nurses (F=1.4, p<0.001; β IQR=-3.7 to 3.2 min).</p><p><strong>Conclusions: </strong>Automated workflow segmentation using computer vision provides a scalable method to evaluate perioperative efficiency with greater granularity. Broader case segmentation may support more targeted and effective surgical quality improvement initiatives.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12718557/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145803191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Developing a non-invasive algorithm for the diagnosis of steatotic liver disease in primary healthcare: a retrospective cohort study. 在初级保健中发展一种非侵入性的诊断脂肪变性肝病的算法:一项回顾性队列研究
IF 4.4 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-12 DOI: 10.1136/bmjhci-2025-101620
Maria Spencer-Sandino, Franco Godoy, Danilo Alvares, Felipe Elorrieta, Ilona Argirion, Jill Koshiol, Claudio Vargas, Claudia Marco, Macarena Garrido, Daniel Cabrera, Juan Pablo Arab, Marco Arrese, Laura Huidobro, Francisco Barrera, Catterina Ferreccio

Objective: This study aims to develop an algorithm to detect steatotic liver disease (SLD) risk in low-resource settings without requiring imaging.

Methods: This retrospective cohort study included 826 measurements from 444 participants aged 45-60 years who participated in the MAUCO+ study. Data included ultrasound, vibration-controlled transient elastography (VCTE), anthropometrics and biomarkers. Logistic multivariable regression was used to develop two predictive models for SLD risk, with and without ultrasound, using VCTE as gold standard. Missing data were minimal and retained in the analysis, as their proportion was not statistically relevant. Predictive performance (sensitivity, specificity, positive predictive value and negative predictive value) was compared with the clinically used Fatty Liver Index (FLI).

Results: The algorithm without ultrasound achieved a sensitivity of 81.1% (95% CI 71.7% to 88.4%) and specificity of 71.4% (95% CI 57.9% to 80.4%). The model with ultrasound demonstrated a sensitivity of 91.5% (95% CI 84.1% to 95.6%) and specificity of 70% (95% CI 59.9% to 80.7%). FLI showed an area under the curve (AUC) of 0.762, while our models achieved higher AUCs: 0.878 (with ultrasound) and 0.794 (without ultrasound).

Discussion: Our models offer screening tools for SLD in low-resource primary care. The model without ultrasound outperformed FLI, making it a feasible alternative where imaging is unavailable. The ultrasound-based model demonstrated higher performance, underscoring the value of ultrasound when it is accessible. Integrating these algorithms into preventive programmes could improve early diagnosis, especially in populations with a high burden of obesity and diabetes.

Conclusions: We developed two predictive models for SLD screening in a Chilean cohort. Both showed strong performance and potential for implementation in primary care to support early detection and better disease management.

目的:本研究旨在开发一种在低资源环境下无需影像学检查即可检测脂肪变性肝病(SLD)风险的算法。方法:这项回顾性队列研究包括444名年龄在45-60岁的MAUCO+研究参与者的826项测量。数据包括超声、振动控制瞬态弹性成像(VCTE)、人体测量学和生物标志物。以VCTE为金标准,采用Logistic多变量回归建立有超声和无超声两种SLD风险预测模型。缺失数据极少,并保留在分析中,因为它们的比例在统计上不相关。预测性能(敏感性、特异性、阳性预测值和阴性预测值)与临床使用的脂肪肝指数(FLI)进行比较。结果:在无超声的情况下,该算法的灵敏度为81.1% (95% CI 71.7% ~ 88.4%),特异性为71.4% (95% CI 57.9% ~ 80.4%)。超声模型的灵敏度为91.5% (95% CI为84.1% ~ 95.6%),特异性为70% (95% CI为59.9% ~ 80.7%)。FLI显示曲线下面积(AUC)为0.762,而我们的模型获得了更高的AUC: 0.878(超声)和0.794(无超声)。讨论:我们的模型为低资源初级保健的特殊生活障碍提供了筛查工具。没有超声的模型优于FLI,使其成为不可用成像的可行替代方案。基于超声的模型表现出更高的性能,强调了超声在可访问时的价值。将这些算法纳入预防规划可以改善早期诊断,特别是在肥胖和糖尿病高负担人群中。结论:我们在智利的一个队列中建立了两种SLD筛查的预测模型。两者都显示出在初级保健中实施的强大性能和潜力,以支持早期发现和更好的疾病管理。
{"title":"Developing a non-invasive algorithm for the diagnosis of steatotic liver disease in primary healthcare: a retrospective cohort study.","authors":"Maria Spencer-Sandino, Franco Godoy, Danilo Alvares, Felipe Elorrieta, Ilona Argirion, Jill Koshiol, Claudio Vargas, Claudia Marco, Macarena Garrido, Daniel Cabrera, Juan Pablo Arab, Marco Arrese, Laura Huidobro, Francisco Barrera, Catterina Ferreccio","doi":"10.1136/bmjhci-2025-101620","DOIUrl":"10.1136/bmjhci-2025-101620","url":null,"abstract":"<p><strong>Objective: </strong>This study aims to develop an algorithm to detect steatotic liver disease (SLD) risk in low-resource settings without requiring imaging.</p><p><strong>Methods: </strong>This retrospective cohort study included 826 measurements from 444 participants aged 45-60 years who participated in the MAUCO+ study. Data included ultrasound, vibration-controlled transient elastography (VCTE), anthropometrics and biomarkers. Logistic multivariable regression was used to develop two predictive models for SLD risk, with and without ultrasound, using VCTE as gold standard. Missing data were minimal and retained in the analysis, as their proportion was not statistically relevant. Predictive performance (sensitivity, specificity, positive predictive value and negative predictive value) was compared with the clinically used Fatty Liver Index (FLI).</p><p><strong>Results: </strong>The algorithm without ultrasound achieved a sensitivity of 81.1% (95% CI 71.7% to 88.4%) and specificity of 71.4% (95% CI 57.9% to 80.4%). The model with ultrasound demonstrated a sensitivity of 91.5% (95% CI 84.1% to 95.6%) and specificity of 70% (95% CI 59.9% to 80.7%). FLI showed an area under the curve (AUC) of 0.762, while our models achieved higher AUCs: 0.878 (with ultrasound) and 0.794 (without ultrasound).</p><p><strong>Discussion: </strong>Our models offer screening tools for SLD in low-resource primary care. The model without ultrasound outperformed FLI, making it a feasible alternative where imaging is unavailable. The ultrasound-based model demonstrated higher performance, underscoring the value of ultrasound when it is accessible. Integrating these algorithms into preventive programmes could improve early diagnosis, especially in populations with a high burden of obesity and diabetes.</p><p><strong>Conclusions: </strong>We developed two predictive models for SLD screening in a Chilean cohort. Both showed strong performance and potential for implementation in primary care to support early detection and better disease management.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12699699/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145741033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Core Mental Health Data Set (CMHDS) methods feasibility paper. 核心心理健康数据集(CMHDS)方法可行性论文。
IF 4.4 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-12 DOI: 10.1136/bmjhci-2025-101446
Kathryn Mary Abel, Auden Edwardes, Heidi Tranter, Paul Dark, Robert D Sandler, Philip A Kalra, Ann John, Martin Wildman, Philip Bell, Nawar Diar Bakerly, Pauline Whelan

Objectives: Little research focuses on mechanisms underlying the well-recognised relationship between mental and physical health, or its potential to influence adherence and response to treatments. This short report summarises results of the National Institute for Health and Care Research-funded 'Core Mental Health Data Set (CMHDS)' study to embed a digital tool for routine collection of mental health data in physical health studies.

Methods: Four chief investigators of physical health trials were approached to embed the CMHDS into their study. Two trials, one for people receiving specialist cystic fibrosis (CF) care, and the established Salford Kidney Study (SKS) successfully managed to embed CMHDS.

Results: A combined 478 participants from both studies were invited to complete the CMHDS. Of those approached, 88% agreed to complete CMHDS; 44% completed it. In the SKS, people who completed CMHDS were significantly younger and had higher estimated glomerular filtration rates and were from least deprived areas. In the CF study, there was no significant difference in characteristics of participants who did or did not complete the tool.

Discussion: It was feasible, and researchers and participants considered it acceptable, to embed the CMHDS in physical health studies as part of routine data collection.

Conclusion: Future studies should embed the CMHDS routinely and encourage completion to minimise bias and optimise the added value of having mental health covariates or predictor variables in physical health studies.

目的:很少有研究关注心理和身体健康之间公认关系的潜在机制,或其影响治疗依从性和反应的潜力。这份简短的报告总结了国家卫生与保健研究所资助的“核心心理健康数据集(CMHDS)”研究的结果,该研究将一个用于常规收集心理健康数据的数字工具嵌入到身体健康研究中。方法:与4位身体健康试验的首席研究员接洽,将CMHDS纳入他们的研究。两项试验,一项是针对接受囊肿性纤维化(CF)治疗的患者,另一项是已建立的索尔福德肾脏研究(SKS),成功地嵌入了CMHDS。结果:两项研究共邀请478名参与者完成CMHDS。在接受治疗的患者中,88%的人同意完成cmds;44%的人完成了。在SKS中,完成CMHDS的患者明显更年轻,估计肾小球滤过率更高,并且来自最贫困地区。在CF研究中,完成或未完成该工具的参与者在特征上没有显著差异。讨论:将CMHDS作为常规数据收集的一部分纳入身体健康研究是可行的,研究者和参与者都认为这是可以接受的。结论:未来的研究应常规纳入CMHDS,并鼓励完成,以尽量减少偏倚,优化心理健康协变量或预测变量在身体健康研究中的附加价值。
{"title":"Core Mental Health Data Set (CMHDS) methods feasibility paper.","authors":"Kathryn Mary Abel, Auden Edwardes, Heidi Tranter, Paul Dark, Robert D Sandler, Philip A Kalra, Ann John, Martin Wildman, Philip Bell, Nawar Diar Bakerly, Pauline Whelan","doi":"10.1136/bmjhci-2025-101446","DOIUrl":"10.1136/bmjhci-2025-101446","url":null,"abstract":"<p><strong>Objectives: </strong>Little research focuses on mechanisms underlying the well-recognised relationship between mental and physical health, or its potential to influence adherence and response to treatments. This short report summarises results of the National Institute for Health and Care Research-funded 'Core Mental Health Data Set (CMHDS)' study to embed a digital tool for routine collection of mental health data in physical health studies.</p><p><strong>Methods: </strong>Four chief investigators of physical health trials were approached to embed the CMHDS into their study. Two trials, one for people receiving specialist cystic fibrosis (CF) care, and the established Salford Kidney Study (SKS) successfully managed to embed CMHDS.</p><p><strong>Results: </strong>A combined 478 participants from both studies were invited to complete the CMHDS. Of those approached, 88% agreed to complete CMHDS; 44% completed it. In the SKS, people who completed CMHDS were significantly younger and had higher estimated glomerular filtration rates and were from least deprived areas. In the CF study, there was no significant difference in characteristics of participants who did or did not complete the tool.</p><p><strong>Discussion: </strong>It was feasible, and researchers and participants considered it acceptable, to embed the CMHDS in physical health studies as part of routine data collection.</p><p><strong>Conclusion: </strong>Future studies should embed the CMHDS routinely and encourage completion to minimise bias and optimise the added value of having mental health covariates or predictor variables in physical health studies.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12699609/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145740983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predictive model for managing the clinical risk of emergency department patients: protocol for a systematic review. 管理急诊科患者临床风险的预测模型:系统评价方案。
IF 4.4 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-11 DOI: 10.1136/bmjhci-2025-101584
Maria João Baptista Rente, Ana Lúcia da Silva João, David José Murteira Mendes, Liliana Andreia Neves da Mota

Introduction: Emergency departments are facing increasing strain due to overcrowding and resource shortages, leading to the suspension of some services. Stratifying the clinical risk-defined as the severity and likelihood of harm-is crucial for anticipating care needs and supporting decision-making. Implementing predictive models for clinical risk management offers a technological solution to this challenge. This systematic review will evaluate the performance and usefulness of a predictive model for managing the clinical risk of people who visit the emergency department.

Methods and analysis: Eight electronic databases will be searched (CINAHL Plus, Health Technology Assessment Database, MedicLatina, MEDLINE, PubMed, Scopus, Cochrane Plus Collection, Web of Science). Risk of bias will be assessed using the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies and Prediction Model Risk of Bias Assessment Tool.

Ethics and dissemination: Ethical approval is not required. Results will be disseminated through peer-reviewed publications.

Prospero registration number: CRD42024556926.

导言:由于过度拥挤和资源短缺,急诊科面临越来越大的压力,导致一些服务暂停。对临床风险进行分层——定义为危害的严重程度和可能性——对于预测护理需求和支持决策至关重要。实施临床风险管理的预测模型为这一挑战提供了一种技术解决方案。这个系统的回顾将评估的性能和有用的预测模型管理的临床风险的人谁访问急诊科。方法与分析:将检索8个电子数据库(CINAHL Plus、卫生技术评估数据库、MedicLatina、MEDLINE、PubMed、Scopus、Cochrane Plus Collection、Web of Science)。偏倚风险将使用预测建模研究系统评价关键评估和数据提取清单和预测模型偏倚风险评估工具进行评估。伦理和传播:不需要伦理批准。结果将通过同行评议的出版物传播。普洛斯彼罗注册号:CRD42024556926。
{"title":"Predictive model for managing the clinical risk of emergency department patients: protocol for a systematic review.","authors":"Maria João Baptista Rente, Ana Lúcia da Silva João, David José Murteira Mendes, Liliana Andreia Neves da Mota","doi":"10.1136/bmjhci-2025-101584","DOIUrl":"10.1136/bmjhci-2025-101584","url":null,"abstract":"<p><strong>Introduction: </strong>Emergency departments are facing increasing strain due to overcrowding and resource shortages, leading to the suspension of some services. Stratifying the clinical risk-defined as the severity and likelihood of harm-is crucial for anticipating care needs and supporting decision-making. Implementing predictive models for clinical risk management offers a technological solution to this challenge. This systematic review will evaluate the performance and usefulness of a predictive model for managing the clinical risk of people who visit the emergency department.</p><p><strong>Methods and analysis: </strong>Eight electronic databases will be searched (CINAHL Plus, Health Technology Assessment Database, MedicLatina, MEDLINE, PubMed, Scopus, Cochrane Plus Collection, Web of Science). Risk of bias will be assessed using the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies and Prediction Model Risk of Bias Assessment Tool.</p><p><strong>Ethics and dissemination: </strong>Ethical approval is not required. Results will be disseminated through peer-reviewed publications.</p><p><strong>Prospero registration number: </strong>CRD42024556926.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12699717/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145741001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence-driven anthropometric assessment for young children: evaluating the accuracy and practicality of a digital image-based length and weight prediction tool. 人工智能驱动的幼儿人体测量评估:评估基于数字图像的长度和体重预测工具的准确性和实用性。
IF 4.4 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-09 DOI: 10.1136/bmjhci-2025-101540
Daniel Chan, Mei Chien Chua, Matthew Hadimaja, Sankha Mukherjee, Jill Wong, Fabian Yap

Background: Monitoring early childhood growth is vital, as growth faltering could indicate nutritional or health issues requiring prompt intervention. Our study's aim was to assess the performance of a length-weight artificial intelligence (LWAI) tool for predicting children's length and weight from smartphone images.

Methods: This observational, single-centre study recruited children aged 0-18 months. Investigators measured length and weight in clinic using WHO standard recommendations and captured six images per child in a supine position, while parents took six similar images at home. Within each image, LWAI identifies specific body landmarks and a reference object, then extracts and uses image features to predict the child's length and weight. The LWAI's performance was assessed by comparing length/weight prediction versus actual measurements. User experience was collected through questionnaires.

Results: A total of 215 participants (mean age 6.1 months) were included, and length/weight predictions were generated for 98% (2184/2224) of the images. The mean absolute error (MAE) and mean absolute percentage error (MAPE) for length were 2.47 cm (4.04%) for individual images and 1.89 cm (3.18%) for grouped images (participants with ≥9 images). The corresponding MAE/MAPE for weight were 0.69 kg (11.68%) and 0.56 kg (9.02%), respectively. Regarding usability, 97% of parents who reported not routinely measuring their child's growth indicated that they would start doing so regularly if a digital tool was available to them.

Conclusions: The LWAI tool can predict length and weight in children ≤18 months, offering a practical, convenient, artificial intelligence-powered alternative for growth monitoring in home and clinical settings.

Trial registration number: NCT05079776.

背景:监测儿童早期生长是至关重要的,因为生长迟缓可能表明需要及时干预的营养或健康问题。我们的研究目的是评估长度-重量人工智能(LWAI)工具的性能,该工具可以从智能手机图像中预测儿童的长度和体重。方法:这项观察性的单中心研究招募了0-18个月的儿童。调查人员在诊所使用世卫组织的标准建议测量了身高和体重,并为每个儿童拍摄了6张仰卧位的照片,而父母在家中拍摄了6张类似的照片。在每张图像中,LWAI识别特定的身体标志和参考对象,然后提取并使用图像特征来预测儿童的身高和体重。通过比较长度/重量预测值与实际测量值来评估LWAI的性能。通过问卷收集用户体验。结果:共纳入215名参与者(平均年龄6.1个月),对98%(2184/2224)的图像进行了长度/体重预测。个体图像长度的平均绝对误差(MAE)和平均绝对百分比误差(MAPE)为2.47 cm(4.04%),分组图像长度的平均绝对误差(MAE)为1.89 cm(3.18%)。体重对应的MAE/MAPE分别为0.69 kg(11.68%)和0.56 kg(9.02%)。在可用性方面,97%没有定期测量孩子成长的父母表示,如果有数字工具,他们会开始定期这样做。结论:LWAI工具可以预测≤18个月儿童的身高和体重,为家庭和临床环境中的生长监测提供了一种实用、方便、人工智能驱动的替代方案。试验注册号:NCT05079776。
{"title":"Artificial intelligence-driven anthropometric assessment for young children: evaluating the accuracy and practicality of a digital image-based length and weight prediction tool.","authors":"Daniel Chan, Mei Chien Chua, Matthew Hadimaja, Sankha Mukherjee, Jill Wong, Fabian Yap","doi":"10.1136/bmjhci-2025-101540","DOIUrl":"10.1136/bmjhci-2025-101540","url":null,"abstract":"<p><strong>Background: </strong>Monitoring early childhood growth is vital, as growth faltering could indicate nutritional or health issues requiring prompt intervention. Our study's aim was to assess the performance of a length-weight artificial intelligence (LWAI) tool for predicting children's length and weight from smartphone images.</p><p><strong>Methods: </strong>This observational, single-centre study recruited children aged 0-18 months. Investigators measured length and weight in clinic using WHO standard recommendations and captured six images per child in a supine position, while parents took six similar images at home. Within each image, LWAI identifies specific body landmarks and a reference object, then extracts and uses image features to predict the child's length and weight. The LWAI's performance was assessed by comparing length/weight prediction versus actual measurements. User experience was collected through questionnaires.</p><p><strong>Results: </strong>A total of 215 participants (mean age 6.1 months) were included, and length/weight predictions were generated for 98% (2184/2224) of the images. The mean absolute error (MAE) and mean absolute percentage error (MAPE) for length were 2.47 cm (4.04%) for individual images and 1.89 cm (3.18%) for grouped images (participants with ≥9 images). The corresponding MAE/MAPE for weight were 0.69 kg (11.68%) and 0.56 kg (9.02%), respectively. Regarding usability, 97% of parents who reported not routinely measuring their child's growth indicated that they would start doing so regularly if a digital tool was available to them.</p><p><strong>Conclusions: </strong>The LWAI tool can predict length and weight in children ≤18 months, offering a practical, convenient, artificial intelligence-powered alternative for growth monitoring in home and clinical settings.</p><p><strong>Trial registration number: </strong>NCT05079776.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12699723/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145713200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of EHR direct-to-patient outreach on ambulatory advance directives completion among older adults. EHR直接对患者外展对老年人门诊预先指示完成的影响。
IF 4.4 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-09 DOI: 10.1136/bmjhci-2025-101524
Nancy Kim, Andrew Pugliese, Abby Dancause, Rita Amendola, Karen Brown

Background: Advance directives (AD) are crucial for aligning healthcare with end-of-life preferences, yet documentation rates remain low, often only 5-15%. Leveraging electronic health records (EHRs) for automated outreach may offer a promising strategy to enhance AD completion without placing additional burdens on already busy clinicians.

Methods: We evaluated the feasibility and effectiveness of EHR-based AD outreach within the Yale New Haven Health System (YNHHS). In April 2024, a targeted message was sent via Epic's MyChart over seven business days, coinciding with National Healthcare Decisions Day. Patients aged ≥65 years with an active MyChart, no existing AD documentation and at least one primary care visit within 2 years were eligible; those hospitalised or in hospice were excluded. The message provided education about advance care planning, encouraged completion of a Healthcare Representative Form and/or Living Will Form and offered instructions for uploading documents directly to the EHR or returning them to a primary care provider's office. A reminder was sent 90 days later.

Results: Outreach reached 25 571 patients, with 61% viewing the MyChart message. Six months after intervention, AD completion across YNHHS rose from 39.9% (28 324/70 911) to 42.8% (30 230/70 583), translating to a 7.5% conversion rate in the targeted cohort. There was no observed increase in patient messaging or clinical staff workload.

Conclusion: These findings suggest that EHR-integrated campaigns can effectively increase AD documentation among older adults without straining providers. By prompting patients to complete forms at their convenience, this scalable and sustainable intervention may be adapted for wider populations and other preventive or chronic care needs.

背景:预先指示(AD)对于使医疗保健与临终偏好保持一致至关重要,但记录率仍然很低,通常只有5-15%。利用电子健康记录(EHRs)进行自动外展可能是一种很有前途的策略,可以提高AD的完成程度,而不会给已经忙碌的临床医生带来额外的负担。方法:我们评估了耶鲁大学纽黑文卫生系统(YNHHS)基于电子病历的AD外展的可行性和有效性。2024年4月,Epic在7个工作日内通过MyChart发送了一条有针对性的信息,恰逢国家医疗保健决策日。年龄≥65岁、MyChart活跃、无阿尔茨海默病记录、2年内至少一次初级保健就诊的患者符合条件;那些住院或临终关怀的人被排除在外。这条信息提供了关于预先护理计划的教育,鼓励填写医疗代表表格和/或生前遗嘱表格,并提供了将文件直接上传至电子病历或将其归还给初级保健提供者办公室的说明。90天后发出了一个提醒。结果:外展覆盖了25571名患者,61%的患者查看了MyChart信息。干预6个月后,YNHHS的AD完成率从39.9%(28 324/70 911)上升到42.8%(30 230/70 583),目标人群的转化率为7.5%。没有观察到患者信息或临床工作人员工作量的增加。结论:这些发现表明,ehr整合运动可以有效地增加老年人的AD记录,而不会给提供者带来压力。通过促使患者在方便时填写表格,这种可扩展和可持续的干预措施可能适用于更广泛的人群和其他预防或慢性护理需求。
{"title":"Impact of EHR direct-to-patient outreach on ambulatory advance directives completion among older adults.","authors":"Nancy Kim, Andrew Pugliese, Abby Dancause, Rita Amendola, Karen Brown","doi":"10.1136/bmjhci-2025-101524","DOIUrl":"10.1136/bmjhci-2025-101524","url":null,"abstract":"<p><strong>Background: </strong>Advance directives (AD) are crucial for aligning healthcare with end-of-life preferences, yet documentation rates remain low, often only 5-15%. Leveraging electronic health records (EHRs) for automated outreach may offer a promising strategy to enhance AD completion without placing additional burdens on already busy clinicians.</p><p><strong>Methods: </strong>We evaluated the feasibility and effectiveness of EHR-based AD outreach within the Yale New Haven Health System (YNHHS). In April 2024, a targeted message was sent via Epic's MyChart over seven business days, coinciding with National Healthcare Decisions Day. Patients aged ≥65 years with an active MyChart, no existing AD documentation and at least one primary care visit within 2 years were eligible; those hospitalised or in hospice were excluded. The message provided education about advance care planning, encouraged completion of a Healthcare Representative Form and/or Living Will Form and offered instructions for uploading documents directly to the EHR or returning them to a primary care provider's office. A reminder was sent 90 days later.</p><p><strong>Results: </strong>Outreach reached 25 571 patients, with 61% viewing the MyChart message. Six months after intervention, AD completion across YNHHS rose from 39.9% (28 324/70 911) to 42.8% (30 230/70 583), translating to a 7.5% conversion rate in the targeted cohort. There was no observed increase in patient messaging or clinical staff workload.</p><p><strong>Conclusion: </strong>These findings suggest that EHR-integrated campaigns can effectively increase AD documentation among older adults without straining providers. By prompting patients to complete forms at their convenience, this scalable and sustainable intervention may be adapted for wider populations and other preventive or chronic care needs.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12699565/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145713241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What emotions reveal about patient safety: GPT-4-based sentiment and emotion analysis of 11056 German CIRS medical reports (2005-2024). 情绪对患者安全的影响:基于gpt -4的11056份德国CIRS医疗报告(2005-2024)的情绪和情绪分析
IF 4.4 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-04 DOI: 10.1136/bmjhci-2025-101741
Carlos Ramon Hölzing, Patrick Meybohm, Peter Kranke, Oliver Happel, Charlotte Meynhardt

Objectives: Critical incident reporting systems (CIRS) collect narrative reports on medical errors, but emotional signals within these reports, potential indicators of perceived risk and systemic weakness, are rarely examined. This cross-sectional study applied large language model-based sentiment analysis to explore how emotional expression in CIRS data may support artificial intelligence-enhanced patient safety monitoring.

Methods: We analysed 11 056 anonymised German incident reports submitted between 2005 and 2024 using GPT-4 (Generative Pre-trained Transformer 4, gpt-4-turbo-2024-04-09, zero shot) to assign sentiment labels and quantify five emotions (fear, frustration, anger, sadness, guilt; scale 0-1). Emotional profiles were clustered (k-means) and thematic patterns extracted via Latent Dirichlet allocation. Associations were examined using non-parametric tests.

Results: Negative sentiment dominated (95.6%, 95% CI 94.9% to 96.2%). Fear (mean=0.63, SD=0.21) and frustration (mean=0.59, SD=0.19) were most prevalent. Emergency care settings showed higher fear (p<0.05) and guilt (p<0.001). Reports with strong emotional expression, especially fear, guilt and sadness, were less likely to receive formal feedback (43.1% (95% CI 41.7% to 44.5%) vs 48.1% (95% CI 46.5% to 49.7%); absolute difference=5.0 percentage points (95% CI 2.7 to 7.3); p=0.001).

Discussion: Emotion intensity did not consistently correlate with harm severity but was linked to care context and systemic complexity. Emotion clusters reflected distinct clinical and organisational patterns, from acute emergencies to procedural failures.

Conclusion: Emotion-based analysis of incident reports provides insight into perceived burden and care context. Sentiment profiling may improve system interpretability and support emotion-sensitive safety culture and feedback. Leveraging large language models can reduce reviewer workload and enable more targeted triage of emotionally complex reports.

目的:关键事件报告系统(CIRS)收集医疗事故的叙述性报告,但这些报告中的情绪信号,感知风险和系统弱点的潜在指标,很少被检查。本横断面研究应用基于大型语言模型的情感分析来探索CIRS数据中的情感表达如何支持人工智能增强的患者安全监测。方法:我们使用GPT-4(生成式预训练变压器4,GPT-4 -turbo-2024-04-09,零射击)分析了2005年至2024年间提交的11 056份匿名德国事件报告,分配情绪标签并量化五种情绪(恐惧、沮丧、愤怒、悲伤、内疚;量表0-1)。情绪特征聚类(k-means)和主题模式提取通过潜狄利克雷分配。使用非参数检验检验相关性。结果:负面情绪占主导地位(95.6%,95% CI 94.9% ~ 96.2%)。恐惧(平均=0.63,SD=0.21)和沮丧(平均=0.59,SD=0.19)最为普遍。紧急护理环境表现出更高的恐惧(p讨论:情绪强度与伤害严重程度并不一致相关,但与护理环境和系统复杂性有关。情感集群反映了不同的临床和组织模式,从急性紧急情况到程序失败。结论:基于情绪的事件报告分析提供了对感知负担和护理环境的洞察。情绪分析可以提高系统的可解释性,并支持情绪敏感的安全文化和反馈。利用大型语言模型可以减少审稿人的工作量,并对情感复杂的报告进行更有针对性的分类。
{"title":"What emotions reveal about patient safety: GPT-4-based sentiment and emotion analysis of 11056 German CIRS medical reports (2005-2024).","authors":"Carlos Ramon Hölzing, Patrick Meybohm, Peter Kranke, Oliver Happel, Charlotte Meynhardt","doi":"10.1136/bmjhci-2025-101741","DOIUrl":"10.1136/bmjhci-2025-101741","url":null,"abstract":"<p><strong>Objectives: </strong>Critical incident reporting systems (CIRS) collect narrative reports on medical errors, but emotional signals within these reports, potential indicators of perceived risk and systemic weakness, are rarely examined. This cross-sectional study applied large language model-based sentiment analysis to explore how emotional expression in CIRS data may support artificial intelligence-enhanced patient safety monitoring.</p><p><strong>Methods: </strong>We analysed 11 056 anonymised German incident reports submitted between 2005 and 2024 using GPT-4 (Generative Pre-trained Transformer 4, <i>gpt-4-turbo-2024-04-09,</i> zero shot) to assign sentiment labels and quantify five emotions (fear, frustration, anger, sadness, guilt; scale 0-1). Emotional profiles were clustered (k-means) and thematic patterns extracted via Latent Dirichlet allocation. Associations were examined using non-parametric tests.</p><p><strong>Results: </strong>Negative sentiment dominated (95.6%, 95% CI 94.9% to 96.2%). Fear (mean=0.63, SD=0.21) and frustration (mean=0.59, SD=0.19) were most prevalent. Emergency care settings showed higher fear (p<0.05) and guilt (p<0.001). Reports with strong emotional expression, especially fear, guilt and sadness, were less likely to receive formal feedback (43.1% (95% CI 41.7% to 44.5%) vs 48.1% (95% CI 46.5% to 49.7%); absolute difference=5.0 percentage points (95% CI 2.7 to 7.3); p=0.001).</p><p><strong>Discussion: </strong>Emotion intensity did not consistently correlate with harm severity but was linked to care context and systemic complexity. Emotion clusters reflected distinct clinical and organisational patterns, from acute emergencies to procedural failures.</p><p><strong>Conclusion: </strong>Emotion-based analysis of incident reports provides insight into perceived burden and care context. Sentiment profiling may improve system interpretability and support emotion-sensitive safety culture and feedback. Leveraging large language models can reduce reviewer workload and enable more targeted triage of emotionally complex reports.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12684117/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145676240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence in clinical risk prediction: promise, performance and the path forward? 人工智能在临床风险预测中的应用:前景、表现和发展方向?
IF 4.4 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-12-03 DOI: 10.1136/bmjhci-2025-101707
Padmanesan Narasimhan, Usman Iqbal, Yu-Chuan Li
{"title":"Artificial intelligence in clinical risk prediction: promise, performance and the path forward?","authors":"Padmanesan Narasimhan, Usman Iqbal, Yu-Chuan Li","doi":"10.1136/bmjhci-2025-101707","DOIUrl":"10.1136/bmjhci-2025-101707","url":null,"abstract":"","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12682157/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145666985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital relapse prevention plan for substance use disorders: study protocol for a multicentre randomised controlled trial. 物质使用障碍的数字复发预防计划:多中心随机对照试验的研究方案。
IF 4.4 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-24 DOI: 10.1136/bmjhci-2025-101808
Rafael Salom, Álvaro Pico Rada, Juan Jesús Muñoz García, Helena García-Mieres, Antonio Artés-Rodríguez

IntroductionRelapse remains a major challenge in the treatment of substance use disorders (SUDs), particularly during follow-up. Digital tools are emerging as supportive resources, but few deliver real-time interventions. This study will examine the effectiveness of a digital relapse prevention plan (DRPP) integrated into a certified mobile application to detect early risk signs and provide immediate, personalised responses.Methods and analysisA multicentre randomised controlled trial will recruit adults with SUD. Participants will be randomised to standard treatment plus a restricted app version (control) or the same treatment with the full app, including automated alerts and DRPP access (experimental). The plan can be activated manually or automatically through smartphone sensors detecting risk patterns. The primary outcome will be time to first clinical relapse, while secondary outcomes will include patient satisfaction with the DRPP, adherence and perceived emotional self-regulation. Findings are expected to provide robust evidence on the feasibility, acceptability and clinical utility of digital relapse prevention strategies.Ethics and disseminationThis study obtained ethical approval (code 25/327) from Committee of Hospital Universitario 12 de Octubre.Trial registration number:NCT07052175.

复发仍然是物质使用障碍(sud)治疗中的一个主要挑战,特别是在随访期间。数字工具正在成为支持性资源,但很少有工具能够提供实时干预。本研究将检验将数字复发预防计划(DRPP)集成到经过认证的移动应用程序中的有效性,以检测早期风险迹象并提供即时的个性化响应。方法与分析一项多中心随机对照试验将招募患有SUD的成人患者。参与者将被随机分配到标准治疗加限制应用程序版本(对照)或与完整应用程序相同的治疗,包括自动警报和DRPP访问(实验)。该计划可以手动启动,也可以通过智能手机传感器检测风险模式自动启动。主要结果将是首次临床复发的时间,而次要结果将包括患者对DRPP的满意度,依从性和感知的情绪自我调节。研究结果有望为数字复发预防策略的可行性、可接受性和临床应用提供强有力的证据。伦理与传播本研究于10月12日获得医院大学委员会的伦理批准(代码25/327)。试验注册号:NCT07052175。
{"title":"Digital relapse prevention plan for substance use disorders: study protocol for a multicentre randomised controlled trial.","authors":"Rafael Salom, Álvaro Pico Rada, Juan Jesús Muñoz García, Helena García-Mieres, Antonio Artés-Rodríguez","doi":"10.1136/bmjhci-2025-101808","DOIUrl":"10.1136/bmjhci-2025-101808","url":null,"abstract":"<p><p><b>Introduction</b>Relapse remains a major challenge in the treatment of substance use disorders (SUDs), particularly during follow-up. Digital tools are emerging as supportive resources, but few deliver real-time interventions. This study will examine the effectiveness of a digital relapse prevention plan (DRPP) integrated into a certified mobile application to detect early risk signs and provide immediate, personalised responses.<b>Methods and analysis</b>A multicentre randomised controlled trial will recruit adults with SUD. Participants will be randomised to standard treatment plus a restricted app version (control) or the same treatment with the full app, including automated alerts and DRPP access (experimental). The plan can be activated manually or automatically through smartphone sensors detecting risk patterns. The primary outcome will be time to first clinical relapse, while secondary outcomes will include patient satisfaction with the DRPP, adherence and perceived emotional self-regulation. Findings are expected to provide robust evidence on the feasibility, acceptability and clinical utility of digital relapse prevention strategies.<b>Ethics and dissemination</b>This study obtained ethical approval (code 25/327) from Committee of Hospital Universitario 12 de Octubre.<b>Trial registration number:</b>NCT07052175.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12645601/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145602157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards an AI-driven registry for postoperative complications: a proof-of-concept study evaluating the opportunities and challenges of AI models. 迈向人工智能驱动的术后并发症注册:一项评估人工智能模型机遇和挑战的概念验证研究。
IF 4.4 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-21 DOI: 10.1136/bmjhci-2025-101566
Emilie Even Dencker, Andreas Skov Millarch, Alexander Bonde, Anders Troelsen, Jens Winther Jensen, Martin Sillesen

Objectives: Postoperative complications (PCs) require substantial resources to manage and are cumbersome to monitor. Artificial intelligence (AI), particularly natural language processing (NLP), offers a potential solution by automating and streamlining these processes, but perceived PC rates may differ depending on model optimisation strategies. This study aimed to develop a mock-up AI-driven automated registry for PCs. We hypothesised that using NLP to obtain longitudinal overviews of key quality metrics is feasible, but that optimisation strategies impacted the observed rate of PCs.

Methods: We analysed 100 505 surgical cases from 12 Danish hospitals between 2017 and 2021. Previously validated NLP models were applied to detect seven types of PCs, using two different threshold settings: a set of thresholds optimised for positive predictive value (precision), referred to as F-score of 0.5, and a set of thresholds optimised for sensitivity, referred to as F-score of 2. Trends in PC rates over time were assessed, and hospital-level variations were examined using logistic regression models.

Results: The NLP models detected 8512 or 15 892 PCs, depending on threshold selection, corresponding to total PC rates of 9.14% and 17.1%, respectively. Most PCs showed stable or increasing trends over time, regardless of threshold setting. Regression analyses demonstrated that threshold selection significantly influenced findings, impacting hospital comparisons.

Conclusion: We demonstrate that NLP can be used for automated PC detection. However, threshold selection and additional performance metrics must be carefully considered.

目的:术后并发症(pc)需要大量的资源来管理和繁琐的监测。人工智能(AI),特别是自然语言处理(NLP),通过自动化和简化这些过程提供了一个潜在的解决方案,但感知PC率可能因模型优化策略而异。本研究旨在为个人电脑开发一个人工智能驱动的自动注册表模型。我们假设使用NLP获得关键质量指标的纵向概述是可行的,但优化策略会影响观察到的pc率。方法:我们分析了2017年至2021年丹麦12家医院的100505例手术病例。先前验证的NLP模型被应用于检测七种类型的pc,使用两种不同的阈值设置:一组为正预测值(精度)优化的阈值,称为f分数为0.5,一组为灵敏度优化的阈值,称为f分数为2。评估了PC率随时间的趋势,并使用逻辑回归模型检查了医院水平的变化。结果:根据阈值的选择,NLP模型检测到8512或15 892个PC,对应的PC总率分别为9.14%和17.1%。无论阈值设置如何,大多数个人电脑都表现出稳定或增长的趋势。回归分析表明,阈值选择显著影响结果,影响医院比较。结论:我们证明了NLP可以用于PC的自动检测。但是,必须仔细考虑阈值选择和其他性能指标。
{"title":"Towards an AI-driven registry for postoperative complications: a proof-of-concept study evaluating the opportunities and challenges of AI models.","authors":"Emilie Even Dencker, Andreas Skov Millarch, Alexander Bonde, Anders Troelsen, Jens Winther Jensen, Martin Sillesen","doi":"10.1136/bmjhci-2025-101566","DOIUrl":"10.1136/bmjhci-2025-101566","url":null,"abstract":"<p><strong>Objectives: </strong>Postoperative complications (PCs) require substantial resources to manage and are cumbersome to monitor. Artificial intelligence (AI), particularly natural language processing (NLP), offers a potential solution by automating and streamlining these processes, but perceived PC rates may differ depending on model optimisation strategies. This study aimed to develop a mock-up AI-driven automated registry for PCs. We hypothesised that using NLP to obtain longitudinal overviews of key quality metrics is feasible, but that optimisation strategies impacted the observed rate of PCs.</p><p><strong>Methods: </strong>We analysed 100 505 surgical cases from 12 Danish hospitals between 2017 and 2021. Previously validated NLP models were applied to detect seven types of PCs, using two different threshold settings: a set of thresholds optimised for positive predictive value (precision), referred to as F-score of 0.5, and a set of thresholds optimised for sensitivity, referred to as F-score of 2. Trends in PC rates over time were assessed, and hospital-level variations were examined using logistic regression models.</p><p><strong>Results: </strong>The NLP models detected 8512 or 15 892 PCs, depending on threshold selection, corresponding to total PC rates of 9.14% and 17.1%, respectively. Most PCs showed stable or increasing trends over time, regardless of threshold setting. Regression analyses demonstrated that threshold selection significantly influenced findings, impacting hospital comparisons.</p><p><strong>Conclusion: </strong>We demonstrate that NLP can be used for automated PC detection. However, threshold selection and additional performance metrics must be carefully considered.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12658484/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145581799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
BMJ Health & Care Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1