首页 > 最新文献

Lancet Digital Health最新文献

英文 中文
Efficacy of standalone smartphone apps for mental health: an updated systematic review and meta-analysis 独立智能手机应用程序对心理健康的功效:一项最新的系统综述和荟萃分析。
IF 24.1 1区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-11-01 DOI: 10.1016/j.landig.2025.100923
Jennifer K Kulke MSc , Lukas M Fuhrmann MSc , Prof Matthias Berking PhD , Prof David D Ebert PhD , Prof Harald Baumeister PhD , Ariqa Derfiora MSc , Avery Veldhouse MSc , Kiona K Weisel PhD
<div><h3>Background</h3><div>To map out the potential benefits of widely available smartphone apps for mental health, especially in contexts where face-to-face services are limited or unavailable, it is crucial to examine their efficacy compared with inactive controls. Standalone smartphone apps might offer an accessible option for individuals waiting for treatment or living in under-resourced settings. Given the currently inconclusive evidence regarding these apps, this systematic review and meta-analysis aimed to assess the efficacy and study quality of randomised controlled trials (RCTs) evaluating standalone smartphone apps for mental health.</div></div><div><h3>Methods</h3><div>In this systematic review and meta-analysis, based on a previously published study, we conducted an updated systematic search of PubMed, PsycINFO, Web of Science, Cochrane Clinical Trial, and Scopus for RCTs published from database inception to Nov 10, 2023. We included RCTs that examined the efficacy of standalone smartphone apps for mental health in adults (age ≥18 years) with heightened symptom severity compared with an inactive control group (eg, waitlist, informational material, and control apps). We excluded control groups that received active treatment. Two independent researchers (AV and AD) extracted summary data, which were verified by a third researcher (JKK). The effect size Hedges’ <em>g</em>, 95% CI, and p value were calculated for each target outcome. We applied a random-effects model to all analyses due to the expected heterogeneity between RCTs. We assessed quality using the Risk of Bias 2 tool (dated Aug 22, 2019) and assessed publication bias via the Egger's test, and the Duval and Tweedie trim-and-fill analysis. The study was registered with PROSPERO, CRD42022310762.</div></div><div><h3>Findings</h3><div>We retrieved 12 705 records from electronic databases and 74 records from other sources (ie, reviews and meta-analyses on digital interventions for mental health identified through database searches and their reference lists, reference lists of other studies, trial registrations in PROSPERO, and websites of researchers in the field). Of these, we included 72 RCTs (70 reports) with 21 702 participants (of the 21 048 participants with sex or gender data, 14 208 [67%] were female, 6744 [32%] were male, and 96 [<1%] were other). At post assessment (assessment after completion of intervention), we found significant effects of apps targeting depression (33 comparisons; Hedges’ <em>g</em> 0·45 [95% CI 0·30 to 0·60], p≤0·0001, <em>I</em><sup>2</sup>=81·30%), anxiety (23 comparisons; 0·35 [0·22 to 0·48], p≤0·0001, <em>I</em><sup>2</sup>=74·91%), sleep problems (14 comparisons; 0·71 [0·51 to 0·92], p≤0·0001, <em>I</em><sup>2</sup>=76·17%), post-traumatic stress disorder (nine comparisons; 0·15 [0·02 to 0·28], p=0·029, <em>I</em><sup>2</sup>=28·65%), eating disorders (four comparisons; 0·50 [0·29 to 0·71], p≤0·0001, <em>I</em><sup>2</sup>=50·49%), and body
背景:为了确定广泛使用的智能手机应用程序对心理健康的潜在益处,特别是在面对面服务有限或无法获得的情况下,将其与不活跃的对照进行比较是至关重要的。独立的智能手机应用程序可能为等待治疗或生活在资源不足环境中的个人提供一个可访问的选择。鉴于目前关于这些应用程序的证据尚无定论,本系统综述和荟萃分析旨在评估评估独立智能手机应用程序对心理健康的随机对照试验(rct)的有效性和研究质量。方法:在本系统综述和荟萃分析中,基于先前发表的一项研究,我们对PubMed、PsycINFO、Web of Science、Cochrane Clinical Trial和Scopus进行了更新的系统检索,检索从数据库建立到2023年11月10日发表的rct。我们纳入了rct,这些rct检查了独立智能手机应用程序对症状严重程度较高的成年人(年龄≥18岁)心理健康的功效,并与不活跃的对照组(例如,等候名单、信息材料和对照应用程序)进行了比较。我们排除了接受积极治疗的对照组。两名独立研究人员(AV和AD)提取了汇总数据,由第三名研究人员(JKK)验证。计算每个目标结果的效应大小Hedges' g、95% CI和p值。由于随机对照试验之间存在预期的异质性,我们对所有分析采用随机效应模型。我们使用风险偏倚2工具(日期为2019年8月22日)评估了质量,并通过Egger检验和Duval和Tweedie修剪填充分析评估了发表偏倚。该研究已注册为PROSPERO, CRD42022310762。研究结果:我们从电子数据库中检索了12 705条记录,从其他来源检索了74条记录(即通过数据库检索及其参考文献列表、其他研究参考文献列表、PROSPERO的试验注册和该领域研究人员的网站确定的关于心理健康数字干预的综述和荟萃分析)。其中,我们纳入了72项随机对照试验(70份报告),共21 702名参与者(在21 048名有性别或性别数据的参与者中,14 208名[67%]为女性,6744名[32%]为男性,96名[2= 830%]),焦虑(23名比较;0.35名[0.22 ~ 0.48],p≤0.0001,I2= 74.91%),睡眠问题(14名比较;0.71名[0.51 ~ 0.92],p≤0.0001,I2= 76.17%),创伤后应激障碍(9名比较;0.15名[0.02 ~ 0.28],p= 0.029, I2= 28.65%),饮食失调(4名比较;0.50 [0.29 ~ 0.71], p≤0.0001,I2= 50.49%),身体畸形障碍(3组比较;0.86 [0.30 ~ 1.41],p= 0.0025, I2= 74.90%)。吸烟(6个比较)、自残(6个比较)、自杀意念(5个比较)和酒精滥用(5个比较)没有发现显著的综合效应。强迫症(2个比较)和精神分裂症(1个比较)的效应量范围为0.10 ~ 0.96(- 0.12 ~ 1.51)。偏倚风险为中高。针对抑郁和焦虑的随机对照试验存在发表偏倚,但针对睡眠问题的随机对照试验不存在发表偏倚;调整后,抑郁的效应量从0.45降至0.18(0.02降至0.34),焦虑的效应量从0.35降至0.18(0.03降至0.32),而睡眠问题的效应量没有变化。解释:虽然一些结果显示出小到中等的效应量,但考虑到不确定性因素的存在,包括相当大的异质性和中等的研究质量,这些结果必须谨慎解释。异质性可能来自样本特征、评估方法和周期、辍学率、干预和应用程序组件以及控制条件,从而限制了研究结果的普遍性。如果没有基于证据的一线干预措施,可能会为抑郁、焦虑和睡眠问题的症状提供独立的智能手机应用程序。资金:没有。
{"title":"Efficacy of standalone smartphone apps for mental health: an updated systematic review and meta-analysis","authors":"Jennifer K Kulke MSc ,&nbsp;Lukas M Fuhrmann MSc ,&nbsp;Prof Matthias Berking PhD ,&nbsp;Prof David D Ebert PhD ,&nbsp;Prof Harald Baumeister PhD ,&nbsp;Ariqa Derfiora MSc ,&nbsp;Avery Veldhouse MSc ,&nbsp;Kiona K Weisel PhD","doi":"10.1016/j.landig.2025.100923","DOIUrl":"10.1016/j.landig.2025.100923","url":null,"abstract":"&lt;div&gt;&lt;h3&gt;Background&lt;/h3&gt;&lt;div&gt;To map out the potential benefits of widely available smartphone apps for mental health, especially in contexts where face-to-face services are limited or unavailable, it is crucial to examine their efficacy compared with inactive controls. Standalone smartphone apps might offer an accessible option for individuals waiting for treatment or living in under-resourced settings. Given the currently inconclusive evidence regarding these apps, this systematic review and meta-analysis aimed to assess the efficacy and study quality of randomised controlled trials (RCTs) evaluating standalone smartphone apps for mental health.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Methods&lt;/h3&gt;&lt;div&gt;In this systematic review and meta-analysis, based on a previously published study, we conducted an updated systematic search of PubMed, PsycINFO, Web of Science, Cochrane Clinical Trial, and Scopus for RCTs published from database inception to Nov 10, 2023. We included RCTs that examined the efficacy of standalone smartphone apps for mental health in adults (age ≥18 years) with heightened symptom severity compared with an inactive control group (eg, waitlist, informational material, and control apps). We excluded control groups that received active treatment. Two independent researchers (AV and AD) extracted summary data, which were verified by a third researcher (JKK). The effect size Hedges’ &lt;em&gt;g&lt;/em&gt;, 95% CI, and p value were calculated for each target outcome. We applied a random-effects model to all analyses due to the expected heterogeneity between RCTs. We assessed quality using the Risk of Bias 2 tool (dated Aug 22, 2019) and assessed publication bias via the Egger's test, and the Duval and Tweedie trim-and-fill analysis. The study was registered with PROSPERO, CRD42022310762.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Findings&lt;/h3&gt;&lt;div&gt;We retrieved 12 705 records from electronic databases and 74 records from other sources (ie, reviews and meta-analyses on digital interventions for mental health identified through database searches and their reference lists, reference lists of other studies, trial registrations in PROSPERO, and websites of researchers in the field). Of these, we included 72 RCTs (70 reports) with 21 702 participants (of the 21 048 participants with sex or gender data, 14 208 [67%] were female, 6744 [32%] were male, and 96 [&lt;1%] were other). At post assessment (assessment after completion of intervention), we found significant effects of apps targeting depression (33 comparisons; Hedges’ &lt;em&gt;g&lt;/em&gt; 0·45 [95% CI 0·30 to 0·60], p≤0·0001, &lt;em&gt;I&lt;/em&gt;&lt;sup&gt;2&lt;/sup&gt;=81·30%), anxiety (23 comparisons; 0·35 [0·22 to 0·48], p≤0·0001, &lt;em&gt;I&lt;/em&gt;&lt;sup&gt;2&lt;/sup&gt;=74·91%), sleep problems (14 comparisons; 0·71 [0·51 to 0·92], p≤0·0001, &lt;em&gt;I&lt;/em&gt;&lt;sup&gt;2&lt;/sup&gt;=76·17%), post-traumatic stress disorder (nine comparisons; 0·15 [0·02 to 0·28], p=0·029, &lt;em&gt;I&lt;/em&gt;&lt;sup&gt;2&lt;/sup&gt;=28·65%), eating disorders (four comparisons; 0·50 [0·29 to 0·71], p≤0·0001, &lt;em&gt;I&lt;/em&gt;&lt;sup&gt;2&lt;/sup&gt;=50·49%), and body","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 11","pages":"Article 100923"},"PeriodicalIF":24.1,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145606895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cloud computing for equitable, data-driven dementia medicine 云计算促进公平、数据驱动的痴呆症医学。
IF 24.1 1区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-11-01 DOI: 10.1016/j.landig.2025.100902
Marcella Montagnese PhD , Bojidar Rangelov PhD , Tom Doel PhD , Prof David Llewellyn PhD , Prof Zuzana Walker MD PhD , Timothy Rittman MD PhD , Neil P Oxtoby PhD
Dementia poses an increasing global health challenge, and the introduction of new drugs with diverse activity profiles underscores the need for the rapid development and deployment of tailored predictive models. Machine learning has shown promise in dementia research, but it remains largely untested in routine dementia health care—particularly for image-based decision support—owing to data unavailability. Thus, data drift remains a key barrier for equitable real-world translation. We propose and pilot a scalable, cloud-based infrastructure as code solution incorporating privacy-preserving federated learning. This architecture preserves patient privacy by keeping data localised and secure, while enabling the development of robust, adaptable artificial intelligence models. Although technology giants have successfully implemented such technologies in consumer applications, their potential in health-care applications remains largely underutilised. This Viewpoint outlines the key challenges and solutions in implementing cloud-based federated learning for dementia medicine and provides a well-documented codebase to support further research.
痴呆症对全球健康构成了日益严峻的挑战,具有多种活动特征的新药的引入强调了快速开发和部署量身定制的预测模型的必要性。机器学习在痴呆症研究中显示出了希望,但由于数据不可用,它在常规痴呆症医疗保健中仍未得到测试,特别是在基于图像的决策支持方面。因此,数据漂移仍然是现实世界公平翻译的主要障碍。我们提出并试点了一个可扩展的、基于云的基础设施作为代码解决方案,其中包含了保护隐私的联邦学习。这种架构通过保持数据本地化和安全来保护患者隐私,同时使开发强大、适应性强的人工智能模型成为可能。虽然技术巨头已成功地在消费者应用中实施了这些技术,但它们在保健应用中的潜力仍未得到充分利用。本观点概述了实施基于云的痴呆症医学联合学习的主要挑战和解决方案,并提供了一个文档完备的代码库,以支持进一步的研究。
{"title":"Cloud computing for equitable, data-driven dementia medicine","authors":"Marcella Montagnese PhD ,&nbsp;Bojidar Rangelov PhD ,&nbsp;Tom Doel PhD ,&nbsp;Prof David Llewellyn PhD ,&nbsp;Prof Zuzana Walker MD PhD ,&nbsp;Timothy Rittman MD PhD ,&nbsp;Neil P Oxtoby PhD","doi":"10.1016/j.landig.2025.100902","DOIUrl":"10.1016/j.landig.2025.100902","url":null,"abstract":"<div><div>Dementia poses an increasing global health challenge, and the introduction of new drugs with diverse activity profiles underscores the need for the rapid development and deployment of tailored predictive models. Machine learning has shown promise in dementia research, but it remains largely untested in routine dementia health care—particularly for image-based decision support—owing to data unavailability. Thus, data drift remains a key barrier for equitable real-world translation. We propose and pilot a scalable, cloud-based infrastructure as code solution incorporating privacy-preserving federated learning. This architecture preserves patient privacy by keeping data localised and secure, while enabling the development of robust, adaptable artificial intelligence models. Although technology giants have successfully implemented such technologies in consumer applications, their potential in health-care applications remains largely underutilised. This Viewpoint outlines the key challenges and solutions in implementing cloud-based federated learning for dementia medicine and provides a well-documented codebase to support further research.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 11","pages":"Article 100902"},"PeriodicalIF":24.1,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145483530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated retinal image analysis systems to triage for grading of diabetic retinopathy: a large-scale, open-label, national screening programme in England 用于糖尿病视网膜病变分级的自动视网膜图像分析系统:英国大规模、开放标签、全国性筛查项目。
IF 24.1 1区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-11-01 DOI: 10.1016/j.landig.2025.100914
Prof Alicja R Rudnicka PhD , Royce Shakespeare MSc , Ryan Chambers BEng , Louis Bolter MSc , John Anderson MD , Jiri Fajtl PhD , Roshan A Welikala PhD , Prof Sarah A Barman PhD , Abraham Olvera-Barrios MD , Laura Webster , Samantha Mann MD , Aaron Lee MD , Prof Paolo Remagnino PhD , Catherine Egan MD , Prof Christopher G Owen PhD , Prof Adnan Tufail MD

Background

The global prevalence of diabetes is rising, alongside costs and workload associated with screening for diabetic eye disease (diabetic retinopathy). Automated retinal image analysis systems (ARIAS) could replace primary human grading of images for diabetic retinopathy. We evaluated multiple ARIAS in a real-life screening programme.

Methods

Eight of 25 invited and potentially eligible CE-marked systems for diabetic retinopathy detection from retinal images agreed to participate. From 202 886 screening encounters at the North East London Diabetic Eye Screening Programme (between Jan 1, 2021, and Dec 31, 2022) we curated a database of 1·2 million images and sociodemographic and grading data. Images were manually graded by up to three graders according to a standard national protocol. ARIAS performance overall and by subgroups of age, sex, ethnicity, and index of multiple deprivation (IMD) were assessed against the reference standard, defined as the final human grade in the worst eye for referable diabetic retinopathy (primary outcome). Vendor algorithms did not have access to human grading data.

Findings

Sensitivity across vendors ranged from 83·7% to 98·7% for referable diabetic retinopathy, from 96·7% to 99·8% for moderate-to-severe non-proliferative diabetic retinopathy, and from 95·8% to 99·5% for proliferative diabetic retinopathy. Sensitivity was largely consistent for moderate-to-severe non-proliferative and proliferative diabetic retinopathy by subgroups of age, sex, ethnicity, and IMD for all ARIAS. For mild-to-moderate non-proliferative diabetic retinopathy with referable maculopathy, sensitivity across vendors ranged from 79·5% to 98·3%, with greater variability across population subgroups. False positive rates for no observable diabetic retinopathy ranged from 4·3% to 61·4% and within vendors varied by 0·5 to 44 percentage points across population subgroups.

Interpretation

ARIAS showed high sensitivity for medium-risk and high-risk diabetic retinopathy in a real-world screening service, with equitable performance across population subgroups. ARIAS could provide a cost-effective solution to deal with the rising burden of screening for diabetic retinopathy by safely triaging for human grading, substantially increasing grading capacity and rapid diabetic retinopathy detection.

Funding

NHS Transformation Directorate, The Health Foundation, and The Wellcome Trust.
背景:全球糖尿病患病率正在上升,同时与糖尿病眼病(糖尿病视网膜病变)筛查相关的成本和工作量也在上升。自动视网膜图像分析系统(ARIAS)可以取代糖尿病视网膜病变的主要人类图像分级。我们在现实生活筛选项目中评估了多种ARIAS。方法:25个受邀和潜在合格的ce标记系统中的8个同意参与从视网膜图像检测糖尿病视网膜病变。从伦敦东北部糖尿病眼筛查项目(2021年1月1日至2022年12月31日)的2020886次筛查中,我们整理了一个包含120万张图像和社会人口统计学和分级数据的数据库。根据标准的国家协议,图像由多达三个分级者手动分级。ARIAS的总体表现以及年龄、性别、种族和多重剥夺指数(IMD)亚组的表现根据参考标准进行评估,参考标准定义为可参考糖尿病视网膜病变最差眼的最终人类等级(主要结局)。供应商的算法无法访问人工评分数据。研究结果:供应商对可转诊糖尿病视网膜病变的敏感性为83.7%至98.7%,对中重度非增生性糖尿病视网膜病变的敏感性为96.7%至99.8%,对增生性糖尿病视网膜病变的敏感性为95.8%至99.5%。在所有ARIAS中,对中度至重度非增殖性和增殖性糖尿病视网膜病变的敏感性在年龄、性别、种族和IMD亚组中基本一致。对于轻度至中度非增生性糖尿病视网膜病变合并可参考黄斑病变,不同供应商的敏感性范围为79.5%至98.3%,不同人群亚组的差异更大。未观察到糖尿病视网膜病变的假阳性率从4.3%到64.1%不等,在供应商内部,不同人群亚组的假阳性率差异为0.5到44个百分点。解释:在真实世界的筛查服务中,ARIAS显示出对中危和高危糖尿病视网膜病变的高敏感性,在人群亚组中表现公平。ARIAS可以提供一种具有成本效益的解决方案,通过安全分诊进行人类分级,大大提高分级能力和快速检测糖尿病视网膜病变,来应对日益增加的糖尿病视网膜病变筛查负担。资助:NHS转型理事会、健康基金会和威康信托基金。
{"title":"Automated retinal image analysis systems to triage for grading of diabetic retinopathy: a large-scale, open-label, national screening programme in England","authors":"Prof Alicja R Rudnicka PhD ,&nbsp;Royce Shakespeare MSc ,&nbsp;Ryan Chambers BEng ,&nbsp;Louis Bolter MSc ,&nbsp;John Anderson MD ,&nbsp;Jiri Fajtl PhD ,&nbsp;Roshan A Welikala PhD ,&nbsp;Prof Sarah A Barman PhD ,&nbsp;Abraham Olvera-Barrios MD ,&nbsp;Laura Webster ,&nbsp;Samantha Mann MD ,&nbsp;Aaron Lee MD ,&nbsp;Prof Paolo Remagnino PhD ,&nbsp;Catherine Egan MD ,&nbsp;Prof Christopher G Owen PhD ,&nbsp;Prof Adnan Tufail MD","doi":"10.1016/j.landig.2025.100914","DOIUrl":"10.1016/j.landig.2025.100914","url":null,"abstract":"<div><h3>Background</h3><div>The global prevalence of diabetes is rising, alongside costs and workload associated with screening for diabetic eye disease (diabetic retinopathy). Automated retinal image analysis systems (ARIAS) could replace primary human grading of images for diabetic retinopathy. We evaluated multiple ARIAS in a real-life screening programme.</div></div><div><h3>Methods</h3><div>Eight of 25 invited and potentially eligible CE-marked systems for diabetic retinopathy detection from retinal images agreed to participate. From 202 886 screening encounters at the North East London Diabetic Eye Screening Programme (between Jan 1, 2021, and Dec 31, 2022) we curated a database of 1·2 million images and sociodemographic and grading data. Images were manually graded by up to three graders according to a standard national protocol. ARIAS performance overall and by subgroups of age, sex, ethnicity, and index of multiple deprivation (IMD) were assessed against the reference standard, defined as the final human grade in the worst eye for referable diabetic retinopathy (primary outcome). Vendor algorithms did not have access to human grading data.</div></div><div><h3>Findings</h3><div>Sensitivity across vendors ranged from 83·7% to 98·7% for referable diabetic retinopathy, from 96·7% to 99·8% for moderate-to-severe non-proliferative diabetic retinopathy, and from 95·8% to 99·5% for proliferative diabetic retinopathy. Sensitivity was largely consistent for moderate-to-severe non-proliferative and proliferative diabetic retinopathy by subgroups of age, sex, ethnicity, and IMD for all ARIAS. For mild-to-moderate non-proliferative diabetic retinopathy with referable maculopathy, sensitivity across vendors ranged from 79·5% to 98·3%, with greater variability across population subgroups. False positive rates for no observable diabetic retinopathy ranged from 4·3% to 61·4% and within vendors varied by 0·5 to 44 percentage points across population subgroups.</div></div><div><h3>Interpretation</h3><div>ARIAS showed high sensitivity for medium-risk and high-risk diabetic retinopathy in a real-world screening service, with equitable performance across population subgroups. ARIAS could provide a cost-effective solution to deal with the rising burden of screening for diabetic retinopathy by safely triaging for human grading, substantially increasing grading capacity and rapid diabetic retinopathy detection.</div></div><div><h3>Funding</h3><div>NHS Transformation Directorate, The Health Foundation, and The Wellcome Trust.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 11","pages":"Article 100914"},"PeriodicalIF":24.1,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145606945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evidence and responsibility of artificial intelligence use in mental health care 人工智能在精神卫生保健中的应用的证据和责任。
IF 24.1 1区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-11-01 DOI: 10.1016/j.landig.2025.100959
The Lancet Digital Health
{"title":"Evidence and responsibility of artificial intelligence use in mental health care","authors":"The Lancet Digital Health","doi":"10.1016/j.landig.2025.100959","DOIUrl":"10.1016/j.landig.2025.100959","url":null,"abstract":"","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 11","pages":"Article 100959"},"PeriodicalIF":24.1,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effective sample size for individual risk predictions: quantifying uncertainty in machine learning models 个体风险预测的有效样本量:量化机器学习模型中的不确定性。
IF 24.1 1区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-11-01 DOI: 10.1016/j.landig.2025.100911
Doranne Thomassen PhD , Toby Hackmann MSc , Prof Jelle Goeman PhD , Prof Ewout Steyerberg PhD , Prof Saskia le Cessie PhD
Individual prediction uncertainty is a key aspect of clinical prediction model performance; however, standard performance metrics do not capture it. Consequently, a model might offer sufficient certainty for some patients but not for others, raising concerns about fairness. To address this limitation, the effective sample size has been proposed as a measure of sampling uncertainty. We developed a computational method to estimate effective sample sizes for a wide range of prediction models, including machine learning approaches. In this Viewpoint, we illustrated the approach using a clinical dataset (N=23 034) across five model types: logistic regression, elastic net, XGBoost, neural network, and random forest. During simulations, our approach generated accurate estimates of effective sample sizes for logistic regression and elastic net models, with minor deviations noted for the other three models. Although model performance metrics were similar across models, substantial differences in effective sample sizes and risk predictions were observed among patients in the clinical dataset. In conclusion, prediction uncertainty at the individual prediction level can be substantial even when models are developed using large samples. Effective sample size is thus a promising measure to communicate the uncertainty of predicted risk to individual users of machine learning-based prediction models.
个体预测的不确定性是影响临床预测模型性能的关键因素;然而,标准的性能指标并没有捕捉到它。因此,一个模型可能为一些病人提供了足够的确定性,但对另一些病人却没有,这引起了人们对公平性的担忧。为了解决这一限制,有效样本量被提议作为抽样不确定性的度量。我们开发了一种计算方法来估计各种预测模型的有效样本量,包括机器学习方法。在本观点中,我们使用五种模型类型的临床数据集(N=23 034)说明了该方法:逻辑回归、弹性网络、XGBoost、神经网络和随机森林。在模拟过程中,我们的方法为逻辑回归和弹性网络模型生成了有效样本量的准确估计,其他三个模型的偏差较小。尽管各模型的模型性能指标相似,但在临床数据集中的患者中观察到有效样本量和风险预测的实质性差异。总之,即使使用大样本开发模型,个体预测水平上的预测不确定性也可能很大。因此,有效样本量是一种很有前途的措施,可以将预测风险的不确定性传达给基于机器学习的预测模型的个人用户。
{"title":"Effective sample size for individual risk predictions: quantifying uncertainty in machine learning models","authors":"Doranne Thomassen PhD ,&nbsp;Toby Hackmann MSc ,&nbsp;Prof Jelle Goeman PhD ,&nbsp;Prof Ewout Steyerberg PhD ,&nbsp;Prof Saskia le Cessie PhD","doi":"10.1016/j.landig.2025.100911","DOIUrl":"10.1016/j.landig.2025.100911","url":null,"abstract":"<div><div>Individual prediction uncertainty is a key aspect of clinical prediction model performance; however, standard performance metrics do not capture it. Consequently, a model might offer sufficient certainty for some patients but not for others, raising concerns about fairness. To address this limitation, the effective sample size has been proposed as a measure of sampling uncertainty. We developed a computational method to estimate effective sample sizes for a wide range of prediction models, including machine learning approaches. In this Viewpoint, we illustrated the approach using a clinical dataset (N=23 034) across five model types: logistic regression, elastic net, XGBoost, neural network, and random forest. During simulations, our approach generated accurate estimates of effective sample sizes for logistic regression and elastic net models, with minor deviations noted for the other three models. Although model performance metrics were similar across models, substantial differences in effective sample sizes and risk predictions were observed among patients in the clinical dataset. In conclusion, prediction uncertainty at the individual prediction level can be substantial even when models are developed using large samples. Effective sample size is thus a promising measure to communicate the uncertainty of predicted risk to individual users of machine learning-based prediction models.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 11","pages":"Article 100911"},"PeriodicalIF":24.1,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145641240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence and tumour-infiltrating lymphocytes in breast cancer: bridging innovation and feasibility towards clinical utility 人工智能和肿瘤浸润淋巴细胞在乳腺癌中的应用:连接创新和临床应用的可行性。
IF 24.1 1区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-11-01 DOI: 10.1016/j.landig.2025.100944
Federica Miglietta , Maria Vittoria Dieci
{"title":"Artificial intelligence and tumour-infiltrating lymphocytes in breast cancer: bridging innovation and feasibility towards clinical utility","authors":"Federica Miglietta ,&nbsp;Maria Vittoria Dieci","doi":"10.1016/j.landig.2025.100944","DOIUrl":"10.1016/j.landig.2025.100944","url":null,"abstract":"","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 11","pages":"Article 100944"},"PeriodicalIF":24.1,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Objective cough counting in clinical practice and public health: a scoping review 目的:咳嗽计数在临床实践和公共卫生中的应用综述。
IF 24.1 1区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-11-01 DOI: 10.1016/j.landig.2025.100908
Alexandra J Zimmer PhD , Rishav Das MSc , Patricia Espinoza Lopez MD , Vaidehi Nafade MSc , Genevieve Gore MLIS , César Ugarte-Gil PhD , Prof Kian Fan Chung MD , Woo-Jung Song PhD , Prof Madhukar Pai PhD , Simon Grandjean Lapierre MD
Quantifying cough can offer value for respiratory disease assessment and monitoring. Traditionally, patient-reported outcomes have provided subjective insights into symptoms. Novel digital cough counting tools now enable objective assessments; however, their integration into clinical practice is limited. The aim of this scoping review was to address this gap in the literature by examining the use of automated and semiautomated cough counting tools in patient care and public health. A systematic search of six databases and preprint servers identified studies published up to Feb 12, 2025. From 6968 records found, 618 full-text articles were assessed for eligibility, and 77 were included. Five clinical use cases were identified—disease diagnosis, severity assessment, treatment monitoring, health outcome prediction, and syndromic surveillance—with scarce available evidence supporting each use case. Moderate correlations were found between objective cough frequency and patient-reported cough severity (median correlation coefficient of 0.42, IQR 0·38 to 0·59) and quality of life (median correlation coefficient of −0·49, −0·63 to −0·44), indicating a complex relationship between quantifiable measures and perceived symptoms. Feasibility challenges include device obtrusiveness, monitoring adherence, and addressing patient privacy concerns. Comprehensive studies are needed to validate these technologies in real-world settings and show their clinical value. Early feasibility and acceptability assessments are essential for successful integration.
量化咳嗽可为呼吸道疾病的评估和监测提供价值。传统上,患者报告的结果提供了对症状的主观见解。新型数字咳嗽计数工具现在可以进行客观评估;然而,它们与临床实践的结合是有限的。本综述的目的是通过检查自动和半自动咳嗽计数工具在患者护理和公共卫生中的使用来解决文献中的这一空白。通过对六个数据库和预印本服务器的系统搜索,确定了截至2025年2月12日发表的研究。从发现的6968条记录中,618篇全文文章被评估为合格,其中77篇被纳入。确定了5个临床用例——疾病诊断、严重程度评估、治疗监测、健康结果预测和综合征监测——支持每个用例的可用证据很少。客观咳嗽频率与患者报告的咳嗽严重程度(相关系数中位数为0.42,IQR为0.38 ~ 0.59)和生活质量(相关系数中位数为- 0.49,- 0.63 ~ - 0.44)之间存在中度相关性,表明可量化测量指标与感知症状之间存在复杂关系。可行性挑战包括设备突兀性、监测依从性和解决患者隐私问题。需要全面的研究来验证这些技术在现实世界的设置和显示其临床价值。早期的可行性和可接受性评估是成功集成的必要条件。
{"title":"Objective cough counting in clinical practice and public health: a scoping review","authors":"Alexandra J Zimmer PhD ,&nbsp;Rishav Das MSc ,&nbsp;Patricia Espinoza Lopez MD ,&nbsp;Vaidehi Nafade MSc ,&nbsp;Genevieve Gore MLIS ,&nbsp;César Ugarte-Gil PhD ,&nbsp;Prof Kian Fan Chung MD ,&nbsp;Woo-Jung Song PhD ,&nbsp;Prof Madhukar Pai PhD ,&nbsp;Simon Grandjean Lapierre MD","doi":"10.1016/j.landig.2025.100908","DOIUrl":"10.1016/j.landig.2025.100908","url":null,"abstract":"<div><div>Quantifying cough can offer value for respiratory disease assessment and monitoring. Traditionally, patient-reported outcomes have provided subjective insights into symptoms. Novel digital cough counting tools now enable objective assessments; however, their integration into clinical practice is limited. The aim of this scoping review was to address this gap in the literature by examining the use of automated and semiautomated cough counting tools in patient care and public health. A systematic search of six databases and preprint servers identified studies published up to Feb 12, 2025. From 6968 records found, 618 full-text articles were assessed for eligibility, and 77 were included. Five clinical use cases were identified—disease diagnosis, severity assessment, treatment monitoring, health outcome prediction, and syndromic surveillance—with scarce available evidence supporting each use case. Moderate correlations were found between objective cough frequency and patient-reported cough severity (median correlation coefficient of 0.42, IQR 0·38 to 0·59) and quality of life (median correlation coefficient of −0·49, −0·63 to −0·44), indicating a complex relationship between quantifiable measures and perceived symptoms. Feasibility challenges include device obtrusiveness, monitoring adherence, and addressing patient privacy concerns. Comprehensive studies are needed to validate these technologies in real-world settings and show their clinical value. Early feasibility and acceptability assessments are essential for successful integration.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 11","pages":"Article 100908"},"PeriodicalIF":24.1,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145582684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Physician input improves generative artificial intelligence models’ diagnostic performance in solving complex clinical cases 医生的输入提高了生成式人工智能模型在解决复杂临床病例时的诊断性能。
IF 24.1 1区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-11-01 DOI: 10.1016/j.landig.2025.100922
Kyle Lam , Julia Calvo Latorre , Andrew Yiu , Grace Navin , Alexander Tan , Jianing Qiu
{"title":"Physician input improves generative artificial intelligence models’ diagnostic performance in solving complex clinical cases","authors":"Kyle Lam ,&nbsp;Julia Calvo Latorre ,&nbsp;Andrew Yiu ,&nbsp;Grace Navin ,&nbsp;Alexander Tan ,&nbsp;Jianing Qiu","doi":"10.1016/j.landig.2025.100922","DOIUrl":"10.1016/j.landig.2025.100922","url":null,"abstract":"","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 11","pages":"Article 100922"},"PeriodicalIF":24.1,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145589382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synthetic data, synthetic trust: navigating data challenges in the digital revolution 合成数据,合成信任:驾驭数字革命中的数据挑战。
IF 24.1 1区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-11-01 DOI: 10.1016/j.landig.2025.100924
Arman Koul MS , Deborah Duran PhD , Tina Hernandez-Boussard PhD
In the evolving landscape of artificial intelligence (AI), the assumption that more data lead to better models has driven unchecked reliance on synthetic data to augment training datasets. Although synthetic data address crucial shortages of real-world training data, their overuse might propagate biases, accelerate model degradation, and compromise generalisability across populations. A concerning consequence of the rapid adoption of synthetic data in medical AI is the emergence of synthetic trust—an unwarranted confidence in models trained on artificially generated datasets that fail to preserve clinical validity or demographic realities. In this Viewpoint, we advocate for caution in using synthetic data to train clinical algorithms. We propose actionable safeguards for synthetic medical AI, including standards for training data, fragility testing during development, and deployment disclosures for synthetic origins to ensure end-to-end accountability. These safeguards uphold data integrity and fairness in clinical applications using synthetic data, offering new standards for responsible and equitable use of synthetic data in health care.
在不断发展的人工智能领域,更多的数据会带来更好的模型,这一假设推动了对合成数据的无限制依赖,以增强训练数据集。尽管合成数据解决了现实世界训练数据的严重短缺,但它们的过度使用可能会传播偏见,加速模型退化,并损害整个人群的通用性。在医疗人工智能中快速采用合成数据的一个令人担忧的后果是合成信任的出现——对人工生成的数据集训练的模型的毫无根据的信心,这些数据集未能保持临床有效性或人口统计学现实。在这个观点中,我们提倡谨慎使用合成数据来训练临床算法。我们为合成医疗人工智能提出了可操作的保障措施,包括培训数据标准、开发期间的脆弱性测试和合成来源的部署披露,以确保端到端问责制。这些保障措施维护了使用合成数据的临床应用中的数据完整性和公平性,为在卫生保健中负责任和公平地使用合成数据提供了新的标准。
{"title":"Synthetic data, synthetic trust: navigating data challenges in the digital revolution","authors":"Arman Koul MS ,&nbsp;Deborah Duran PhD ,&nbsp;Tina Hernandez-Boussard PhD","doi":"10.1016/j.landig.2025.100924","DOIUrl":"10.1016/j.landig.2025.100924","url":null,"abstract":"<div><div>In the evolving landscape of artificial intelligence (AI), the assumption that more data lead to better models has driven unchecked reliance on synthetic data to augment training datasets. Although synthetic data address crucial shortages of real-world training data, their overuse might propagate biases, accelerate model degradation, and compromise generalisability across populations. A concerning consequence of the rapid adoption of synthetic data in medical AI is the emergence of synthetic trust—an unwarranted confidence in models trained on artificially generated datasets that fail to preserve clinical validity or demographic realities. In this Viewpoint, we advocate for caution in using synthetic data to train clinical algorithms. We propose actionable safeguards for synthetic medical AI, including standards for training data, fragility testing during development, and deployment disclosures for synthetic origins to ensure end-to-end accountability. These safeguards uphold data integrity and fairness in clinical applications using synthetic data, offering new standards for responsible and equitable use of synthetic data in health care.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 11","pages":"Article 100924"},"PeriodicalIF":24.1,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145662423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The impact of artificial intelligence-driven decision support on uncertain antimicrobial prescribing: a randomised, multimethod study 人工智能驱动的决策支持对不确定抗菌素处方的影响:一项随机、多方法研究。
IF 24.1 1区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-11-01 DOI: 10.1016/j.landig.2025.100912
William J Bolton PhD , Richard Wilson MPharm , Prof Mark Gilchrist MSc , Prof Pantelis Georgiou PhD , Prof Alison Holmes MD , Timothy M Rawson PhD
<div><h3>Background</h3><div>Challenges exist when translating artificial intelligence (AI)-driven clinical decision support systems (CDSSs) from research into health-care settings, particularly in infectious diseases, an area in which behaviour, culture, uncertainty, and frequent absence of a ground truth enhance the complexity of medical decision making. We aimed to evaluate clinicians’ perceptions of an AI CDSS for intravenous-to-oral antibiotic switching and how the system influences their decision making.</div></div><div><h3>Methods</h3><div>This randomised, multimethod study enrolled health-care professionals in the UK who were regularly involved in antibiotic prescribing. Participants were recruited through personal networks and the general email list of the British Infection Association. The first part of the study involved a semistructured interview about participants’ experience of antibiotic prescribing and their perception of AI. The second part used a custom web app to run a clinical vignette experiment: each of the 12 case vignettes consisted of a patient currently receiving intravenous antibiotics, and participants were asked to decide whether or not the patient was suitable for switching to oral antibiotics. Participants were assigned to receive either standard of care (SOC) information, or SOC alongside our previously developed AI-driven CDSS and its explanations, for each vignette across two groups. We assessed differences in participant choices according to the intervention they were assigned, both for each vignette and overall; evaluated the aggregate effect of the CDSS across all switching decisions; and characterised the decision diversity across participants. In the third part of the study, participants completed the system usability scale (SUS) and technology acceptance model (TAM) questionnaires to enable their opinions of the AI CDSS to be assessed.</div></div><div><h3>Findings</h3><div>59 clinicians were directly contacted or responded to recruitment emails, 42 of whom from 23 hospitals in the UK completed the study between April 23, 2024, and Aug 16, 2024. The median age of participants was 39 years (IQR 37–47), 19 (45%) were female and 23 (55%) were male, 26 (62%) were consultants and 16 (38%) were training-grade doctors, and 14 (33%) specialised in infectious diseases. Interviews revealed mixed individualisation of prescribing and uneven use of technology, alongside enthusiasm for AI, which was conditional on evidence and usability but constrained by behavioural inertia and infrastructure limitations. Case vignette completion times and many decisions were equivalent between SOC and CDSS interventions, with clinicians able to identify and ignore incorrect advice. When a statistical difference was observed, the CDSS influenced participants towards not switching (χ<sup>2</sup> 7·73, p=0·0054; logistic regression odds ratio 0·13 [95% CI 0·03–0·50]; p=0·0031). AI explanations were used only 9% of the time when available.
背景:在将人工智能(AI)驱动的临床决策支持系统(cdss)从研究转化为医疗保健环境时存在挑战,特别是在传染病领域,在这个领域,行为、文化、不确定性和经常缺乏基本事实增加了医疗决策的复杂性。我们的目的是评估临床医生对静脉注射到口服抗生素切换的人工智能CDSS的看法,以及该系统如何影响他们的决策。方法:这项随机、多方法研究招募了英国经常参与抗生素处方的卫生保健专业人员。参与者是通过个人网络和英国感染协会的一般电子邮件列表招募的。研究的第一部分包括对参与者的抗生素处方经历和他们对人工智能的看法进行半结构化采访。第二部分使用一个定制的web应用程序来运行一个临床小故事实验:12个病例小故事中的每一个都包括一个目前正在接受静脉注射抗生素的患者,参与者被要求决定该患者是否适合改用口服抗生素。参与者被分配接受标准护理(SOC)信息,或SOC与我们之前开发的人工智能驱动的CDSS及其解释一起接受两组的每个小插曲。我们根据参与者被分配的干预措施评估了他们选择的差异,包括每个小插曲和总体;评估了cds在所有切换决策中的总体效应;并描述了参与者的决策多样性。在研究的第三部分,参与者完成了系统可用性量表(SUS)和技术接受模型(TAM)问卷,以便评估他们对人工智能CDSS的意见。研究结果:直接联系了59名临床医生或回复了招聘电子邮件,其中42名来自英国23家医院,在2024年4月23日至2024年8月16日期间完成了研究。参与者的年龄中位数为39岁(IQR 37-47),女性19人(45%),男性23人(55%),顾问26人(62%),培训级医生16人(38%),传染病专科14人(33%)。采访显示,处方的个性化和技术使用的不均衡,以及对人工智能的热情,这取决于证据和可用性,但受到行为惯性和基础设施限制的限制。在SOC干预和CDSS干预之间,病例小品完成时间和许多决策是相同的,临床医生能够识别和忽略不正确的建议。当观察到有统计学差异时,CDSS影响参与者不切换(χ 2.73, p= 0.0054; logistic回归优势比0.13 [95% CI 0.03 - 0.50]; p= 0.0031)。在可用的情况下,人工智能解释的使用率只有9%。我们的软件和AI CDSS获得了良好的SUS得分,为77.3分(SD为8.79分),TAM问卷的感知有用性得分为3.6分(0.31分),感知易用性得分为3.8分(0.20分),自我效能感得分为4.1分(0.05分)。解释:该AI CDSS得到了积极的接受,并有可能支持抗菌药物处方,当它建议不从静脉注射转为口服治疗时,对临床医生的影响最大。需要进一步的前瞻性研究来收集安全性和获益数据,并了解人工智能cdss进入临床实践后的行为变化。我们的研究表明,人工智能解释可能在护理点上发挥次要作用,人工智能CDSS的采用和利用取决于系统是否易于使用和信任,主要是通过临床证据。资助:英国医疗保健人工智能博士培训研究与创新中心,以及伦敦帝国理工学院国家卫生与护理研究所医疗保健相关感染和抗菌素耐药性卫生保护研究单位。
{"title":"The impact of artificial intelligence-driven decision support on uncertain antimicrobial prescribing: a randomised, multimethod study","authors":"William J Bolton PhD ,&nbsp;Richard Wilson MPharm ,&nbsp;Prof Mark Gilchrist MSc ,&nbsp;Prof Pantelis Georgiou PhD ,&nbsp;Prof Alison Holmes MD ,&nbsp;Timothy M Rawson PhD","doi":"10.1016/j.landig.2025.100912","DOIUrl":"10.1016/j.landig.2025.100912","url":null,"abstract":"&lt;div&gt;&lt;h3&gt;Background&lt;/h3&gt;&lt;div&gt;Challenges exist when translating artificial intelligence (AI)-driven clinical decision support systems (CDSSs) from research into health-care settings, particularly in infectious diseases, an area in which behaviour, culture, uncertainty, and frequent absence of a ground truth enhance the complexity of medical decision making. We aimed to evaluate clinicians’ perceptions of an AI CDSS for intravenous-to-oral antibiotic switching and how the system influences their decision making.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Methods&lt;/h3&gt;&lt;div&gt;This randomised, multimethod study enrolled health-care professionals in the UK who were regularly involved in antibiotic prescribing. Participants were recruited through personal networks and the general email list of the British Infection Association. The first part of the study involved a semistructured interview about participants’ experience of antibiotic prescribing and their perception of AI. The second part used a custom web app to run a clinical vignette experiment: each of the 12 case vignettes consisted of a patient currently receiving intravenous antibiotics, and participants were asked to decide whether or not the patient was suitable for switching to oral antibiotics. Participants were assigned to receive either standard of care (SOC) information, or SOC alongside our previously developed AI-driven CDSS and its explanations, for each vignette across two groups. We assessed differences in participant choices according to the intervention they were assigned, both for each vignette and overall; evaluated the aggregate effect of the CDSS across all switching decisions; and characterised the decision diversity across participants. In the third part of the study, participants completed the system usability scale (SUS) and technology acceptance model (TAM) questionnaires to enable their opinions of the AI CDSS to be assessed.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Findings&lt;/h3&gt;&lt;div&gt;59 clinicians were directly contacted or responded to recruitment emails, 42 of whom from 23 hospitals in the UK completed the study between April 23, 2024, and Aug 16, 2024. The median age of participants was 39 years (IQR 37–47), 19 (45%) were female and 23 (55%) were male, 26 (62%) were consultants and 16 (38%) were training-grade doctors, and 14 (33%) specialised in infectious diseases. Interviews revealed mixed individualisation of prescribing and uneven use of technology, alongside enthusiasm for AI, which was conditional on evidence and usability but constrained by behavioural inertia and infrastructure limitations. Case vignette completion times and many decisions were equivalent between SOC and CDSS interventions, with clinicians able to identify and ignore incorrect advice. When a statistical difference was observed, the CDSS influenced participants towards not switching (χ&lt;sup&gt;2&lt;/sup&gt; 7·73, p=0·0054; logistic regression odds ratio 0·13 [95% CI 0·03–0·50]; p=0·0031). AI explanations were used only 9% of the time when available. ","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 11","pages":"Article 100912"},"PeriodicalIF":24.1,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145726136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Lancet Digital Health
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1