Lancet Digital Health最新文献_第9页

Early detection of sexually transmitted infections from skin lesions with deep learning: a systematic review and meta-analysis 深度学习在皮肤损伤性传播感染早期检测中的应用：一项系统综述和荟萃分析。

IF 24.1 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health

Pub Date : 2025-07-01 DOI: 10.1016/j.landig.2025.100894

Ming Liu MSc , Xin-Yao Yi MSc , Yun-Zhe Chen MPH , Mei-Nuo Li MPH , Yuan-Yuan Zhang MPH , Casper J P Zhang PhD , Jian Huang PhD , Prof Wai-Kit Ming MD

Background

Sexually transmitted infections (STIs) are a substantial public health concern. We aimed to evaluate the accuracy and applicability of deep learning algorithms in the early detection of STIs from skin lesions.

Methods

In this systematic review and meta-analysis, we searched PubMed, Institute of Electrical and Electronics Engineers Xplore, Web of Science, Scopus for studies employing deep learning for classifying clinical skin lesion images of STIs published between Jan 1, 2010, and Dec 31, 2023. Studies that did not include clinical images were excluded. The primary outcome was diagnostic performance, assessed by pooled sensitivity and specificity. We conducted a meta-analysis of the studies providing contingency tables using a unified hierarchical model. We additionally assessed the quality of the studies using modified QUADAS-2 and CheckList for Evaluation of image-based AI Reports in Dermatology (CLEAR Derm) criteria. This study was registered with PROSPERO, CRD42024496966.

Findings

Among the 1946 studies identified, we included 101 in our review. The majority of the included studies focused on mpox (91 [88%] of 101 studies), followed by scabies (eight [8%] studies), herpes (four [4%] studies), syphilis (one [1%] study), and molluscum (one [1%] study). A meta-analysis of 55 studies showed that deep learning algorithms had a pooled sensitivity of 0·97 (95% CI 0·95–0·98) and a specificity of 0·99 (0·98–0·99) for mpox, and a sensitivity of 0·95 (0·90–0·98) and specificity of 0·97 (0·86–0·99) for scabies. The majority of studies (86 [85%] of 101 studies) utilised public datasets; traditional convolutional neural networks with backbone architectures such as ResNet and VGGNet were used in all studies. However, notable quality issues related to the data, technical descriptions of labelling methods and diagnostic label references, technical assessment for public evaluation of algorithms, benchmarking and bias assessments, application descriptions of use cases, and target conditions and potential impacts were identified in CLEAR Derm. Potential biases in performance evaluation metrics and applicability concerns in the data, deep learning algorithms, and performance evaluation metrics might impede the generalisability of these models to real-world clinical practice and STI screening across diverse populations.

Interpretation

Although deep learning shows potential for early detection of STIs, there are challenges to ensuring the generalisability of such algorithms due to limited heterogeneous data. Standardised, diverse skin lesion image datasets are crucial to ensure fair comparisons and reliable performance.

Funding

City University of Hong Kong.

背景：性传播感染（STIs）是一个重大的公共卫生问题。我们的目的是评估深度学习算法在皮肤病变性传播感染早期检测中的准确性和适用性。方法：在本系统综述和荟萃分析中，我们检索了PubMed， Institute of Electrical and Electronics Engineers Xplore, Web of Science, Scopus，检索了2010年1月1日至2023年12月31日期间发表的使用深度学习对性传播感染临床皮肤病变图像进行分类的研究。不包括临床影像的研究被排除在外。主要结局是诊断表现，通过综合敏感性和特异性进行评估。我们对这些研究进行了荟萃分析，使用统一的层次模型提供列联表。我们还使用改进的QUADAS-2和基于图像的皮肤病学人工智能报告评估清单（CLEAR Derm）标准评估了研究的质量。本研究注册号为PROSPERO， CRD42024496966。研究结果：在已确定的1946项研究中，我们纳入了101项。纳入的大多数研究集中于mpox(101项研究中的91项[88%])，其次是疥疮（8项[8%]研究）、疱疹（4项[4%]研究）、梅毒（1项[1%]研究）和软疣（1项[1%]研究）。55项研究的荟萃分析显示，深度学习算法对m痘的总灵敏度为0.97 (95% CI为0.95 ~ 0.98)，特异性为0.99(0.98 ~ 0.99)；对疥疮的总灵敏度为0.95(0.90 ~ 0.98)，特异性为0.97（0.86 ~ 0.99）。大多数研究（101项研究中的86项[85%]）使用了公共数据集；所有研究均使用具有骨干结构的传统卷积神经网络，如ResNet和VGGNet。然而，在CLEAR Derm中发现了与数据、标签方法和诊断标签参考的技术描述、算法公共评估的技术评估、基准和偏差评估、用例的应用描述以及目标条件和潜在影响相关的显著质量问题。性能评估指标的潜在偏差以及数据、深度学习算法和性能评估指标的适用性问题可能会阻碍这些模型在现实世界的临床实践和不同人群的STI筛查中的推广。解释：尽管深度学习显示出早期发现性传播感染的潜力，但由于异构数据有限，在确保此类算法的通用性方面存在挑战。标准化、多样化的皮肤病变图像数据集对于确保公平比较和可靠的性能至关重要。资助：香港城市大学。

{"title":"Early detection of sexually transmitted infections from skin lesions with deep learning: a systematic review and meta-analysis","authors":"Ming Liu MSc , Xin-Yao Yi MSc , Yun-Zhe Chen MPH , Mei-Nuo Li MPH , Yuan-Yuan Zhang MPH , Casper J P Zhang PhD , Jian Huang PhD , Prof Wai-Kit Ming MD","doi":"10.1016/j.landig.2025.100894","DOIUrl":"10.1016/j.landig.2025.100894","url":null,"abstract":"<div><h3>Background</h3><div>Sexually transmitted infections (STIs) are a substantial public health concern. We aimed to evaluate the accuracy and applicability of deep learning algorithms in the early detection of STIs from skin lesions.</div></div><div><h3>Methods</h3><div>In this systematic review and meta-analysis, we searched PubMed, Institute of Electrical and Electronics Engineers Xplore, Web of Science, Scopus for studies employing deep learning for classifying clinical skin lesion images of STIs published between Jan 1, 2010, and Dec 31, 2023. Studies that did not include clinical images were excluded. The primary outcome was diagnostic performance, assessed by pooled sensitivity and specificity. We conducted a meta-analysis of the studies providing contingency tables using a unified hierarchical model. We additionally assessed the quality of the studies using modified QUADAS-2 and CheckList for Evaluation of image-based AI Reports in Dermatology (CLEAR Derm) criteria. This study was registered with PROSPERO, CRD42024496966.</div></div><div><h3>Findings</h3><div>Among the 1946 studies identified, we included 101 in our review. The majority of the included studies focused on mpox (91 [88%] of 101 studies), followed by scabies (eight [8%] studies), herpes (four [4%] studies), syphilis (one [1%] study), and molluscum (one [1%] study). A meta-analysis of 55 studies showed that deep learning algorithms had a pooled sensitivity of 0·97 (95% CI 0·95–0·98) and a specificity of 0·99 (0·98–0·99) for mpox, and a sensitivity of 0·95 (0·90–0·98) and specificity of 0·97 (0·86–0·99) for scabies. The majority of studies (86 [85%] of 101 studies) utilised public datasets; traditional convolutional neural networks with backbone architectures such as ResNet and VGGNet were used in all studies. However, notable quality issues related to the data, technical descriptions of labelling methods and diagnostic label references, technical assessment for public evaluation of algorithms, benchmarking and bias assessments, application descriptions of use cases, and target conditions and potential impacts were identified in CLEAR Derm. Potential biases in performance evaluation metrics and applicability concerns in the data, deep learning algorithms, and performance evaluation metrics might impede the generalisability of these models to real-world clinical practice and STI screening across diverse populations.</div></div><div><h3>Interpretation</h3><div>Although deep learning shows potential for early detection of STIs, there are challenges to ensuring the generalisability of such algorithms due to limited heterogeneous data. Standardised, diverse skin lesion image datasets are crucial to ensure fair comparisons and reliable performance.</div></div><div><h3>Funding</h3><div>City University of Hong Kong.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 7","pages":"Article 100894"},"PeriodicalIF":24.1,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144795880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

FaceAge, a deep learning system to estimate biological age from face photographs to improve prognostication: a model development and validation study FaceAge，一个深度学习系统，从面部照片估计生物年龄，以提高预测：一项模型开发和验证研究。

IF 23.8 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health

Pub Date : 2025-06-01 DOI: 10.1016/j.landig.2025.03.002

Dennis Bontempi PhD , Osbert Zalay PhD , Danielle S Bitterman MD , Nicolai Birkbak PhD , Derek Shyr PhD , Fridolin Haugg MSc , Jack M Qian MD , Hannah Roberts MD , Subha Perni MD , Vasco Prudente MSc , Suraj Pai MSc , Andre Dekker PhD , Benjamin Haibe-Kains PhD , Christian Guthier PhD , Tracy Balboni MD , Laura Warren MD , Monica Krishan MD , Benjamin H Kann MD , Prof Charles Swanton MD , Prof Dirk De Ruysscher MD , Prof Hugo J W L Aerts PhD

<div><h3>Background</h3><div>As humans age at different rates, physical appearance can yield insights into biological age and physiological health more reliably than chronological age. In medicine, however, appearance is incorporated into medical judgements in a subjective and non-standardised way. In this study, we aimed to develop and validate FaceAge, a deep learning system to estimate biological age from easily obtainable and low-cost face photographs.</div></div><div><h3>Methods</h3><div>FaceAge was trained on data from 58 851 presumed healthy individuals aged 60 years or older: 56 304 individuals from the IMDb–Wiki dataset (training) and 2547 from the UTKFace dataset (initial validation). Clinical utility was evaluated on data from 6196 patients with cancer diagnoses from two institutions in the Netherlands and the USA: the MAASTRO, Harvard Thoracic, and Harvard Palliative cohorts FaceAge estimates in these cancer cohorts were compared with a non-cancerous reference cohort of 535 individuals. To assess the prognostic relevance of FaceAge, we performed Kaplan–Meier survival analysis and Cox modelling, adjusting for several clinical covariates. We also assessed the performance of FaceAge in patients with metastatic cancer receiving palliative treatment at the end of life by incorporating FaceAge into clinical prediction models. To evaluate whether FaceAge has the potential to be a biomarker for molecular ageing, we performed a gene-based analysis to assess its association with senescence genes.</div></div><div><h3>Findings</h3><div>FaceAge showed significant independent prognostic performance in various cancer types and stages. Looking older was correlated with worse overall survival (after adjusting for covariates per-decade hazard ratio [HR] 1·151, p=0·013 in a pan-cancer cohort of n=4906; 1·148, p=0·011 in a thoracic cohort of n=573; and 1·117, p=0·021 in a palliative cohort of n=717). We found that, on average, patients with cancer looked older than their chronological age (mean increase of 4·79 years with respect to non-cancerous reference cohort, p<0·0001). We found that FaceAge can improve physicians’ survival predictions in patients with incurable cancer receiving palliative treatments (from area under the curve 0·74 [95% CI 0·70–0·78] to 0·8 [0·76–0·83]; p<0·0001), highlighting the clinical use of the algorithm to support end-of-life decision making. FaceAge was also significantly associated with molecular mechanisms of senescence through gene analysis, whereas age was not.</div></div><div><h3>Interpretation</h3><div>Our results suggest that a deep learning model can estimate biological age from face photographs and thereby enhance survival prediction in patients with cancer. Further research, including validation in larger cohorts, is needed to verify these findings in patients with cancer and to establish whether the findings extend to patients with other diseases. Subject to further testing and validation, approaches such as

背景：随着人类以不同的速度衰老，外表可以比实足年龄更可靠地反映生物年龄和生理健康状况。然而，在医学上，外表以一种主观和非标准化的方式被纳入医学判断。在这项研究中，我们旨在开发和验证FaceAge，这是一个深度学习系统，可以从容易获得的低成本面部照片中估计生物年龄。方法：FaceAge使用来自58 851名60岁或以上的假定健康个体的数据进行训练，其中56 304人来自IMDb-Wiki数据集（训练），2547人来自UTKFace数据集（初始验证）。临床应用评估了来自荷兰和美国两个机构的6196名癌症诊断患者的数据：MAASTRO、哈佛胸廓和哈佛姑息治疗队列。将这些癌症队列的FaceAge估计与535名非癌症参考队列进行比较。为了评估FaceAge的预后相关性，我们进行了Kaplan-Meier生存分析和Cox建模，调整了几个临床协变量。我们还通过将FaceAge纳入临床预测模型，评估了FaceAge在晚期接受姑息治疗的转移性癌症患者中的表现。为了评估FaceAge是否有潜力成为分子衰老的生物标志物，我们进行了一项基于基因的分析，以评估其与衰老基因的关联。结果：FaceAge在不同癌症类型和分期中显示出显著的独立预后表现。年龄越大与总生存率越差相关(在n=4906的泛癌症队列中，校正协变量后，每十年的风险比[HR]为1.151,p= 0.013；1·148,p=0·011在胸部队列n=573；在姑息治疗队列（n=717）中为1·117，p=0·021。我们发现，平均而言，癌症患者看起来比他们的实际年龄要老（与非癌症参考队列相比，平均增加4.79岁）。解释：我们的研究结果表明，深度学习模型可以从面部照片中估计生物年龄，从而提高癌症患者的生存预测。需要进一步的研究，包括在更大的队列中进行验证，以验证癌症患者的这些发现，并确定这些发现是否适用于其他疾病患者。经过进一步的测试和验证，FaceAge等方法可用于将患者的视觉外观转化为客观、定量和有临床价值的测量。资助：美国国立卫生研究院和欧盟欧洲研究理事会。

{"title":"FaceAge, a deep learning system to estimate biological age from face photographs to improve prognostication: a model development and validation study","authors":"Dennis Bontempi PhD , Osbert Zalay PhD , Danielle S Bitterman MD , Nicolai Birkbak PhD , Derek Shyr PhD , Fridolin Haugg MSc , Jack M Qian MD , Hannah Roberts MD , Subha Perni MD , Vasco Prudente MSc , Suraj Pai MSc , Andre Dekker PhD , Benjamin Haibe-Kains PhD , Christian Guthier PhD , Tracy Balboni MD , Laura Warren MD , Monica Krishan MD , Benjamin H Kann MD , Prof Charles Swanton MD , Prof Dirk De Ruysscher MD , Prof Hugo J W L Aerts PhD","doi":"10.1016/j.landig.2025.03.002","DOIUrl":"10.1016/j.landig.2025.03.002","url":null,"abstract":"<div><h3>Background</h3><div>As humans age at different rates, physical appearance can yield insights into biological age and physiological health more reliably than chronological age. In medicine, however, appearance is incorporated into medical judgements in a subjective and non-standardised way. In this study, we aimed to develop and validate FaceAge, a deep learning system to estimate biological age from easily obtainable and low-cost face photographs.</div></div><div><h3>Methods</h3><div>FaceAge was trained on data from 58 851 presumed healthy individuals aged 60 years or older: 56 304 individuals from the IMDb–Wiki dataset (training) and 2547 from the UTKFace dataset (initial validation). Clinical utility was evaluated on data from 6196 patients with cancer diagnoses from two institutions in the Netherlands and the USA: the MAASTRO, Harvard Thoracic, and Harvard Palliative cohorts FaceAge estimates in these cancer cohorts were compared with a non-cancerous reference cohort of 535 individuals. To assess the prognostic relevance of FaceAge, we performed Kaplan–Meier survival analysis and Cox modelling, adjusting for several clinical covariates. We also assessed the performance of FaceAge in patients with metastatic cancer receiving palliative treatment at the end of life by incorporating FaceAge into clinical prediction models. To evaluate whether FaceAge has the potential to be a biomarker for molecular ageing, we performed a gene-based analysis to assess its association with senescence genes.</div></div><div><h3>Findings</h3><div>FaceAge showed significant independent prognostic performance in various cancer types and stages. Looking older was correlated with worse overall survival (after adjusting for covariates per-decade hazard ratio [HR] 1·151, p=0·013 in a pan-cancer cohort of n=4906; 1·148, p=0·011 in a thoracic cohort of n=573; and 1·117, p=0·021 in a palliative cohort of n=717). We found that, on average, patients with cancer looked older than their chronological age (mean increase of 4·79 years with respect to non-cancerous reference cohort, p<0·0001). We found that FaceAge can improve physicians’ survival predictions in patients with incurable cancer receiving palliative treatments (from area under the curve 0·74 [95% CI 0·70–0·78] to 0·8 [0·76–0·83]; p<0·0001), highlighting the clinical use of the algorithm to support end-of-life decision making. FaceAge was also significantly associated with molecular mechanisms of senescence through gene analysis, whereas age was not.</div></div><div><h3>Interpretation</h3><div>Our results suggest that a deep learning model can estimate biological age from face photographs and thereby enhance survival prediction in patients with cancer. Further research, including validation in larger cohorts, is needed to verify these findings in patients with cancer and to establish whether the findings extend to patients with other diseases. Subject to further testing and validation, approaches such as","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 6","pages":"Article 100870"},"PeriodicalIF":23.8,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144035203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Refined selection of individuals for preventive cardiovascular disease treatment with a transformer-based risk model 以变压器为基础的风险模型对个体进行预防性心血管疾病治疗的精细选择。

IF 23.8 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health

Pub Date : 2025-06-01 DOI: 10.1016/j.landig.2025.03.005

Shishir Rao DPhil , Yikuan Li DPhil , Mohammad Mamouei PhD , Gholamreza Salimi-Khorshidi DPhil , Malgorzata Wamil PhD , Milad Nazarzadeh DPhil , Christopher Yau DPhil , Gary S Collins PhD , Rod Jackson PhD , Andrew Vickers DPhil , Goodarz Danaei MD ScD , Kazem Rahimi DM FESC

Background

Although statistical models have been commonly used to identify patients at risk of cardiovascular disease for preventive therapy, these models tend to over-recommend therapy. Moreover, in populations with pre-existing diseases, the current approach is to indiscriminately treat all, as modelling in this context is currently inadequate. This study aimed to develop and validate the Transformer-based Risk assessment survival (TRisk) model, a novel deep learning model, for predicting 10-year risk of cardiovascular disease in both the primary prevention population and individuals with diabetes.

Methods

An open cohort of 3 million adults aged 25–84 years was identified using linked electronic health records from 291 general practices, for model development, and 98 general practices, for validation, across England from 1998 to 2015. Comparison against the QRISK3 score and a deep learning derivation of it was done. Additional analyses compared discriminatory performance in other age groups, by sex, and across categories of socioeconomic status.

Findings

TRisk showed superior discrimination (C index in the primary prevention population 0·910; 95% CI 0·906–0·913). TRisk’s performance was found to be less sensitive to population age range than the benchmark models and outperformed other models also in analyses stratified by age, sex, or socioeconomic status. All models were overall well calibrated. In decision curve analyses, TRisk showed a greater net benefit than benchmark models across the range of relevant thresholds. At the widely recommended 10% risk threshold and the higher 15% threshold, TRisk reduced both the total number of patients classified at high risk (by 20·6% and 34·6%, respectively) and the number of false negatives as compared with recommended strategies. TRisk similarly outperformed other models in patients with diabetes. Compared with the widely recommended treat-all policy approach for patients with diabetes, TRisk at a 10% risk threshold would lead to deselection of 24·3% of individuals, with a small fraction of false negatives (0·2% of the cohort).

Interpretation

TRisk enabled a more targeted selection of individuals at risk of cardiovascular disease in both the primary prevention population and cohorts with diabetes, compared with benchmark approaches. Incorporation of TRisk into routine care could potentially reduce the number of treatment-eligible patients by approximately one-third while preventing at least as many events as with currently adopted approaches.

Funding

None.

背景：虽然统计模型通常用于识别心血管疾病风险患者进行预防治疗，但这些模型倾向于过度推荐治疗。此外，在已有疾病的人群中，目前的做法是不分青红皂白地治疗所有人，因为在这方面的建模目前是不充分的。本研究旨在开发和验证基于transformer的风险评估生存（TRisk）模型，这是一种新的深度学习模型，用于预测初级预防人群和糖尿病患者10年心血管疾病风险。方法：从1998年至2015年，使用英格兰291个全科诊所的相关电子健康记录确定了300万名25-84岁成年人的开放队列，用于模型开发，98个全科诊所进行验证。与QRISK3分数进行了比较，并对其进行了深度学习推导。其他分析比较了其他年龄组、性别和不同社会经济地位类别的歧视性表现。结果：一级预防人群的风险指数（C指数）为0·910；95% ci 0.906 - 0.913)。研究发现，与基准模型相比，风险模型对人口年龄范围的敏感性较低，在按年龄、性别或社会经济地位分层的分析中，风险模型的表现也优于其他模型。所有模型总体上都得到了很好的校准。在决策曲线分析中，在相关阈值范围内，TRisk显示出比基准模型更大的净收益。在广泛推荐的10%风险阈值和更高的15%阈值下，与推荐的策略相比，TRisk降低了高风险患者的总数（分别减少20.6%和34.6%）和假阴性的数量。在糖尿病患者中，TRisk同样优于其他模型。与广泛推荐的针对糖尿病患者的全面治疗政策方法相比，风险阈值为10%的风险将导致24.3%的个体取消选择，并有一小部分假阴性（0.2%的队列）。解释：与基准方法相比，在初级预防人群和糖尿病人群中，风险使得更有针对性地选择有心血管疾病风险的个体。将风险纳入常规护理可能会使符合治疗条件的患者数量减少约三分之一，同时预防的事件至少与目前采用的方法一样多。资金:没有。

{"title":"Refined selection of individuals for preventive cardiovascular disease treatment with a transformer-based risk model","authors":"Shishir Rao DPhil , Yikuan Li DPhil , Mohammad Mamouei PhD , Gholamreza Salimi-Khorshidi DPhil , Malgorzata Wamil PhD , Milad Nazarzadeh DPhil , Christopher Yau DPhil , Gary S Collins PhD , Rod Jackson PhD , Andrew Vickers DPhil , Goodarz Danaei MD ScD , Kazem Rahimi DM FESC","doi":"10.1016/j.landig.2025.03.005","DOIUrl":"10.1016/j.landig.2025.03.005","url":null,"abstract":"<div><h3>Background</h3><div>Although statistical models have been commonly used to identify patients at risk of cardiovascular disease for preventive therapy, these models tend to over-recommend therapy. Moreover, in populations with pre-existing diseases, the current approach is to indiscriminately treat all, as modelling in this context is currently inadequate. This study aimed to develop and validate the Transformer-based Risk assessment survival (TRisk) model, a novel deep learning model, for predicting 10-year risk of cardiovascular disease in both the primary prevention population and individuals with diabetes.</div></div><div><h3>Methods</h3><div>An open cohort of 3 million adults aged 25–84 years was identified using linked electronic health records from 291 general practices, for model development, and 98 general practices, for validation, across England from 1998 to 2015. Comparison against the QRISK3 score and a deep learning derivation of it was done. Additional analyses compared discriminatory performance in other age groups, by sex, and across categories of socioeconomic status.</div></div><div><h3>Findings</h3><div>TRisk showed superior discrimination (C index in the primary prevention population 0·910; 95% CI 0·906–0·913). TRisk’s performance was found to be less sensitive to population age range than the benchmark models and outperformed other models also in analyses stratified by age, sex, or socioeconomic status. All models were overall well calibrated. In decision curve analyses, TRisk showed a greater net benefit than benchmark models across the range of relevant thresholds. At the widely recommended 10% risk threshold and the higher 15% threshold, TRisk reduced both the total number of patients classified at high risk (by 20·6% and 34·6%, respectively) and the number of false negatives as compared with recommended strategies. TRisk similarly outperformed other models in patients with diabetes. Compared with the widely recommended treat-all policy approach for patients with diabetes, TRisk at a 10% risk threshold would lead to deselection of 24·3% of individuals, with a small fraction of false negatives (0·2% of the cohort).</div></div><div><h3>Interpretation</h3><div>TRisk enabled a more targeted selection of individuals at risk of cardiovascular disease in both the primary prevention population and cohorts with diabetes, compared with benchmark approaches. Incorporation of TRisk into routine care could potentially reduce the number of treatment-eligible patients by approximately one-third while preventing at least as many events as with currently adopted approaches.</div></div><div><h3>Funding</h3><div>None.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 6","pages":"Article 100873"},"PeriodicalIF":23.8,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144217272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Transforming women's health, empowerment, and gender equality with digital health: evidence-based policy and practice 以数字健康改变妇女健康、赋权和性别平等：基于证据的政策和实践。

IF 23.8 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health

Pub Date : 2025-06-01 DOI: 10.1016/j.landig.2025.01.014

Prof Israel Júnior Borges do Nascimento MD ClinPath , Hebatullah Mohamed Abdulazeem MD MSc , Ishanka Weerasekara PhD , Prof Jodie Marquez PhD , Lenny T Vasanthan PhD , Genevieve Deeken MSc , Prof Rosemary Morgan PhD , Heang-Lee Tan MPH , Isabel Yordi Aguirre PhD , Lasse Østeengaard MSc , Indunil Kularathne BSc , Natasha Azzopardi-Muscat PhD , Prof Robin van Kessel PhD , Edson Zangiacomi Martinez PhD , Govin Permanand PhD , David Novillo-Ortiz PhD MLIS

We evaluated the effects of digital health technologies (DHTs) on women's health, empowerment, and gender equality, using the scoping review method. Following a search across five databases and grey literature, we analysed 80 studies published up to Aug 18, 2023. The thematic appraisal and quantitative analysis found that DHTs positively affect women's access to health-care services, self-care, and tailored self-monitoring enabling the acquisition of health-related interventions. Use of these technologies is beneficial across various medical fields, including gynaecology, endocrinology, and psychiatry. DHTs also improve women's empowerment and gender equality by facilitating skills acquisition, health education, and social interaction, while allowing cost-effective health services. Overall, DHTs contribute to better health outcomes for women and support the UN Sustainable Development Goals by improving access to health care and financial literacy.

我们使用范围审查方法评估了数字卫生技术（dht）对妇女健康、赋权和性别平等的影响。通过对五个数据库和灰色文献的搜索，我们分析了截至2023年8月18日发表的80项研究。专题评价和定量分析发现，dht对妇女获得保健服务、自我保健和量身定制的自我监测产生了积极影响，从而能够获得与健康有关的干预措施。这些技术的使用在各个医学领域都是有益的，包括妇科、内分泌学和精神病学。卫生保健部门还通过促进技能获取、卫生教育和社会互动，改善妇女赋权和性别平等，同时提供具有成本效益的卫生服务。总体而言，卫生保健技术有助于改善妇女的健康结果，并通过改善获得卫生保健和金融知识的机会来支持联合国可持续发展目标。

{"title":"Transforming women's health, empowerment, and gender equality with digital health: evidence-based policy and practice","authors":"Prof Israel Júnior Borges do Nascimento MD ClinPath , Hebatullah Mohamed Abdulazeem MD MSc , Ishanka Weerasekara PhD , Prof Jodie Marquez PhD , Lenny T Vasanthan PhD , Genevieve Deeken MSc , Prof Rosemary Morgan PhD , Heang-Lee Tan MPH , Isabel Yordi Aguirre PhD , Lasse Østeengaard MSc , Indunil Kularathne BSc , Natasha Azzopardi-Muscat PhD , Prof Robin van Kessel PhD , Edson Zangiacomi Martinez PhD , Govin Permanand PhD , David Novillo-Ortiz PhD MLIS","doi":"10.1016/j.landig.2025.01.014","DOIUrl":"10.1016/j.landig.2025.01.014","url":null,"abstract":"<div><div>We evaluated the effects of digital health technologies (DHTs) on women's health, empowerment, and gender equality, using the scoping review method. Following a search across five databases and grey literature, we analysed 80 studies published up to Aug 18, 2023. The thematic appraisal and quantitative analysis found that DHTs positively affect women's access to health-care services, self-care, and tailored self-monitoring enabling the acquisition of health-related interventions. Use of these technologies is beneficial across various medical fields, including gynaecology, endocrinology, and psychiatry. DHTs also improve women's empowerment and gender equality by facilitating skills acquisition, health education, and social interaction, while allowing cost-effective health services. Overall, DHTs contribute to better health outcomes for women and support the UN Sustainable Development Goals by improving access to health care and financial literacy.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 6","pages":"Article 100858"},"PeriodicalIF":23.8,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144081422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Artificial intelligence-assisted detection of nasopharyngeal carcinoma on endoscopic images: a national, multicentre, model development and validation study 人工智能辅助鼻咽癌内镜图像检测：一项全国性、多中心、模型开发和验证研究。

IF 23.8 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health

Pub Date : 2025-06-01 DOI: 10.1016/j.landig.2025.03.001

Yuxuan Shi PhD , Zhen Li PhD , Li Wang PhD , Hong Wang PhD , Prof Xiaofeng Liu PhD , Dantong Gu MS , Xiao Chen MS , Xueli Liu PhD , Wentao Gong MS , Xiaowen Jiang MD , Wenquan Li MD , Yongdong Lin BS , Ke Liu MD , Deyan Luo MD , Tao Peng PhD , Xuemei Peng BS , Meimei Tong BS , Huizhen Zheng MD , Xuanchen Zhou MD , Jianrong Wu PhD , Prof Hongmeng Yu PhD

<div><h3>Background</h3><div>Nasopharyngeal carcinoma is highly curable when diagnosed early. However, the nasopharynx’s obscure anatomical position and the similarity of local imaging manifestations with those of other nasopharyngeal diseases often lead to diagnostic challenges, resulting in delayed or missed diagnoses. Our aim was to develop a deep learning algorithm to enhance an otolaryngologist’s diagnostic capabilities by differentiating between nasopharyngeal carcinoma, benign hyperplasia, and normal nasopharynx during endoscopic examination.</div></div><div><h3>Methods</h3><div>In this national, multicentre, model development and validation study, we developed a Swin Transformer-based Nasopharyngeal Diagnostic (STND) system to identify nasopharyngeal carcinoma, benign hyperplasia, and normal nasopharynx. STND was developed with 27 362 nasopharyngeal endoscopic images (10 693 biopsy-proven nasopharyngeal carcinoma, 7073 biopsy-proven benign hyperplasia, and 9596 normal nasopharynx) sourced from eight prominent nasopharyngeal carcinoma centres (stage 1), and externally validated with 1885 prospectively acquired images from ten comprehensive hospitals with a high incidence of nasopharyngeal carcinoma (stage 2). Furthermore, we did a fully crossed, multireader, multicase study involving four expert otolaryngologists from four regional leading nasopharyngeal carcinoma centres, and 24 general otolaryngologists from 24 geographically diverse primary hospitals. This study included 400 images to evaluate the diagnostic capabilities of the experts and general otolaryngologists both with and without the aid of the STND system in a real-world environment.</div></div><div><h3>Findings</h3><div>Endoscopic images used in the internal study (Jan 1, 2017, to Jan 31, 2023) were from 15 521 individuals (9033 [58·2%] men and 6488 [41·8%] women; mean age 47·6 years [IQR 38·4–56·8]). Images from 945 participants (538 [56·9%] men and 407 [43·1%] women; mean age 45·2 years [IQR 35·2– 55·2]) were used in the external validation. STND in the internal dataset discriminated normal nasopharynx images from abnormalities (benign hyperplasia and nasopharyngeal carcinoma) with an area under the curve (AUC) of 0·99 (95% CI 0·99–0·99) and malignant images (ie, nasopharyngeal carcinoma) from non-malignant images (ie, benign hyperplasia and normal nasopharynx) with an AUC of 0·99 (95% CI 0·98–0·99). In the external validation, the system had an AUC for the detection of nasopharyngeal carcinoma of 0·95 (95% CI 0·94–0·96), a sensitivity of 91·6% (95% CI 89·3–93·5), and a specificity of 86·1% (95% CI 84·1–87·9). In the multireader, multicase study, the artificial intelligence (AI)-assisted strategy enhanced otolaryngologists’ diagnostic accuracy by 7·9%, increasing from 83·4% (95% CI 80·1–86·7, without AI assistance) to 91·2% (95% CI 88·6–93·9, with AI assistance; p<0·0001) for primary care otolaryngologists. Reading time per image decreased with the aid of the AI model (mea

背景：鼻咽癌早期诊断治愈率高。然而，鼻咽部解剖位置模糊，局部影像学表现与其他鼻咽部疾病相似，往往导致诊断困难，导致延误或漏诊。我们的目标是开发一种深度学习算法，通过在内窥镜检查中区分鼻咽癌、良性增生和正常鼻咽来提高耳鼻喉科医生的诊断能力。方法：在这项全国性、多中心的模型开发和验证研究中，我们开发了一种基于Swin变压器的鼻咽癌诊断（STND）系统，用于识别鼻咽癌、良性增生和正常鼻咽癌。STND采用来自8个著名鼻咽癌中心（1期）的27 362张鼻咽内镜图像（10 693张活检证实的鼻咽癌，7073张活检证实的良性增生，9596张正常鼻咽癌），并通过来自10家鼻咽癌高发综合医院（2期）的1885张前瞻性图像进行外部验证。此外，我们进行了一项完全交叉、多读者、多病例的研究，涉及来自四个地区领先的鼻咽癌中心的四名专家耳鼻喉科医生，以及来自24个地理位置不同的初级医院的24名普通耳鼻喉科医生。该研究包括400张图像，以评估专家和普通耳鼻喉科医生在使用和不使用STND系统的情况下在现实环境中的诊断能力。结果：内部研究（2017年1月1日至2023年1月31日）使用的内镜图像来自15 521人(男性9033人[58.2%]，女性6488人[41.8%]；平均年龄47.6岁[IQR 38.4 ~ 56.8])。来自945名参与者的图像(男性538人[56.9%]，女性407人[43.1%]；平均年龄45·2岁[IQR 35.2 - 55.2])进行外部验证。内部数据集中的STND以曲线下面积（AUC）为0.99 （95% CI为0.99 ~ 0.99）区分正常鼻咽图像与异常（良性增生和鼻咽癌），以AUC为0.99 （95% CI为0.98 ~ 0.99）区分恶性图像（即鼻咽癌）与非恶性图像（即良性增生和正常鼻咽）。在外部验证中，该系统检测鼻咽癌的AUC为0.95 (95% CI为0.94 ~ 0.96)，灵敏度为91.6% (95% CI为89.3 ~ 93.5)，特异性为86.1% （95% CI为84.1 ~ 89.7）。在多读者、多病例研究中，人工智能（AI）辅助策略使耳鼻喉科医生的诊断准确性提高了7.9%，从83.4% （95% CI 801 - 86.7，无AI辅助）增加到91.2% （95% CI 88.6 - 99.3，有AI辅助）；解释：我们的深度学习系统在鼻咽癌的内镜图像诊断方面显示出了巨大的临床应用潜力。该系统为基层医院的采用提供了实质性的好处，旨在提高特异性，避免额外的活检，并减少漏诊。资助项目：颅底肿瘤内窥镜手术新技术：CAMS医学科学创新基金；上海市科学技术委员会基金；上海市临床重点专科；国家临床重点专科；以及青年精英科学家资助计划。

{"title":"Artificial intelligence-assisted detection of nasopharyngeal carcinoma on endoscopic images: a national, multicentre, model development and validation study","authors":"Yuxuan Shi PhD , Zhen Li PhD , Li Wang PhD , Hong Wang PhD , Prof Xiaofeng Liu PhD , Dantong Gu MS , Xiao Chen MS , Xueli Liu PhD , Wentao Gong MS , Xiaowen Jiang MD , Wenquan Li MD , Yongdong Lin BS , Ke Liu MD , Deyan Luo MD , Tao Peng PhD , Xuemei Peng BS , Meimei Tong BS , Huizhen Zheng MD , Xuanchen Zhou MD , Jianrong Wu PhD , Prof Hongmeng Yu PhD","doi":"10.1016/j.landig.2025.03.001","DOIUrl":"10.1016/j.landig.2025.03.001","url":null,"abstract":"<div><h3>Background</h3><div>Nasopharyngeal carcinoma is highly curable when diagnosed early. However, the nasopharynx’s obscure anatomical position and the similarity of local imaging manifestations with those of other nasopharyngeal diseases often lead to diagnostic challenges, resulting in delayed or missed diagnoses. Our aim was to develop a deep learning algorithm to enhance an otolaryngologist’s diagnostic capabilities by differentiating between nasopharyngeal carcinoma, benign hyperplasia, and normal nasopharynx during endoscopic examination.</div></div><div><h3>Methods</h3><div>In this national, multicentre, model development and validation study, we developed a Swin Transformer-based Nasopharyngeal Diagnostic (STND) system to identify nasopharyngeal carcinoma, benign hyperplasia, and normal nasopharynx. STND was developed with 27 362 nasopharyngeal endoscopic images (10 693 biopsy-proven nasopharyngeal carcinoma, 7073 biopsy-proven benign hyperplasia, and 9596 normal nasopharynx) sourced from eight prominent nasopharyngeal carcinoma centres (stage 1), and externally validated with 1885 prospectively acquired images from ten comprehensive hospitals with a high incidence of nasopharyngeal carcinoma (stage 2). Furthermore, we did a fully crossed, multireader, multicase study involving four expert otolaryngologists from four regional leading nasopharyngeal carcinoma centres, and 24 general otolaryngologists from 24 geographically diverse primary hospitals. This study included 400 images to evaluate the diagnostic capabilities of the experts and general otolaryngologists both with and without the aid of the STND system in a real-world environment.</div></div><div><h3>Findings</h3><div>Endoscopic images used in the internal study (Jan 1, 2017, to Jan 31, 2023) were from 15 521 individuals (9033 [58·2%] men and 6488 [41·8%] women; mean age 47·6 years [IQR 38·4–56·8]). Images from 945 participants (538 [56·9%] men and 407 [43·1%] women; mean age 45·2 years [IQR 35·2– 55·2]) were used in the external validation. STND in the internal dataset discriminated normal nasopharynx images from abnormalities (benign hyperplasia and nasopharyngeal carcinoma) with an area under the curve (AUC) of 0·99 (95% CI 0·99–0·99) and malignant images (ie, nasopharyngeal carcinoma) from non-malignant images (ie, benign hyperplasia and normal nasopharynx) with an AUC of 0·99 (95% CI 0·98–0·99). In the external validation, the system had an AUC for the detection of nasopharyngeal carcinoma of 0·95 (95% CI 0·94–0·96), a sensitivity of 91·6% (95% CI 89·3–93·5), and a specificity of 86·1% (95% CI 84·1–87·9). In the multireader, multicase study, the artificial intelligence (AI)-assisted strategy enhanced otolaryngologists’ diagnostic accuracy by 7·9%, increasing from 83·4% (95% CI 80·1–86·7, without AI assistance) to 91·2% (95% CI 88·6–93·9, with AI assistance; p<0·0001) for primary care otolaryngologists. Reading time per image decreased with the aid of the AI model (mea","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 6","pages":"Article 100869"},"PeriodicalIF":23.8,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144340497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Health insights from face photographs 从面部照片看健康。

IF 23.8 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health

Pub Date : 2025-06-01 DOI: 10.1016/j.landig.2025.100889

The Lancet Digital Health

引用次数: 0

Online video versus face-to-face preoperative consultation for major abdominal surgery (VIDEOGO): a multicentre, open-label, randomised, controlled, non-inferiority trial 在线视频与腹部大手术术前面对面咨询（视频）：一项多中心、开放标签、随机、对照、非劣效性试验。

IF 23.8 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health

Pub Date : 2025-06-01 DOI: 10.1016/j.landig.2025.02.007

Britte H E A ten Haaft MD , Boris V Janssen BSc , Esther Z Barsom MD PhD , Prof Wouter J K Hehenkamp MD PhD , Prof Mark I van Berge Henegouwen MD PhD , Prof Olivier R Busch MD PhD , Susan van Dieren PhD , Joris I Erdmann MD PhD , Wietse J Eshuis MD PhD , Suzanne S Gisbertz MD PhD , Prof Misha D P Luyer MD PhD , Olga C Damman PhD , Prof Martine C de Bruijne MD PhD , Prof Geert Kazemier MD PhD , Prof Marlies P Schijven MD PhD , Prof Marc G Besselink MD PhD

Background

Online video consultation between patients and health-care providers rapidly gained popularity during the COVID-19 pandemic. However, to our knowledge, there is no high-quality comparative evidence regarding patient satisfaction and quality of information recall with online video consultation and traditional face-to-face consultation. This lack of evidence is especially concerning in the most demanding consultations. We aimed to assess whether online video consultation between patients and surgeons before major abdominal surgery was non-inferior to face-to-face consultation in terms of patient satisfaction, and to assess effects on patient information recall.

Methods

This open-label, randomised, controlled, non-inferiority trial (VIDEOGO) was conducted at two hospitals (one academic and one regional) in the Netherlands. Adult patients (aged ≥18 years) who required consultation with a surgeon to discuss major abdominal surgery and were able and willing to interact via both online video and face-to-face consultation were eligible for inclusion; patients were excluded if they were unable or unwilling to start or maintain online video consultation. Eligible patients were randomly allocated (1:1) to online video or face-to-face consultation by the study coordinator, using a computer-generated, concealed, permuted-block randomisation method with varying block sizes (two, four, and six patients), stratified by study site. Masking of patients and health-care providers was not possible owing to the nature of the study. The primary outcomes were patient satisfaction (score 0–100; assessed for non-inferiority with a predefined margin of −10%) and information recall (score 0–11), both of which were assessed with online questionnaires and analysed in the intention-to-treat population for whom outcome data were available. Technical adverse events were assessed directly after the consultation as part of the satisfaction questionnaire. This trial is registered with the International Clinical Trial Registry Platform and the Central Committee on Research Involving Human Subjects registry, NL-OMON20092, and is complete.

Findings

Between Feb 13, 2021, and Oct 2, 2023, 120 patients were randomly assigned: 60 to online video consultation and 60 to face-to-face consultation. Outcome data were available for 57 patients in the online video consultation group (20 [35%] female and 37 [65%] male; median age 64·0 [54·5–72·5] years) and 55 patients in the face-to-face group (22 [40%] female and 33 [60%] male; median age 62·0 [56·0–70·0] years). The mean patient satisfaction score was 85·4 out of 100 (SD 12·3) in the online video consultation group and 85·2 (14·2) in the face-to-face group (mean difference 0·2, 95% CI −4·8 to 5·1), which was within the non-inferiority margin of −10% (p_{non-inferiority}<0·0001). The mean information recall score was 7·30 out of 11 (SD 1·60) in the

背景：在2019冠状病毒病大流行期间，患者和卫生保健提供者之间的在线视频会诊迅速普及。然而，据我们所知，关于在线视频咨询和传统面对面咨询的患者满意度和信息回忆质量，没有高质量的比较证据。在要求最高的磋商中，这种证据的缺乏尤其令人关切。我们的目的是评估就患者满意度而言，腹部大手术前患者与外科医生之间的在线视频咨询是否不逊于面对面咨询，并评估对患者信息回忆的影响。方法：这项开放标签、随机、对照、非劣效性试验（video o）在荷兰的两家医院（一家是学术医院，一家是地区医院）进行。成年患者（年龄≥18岁）需要咨询外科医生讨论腹部大手术，并且能够并愿意通过在线视频和面对面咨询进行互动，符合纳入条件；如果患者不能或不愿意开始或维持在线视频咨询，则被排除在外。符合条件的患者被随机分配（1:1）进行在线视频或面对面咨询，由研究协调员使用计算机生成的，隐藏的，排列块随机化方法，不同块大小（2名，4名和6名患者），按研究地点分层。由于研究的性质，不可能掩盖患者和保健提供者。主要结局为患者满意度(0-100分；评估非劣效性（预先设定的差值为-10%）和信息召回（评分0-11），这两项均通过在线问卷进行评估，并在有结果数据的意向治疗人群中进行分析。技术不良事件在咨询后直接作为满意度问卷的一部分进行评估。该试验已在国际临床试验注册平台和涉及人类受试者的研究中央委员会注册，编号为NL-OMON20092，并已完成。研究结果：在2021年2月13日至2023年10月2日期间，120名患者被随机分配：60名患者进行在线视频咨询，60名患者进行面对面咨询。在线视频咨询组57例患者的结局数据可用(20例[35%]女性，37例[65%]男性；中位年龄64·0[55.4 ~ 72.5]岁)，面对面组55例，其中女性22例[40%]，男性33例[60%]；中位年龄62岁[56岁~ 70岁]。在线视频咨询组的平均患者满意度得分为85.4分（SD为12.3），面对面咨询组的平均满意度得分为85.2分（14.2分）（平均差异为0.2,95% CI为- 4.8 ~ 5.1），均在-10%的非劣效范围内（pnon-劣效）。在腹部大手术的外科会诊中，使用在线视频会诊在患者满意度方面不低于面对面会诊，并且对信息回忆没有实质性影响。这些研究结果表明，在线视频会诊可以在外科门诊放心地实施。资助：荷兰卫生研究与发展组织。

{"title":"Online video versus face-to-face preoperative consultation for major abdominal surgery (VIDEOGO): a multicentre, open-label, randomised, controlled, non-inferiority trial","authors":"Britte H E A ten Haaft MD , Boris V Janssen BSc , Esther Z Barsom MD PhD , Prof Wouter J K Hehenkamp MD PhD , Prof Mark I van Berge Henegouwen MD PhD , Prof Olivier R Busch MD PhD , Susan van Dieren PhD , Joris I Erdmann MD PhD , Wietse J Eshuis MD PhD , Suzanne S Gisbertz MD PhD , Prof Misha D P Luyer MD PhD , Olga C Damman PhD , Prof Martine C de Bruijne MD PhD , Prof Geert Kazemier MD PhD , Prof Marlies P Schijven MD PhD , Prof Marc G Besselink MD PhD","doi":"10.1016/j.landig.2025.02.007","DOIUrl":"10.1016/j.landig.2025.02.007","url":null,"abstract":"<div><h3>Background</h3><div>Online video consultation between patients and health-care providers rapidly gained popularity during the COVID-19 pandemic. However, to our knowledge, there is no high-quality comparative evidence regarding patient satisfaction and quality of information recall with online video consultation and traditional face-to-face consultation. This lack of evidence is especially concerning in the most demanding consultations. We aimed to assess whether online video consultation between patients and surgeons before major abdominal surgery was non-inferior to face-to-face consultation in terms of patient satisfaction, and to assess effects on patient information recall.</div></div><div><h3>Methods</h3><div>This open-label, randomised, controlled, non-inferiority trial (VIDEOGO) was conducted at two hospitals (one academic and one regional) in the Netherlands. Adult patients (aged ≥18 years) who required consultation with a surgeon to discuss major abdominal surgery and were able and willing to interact via both online video and face-to-face consultation were eligible for inclusion; patients were excluded if they were unable or unwilling to start or maintain online video consultation. Eligible patients were randomly allocated (1:1) to online video or face-to-face consultation by the study coordinator, using a computer-generated, concealed, permuted-block randomisation method with varying block sizes (two, four, and six patients), stratified by study site. Masking of patients and health-care providers was not possible owing to the nature of the study. The primary outcomes were patient satisfaction (score 0–100; assessed for non-inferiority with a predefined margin of −10%) and information recall (score 0–11), both of which were assessed with online questionnaires and analysed in the intention-to-treat population for whom outcome data were available. Technical adverse events were assessed directly after the consultation as part of the satisfaction questionnaire. This trial is registered with the International Clinical Trial Registry Platform and the Central Committee on Research Involving Human Subjects registry, NL-OMON20092, and is complete.</div></div><div><h3>Findings</h3><div>Between Feb 13, 2021, and Oct 2, 2023, 120 patients were randomly assigned: 60 to online video consultation and 60 to face-to-face consultation. Outcome data were available for 57 patients in the online video consultation group (20 [35%] female and 37 [65%] male; median age 64·0 [54·5–72·5] years) and 55 patients in the face-to-face group (22 [40%] female and 33 [60%] male; median age 62·0 [56·0–70·0] years). The mean patient satisfaction score was 85·4 out of 100 (SD 12·3) in the online video consultation group and 85·2 (14·2) in the face-to-face group (mean difference 0·2, 95% CI −4·8 to 5·1), which was within the non-inferiority margin of −10% (p<sub>non-inferiority</sub><0·0001). The mean information recall score was 7·30 out of 11 (SD 1·60) in the","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 6","pages":"Article 100867"},"PeriodicalIF":23.8,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144310552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Video in the clinic: advancing care for patients, professionals, and the planet 视频在诊所：推进护理病人，专业人员和地球。

IF 23.8 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health

Pub Date : 2025-06-01 DOI: 10.1016/j.landig.2025.100875

Lars Henrik Jensen

引用次数: 0

Importance of sample size on the quality and utility of AI-based prediction models for healthcare 样本大小对基于人工智能的医疗保健预测模型的质量和效用的重要性。

IF 23.8 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health

Pub Date : 2025-06-01 DOI: 10.1016/j.landig.2025.01.013

Prof Richard D Riley PhD , Joie Ensor PhD , Kym I E Snell PhD , Lucinda Archer PhD , Rebecca Whittle PhD , Paula Dhiman PhD , Joseph Alderman MBChB , Xiaoxuan Liu PhD , Laura Kirton MSc , Jay Manson-Whitton , Maarten van Smeden PhD , Prof Karel G Moons PhD , Prof Krishnarajah Nirantharakumar MD , Prof Jean-Baptiste Cazier PhD , Prof Alastair K Denniston PhD , Prof Ben Van Calster PhD , Prof Gary S Collins PhD

Rigorous study design and analytical standards are required to generate reliable findings in healthcare from artificial intelligence (AI) research. One crucial but often overlooked aspect is the determination of appropriate sample sizes for studies developing AI-based prediction models for individual diagnosis or prognosis. Specifically, the number of participants and outcome events required in datasets for model training and evaluation remains inadequately addressed. Most AI studies do not provide a rationale for their chosen sample sizes and frequently rely on datasets that are inadequate for training or evaluating a clinical prediction model. Among the ten principles of Good Machine Learning Practice established by the US Food and Drug Administration, the UK Medicines and Healthcare products Regulatory Agency, and Health Canada, guidance on sample size is directly relevant to at least three principles. To reinforce this recommendation, we outline seven reasons why inadequate sample size negatively affects model training, evaluation, and performance. Using a range of examples, we illustrate these issues and discuss the potentially harmful consequences for patient care and clinical adoption. Additionally, we address challenges associated with increasing sample sizes in AI research and highlight existing approaches and software for calculating the minimum sample sizes required for model training and evaluation.

为了从人工智能（AI）研究中获得可靠的医疗保健结果，需要严格的研究设计和分析标准。一个关键但经常被忽视的方面是确定适当的样本量，用于开发基于人工智能的个体诊断或预后预测模型的研究。具体来说，模型训练和评估所需的数据集中的参与者和结果事件的数量仍然没有得到充分的解决。大多数人工智能研究没有为其选择的样本量提供基本原理，并且经常依赖于不足以训练或评估临床预测模型的数据集。在由美国食品和药物管理局、英国药品和保健产品监管局和加拿大卫生部建立的良好机器学习实践的十大原则中，关于样本量的指导至少与三项原则直接相关。为了加强这一建议，我们列出了七个原因，为什么样本量不足会对模型训练、评估和性能产生负面影响。通过一系列的例子，我们说明了这些问题，并讨论了对患者护理和临床采用的潜在有害后果。此外，我们还解决了与人工智能研究中样本量增加相关的挑战，并强调了用于计算模型训练和评估所需的最小样本量的现有方法和软件。

{"title":"Importance of sample size on the quality and utility of AI-based prediction models for healthcare","authors":"Prof Richard D Riley PhD , Joie Ensor PhD , Kym I E Snell PhD , Lucinda Archer PhD , Rebecca Whittle PhD , Paula Dhiman PhD , Joseph Alderman MBChB , Xiaoxuan Liu PhD , Laura Kirton MSc , Jay Manson-Whitton , Maarten van Smeden PhD , Prof Karel G Moons PhD , Prof Krishnarajah Nirantharakumar MD , Prof Jean-Baptiste Cazier PhD , Prof Alastair K Denniston PhD , Prof Ben Van Calster PhD , Prof Gary S Collins PhD","doi":"10.1016/j.landig.2025.01.013","DOIUrl":"10.1016/j.landig.2025.01.013","url":null,"abstract":"<div><div>Rigorous study design and analytical standards are required to generate reliable findings in healthcare from artificial intelligence (AI) research. One crucial but often overlooked aspect is the determination of appropriate sample sizes for studies developing AI-based prediction models for individual diagnosis or prognosis. Specifically, the number of participants and outcome events required in datasets for model training and evaluation remains inadequately addressed. Most AI studies do not provide a rationale for their chosen sample sizes and frequently rely on datasets that are inadequate for training or evaluating a clinical prediction model. Among the ten principles of Good Machine Learning Practice established by the US Food and Drug Administration, the UK Medicines and Healthcare products Regulatory Agency, and Health Canada, guidance on sample size is directly relevant to at least three principles. To reinforce this recommendation, we outline seven reasons why inadequate sample size negatively affects model training, evaluation, and performance. Using a range of examples, we illustrate these issues and discuss the potentially harmful consequences for patient care and clinical adoption. Additionally, we address challenges associated with increasing sample sizes in AI research and highlight existing approaches and software for calculating the minimum sample sizes required for model training and evaluation.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 6","pages":"Article 100857"},"PeriodicalIF":23.8,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144217271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Correction to Lancet Digit Health 2024; 6: e386–95 《柳叶刀数字健康2024》修正；6: e386 - 95。

IF 23.8 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health

Pub Date : 2025-05-01 DOI: 10.1016/j.landig.2025.100877

引用次数: 0

首页上一页

3
4
5
6
7
8
9
10

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Lancet Digital Health

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀