Aims: Stress echocardiography (SE) is widely used for assessing coronary artery disease, but volumetric chamber analysis during SE is limited by time-consuming manual tracings and operator-dependent variability. Automated evaluation may overcome these barriers and enhance efficiency.
Methods and results: This multi-centre study included 240 participants undergoing pharmacological SE for ischaemic heart disease evaluation from five sites in four countries. SE imaging data from apical four-chamber and two-chamber views were acquired during rest and stress phases. Expert cardiologists manually traced endocardial borders for left ventricular (LV), left atrial (LA) and right ventricular (RV), right atrial (RA) areas, which were compared to machine learning (ML) derived measurements. Image quality was categorized as optimal, good, fair, or poor, and its influence on ML performance was analysed. Statistical methods included Intraclass Correlation Coefficients (ICCs), Bland-Altman testing, and within-patient coefficient of variation. The yield of the ML algorithm demonstrated consistency across rest and stress phases. It demonstrated strong agreement with cardiologists for LV and LA volumes, with ICCs ranging from 0.84 to 0.93 across rest and stress conditions. RA and RV areas measurements showed moderate correlations, with better agreement at rest than during stress phases. Image quality significantly influenced ML performance, as poor-quality images reduced diagnostic yield.
Conclusion: AI-driven volumetric analysis is a reliable method for quantifying left-sided heart chambers during pharmacological SE, with results closely matching expert measurements. Moderate reliability for right-sided chambers highlights the need for high-quality imaging and standardized protocols. AI integration may streamline SE workflows and support improved clinical decision-making.
{"title":"Artificial intelligence implementation in automated heart chambers quantification during pharmacological stress echocardiography.","authors":"Arnas Karuzas, Quirino Ciampi, Ieva Kazukauskiene, Laurynas Miscikas, Karolis Sablauskas, Antanas Kiziela, Dovydas Verikas, Jurgita Plisiene, Vaiva Lesauskaite, Lauro Cortigiani, Karina Wierzbowska-Drabik, Jaroslaw D Kasprzak, Jorge Lowenstein, Costantina Prota, Nicola Gaibazzi, Domenico Tuttolomondo, Attilio Lepone, Sofia Marconi, Rosina Arbucci, Eugenio Picano","doi":"10.1093/ehjdh/ztaf121","DOIUrl":"10.1093/ehjdh/ztaf121","url":null,"abstract":"<p><strong>Aims: </strong>Stress echocardiography (SE) is widely used for assessing coronary artery disease, but volumetric chamber analysis during SE is limited by time-consuming manual tracings and operator-dependent variability. Automated evaluation may overcome these barriers and enhance efficiency.</p><p><strong>Methods and results: </strong>This multi-centre study included 240 participants undergoing pharmacological SE for ischaemic heart disease evaluation from five sites in four countries. SE imaging data from apical four-chamber and two-chamber views were acquired during rest and stress phases. Expert cardiologists manually traced endocardial borders for left ventricular (LV), left atrial (LA) and right ventricular (RV), right atrial (RA) areas, which were compared to machine learning (ML) derived measurements. Image quality was categorized as optimal, good, fair, or poor, and its influence on ML performance was analysed. Statistical methods included Intraclass Correlation Coefficients (ICCs), Bland-Altman testing, and within-patient coefficient of variation. The yield of the ML algorithm demonstrated consistency across rest and stress phases. It demonstrated strong agreement with cardiologists for LV and LA volumes, with ICCs ranging from 0.84 to 0.93 across rest and stress conditions. RA and RV areas measurements showed moderate correlations, with better agreement at rest than during stress phases. Image quality significantly influenced ML performance, as poor-quality images reduced diagnostic yield.</p><p><strong>Conclusion: </strong>AI-driven volumetric analysis is a reliable method for quantifying left-sided heart chambers during pharmacological SE, with results closely matching expert measurements. Moderate reliability for right-sided chambers highlights the need for high-quality imaging and standardized protocols. AI integration may streamline SE workflows and support improved clinical decision-making.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"7 1","pages":"ztaf121"},"PeriodicalIF":4.4,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12821065/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146031848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-23eCollection Date: 2026-01-01DOI: 10.1093/ehjdh/ztaf123
Asgher Champsi, Karin T Slater, Simrat Gill, Tomasz Dyszynski, Megan Schröder, Kiliana Suzart-Woischnik, Benoit Tyl, Guillaume Allée, Alfonso Sartorius, R Thomas Lumbers, Folkert W Asselbergs, Diederick E Grobbee, Georgios Gkoutos, Dipak Kotecha
Aims: Coded healthcare data are now commonly used in clinical research. This study aimed to assess the transparency of reporting within heart failure studies and employ machine learning to facilitate larger-scale evaluation.
Methods & results: A systematic search of EMBASE and MEDLINE (2015-2020) identified 4279 heart failure studies with accessible Extensible Markup Language published in the top 25 journals by impact factor. Manual extraction in a random sample of 170 studies by independent human reviewers characterized 40 studies (23.5%) that used coded healthcare data, with 34 of these (85%) reporting doing so and only 19 (47.5%) providing clear descriptions of dataset construction and linkage. Another 420 studies underwent manual annotation to further train a Natural Language Processing (NLP) model designed for this study to automate and upscale review. The NLP model processed 3689 studies with a high level of internal accuracy (area under the receiver operating characteristic curve 0.97 and F1 score 0.96). Overall, the NLP approach identified 782 studies (21.2%) that reported coded healthcare data usage (95% CI 19.8-20.9%). No correlation was found between the reporting of coded healthcare data use and the publication year (r = -0.05; P = 0.21) or citation count (r = -0.13; P = 0.12).
Conclusion: One-fifth of contemporary heart failure research articles are already reporting the use of coded healthcare data, with at-scale evaluation facilitated by a machine-learning model. The limited transparency on how coded healthcare data were used in studies highlights the need for quality standards such as the CODE-EHR framework for the use of healthcare data in research.
目的:编码医疗数据现在普遍用于临床研究。本研究旨在评估心力衰竭研究报告的透明度,并利用机器学习促进更大规模的评估。方法与结果:系统检索EMBASE和MEDLINE(2015-2020),确定了4279篇使用可访问的可扩展标记语言发表在影响因子排名前25的期刊上的心力衰竭研究。由独立的人类审稿人在170项研究的随机样本中进行人工提取,发现40项研究(23.5%)使用了编码的医疗保健数据,其中34项(85%)报告了这样做,只有19项(47.5%)提供了数据集构建和链接的清晰描述。另外420项研究进行了手动注释,以进一步训练为本研究设计的自然语言处理(NLP)模型,以实现自动化和高级审查。NLP模型以较高的内部精度(接收者工作特征曲线下面积0.97,F1得分0.96)处理了3689项研究。总体而言,NLP方法确定了782项研究(21.2%)报告了编码的医疗保健数据使用情况(95% CI 19.8-20.9%)。编码医疗保健数据使用报告与发表年份(r = -0.05; P = 0.21)或引用次数(r = -0.13; P = 0.12)之间没有相关性。结论:五分之一的当代心力衰竭研究文章已经报告了编码医疗数据的使用,并通过机器学习模型促进了大规模评估。关于如何在研究中使用编码的医疗保健数据的透明度有限,这突出表明需要制定质量标准,例如在研究中使用医疗保健数据的CODE-EHR框架。
{"title":"Machine learning-enabled systematic review on coded healthcare data in heart failure research.","authors":"Asgher Champsi, Karin T Slater, Simrat Gill, Tomasz Dyszynski, Megan Schröder, Kiliana Suzart-Woischnik, Benoit Tyl, Guillaume Allée, Alfonso Sartorius, R Thomas Lumbers, Folkert W Asselbergs, Diederick E Grobbee, Georgios Gkoutos, Dipak Kotecha","doi":"10.1093/ehjdh/ztaf123","DOIUrl":"10.1093/ehjdh/ztaf123","url":null,"abstract":"<p><strong>Aims: </strong>Coded healthcare data are now commonly used in clinical research. This study aimed to assess the transparency of reporting within heart failure studies and employ machine learning to facilitate larger-scale evaluation.</p><p><strong>Methods & results: </strong>A systematic search of EMBASE and MEDLINE (2015-2020) identified 4279 heart failure studies with accessible Extensible Markup Language published in the top 25 journals by impact factor. Manual extraction in a random sample of 170 studies by independent human reviewers characterized 40 studies (23.5%) that used coded healthcare data, with 34 of these (85%) reporting doing so and only 19 (47.5%) providing clear descriptions of dataset construction and linkage. Another 420 studies underwent manual annotation to further train a Natural Language Processing (NLP) model designed for this study to automate and upscale review. The NLP model processed 3689 studies with a high level of internal accuracy (area under the receiver operating characteristic curve 0.97 and F1 score 0.96). Overall, the NLP approach identified 782 studies (21.2%) that reported coded healthcare data usage (95% CI 19.8-20.9%). No correlation was found between the reporting of coded healthcare data use and the publication year (r = <sup>-</sup>0.05; <i>P</i> = 0.21) or citation count (r = <sup>-</sup>0.13; <i>P</i> = 0.12).</p><p><strong>Conclusion: </strong>One-fifth of contemporary heart failure research articles are already reporting the use of coded healthcare data, with at-scale evaluation facilitated by a machine-learning model. The limited transparency on how coded healthcare data were used in studies highlights the need for quality standards such as the CODE-EHR framework for the use of healthcare data in research.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"7 1","pages":"ztaf123"},"PeriodicalIF":4.4,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12821059/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146031823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-22eCollection Date: 2026-01-01DOI: 10.1093/ehjdh/ztaf122
Heng-Yu Pan, Benny Wei-Yun Hsu, Chun-Ti Chou, Yuan-Yuan Hsu, Chih-Kuo Lee, Wen-Jeng Lee, Tai-Ming Ko, Vincent S Tseng, Tzung-Dau Wang
Aims: To propose a novel deep learning-based method, the eLVMass-Net, for the estimation of left ventricular mass (LVM) based on 12-lead electrocardiograms (ECGs).
Methods and results: We developed a deep learning model for LVM estimation using raw ECG signals, demographic data, and ECG parameters as input by using TW-CVAI dataset (n = 1459). Synchronized single-heartbeat waveforms were processed using a temporal convolutional network (TCN). Ground-truth LVM values were obtained from coronary computed tomography angiography. We performed external validation on an independent NTUH dataset (n = 2579). To account for sex-specific differences in left ventricular remodelling and body habitus, we further developed separate models for males and females. We compared the performance of the eLVMass-Net, with two state-of-the-art (SOTA) models.Non-sex-specific eLVMass-Net achieved a mean absolute error (MAE) of 14.3 ± 0.7 g and a mean absolute percentage error (MAPE) of 12.9 ± 1.1% between predicted and ground-truth LVM values under five-fold cross-validation. The eLVMass-Net outperformed two SOTA models in terms of both LVM estimation and left ventricular hypertrophy (LVH) classification. Sex-specific design was superior in LVH classification based on estimated LVM (c-statistic: 0.77 ± 0.05 for male model; 0.75 ± 0.05 for female model; 0.70 ± 0.02 for non-sex-specific model; P< 0.01 between both sex-specific models vs. non-sex-specific model). The saliency maps revealed gender-specific differences in how the model weighted ST-T segment features for LVM prediction.
Conclusion: The proposed eLVMass-Net outperformed previously published approaches by ECG pre-processing with synchronized single heartbeat extraction and TCN as ECG encoder. Additionally, the development of sex-specific models proved to be a rational approach.
{"title":"Automated estimation of computed tomography-derived left ventricular mass using sex-specific 12-lead ECG-based temporal convolutional network.","authors":"Heng-Yu Pan, Benny Wei-Yun Hsu, Chun-Ti Chou, Yuan-Yuan Hsu, Chih-Kuo Lee, Wen-Jeng Lee, Tai-Ming Ko, Vincent S Tseng, Tzung-Dau Wang","doi":"10.1093/ehjdh/ztaf122","DOIUrl":"10.1093/ehjdh/ztaf122","url":null,"abstract":"<p><strong>Aims: </strong>To propose a novel deep learning-based method, the eLVMass-Net, for the estimation of left ventricular mass (LVM) based on 12-lead electrocardiograms (ECGs).</p><p><strong>Methods and results: </strong>We developed a deep learning model for LVM estimation using raw ECG signals, demographic data, and ECG parameters as input by using TW-CVAI dataset (<i>n</i> = 1459). Synchronized single-heartbeat waveforms were processed using a temporal convolutional network (TCN). Ground-truth LVM values were obtained from coronary computed tomography angiography. We performed external validation on an independent NTUH dataset (<i>n</i> = 2579). To account for sex-specific differences in left ventricular remodelling and body habitus, we further developed separate models for males and females. We compared the performance of the eLVMass-Net, with two state-of-the-art (SOTA) models.Non-sex-specific eLVMass-Net achieved a mean absolute error (MAE) of 14.3 ± 0.7 g and a mean absolute percentage error (MAPE) of 12.9 ± 1.1% between predicted and ground-truth LVM values under five-fold cross-validation. The eLVMass-Net outperformed two SOTA models in terms of both LVM estimation and left ventricular hypertrophy (LVH) classification. Sex-specific design was superior in LVH classification based on estimated LVM (<i>c</i>-statistic: 0.77 ± 0.05 for male model; 0.75 ± 0.05 for female model; 0.70 ± 0.02 for non-sex-specific model; <i>P</i> <i><</i> 0.01 between both sex-specific models vs. non-sex-specific model). The saliency maps revealed gender-specific differences in how the model weighted ST-T segment features for LVM prediction.</p><p><strong>Conclusion: </strong>The proposed eLVMass-Net outperformed previously published approaches by ECG pre-processing with synchronized single heartbeat extraction and TCN as ECG encoder. Additionally, the development of sex-specific models proved to be a rational approach.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"7 1","pages":"ztaf122"},"PeriodicalIF":4.4,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12821057/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146031835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-15eCollection Date: 2025-11-01DOI: 10.1093/ehjdh/ztaf120
Tony Hauptmann, Sven-Oliver Tröbs, Andreas Schulz, Aida Romano Martinez, Philipp Lurz, Jürgen Prochaska, Philipp Sebastian Wild, Stefan Kramer
Aims: Automatic echocardiographic measurements using artificial intelligence have shown promising results; however, they have not been compared with manual measurements regarding heart failure (HF) progression and algorithm runtime.
Methods and results: Data came from the prospective HF study MyoVasc (NCT04064450), which involved a highly standardized 5-h examination, including comprehensive echocardiography, at a dedicated study centre between January 2013 and April 2018. Worsening of HF was a primary composite endpoint, recorded by structured follow-up, death certificates, and medical records. The automated assessment was performed using EchoDL, eight 3D convolutional neural networks (CNNs) trained to predict clinical parameters. Manual and automatic left ventricular ejection fraction (LVEF), E/E'-ratio and left ventricular mass (LVM) demonstrated a good intraclass correlation coefficient {LVEF: 0.75 [95% confidence interval (CI) 0.75-0.77], E/E'-ratio: 0.59 [CI 0.56-0.61], LVM: 0.64 [CI 0.62-0.66]}. After a median follow-up of 3.8 years (IQR 2.1-5.0), 470 patients experienced worsening of HF. In multivariable Cox analysis, comparison of manually and automatically assessed LVEF, E/E'-ratio and LVM demonstrated risk estimates slightly in favour of the CNNs. Direct comparison of C-indices showed significantly better model performance for automatically determined LVEF (0.71 vs. 0.73, P = 0.038) and E/E'-ratio (0.64 vs. 0.66, P = 0.013) and a trend for LVM (0.66 vs. 0.68, P = 0.063). Echo-DL required an average of 1053.4 ms (95% CI 1050.7-1056.0) to analyse a four-second-long echocardiogram.
Conclusion: Automated analysis of echocardiograms using 3D CNNs was comparable to manual measurements in predicting HF-specific outcomes. Echo-DL offers potential time savings and improved risk prediction in clinical settings, allowing integration into echocardiographic hardware.
目的:人工智能自动超声心动图测量显示出良好的结果;然而,还没有将它们与人工测量的心力衰竭(HF)进展和算法运行时间进行比较。方法和结果:数据来自前瞻性心衰研究MyoVasc (NCT04064450),该研究于2013年1月至2018年4月在一个专门的研究中心进行了高度标准化的5小时检查,包括全面的超声心动图。心衰恶化是主要的复合终点,通过结构化随访、死亡证明和医疗记录进行记录。使用EchoDL进行自动评估,8个3D卷积神经网络(cnn)经过训练来预测临床参数。手动和自动左室射血分数(LVEF)、E/E′-比和左室质量(LVM)表现出良好的类内相关系数{LVEF: 0.75[95%可信区间(CI) 0.75 ~ 0.77], E/E′-比:0.59 [CI 0.56 ~ 0.61], LVM: 0.64 [CI 0.62 ~ 0.66]}。中位随访3.8年(IQR 2.1-5.0)后,470例患者心衰恶化。在多变量Cox分析中,人工和自动评估的LVEF、E/E’-ratio和LVM的比较显示,风险估计略微偏向cnn。c指数的直接比较表明,自动确定的LVEF (0.71 vs. 0.73, P = 0.038)和E/E'-ratio (0.64 vs. 0.66, P = 0.013)的模型性能明显更好,LVM (0.66 vs. 0.68, P = 0.063)有趋势。Echo-DL平均需要1053.4 ms (95% CI 1050.7-1056.0)来分析4秒长的超声心动图。结论:使用3D cnn自动分析超声心动图在预测hf特异性结果方面与人工测量相当。Echo-DL在临床环境中提供了潜在的时间节省和改进的风险预测,允许集成到超声心动图硬件。
{"title":"Echocardiographic measures read by artificial intelligence enable accurate and rapid prediction of the worsening of heart failure.","authors":"Tony Hauptmann, Sven-Oliver Tröbs, Andreas Schulz, Aida Romano Martinez, Philipp Lurz, Jürgen Prochaska, Philipp Sebastian Wild, Stefan Kramer","doi":"10.1093/ehjdh/ztaf120","DOIUrl":"10.1093/ehjdh/ztaf120","url":null,"abstract":"<p><strong>Aims: </strong>Automatic echocardiographic measurements using artificial intelligence have shown promising results; however, they have not been compared with manual measurements regarding heart failure (HF) progression and algorithm runtime.</p><p><strong>Methods and results: </strong>Data came from the prospective HF study MyoVasc (NCT04064450), which involved a highly standardized 5-h examination, including comprehensive echocardiography, at a dedicated study centre between January 2013 and April 2018. Worsening of HF was a primary composite endpoint, recorded by structured follow-up, death certificates, and medical records. The automated assessment was performed using EchoDL, eight 3D convolutional neural networks (CNNs) trained to predict clinical parameters. Manual and automatic left ventricular ejection fraction (LVEF), <i>E</i>/<i>E</i>'-ratio and left ventricular mass (LVM) demonstrated a good intraclass correlation coefficient {LVEF: 0.75 [95% confidence interval (CI) 0.75-0.77], <i>E</i>/<i>E</i>'-ratio: 0.59 [CI 0.56-0.61], LVM: 0.64 [CI 0.62-0.66]}. After a median follow-up of 3.8 years (IQR 2.1-5.0), 470 patients experienced worsening of HF. In multivariable Cox analysis, comparison of manually and automatically assessed LVEF, <i>E</i>/<i>E</i>'-ratio and LVM demonstrated risk estimates slightly in favour of the CNNs. Direct comparison of <i>C</i>-indices showed significantly better model performance for automatically determined LVEF (0.71 vs. 0.73, <i>P</i> = 0.038) and <i>E</i>/<i>E</i>'-ratio (0.64 vs. 0.66, <i>P</i> = 0.013) and a trend for LVM (0.66 vs. 0.68, <i>P</i> = 0.063). Echo-DL required an average of 1053.4 ms (95% CI 1050.7-1056.0) to analyse a four-second-long echocardiogram.</p><p><strong>Conclusion: </strong>Automated analysis of echocardiograms using 3D CNNs was comparable to manual measurements in predicting HF-specific outcomes. Echo-DL offers potential time savings and improved risk prediction in clinical settings, allowing integration into echocardiographic hardware.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"6 6","pages":"1246-1256"},"PeriodicalIF":4.4,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12629647/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-13eCollection Date: 2025-11-01DOI: 10.1093/ehjdh/ztaf116
Shuang Leng, Nicholas Cheng, Eddy Tan, Lohendran Baskaran, Lynette Teo, Min Sen Yew, Kee Yuan Ngiam, Weimin Huang, Ping Chai, Ching Ching Ong, Ching Hui Sia, Malay Singh, Yan Ting Loong, Nur A S Raffiee, Xiaomeng Wang, John Allen, Swee Yaw Tan, Mark Chan, Hwee Kuan Lee, Liang Zhong
Aims: Epicardial adipose tissue (EAT), located within the pericardial sac, has emerged as a biomarker for coronary artery disease (CAD) progression. This study aimed to develop and validate a deep learning-based system for automated EAT volume quantification using non-contrast computed tomography (NCCT) scans from a large, multi-centre, pan-Asian cohort.
Methods and results: A total of 1243 NCCT patient scans from three centres were used to train and internally validate a deep learning model based on 3D UNet++ architecture for pericardium segmentation, followed by intensity thresholding to derive EAT volume. Epicardial adipose tissue quantification required ∼30 s per scan. The final model was evaluated on an external testing cohort of 160 patients, including 90 non-Asian individuals. In this cohort, AI-predicted EAT volumes showed excellent agreement with expert annotations (r = 0.975; P < 0.0001). The Bland-Altman analysis demonstrated a mean bias of -5.2 cm3with 95% limits of agreement from -25.1 to 14.7 cm3. Among the non-Asian subgroup, model performance remained strong (r = 0.970; bias, -3.2 cm3; limits of agreement, -25.1-18.7 cm3). AI-derived EAT volume was independently associated with obstructive CAD (odds ratio 1.11; 95% confidence interval, 1.04-1.19; P = 0.004), after adjusting for confounders. The global χ2 statistic increased from 81.7 with coronary calcium score alone to 93.3 when EAT volume was added (P = 0.001), indicating improved risk prediction.
Conclusion: We developed and validated a deep learning system for automated EAT volume quantification from NCCT scans. The model demonstrated high accuracy and generalizability across ethnically diverse populations, supporting its potential for routine EAT assessment and CAD risk stratification.
{"title":"Deep learning-based quantification of epicardial adipose tissue volume from non-contrast computed tomography images: a multi-centre study.","authors":"Shuang Leng, Nicholas Cheng, Eddy Tan, Lohendran Baskaran, Lynette Teo, Min Sen Yew, Kee Yuan Ngiam, Weimin Huang, Ping Chai, Ching Ching Ong, Ching Hui Sia, Malay Singh, Yan Ting Loong, Nur A S Raffiee, Xiaomeng Wang, John Allen, Swee Yaw Tan, Mark Chan, Hwee Kuan Lee, Liang Zhong","doi":"10.1093/ehjdh/ztaf116","DOIUrl":"10.1093/ehjdh/ztaf116","url":null,"abstract":"<p><strong>Aims: </strong>Epicardial adipose tissue (EAT), located within the pericardial sac, has emerged as a biomarker for coronary artery disease (CAD) progression. This study aimed to develop and validate a deep learning-based system for automated EAT volume quantification using non-contrast computed tomography (NCCT) scans from a large, multi-centre, pan-Asian cohort.</p><p><strong>Methods and results: </strong>A total of 1243 NCCT patient scans from three centres were used to train and internally validate a deep learning model based on 3D UNet++ architecture for pericardium segmentation, followed by intensity thresholding to derive EAT volume. Epicardial adipose tissue quantification required ∼30 s per scan. The final model was evaluated on an external testing cohort of 160 patients, including 90 non-Asian individuals. In this cohort, AI-predicted EAT volumes showed excellent agreement with expert annotations (<i>r</i> = 0.975; <i>P</i> < 0.0001). The Bland-Altman analysis demonstrated a mean bias of -5.2 cm<sup>3</sup>with 95% limits of agreement from -25.1 to 14.7 cm<sup>3</sup>. Among the non-Asian subgroup, model performance remained strong (<i>r</i> = 0.970; bias, -3.2 cm<sup>3</sup>; limits of agreement, -25.1-18.7 cm<sup>3</sup>). AI-derived EAT volume was independently associated with obstructive CAD (odds ratio 1.11; 95% confidence interval, 1.04-1.19; <i>P</i> = 0.004), after adjusting for confounders. The global χ<sup>2</sup> statistic increased from 81.7 with coronary calcium score alone to 93.3 when EAT volume was added (<i>P</i> = 0.001), indicating improved risk prediction.</p><p><strong>Conclusion: </strong>We developed and validated a deep learning system for automated EAT volume quantification from NCCT scans. The model demonstrated high accuracy and generalizability across ethnically diverse populations, supporting its potential for routine EAT assessment and CAD risk stratification.</p><p><strong>Trial registration: </strong>ClinicalTrials.gov Identifier: NCT05509010.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"6 6","pages":"1223-1233"},"PeriodicalIF":4.4,"publicationDate":"2025-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12629654/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-09eCollection Date: 2026-01-01DOI: 10.1093/ehjdh/ztaf115
Marie-Ange Fleury, Louis Ohl, Lionel Tastet, Mickaël Leclercq, Frédéric Precioso, Pierre-Alexandre Mattei, Romain Capoulade, Kathia Abdoun, Élisabeth Bédard, Marie Arsenault, Jonathan Beaudoin, Mathieu Bernier, Erwan Salaun, Jérémy Bernard, Mylène Shen, Sébastien Hecht, Nancy Côté, Arnaud Droit, Philippe Pibarot
Aims: There is a lack of studies investigating the pathophysiologic and phenotypic distinctiveness of aortic stenosis (AS). This heterogeneity has important implications for identifying optimal intervention timing and potential medical management. This study seeks to identify phenogroups of AS using unsupervised machine learning to improve risk stratification.
Methods and results: A total of 349 patients with asymptomatic AS from the PROGRESSA study were included in this analysis. Echocardiographic, clinical and blood sample data were used in the unsupervised clustering process. Longitudinal echocardiographic data were used to evaluate AS progression. Five clusters of patients were revealed using 18 variables selected by an unsupervised machine learning algorithm. Amongst them, aortic valvular phenotype, mean gradient, peak jet velocity (Vpeak), and left ventricle stroke volume were selected as discriminatory variables. Following the clustering process, characteristics differed between clusters, including age, body mass index, and sex ratio (all P < 0.001). Of note, cluster 1 showed higher AS severity at baseline with significantly higher initial Vpeak (344 [314; 376] cm/s) and calcium score (1257 [806; 1837] UA) (P < 0.001). Patients from cluster 1 had a faster AS progression (progression of Vpeak = 22 [9; 39] cm/s/year), and calcium score (213 [111; 307] UA/year) (P < 0.001). Cluster 1 was also associated with a higher composite risk of mortality and aortic valve replacement when adjusted for age, sex, and baseline AS severity (P < 0.001).
Conclusion: Artificial intelligence-guided phenotypic classification revealed 5 distinct groups and enhanced risk stratification of patients with AS. This approach may be useful to optimize and individualize medical and interventional management of AS.
{"title":"Unsupervised machine learning analysis to enhance risk stratification in patients with asymptomatic aortic stenosis.","authors":"Marie-Ange Fleury, Louis Ohl, Lionel Tastet, Mickaël Leclercq, Frédéric Precioso, Pierre-Alexandre Mattei, Romain Capoulade, Kathia Abdoun, Élisabeth Bédard, Marie Arsenault, Jonathan Beaudoin, Mathieu Bernier, Erwan Salaun, Jérémy Bernard, Mylène Shen, Sébastien Hecht, Nancy Côté, Arnaud Droit, Philippe Pibarot","doi":"10.1093/ehjdh/ztaf115","DOIUrl":"10.1093/ehjdh/ztaf115","url":null,"abstract":"<p><strong>Aims: </strong>There is a lack of studies investigating the pathophysiologic and phenotypic distinctiveness of aortic stenosis (AS). This heterogeneity has important implications for identifying optimal intervention timing and potential medical management. This study seeks to identify phenogroups of AS using unsupervised machine learning to improve risk stratification.</p><p><strong>Methods and results: </strong>A total of 349 patients with asymptomatic AS from the PROGRESSA study were included in this analysis. Echocardiographic, clinical and blood sample data were used in the unsupervised clustering process. Longitudinal echocardiographic data were used to evaluate AS progression. Five clusters of patients were revealed using 18 variables selected by an unsupervised machine learning algorithm. Amongst them, aortic valvular phenotype, mean gradient, peak jet velocity (V<sub>peak</sub>), and left ventricle stroke volume were selected as discriminatory variables. Following the clustering process, characteristics differed between clusters, including age, body mass index, and sex ratio (all <i>P</i> < 0.001). Of note, cluster 1 showed higher AS severity at baseline with significantly higher initial V<sub>peak</sub> (344 [314; 376] cm/s) and calcium score (1257 [806; 1837] UA) (<i>P</i> < 0.001). Patients from cluster 1 had a faster AS progression (progression of V<sub>peak</sub> = 22 [9; 39] cm/s/year), and calcium score (213 [111; 307] UA/year) (<i>P</i> < 0.001). Cluster 1 was also associated with a higher composite risk of mortality and aortic valve replacement when adjusted for age, sex, and baseline AS severity (<i>P</i> < 0.001).</p><p><strong>Conclusion: </strong>Artificial intelligence-guided phenotypic classification revealed 5 distinct groups and enhanced risk stratification of patients with AS. This approach may be useful to optimize and individualize medical and interventional management of AS.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"7 1","pages":"ztaf115"},"PeriodicalIF":4.4,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12821062/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146031864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-07eCollection Date: 2026-01-01DOI: 10.1093/ehjdh/ztaf113
Hao Jia, Yifan Wang, Zhimin Lv, Yiqi Zhao, Ningning Zhang, Xiulin Zhang, Wentao Wang, Yihang Feng, Weiteng Wang, Hao Cui, Yuyang Liu, Zheng Gao, Han Mo, Han Han, Yuhong Hu, Xijia Shao, Xiao Chen, Daniel Reichart, Jiangping Song
Aims: Non-ischaemic dilated cardiomyopathy (NIDCM) is a major cause of heart failure (HF) and heart transplantation (HTx), characterized by heterogeneity in aetiology, clinical phenotype, and disease progression. Nevertheless, precision medicine-based diagnostics and treatment strategies for NIDCM remain lacking. This proof-of-concept study aimed to stratify NIDCM patients by pathological features and identify those at high-risk for malignant arrhythmia (MA) and rapid progression to end-stage HF.
Methods and results: 293 NIDCM-HTx patients were included in this study. A total of 3516 heart tissue slides from six representative sites of each patient were analyzed using deep learning-based computational pathology (DL-CPath) and unsupervised clustering to identify pathological subgroups (PGs): PGA, PGB, and PGC. PGA was characterized by interstitial fibrosis, cardiomyocyte vacuolization, microvascular intimal hyperplasia, and myocyte disarray, and had the highest rates of MA (P = 0.03) and the shortest interval from diagnosis to HTx (P = 0.03). PGB showed focal fibrosis, whereas PGC demonstrated the mildest histopathological alterations. For clinical features, PGA showed elevated levels of blood biomarkers indicative of myocardial and secondary organ injury. PGB was associated with extensive fibrosis and significant impairment of ejection fraction. PGC presented with the mildest clinical abnormalities. Although LMNA mutation was a significant non-DL-CPath high-risk factor for MA and rapid NIDCM progression, its distribution did not differ significantly across PGs (P = 0.786).
Conclusion: DL-based pathological classification effectively extracted clinically-meaningful imaging features and enabled the identification of high-risk NIDCM subgroup. Each PG exhibited unique histopathological and clinical characteristics, highlighting distinct phenotypes and risk profiles.
{"title":"Pathological classification of non-ischaemic dilated cardiomyopathy based on deep learning.","authors":"Hao Jia, Yifan Wang, Zhimin Lv, Yiqi Zhao, Ningning Zhang, Xiulin Zhang, Wentao Wang, Yihang Feng, Weiteng Wang, Hao Cui, Yuyang Liu, Zheng Gao, Han Mo, Han Han, Yuhong Hu, Xijia Shao, Xiao Chen, Daniel Reichart, Jiangping Song","doi":"10.1093/ehjdh/ztaf113","DOIUrl":"10.1093/ehjdh/ztaf113","url":null,"abstract":"<p><strong>Aims: </strong>Non-ischaemic dilated cardiomyopathy (NIDCM) is a major cause of heart failure (HF) and heart transplantation (HTx), characterized by heterogeneity in aetiology, clinical phenotype, and disease progression. Nevertheless, precision medicine-based diagnostics and treatment strategies for NIDCM remain lacking. This proof-of-concept study aimed to stratify NIDCM patients by pathological features and identify those at high-risk for malignant arrhythmia (MA) and rapid progression to end-stage HF.</p><p><strong>Methods and results: </strong>293 NIDCM-HTx patients were included in this study. A total of 3516 heart tissue slides from six representative sites of each patient were analyzed using deep learning-based computational pathology (DL-CPath) and unsupervised clustering to identify pathological subgroups (PGs): PGA, PGB, and PGC. PGA was characterized by interstitial fibrosis, cardiomyocyte vacuolization, microvascular intimal hyperplasia, and myocyte disarray, and had the highest rates of MA (<i>P</i> = 0.03) and the shortest interval from diagnosis to HTx (<i>P</i> = 0.03). PGB showed focal fibrosis, whereas PGC demonstrated the mildest histopathological alterations. For clinical features, PGA showed elevated levels of blood biomarkers indicative of myocardial and secondary organ injury. PGB was associated with extensive fibrosis and significant impairment of ejection fraction. PGC presented with the mildest clinical abnormalities. Although <i>LMNA</i> mutation was a significant non-DL-CPath high-risk factor for MA and rapid NIDCM progression, its distribution did not differ significantly across PGs (<i>P</i> = 0.786).</p><p><strong>Conclusion: </strong>DL-based pathological classification effectively extracted clinically-meaningful imaging features and enabled the identification of high-risk NIDCM subgroup. Each PG exhibited unique histopathological and clinical characteristics, highlighting distinct phenotypes and risk profiles.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"7 1","pages":"ztaf113"},"PeriodicalIF":4.4,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12821066/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146031831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-06eCollection Date: 2025-11-01DOI: 10.1093/ehjdh/ztaf118
Thomas F Kok, Navin Suthahar, Jesse H Krijthe, Rudolf A de Boer, Eric Boersma, Isabella Kardys
Aims: We aimed to compare performances of conventional survival models with machine learning (ML) survival models for incident heart failure (HF) in men and women without prevalent HF, cardiomyopathy (CM) or ischaemic heart disease (IHD), and to identify potential high-risk precursors overlooked by conventional survival models.
Methods and results: We predicted 10-year risk of incident HF in 266 306 women (2894 events) and 212 061 men (4213 events). We constructed multivariable Cox models, first using ∼ 400 baseline characteristics, and subsequently only those remaining after LASSO stability selection. We also used Random Survival Forest (RSF) and eXtreme Gradient Survival Boosting (XGBoost). Performances were assessed using internal cross validation and hold-out sets, with C-indices, calibration curves and net-benefit analyses. Model performances were comparable during internal validation: XGBoost (C-index ± SE) (men: 0.79 ± 0.0040, women: 0.83 ± 0.0023) showed similar performance to the multivariable Cox model (men: 0.80 ± 0.0031, women: 0.83 ± 0.0022) and Cox models after LASSO stability selection, while RSF showed numerically slightly lower performance (men: 0.78 ± 0.0025, women: 0.81 ± 0.0015). Findings were similar in the hold-out sets. Age, cystatin-C, lifetime treatments/medications, other heart disease, systolic blood pressure, and spirometry measures were identified as high-risk factors in both model types for both sexes. Additionally, sex-specific and model-specific risk factors were identified.
Conclusion: Machine learning models and Cox proportional hazard models performed well and similarly for 10-year incident HF risk prediction in the general population. However, sex-specific and model-specific risk predictors were found. Spirometry measures, rarely included in existing models, were identified as important risk factors. Our results suggest that ML models for HF prediction in the general population reveal insights that would otherwise remain unnoticed.
{"title":"High-dimensional machine learning models for prediction of heart failure in more than 400 000 men and women from the UK Biobank.","authors":"Thomas F Kok, Navin Suthahar, Jesse H Krijthe, Rudolf A de Boer, Eric Boersma, Isabella Kardys","doi":"10.1093/ehjdh/ztaf118","DOIUrl":"10.1093/ehjdh/ztaf118","url":null,"abstract":"<p><strong>Aims: </strong>We aimed to compare performances of conventional survival models with machine learning (ML) survival models for incident heart failure (HF) in men and women without prevalent HF, cardiomyopathy (CM) or ischaemic heart disease (IHD), and to identify potential high-risk precursors overlooked by conventional survival models.</p><p><strong>Methods and results: </strong>We predicted 10-year risk of incident HF in 266 306 women (2894 events) and 212 061 men (4213 events). We constructed multivariable Cox models, first using ∼ 400 baseline characteristics, and subsequently only those remaining after LASSO stability selection. We also used Random Survival Forest (RSF) and eXtreme Gradient Survival Boosting (XGBoost). Performances were assessed using internal cross validation and hold-out sets, with C-indices, calibration curves and net-benefit analyses. Model performances were comparable during internal validation: XGBoost (<i>C</i>-index ± SE) (men: 0.79 ± 0.0040, women: 0.83 ± 0.0023) showed similar performance to the multivariable Cox model (men: 0.80 ± 0.0031, women: 0.83 ± 0.0022) and Cox models after LASSO stability selection, while RSF showed numerically slightly lower performance (men: 0.78 ± 0.0025, women: 0.81 ± 0.0015). Findings were similar in the hold-out sets. Age, cystatin-C, lifetime treatments/medications, other heart disease, systolic blood pressure, and spirometry measures were identified as high-risk factors in both model types for both sexes. Additionally, sex-specific and model-specific risk factors were identified.</p><p><strong>Conclusion: </strong>Machine learning models and Cox proportional hazard models performed well and similarly for 10-year incident HF risk prediction in the general population. However, sex-specific and model-specific risk predictors were found. Spirometry measures, rarely included in existing models, were identified as important risk factors. Our results suggest that ML models for HF prediction in the general population reveal insights that would otherwise remain unnoticed.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"6 6","pages":"1234-1245"},"PeriodicalIF":4.4,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12629653/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-06eCollection Date: 2025-11-01DOI: 10.1093/ehjdh/ztaf117
Ivor B Asztalos, Amy Li, Victoria L Vetter, John K Triedman, Joshua Mayourian
Aims: To assess the potential for artificial intelligence-enabled electrocardiogram (AI-ECG) to serve as a long-term cardiac surveillance tool and predict left ventricular systolic dysfunction in childhood cancer patients.
Methods and results: We assessed performance of our previously established AI-ECG model to predict left ventricular ejection fraction (LVEF) ≤50% and ≤40% in patients with childhood cancer during internal testing (Boston Children's Hospital) and external validation (Children's Hospital of Philadelphia). The internal test cohort comprised 447 patients [57% male; age at cancer diagnosis 11.2 (5.4-15.7) years; 1553 ECG-echo pairs at median age 13.5 (IQR 7.7-17.9) years; 6.4% with LVEF ≤50%; 1.3% with LVEF ≤40%], 28% with leukaemia, 16% with lymphoma, 8% with neuroblastoma, 8% with sarcoma, 2% with gastrointestinal cancers, 3% with genitourinary cancers, 6% with central nervous system cancers, 11% with other/unspecified cancers, and 18% with missing/unknown cancer labels. Treatment strategies included anthracyclines (35%), bone marrow transplant (7%), and radiation (1%). The external test cohort comprised 2964 patients [55.4% male; 7054 ECG-echo pairs at median age 11.6 (IQR 6.8-15.1) years; 2.5% with LVEF ≤50%; 0.9% with LVEF ≤40%]. Similar AUROCs (0.80-0.85), sensitivities (0.75-0.82), NPVs (0.986-0.996), and percent predicted negative (51-65%) were obtained across institutions to predict LVEF ≤50%, outperforming a biomarker-based model benchmark. Patients with high AI-ECG risk scores for LVEF ≤50% had higher rates of mortality [hazard ratio 3.1 (95% CI 1.8-5.3), P < 0.001] compared to patients with low AI-ECG risk scores.
Conclusion: AI-ECG shows promise as a digital biomarker for cardiac surveillance in the vulnerable childhood cancer survivor population.
目的:评估人工智能心电图(AI-ECG)作为长期心脏监测工具和预测儿童癌症患者左心室收缩功能障碍的潜力。方法和结果:我们在内部测试(波士顿儿童医院)和外部验证(费城儿童医院)期间评估了我们之前建立的AI-ECG模型的性能,以预测儿童癌症患者的左室射血分数(LVEF)≤50%和≤40%。内测队列包括447例患者[57%为男性;癌症诊断年龄11.2(5.4-15.7)岁;1553对,中位年龄13.5 (IQR 7.7 ~ 17.9)岁;LVEF≤50%的占6.4%;1.3%为LVEF≤40%],28%为白血病,16%为淋巴瘤,8%为神经母细胞瘤,8%为肉瘤,2%为胃肠道癌,3%为泌尿生殖系统癌,6%为中枢神经系统癌,11%为其他/不明癌症,18%为缺失/未知癌症标签。治疗策略包括蒽环类药物(35%)、骨髓移植(7%)和放疗(1%)。外部测试队列包括2964例患者,其中男性55.4%;7054对,中位年龄11.6 (IQR 6.8-15.1)岁;2.5%, LVEF≤50%;0.9%, LVEF≤40%]。在预测LVEF≤50%时,各机构的auroc(0.80-0.85)、灵敏度(0.75-0.82)、npv(0.986-0.996)和预测阴性百分比(51-65%)相似,优于基于生物标志物的模型基准。LVEF≤50%的高AI-ECG风险评分患者的死亡率高于低AI-ECG风险评分患者[风险比3.1 (95% CI 1.8-5.3), P < 0.001]。结论:AI-ECG有望作为易受伤害的儿童癌症幸存者人群心脏监测的数字生物标志物。
{"title":"Cardiac surveillance of childhood cancer using artificial intelligence-enabled electrocardiograms.","authors":"Ivor B Asztalos, Amy Li, Victoria L Vetter, John K Triedman, Joshua Mayourian","doi":"10.1093/ehjdh/ztaf117","DOIUrl":"10.1093/ehjdh/ztaf117","url":null,"abstract":"<p><strong>Aims: </strong>To assess the potential for artificial intelligence-enabled electrocardiogram (AI-ECG) to serve as a long-term cardiac surveillance tool and predict left ventricular systolic dysfunction in childhood cancer patients.</p><p><strong>Methods and results: </strong>We assessed performance of our previously established AI-ECG model to predict left ventricular ejection fraction (LVEF) ≤50% and ≤40% in patients with childhood cancer during internal testing (Boston Children's Hospital) and external validation (Children's Hospital of Philadelphia). The internal test cohort comprised 447 patients [57% male; age at cancer diagnosis 11.2 (5.4-15.7) years; 1553 ECG-echo pairs at median age 13.5 (IQR 7.7-17.9) years; 6.4% with LVEF ≤50%; 1.3% with LVEF ≤40%], 28% with leukaemia, 16% with lymphoma, 8% with neuroblastoma, 8% with sarcoma, 2% with gastrointestinal cancers, 3% with genitourinary cancers, 6% with central nervous system cancers, 11% with other/unspecified cancers, and 18% with missing/unknown cancer labels. Treatment strategies included anthracyclines (35%), bone marrow transplant (7%), and radiation (1%). The external test cohort comprised 2964 patients [55.4% male; 7054 ECG-echo pairs at median age 11.6 (IQR 6.8-15.1) years; 2.5% with LVEF ≤50%; 0.9% with LVEF ≤40%]. Similar AUROCs (0.80-0.85), sensitivities (0.75-0.82), NPVs (0.986-0.996), and percent predicted negative (51-65%) were obtained across institutions to predict LVEF ≤50%, outperforming a biomarker-based model benchmark. Patients with high AI-ECG risk scores for LVEF ≤50% had higher rates of mortality [hazard ratio 3.1 (95% CI 1.8-5.3), <i>P</i> < 0.001] compared to patients with low AI-ECG risk scores.</p><p><strong>Conclusion: </strong>AI-ECG shows promise as a digital biomarker for cardiac surveillance in the vulnerable childhood cancer survivor population.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"6 6","pages":"1293-1296"},"PeriodicalIF":4.4,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12629641/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-06eCollection Date: 2025-11-01DOI: 10.1093/ehjdh/ztaf106
Manuel Sigle, Diana Heurich, Wenke Faller, Meinrad Gawaz, Karin Anne Lydia Mueller, Andreas Goldschmied
Aims: Overdiagnosis in patients suspected of acute coronary syndrome (ACS) leads to unnecessary coronary angiographies, particularly in cases with non-specifically elevated troponin (Trop) levels. We established machine learning (ML) models integrating sequentially available prehospital and in-hospital variables to improve early prediction of the need for coronary re while minimizing overdiagnosis.
Methods and results: Retrospective cohort study analysing patients with suspected ACS from 2016 to 2020. Machine learning models were trained using data available at different diagnostic time points, including prehospital assessment, arterial blood gas analysis, full laboratory results, and sequential Trop measurements. A total of 2756 patients were included, identified through emergency physician protocols for ACS-related complaints. Patients with incomplete data or prehospital mortality were excluded.Model performance improved with additional diagnostic data. Model 1 (prehospital data only) achieved an area under the receiver operating characteristic (AUROC) of 0.76 (95% confidence interval [CI] 0.72-0.79), while Model 4 (including sequential Trop testing) reached 0.87 (95% CI 0.83-0.91). Adding early hospital diagnostics (Model 2) significantly improved accuracy compared with Model 1 (0.65 vs. 0.78). Sequential Trop testing in Model 4 did not substantially enhance performance compared with single Trop testing in Model 3 (AUROC 0.87, 95% CI 0.83-0.91 vs. 0.86, 95% CI 0.82-0.91). Misclassification analysis revealed that underdiagnosed patients were typically older females with dyspnoea and known coronary artery disease but no ST-elevations. Overdiagnosed patients had higher body mass index, ST-elevations, regional wall motion abnormalities, and impaired left ventricular ejection fraction but lacked significant sequential Trop elevation.
Conclusion: Prehospital assessments combined with early in-hospital diagnostics provide reliable stratification of coronary intervention need, potentially optimizing clinical decision-making and resource utilization.
目的:疑似急性冠脉综合征(ACS)患者的过度诊断导致不必要的冠状动脉造影,特别是在肌钙蛋白(Trop)水平非特异性升高的病例中。我们建立了机器学习(ML)模型,整合依次可用的院前和院内变量,以提高对冠状动脉再灌注需求的早期预测,同时最大限度地减少过度诊断。方法与结果:回顾性队列研究,分析2016 - 2020年疑似ACS患者。机器学习模型使用不同诊断时间点的可用数据进行训练,包括院前评估、动脉血气分析、完整的实验室结果和连续的Trop测量。共纳入2756例患者,通过acs相关投诉的急诊医师协议确定。排除资料不完整或院前死亡率的患者。使用额外的诊断数据可以提高模型性能。模型1(仅院前数据)的受试者工作特征(AUROC)下面积为0.76(95%可信区间[CI] 0.72-0.79),而模型4(包括序贯Trop检验)达到0.87 (95% CI 0.83-0.91)。与模型1相比,加入早期医院诊断(模型2)显著提高了准确性(0.65 vs. 0.78)。与模型3中的单一Trop检验相比,模型4中的顺序Trop检验并没有显著提高性能(AUROC为0.87,95% CI 0.83-0.91比0.86,95% CI 0.82-0.91)。错误分类分析显示,未被诊断的患者通常是有呼吸困难和已知冠状动脉疾病但无st段抬高的老年女性。过度诊断的患者有较高的身体质量指数、st段抬高、局部壁运动异常和左室射血分数受损,但缺乏显著的连续Trop升高。结论:院前评估结合早期住院诊断可提供可靠的冠状动脉介入治疗需求分层,有可能优化临床决策和资源利用。
{"title":"AI-guided refinement of coronary revascularization need in patients suspected of acute coronary syndrome.","authors":"Manuel Sigle, Diana Heurich, Wenke Faller, Meinrad Gawaz, Karin Anne Lydia Mueller, Andreas Goldschmied","doi":"10.1093/ehjdh/ztaf106","DOIUrl":"10.1093/ehjdh/ztaf106","url":null,"abstract":"<p><strong>Aims: </strong>Overdiagnosis in patients suspected of acute coronary syndrome (ACS) leads to unnecessary coronary angiographies, particularly in cases with non-specifically elevated troponin (Trop) levels. We established machine learning (ML) models integrating sequentially available prehospital and in-hospital variables to improve early prediction of the need for coronary re while minimizing overdiagnosis.</p><p><strong>Methods and results: </strong>Retrospective cohort study analysing patients with suspected ACS from 2016 to 2020. Machine learning models were trained using data available at different diagnostic time points, including prehospital assessment, arterial blood gas analysis, full laboratory results, and sequential Trop measurements. A total of 2756 patients were included, identified through emergency physician protocols for ACS-related complaints. Patients with incomplete data or prehospital mortality were excluded.Model performance improved with additional diagnostic data. Model 1 (prehospital data only) achieved an area under the receiver operating characteristic (AUROC) of 0.76 (95% confidence interval [CI] 0.72-0.79), while Model 4 (including sequential Trop testing) reached 0.87 (95% CI 0.83-0.91). Adding early hospital diagnostics (Model 2) significantly improved accuracy compared with Model 1 (0.65 vs. 0.78). Sequential Trop testing in Model 4 did not substantially enhance performance compared with single Trop testing in Model 3 (AUROC 0.87, 95% CI 0.83-0.91 vs. 0.86, 95% CI 0.82-0.91). Misclassification analysis revealed that underdiagnosed patients were typically older females with dyspnoea and known coronary artery disease but no ST-elevations. Overdiagnosed patients had higher body mass index, ST-elevations, regional wall motion abnormalities, and impaired left ventricular ejection fraction but lacked significant sequential Trop elevation.</p><p><strong>Conclusion: </strong>Prehospital assessments combined with early in-hospital diagnostics provide reliable stratification of coronary intervention need, potentially optimizing clinical decision-making and resource utilization.</p>","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"6 6","pages":"1169-1180"},"PeriodicalIF":4.4,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12629660/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}