Extended Technical and Clinical Validation of Deep Learning-Based Brainstem Segmentation for Application in Neurodegenerative Diseases

IF 3.3 2区医学 Q1 NEUROIMAGING Human Brain Mapping Pub Date : 2025-02-12 DOI:10.1002/hbm.70141

Benno Gesierich, Laura Sander, Lukas Pirpamer, Dominik S. Meier, Esther Ruberte, Michael Amann, Tim Sinnecker, Antal Huck, Frank-Erik de Leeuw, Pauline Maillard, Sue Moy, Karl G. Helmer, MarkVCID Consortium, Johannes Levin, Günter U. Höglinger, PROMESA Study Group, Michael Kühne, Leo H. Bonati, Jens Kuhle, Philippe Cattin, Cristina Granziera, Regina Schlaeger, Marco Duering

{"title":"Extended Technical and Clinical Validation of Deep Learning-Based Brainstem Segmentation for Application in Neurodegenerative Diseases","authors":"Benno Gesierich, Laura Sander, Lukas Pirpamer, Dominik S. Meier, Esther Ruberte, Michael Amann, Tim Sinnecker, Antal Huck, Frank-Erik de Leeuw, Pauline Maillard, Sue Moy, Karl G. Helmer, MarkVCID Consortium, Johannes Levin, Günter U. Höglinger, PROMESA Study Group, Michael Kühne, Leo H. Bonati, Jens Kuhle, Philippe Cattin, Cristina Granziera, Regina Schlaeger, Marco Duering","doi":"10.1002/hbm.70141","DOIUrl":null,"url":null,"abstract":"Disorders of the central nervous system, including neurodegenerative diseases, frequently affect the brainstem and can present with focal atrophy. This study aimed to (1) optimize deep learning-based brainstem segmentation for a wide range of pathologies and T1-weighted image acquisition parameters, (2) conduct a systematic technical and clinical validation, (3) improve segmentation quality in the presence of brainstem lesions, and (4) make an optimized brainstem segmentation tool available for public use. An intentionally heterogeneous ground truth dataset (n = 257) was employed in the training of deep learning models based on multi-dimensional gated recurrent units (MD-GRU) or the nnU-Net method. Segmentation performance was evaluated against ground truth labels. FreeSurfer was used for benchmarking in subsequent validation. Technical validation, including scan-rescan repeatability (n = 46) and inter-scanner reproducibility (n = 20, 3 different scanners) in unseen data, was conducted in patients with cerebral small vessel disease. Clinical validation in unseen data was performed in 1-year follow-up data of 16 patients with multiple system atrophy, evaluating the annual percentage volume change. Two lesion filling algorithms were investigated to improve segmentation performance in 23 patients with multiple sclerosis. The MD-GRU and nnU-Net models demonstrated very good segmentation performance (median Dice coefficients ≥ 0.95 each) and outperformed a previously published model trained on a narrower dataset. Scan–rescan repeatability and inter-scanner reproducibility yielded similar Bland–Altman derived limits of agreement for longitudinal FreeSurfer (total brainstem volume repeatability/reproducibility 0.68/1.85), MD-GRU (0.72/1.46), and nnU-Net (0.48/1.52). All methods showed comparable performance in the detection of atrophy in the total brainstem (atrophy detected in 100% of patients) and its substructures. In patients with multiple sclerosis, lesion filling further improved the accuracy of brainstem segmentation. We enhanced and systematically validated two fully automated deep learning brainstem segmentation methods and released them publicly. This enables a broader evaluation of brainstem volume as a candidate biomarker for neurodegeneration.","PeriodicalId":13019,"journal":{"name":"Human Brain Mapping","volume":"46 3","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/hbm.70141","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Brain Mapping","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/hbm.70141","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROIMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Disorders of the central nervous system, including neurodegenerative diseases, frequently affect the brainstem and can present with focal atrophy. This study aimed to (1) optimize deep learning-based brainstem segmentation for a wide range of pathologies and T1-weighted image acquisition parameters, (2) conduct a systematic technical and clinical validation, (3) improve segmentation quality in the presence of brainstem lesions, and (4) make an optimized brainstem segmentation tool available for public use. An intentionally heterogeneous ground truth dataset (n = 257) was employed in the training of deep learning models based on multi-dimensional gated recurrent units (MD-GRU) or the nnU-Net method. Segmentation performance was evaluated against ground truth labels. FreeSurfer was used for benchmarking in subsequent validation. Technical validation, including scan-rescan repeatability (n = 46) and inter-scanner reproducibility (n = 20, 3 different scanners) in unseen data, was conducted in patients with cerebral small vessel disease. Clinical validation in unseen data was performed in 1-year follow-up data of 16 patients with multiple system atrophy, evaluating the annual percentage volume change. Two lesion filling algorithms were investigated to improve segmentation performance in 23 patients with multiple sclerosis. The MD-GRU and nnU-Net models demonstrated very good segmentation performance (median Dice coefficients ≥ 0.95 each) and outperformed a previously published model trained on a narrower dataset. Scan–rescan repeatability and inter-scanner reproducibility yielded similar Bland–Altman derived limits of agreement for longitudinal FreeSurfer (total brainstem volume repeatability/reproducibility 0.68/1.85), MD-GRU (0.72/1.46), and nnU-Net (0.48/1.52). All methods showed comparable performance in the detection of atrophy in the total brainstem (atrophy detected in 100% of patients) and its substructures. In patients with multiple sclerosis, lesion filling further improved the accuracy of brainstem segmentation. We enhanced and systematically validated two fully automated deep learning brainstem segmentation methods and released them publicly. This enables a broader evaluation of brainstem volume as a candidate biomarker for neurodegeneration.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度学习的脑干分割在神经退行性疾病应用的扩展技术和临床验证

中枢神经系统疾病，包括神经退行性疾病，经常影响脑干，并可表现为局灶性萎缩。本研究旨在(1)优化基于深度学习的脑干分割，以适应广泛的病理和t1加权图像采集参数，(2)进行系统的技术和临床验证，(3)提高脑干病变存在时的分割质量，(4)使优化后的脑干分割工具可供公众使用。在基于多维门控循环单元（MD-GRU）或nnU-Net方法的深度学习模型训练中，使用了一个有意异构的地面真值数据集（n = 257）。根据真值标签对分割性能进行评估。在随后的验证中使用FreeSurfer进行基准测试。技术验证，包括未见数据的扫描-扫描重复性（n = 46）和扫描仪间重复性（n = 20, 3台不同的扫描仪），在脑血管疾病患者中进行。对16例多系统萎缩患者1年随访数据进行未见数据的临床验证，评估年体积变化百分比。为了提高23例多发性硬化症患者的分割效果，研究了两种病灶填充算法。MD-GRU和nnU-Net模型显示出非常好的分割性能（中位数Dice系数均≥0.95），并且优于先前发表的在较窄数据集上训练的模型。纵向FreeSurfer（总脑干体积重复性/重复性0.68/1.85）、MD-GRU（0.72/1.46）和nnU-Net（0.48/1.52）的扫描-扫描重复性和扫描间重复性得出了相似的Bland-Altman导出的一致极限。所有方法在检测全脑干萎缩（100%的患者检测到萎缩）及其亚结构方面表现出相当的性能。在多发性硬化症患者中，病变填充进一步提高了脑干分割的准确性。我们对两种全自动深度学习脑干分割方法进行了增强和系统验证，并公开发布。这使得脑干体积作为神经变性的候选生物标志物的评估更加广泛。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Human Brain Mapping 医学-核医学

CiteScore

8.30

自引率

6.20%

发文量

401

审稿时长

3-6 weeks

期刊介绍： Human Brain Mapping publishes peer-reviewed basic, clinical, technical, and theoretical research in the interdisciplinary and rapidly expanding field of human brain mapping. The journal features research derived from non-invasive brain imaging modalities used to explore the spatial and temporal organization of the neural systems supporting human behavior. Imaging modalities of interest include positron emission tomography, event-related potentials, electro-and magnetoencephalography, magnetic resonance imaging, and single-photon emission tomography. Brain mapping research in both normal and clinical populations is encouraged. Article formats include Research Articles, Review Articles, Clinical Case Studies, and Technique, as well as Technological Developments, Theoretical Articles, and Synthetic Reviews. Technical advances, such as novel brain imaging methods, analyses for detecting or localizing neural activity, synergistic uses of multiple imaging modalities, and strategies for the design of behavioral paradigms and neural-systems modeling are of particular interest. The journal endorses the propagation of methodological standards and encourages database development in the field of human brain mapping.