Gradient Boosted Trees for Corrective Learning.

Machine learning in medical imaging. MLMI (Workshop) Pub Date : 2017-09-01 Epub Date: 2017-09-07 DOI:10.1007/978-3-319-67389-9_24

Baris U Oguz, Russell T Shinohara, Paul A Yushkevich, Ipek Oguz

{"title":"Gradient Boosted Trees for Corrective Learning.","authors":"Baris U Oguz, Russell T Shinohara, Paul A Yushkevich, Ipek Oguz","doi":"10.1007/978-3-319-67389-9_24","DOIUrl":null,"url":null,"abstract":"Random forests (RF) have long been a widely popular method in medical image analysis. Meanwhile, the closely related gradient boosted trees (GBT) have not become a mainstream tool in medical imaging despite their attractive performance, perhaps due to their computational cost. In this paper, we leverage the recent availability of an efficient open-source GBT implementation to illustrate the GBT method in a corrective learning framework, in application to the segmentation of the caudate nucleus, putamen and hippocampus. The size and shape of these structures are used to derive important biomarkers in many neurological and psychiatric conditions. However, the large variability in deep gray matter appearance makes their automated segmentation from MRI scans a challenging task. We propose using GBT to improve existing segmentation methods. We begin with an existing 'host' segmentation method to create an estimate surface. Based on this estimate, a surface-based sampling scheme is used to construct a set of candidate locations. GBT models are trained on features derived from the candidate locations, including spatial coordinates, image intensity, texture, and gradient magnitude. The classification probabilities from the GBT models are used to calculate a final surface estimate. The method is evaluated on a public dataset, with a 2-fold cross-validation. We use a multi-atlas approach and FreeSurfer as host segmentation methods. The mean reduction in surface distance error metric for FreeSurfer was 0.2 - 0.3 mm, whereas for multi-atlas segmentation, it was 0.1mm for each of caudate, putamen and hippocampus. Importantly, our approach outperformed an RF model trained on the same features (p < 0.05 on all measures). Our method is readily generalizable and can be applied to a wide range of medical image segmentation problems and allows any segmentation method to be used as input.","PeriodicalId":74092,"journal":{"name":"Machine learning in medical imaging. MLMI (Workshop)","volume":"10541 ","pages":"203-211"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/978-3-319-67389-9_24","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning in medical imaging. MLMI (Workshop)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-3-319-67389-9_24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2017/9/7 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Random forests (RF) have long been a widely popular method in medical image analysis. Meanwhile, the closely related gradient boosted trees (GBT) have not become a mainstream tool in medical imaging despite their attractive performance, perhaps due to their computational cost. In this paper, we leverage the recent availability of an efficient open-source GBT implementation to illustrate the GBT method in a corrective learning framework, in application to the segmentation of the caudate nucleus, putamen and hippocampus. The size and shape of these structures are used to derive important biomarkers in many neurological and psychiatric conditions. However, the large variability in deep gray matter appearance makes their automated segmentation from MRI scans a challenging task. We propose using GBT to improve existing segmentation methods. We begin with an existing 'host' segmentation method to create an estimate surface. Based on this estimate, a surface-based sampling scheme is used to construct a set of candidate locations. GBT models are trained on features derived from the candidate locations, including spatial coordinates, image intensity, texture, and gradient magnitude. The classification probabilities from the GBT models are used to calculate a final surface estimate. The method is evaluated on a public dataset, with a 2-fold cross-validation. We use a multi-atlas approach and FreeSurfer as host segmentation methods. The mean reduction in surface distance error metric for FreeSurfer was 0.2 - 0.3 mm, whereas for multi-atlas segmentation, it was 0.1mm for each of caudate, putamen and hippocampus. Importantly, our approach outperformed an RF model trained on the same features (p < 0.05 on all measures). Our method is readily generalizable and can be applied to a wide range of medical image segmentation problems and allows any segmentation method to be used as input.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于纠正学习的梯度提升树。

随机森林（RF）长期以来一直是医学图像分析中广泛流行的方法。同时，密切相关的梯度增强树（GBT）尽管具有诱人的性能，但可能由于其计算成本，尚未成为医学成像的主流工具。在本文中，我们利用最近有效的开源GBT实现来说明纠正学习框架中的GBT方法，该方法应用于尾状核、壳核和海马的分割。这些结构的大小和形状被用来推导许多神经和精神疾病中的重要生物标志物。然而，深灰质外观的巨大可变性使其从MRI扫描中自动分割成为一项具有挑战性的任务。我们建议使用GBT来改进现有的分割方法。我们从现有的“宿主”分割方法开始创建估计曲面。基于该估计，使用基于表面的采样方案来构建一组候选位置。GBT模型基于从候选位置导出的特征进行训练，包括空间坐标、图像强度、纹理和梯度大小。来自GBT模型的分类概率用于计算最终表面估计。该方法在公共数据集上进行了评估，并进行了2次交叉验证。我们使用多图谱方法和FreeSurfer作为主机分割方法。FreeSurfer的表面距离误差度量的平均减少量为0.2-0.3mm，而对于多图谱分割，尾状核、壳核和海马体的平均减少度为0.1mm。重要的是，我们的方法优于在相同特征上训练的RF模型（所有测量值均<0.05）。我们的方法很容易推广，可以应用于广泛的医学图像分割问题，并允许使用任何分割方法作为输入。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Machine learning in medical imaging. MLMI (Workshop)

自引率

0.00%

发文量