Soft-Split Random Forest for Anatomy Labeling.

Machine learning in medical imaging. MLMI (Workshop) Pub Date : 2015-10-01 Epub Date: 2015-10-02 DOI:10.1007/978-3-319-24888-2_3

Guangkai Ma, Yaozong Gao, Li Wang, Ligang Wu, Dinggang Shen

{"title":"Soft-Split Random Forest for Anatomy Labeling.","authors":"Guangkai Ma, Yaozong Gao, Li Wang, Ligang Wu, Dinggang Shen","doi":"10.1007/978-3-319-24888-2_3","DOIUrl":null,"url":null,"abstract":"Random Forest (RF) has been widely used in the learning-based labeling. In RF, each sample is directed from the root to each leaf based on the decisions made in the interior nodes, also called splitting nodes. The splitting nodes assign a testing sample to either left or right child based on the learned splitting function. The final prediction is determined as the average of label probability distributions stored in all arrived leaf nodes. For ambiguous testing samples, which often lie near the splitting boundaries, the conventional splitting function, also referred to as hard split function, tends to make wrong assignments, hence leading to wrong predictions. To overcome this limitation, we propose a novel soft-split random forest (SSRF) framework to improve the reliability of node splitting and finally the accuracy of classification. Specifically, a soft split function is employed to assign a testing sample into both left and right child nodes with their certain probabilities, which can effectively reduce influence of the wrong node assignment on the prediction accuracy. As a result, each testing sample can arrive at multiple leaf nodes, and their respective results can be fused to obtain the final prediction according to the weights accumulated along the path from the root node to each leaf node. Besides, considering the importance of context information, we also adopt a Haar-features based context model to iteratively refine the classification map. We have comprehensively evaluated our method on two public datasets, respectively, for labeling hippocampus in MR images and also labeling three organs in Head & Neck CT images. Compared with the hard-split RF (HSRF), our method achieved a notable improvement in labeling accuracy.","PeriodicalId":74092,"journal":{"name":"Machine learning in medical imaging. MLMI (Workshop)","volume":"9352 ","pages":"17-25"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6261352/pdf/nihms963645.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning in medical imaging. MLMI (Workshop)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-3-319-24888-2_3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2015/10/2 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Random Forest (RF) has been widely used in the learning-based labeling. In RF, each sample is directed from the root to each leaf based on the decisions made in the interior nodes, also called splitting nodes. The splitting nodes assign a testing sample to either left or right child based on the learned splitting function. The final prediction is determined as the average of label probability distributions stored in all arrived leaf nodes. For ambiguous testing samples, which often lie near the splitting boundaries, the conventional splitting function, also referred to as hard split function, tends to make wrong assignments, hence leading to wrong predictions. To overcome this limitation, we propose a novel soft-split random forest (SSRF) framework to improve the reliability of node splitting and finally the accuracy of classification. Specifically, a soft split function is employed to assign a testing sample into both left and right child nodes with their certain probabilities, which can effectively reduce influence of the wrong node assignment on the prediction accuracy. As a result, each testing sample can arrive at multiple leaf nodes, and their respective results can be fused to obtain the final prediction according to the weights accumulated along the path from the root node to each leaf node. Besides, considering the importance of context information, we also adopt a Haar-features based context model to iteratively refine the classification map. We have comprehensively evaluated our method on two public datasets, respectively, for labeling hippocampus in MR images and also labeling three organs in Head & Neck CT images. Compared with the hard-split RF (HSRF), our method achieved a notable improvement in labeling accuracy.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于解剖标记的软分裂随机森林

随机森林（RF）已广泛应用于基于学习的标注。在 RF 中，每个样本都是根据内部节点（也称为分割节点）的决定从根指向每片叶子的。分割节点根据学习到的分割函数将测试样本分配给左侧或右侧子节点。最终的预测结果是存储在所有到达的叶节点中的标签概率分布的平均值。对于模棱两可的测试样本（通常位于拆分边界附近），传统的拆分函数（也称为硬拆分函数）往往会做出错误的分配，从而导致错误的预测。为了克服这一局限，我们提出了一种新颖的软拆分随机森林（SSRF）框架，以提高节点拆分的可靠性和分类的准确性。具体来说，采用软拆分函数将测试样本以一定的概率分配到左右两个子节点中，从而有效降低错误节点分配对预测准确性的影响。因此，每个测试样本可以到达多个叶子节点，并根据从根节点到每个叶子节点的路径所积累的权重，将它们各自的结果进行融合，从而得到最终的预测结果。此外，考虑到上下文信息的重要性，我们还采用了基于 Haar 特征的上下文模型来迭代完善分类图。我们在两个公开数据集上对我们的方法进行了全面评估，这两个数据集分别用于标记 MR 图像中的海马和头颈部 CT 图像中的三个器官。与硬分割射频（HSRF）相比，我们的方法显著提高了标注准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Machine learning in medical imaging. MLMI (Workshop)

自引率

0.00%

发文量