Automatic Size And Pose Homogenization With Spatial Transformer Network To Improve And Accelerate Pediatric Segmentation

2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) Pub Date : 2021-04-13 DOI:10.1109/ISBI48211.2021.9434090

Giammarco La Barbera, P. Gori, Haithem Boussaid, Bruno Belucci, A. Delmonte, Jeanne Goulin, S. Sarnacki, L. Rouet, I. Bloch

{"title":"Automatic Size And Pose Homogenization With Spatial Transformer Network To Improve And Accelerate Pediatric Segmentation","authors":"Giammarco La Barbera, P. Gori, Haithem Boussaid, Bruno Belucci, A. Delmonte, Jeanne Goulin, S. Sarnacki, L. Rouet, I. Bloch","doi":"10.1109/ISBI48211.2021.9434090","DOIUrl":null,"url":null,"abstract":"Due to a high heterogeneity in pose and size and to a limited number of available data, segmentation of pediatric images is challenging for deep learning methods. In this work, we propose a new CNN architecture that is pose and scale invariant thanks to the use of Spatial Transformer Network (STN). Our architecture is composed of three sequential modules that are estimated together during training: (i) a regression module to estimate a similarity matrix to normalize the input image to a reference one; (ii) a differentiable module to find the region of interest to segment; (iii) a segmentation module, based on the popular UNet architecture, to delineate the object. Unlike the original UNet, which strives to learn a complex mapping, including pose and scale variations, from a finite training dataset, our segmentation module learns a simpler mapping focusing on images with normalized pose and size. Furthermore, the use of an automatic bounding box detection through STN allows saving time and especially memory, while keeping similar performance. We test the proposed method in kidney and renal tumor segmentation on abdominal pediatric CT scanners. Results indicate that the estimated STN homogenization of size and pose accelerates the segmentation (25h), compared to standard data-augmentation (33h), while obtaining a similar quality for the kidney (88.01% of Dice score) and improving the renal tumor delineation (from 85.52% to 87.12%).","PeriodicalId":372939,"journal":{"name":"2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISBI48211.2021.9434090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Due to a high heterogeneity in pose and size and to a limited number of available data, segmentation of pediatric images is challenging for deep learning methods. In this work, we propose a new CNN architecture that is pose and scale invariant thanks to the use of Spatial Transformer Network (STN). Our architecture is composed of three sequential modules that are estimated together during training: (i) a regression module to estimate a similarity matrix to normalize the input image to a reference one; (ii) a differentiable module to find the region of interest to segment; (iii) a segmentation module, based on the popular UNet architecture, to delineate the object. Unlike the original UNet, which strives to learn a complex mapping, including pose and scale variations, from a finite training dataset, our segmentation module learns a simpler mapping focusing on images with normalized pose and size. Furthermore, the use of an automatic bounding box detection through STN allows saving time and especially memory, while keeping similar performance. We test the proposed method in kidney and renal tumor segmentation on abdominal pediatric CT scanners. Results indicate that the estimated STN homogenization of size and pose accelerates the segmentation (25h), compared to standard data-augmentation (33h), while obtaining a similar quality for the kidney (88.01% of Dice score) and improving the renal tumor delineation (from 85.52% to 87.12%).

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于空间变形网络的自动尺寸和位姿均匀化改进和加速儿童分割

由于姿态和大小的高度异质性以及可用数据数量有限，儿科图像的分割对于深度学习方法来说是具有挑战性的。在这项工作中，我们提出了一种新的CNN架构，由于使用了空间变压器网络(STN)，它是位姿和尺度不变的。我们的架构由三个顺序模块组成，它们在训练过程中一起进行估计:(i)一个回归模块，用于估计相似矩阵，将输入图像归一化为参考图像;(ii)一个可微模块，用于寻找要分割的感兴趣区域;(iii)基于流行的UNet架构的分割模块，用于描绘对象。与最初的UNet不同，UNet努力从有限的训练数据集中学习复杂的映射，包括姿势和比例变化，我们的分割模块学习更简单的映射，专注于具有标准化姿势和大小的图像。此外，通过STN使用自动边界框检测可以节省时间，特别是内存，同时保持类似的性能。我们在腹部儿童CT扫描仪上测试了该方法在肾脏和肾脏肿瘤分割中的应用。结果表明，与标准数据增强(33小时)相比，估计的大小和姿态的STN均匀化加速了分割(25小时)，同时对肾脏获得了相似的质量(Dice评分的88.01%)，并改善了肾脏肿瘤的描绘(从85.52%提高到87.12%)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)

自引率

0.00%

发文量