Automatic Size And Pose Homogenization With Spatial Transformer Network To Improve And Accelerate Pediatric Segmentation

Giammarco La Barbera, P. Gori, Haithem Boussaid, Bruno Belucci, A. Delmonte, Jeanne Goulin, S. Sarnacki, L. Rouet, I. Bloch
{"title":"Automatic Size And Pose Homogenization With Spatial Transformer Network To Improve And Accelerate Pediatric Segmentation","authors":"Giammarco La Barbera, P. Gori, Haithem Boussaid, Bruno Belucci, A. Delmonte, Jeanne Goulin, S. Sarnacki, L. Rouet, I. Bloch","doi":"10.1109/ISBI48211.2021.9434090","DOIUrl":null,"url":null,"abstract":"Due to a high heterogeneity in pose and size and to a limited number of available data, segmentation of pediatric images is challenging for deep learning methods. In this work, we propose a new CNN architecture that is pose and scale invariant thanks to the use of Spatial Transformer Network (STN). Our architecture is composed of three sequential modules that are estimated together during training: (i) a regression module to estimate a similarity matrix to normalize the input image to a reference one; (ii) a differentiable module to find the region of interest to segment; (iii) a segmentation module, based on the popular UNet architecture, to delineate the object. Unlike the original UNet, which strives to learn a complex mapping, including pose and scale variations, from a finite training dataset, our segmentation module learns a simpler mapping focusing on images with normalized pose and size. Furthermore, the use of an automatic bounding box detection through STN allows saving time and especially memory, while keeping similar performance. We test the proposed method in kidney and renal tumor segmentation on abdominal pediatric CT scanners. Results indicate that the estimated STN homogenization of size and pose accelerates the segmentation (25h), compared to standard data-augmentation (33h), while obtaining a similar quality for the kidney (88.01% of Dice score) and improving the renal tumor delineation (from 85.52% to 87.12%).","PeriodicalId":372939,"journal":{"name":"2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISBI48211.2021.9434090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Due to a high heterogeneity in pose and size and to a limited number of available data, segmentation of pediatric images is challenging for deep learning methods. In this work, we propose a new CNN architecture that is pose and scale invariant thanks to the use of Spatial Transformer Network (STN). Our architecture is composed of three sequential modules that are estimated together during training: (i) a regression module to estimate a similarity matrix to normalize the input image to a reference one; (ii) a differentiable module to find the region of interest to segment; (iii) a segmentation module, based on the popular UNet architecture, to delineate the object. Unlike the original UNet, which strives to learn a complex mapping, including pose and scale variations, from a finite training dataset, our segmentation module learns a simpler mapping focusing on images with normalized pose and size. Furthermore, the use of an automatic bounding box detection through STN allows saving time and especially memory, while keeping similar performance. We test the proposed method in kidney and renal tumor segmentation on abdominal pediatric CT scanners. Results indicate that the estimated STN homogenization of size and pose accelerates the segmentation (25h), compared to standard data-augmentation (33h), while obtaining a similar quality for the kidney (88.01% of Dice score) and improving the renal tumor delineation (from 85.52% to 87.12%).
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于空间变形网络的自动尺寸和位姿均匀化改进和加速儿童分割
由于姿态和大小的高度异质性以及可用数据数量有限,儿科图像的分割对于深度学习方法来说是具有挑战性的。在这项工作中,我们提出了一种新的CNN架构,由于使用了空间变压器网络(STN),它是位姿和尺度不变的。我们的架构由三个顺序模块组成,它们在训练过程中一起进行估计:(i)一个回归模块,用于估计相似矩阵,将输入图像归一化为参考图像;(ii)一个可微模块,用于寻找要分割的感兴趣区域;(iii)基于流行的UNet架构的分割模块,用于描绘对象。与最初的UNet不同,UNet努力从有限的训练数据集中学习复杂的映射,包括姿势和比例变化,我们的分割模块学习更简单的映射,专注于具有标准化姿势和大小的图像。此外,通过STN使用自动边界框检测可以节省时间,特别是内存,同时保持类似的性能。我们在腹部儿童CT扫描仪上测试了该方法在肾脏和肾脏肿瘤分割中的应用。结果表明,与标准数据增强(33小时)相比,估计的大小和姿态的STN均匀化加速了分割(25小时),同时对肾脏获得了相似的质量(Dice评分的88.01%),并改善了肾脏肿瘤的描绘(从85.52%提高到87.12%)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Enhanced-Quality Gan (EQ-GAN) on Lung CT Scans: Toward Truth and Potential Hallucinations Ghost-Light-3dnet: Efficient Network For Heart Segmentation Landmark Constellation Models For Central Venous Catheter Malposition Detection Biventricular Surface Reconstruction From Cine Mri Contours Using Point Completion Networks Multi-channel Sparse Graph Transformer Network for Early Alzheimer’s Disease Identification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1