OCT Image Segmentation Using Neural Architecture Search and SRGAN

2020 25th International Conference on Pattern Recognition (ICPR) Pub Date : 2021-01-10 DOI:10.1109/ICPR48806.2021.9412818

O. Dehzangi, Saba Heidari Gheshlaghi, Annahita Amireskandari, N. Nasrabadi, A. Rezai

{"title":"OCT Image Segmentation Using Neural Architecture Search and SRGAN","authors":"O. Dehzangi, Saba Heidari Gheshlaghi, Annahita Amireskandari, N. Nasrabadi, A. Rezai","doi":"10.1109/ICPR48806.2021.9412818","DOIUrl":null,"url":null,"abstract":"Medical image segmentation is a critical field in the domain of computer vision and with the growing acclaim of deep learning based models, research in this field is constantly expanding. Optical coherence tomography (OCT) is a non-invasive method that scans the human's retina with depth. It has been hypothesized that the thickness of the retinal layers extracted from OCTs could be an efficient and effective biomarker for early diagnosis of AD. In this work, we aim to design a self-training model architecture for the task of segmenting the retinal layers in OCT scans. Neural architecture search (NAS) is a subfield of AutoML domain, which has a significant impact on improving the accuracy of machine vision tasks. We integrate the NAS algorithm with a Unet auto-encoder architecture as its backbone. Then, we employ our proposed model to segment the retinal nerve fiber layer in our preprocessed OCT images with the aim of AD diagnosis. In this work, we trained a super-resolution generative adversarial network on the raw OCT scans to improve the quality of the images before the modeling stage. In our architecture search strategy, different primitive operations suggested to find down- & up-sampling Unet cell blocks and the binary gate method has been applied to make the search strategy more practical. Our architecture search method is empirically evaluated by training on the Unet and NAS-Unet from scratch. Specifically, the proposed NAS-Unet training significantly outperforms the baseline human-designed architecture by achieving 95.1% in the mean Intersection over Union metric and 79.1% in the Dice similarity coefficient.","PeriodicalId":6783,"journal":{"name":"2020 25th International Conference on Pattern Recognition (ICPR)","volume":"7 1","pages":"6425-6430"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 25th International Conference on Pattern Recognition (ICPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPR48806.2021.9412818","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Medical image segmentation is a critical field in the domain of computer vision and with the growing acclaim of deep learning based models, research in this field is constantly expanding. Optical coherence tomography (OCT) is a non-invasive method that scans the human's retina with depth. It has been hypothesized that the thickness of the retinal layers extracted from OCTs could be an efficient and effective biomarker for early diagnosis of AD. In this work, we aim to design a self-training model architecture for the task of segmenting the retinal layers in OCT scans. Neural architecture search (NAS) is a subfield of AutoML domain, which has a significant impact on improving the accuracy of machine vision tasks. We integrate the NAS algorithm with a Unet auto-encoder architecture as its backbone. Then, we employ our proposed model to segment the retinal nerve fiber layer in our preprocessed OCT images with the aim of AD diagnosis. In this work, we trained a super-resolution generative adversarial network on the raw OCT scans to improve the quality of the images before the modeling stage. In our architecture search strategy, different primitive operations suggested to find down- & up-sampling Unet cell blocks and the binary gate method has been applied to make the search strategy more practical. Our architecture search method is empirically evaluated by training on the Unet and NAS-Unet from scratch. Specifically, the proposed NAS-Unet training significantly outperforms the baseline human-designed architecture by achieving 95.1% in the mean Intersection over Union metric and 79.1% in the Dice similarity coefficient.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于神经结构搜索和SRGAN的OCT图像分割

医学图像分割是计算机视觉领域的一个关键领域，随着基于深度学习模型的日益普及，该领域的研究也在不断扩大。光学相干断层扫描(OCT)是一种对人视网膜进行深度扫描的非侵入性方法。研究人员推测，从OCTs中提取的视网膜层厚度可能是早期诊断AD的有效生物标志物。在这项工作中，我们的目标是设计一个自我训练模型架构，用于分割OCT扫描中的视网膜层。神经结构搜索(NAS)是AutoML领域的一个子领域，对提高机器视觉任务的准确率有着重要的影响。我们将NAS算法与Unet自动编码器架构集成为其主干。然后，我们利用所提出的模型对预处理OCT图像中的视网膜神经纤维层进行分割，目的是诊断AD。在这项工作中，我们在原始OCT扫描上训练了一个超分辨率生成对抗网络，以在建模阶段之前提高图像质量。在我们的架构搜索策略中，提出了不同的原语操作来查找上下采样的Unet单元块，并采用了二值门方法使搜索策略更加实用。我们的架构搜索方法通过在Unet和NAS-Unet上从零开始的训练进行了经验评估。具体来说，所提出的NAS-Unet训练显著优于基线人类设计的架构，达到95.1%的平均交集与联合度量和79.1%的骰子相似系数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助