{"title":"Automatic Segmentation of Maxillary Sinus with U-Net Model with Pre-trained Encoder","authors":"Ahmet Said Dedeoğlu, Serkan Özbay, Orhan Tunç","doi":"10.1109/HORA58378.2023.10156706","DOIUrl":null,"url":null,"abstract":"An accurate segmentation of the maxillary sinus (MS) is crucial for the preoperative planning of MS-related surgeries and for preventing postoperative complications. Manual segmentation is challenging, time-consuming, and highly dependent on the practitioner's experience. Therefore, it is not applicable for clinical practice, and accurate, efficient automatic segmentation of MS is required. Convolutional neural networks (CNNs) have recently become the most preferred method for automatic medical image segmentation. In this study, an automatic MS segmentation model based on a convolutional neural network model, U-Net, is proposed. Instead of using the original U-Net encoder, the VGG16 network pre-trained with the ImageNet dataset, apart from the fully connected layers, was used as the encoder of the U-Net architecture to improve the segmentation accuracy. Furthermore, during the training period, models were also trained with focal dice loss (FDL), an equally weighted combination of dice loss (DL) and focal loss, to overcome the imbalanced dataset. The segmentation model based on U-Net with a VGG16 encoder trained with FDL obtained the best results with a dice similarity coefficient (DSC) of 0.93253 and an Intersection over Union (IoU) of 0.88775 on the test dataset.","PeriodicalId":247679,"journal":{"name":"2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HORA58378.2023.10156706","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
An accurate segmentation of the maxillary sinus (MS) is crucial for the preoperative planning of MS-related surgeries and for preventing postoperative complications. Manual segmentation is challenging, time-consuming, and highly dependent on the practitioner's experience. Therefore, it is not applicable for clinical practice, and accurate, efficient automatic segmentation of MS is required. Convolutional neural networks (CNNs) have recently become the most preferred method for automatic medical image segmentation. In this study, an automatic MS segmentation model based on a convolutional neural network model, U-Net, is proposed. Instead of using the original U-Net encoder, the VGG16 network pre-trained with the ImageNet dataset, apart from the fully connected layers, was used as the encoder of the U-Net architecture to improve the segmentation accuracy. Furthermore, during the training period, models were also trained with focal dice loss (FDL), an equally weighted combination of dice loss (DL) and focal loss, to overcome the imbalanced dataset. The segmentation model based on U-Net with a VGG16 encoder trained with FDL obtained the best results with a dice similarity coefficient (DSC) of 0.93253 and an Intersection over Union (IoU) of 0.88775 on the test dataset.
上颌窦(MS)的准确分割对于MS相关手术的术前规划和预防术后并发症至关重要。手动分割具有挑战性,耗时,并且高度依赖于从业者的经验。因此并不适用于临床实践,需要对质谱进行准确、高效的自动分割。近年来,卷积神经网络(cnn)已成为医学图像自动分割的首选方法。本文提出了一种基于卷积神经网络模型U-Net的MS自动分割模型。利用ImageNet数据集预训练的VGG16网络作为U-Net架构的编码器,而不是使用原始的U-Net编码器,以提高分割精度。此外,在训练期间,模型还使用focal dice loss (FDL)进行训练,FDL是骰子损失(DL)和焦点损失的等加权组合,以克服数据集的不平衡。使用FDL训练的VGG16编码器的U-Net分割模型在测试数据集上获得了最佳分割效果,其骰子相似系数(DSC)为0.93253,交集/联合(IoU)为0.88775。