使用深度机器学习方法对光学相干断层扫描图像进行分类

Digital Diagnostics Pub Date : 2024-03-11 DOI:10.17816/dd623801

Alexander Arzamastsev, O. Fabrikantov, Elena Valerievna Kulagina, N. Zenkova

{"title":"使用深度机器学习方法对光学相干断层扫描图像进行分类","authors":"Alexander Arzamastsev, O. Fabrikantov, Elena Valerievna Kulagina, N. Zenkova","doi":"10.17816/dd623801","DOIUrl":null,"url":null,"abstract":"Backgraund. Optical coherence tomography (OCT) is a modern high-tech and informative method for detecting pathology of the retina and preretinal layers of the vitreous body. However, the description and interpretation of the research results require high qualifications and special training of an ophthalmologist, and significant time expenditure for the doctor and the patient. At the same time, the use of mathematical models based on artificial neural networks (ANN- models) currently makes it possible to automate many processes associated with image processing. Therefore, solving problems associated with automating the process of classifying OCT images based on ANN models is actual. \nAims. To develop architectures of mathematical (computer) models based on deep learning of convolutional neural networks (CNN) for classification of OCT images of the retina. To compare the results of computational experiments conducted using Python tools in the Google Colaboratory with single-model and multi-model approaches and evaluate classification accuracy. To make conclusions about the optimal architecture of ANN models and the values of the hyperparameters used. \nMaterials and methods. The original dataset, which was anonymized OCT images of real patients, included more than 2000 images obtained directly from the device in a resolution of 1920 × 969 × 24 BPP. The number of image classes is 12. To create the training and validation data sets, a subject area of 1100 × 550 × 24 BPP was “cut out.” Various approaches were studied: the possibility of using pretrained CNNs with transfer learning, techniques for resizing and augmenting images, as well as various combinations of hyperparameters of ANN-models. When compiling the model, the following parameters were used: Adam optimizer, categorical_crossentropy loss function, accuracy metric. All technological processes with images and ANN-models were carried out using Python language tools in Google Colaboratory. \nResults. Single-model and multi-model principles for classifying OCT images of the retina are proposed. Computational experiments on automated classification of such images obtained from a DRI OCT Triton 3D tomograph using various ANN model architectures showed an accuracy of 98-100% during training and validation and 85% during an additional test, which is a satisfactory result. The optimal architecture of the ANN model - a six-layer convolutional network - was selected and the values of its hyperparameters were determined. \nConclusions. The results of deep training of convolutional neural network models with various architectures, their validation and testing showed satisfactory classification accuracy of retinal OCT images. These developments can be used in decision support systems in the field of ophthalmology.","PeriodicalId":34831,"journal":{"name":"Digital Diagnostics","volume":"15 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classification of optical coherence tomography images using deep machine learning methods\",\"authors\":\"Alexander Arzamastsev, O. Fabrikantov, Elena Valerievna Kulagina, N. Zenkova\",\"doi\":\"10.17816/dd623801\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Backgraund. Optical coherence tomography (OCT) is a modern high-tech and informative method for detecting pathology of the retina and preretinal layers of the vitreous body. However, the description and interpretation of the research results require high qualifications and special training of an ophthalmologist, and significant time expenditure for the doctor and the patient. At the same time, the use of mathematical models based on artificial neural networks (ANN- models) currently makes it possible to automate many processes associated with image processing. Therefore, solving problems associated with automating the process of classifying OCT images based on ANN models is actual. \\nAims. To develop architectures of mathematical (computer) models based on deep learning of convolutional neural networks (CNN) for classification of OCT images of the retina. To compare the results of computational experiments conducted using Python tools in the Google Colaboratory with single-model and multi-model approaches and evaluate classification accuracy. To make conclusions about the optimal architecture of ANN models and the values of the hyperparameters used. \\nMaterials and methods. The original dataset, which was anonymized OCT images of real patients, included more than 2000 images obtained directly from the device in a resolution of 1920 × 969 × 24 BPP. The number of image classes is 12. To create the training and validation data sets, a subject area of 1100 × 550 × 24 BPP was “cut out.” Various approaches were studied: the possibility of using pretrained CNNs with transfer learning, techniques for resizing and augmenting images, as well as various combinations of hyperparameters of ANN-models. When compiling the model, the following parameters were used: Adam optimizer, categorical_crossentropy loss function, accuracy metric. All technological processes with images and ANN-models were carried out using Python language tools in Google Colaboratory. \\nResults. Single-model and multi-model principles for classifying OCT images of the retina are proposed. Computational experiments on automated classification of such images obtained from a DRI OCT Triton 3D tomograph using various ANN model architectures showed an accuracy of 98-100% during training and validation and 85% during an additional test, which is a satisfactory result. The optimal architecture of the ANN model - a six-layer convolutional network - was selected and the values of its hyperparameters were determined. \\nConclusions. The results of deep training of convolutional neural network models with various architectures, their validation and testing showed satisfactory classification accuracy of retinal OCT images. These developments can be used in decision support systems in the field of ophthalmology.\",\"PeriodicalId\":34831,\"journal\":{\"name\":\"Digital Diagnostics\",\"volume\":\"15 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Diagnostics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17816/dd623801\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Diagnostics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17816/dd623801","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

后视镜光学相干断层扫描（OCT）是一种检测视网膜和玻璃体视网膜前层病变的现代高科技信息方法。然而，对研究结果的描述和解释需要眼科医生的高素质和特殊培训，医生和病人也需要花费大量时间。与此同时，目前基于人工神经网络（ANN 模型）的数学模型的使用使得许多与图像处理相关的过程实现了自动化。因此，解决与基于人工神经网络模型的 OCT 图像分类过程自动化相关的问题是切实可行的。目标开发基于卷积神经网络（CNN）深度学习的数学（计算机）模型架构，用于视网膜 OCT 图像分类。使用谷歌实验室中的 Python 工具，比较单一模型和多模型方法的计算实验结果，并评估分类准确性。对 ANN 模型的最佳架构和所使用的超参数值做出结论。材料和方法原始数据集是真实患者的匿名 OCT 图像，包括直接从设备获取的 2000 多张图像，分辨率为 1920 × 969 × 24 BPP。图像类别数量为 12 个。为了创建训练和验证数据集，"切出 "了一个 1100 × 550 × 24 BPP 的主题区域。对各种方法进行了研究：使用带迁移学习的预训练 CNN 的可能性、调整图像大小和增强图像的技术，以及 ANN 模型超参数的各种组合。在编制模型时，使用了以下参数：亚当优化器、分类交叉熵损失函数、准确度指标。所有图像和 ANN 模型的技术处理均使用 Google Colaboratory 中的 Python 语言工具进行。结果提出了视网膜 OCT 图像分类的单模型和多模型原则。使用各种 ANN 模型架构对从 DRI OCT Triton 3D 层析成像仪获得的此类图像进行自动分类的计算实验表明，在训练和验证过程中的准确率为 98%-100%，在附加测试中的准确率为 85%，这是一个令人满意的结果。我们选择了最佳结构的方差网络模型（六层卷积网络），并确定了其超参数值。结论采用不同架构的卷积神经网络模型的深度训练、验证和测试结果表明，视网膜 OCT 图像的分类准确率令人满意。这些进展可用于眼科领域的决策支持系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Classification of optical coherence tomography images using deep machine learning methods

Backgraund. Optical coherence tomography (OCT) is a modern high-tech and informative method for detecting pathology of the retina and preretinal layers of the vitreous body. However, the description and interpretation of the research results require high qualifications and special training of an ophthalmologist, and significant time expenditure for the doctor and the patient. At the same time, the use of mathematical models based on artificial neural networks (ANN- models) currently makes it possible to automate many processes associated with image processing. Therefore, solving problems associated with automating the process of classifying OCT images based on ANN models is actual. Aims. To develop architectures of mathematical (computer) models based on deep learning of convolutional neural networks (CNN) for classification of OCT images of the retina. To compare the results of computational experiments conducted using Python tools in the Google Colaboratory with single-model and multi-model approaches and evaluate classification accuracy. To make conclusions about the optimal architecture of ANN models and the values of the hyperparameters used. Materials and methods. The original dataset, which was anonymized OCT images of real patients, included more than 2000 images obtained directly from the device in a resolution of 1920 × 969 × 24 BPP. The number of image classes is 12. To create the training and validation data sets, a subject area of 1100 × 550 × 24 BPP was “cut out.” Various approaches were studied: the possibility of using pretrained CNNs with transfer learning, techniques for resizing and augmenting images, as well as various combinations of hyperparameters of ANN-models. When compiling the model, the following parameters were used: Adam optimizer, categorical_crossentropy loss function, accuracy metric. All technological processes with images and ANN-models were carried out using Python language tools in Google Colaboratory. Results. Single-model and multi-model principles for classifying OCT images of the retina are proposed. Computational experiments on automated classification of such images obtained from a DRI OCT Triton 3D tomograph using various ANN model architectures showed an accuracy of 98-100% during training and validation and 85% during an additional test, which is a satisfactory result. The optimal architecture of the ANN model - a six-layer convolutional network - was selected and the values of its hyperparameters were determined. Conclusions. The results of deep training of convolutional neural network models with various architectures, their validation and testing showed satisfactory classification accuracy of retinal OCT images. These developments can be used in decision support systems in the field of ophthalmology.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Digital Diagnostics

CiteScore

1.30

自引率

0.00%

发文量

审稿时长

5 weeks