Deep Learning Models for Anatomical Location Classification in Esophagogastroduodenoscopy Images and Videos: A Quantitative Evaluation with Clinical Data.

IF 3 3区 医学 Q1 MEDICINE, GENERAL & INTERNAL Diagnostics Pub Date : 2024-10-23 DOI:10.3390/diagnostics14212360
Seong Min Kang, Gi Pyo Lee, Young Jae Kim, Kyoung Oh Kim, Kwang Gi Kim
{"title":"Deep Learning Models for Anatomical Location Classification in Esophagogastroduodenoscopy Images and Videos: A Quantitative Evaluation with Clinical Data.","authors":"Seong Min Kang, Gi Pyo Lee, Young Jae Kim, Kyoung Oh Kim, Kwang Gi Kim","doi":"10.3390/diagnostics14212360","DOIUrl":null,"url":null,"abstract":"<p><strong>Background/objectives: </strong>During gastroscopy, accurately identifying the anatomical locations of the gastrointestinal tract is crucial for developing diagnostic aids, such as lesion localization and blind spot alerts.</p><p><strong>Methods: </strong>This study utilized a dataset of 31,403 still images from 1000 patients with normal findings to annotate the anatomical locations within the images and develop a classification model. The model was then applied to videos of 20 esophagogastroduodenoscopy procedures, where it was validated for real-time location prediction. To address instability of predictions caused by independent frame-by-frame assessment, we implemented a hard-voting-based post-processing algorithm that aggregates results from seven consecutive frames, improving the overall accuracy.</p><p><strong>Results: </strong>Among the tested models, InceptionV3 demonstrated superior performance for still images, achieving an F1 score of 79.79%, precision of 80.57%, and recall of 80.08%. For video data, the InceptionResNetV2 model performed best, achieving an F1 score of 61.37%, precision of 73.08%, and recall of 57.21%. These results indicate that the deep learning models not only achieved high accuracy in position recognition for still images but also performed well on video data. Additionally, the post-processing algorithm effectively stabilized the predictions, highlighting its potential for real-time endoscopic applications.</p><p><strong>Conclusions: </strong>This study demonstrates the feasibility of predicting the gastrointestinal tract locations during gastroscopy and suggests a promising path for the development of advanced diagnostic aids to assist clinicians. Furthermore, the location information generated by this model can be leveraged in future technologies, such as automated report generation and supporting follow-up examinations for patients.</p>","PeriodicalId":11225,"journal":{"name":"Diagnostics","volume":"14 21","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11545494/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diagnostics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/diagnostics14212360","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background/objectives: During gastroscopy, accurately identifying the anatomical locations of the gastrointestinal tract is crucial for developing diagnostic aids, such as lesion localization and blind spot alerts.

Methods: This study utilized a dataset of 31,403 still images from 1000 patients with normal findings to annotate the anatomical locations within the images and develop a classification model. The model was then applied to videos of 20 esophagogastroduodenoscopy procedures, where it was validated for real-time location prediction. To address instability of predictions caused by independent frame-by-frame assessment, we implemented a hard-voting-based post-processing algorithm that aggregates results from seven consecutive frames, improving the overall accuracy.

Results: Among the tested models, InceptionV3 demonstrated superior performance for still images, achieving an F1 score of 79.79%, precision of 80.57%, and recall of 80.08%. For video data, the InceptionResNetV2 model performed best, achieving an F1 score of 61.37%, precision of 73.08%, and recall of 57.21%. These results indicate that the deep learning models not only achieved high accuracy in position recognition for still images but also performed well on video data. Additionally, the post-processing algorithm effectively stabilized the predictions, highlighting its potential for real-time endoscopic applications.

Conclusions: This study demonstrates the feasibility of predicting the gastrointestinal tract locations during gastroscopy and suggests a promising path for the development of advanced diagnostic aids to assist clinicians. Furthermore, the location information generated by this model can be leveraged in future technologies, such as automated report generation and supporting follow-up examinations for patients.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于食管胃十二指肠镜图像和视频解剖位置分类的深度学习模型:利用临床数据进行定量评估。
背景/目的:在胃镜检查过程中,准确识别胃肠道的解剖位置对于开发病变定位和盲点警报等诊断辅助工具至关重要:本研究利用来自 1000 名检查结果正常的患者的 31,403 张静态图像数据集,对图像中的解剖位置进行标注,并开发了一个分类模型。然后将该模型应用于 20 个食管胃十二指肠镜检查过程的视频,并对其进行了实时位置预测验证。为了解决逐帧独立评估造成的预测不稳定问题,我们采用了基于硬投票的后处理算法,将七个连续帧的结果汇总,从而提高了整体准确性:在测试的模型中,InceptionV3 在静态图像方面表现出色,F1 得分为 79.79%,精确度为 80.57%,召回率为 80.08%。在视频数据方面,InceptionResNetV2 模型表现最佳,F1 得分为 61.37%,精确度为 73.08%,召回率为 57.21%。这些结果表明,深度学习模型不仅在静态图像的位置识别方面取得了很高的准确率,而且在视频数据方面也表现出色。此外,后处理算法有效地稳定了预测结果,凸显了其在实时内窥镜应用中的潜力:这项研究证明了在胃镜检查过程中预测胃肠道位置的可行性,并为开发先进的诊断辅助工具以协助临床医生提供了一条前景广阔的道路。此外,该模型生成的位置信息还可用于未来的技术中,如自动生成报告和支持患者的后续检查。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Diagnostics
Diagnostics Biochemistry, Genetics and Molecular Biology-Clinical Biochemistry
CiteScore
4.70
自引率
8.30%
发文量
2699
审稿时长
19.64 days
期刊介绍: Diagnostics (ISSN 2075-4418) is an international scholarly open access journal on medical diagnostics. It publishes original research articles, reviews, communications and short notes on the research and development of medical diagnostics. There is no restriction on the length of the papers. Our aim is to encourage scientists to publish their experimental and theoretical research in as much detail as possible. Full experimental and/or methodological details must be provided for research articles.
期刊最新文献
RETRACTED: Monda et al. Left Ventricular Non-Compaction in Children: Aetiology and Diagnostic Criteria. Diagnostics 2024, 14, 115. Ultrashort Cell-Free DNA Fragments and Vimentin-Positive Circulating Tumor Cells for Predicting Early Recurrence in Patients with Biliary Tract Cancer. Monitoring and Management of Cytomegalovirus Reactivations After Allogeneic Hematopoietic Stem Cell Transplantation in Children: Experience from a Single Pediatric Center. Molecular Imaging Biomarkers for Early Cancer Detection: A Systematic Review of Emerging Technologies and Clinical Applications. AI-Assisted Detection and Localization of Spinal Metastatic Lesions.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1