卷积- mtd：一种基于CNN的多标签医用试管检测和分类模型，以促进资源受限的护理点设备。

IF 6.8 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Journal of Biomedical and Health Informatics Pub Date : 2026-04-01 DOI:10.1109/JBHI.2025.3543245

Moneeb Abbas, Wen-Chung Kuo, Khalid Mahmood, Waseem Akram, Sajid Mehmood, Ali Kashif Bashir

{"title":"卷积- mtd：一种基于CNN的多标签医用试管检测和分类模型，以促进资源受限的护理点设备。","authors":"Moneeb Abbas, Wen-Chung Kuo, Khalid Mahmood, Waseem Akram, Sajid Mehmood, Ali Kashif Bashir","doi":"10.1109/JBHI.2025.3543245","DOIUrl":null,"url":null,"abstract":"Computer-aided detection through deep learning is becoming a prevalent approach across various fields, including the detection of anomalies in medical procedures. One such medical procedure involves the placement of medical tubes to provide nutrition or other medical interventions in critically ill patients. Medical tube placement can be highly complex and prone to subjective errors. Malposition of medical tubes is often observed and associated with significant morbidity and mortality. In addition, continuous verification using manual procedures such as capnography, pH testing, auscultation, and visual inspection through chest X-ray (CXR) imaging is required. In this paper, we propose a Conv-MTD, a medical tube detection (MTD) model that detects the placement of medical tubes using CXR images, assisting radiologists with precise identification and categorizing the tubes into normal, abnormal, and borderline placement. Conv-MTD leverages the EfficientNet-B7 architecture as its backbone, enhanced with an auxiliary head in the intermediate layers to mitigate vanishing gradient issues common in deep neural networks. The Conv-MTD is further optimized using post-training 16-bit floating-point (FP16) quantization, which significantly reduces memory consumption by 50% and 2x improvement in inference speed without compromising accuracy. This optimization allows Conv-MTD to achieve efficient performance without requiring high-end computational resources, making it suitable for deployment on point-of-care devices. Conv-MTD provided the best performance, with an average area under the receiver operating characteristic curve (AUC) of 0.95 using the open-source RANZCR CLiP dataset. The proposed Conv-MTD has the potential to operate on resource-constrained point-of-care devices due to its use of FP16 computation, enabling low-cost and automated assessments in various healthcare settings.","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"2927-2938"},"PeriodicalIF":6.8000,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Conv-MTD: A CNN Based Multi-Label Medical Tubes Detection and Classification Model to Facilitate Resource-Constrained Point-of-Care Devices.\",\"authors\":\"Moneeb Abbas, Wen-Chung Kuo, Khalid Mahmood, Waseem Akram, Sajid Mehmood, Ali Kashif Bashir\",\"doi\":\"10.1109/JBHI.2025.3543245\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer-aided detection through deep learning is becoming a prevalent approach across various fields, including the detection of anomalies in medical procedures. One such medical procedure involves the placement of medical tubes to provide nutrition or other medical interventions in critically ill patients. Medical tube placement can be highly complex and prone to subjective errors. Malposition of medical tubes is often observed and associated with significant morbidity and mortality. In addition, continuous verification using manual procedures such as capnography, pH testing, auscultation, and visual inspection through chest X-ray (CXR) imaging is required. In this paper, we propose a Conv-MTD, a medical tube detection (MTD) model that detects the placement of medical tubes using CXR images, assisting radiologists with precise identification and categorizing the tubes into normal, abnormal, and borderline placement. Conv-MTD leverages the EfficientNet-B7 architecture as its backbone, enhanced with an auxiliary head in the intermediate layers to mitigate vanishing gradient issues common in deep neural networks. The Conv-MTD is further optimized using post-training 16-bit floating-point (FP16) quantization, which significantly reduces memory consumption by 50% and 2x improvement in inference speed without compromising accuracy. This optimization allows Conv-MTD to achieve efficient performance without requiring high-end computational resources, making it suitable for deployment on point-of-care devices. Conv-MTD provided the best performance, with an average area under the receiver operating characteristic curve (AUC) of 0.95 using the open-source RANZCR CLiP dataset. The proposed Conv-MTD has the potential to operate on resource-constrained point-of-care devices due to its use of FP16 computation, enabling low-cost and automated assessments in various healthcare settings.\",\"PeriodicalId\":13073,\"journal\":{\"name\":\"IEEE Journal of Biomedical and Health Informatics\",\"volume\":\"PP \",\"pages\":\"2927-2938\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2026-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Biomedical and Health Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1109/JBHI.2025.3543245\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Biomedical and Health Informatics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/JBHI.2025.3543245","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

通过深度学习的计算机辅助检测正在成为各个领域的普遍方法，包括检测医疗程序中的异常情况。其中一种医疗程序包括放置医疗管，为危重病人提供营养或其他医疗程序。医用管的放置非常复杂，容易出现主观错误。医用导管的错位是经常观察到的，并与显著的发病率和死亡率相关。此外，还需要使用人工程序进行连续验证，如血管造影、pH值测试、听诊和胸部x射线（CXR）成像的目视检查。在本文中，我们提出了一种医疗管检测（MTD）模型convn -MTD，该模型使用CXR图像检测医疗管的放置，协助放射科医生精确识别并将管分为正常，异常和边缘放置。convt - mtd利用最先进的EfficientNet-B7架构作为其主干，并在中间层增强了辅助头，以减轻深度神经网络中常见的梯度消失问题。利用训练后的16位浮点（FP16）量化进一步优化了convm - mtd，有效降低了资源受限设备上的内存消耗和推理延迟。convv - mtd表现最好，接受者-操作者曲线下的平均面积AUC-ROC为0.95。拟议的convm - mtd有可能在资源受限的护理点设备上运行，从而在各种医疗保健环境中实现低成本和自动化评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Conv-MTD: A CNN Based Multi-Label Medical Tubes Detection and Classification Model to Facilitate Resource-Constrained Point-of-Care Devices.

Computer-aided detection through deep learning is becoming a prevalent approach across various fields, including the detection of anomalies in medical procedures. One such medical procedure involves the placement of medical tubes to provide nutrition or other medical interventions in critically ill patients. Medical tube placement can be highly complex and prone to subjective errors. Malposition of medical tubes is often observed and associated with significant morbidity and mortality. In addition, continuous verification using manual procedures such as capnography, pH testing, auscultation, and visual inspection through chest X-ray (CXR) imaging is required. In this paper, we propose a Conv-MTD, a medical tube detection (MTD) model that detects the placement of medical tubes using CXR images, assisting radiologists with precise identification and categorizing the tubes into normal, abnormal, and borderline placement. Conv-MTD leverages the EfficientNet-B7 architecture as its backbone, enhanced with an auxiliary head in the intermediate layers to mitigate vanishing gradient issues common in deep neural networks. The Conv-MTD is further optimized using post-training 16-bit floating-point (FP16) quantization, which significantly reduces memory consumption by 50% and 2x improvement in inference speed without compromising accuracy. This optimization allows Conv-MTD to achieve efficient performance without requiring high-end computational resources, making it suitable for deployment on point-of-care devices. Conv-MTD provided the best performance, with an average area under the receiver operating characteristic curve (AUC) of 0.95 using the open-source RANZCR CLiP dataset. The proposed Conv-MTD has the potential to operate on resource-constrained point-of-care devices due to its use of FP16 computation, enabling low-cost and automated assessments in various healthcare settings.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Journal of Biomedical and Health Informatics COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

CiteScore

13.60

自引率

6.50%

发文量

1151

期刊介绍： IEEE Journal of Biomedical and Health Informatics publishes original papers presenting recent advances where information and communication technologies intersect with health, healthcare, life sciences, and biomedicine. Topics include acquisition, transmission, storage, retrieval, management, and analysis of biomedical and health information. The journal covers applications of information technologies in healthcare, patient monitoring, preventive care, early disease diagnosis, therapy discovery, and personalized treatment protocols. It explores electronic medical and health records, clinical information systems, decision support systems, medical and biological imaging informatics, wearable systems, body area/sensor networks, and more. Integration-related topics like interoperability, evidence-based medicine, and secure patient data are also addressed.