{"title":"卷积- mtd:一种基于CNN的多标签医用试管检测和分类模型,以促进资源受限的护理点设备。","authors":"Moneeb Abbas, Wen-Chung Kuo, Khalid Mahmood, Waseem Akram, Sajid Mehmood, Ali Kashif Bashir","doi":"10.1109/JBHI.2025.3543245","DOIUrl":null,"url":null,"abstract":"<p><p>Computer-aided detection through deep learning is becoming a prevalent approach across various fields, including the detection of anomalies in medical procedures. One such medical procedure involves the placement of medical tubes to provide nutrition or other medical interventions in critically ill patients. Medical tube placement can be highly complex and prone to subjective errors. Malposition of medical tubes is often observed and associated with significant morbidity and mortality. In addition, continuous verification using manual procedures such as capnography, pH testing, auscultation, and visual inspection through chest X-ray (CXR) imaging is required. In this paper, we propose a Conv-MTD, a medical tube detection (MTD) model that detects the placement of medical tubes using CXR images, assisting radiologists with precise identification and categorizing the tubes into normal, abnormal, and borderline placement. Conv-MTD leverages the EfficientNet-B7 architecture as its backbone, enhanced with an auxiliary head in the intermediate layers to mitigate vanishing gradient issues common in deep neural networks. The Conv-MTD is further optimized using post-training 16-bit floating-point (FP16) quantization, which significantly reduces memory consumption by 50% and 2x improvement in inference speed without compromising accuracy. This optimization allows Conv-MTD to achieve efficient performance without requiring high-end computational resources, making it suitable for deployment on point-of-care devices. Conv-MTD provided the best performance, with an average area under the receiver operating characteristic curve (AUC) of 0.95 using the open-source RANZCR CLiP dataset. The proposed Conv-MTD has the potential to operate on resource-constrained point-of-care devices due to its use of FP16 computation, enabling low-cost and automated assessments in various healthcare settings.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"2927-2938"},"PeriodicalIF":6.8000,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Conv-MTD: A CNN Based Multi-Label Medical Tubes Detection and Classification Model to Facilitate Resource-Constrained Point-of-Care Devices.\",\"authors\":\"Moneeb Abbas, Wen-Chung Kuo, Khalid Mahmood, Waseem Akram, Sajid Mehmood, Ali Kashif Bashir\",\"doi\":\"10.1109/JBHI.2025.3543245\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Computer-aided detection through deep learning is becoming a prevalent approach across various fields, including the detection of anomalies in medical procedures. One such medical procedure involves the placement of medical tubes to provide nutrition or other medical interventions in critically ill patients. Medical tube placement can be highly complex and prone to subjective errors. Malposition of medical tubes is often observed and associated with significant morbidity and mortality. In addition, continuous verification using manual procedures such as capnography, pH testing, auscultation, and visual inspection through chest X-ray (CXR) imaging is required. In this paper, we propose a Conv-MTD, a medical tube detection (MTD) model that detects the placement of medical tubes using CXR images, assisting radiologists with precise identification and categorizing the tubes into normal, abnormal, and borderline placement. Conv-MTD leverages the EfficientNet-B7 architecture as its backbone, enhanced with an auxiliary head in the intermediate layers to mitigate vanishing gradient issues common in deep neural networks. The Conv-MTD is further optimized using post-training 16-bit floating-point (FP16) quantization, which significantly reduces memory consumption by 50% and 2x improvement in inference speed without compromising accuracy. This optimization allows Conv-MTD to achieve efficient performance without requiring high-end computational resources, making it suitable for deployment on point-of-care devices. Conv-MTD provided the best performance, with an average area under the receiver operating characteristic curve (AUC) of 0.95 using the open-source RANZCR CLiP dataset. The proposed Conv-MTD has the potential to operate on resource-constrained point-of-care devices due to its use of FP16 computation, enabling low-cost and automated assessments in various healthcare settings.</p>\",\"PeriodicalId\":13073,\"journal\":{\"name\":\"IEEE Journal of Biomedical and Health Informatics\",\"volume\":\"PP \",\"pages\":\"2927-2938\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2026-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Biomedical and Health Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1109/JBHI.2025.3543245\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Biomedical and Health Informatics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/JBHI.2025.3543245","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Conv-MTD: A CNN Based Multi-Label Medical Tubes Detection and Classification Model to Facilitate Resource-Constrained Point-of-Care Devices.
Computer-aided detection through deep learning is becoming a prevalent approach across various fields, including the detection of anomalies in medical procedures. One such medical procedure involves the placement of medical tubes to provide nutrition or other medical interventions in critically ill patients. Medical tube placement can be highly complex and prone to subjective errors. Malposition of medical tubes is often observed and associated with significant morbidity and mortality. In addition, continuous verification using manual procedures such as capnography, pH testing, auscultation, and visual inspection through chest X-ray (CXR) imaging is required. In this paper, we propose a Conv-MTD, a medical tube detection (MTD) model that detects the placement of medical tubes using CXR images, assisting radiologists with precise identification and categorizing the tubes into normal, abnormal, and borderline placement. Conv-MTD leverages the EfficientNet-B7 architecture as its backbone, enhanced with an auxiliary head in the intermediate layers to mitigate vanishing gradient issues common in deep neural networks. The Conv-MTD is further optimized using post-training 16-bit floating-point (FP16) quantization, which significantly reduces memory consumption by 50% and 2x improvement in inference speed without compromising accuracy. This optimization allows Conv-MTD to achieve efficient performance without requiring high-end computational resources, making it suitable for deployment on point-of-care devices. Conv-MTD provided the best performance, with an average area under the receiver operating characteristic curve (AUC) of 0.95 using the open-source RANZCR CLiP dataset. The proposed Conv-MTD has the potential to operate on resource-constrained point-of-care devices due to its use of FP16 computation, enabling low-cost and automated assessments in various healthcare settings.
期刊介绍:
IEEE Journal of Biomedical and Health Informatics publishes original papers presenting recent advances where information and communication technologies intersect with health, healthcare, life sciences, and biomedicine. Topics include acquisition, transmission, storage, retrieval, management, and analysis of biomedical and health information. The journal covers applications of information technologies in healthcare, patient monitoring, preventive care, early disease diagnosis, therapy discovery, and personalized treatment protocols. It explores electronic medical and health records, clinical information systems, decision support systems, medical and biological imaging informatics, wearable systems, body area/sensor networks, and more. Integration-related topics like interoperability, evidence-based medicine, and secure patient data are also addressed.