TinyCheXReport：用于生成胸部 X 光报告的压缩深度神经网络

IF 1.8 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE ACM Transactions on Asian and Low-Resource Language Information Processing Pub Date : 2024-07-03 DOI:10.1145/3676166

F. Alotaibi, K. Alyoubi, Ajay Mittal, Vishal Gupta, Navdeep Kaur

{"title":"TinyCheXReport：用于生成胸部 X 光报告的压缩深度神经网络","authors":"F. Alotaibi, K. Alyoubi, Ajay Mittal, Vishal Gupta, Navdeep Kaur","doi":"10.1145/3676166","DOIUrl":null,"url":null,"abstract":"Increase in chest x-ray (CXR) imaging tests has burdened radiologists, thereby posing significant challenges in writing radiological reports on time. Although several deep learning-based automatic report generation methods have been developed till date, most are over-parameterized. For deployment on edge devices with constrained processing power or limited resources, over-parameterized models are often too large. This paper presents a compressed deep learning-based model that is 30% space efficient as compared to the non-compressed base model while both have comparable performance. The model comprising of VGG19 and hierarchical long short-term memory (LSTM) equipped with contextual word embedding layer is used as the base model. The redundant weight parameters are removed from the base model using unstructured one-shot pruning. To overcome the performance degradation, the lightweight pruned model is fine-tuned over publically available OpenI dataset. The quantitative evaluation metric scores demonstrate that proposed model surpasses the performance of state-of-the-art models. Additionally, the proposed model, being 30% space efficient, is easily deployable in resource-limited settings. Thus, this study serves as baseline for development of compressed models to generate radiologicalreports from CXR images.","PeriodicalId":54312,"journal":{"name":"ACM Transactions on Asian and Low-Resource Language Information Processing","volume":null,"pages":null},"PeriodicalIF":1.8000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"TinyCheXReport: Compressed deep neural network for Chest X-ray report generation\",\"authors\":\"F. Alotaibi, K. Alyoubi, Ajay Mittal, Vishal Gupta, Navdeep Kaur\",\"doi\":\"10.1145/3676166\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Increase in chest x-ray (CXR) imaging tests has burdened radiologists, thereby posing significant challenges in writing radiological reports on time. Although several deep learning-based automatic report generation methods have been developed till date, most are over-parameterized. For deployment on edge devices with constrained processing power or limited resources, over-parameterized models are often too large. This paper presents a compressed deep learning-based model that is 30% space efficient as compared to the non-compressed base model while both have comparable performance. The model comprising of VGG19 and hierarchical long short-term memory (LSTM) equipped with contextual word embedding layer is used as the base model. The redundant weight parameters are removed from the base model using unstructured one-shot pruning. To overcome the performance degradation, the lightweight pruned model is fine-tuned over publically available OpenI dataset. The quantitative evaluation metric scores demonstrate that proposed model surpasses the performance of state-of-the-art models. Additionally, the proposed model, being 30% space efficient, is easily deployable in resource-limited settings. Thus, this study serves as baseline for development of compressed models to generate radiologicalreports from CXR images.\",\"PeriodicalId\":54312,\"journal\":{\"name\":\"ACM Transactions on Asian and Low-Resource Language Information Processing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2024-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Transactions on Asian and Low-Resource Language Information Processing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1145/3676166\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Asian and Low-Resource Language Information Processing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3676166","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

胸部 X 光（CXR）成像测试的增加加重了放射科医生的负担，从而给按时撰写放射报告带来了巨大挑战。尽管迄今为止已开发出几种基于深度学习的自动报告生成方法，但大多数方法都参数过高。对于部署在处理能力受限或资源有限的边缘设备上，过度参数化的模型往往过于庞大。本文提出了一种基于深度学习的压缩模型，与非压缩基础模型相比，空间效率提高了 30%，而两者的性能相当。基础模型由 VGG19 和配备上下文词嵌入层的分层长短期记忆（LSTM）组成。使用非结构化的单次剪枝，从基础模型中去除冗余权重参数。为了克服性能下降问题，在公开的 OpenI 数据集上对轻量级剪枝模型进行了微调。定量评估指标得分表明，所提出的模型超越了最先进模型的性能。此外，所提出的模型节省了 30% 的空间，很容易部署到资源有限的环境中。因此，本研究可作为开发压缩模型的基线，以便从 CXR 图像生成放射报告。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

TinyCheXReport: Compressed deep neural network for Chest X-ray report generation

Increase in chest x-ray (CXR) imaging tests has burdened radiologists, thereby posing significant challenges in writing radiological reports on time. Although several deep learning-based automatic report generation methods have been developed till date, most are over-parameterized. For deployment on edge devices with constrained processing power or limited resources, over-parameterized models are often too large. This paper presents a compressed deep learning-based model that is 30% space efficient as compared to the non-compressed base model while both have comparable performance. The model comprising of VGG19 and hierarchical long short-term memory (LSTM) equipped with contextual word embedding layer is used as the base model. The redundant weight parameters are removed from the base model using unstructured one-shot pruning. To overcome the performance degradation, the lightweight pruned model is fine-tuned over publically available OpenI dataset. The quantitative evaluation metric scores demonstrate that proposed model surpasses the performance of state-of-the-art models. Additionally, the proposed model, being 30% space efficient, is easily deployable in resource-limited settings. Thus, this study serves as baseline for development of compressed models to generate radiologicalreports from CXR images.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ACM Transactions on Asian and Low-Resource Language Information Processing Computer Science-General Computer Science

CiteScore

3.60

自引率

15.00%

发文量

241

期刊介绍： The ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) publishes high quality original archival papers and technical notes in the areas of computation and processing of information in Asian languages, low-resource languages of Africa, Australasia, Oceania and the Americas, as well as related disciplines. The subject areas covered by TALLIP include, but are not limited to: -Computational Linguistics: including computational phonology, computational morphology, computational syntax (e.g. parsing), computational semantics, computational pragmatics, etc. -Linguistic Resources: including computational lexicography, terminology, electronic dictionaries, cross-lingual dictionaries, electronic thesauri, etc. -Hardware and software algorithms and tools for Asian or low-resource language processing, e.g., handwritten character recognition. -Information Understanding: including text understanding, speech understanding, character recognition, discourse processing, dialogue systems, etc. -Machine Translation involving Asian or low-resource languages. -Information Retrieval: including natural language processing (NLP) for concept-based indexing, natural language query interfaces, semantic relevance judgments, etc. -Information Extraction and Filtering: including automatic abstraction, user profiling, etc. -Speech processing: including text-to-speech synthesis and automatic speech recognition. -Multimedia Asian Information Processing: including speech, image, video, image/text translation, etc. -Cross-lingual information processing involving Asian or low-resource languages. -Papers that deal in theory, systems design, evaluation and applications in the aforesaid subjects are appropriate for TALLIP. Emphasis will be placed on the originality and the practical significance of the reported research.