HET-RL:利用多模态数据,通过基于混合高效变压器的表征学习模型,诊断多种肺部疾病

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL Biomedical Signal Processing and Control Pub Date : 2024-11-11 DOI:10.1016/j.bspc.2024.107157
A.P. Narmadha, N. Gobalakrishnan
{"title":"HET-RL:利用多模态数据,通过基于混合高效变压器的表征学习模型,诊断多种肺部疾病","authors":"A.P. Narmadha,&nbsp;N. Gobalakrishnan","doi":"10.1016/j.bspc.2024.107157","DOIUrl":null,"url":null,"abstract":"<div><div>Pulmonary diseases, encompassing conditions such as chronic bronchitis, emphysema, asthma, and pulmonary fibrosis, involve intricate pathophysiological mechanisms affecting the respiratory system, necessitating precise diagnosis and tailored therapeutic approaches. Timely and accurate diagnosis of pulmonary diseases is crucial as it enables early intervention, optimal management, and prevention of complications, thereby improving patient outcomes and quality of life. The scarcity of multi-modality datasets and challenges in accurate diagnosis underscore the complexities faced by deep learning models in achieving comprehensive pulmonary diagnoses, emphasizing the need for enhanced data diversity and algorithmic robustness in addressing diagnostic issues. To overcome these challenges, we have proposed a hybrid efficient transformer based on representation learning named as HET-RL model for accurate various pulmonary disease using multi-modality. Primarily in our work, we utilized multiple data including radiograph, chief complaints and clinical parameters for enhancing the efficiency of model performance. We have performed dual-level pre-processing such as denoising and normalization for amplifying data quality using Steered Filter (SF) and Min-Max normalization, respectively. Then, we proposed HET-RL model which encompasses of Inter-Attention Transformer (IAT) and Text Analyzer Transformer (TAT) for data analyzing. In which, appropriate features are analyzed and extracted from radiograph (CT scan) by IAT and TAT with representation learning (RL) encode the text and extract significant information from both chief complaint clinical parameters. Finally, the extracted features from hybrid transformers are fused by adapting Fusion Network and then pulmonary disease are classified into multiple classes. Incorporating diverse data sources, including results of laboratory test and patient demographic characteristics, our model demonstrated superior performance compared to non-unified multimodal and an image-only model diagnosis model. It exhibited a 12% and 9% improvement in identifying pulmonary disease. Multimodal hybrid transformer-based models hold promise for streamlining patient triaging and enhancing the clinical decision-making process.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"100 ","pages":"Article 107157"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"HET-RL: Multiple pulmonary disease diagnosis via hybrid efficient transformers based representation learning model using multi-modality data\",\"authors\":\"A.P. Narmadha,&nbsp;N. Gobalakrishnan\",\"doi\":\"10.1016/j.bspc.2024.107157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Pulmonary diseases, encompassing conditions such as chronic bronchitis, emphysema, asthma, and pulmonary fibrosis, involve intricate pathophysiological mechanisms affecting the respiratory system, necessitating precise diagnosis and tailored therapeutic approaches. Timely and accurate diagnosis of pulmonary diseases is crucial as it enables early intervention, optimal management, and prevention of complications, thereby improving patient outcomes and quality of life. The scarcity of multi-modality datasets and challenges in accurate diagnosis underscore the complexities faced by deep learning models in achieving comprehensive pulmonary diagnoses, emphasizing the need for enhanced data diversity and algorithmic robustness in addressing diagnostic issues. To overcome these challenges, we have proposed a hybrid efficient transformer based on representation learning named as HET-RL model for accurate various pulmonary disease using multi-modality. Primarily in our work, we utilized multiple data including radiograph, chief complaints and clinical parameters for enhancing the efficiency of model performance. We have performed dual-level pre-processing such as denoising and normalization for amplifying data quality using Steered Filter (SF) and Min-Max normalization, respectively. Then, we proposed HET-RL model which encompasses of Inter-Attention Transformer (IAT) and Text Analyzer Transformer (TAT) for data analyzing. In which, appropriate features are analyzed and extracted from radiograph (CT scan) by IAT and TAT with representation learning (RL) encode the text and extract significant information from both chief complaint clinical parameters. Finally, the extracted features from hybrid transformers are fused by adapting Fusion Network and then pulmonary disease are classified into multiple classes. Incorporating diverse data sources, including results of laboratory test and patient demographic characteristics, our model demonstrated superior performance compared to non-unified multimodal and an image-only model diagnosis model. It exhibited a 12% and 9% improvement in identifying pulmonary disease. Multimodal hybrid transformer-based models hold promise for streamlining patient triaging and enhancing the clinical decision-making process.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"100 \",\"pages\":\"Article 107157\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809424012151\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424012151","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

肺部疾病包括慢性支气管炎、肺气肿、哮喘和肺纤维化等病症,涉及影响呼吸系统的复杂病理生理机制,需要精确诊断和有针对性的治疗方法。及时、准确地诊断肺部疾病至关重要,因为这有助于早期干预、优化管理和预防并发症,从而改善患者的预后和生活质量。多模态数据集的稀缺和准确诊断的挑战凸显了深度学习模型在实现全面肺部诊断时所面临的复杂性,强调了在解决诊断问题时增强数据多样性和算法鲁棒性的必要性。为了克服这些挑战,我们提出了一种基于表征学习的混合高效变换器,命名为 HET-RL 模型,用于利用多模态准确诊断各种肺部疾病。在我们的工作中,我们主要利用了多种数据,包括 X 光片、主诉和临床参数,以提高模型性能的效率。我们进行了双重预处理,如去噪和归一化,分别使用 Steered Filter(SF)和 Min-Max 归一化来提高数据质量。然后,我们提出了 HET-RL 模型,其中包括用于数据分析的 Inter-Attention Transformer(IAT)和 Text Analyzer Transformer(TAT)。其中,IAT 和 TAT 通过表征学习(RL)对文本进行编码,从放射照片(CT 扫描)中分析并提取适当的特征,并从主诉和临床参数中提取重要信息。最后,通过自适应融合网络将从混合变换器中提取的特征进行融合,然后将肺部疾病分为多个类别。与非统一多模态诊断模型和纯图像诊断模型相比,我们的模型结合了包括实验室检测结果和患者人口特征在内的多种数据源,表现出更优越的性能。它在识别肺部疾病方面分别提高了 12% 和 9%。基于多模态混合变压器的模型有望简化病人分流和加强临床决策过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
HET-RL: Multiple pulmonary disease diagnosis via hybrid efficient transformers based representation learning model using multi-modality data
Pulmonary diseases, encompassing conditions such as chronic bronchitis, emphysema, asthma, and pulmonary fibrosis, involve intricate pathophysiological mechanisms affecting the respiratory system, necessitating precise diagnosis and tailored therapeutic approaches. Timely and accurate diagnosis of pulmonary diseases is crucial as it enables early intervention, optimal management, and prevention of complications, thereby improving patient outcomes and quality of life. The scarcity of multi-modality datasets and challenges in accurate diagnosis underscore the complexities faced by deep learning models in achieving comprehensive pulmonary diagnoses, emphasizing the need for enhanced data diversity and algorithmic robustness in addressing diagnostic issues. To overcome these challenges, we have proposed a hybrid efficient transformer based on representation learning named as HET-RL model for accurate various pulmonary disease using multi-modality. Primarily in our work, we utilized multiple data including radiograph, chief complaints and clinical parameters for enhancing the efficiency of model performance. We have performed dual-level pre-processing such as denoising and normalization for amplifying data quality using Steered Filter (SF) and Min-Max normalization, respectively. Then, we proposed HET-RL model which encompasses of Inter-Attention Transformer (IAT) and Text Analyzer Transformer (TAT) for data analyzing. In which, appropriate features are analyzed and extracted from radiograph (CT scan) by IAT and TAT with representation learning (RL) encode the text and extract significant information from both chief complaint clinical parameters. Finally, the extracted features from hybrid transformers are fused by adapting Fusion Network and then pulmonary disease are classified into multiple classes. Incorporating diverse data sources, including results of laboratory test and patient demographic characteristics, our model demonstrated superior performance compared to non-unified multimodal and an image-only model diagnosis model. It exhibited a 12% and 9% improvement in identifying pulmonary disease. Multimodal hybrid transformer-based models hold promise for streamlining patient triaging and enhancing the clinical decision-making process.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
期刊最新文献
Innovative brain tumor detection: Stacked random support vector-based hybrid gazelle coati algorithm A novel optimized machine learning approach with texture rectified cross-attention based transformer for COVID-19 detection A lightweight model for the retinal disease classification using optical coherence tomography An improved ECG data compression scheme based on ensemble empirical mode decomposition Performance evaluation of optimal ensemble learning approaches with PCA and LDA-based feature extraction for heart disease prediction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1