{"title":"HET-RL:利用多模态数据,通过基于混合高效变压器的表征学习模型,诊断多种肺部疾病","authors":"A.P. Narmadha, N. Gobalakrishnan","doi":"10.1016/j.bspc.2024.107157","DOIUrl":null,"url":null,"abstract":"<div><div>Pulmonary diseases, encompassing conditions such as chronic bronchitis, emphysema, asthma, and pulmonary fibrosis, involve intricate pathophysiological mechanisms affecting the respiratory system, necessitating precise diagnosis and tailored therapeutic approaches. Timely and accurate diagnosis of pulmonary diseases is crucial as it enables early intervention, optimal management, and prevention of complications, thereby improving patient outcomes and quality of life. The scarcity of multi-modality datasets and challenges in accurate diagnosis underscore the complexities faced by deep learning models in achieving comprehensive pulmonary diagnoses, emphasizing the need for enhanced data diversity and algorithmic robustness in addressing diagnostic issues. To overcome these challenges, we have proposed a hybrid efficient transformer based on representation learning named as HET-RL model for accurate various pulmonary disease using multi-modality. Primarily in our work, we utilized multiple data including radiograph, chief complaints and clinical parameters for enhancing the efficiency of model performance. We have performed dual-level pre-processing such as denoising and normalization for amplifying data quality using Steered Filter (SF) and Min-Max normalization, respectively. Then, we proposed HET-RL model which encompasses of Inter-Attention Transformer (IAT) and Text Analyzer Transformer (TAT) for data analyzing. In which, appropriate features are analyzed and extracted from radiograph (CT scan) by IAT and TAT with representation learning (RL) encode the text and extract significant information from both chief complaint clinical parameters. Finally, the extracted features from hybrid transformers are fused by adapting Fusion Network and then pulmonary disease are classified into multiple classes. Incorporating diverse data sources, including results of laboratory test and patient demographic characteristics, our model demonstrated superior performance compared to non-unified multimodal and an image-only model diagnosis model. It exhibited a 12% and 9% improvement in identifying pulmonary disease. Multimodal hybrid transformer-based models hold promise for streamlining patient triaging and enhancing the clinical decision-making process.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"100 ","pages":"Article 107157"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"HET-RL: Multiple pulmonary disease diagnosis via hybrid efficient transformers based representation learning model using multi-modality data\",\"authors\":\"A.P. Narmadha, N. Gobalakrishnan\",\"doi\":\"10.1016/j.bspc.2024.107157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Pulmonary diseases, encompassing conditions such as chronic bronchitis, emphysema, asthma, and pulmonary fibrosis, involve intricate pathophysiological mechanisms affecting the respiratory system, necessitating precise diagnosis and tailored therapeutic approaches. Timely and accurate diagnosis of pulmonary diseases is crucial as it enables early intervention, optimal management, and prevention of complications, thereby improving patient outcomes and quality of life. The scarcity of multi-modality datasets and challenges in accurate diagnosis underscore the complexities faced by deep learning models in achieving comprehensive pulmonary diagnoses, emphasizing the need for enhanced data diversity and algorithmic robustness in addressing diagnostic issues. To overcome these challenges, we have proposed a hybrid efficient transformer based on representation learning named as HET-RL model for accurate various pulmonary disease using multi-modality. Primarily in our work, we utilized multiple data including radiograph, chief complaints and clinical parameters for enhancing the efficiency of model performance. We have performed dual-level pre-processing such as denoising and normalization for amplifying data quality using Steered Filter (SF) and Min-Max normalization, respectively. Then, we proposed HET-RL model which encompasses of Inter-Attention Transformer (IAT) and Text Analyzer Transformer (TAT) for data analyzing. In which, appropriate features are analyzed and extracted from radiograph (CT scan) by IAT and TAT with representation learning (RL) encode the text and extract significant information from both chief complaint clinical parameters. Finally, the extracted features from hybrid transformers are fused by adapting Fusion Network and then pulmonary disease are classified into multiple classes. Incorporating diverse data sources, including results of laboratory test and patient demographic characteristics, our model demonstrated superior performance compared to non-unified multimodal and an image-only model diagnosis model. It exhibited a 12% and 9% improvement in identifying pulmonary disease. Multimodal hybrid transformer-based models hold promise for streamlining patient triaging and enhancing the clinical decision-making process.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"100 \",\"pages\":\"Article 107157\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809424012151\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424012151","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
摘要
肺部疾病包括慢性支气管炎、肺气肿、哮喘和肺纤维化等病症,涉及影响呼吸系统的复杂病理生理机制,需要精确诊断和有针对性的治疗方法。及时、准确地诊断肺部疾病至关重要,因为这有助于早期干预、优化管理和预防并发症,从而改善患者的预后和生活质量。多模态数据集的稀缺和准确诊断的挑战凸显了深度学习模型在实现全面肺部诊断时所面临的复杂性,强调了在解决诊断问题时增强数据多样性和算法鲁棒性的必要性。为了克服这些挑战,我们提出了一种基于表征学习的混合高效变换器,命名为 HET-RL 模型,用于利用多模态准确诊断各种肺部疾病。在我们的工作中,我们主要利用了多种数据,包括 X 光片、主诉和临床参数,以提高模型性能的效率。我们进行了双重预处理,如去噪和归一化,分别使用 Steered Filter(SF)和 Min-Max 归一化来提高数据质量。然后,我们提出了 HET-RL 模型,其中包括用于数据分析的 Inter-Attention Transformer(IAT)和 Text Analyzer Transformer(TAT)。其中,IAT 和 TAT 通过表征学习(RL)对文本进行编码,从放射照片(CT 扫描)中分析并提取适当的特征,并从主诉和临床参数中提取重要信息。最后,通过自适应融合网络将从混合变换器中提取的特征进行融合,然后将肺部疾病分为多个类别。与非统一多模态诊断模型和纯图像诊断模型相比,我们的模型结合了包括实验室检测结果和患者人口特征在内的多种数据源,表现出更优越的性能。它在识别肺部疾病方面分别提高了 12% 和 9%。基于多模态混合变压器的模型有望简化病人分流和加强临床决策过程。
HET-RL: Multiple pulmonary disease diagnosis via hybrid efficient transformers based representation learning model using multi-modality data
Pulmonary diseases, encompassing conditions such as chronic bronchitis, emphysema, asthma, and pulmonary fibrosis, involve intricate pathophysiological mechanisms affecting the respiratory system, necessitating precise diagnosis and tailored therapeutic approaches. Timely and accurate diagnosis of pulmonary diseases is crucial as it enables early intervention, optimal management, and prevention of complications, thereby improving patient outcomes and quality of life. The scarcity of multi-modality datasets and challenges in accurate diagnosis underscore the complexities faced by deep learning models in achieving comprehensive pulmonary diagnoses, emphasizing the need for enhanced data diversity and algorithmic robustness in addressing diagnostic issues. To overcome these challenges, we have proposed a hybrid efficient transformer based on representation learning named as HET-RL model for accurate various pulmonary disease using multi-modality. Primarily in our work, we utilized multiple data including radiograph, chief complaints and clinical parameters for enhancing the efficiency of model performance. We have performed dual-level pre-processing such as denoising and normalization for amplifying data quality using Steered Filter (SF) and Min-Max normalization, respectively. Then, we proposed HET-RL model which encompasses of Inter-Attention Transformer (IAT) and Text Analyzer Transformer (TAT) for data analyzing. In which, appropriate features are analyzed and extracted from radiograph (CT scan) by IAT and TAT with representation learning (RL) encode the text and extract significant information from both chief complaint clinical parameters. Finally, the extracted features from hybrid transformers are fused by adapting Fusion Network and then pulmonary disease are classified into multiple classes. Incorporating diverse data sources, including results of laboratory test and patient demographic characteristics, our model demonstrated superior performance compared to non-unified multimodal and an image-only model diagnosis model. It exhibited a 12% and 9% improvement in identifying pulmonary disease. Multimodal hybrid transformer-based models hold promise for streamlining patient triaging and enhancing the clinical decision-making process.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.