Multimodal modeling with low-dose CT and clinical information for diagnostic artificial intelligence on mediastinal tumors: a preliminary study

IF 3.6 3区医学 Q1 RESPIRATORY SYSTEM BMJ Open Respiratory Research Pub Date : 2024-04-01 DOI:10.1136/bmjresp-2023-002249

Daisuke Yamada, Fumitsugu Kojima, Yujiro Otsuka, Kouhei Kawakami, Naoki Koishi, Ken Oba, Toru Bando, Masaki Matsusako, Yasuyuki Kurihara

{"title":"Multimodal modeling with low-dose CT and clinical information for diagnostic artificial intelligence on mediastinal tumors: a preliminary study","authors":"Daisuke Yamada, Fumitsugu Kojima, Yujiro Otsuka, Kouhei Kawakami, Naoki Koishi, Ken Oba, Toru Bando, Masaki Matsusako, Yasuyuki Kurihara","doi":"10.1136/bmjresp-2023-002249","DOIUrl":null,"url":null,"abstract":"Background Diagnosing mediastinal tumours, including incidental lesions, using low-dose CT (LDCT) performed for lung cancer screening, is challenging. It often requires additional invasive and costly tests for proper characterisation and surgical planning. This indicates the need for a more efficient and patient-centred approach, suggesting a gap in the existing diagnostic methods and the potential for artificial intelligence technologies to address this gap. This study aimed to create a multimodal hybrid transformer model using the Vision Transformer that leverages LDCT features and clinical data to improve surgical decision-making for patients with incidentally detected mediastinal tumours. Methods This retrospective study analysed patients with mediastinal tumours between 2010 and 2021. Patients eligible for surgery (n=30) were considered ‘positive,’ whereas those without tumour enlargement (n=32) were considered ‘negative.’ We developed a hybrid model combining a convolutional neural network with a transformer to integrate imaging and clinical data. The dataset was split in a 5:3:2 ratio for training, validation and testing. The model’s efficacy was evaluated using a receiver operating characteristic (ROC) analysis across 25 iterations of random assignments and compared against conventional radiomics models and models excluding clinical data. Results The multimodal hybrid model demonstrated a mean area under the curve (AUC) of 0.90, significantly outperforming the non-clinical data model (AUC=0.86, p=0.04) and radiomics models (random forest AUC=0.81, p=0.008; logistic regression AUC=0.77, p=0.004). Conclusion Integrating clinical and LDCT data using a hybrid transformer model can improve surgical decision-making for mediastinal tumours, showing superiority over models lacking clinical data integration. Data are available upon reasonable request.","PeriodicalId":9048,"journal":{"name":"BMJ Open Respiratory Research","volume":"186 1","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ Open Respiratory Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1136/bmjresp-2023-002249","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}

引用次数: 0

Abstract

Background Diagnosing mediastinal tumours, including incidental lesions, using low-dose CT (LDCT) performed for lung cancer screening, is challenging. It often requires additional invasive and costly tests for proper characterisation and surgical planning. This indicates the need for a more efficient and patient-centred approach, suggesting a gap in the existing diagnostic methods and the potential for artificial intelligence technologies to address this gap. This study aimed to create a multimodal hybrid transformer model using the Vision Transformer that leverages LDCT features and clinical data to improve surgical decision-making for patients with incidentally detected mediastinal tumours. Methods This retrospective study analysed patients with mediastinal tumours between 2010 and 2021. Patients eligible for surgery (n=30) were considered ‘positive,’ whereas those without tumour enlargement (n=32) were considered ‘negative.’ We developed a hybrid model combining a convolutional neural network with a transformer to integrate imaging and clinical data. The dataset was split in a 5:3:2 ratio for training, validation and testing. The model’s efficacy was evaluated using a receiver operating characteristic (ROC) analysis across 25 iterations of random assignments and compared against conventional radiomics models and models excluding clinical data. Results The multimodal hybrid model demonstrated a mean area under the curve (AUC) of 0.90, significantly outperforming the non-clinical data model (AUC=0.86, p=0.04) and radiomics models (random forest AUC=0.81, p=0.008; logistic regression AUC=0.77, p=0.004). Conclusion Integrating clinical and LDCT data using a hybrid transformer model can improve surgical decision-making for mediastinal tumours, showing superiority over models lacking clinical data integration. Data are available upon reasonable request.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用低剂量 CT 和临床信息建立多模态模型，用于纵隔肿瘤的人工智能诊断：初步研究

背景利用低剂量 CT（LDCT）进行肺癌筛查，诊断纵隔肿瘤（包括偶发病变）是一项挑战。它通常需要额外的侵入性和昂贵的检查来进行正确的定性和手术规划。这表明需要一种更高效、更以患者为中心的方法，同时也表明现有诊断方法存在差距，而人工智能技术则有可能弥补这一差距。本研究旨在利用视觉转换器（Vision Transformer）创建一个多模态混合转换器模型，利用 LDCT 特征和临床数据改善偶然发现纵隔肿瘤患者的手术决策。方法这项回顾性研究分析了 2010 年至 2021 年间的纵隔肿瘤患者。符合手术条件的患者（30 人）被视为 "阳性"，而没有肿瘤增大的患者（32 人）被视为 "阴性"。我们开发了一个混合模型，将卷积神经网络与变压器相结合，以整合成像和临床数据。数据集以 5:3:2 的比例进行训练、验证和测试。使用接收器操作特征（ROC）分析法评估了该模型在 25 次随机分配迭代中的有效性，并与传统的放射组学模型和不包括临床数据的模型进行了比较。结果多模态混合模型的平均曲线下面积（AUC）为 0.90，明显优于非临床数据模型（AUC=0.86，p=0.04）和放射组学模型（随机森林 AUC=0.81，p=0.008；逻辑回归 AUC=0.77，p=0.004）。结论使用混合变换器模型整合临床和 LDCT 数据可改善纵隔肿瘤的手术决策，比缺乏临床数据整合的模型更具优势。如有合理要求，可提供相关数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

BMJ Open Respiratory Research RESPIRATORY SYSTEM-

CiteScore

6.60

自引率

2.40%

发文量

审稿时长

12 weeks

期刊介绍： BMJ Open Respiratory Research is a peer-reviewed, open access journal publishing respiratory and critical care medicine. It is the sister journal to Thorax and co-owned by the British Thoracic Society and BMJ. The journal focuses on robustness of methodology and scientific rigour with less emphasis on novelty or perceived impact. BMJ Open Respiratory Research operates a rapid review process, with continuous publication online, ensuring timely, up-to-date research is available worldwide. The journal publishes review articles and all research study types: Basic science including laboratory based experiments and animal models, Pilot studies or proof of concept, Observational studies, Study protocols, Registries, Clinical trials from phase I to multicentre randomised clinical trials, Systematic reviews and meta-analyses.