MOSAIC:基于人工智能的罕见癌症多模式分析、分类和个性化预后评估框架。

IF 3.3 Q2 ONCOLOGY JCO Clinical Cancer Informatics Pub Date : 2024-06-01 DOI:10.1200/CCI.24.00008
Saverio D'Amico, Lorenzo Dall'Olio, Cesare Rollo, Patricia Alonso, Iñigo Prada-Luengo, Daniele Dall'Olio, Claudia Sala, Elisabetta Sauta, Gianluca Asti, Luca Lanino, Giulia Maggioni, Alessia Campagna, Elena Zazzetti, Mattia Delleani, Maria Elena Bicchieri, Pierandrea Morandini, Victor Savevski, Borja Arroyo, Juan Parras, Lin Pierre Zhao, Uwe Platzbecker, Maria Diez-Campelo, Valeria Santini, Pierre Fenaux, Torsten Haferlach, Anders Krogh, Santiago Zazo, Piero Fariselli, Tiziana Sanavia, Matteo Giovanni Della Porta, Gastone Castellani
{"title":"MOSAIC:基于人工智能的罕见癌症多模式分析、分类和个性化预后评估框架。","authors":"Saverio D'Amico, Lorenzo Dall'Olio, Cesare Rollo, Patricia Alonso, Iñigo Prada-Luengo, Daniele Dall'Olio, Claudia Sala, Elisabetta Sauta, Gianluca Asti, Luca Lanino, Giulia Maggioni, Alessia Campagna, Elena Zazzetti, Mattia Delleani, Maria Elena Bicchieri, Pierandrea Morandini, Victor Savevski, Borja Arroyo, Juan Parras, Lin Pierre Zhao, Uwe Platzbecker, Maria Diez-Campelo, Valeria Santini, Pierre Fenaux, Torsten Haferlach, Anders Krogh, Santiago Zazo, Piero Fariselli, Tiziana Sanavia, Matteo Giovanni Della Porta, Gastone Castellani","doi":"10.1200/CCI.24.00008","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Rare cancers constitute over 20% of human neoplasms, often affecting patients with unmet medical needs. The development of effective classification and prognostication systems is crucial to improve the decision-making process and drive innovative treatment strategies. We have created and implemented MOSAIC, an artificial intelligence (AI)-based framework designed for multimodal analysis, classification, and personalized prognostic assessment in rare cancers. Clinical validation was performed on myelodysplastic syndrome (MDS), a rare hematologic cancer with clinical and genomic heterogeneities.</p><p><strong>Methods: </strong>We analyzed 4,427 patients with MDS divided into training and validation cohorts. Deep learning methods were applied to integrate and impute clinical/genomic features. Clustering was performed by combining Uniform Manifold Approximation and Projection for Dimension Reduction + Hierarchical Density-Based Spatial Clustering of Applications with Noise (UMAP + HDBSCAN) methods, compared with the conventional Hierarchical Dirichlet Process (HDP). Linear and AI-based nonlinear approaches were compared for survival prediction. Explainable AI (Shapley Additive Explanations approach [SHAP]) and federated learning were used to improve the interpretation and the performance of the clinical models, integrating them into distributed infrastructure.</p><p><strong>Results: </strong>UMAP + HDBSCAN clustering obtained a more granular patient stratification, achieving a higher average silhouette coefficient (0.16) with respect to HDP (0.01) and higher balanced accuracy in cluster classification by Random Forest (92.7% ± 1.3% and 85.8% ± 0.8%). AI methods for survival prediction outperform conventional statistical techniques and the reference prognostic tool for MDS. Nonlinear Gradient Boosting Survival stands in the internal (Concordance-Index [C-Index], 0.77; SD, 0.01) and external validation (C-Index, 0.74; SD, 0.02). SHAP analysis revealed that similar features drove patients' subgroups and outcomes in both training and validation cohorts. Federated implementation improved the accuracy of developed models.</p><p><strong>Conclusion: </strong>MOSAIC provides an explainable and robust framework to optimize classification and prognostic assessment of rare cancers. AI-based approaches demonstrated superior accuracy in capturing genomic similarities and providing individual prognostic information compared with conventional statistical methods. Its federated implementation ensures broad clinical application, guaranteeing high performance and data protection.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2400008"},"PeriodicalIF":3.3000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11371092/pdf/","citationCount":"0","resultStr":"{\"title\":\"MOSAIC: An Artificial Intelligence-Based Framework for Multimodal Analysis, Classification, and Personalized Prognostic Assessment in Rare Cancers.\",\"authors\":\"Saverio D'Amico, Lorenzo Dall'Olio, Cesare Rollo, Patricia Alonso, Iñigo Prada-Luengo, Daniele Dall'Olio, Claudia Sala, Elisabetta Sauta, Gianluca Asti, Luca Lanino, Giulia Maggioni, Alessia Campagna, Elena Zazzetti, Mattia Delleani, Maria Elena Bicchieri, Pierandrea Morandini, Victor Savevski, Borja Arroyo, Juan Parras, Lin Pierre Zhao, Uwe Platzbecker, Maria Diez-Campelo, Valeria Santini, Pierre Fenaux, Torsten Haferlach, Anders Krogh, Santiago Zazo, Piero Fariselli, Tiziana Sanavia, Matteo Giovanni Della Porta, Gastone Castellani\",\"doi\":\"10.1200/CCI.24.00008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Rare cancers constitute over 20% of human neoplasms, often affecting patients with unmet medical needs. The development of effective classification and prognostication systems is crucial to improve the decision-making process and drive innovative treatment strategies. We have created and implemented MOSAIC, an artificial intelligence (AI)-based framework designed for multimodal analysis, classification, and personalized prognostic assessment in rare cancers. Clinical validation was performed on myelodysplastic syndrome (MDS), a rare hematologic cancer with clinical and genomic heterogeneities.</p><p><strong>Methods: </strong>We analyzed 4,427 patients with MDS divided into training and validation cohorts. Deep learning methods were applied to integrate and impute clinical/genomic features. Clustering was performed by combining Uniform Manifold Approximation and Projection for Dimension Reduction + Hierarchical Density-Based Spatial Clustering of Applications with Noise (UMAP + HDBSCAN) methods, compared with the conventional Hierarchical Dirichlet Process (HDP). Linear and AI-based nonlinear approaches were compared for survival prediction. Explainable AI (Shapley Additive Explanations approach [SHAP]) and federated learning were used to improve the interpretation and the performance of the clinical models, integrating them into distributed infrastructure.</p><p><strong>Results: </strong>UMAP + HDBSCAN clustering obtained a more granular patient stratification, achieving a higher average silhouette coefficient (0.16) with respect to HDP (0.01) and higher balanced accuracy in cluster classification by Random Forest (92.7% ± 1.3% and 85.8% ± 0.8%). AI methods for survival prediction outperform conventional statistical techniques and the reference prognostic tool for MDS. Nonlinear Gradient Boosting Survival stands in the internal (Concordance-Index [C-Index], 0.77; SD, 0.01) and external validation (C-Index, 0.74; SD, 0.02). SHAP analysis revealed that similar features drove patients' subgroups and outcomes in both training and validation cohorts. Federated implementation improved the accuracy of developed models.</p><p><strong>Conclusion: </strong>MOSAIC provides an explainable and robust framework to optimize classification and prognostic assessment of rare cancers. AI-based approaches demonstrated superior accuracy in capturing genomic similarities and providing individual prognostic information compared with conventional statistical methods. Its federated implementation ensures broad clinical application, guaranteeing high performance and data protection.</p>\",\"PeriodicalId\":51626,\"journal\":{\"name\":\"JCO Clinical Cancer Informatics\",\"volume\":\"8 \",\"pages\":\"e2400008\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11371092/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JCO Clinical Cancer Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1200/CCI.24.00008\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI.24.00008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的:罕见癌症占人类肿瘤的 20% 以上,通常会影响到尚未满足医疗需求的患者。开发有效的分类和预后系统对于改善决策过程和推动创新治疗策略至关重要。我们创建并实施了 MOSAIC,这是一个基于人工智能(AI)的框架,旨在对罕见癌症进行多模态分析、分类和个性化预后评估。骨髓增生异常综合征(MDS)是一种罕见的血液肿瘤,具有临床和基因组异质性,我们对其进行了临床验证:我们对 4427 名 MDS 患者进行了分析,将其分为训练组和验证组。应用深度学习方法整合和估算临床/基因组特征。与传统的分层迪里希勒过程(HDP)相比,聚类是通过结合统一表层逼近和投影降维+基于密度的分层空间聚类(UMAP + HDBSCAN)方法进行的。在生存预测方面,对线性方法和基于人工智能的非线性方法进行了比较。可解释人工智能(Shapley Additive Explanations approach [SHAP])和联合学习被用来改进临床模型的解释和性能,并将它们集成到分布式基础设施中:UMAP+HDBSCAN聚类方法获得了更精细的患者分层,与HDP(0.01)相比,平均剪影系数(0.16)更高,随机森林(92.7%±1.3%)和85.8%±0.8%)聚类分类的均衡准确率更高。人工智能生存预测方法优于传统统计技术和 MDS 的参考预后工具。非线性梯度提升生存率在内部验证(Concordance-Index [C-Index],0.77;SD,0.01)和外部验证(C-Index,0.74;SD,0.02)中均名列前茅。SHAP分析表明,在训练组和验证组中,类似的特征驱动着患者的亚组和结果。联合实施提高了所开发模型的准确性:MOSAIC为优化罕见癌症的分类和预后评估提供了一个可解释且稳健的框架。与传统统计方法相比,基于人工智能的方法在捕捉基因组相似性和提供个体预后信息方面表现出更高的准确性。其联合实施确保了广泛的临床应用,保证了高性能和数据保护。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MOSAIC: An Artificial Intelligence-Based Framework for Multimodal Analysis, Classification, and Personalized Prognostic Assessment in Rare Cancers.

Purpose: Rare cancers constitute over 20% of human neoplasms, often affecting patients with unmet medical needs. The development of effective classification and prognostication systems is crucial to improve the decision-making process and drive innovative treatment strategies. We have created and implemented MOSAIC, an artificial intelligence (AI)-based framework designed for multimodal analysis, classification, and personalized prognostic assessment in rare cancers. Clinical validation was performed on myelodysplastic syndrome (MDS), a rare hematologic cancer with clinical and genomic heterogeneities.

Methods: We analyzed 4,427 patients with MDS divided into training and validation cohorts. Deep learning methods were applied to integrate and impute clinical/genomic features. Clustering was performed by combining Uniform Manifold Approximation and Projection for Dimension Reduction + Hierarchical Density-Based Spatial Clustering of Applications with Noise (UMAP + HDBSCAN) methods, compared with the conventional Hierarchical Dirichlet Process (HDP). Linear and AI-based nonlinear approaches were compared for survival prediction. Explainable AI (Shapley Additive Explanations approach [SHAP]) and federated learning were used to improve the interpretation and the performance of the clinical models, integrating them into distributed infrastructure.

Results: UMAP + HDBSCAN clustering obtained a more granular patient stratification, achieving a higher average silhouette coefficient (0.16) with respect to HDP (0.01) and higher balanced accuracy in cluster classification by Random Forest (92.7% ± 1.3% and 85.8% ± 0.8%). AI methods for survival prediction outperform conventional statistical techniques and the reference prognostic tool for MDS. Nonlinear Gradient Boosting Survival stands in the internal (Concordance-Index [C-Index], 0.77; SD, 0.01) and external validation (C-Index, 0.74; SD, 0.02). SHAP analysis revealed that similar features drove patients' subgroups and outcomes in both training and validation cohorts. Federated implementation improved the accuracy of developed models.

Conclusion: MOSAIC provides an explainable and robust framework to optimize classification and prognostic assessment of rare cancers. AI-based approaches demonstrated superior accuracy in capturing genomic similarities and providing individual prognostic information compared with conventional statistical methods. Its federated implementation ensures broad clinical application, guaranteeing high performance and data protection.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
6.20
自引率
4.80%
发文量
190
期刊最新文献
Development, Validation, and Clinical Utility of Electronic Patient-Reported Outcome Measure-Enhanced Prediction Models for Overall Survival in Patients With Advanced Non-Small Cell Lung Cancer Receiving Immunotherapy. Metastatic Versus Localized Disease as Inclusion Criteria That Can Be Automatically Extracted From Randomized Controlled Trials Using Natural Language Processing. Identifying Oncology Patients at High Risk for Potentially Preventable Emergency Department Visits Using a Novel Definition. Use of Patient-Reported Outcomes in Risk Prediction Model Development to Support Cancer Care Delivery: A Scoping Review. Optimizing End Points for Phase III Cancer Trials.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1