Classification of non-TCGA cancer samples to TCGA molecular subtypes using compact feature sets

IF 48.8 1区 医学 Q1 CELL BIOLOGY Cancer Cell Pub Date : 2025-01-02 DOI:10.1016/j.ccell.2024.12.002
Kyle Ellrott, Christopher K. Wong, Christina Yau, Mauro A.A. Castro, Jordan A. Lee, Brian J. Karlberg, Jasleen K. Grewal, Vincenzo Lagani, Bahar Tercan, Verena Friedl, Toshinori Hinoue, Vladislav Uzunangelov, Lindsay Westlake, Xavier Loinaz, Ina Felau, Peggy I. Wang, Anab Kemal, Samantha J. Caesar-Johnson, Ilya Shmulevich, Alexander J. Lazar, Peter W. Laird
{"title":"Classification of non-TCGA cancer samples to TCGA molecular subtypes using compact feature sets","authors":"Kyle Ellrott, Christopher K. Wong, Christina Yau, Mauro A.A. Castro, Jordan A. Lee, Brian J. Karlberg, Jasleen K. Grewal, Vincenzo Lagani, Bahar Tercan, Verena Friedl, Toshinori Hinoue, Vladislav Uzunangelov, Lindsay Westlake, Xavier Loinaz, Ina Felau, Peggy I. Wang, Anab Kemal, Samantha J. Caesar-Johnson, Ilya Shmulevich, Alexander J. Lazar, Peter W. Laird","doi":"10.1016/j.ccell.2024.12.002","DOIUrl":null,"url":null,"abstract":"Molecular subtypes, such as defined by The Cancer Genome Atlas (TCGA), delineate a cancer’s underlying biology, bringing hope to inform a patient’s prognosis and treatment plan. However, most approaches used in the discovery of subtypes are not suitable for assigning subtype labels to new cancer specimens from other studies or clinical trials. Here, we address this barrier by applying five different machine learning approaches to multi-omic data from 8,791 TCGA tumor samples comprising 106 subtypes from 26 different cancer cohorts to build models based upon small numbers of features that can classify new samples into previously defined TCGA molecular subtypes—a step toward molecular subtype application in the clinic. We validate select classifiers using external datasets. Predictive performance and classifier-selected features yield insight into the different machine-learning approaches and genomic data platforms. For each cancer and data type we provide containerized versions of the top-performing models as a public resource.","PeriodicalId":9670,"journal":{"name":"Cancer Cell","volume":"92 1","pages":""},"PeriodicalIF":48.8000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Cell","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ccell.2024.12.002","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Molecular subtypes, such as defined by The Cancer Genome Atlas (TCGA), delineate a cancer’s underlying biology, bringing hope to inform a patient’s prognosis and treatment plan. However, most approaches used in the discovery of subtypes are not suitable for assigning subtype labels to new cancer specimens from other studies or clinical trials. Here, we address this barrier by applying five different machine learning approaches to multi-omic data from 8,791 TCGA tumor samples comprising 106 subtypes from 26 different cancer cohorts to build models based upon small numbers of features that can classify new samples into previously defined TCGA molecular subtypes—a step toward molecular subtype application in the clinic. We validate select classifiers using external datasets. Predictive performance and classifier-selected features yield insight into the different machine-learning approaches and genomic data platforms. For each cancer and data type we provide containerized versions of the top-performing models as a public resource.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Cancer Cell
Cancer Cell 医学-肿瘤学
CiteScore
55.20
自引率
1.20%
发文量
179
审稿时长
4-8 weeks
期刊介绍: Cancer Cell is a journal that focuses on promoting major advances in cancer research and oncology. The primary criteria for considering manuscripts are as follows: Major advances: Manuscripts should provide significant advancements in answering important questions related to naturally occurring cancers. Translational research: The journal welcomes translational research, which involves the application of basic scientific findings to human health and clinical practice. Clinical investigations: Cancer Cell is interested in publishing clinical investigations that contribute to establishing new paradigms in the treatment, diagnosis, or prevention of cancers. Insights into cancer biology: The journal values clinical investigations that provide important insights into cancer biology beyond what has been revealed by preclinical studies. Mechanism-based proof-of-principle studies: Cancer Cell encourages the publication of mechanism-based proof-of-principle clinical studies, which demonstrate the feasibility of a specific therapeutic approach or diagnostic test.
期刊最新文献
Infiltrating plasma cells maintain glioblastoma stem cells through IgG-Tumor binding Beta papillomaviruses: From foe to friend in skin cancer immunity Classification of non-TCGA cancer samples to TCGA molecular subtypes using compact feature sets Neutrophil extracellular traps promote pre-metastatic niche formation in the omentum by expanding innate-like B cells that express IL-10 Plasma cell and cancer stem cell crosstalk in glioblastoma
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1