Assigning Transcriptomic Subtypes to Chronic Lymphocytic Leukemia Samples Using Nanopore RNA-Sequencing and Self-Organizing Maps.

IF 4.4 2区 医学 Q1 ONCOLOGY Cancers Pub Date : 2025-03-13 DOI:10.3390/cancers17060964
Arsen Arakelyan, Tamara Sirunyan, Gisane Khachatryan, Siras Hakobyan, Arpine Minasyan, Maria Nikoghosyan, Meline Hakobyan, Andranik Chavushyan, Gevorg Martirosyan, Yervand Hakobyan, Hans Binder
{"title":"Assigning Transcriptomic Subtypes to Chronic Lymphocytic Leukemia Samples Using Nanopore RNA-Sequencing and Self-Organizing Maps.","authors":"Arsen Arakelyan, Tamara Sirunyan, Gisane Khachatryan, Siras Hakobyan, Arpine Minasyan, Maria Nikoghosyan, Meline Hakobyan, Andranik Chavushyan, Gevorg Martirosyan, Yervand Hakobyan, Hans Binder","doi":"10.3390/cancers17060964","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background/Objectives:</b> Massively parallel sequencing technologies have advanced chronic lymphocytic leukemia (CLL) diagnostics and precision oncology. Illumina platforms, while offering robust performance, require substantial infrastructure investment and a large number of samples for cost-efficiency. Conversely, third-generation long-read nanopore sequencing from Oxford Nanopore Technologies (ONT) can significantly reduce sequencing costs, making it a valuable tool in resource-limited settings. However, nanopore sequencing faces challenges with lower accuracy and throughput than Illumina platforms, necessitating additional computational strategies. In this paper, we demonstrate that integrating publicly available short-read data with in-house generated ONT data, along with the application of machine learning approaches, enables the characterization of the CLL transcriptome landscape, the identification of clinically relevant molecular subtypes, and the assignment of these subtypes to nanopore-sequenced samples. <b>Methods:</b> Public Illumina RNA sequencing data for 608 CLL samples were obtained from the CLL-Map Portal. CLL transcriptome analysis, gene module identification, and transcriptomic subtype classification were performed using the oposSOM R package for high-dimensional data visualization with self-organizing maps. Eight CLL patients were recruited from the Hematology Center After Prof. R. Yeolyan (Yerevan, Armenia). Sequencing libraries were prepared from blood total RNA using the PCR-cDNA sequencing-barcoding kit (SQK-PCB109) following the manufacturer's protocol and sequenced on an R9.4.1 flow cell for 24-48 h. Raw reads were converted to TPM values. These data were projected into the SOMs space using the supervised SOMs portrayal (supSOM) approach to predict the SOMs portrait of new samples using support vector machine regression. <b>Results:</b> The CLL transcriptomic landscape reveals disruptions in gene modules (spots) associated with T cell cytotoxicity, B and T cell activation, inflammation, cell cycle, DNA repair, proliferation, and splicing. A specific gene module contained genes associated with poor prognosis in CLL. Accordingly, CLL samples were classified into T-cell cytotoxic, immune, proliferative, splicing, and three mixed types: proliferative-immune, proliferative-splicing, and proliferative-immune-splicing. These transcriptomic subtypes were associated with survival orthogonal to gender and mutation status. Using supervised machine learning approaches, transcriptomic subtypes were assigned to patient samples sequenced with nanopore sequencing. <b>Conclusions:</b> This study demonstrates that the CLL transcriptome landscape can be parsed into functional modules, revealing distinct molecular subtypes based on proliferative and immune activity, with important implications for prognosis and treatment that are orthogonal to other molecular classifications. Additionally, the integration of nanopore sequencing with public datasets and machine learning offers a cost-effective approach to molecular subtyping and prognostic prediction, facilitating more accessible and personalized CLL care.</p>","PeriodicalId":9681,"journal":{"name":"Cancers","volume":"17 6","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11940626/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancers","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/cancers17060964","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background/Objectives: Massively parallel sequencing technologies have advanced chronic lymphocytic leukemia (CLL) diagnostics and precision oncology. Illumina platforms, while offering robust performance, require substantial infrastructure investment and a large number of samples for cost-efficiency. Conversely, third-generation long-read nanopore sequencing from Oxford Nanopore Technologies (ONT) can significantly reduce sequencing costs, making it a valuable tool in resource-limited settings. However, nanopore sequencing faces challenges with lower accuracy and throughput than Illumina platforms, necessitating additional computational strategies. In this paper, we demonstrate that integrating publicly available short-read data with in-house generated ONT data, along with the application of machine learning approaches, enables the characterization of the CLL transcriptome landscape, the identification of clinically relevant molecular subtypes, and the assignment of these subtypes to nanopore-sequenced samples. Methods: Public Illumina RNA sequencing data for 608 CLL samples were obtained from the CLL-Map Portal. CLL transcriptome analysis, gene module identification, and transcriptomic subtype classification were performed using the oposSOM R package for high-dimensional data visualization with self-organizing maps. Eight CLL patients were recruited from the Hematology Center After Prof. R. Yeolyan (Yerevan, Armenia). Sequencing libraries were prepared from blood total RNA using the PCR-cDNA sequencing-barcoding kit (SQK-PCB109) following the manufacturer's protocol and sequenced on an R9.4.1 flow cell for 24-48 h. Raw reads were converted to TPM values. These data were projected into the SOMs space using the supervised SOMs portrayal (supSOM) approach to predict the SOMs portrait of new samples using support vector machine regression. Results: The CLL transcriptomic landscape reveals disruptions in gene modules (spots) associated with T cell cytotoxicity, B and T cell activation, inflammation, cell cycle, DNA repair, proliferation, and splicing. A specific gene module contained genes associated with poor prognosis in CLL. Accordingly, CLL samples were classified into T-cell cytotoxic, immune, proliferative, splicing, and three mixed types: proliferative-immune, proliferative-splicing, and proliferative-immune-splicing. These transcriptomic subtypes were associated with survival orthogonal to gender and mutation status. Using supervised machine learning approaches, transcriptomic subtypes were assigned to patient samples sequenced with nanopore sequencing. Conclusions: This study demonstrates that the CLL transcriptome landscape can be parsed into functional modules, revealing distinct molecular subtypes based on proliferative and immune activity, with important implications for prognosis and treatment that are orthogonal to other molecular classifications. Additionally, the integration of nanopore sequencing with public datasets and machine learning offers a cost-effective approach to molecular subtyping and prognostic prediction, facilitating more accessible and personalized CLL care.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用纳米孔 RNA 测序和自组织图为慢性淋巴细胞白血病样本分配转录组亚型
背景/目的:大规模平行测序技术在推进慢性淋巴细胞白血病(CLL)诊断和精确肿瘤学方面具有重要意义。Illumina平台虽然提供强大的性能,但需要大量的基础设施投资和大量的样品以提高成本效益。相反,来自Oxford nanopore Technologies (ONT)的第三代长读纳米孔测序可以显著降低测序成本,使其成为资源有限环境下的宝贵工具。然而,纳米孔测序面临着精度和通量低于Illumina平台的挑战,需要额外的计算策略。在本文中,我们证明了将公开可用的短读数据与内部生成的ONT数据结合起来,以及机器学习方法的应用,可以表征CLL转录组景观,鉴定临床相关的分子亚型,并将这些亚型分配到纳米孔测序样品中。方法:从CLL- map Portal获取608份CLL样本的公开Illumina RNA测序数据。CLL转录组分析、基因模块鉴定和转录组亚型分类使用oposSOM R包进行高维数据可视化和自组织图。在R. Yeolyan教授(埃里温,亚美尼亚)的指导下,从血液学中心招募了8名CLL患者。使用PCR-cDNA测序条形码试剂盒(SQK-PCB109)按照制造商的方案从血液总RNA中制备测序文库,并在R9.4.1流式细胞池上测序24-48小时。将原始读数转换为TPM值。这些数据使用有监督的SOMs描述(supSOM)方法投影到SOMs空间中,使用支持向量机回归预测新样本的SOMs描述。结果:CLL转录组图谱揭示了与T细胞毒性、B细胞和T细胞活化、炎症、细胞周期、DNA修复、增殖和剪接相关的基因模块(点)的破坏。一个特定的基因模块包含与CLL预后不良相关的基因。因此,CLL样本被分为t细胞毒性、免疫、增殖性、剪接和三种混合类型:增殖性免疫、增殖性剪接和增殖性免疫剪接。这些转录组亚型与生存相关,与性别和突变状态正交。使用有监督的机器学习方法,转录组亚型被分配到用纳米孔测序的患者样本中。结论:本研究表明,CLL转录组图谱可以被解析为功能模块,揭示基于增殖和免疫活性的不同分子亚型,与其他分子分类正交,对预后和治疗具有重要意义。此外,纳米孔测序与公共数据集和机器学习的整合为分子分型和预后预测提供了一种经济有效的方法,促进了更容易获得和个性化的CLL护理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Cancers
Cancers Medicine-Oncology
CiteScore
8.00
自引率
9.60%
发文量
5371
审稿时长
18.07 days
期刊介绍: Cancers (ISSN 2072-6694) is an international, peer-reviewed open access journal on oncology. It publishes reviews, regular research papers and short communications. Our aim is to encourage scientists to publish their experimental and theoretical results in as much detail as possible. There is no restriction on the length of the papers. The full experimental details must be provided so that the results can be reproduced.
期刊最新文献
Outcomes of pPCL Diagnosed Using the IMWG 2021 Consensus Definition: A Retrospective Multicenter Analysis. Predictors of Survival in Patients Aged ≥70 with Glioblastoma: A Time-Dependent Multivariable Analysis. Effects of Tobacco Use on Oral Cancer Screening Algorithm Performance. Differences in Executive Functioning Between Patients with IDH1-Mutant Oligodendroglioma and Astrocytoma Before and After Surgery. Complications of Port-a-Cath Systems: An Institutional Study on Romanian Oncological Patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1