Evaluation of agreement between common clustering strategies for DNA methylation-based subtyping of breast tumours.

IF 3 4区 医学 Q2 GENETICS & HEREDITY Epigenomics Pub Date : 2025-02-01 Epub Date: 2024-12-23 DOI:10.1080/17501911.2024.2441653
Elaheh Zarean, Shuai Li, Ee Ming Wong, Enes Makalic, Roger L Milne, Graham G Giles, Catriona McLean, Melissa C Southey, Pierre-Antoine Dugué
{"title":"Evaluation of agreement between common clustering strategies for DNA methylation-based subtyping of breast tumours.","authors":"Elaheh Zarean, Shuai Li, Ee Ming Wong, Enes Makalic, Roger L Milne, Graham G Giles, Catriona McLean, Melissa C Southey, Pierre-Antoine Dugué","doi":"10.1080/17501911.2024.2441653","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>Clustering algorithms have been widely applied to tumor DNA methylation datasets to define methylation-based cancer subtypes. This study aimed to evaluate the agreement between subtypes obtained from common clustering strategies.</p><p><strong>Materials & methods: </strong>We used tumor DNA methylation data from 409 women with breast cancer from the Melbourne Collaborative Cohort Study (MCCS) and 781 breast tumors from The Cancer Genome Atlas (TCGA). Agreement was assessed using the adjusted Rand index for various combinations of number of CpGs, number of clusters and clustering algorithms (hierarchical, K-means, partitioning around medoids, and recursively partitioned mixture models).</p><p><strong>Results: </strong>Inconsistent agreement patterns were observed for between-algorithm and within-algorithm comparisons, with generally poor to moderate agreement (ARI <0.7). Results were qualitatively similar in the MCCS and TCGA, showing better agreement for moderate number of CpGs and fewer clusters (K = 2). Restricting the analysis to CpGs that were differentially-methylated between tumor and normal tissue did not result in higher agreement.</p><p><strong>Conclusion: </strong>Our study highlights that common clustering strategies involving an arbitrary choice of algorithm, number of clusters and number of methylation sites are likely to identify different DNA methylation-based breast tumor subtypes.</p>","PeriodicalId":11959,"journal":{"name":"Epigenomics","volume":" ","pages":"105-114"},"PeriodicalIF":3.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epigenomics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/17501911.2024.2441653","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/23 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Aims: Clustering algorithms have been widely applied to tumor DNA methylation datasets to define methylation-based cancer subtypes. This study aimed to evaluate the agreement between subtypes obtained from common clustering strategies.

Materials & methods: We used tumor DNA methylation data from 409 women with breast cancer from the Melbourne Collaborative Cohort Study (MCCS) and 781 breast tumors from The Cancer Genome Atlas (TCGA). Agreement was assessed using the adjusted Rand index for various combinations of number of CpGs, number of clusters and clustering algorithms (hierarchical, K-means, partitioning around medoids, and recursively partitioned mixture models).

Results: Inconsistent agreement patterns were observed for between-algorithm and within-algorithm comparisons, with generally poor to moderate agreement (ARI <0.7). Results were qualitatively similar in the MCCS and TCGA, showing better agreement for moderate number of CpGs and fewer clusters (K = 2). Restricting the analysis to CpGs that were differentially-methylated between tumor and normal tissue did not result in higher agreement.

Conclusion: Our study highlights that common clustering strategies involving an arbitrary choice of algorithm, number of clusters and number of methylation sites are likely to identify different DNA methylation-based breast tumor subtypes.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
评估基于DNA甲基化的乳腺肿瘤亚型的常见聚类策略之间的一致性。
目的:聚类算法已广泛应用于肿瘤DNA甲基化数据集,以定义基于甲基化的癌症亚型。本研究旨在评估从常见聚类策略中获得的亚型之间的一致性。材料和方法:我们使用来自墨尔本合作队列研究(MCCS)的409名乳腺癌女性和来自癌症基因组图谱(TCGA)的781例乳腺癌肿瘤的肿瘤DNA甲基化数据。使用调整后的Rand指数对cpg数量、聚类数量和聚类算法(分层、K-means、围绕介质划分和递归划分混合模型)的各种组合进行一致性评估。结果:在算法之间和算法内部的比较中观察到不一致的一致性模式,通常一致性较差或中等(ARI结论:我们的研究强调,涉及任意选择算法、聚类数量和甲基化位点数量的常见聚类策略可能识别不同的基于DNA甲基化的乳腺肿瘤亚型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Epigenomics
Epigenomics GENETICS & HEREDITY-
CiteScore
5.80
自引率
2.60%
发文量
95
审稿时长
>12 weeks
期刊介绍: Epigenomics provides the forum to address the rapidly progressing research developments in this ever-expanding field; to report on the major challenges ahead and critical advances that are propelling the science forward. The journal delivers this information in concise, at-a-glance article formats – invaluable to a time constrained community. Substantial developments in our current knowledge and understanding of genomics and epigenetics are constantly being made, yet this field is still in its infancy. Epigenomics provides a critical overview of the latest and most significant advances as they unfold and explores their potential application in the clinical setting.
期刊最新文献
Detection of an intestinal cell DNA methylation signature in blood samples from neonates with necrotizing enterocolitis. Predictive power of epigenetic age - opportunities and cautions. The triple code model for advancing research in rare and undiagnosed diseases beyond the base pairs. Impact of excess sugar on the whole genome DNA methylation pattern in human sperm. Evaluation of agreement between common clustering strategies for DNA methylation-based subtyping of breast tumours.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1