Feature Selection and Classification Technique for Predicting Lymph Node Metastasis of Papillary Thyroid Carcinoma

IF 2 4区 化学 Q3 CHEMISTRY, MULTIDISCIPLINARY Journal of Computational Biophysics and Chemistry Pub Date : 2024-07-03 DOI:10.1142/s2737416524400064
Dan Wu, Zhuang Yan, Guoliang Liao, Lin Han, Ke Chen, Cheng Li, Zhan Hua, Jiangli Lin
{"title":"Feature Selection and Classification Technique for Predicting Lymph Node Metastasis of Papillary Thyroid Carcinoma","authors":"Dan Wu, Zhuang Yan, Guoliang Liao, Lin Han, Ke Chen, Cheng Li, Zhan Hua, Jiangli Lin","doi":"10.1142/s2737416524400064","DOIUrl":null,"url":null,"abstract":"Papillary thyroid carcinoma (PTC) is typically an indolent cancer, yet a minority of cases develop lymph node metastasis. Due to the unclear mechanisms of lymph node metastasis, a considerable number of patients undergo unnecessary surgeries. Currently, the identification of key genetic biomarkers in high-dimensional data presents a significant challenge, thereby limiting research progress in this area. Here, we proposed a hybrid filter-wrapper feature selection strategy for core factor detection and developed MethyAE, a metastasis prediction model based on DNA methylation, utilizing an end-to-end learning auto-encoder. 46 methylated CpG sites were successfully identified as crucial biomarkers for lymph node metastasis. Leveraging 447 PTC samples from the Cancer Genome Atlas (221 with metastasis, 226 without), the MethyAE model achieves 88.9% accuracy and a recall rate of 88.6% in predicting lymph node metastasis, outperforming commonly used machine learning methods like logistic regression and random forest. Furthermore, the MethyAE model exhibits favorable performance in DNA methylation data from colon cancer, bladder cancer, and breast cancer. To the best of our knowledge, this is the first attempt to predict PTC lymph node metastasis through DNA methylation, offering pivotal decision-making criteria for avoiding unnecessary surgeries and selecting appropriate treatment plans for a substantial cohort of PTC patients.","PeriodicalId":15603,"journal":{"name":"Journal of Computational Biophysics and Chemistry","volume":null,"pages":null},"PeriodicalIF":2.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Biophysics and Chemistry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s2737416524400064","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Papillary thyroid carcinoma (PTC) is typically an indolent cancer, yet a minority of cases develop lymph node metastasis. Due to the unclear mechanisms of lymph node metastasis, a considerable number of patients undergo unnecessary surgeries. Currently, the identification of key genetic biomarkers in high-dimensional data presents a significant challenge, thereby limiting research progress in this area. Here, we proposed a hybrid filter-wrapper feature selection strategy for core factor detection and developed MethyAE, a metastasis prediction model based on DNA methylation, utilizing an end-to-end learning auto-encoder. 46 methylated CpG sites were successfully identified as crucial biomarkers for lymph node metastasis. Leveraging 447 PTC samples from the Cancer Genome Atlas (221 with metastasis, 226 without), the MethyAE model achieves 88.9% accuracy and a recall rate of 88.6% in predicting lymph node metastasis, outperforming commonly used machine learning methods like logistic regression and random forest. Furthermore, the MethyAE model exhibits favorable performance in DNA methylation data from colon cancer, bladder cancer, and breast cancer. To the best of our knowledge, this is the first attempt to predict PTC lymph node metastasis through DNA methylation, offering pivotal decision-making criteria for avoiding unnecessary surgeries and selecting appropriate treatment plans for a substantial cohort of PTC patients.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
预测甲状腺乳头状癌淋巴结转移的特征选择和分类技术
甲状腺乳头状癌(PTC)是一种典型的隐匿性癌症,但少数病例会发生淋巴结转移。由于淋巴结转移的机制尚不明确,相当多的患者接受了不必要的手术。目前,在高维数据中识别关键基因生物标志物是一项重大挑战,从而限制了这一领域的研究进展。在此,我们提出了一种用于核心因子检测的混合滤波器-包装器特征选择策略,并利用端到端学习自动编码器开发了基于DNA甲基化的转移预测模型MethyAE。46 个甲基化 CpG 位点被成功鉴定为淋巴结转移的关键生物标志物。利用癌症基因组图谱中的 447 个 PTC 样本(221 个有转移,226 个无转移),MethyAE 模型在预测淋巴结转移方面达到了 88.9% 的准确率和 88.6% 的召回率,优于逻辑回归和随机森林等常用机器学习方法。此外,MethyAE 模型在结肠癌、膀胱癌和乳腺癌的 DNA 甲基化数据中表现出良好的性能。据我们所知,这是首次尝试通过DNA甲基化预测PTC淋巴结转移,为大量PTC患者避免不必要的手术和选择合适的治疗方案提供了关键的决策标准。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
3.60
自引率
9.10%
发文量
62
期刊最新文献
An Integrative Computational Approach for Design and Evaluation of Novel [1,2,4]triazolo[3,4-b][1,3,4]thiadiazole Analogues as Dual-Action Anti-Retroviral and Anti-Bacterial Agents: Insights into Rational Drug Design Strategies Natural Products from Brazilian Biodiversity Explored as Anti-EBV Drug Candidates: In-Silico Database Mining, Docking Computations, Molecular Dynamics, and DFT Calculations Persistent Path Homology for Quantitative Analysis of Carboranes SGCAL: An algorithm to identify sensitive gene combinations in the mouse osteoblast gene network Inhibition of β-lactamase by Novel Benzothiazole-Coupled Azetidinone Derivatives: A Comprehensive Study Using an In-silico and In-vitro Approaches Against Multi Drug Resistant Bacteria
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1