Machine-learning diagnostics of breast cancer using piRNA biomarkers.

IF 2 4区 医学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Biomarkers Pub Date : 2025-03-04 DOI:10.1080/1354750X.2025.2461067
Amy R Zhao, Valentina L Kouznetsova, Santosh Kesari, Igor F Tsigelny
{"title":"Machine-learning diagnostics of breast cancer using piRNA biomarkers.","authors":"Amy R Zhao, Valentina L Kouznetsova, Santosh Kesari, Igor F Tsigelny","doi":"10.1080/1354750X.2025.2461067","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and objectives: </strong>Prior studies have shown that small non-coding RNAs (sncRNAs) are associated with cancer occurrence or development. Recently, a newly discovered class of small ncRNAs known as PIWI-interacting RNAs (piRNAs) have been found to play a vital role in physiological processes and cancer initiation. This study aims to utilize piRNAs as innovative, noninvasive diagnostic biomarkers for breast cancer. Our objective is to develop computational methods that leverage piRNA attributes for breast cancer prediction and its application in diagnostics.</p><p><strong>Methods: </strong>We created a set of piRNA sequence descriptors using information extracted from the piRNA sequences. To ensure accuracy, we found a path to convert non-standard piRNA names to standard ones to enable precise identification of these sequences. Using these descriptors, we applied machine-learning (ML) techniques in WEKA (Waikato Environment for Knowledge Analysis) to a dataset of piRNA to assess the predictive accuracy of the following classifiers: Logistic Regression model, Sequential Minimal Optimization (SMO), Random Forest classifier, and Logistic Model Tree (LMT). Furthermore, we performed Shapley additive explanations (SHAP) Analysis to understand which descriptors were the most relevant to the prediction accuracy. The ML models were then validated on an independent dataset to evaluate their effectiveness in predicting breast cancer.</p><p><strong>Results: </strong>The top three performing classifiers in WEKA were Logistic Regression, SMO, and LMT. The Logistic Regression model achieved an accuracy of 90.7% in predicting breast cancer, while SMO and LMT attained 89.7% and 85.65%, respectively.</p><p><strong>Conclusions: </strong>Our study demonstrates the effectiveness of using ML-based piRNA classifiers in diagnosing breast cancer and contributes to the growing body of evidence supporting piRNAs as biomarkers in cancer diagnosis. However, additional research is needed to validate these findings and further assess the clinical applicability of this approach.</p>","PeriodicalId":8921,"journal":{"name":"Biomarkers","volume":" ","pages":"1-11"},"PeriodicalIF":2.0000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomarkers","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/1354750X.2025.2461067","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background and objectives: Prior studies have shown that small non-coding RNAs (sncRNAs) are associated with cancer occurrence or development. Recently, a newly discovered class of small ncRNAs known as PIWI-interacting RNAs (piRNAs) have been found to play a vital role in physiological processes and cancer initiation. This study aims to utilize piRNAs as innovative, noninvasive diagnostic biomarkers for breast cancer. Our objective is to develop computational methods that leverage piRNA attributes for breast cancer prediction and its application in diagnostics.

Methods: We created a set of piRNA sequence descriptors using information extracted from the piRNA sequences. To ensure accuracy, we found a path to convert non-standard piRNA names to standard ones to enable precise identification of these sequences. Using these descriptors, we applied machine-learning (ML) techniques in WEKA (Waikato Environment for Knowledge Analysis) to a dataset of piRNA to assess the predictive accuracy of the following classifiers: Logistic Regression model, Sequential Minimal Optimization (SMO), Random Forest classifier, and Logistic Model Tree (LMT). Furthermore, we performed Shapley additive explanations (SHAP) Analysis to understand which descriptors were the most relevant to the prediction accuracy. The ML models were then validated on an independent dataset to evaluate their effectiveness in predicting breast cancer.

Results: The top three performing classifiers in WEKA were Logistic Regression, SMO, and LMT. The Logistic Regression model achieved an accuracy of 90.7% in predicting breast cancer, while SMO and LMT attained 89.7% and 85.65%, respectively.

Conclusions: Our study demonstrates the effectiveness of using ML-based piRNA classifiers in diagnosing breast cancer and contributes to the growing body of evidence supporting piRNAs as biomarkers in cancer diagnosis. However, additional research is needed to validate these findings and further assess the clinical applicability of this approach.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Biomarkers
Biomarkers 医学-毒理学
CiteScore
5.00
自引率
3.80%
发文量
140
审稿时长
3 months
期刊介绍: The journal Biomarkers brings together all aspects of the rapidly growing field of biomarker research, encompassing their various uses and applications in one essential source. Biomarkers provides a vital forum for the exchange of ideas and concepts in all areas of biomarker research. High quality papers in four main areas are accepted and manuscripts describing novel biomarkers and their subsequent validation are especially encouraged: • Biomarkers of disease • Biomarkers of exposure • Biomarkers of response • Biomarkers of susceptibility Manuscripts can describe biomarkers measured in humans or other animals in vivo or in vitro. Biomarkers will consider publishing negative data from studies of biomarkers of susceptibility in human populations.
期刊最新文献
Effect of Waterpipe Smoking and its Cessation on Metabolic Biomarkers and a Novel Biomarker Omentin-1. A systematic review of first-trimester blood biomarkers associated with preterm prelabor rupture of the fetal membranes. Machine-learning diagnostics of breast cancer using piRNA biomarkers. Early Warning System for Player Recovery? A Series of Case Studies Illustrating the Application of Individualised Adaptive Reference Ranges in the Longitudinal Blood Monitoring of English Premier League Soccer Players. Systemic fluoride levels in toenails as biomarkers of exposure and their association with the severity of dental fluorosis in Mexican schoolchildren - a cross-sectional study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1