Developing a gene expression classifier for breast cancer diagnosis.

IF 2.6 4区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Medical & Biological Engineering & Computing Pub Date : 2025-09-01 Epub Date: 2025-03-13 DOI:10.1007/s11517-025-03329-7
Zahra Hosseinpour, Mostafa Rezaei-Tavirani, Mohammad-Esmaeil Akbari, Masoumeh Farahani
{"title":"Developing a gene expression classifier for breast cancer diagnosis.","authors":"Zahra Hosseinpour, Mostafa Rezaei-Tavirani, Mohammad-Esmaeil Akbari, Masoumeh Farahani","doi":"10.1007/s11517-025-03329-7","DOIUrl":null,"url":null,"abstract":"<p><p>Breast cancer (BC) is the most common type of cancer in women worldwide. Solid tumors are complex structures composed of many cell types and extracellular matrix components. Understanding solid tumors is crucial for developing effective treatments. This study aimed to develop a gene expression classifier to predict BC with high accuracy. The study first identified the most important genes for cancer through differential expression analysis (DEA) between breast cancer and adjacent normal breast samples. The R package STRINGdb was then used to create a protein-protein interaction network (PPI) to examine upregulated genes and find clusters. Enrichment analyses were performed to identify overrepresented biological functions and pathways. A logistic regression prediction model was developed using a breast cancer dataset from TCGA and evaluated using discrimination and calibration measures. BUB1 expression in breast cancer was also investigated using quantitative analysis. Two significant clusters were identified, with cell cycle checkpoints and M phase key pathways in one cluster and extracellular matrix organization in the other. A prediction model using the hub gene set (COMP, FN1, SDC1, BUB1, TTK, and NUSAP1) showed high sensitivity (97.2%) and specificity (96.1%), and an AUC of 0.994. Three hub genes (COMP, FN1, and SDC1) were identified through the PPI network, strongly linked to extracellular matrix organization (BUB1, TTK, and NUSAP1) as hub genes involved in M phase and cell cycle checkpoints. Overall, the study identified hub pathways and genes that accurately distinguish between cancer and normal samples, presenting promising new possibilities for early cancer detection and improved BC therapy.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"2567-2583"},"PeriodicalIF":2.6000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical & Biological Engineering & Computing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11517-025-03329-7","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/13 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Breast cancer (BC) is the most common type of cancer in women worldwide. Solid tumors are complex structures composed of many cell types and extracellular matrix components. Understanding solid tumors is crucial for developing effective treatments. This study aimed to develop a gene expression classifier to predict BC with high accuracy. The study first identified the most important genes for cancer through differential expression analysis (DEA) between breast cancer and adjacent normal breast samples. The R package STRINGdb was then used to create a protein-protein interaction network (PPI) to examine upregulated genes and find clusters. Enrichment analyses were performed to identify overrepresented biological functions and pathways. A logistic regression prediction model was developed using a breast cancer dataset from TCGA and evaluated using discrimination and calibration measures. BUB1 expression in breast cancer was also investigated using quantitative analysis. Two significant clusters were identified, with cell cycle checkpoints and M phase key pathways in one cluster and extracellular matrix organization in the other. A prediction model using the hub gene set (COMP, FN1, SDC1, BUB1, TTK, and NUSAP1) showed high sensitivity (97.2%) and specificity (96.1%), and an AUC of 0.994. Three hub genes (COMP, FN1, and SDC1) were identified through the PPI network, strongly linked to extracellular matrix organization (BUB1, TTK, and NUSAP1) as hub genes involved in M phase and cell cycle checkpoints. Overall, the study identified hub pathways and genes that accurately distinguish between cancer and normal samples, presenting promising new possibilities for early cancer detection and improved BC therapy.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
开发一种用于乳腺癌诊断的基因表达分类器。
乳腺癌(BC)是世界范围内女性最常见的癌症类型。实体瘤是由多种细胞类型和细胞外基质成分组成的复杂结构。了解实体瘤对于开发有效的治疗方法至关重要。本研究旨在建立一个基因表达分类器来预测BC的准确性。该研究首先通过乳腺癌和邻近正常乳腺样本之间的差异表达分析(DEA)确定了最重要的癌症基因。然后使用R包STRINGdb创建蛋白相互作用网络(protein-protein interaction network, PPI)来检查上调的基因并找到簇。进行富集分析以确定过度代表的生物学功能和途径。使用TCGA的乳腺癌数据集建立逻辑回归预测模型,并使用判别和校准措施进行评估。用定量分析方法研究了乳腺癌中BUB1的表达。鉴定出两个重要的集群,其中一个集群具有细胞周期检查点和M期关键通路,另一个集群具有细胞外基质组织。使用枢纽基因集(COMP、FN1、SDC1、BUB1、TTK和NUSAP1)的预测模型具有较高的敏感性(97.2%)和特异性(96.1%),AUC为0.994。通过PPI网络鉴定出三个中心基因(COMP, FN1和SDC1),它们与细胞外基质组织(BUB1, TTK和NUSAP1)密切相关,是参与M期和细胞周期检查点的中心基因。总体而言,该研究确定了准确区分癌症和正常样本的中枢通路和基因,为早期癌症检测和改进BC治疗提供了有希望的新可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Medical & Biological Engineering & Computing
Medical & Biological Engineering & Computing 医学-工程:生物医学
CiteScore
6.00
自引率
3.10%
发文量
249
审稿时长
3.5 months
期刊介绍: Founded in 1963, Medical & Biological Engineering & Computing (MBEC) continues to serve the biomedical engineering community, covering the entire spectrum of biomedical and clinical engineering. The journal presents exciting and vital experimental and theoretical developments in biomedical science and technology, and reports on advances in computer-based methodologies in these multidisciplinary subjects. The journal also incorporates new and evolving technologies including cellular engineering and molecular imaging. MBEC publishes original research articles as well as reviews and technical notes. Its Rapid Communications category focuses on material of immediate value to the readership, while the Controversies section provides a forum to exchange views on selected issues, stimulating a vigorous and informed debate in this exciting and high profile field. MBEC is an official journal of the International Federation of Medical and Biological Engineering (IFMBE).
期刊最新文献
Accurate segmentation of pulmonary arteries and veins via a human-in-the-loop framework with application in COPD. Hemodynamic performance and blood damage of the Intra-aortic pumps: A CFD-Based investigation. Uncertainty quantification of conduction velocity in models of cardiac spread of activation. MRI image segmentation of the major lower leg muscles using deep learning: application in biomechanical analysis. Simultaneous detection of physical and mental fatigue using limited-channel EEG for practical workplace monitoring.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1