Application of Machine Learning Techniques in Predicting of Breast Cancer Metastases Using Decision Tree Algorithm, in Sokoto Northwestern Nigeria

A. Musa, U. Aliyu
{"title":"Application of Machine Learning Techniques in Predicting of Breast Cancer Metastases Using Decision Tree Algorithm, in Sokoto Northwestern Nigeria","authors":"A. Musa, U. Aliyu","doi":"10.35248/2153-0602.20.11.220","DOIUrl":null,"url":null,"abstract":"According to international agency for research on cancer, female breast cancer was the leading type of cancer worldwide in terms of the number of new cases (approximately 2.1 million) diagnosed in 2018. Predicting outcome of a disease is a challenging task. Data mining techniques tends to simplify the prediction segment. Automated tools have made it possible to collect large volumes of medical data, which are made available to the medical research groups. This study aimed to apply machine learning algorithms using decision three classifier and descriptive statistics to evaluate the performance of the model in predicting the probability of cancer metastasis in patients that present late. Materials and method: The breast cancer disease dataset has been taken from the department of Radiotherapy and Oncology of Usmanu Danfodiyo University Teaching Hospital, Sokoto state, Nigerian. Dataset has 259 instances and 10 attributes. The experimental results of this study used, decision three classifier in IMB SPSS (version 23) software environment. In the experiment, two classes were used and therefore a 2 × 2 confusion matrix was applied. Class 0=Not Metastasized, Class 1=Metastasized. We applied supervised machine learning approach in which dataset were divided into two classes that is training and testing using 10 fold cross validation. Results: Shows that 259 instance of breast cancer, 218(84.2%) cases were not metastasized while 41(15.8%) cases were metastasized to the other region of the body. The overall accuracy of the model was found to be 87%, with the sensitivity of 88%, specificity 75% and the precision of 98% Conclusion: Based on these findings, the machine learning algorism using decision three classifiers predicted that 87% of the tumor presented at stage IV, indicating that the tumour can spread to the other region of the body.","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"28 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Data Mining in Genomics & Proteomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35248/2153-0602.20.11.220","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

According to international agency for research on cancer, female breast cancer was the leading type of cancer worldwide in terms of the number of new cases (approximately 2.1 million) diagnosed in 2018. Predicting outcome of a disease is a challenging task. Data mining techniques tends to simplify the prediction segment. Automated tools have made it possible to collect large volumes of medical data, which are made available to the medical research groups. This study aimed to apply machine learning algorithms using decision three classifier and descriptive statistics to evaluate the performance of the model in predicting the probability of cancer metastasis in patients that present late. Materials and method: The breast cancer disease dataset has been taken from the department of Radiotherapy and Oncology of Usmanu Danfodiyo University Teaching Hospital, Sokoto state, Nigerian. Dataset has 259 instances and 10 attributes. The experimental results of this study used, decision three classifier in IMB SPSS (version 23) software environment. In the experiment, two classes were used and therefore a 2 × 2 confusion matrix was applied. Class 0=Not Metastasized, Class 1=Metastasized. We applied supervised machine learning approach in which dataset were divided into two classes that is training and testing using 10 fold cross validation. Results: Shows that 259 instance of breast cancer, 218(84.2%) cases were not metastasized while 41(15.8%) cases were metastasized to the other region of the body. The overall accuracy of the model was found to be 87%, with the sensitivity of 88%, specificity 75% and the precision of 98% Conclusion: Based on these findings, the machine learning algorism using decision three classifiers predicted that 87% of the tumor presented at stage IV, indicating that the tumour can spread to the other region of the body.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
机器学习技术在使用决策树算法预测乳腺癌转移中的应用,尼日利亚西北部Sokoto
根据国际癌症研究机构的数据,就2018年确诊的新病例数量(约210万)而言,女性乳腺癌是全球领先的癌症类型。预测疾病的结果是一项具有挑战性的任务。数据挖掘技术倾向于简化预测部分。自动化工具使收集大量医学数据成为可能,这些数据可供医学研究小组使用。本研究旨在应用机器学习算法,使用决策三分类器和描述性统计来评估模型在预测晚期患者癌症转移概率方面的性能。材料和方法:乳腺癌疾病数据集取自尼日利亚索科托州Usmanu Danfodiyo大学教学医院放射治疗和肿瘤科。数据集有259个实例和10个属性。本研究使用的实验结果,在IMB SPSS (version 23)软件环境下决定三个分类器。在实验中,使用了两个类,因此使用了一个2 × 2的混淆矩阵。0级=未转移,1级=转移。我们应用了有监督的机器学习方法,其中数据集分为两类,使用10倍交叉验证进行训练和测试。结果:259例乳腺癌中,218例(84.2%)未发生转移,41例(15.8%)发生转移。发现该模型的总体准确率为87%,灵敏度为88%,特异性为75%,精度为98%。结论:基于这些发现,使用决策三分类器的机器学习算法预测87%的肿瘤出现在IV期,表明肿瘤可以扩散到身体的其他部位。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Proteomics Study of the Effect Left Atrial Appendage Resection on theEnergy Metabolism of Atrial Muscle in Beagle Dogs with Rapid Atrial Pacing Expression of NUP62 in the Development of Ovarian Cancer Translocation (2; 5) (q37.3, q14q35.3) in a Case of Male Infertility in Cotonou Editorial on Bioinformatics Tools and Techniques for Data Mining Ribosomes: Atomic Machines Association between Nucleic acids and Proteins
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1