Regularized Logistic Regression Model for Cancer Classification

A. Arafa, M. Radad, M. Badawy, Nawal El - Fishawy
{"title":"Regularized Logistic Regression Model for Cancer Classification","authors":"A. Arafa, M. Radad, M. Badawy, Nawal El - Fishawy","doi":"10.1109/NRSC52299.2021.9509831","DOIUrl":null,"url":null,"abstract":"Cancer is a serious disease and is considered one of the causes of death. Making it worse, many cancers are diagnosed too late. Early, diagnosis of cancer helps in taking correct steps towards treatment. This paper introduces a machine learning model to diagnose and classify different types of cancer. This model is implemented based on regularized logistic regression. The regularization techniques L1, L2 and Elastic Net are evaluated where L2 outperformed other techniques. Also, the proposed model is optimized using Stochastic Gradient Descent (SGD) and Averaged Stochastic Gradient Descent (ASGD) where ASGD outperformed SGD. The results showed that the model with best performance is obtained with L2 regularization when optimized with ASGD. The best model performance is evaluated using cross validation yielding 99.6%, 90.27% and 98.08% test accuracy for Ovarian, Colon and WBCD data sets respectively.","PeriodicalId":231431,"journal":{"name":"2021 38th National Radio Science Conference (NRSC)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 38th National Radio Science Conference (NRSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NRSC52299.2021.9509831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Cancer is a serious disease and is considered one of the causes of death. Making it worse, many cancers are diagnosed too late. Early, diagnosis of cancer helps in taking correct steps towards treatment. This paper introduces a machine learning model to diagnose and classify different types of cancer. This model is implemented based on regularized logistic regression. The regularization techniques L1, L2 and Elastic Net are evaluated where L2 outperformed other techniques. Also, the proposed model is optimized using Stochastic Gradient Descent (SGD) and Averaged Stochastic Gradient Descent (ASGD) where ASGD outperformed SGD. The results showed that the model with best performance is obtained with L2 regularization when optimized with ASGD. The best model performance is evaluated using cross validation yielding 99.6%, 90.27% and 98.08% test accuracy for Ovarian, Colon and WBCD data sets respectively.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
肿瘤分类的正则化逻辑回归模型
癌症是一种严重的疾病,被认为是导致死亡的原因之一。更糟糕的是,许多癌症被诊断得太晚了。癌症的早期诊断有助于采取正确的治疗步骤。本文介绍了一种用于诊断和分类不同类型癌症的机器学习模型。该模型是基于正则化逻辑回归实现的。正则化技术L1、L2和Elastic Net在L2优于其他技术的地方进行了评估。此外,采用随机梯度下降(SGD)和平均随机梯度下降(ASGD)对模型进行了优化,其中ASGD优于SGD。结果表明,在用ASGD优化后,采用L2正则化得到了性能最好的模型。通过交叉验证,对卵巢、结肠和WBCD数据集的测试准确率分别为99.6%、90.27%和98.08%,评估了模型的最佳性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Modeling and Simulation of Respiratory System for Acute Respiratory Distress Syndrome (ARDS) Associated with COVID-19 NRSC 2021 Authors Index Dual-Band Cavity-Backed Ka-Band Antenna for Satellite Communication Regularized Logistic Regression Model for Cancer Classification High-Gain Annular Ring with Meander Slots Antenna Array for RFID Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1