Advanced Optimization Techniques & Its Application in AI-Powered Breast Cancer Classification

Surajit Das, Subhodeep Mukherjee
{"title":"Advanced Optimization Techniques & Its Application in AI-Powered Breast Cancer Classification","authors":"Surajit Das, Subhodeep Mukherjee","doi":"10.1109/DeSE58274.2023.10099678","DOIUrl":null,"url":null,"abstract":"In this paper, an advanced optimization technique will be used to find the cut-off of base model(s) and meta model along with the weights of the weighted blending. In this work, XGBoost, Random Forest, Logistic Regression have been used as the base model and also K-Fold cross validation has been used to capture the average score of individual base model. Here F-score will be used to assess the goodness of the models. The techniques have been applied for classification of Breast Carcinoma which is the one of the most prevailing diseases that thrives amid the human beings over decades. According to a report, published in March '21, in the web site of WHO, in 2020, about 2.3 million women diagnosed with breast cancer and according to International Agency for Research on Cancer (IARC) in December 2020, breast cancer has overtaken the lung cancer and has reached at the top position as a commonly diagnosed cancer. In order to determine the breast carcinoma, breast tumors are classified into two categories which are tagged as malignant or benign. For this study the WBCD dataset has been used as the dataset that contains 569 records derived from Fine Needle Aspirates (FNA) of human breast masses has no missing value and is a balanced dataset which minimizes the data pre-processing and EDA steps. In the Optimized weighted Blending, the F-1 Score goes maximum 0.99 (approx.) compared to other approaches within our scope.","PeriodicalId":346847,"journal":{"name":"2023 15th International Conference on Developments in eSystems Engineering (DeSE)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 15th International Conference on Developments in eSystems Engineering (DeSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DeSE58274.2023.10099678","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, an advanced optimization technique will be used to find the cut-off of base model(s) and meta model along with the weights of the weighted blending. In this work, XGBoost, Random Forest, Logistic Regression have been used as the base model and also K-Fold cross validation has been used to capture the average score of individual base model. Here F-score will be used to assess the goodness of the models. The techniques have been applied for classification of Breast Carcinoma which is the one of the most prevailing diseases that thrives amid the human beings over decades. According to a report, published in March '21, in the web site of WHO, in 2020, about 2.3 million women diagnosed with breast cancer and according to International Agency for Research on Cancer (IARC) in December 2020, breast cancer has overtaken the lung cancer and has reached at the top position as a commonly diagnosed cancer. In order to determine the breast carcinoma, breast tumors are classified into two categories which are tagged as malignant or benign. For this study the WBCD dataset has been used as the dataset that contains 569 records derived from Fine Needle Aspirates (FNA) of human breast masses has no missing value and is a balanced dataset which minimizes the data pre-processing and EDA steps. In the Optimized weighted Blending, the F-1 Score goes maximum 0.99 (approx.) compared to other approaches within our scope.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
先进优化技术及其在人工智能乳腺癌分类中的应用
本文将采用一种先进的优化技术,随着加权混合的权重,找到基本模型和元模型的截止点。在这项工作中,使用XGBoost,随机森林,逻辑回归作为基础模型,并使用K-Fold交叉验证来捕获单个基础模型的平均得分。这里将使用F-score来评估模型的优劣。该技术已被应用于乳腺癌的分类,乳腺癌是几十年来在人类中蓬勃发展的最普遍的疾病之一。世界卫生组织网站21年3月发布的一份报告显示,2020年,约有230万妇女被诊断患有乳腺癌,根据国际癌症研究机构(IARC) 2020年12月的报告,乳腺癌已超过肺癌,成为最常见的癌症。为了确定乳腺癌,将乳腺肿瘤分为恶性和良性两类。在本研究中,WBCD数据集被用作包含569条记录的数据集,这些记录来自于人类乳房肿块的细针抽吸(FNA),没有缺失值,是一个平衡的数据集,最大限度地减少了数据预处理和EDA步骤。在优化加权混合中,与我们范围内的其他方法相比,F-1分数达到最大0.99(大约)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Using Simulation for Investigating Emergency Traffic Situations Real- Time Healthcare Monitoring and Treatment System Based Microcontroller with IoT Automated Face Mask Detection using Artificial Intelligence and Video Surveillance Management Improvement of the Personnel Delivery System in the Mining Complex using Simulation Models An Exploratory Study on the Impact of Hosting Blockchain Applications in Cloud Infrastructures
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1