{"title":"Machine Learning for Breast Cancer Classification With ANN and Decision Tree","authors":"Reetodeep Hazra, Megha Banerjee, L. Badia","doi":"10.1109/IEMCON51383.2020.9284936","DOIUrl":null,"url":null,"abstract":"Breast cancer is one of the commonest cause of cancer deaths in women. It starts developing when threatening bumps start forming from the breast cells, and unfortunately most diagnoses happen in later stages, thus resulting in low chances of survival for the patient. So for early detection and prognosis, it is necessary to detect the benign or threatening nature of the bumps. In this paper, Artificial Neural Networks (ANN) and Decision Tree (DT) classifiers are used to develop a machine learning (ML) model using the Wisconsin diagnostic breast cancer (WDBC) dataset, in order to evaluate the attributes of a breast cancer development at beginning phases and classify it as malignant or benign. In the proposed scheme, feature selection and feature extraction are done to extract statistical features from the dataset and comparison between the models is provided based on their performance to identify the most suitable approach for diagnosis. The dataset apportioned into various arrangements of train-test split. The presentation of the framework is estimated, depending on accuracy, sensitivity, specificity, precision, and recall. The binary classification problem achieved a maximum accuracy of 98.55%.","PeriodicalId":6871,"journal":{"name":"2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","volume":"23 1","pages":"0522-0527"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCON51383.2020.9284936","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
Breast cancer is one of the commonest cause of cancer deaths in women. It starts developing when threatening bumps start forming from the breast cells, and unfortunately most diagnoses happen in later stages, thus resulting in low chances of survival for the patient. So for early detection and prognosis, it is necessary to detect the benign or threatening nature of the bumps. In this paper, Artificial Neural Networks (ANN) and Decision Tree (DT) classifiers are used to develop a machine learning (ML) model using the Wisconsin diagnostic breast cancer (WDBC) dataset, in order to evaluate the attributes of a breast cancer development at beginning phases and classify it as malignant or benign. In the proposed scheme, feature selection and feature extraction are done to extract statistical features from the dataset and comparison between the models is provided based on their performance to identify the most suitable approach for diagnosis. The dataset apportioned into various arrangements of train-test split. The presentation of the framework is estimated, depending on accuracy, sensitivity, specificity, precision, and recall. The binary classification problem achieved a maximum accuracy of 98.55%.