Mohammad Batah, M. Alzyoud, Raed Alazaidah, Malek Toubat, Haneen Alzoubi, Areej Olaiyat
{"title":"使用机器学习技术进行宫颈癌的早期预测","authors":"Mohammad Batah, M. Alzyoud, Raed Alazaidah, Malek Toubat, Haneen Alzoubi, Areej Olaiyat","doi":"10.5455/jjcit.71-1661691447","DOIUrl":null,"url":null,"abstract":"According to recent studies and statistics, Cervical Cancer (CC) is one of the most common causes of death worldwide, and mainly in the developing countries. CC has a mortality rate around 60%, in less developing countries and the percentages could go even higher, due to poor screening processes, lack of sensitization, and several other reasons. Therefore, this paper aims to utilize the high capabilities of machine learning techniques in the early prediction of CC. In specific, three well-known feature selection and ranking methods have been used to identify the most significant features that help in the diagnosis process. Also, eighteen different classifiers that belong to six learning strategies have been trained and extensively evaluated against a primary data which consists of five hundred images. Moreover, an investigation regarding the problem of imbalance class distribution which is common in medical dataset is conducted. The results revealed that LWNB and RandomForest classifiers showed the best performance in general, and considering four different evaluation metrics. Also, LWNB and Logistic classifiers were the best choices to handle the problem of imbalance class distribution which is common in medical diagnosis task. The final conclusion could be made is that using an ensemble model which consists of several classifiers such as LWNB, RandomForest, and Logistic is the best solution to handle this type of problems.","PeriodicalId":36757,"journal":{"name":"Jordanian Journal of Computers and Information Technology","volume":"1 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"EARLY PREDICTION OF CERVICAL CANCER USING MACHINE LEARNING TECHNIQUES\",\"authors\":\"Mohammad Batah, M. Alzyoud, Raed Alazaidah, Malek Toubat, Haneen Alzoubi, Areej Olaiyat\",\"doi\":\"10.5455/jjcit.71-1661691447\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"According to recent studies and statistics, Cervical Cancer (CC) is one of the most common causes of death worldwide, and mainly in the developing countries. CC has a mortality rate around 60%, in less developing countries and the percentages could go even higher, due to poor screening processes, lack of sensitization, and several other reasons. Therefore, this paper aims to utilize the high capabilities of machine learning techniques in the early prediction of CC. In specific, three well-known feature selection and ranking methods have been used to identify the most significant features that help in the diagnosis process. Also, eighteen different classifiers that belong to six learning strategies have been trained and extensively evaluated against a primary data which consists of five hundred images. Moreover, an investigation regarding the problem of imbalance class distribution which is common in medical dataset is conducted. The results revealed that LWNB and RandomForest classifiers showed the best performance in general, and considering four different evaluation metrics. Also, LWNB and Logistic classifiers were the best choices to handle the problem of imbalance class distribution which is common in medical diagnosis task. The final conclusion could be made is that using an ensemble model which consists of several classifiers such as LWNB, RandomForest, and Logistic is the best solution to handle this type of problems.\",\"PeriodicalId\":36757,\"journal\":{\"name\":\"Jordanian Journal of Computers and Information Technology\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jordanian Journal of Computers and Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5455/jjcit.71-1661691447\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jordanian Journal of Computers and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5455/jjcit.71-1661691447","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
EARLY PREDICTION OF CERVICAL CANCER USING MACHINE LEARNING TECHNIQUES
According to recent studies and statistics, Cervical Cancer (CC) is one of the most common causes of death worldwide, and mainly in the developing countries. CC has a mortality rate around 60%, in less developing countries and the percentages could go even higher, due to poor screening processes, lack of sensitization, and several other reasons. Therefore, this paper aims to utilize the high capabilities of machine learning techniques in the early prediction of CC. In specific, three well-known feature selection and ranking methods have been used to identify the most significant features that help in the diagnosis process. Also, eighteen different classifiers that belong to six learning strategies have been trained and extensively evaluated against a primary data which consists of five hundred images. Moreover, an investigation regarding the problem of imbalance class distribution which is common in medical dataset is conducted. The results revealed that LWNB and RandomForest classifiers showed the best performance in general, and considering four different evaluation metrics. Also, LWNB and Logistic classifiers were the best choices to handle the problem of imbalance class distribution which is common in medical diagnosis task. The final conclusion could be made is that using an ensemble model which consists of several classifiers such as LWNB, RandomForest, and Logistic is the best solution to handle this type of problems.