Seyed Mohammad Saleh Hadavi, Shahram Oliaei, Sandra Saidi, Elham Nadimi, Mohammad Hassan Kazemi-Galougahi
{"title":"Using Data Mining and Association Rules for Early Diagnosis of Esophageal Cancer.","authors":"Seyed Mohammad Saleh Hadavi, Shahram Oliaei, Sandra Saidi, Elham Nadimi, Mohammad Hassan Kazemi-Galougahi","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>From 17,000 new cases of esophageal cancer worldwide during last year, 16,000 proved to be fatal. Late or incorrect diagnosis of esophageal cancer cases increases its fatality rate. Today, a data-mining technique can predict the course of the disease with the help of an upto-date technology. With this knowledge, we can reduce esophageal cancer mortality. This study aims to find an association between general characteristics, screening tests, and esophageal cancer based on raw data from the Cancer Research Center within-person interviews, using data mining and classification techniques on mortality. The 5-year medical records of 512 esophageal cancer patients and those with problems related to this cancer, with 50 functional characteristics, were included in this model. In order to provide a prognostic and rule discovery model for esophageal cancer suffering, we used preprocessing EM Algorithm. After accurate identification of the data, WEKA Software tools and Java programming language was used to create Association Rule Classifier and Apriori algorithm for the associated rule discovery. We created 6 significant rules of the association for classification generated by rule miner with 95% and 91% confidence based on screening tests and general attributes, respectively. These substantial rules showed significant association between age, history of medication, smoking, gender, carcinoembryonic antigen (CEA), creatinine, WBCs, and Platelets. The findings of this study can be used as a clue for physicians to consider patients with these characteristics as people who are more likely to develop esophageal cancer and help them for early diagnosis of patients. Keywords:Data mining, esophageal cancer, association rule, healthcare.</p>","PeriodicalId":53633,"journal":{"name":"The gulf journal of oncology","volume":"1 40","pages":"38-46"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The gulf journal of oncology","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
From 17,000 new cases of esophageal cancer worldwide during last year, 16,000 proved to be fatal. Late or incorrect diagnosis of esophageal cancer cases increases its fatality rate. Today, a data-mining technique can predict the course of the disease with the help of an upto-date technology. With this knowledge, we can reduce esophageal cancer mortality. This study aims to find an association between general characteristics, screening tests, and esophageal cancer based on raw data from the Cancer Research Center within-person interviews, using data mining and classification techniques on mortality. The 5-year medical records of 512 esophageal cancer patients and those with problems related to this cancer, with 50 functional characteristics, were included in this model. In order to provide a prognostic and rule discovery model for esophageal cancer suffering, we used preprocessing EM Algorithm. After accurate identification of the data, WEKA Software tools and Java programming language was used to create Association Rule Classifier and Apriori algorithm for the associated rule discovery. We created 6 significant rules of the association for classification generated by rule miner with 95% and 91% confidence based on screening tests and general attributes, respectively. These substantial rules showed significant association between age, history of medication, smoking, gender, carcinoembryonic antigen (CEA), creatinine, WBCs, and Platelets. The findings of this study can be used as a clue for physicians to consider patients with these characteristics as people who are more likely to develop esophageal cancer and help them for early diagnosis of patients. Keywords:Data mining, esophageal cancer, association rule, healthcare.