{"title":"Diagnosis of Diseases: Classification Rule Discovery from Medical Data using Genetic Algorithm with Suppressor Mutation","authors":"E. Thamizhselvi, Geetha Vaithianathan","doi":"10.1109/ICSCAN49426.2020.9262429","DOIUrl":null,"url":null,"abstract":"Data mining is a powerful method to extract knowledge from data. Data mining area focused an attractive research challenges on Knowledge Discovery in Databases (KDD), with the aim to discover interesting and useful data from various transactional databases. Association Rule Mining is one of the most well-known techniques for extracting relations among attributes from larger databases which is given in the form of rules to the user. Data mining techniques in medical domain is inevitable in diagnosing various diseases which leads to reduce time and performance, by taking number of tests required for the patients. This paper presents a rule extraction for diabetes disease using genetic algorithm, with a newly proposed genetic operator named suppressor mutation (GA-SM), to suppress the activity of genes which are over expressive in nature i.e. to suppress the useless attributes from the rules. By using this new operator in association rule mining techniques, it generates a smaller number of rules for diagnosing diabetes. Genetic algorithm shows a wonderful performance with suppressor mutation when compared to CN2, J48, BF tree with respect to performance factors.","PeriodicalId":6744,"journal":{"name":"2020 International Conference on System, Computation, Automation and Networking (ICSCAN)","volume":"13 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on System, Computation, Automation and Networking (ICSCAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSCAN49426.2020.9262429","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Data mining is a powerful method to extract knowledge from data. Data mining area focused an attractive research challenges on Knowledge Discovery in Databases (KDD), with the aim to discover interesting and useful data from various transactional databases. Association Rule Mining is one of the most well-known techniques for extracting relations among attributes from larger databases which is given in the form of rules to the user. Data mining techniques in medical domain is inevitable in diagnosing various diseases which leads to reduce time and performance, by taking number of tests required for the patients. This paper presents a rule extraction for diabetes disease using genetic algorithm, with a newly proposed genetic operator named suppressor mutation (GA-SM), to suppress the activity of genes which are over expressive in nature i.e. to suppress the useless attributes from the rules. By using this new operator in association rule mining techniques, it generates a smaller number of rules for diagnosing diabetes. Genetic algorithm shows a wonderful performance with suppressor mutation when compared to CN2, J48, BF tree with respect to performance factors.
数据挖掘是一种从数据中提取知识的强大方法。数据挖掘领域关注数据库中的知识发现(Knowledge Discovery in Databases, KDD)这一具有吸引力的研究挑战,旨在从各种事务数据库中发现有趣和有用的数据。关联规则挖掘是从大型数据库中提取属性之间的关系的最著名的技术之一,这些关系以规则的形式提供给用户。医学领域的数据挖掘技术在各种疾病的诊断中是不可避免的,由于需要对患者进行多次检查,减少了时间和性能。本文提出了一种基于遗传算法的糖尿病规则提取方法,并提出了一种新的遗传算子抑制突变(GA-SM),以抑制本质上过度表达的基因的活性,即抑制规则中的无用属性。通过在关联规则挖掘技术中使用这个新的算子,它生成的诊断糖尿病的规则数量更少。在性能因子方面,与CN2、J48、BF树相比,遗传算法具有较好的抑制因子突变性能。