{"title":"Improving disease diagnosis by a new hybrid model","authors":"Bikash Kanti Sarkar","doi":"10.1016/j.nhtm.2017.07.001","DOIUrl":null,"url":null,"abstract":"<div><p>Knowledge extraction is an important part of e-Health system. However, datasets in health domain are highly <em>imbalanced</em>, <em>voluminous, conflicting</em> and <em>complex</em> in nature, and these can lead to erroneous diagnosis of diseases. So, designing accurate and robust clinical diagnosis models for such datasets is a challenging task in data mining. In literature, numerous standard intelligent models have been proposed for this purpose but they usually suffer from several drawbacks like lack of <em>understandability</em>, incapability of operating <em>rare cases</em>, inefficiency in making <em>quick</em> and <em>correct</em> decision, etc. In fact, specific health application using standard intelligent methods may not satisfy multiple criteria. However, recent research indicates that hybrid intelligent methods (integrating several standard ones, can achieve better performance for health applications. Addressing the limitations of the existing approaches, the present research introduces a new hybrid predictive model (integrating C4.5 and PRISM learners) for diagnosing effectively the diseases (instead of any specific disease) in comprehensible way by the practitioners with better prediction results in comparison to the traditional approaches. The empirical results (in terms of <em>accuracy</em>, <em>sensitivity</em> and <em>false positive rate</em>) obtained over fourteen benchmark datasets demonstrate that the model outperforms the base learners in almost all cases. The performance of the model also claims that it can be good alternative to the specialized learners (each designed for specific disease) published in the literature. After all, the presented intelligent system is effective in undertaking medical data classification task.</p></div>","PeriodicalId":90660,"journal":{"name":"New horizons in translational medicine","volume":"4 1","pages":"Pages 31-47"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.nhtm.2017.07.001","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"New horizons in translational medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2307502317300231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Knowledge extraction is an important part of e-Health system. However, datasets in health domain are highly imbalanced, voluminous, conflicting and complex in nature, and these can lead to erroneous diagnosis of diseases. So, designing accurate and robust clinical diagnosis models for such datasets is a challenging task in data mining. In literature, numerous standard intelligent models have been proposed for this purpose but they usually suffer from several drawbacks like lack of understandability, incapability of operating rare cases, inefficiency in making quick and correct decision, etc. In fact, specific health application using standard intelligent methods may not satisfy multiple criteria. However, recent research indicates that hybrid intelligent methods (integrating several standard ones, can achieve better performance for health applications. Addressing the limitations of the existing approaches, the present research introduces a new hybrid predictive model (integrating C4.5 and PRISM learners) for diagnosing effectively the diseases (instead of any specific disease) in comprehensible way by the practitioners with better prediction results in comparison to the traditional approaches. The empirical results (in terms of accuracy, sensitivity and false positive rate) obtained over fourteen benchmark datasets demonstrate that the model outperforms the base learners in almost all cases. The performance of the model also claims that it can be good alternative to the specialized learners (each designed for specific disease) published in the literature. After all, the presented intelligent system is effective in undertaking medical data classification task.