Liang Dai, Jia Zhang, Candong Li, Changen Zhou, Shaozi Li
{"title":"Multi-label feature selection with application to TCM state identification","authors":"Liang Dai, Jia Zhang, Candong Li, Changen Zhou, Shaozi Li","doi":"10.1002/cpe.4634","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The goal of TCM state identification is to identify the patient's syndromes and locations and natures of diseases according to symptoms. Generally, symptoms of a patient are associated with several syndromes and multiple locations and natures of diseases; hence, the TCM state identification is a typical multi-label problem. In this paper, a new method is proposed to predict syndromes and locations and natures of diseases according to the diagnostic information of TCM. In detail, the correlation between features and the correlation between class labels are combined into a new uniform feature space. After that, the MDMR algorithm is used to select the most discriminatory features from the new uniform feature space, which is helpful to reduce the data dimensionality. Lastly, a KNN-like algorithm is modified to calculate the label similarity of test data, and the finite set of labels of test data is predicted by ML-KNN. In this paper, the test data is collected by Fujian University of Traditional Chinese Medicine according to the theory of TCM and medical ethics. The experiments show that the performance of the proposed method is superior to some other popular methods and is helpful in the identification of health state in TCM.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"31 23","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2018-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpe.4634","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.4634","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 14
Abstract
The goal of TCM state identification is to identify the patient's syndromes and locations and natures of diseases according to symptoms. Generally, symptoms of a patient are associated with several syndromes and multiple locations and natures of diseases; hence, the TCM state identification is a typical multi-label problem. In this paper, a new method is proposed to predict syndromes and locations and natures of diseases according to the diagnostic information of TCM. In detail, the correlation between features and the correlation between class labels are combined into a new uniform feature space. After that, the MDMR algorithm is used to select the most discriminatory features from the new uniform feature space, which is helpful to reduce the data dimensionality. Lastly, a KNN-like algorithm is modified to calculate the label similarity of test data, and the finite set of labels of test data is predicted by ML-KNN. In this paper, the test data is collected by Fujian University of Traditional Chinese Medicine according to the theory of TCM and medical ethics. The experiments show that the performance of the proposed method is superior to some other popular methods and is helpful in the identification of health state in TCM.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.