{"title":"cLP: Linear programming with biological constraints and its application in classification problems","authors":"Manli Zhou, Youxi Luo, Guoqin Mai, F. Zhou","doi":"10.1109/ISB.2014.6990747","DOIUrl":null,"url":null,"abstract":"Feature selection represents a major challenge in the biomedical data mining problem, and numerous algorithms have been proposed to select an optimal subset of features with the best classification performance. However, the existing algorithms do not take into account the vast amount of biomedical knowledge from the literature and experienced researchers. This work proposes a novel feature selection algorithm, cLP, with the optimized binary classification accuracy. The proposed algorithm incorporates the biomedical knowledge as constraints in the linear programming based optimization model. The experimental data shows that cLP outperforms the other feature selection algorithms, and its constrained version performs similarly well with the unconstrained version. Although theoretically constraints will reduce the classification model performance, our data shows that the constrained cLP sometimes even outperforms the unconstrained version. This suggests that besides the benefit of including biomedical knowledge in the model, the constrained cLP may also achieve better classification performance.","PeriodicalId":249103,"journal":{"name":"2014 8th International Conference on Systems Biology (ISB)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 8th International Conference on Systems Biology (ISB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISB.2014.6990747","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Feature selection represents a major challenge in the biomedical data mining problem, and numerous algorithms have been proposed to select an optimal subset of features with the best classification performance. However, the existing algorithms do not take into account the vast amount of biomedical knowledge from the literature and experienced researchers. This work proposes a novel feature selection algorithm, cLP, with the optimized binary classification accuracy. The proposed algorithm incorporates the biomedical knowledge as constraints in the linear programming based optimization model. The experimental data shows that cLP outperforms the other feature selection algorithms, and its constrained version performs similarly well with the unconstrained version. Although theoretically constraints will reduce the classification model performance, our data shows that the constrained cLP sometimes even outperforms the unconstrained version. This suggests that besides the benefit of including biomedical knowledge in the model, the constrained cLP may also achieve better classification performance.