{"title":"A New Filter Approach to Extract Relevant Features from Mass Spectrum Datasets","authors":"Tri-Thanh Le, T. Vu, N. Trang, Ha-Nam Nguyen","doi":"10.1109/KSE.2009.36","DOIUrl":null,"url":null,"abstract":"We propose an approach to extract relevant features from SELDI-TOF mass spectrum datasets. The proposed method can deal with both two-class and multiple-class problems. In the method, the relevance value of a feature representing how well the value of a feature helps to separate a sample from a given class was defined based on the difference between the numbers of samples in the given class with greater and less feature value than the sample. Using the relevance value as a basic factor, several ranked feature lists were established. Searching strategies to obtain optimal feature sets were also proposed by utilizing the relevance indices of features without using learning algorithms. The new method was applied to the three public mass spectrum datasets and showed better or comparable results than conventional filter methods","PeriodicalId":347175,"journal":{"name":"2009 International Conference on Knowledge and Systems Engineering","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Knowledge and Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE.2009.36","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We propose an approach to extract relevant features from SELDI-TOF mass spectrum datasets. The proposed method can deal with both two-class and multiple-class problems. In the method, the relevance value of a feature representing how well the value of a feature helps to separate a sample from a given class was defined based on the difference between the numbers of samples in the given class with greater and less feature value than the sample. Using the relevance value as a basic factor, several ranked feature lists were established. Searching strategies to obtain optimal feature sets were also proposed by utilizing the relevance indices of features without using learning algorithms. The new method was applied to the three public mass spectrum datasets and showed better or comparable results than conventional filter methods