{"title":"Variable Selection for Discrimination between Low and High Yielding Populations of Indian Mustard","authors":"P. Godara","doi":"10.18782/2320-7051.7792","DOIUrl":null,"url":null,"abstract":"Variable Selection is an important problem in classification and discriminant analysis. The selection of important variables for the purpose of discrimination between populations is important from the point of view of time and resources required for the experimentation. Keeping this in view, the present study has been designed to find important characters of Indian mustard which can discriminate between high and low yielding genotypes. Secondary data set on 310 genotypes of Indian mustard recorded for 12 characters was used for discrimination between populations of low and high yielding genotypes of Indian mustard. Three variable selection methods (Univariate t-test, Rao ́s F test for additional Information and Random Forests Algorithm) for classification and discrimination were used and compared. Performance of the methods was assessed in terms of leave one out cross-validation error and out of bag error rate for classification. The Four most important variables for discrimination among genotypes based on seed yield per plants were secondary branches, primary branches, days to maturity and siliqua number on main shoot.","PeriodicalId":14249,"journal":{"name":"International Journal of Pure & Applied Bioscience","volume":"68 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Pure & Applied Bioscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18782/2320-7051.7792","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Variable Selection is an important problem in classification and discriminant analysis. The selection of important variables for the purpose of discrimination between populations is important from the point of view of time and resources required for the experimentation. Keeping this in view, the present study has been designed to find important characters of Indian mustard which can discriminate between high and low yielding genotypes. Secondary data set on 310 genotypes of Indian mustard recorded for 12 characters was used for discrimination between populations of low and high yielding genotypes of Indian mustard. Three variable selection methods (Univariate t-test, Rao ́s F test for additional Information and Random Forests Algorithm) for classification and discrimination were used and compared. Performance of the methods was assessed in terms of leave one out cross-validation error and out of bag error rate for classification. The Four most important variables for discrimination among genotypes based on seed yield per plants were secondary branches, primary branches, days to maturity and siliqua number on main shoot.