E. R. Rêgo, M. M. Rêgo, C. Cruz, P. Cecon, F. Finger
{"title":"Genetic diversity analysis of peppers: a comparison of discarding variable methods","authors":"E. R. Rêgo, M. M. Rêgo, C. Cruz, P. Cecon, F. Finger","doi":"10.12702/1984-7033.V03N01A03","DOIUrl":null,"url":null,"abstract":"There are a lot of variables in genetic diversity studies, and it is necessary to know whether or not they are all important and which ones can be discarded. There are often little changes in clustering patterns if a subset of these variables is used, because the discarded variables are redundant or of little contribution to the variability. This study aimed at comparing two discards of variables methods – the Singh method and the principal components method – as well as evaluating the effect of the discards on the cluster analysis. In this analysis data of six ripe fruits traits were used. Other characters with previously known variability or collinearity were added to the analysis. The method considered being the most efficient was the one, which indicated variables that did not alter the initial clustering pattern when discarded. The Singh method did not detect variation differences when standardized data were used. When the distance was obtained by the non-standardized data, the pericarp thickness (0.018%), total soluble solids (0.1668%) and minimum width (2.99%) had the lowest contribution to the divergence. The principal components pointed out that the characteristics fruit length, total soluble solids and seeds yield/fruit were considered as dispensable variables. There were changes in the initial clustering pattern when the variable pericarp thickness was discarded, and the Singh method was not efficient in detecting the importance of this variable. There were no changes in the initial clustering pattern when fruit length was discarded. The data showed that the two compared methods differed, since Singh’s and principal component methods showed different variables to be discarded. The Singh method was not efficient in detecting multicollinearity among variables. The principal component method was more efficient in pointing out the variables that can be discarded. It is advisable that the genetic divergence is calculated based on the scores of the principal components. In future studies, when there is no replicated data, the genetic divergence and the pinpoint of characters should be calculated based on the principal component scores to avoid discarding some important variables when determining divergence. However, if the variable values differ independently, the Singh method based on Euclidean distance is appropriate.","PeriodicalId":49085,"journal":{"name":"Crop Breeding and Applied Biotechnology","volume":"1 1","pages":"19-26"},"PeriodicalIF":1.3000,"publicationDate":"2003-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Crop Breeding and Applied Biotechnology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.12702/1984-7033.V03N01A03","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 27
Abstract
There are a lot of variables in genetic diversity studies, and it is necessary to know whether or not they are all important and which ones can be discarded. There are often little changes in clustering patterns if a subset of these variables is used, because the discarded variables are redundant or of little contribution to the variability. This study aimed at comparing two discards of variables methods – the Singh method and the principal components method – as well as evaluating the effect of the discards on the cluster analysis. In this analysis data of six ripe fruits traits were used. Other characters with previously known variability or collinearity were added to the analysis. The method considered being the most efficient was the one, which indicated variables that did not alter the initial clustering pattern when discarded. The Singh method did not detect variation differences when standardized data were used. When the distance was obtained by the non-standardized data, the pericarp thickness (0.018%), total soluble solids (0.1668%) and minimum width (2.99%) had the lowest contribution to the divergence. The principal components pointed out that the characteristics fruit length, total soluble solids and seeds yield/fruit were considered as dispensable variables. There were changes in the initial clustering pattern when the variable pericarp thickness was discarded, and the Singh method was not efficient in detecting the importance of this variable. There were no changes in the initial clustering pattern when fruit length was discarded. The data showed that the two compared methods differed, since Singh’s and principal component methods showed different variables to be discarded. The Singh method was not efficient in detecting multicollinearity among variables. The principal component method was more efficient in pointing out the variables that can be discarded. It is advisable that the genetic divergence is calculated based on the scores of the principal components. In future studies, when there is no replicated data, the genetic divergence and the pinpoint of characters should be calculated based on the principal component scores to avoid discarding some important variables when determining divergence. However, if the variable values differ independently, the Singh method based on Euclidean distance is appropriate.
期刊介绍:
The CBAB – CROP BREEDING AND APPLIED BIOTECHNOLOGY (ISSN 1984-7033) – is the official quarterly journal of the Brazilian Society of Plant Breeding, abbreviated CROP BREED APPL BIOTECHNOL.
It publishes original scientific articles, which contribute to the scientific and technological development of plant breeding and agriculture. Articles should be to do with basic and applied research on improvement of perennial and annual plants, within the fields of genetics, conservation of germplasm, biotechnology, genomics, cytogenetics, experimental statistics, seeds, food quality, biotic and abiotic stress, and correlated areas. The article must be unpublished. Simultaneous submitting to another periodical is ruled out. Authors are held solely responsible for the opinions and ideas expressed, which do not necessarily reflect the view of the Editorial board. However, the Editorial board reserves the right to suggest or ask for any modifications required. The journal adopts the Ithenticate software for identification of plagiarism. Complete or partial reproduction of articles is permitted, provided the source is cited. All content of the journal, except where identified, is licensed under a Creative Commons attribution-type BY. All articles are published free of charge. This is an open access journal.