{"title":"Phylogenetic analysis of some leguminous trees using CLUSTALW2 bioinformatics tool","authors":"Sagar Patel, H. Panchal, K. Anjaria","doi":"10.1109/BIBMW.2012.6470264","DOIUrl":null,"url":null,"abstract":"A multiple sequence alignment (MSA) is a sequence alignment of three or more biological sequences, generally for Protein. MSA has wide range of applications and to cite few of them such as phylogenetic analysis, protein pattern identification, protein domain identification, prediction of protein structure, structural similarity of amino acids and to get evolutionary similarity. ClustalW2 is a general purpose global multiple sequence alignment program for proteins. It produces biologically meaningful multiple sequence alignments of divergent sequences. The output from ClustalW2 shows the best match for the selected sequences and lines up them in such a way that the identities, similarities and differences can be easily understood. Evolutionary relationships can be seen by creating Cladograms or Phylograms. Firstly, individual weights are assigned to each sequence in a partial alignment in order to down-weight near-duplicate sequences and up-weight the most divergent ones. Secondly, amino acid substitution matrices are varied at different alignment stages according to the divergence of the sequences to be aligned. Thirdly, residue-specific gap penalties and locally reduced gap penalties in hydrophilic regions encourage new gaps in potential loop regions rather than regular secondary structure. Fourthly, positions in early alignments where gaps have been opened receive locally reduced gap penalties to encourage the opening up of new gaps at these positions. In this paper, protein sequences of few legume species from UNIPROT database were taken and focused on MSA for protein sequences for these tree species of family Leguminosae, where ClustalW2 tool have used to generate biological data. The results are discussed with the help of Cladograms and Phylograms for selected tree species.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBMW.2012.6470264","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
A multiple sequence alignment (MSA) is a sequence alignment of three or more biological sequences, generally for Protein. MSA has wide range of applications and to cite few of them such as phylogenetic analysis, protein pattern identification, protein domain identification, prediction of protein structure, structural similarity of amino acids and to get evolutionary similarity. ClustalW2 is a general purpose global multiple sequence alignment program for proteins. It produces biologically meaningful multiple sequence alignments of divergent sequences. The output from ClustalW2 shows the best match for the selected sequences and lines up them in such a way that the identities, similarities and differences can be easily understood. Evolutionary relationships can be seen by creating Cladograms or Phylograms. Firstly, individual weights are assigned to each sequence in a partial alignment in order to down-weight near-duplicate sequences and up-weight the most divergent ones. Secondly, amino acid substitution matrices are varied at different alignment stages according to the divergence of the sequences to be aligned. Thirdly, residue-specific gap penalties and locally reduced gap penalties in hydrophilic regions encourage new gaps in potential loop regions rather than regular secondary structure. Fourthly, positions in early alignments where gaps have been opened receive locally reduced gap penalties to encourage the opening up of new gaps at these positions. In this paper, protein sequences of few legume species from UNIPROT database were taken and focused on MSA for protein sequences for these tree species of family Leguminosae, where ClustalW2 tool have used to generate biological data. The results are discussed with the help of Cladograms and Phylograms for selected tree species.