{"title":"A Statistical Analysis of Sequence Features within Genes from Neurospora crassa","authors":"Stephanie E. Edelmann, Chuck Staben","doi":"10.1006/emyc.1994.1007","DOIUrl":null,"url":null,"abstract":"<div><p>Edelmann, S. E., and Staben, C. 1994. A statistical analysis of features within genes from <em>Neurospora crassa. Experimental Mycology</em> 18, 70-81. We analyzed gene sequences from <em>Neurospora crassa</em> deposited in GenBank or EMBL for GC content, codon usage, intron prevalence, intron length, exon length, translation initiation sites, and mRNA splice sites. Protein coding regions were 59% GC, and noncoding regions were 49% GC. Codon usage was biased, primarily due to a strong preference for C in the final position of the codons. Over 80% of <em>N. crassa</em> protein coding genes had introns, which are typically 60 nucleotides long. Exons varied greatly in length, but were typically much longer than introns. The distribution of nucleotides surrounding translation initiation and intron splice sites was clearly nonrandom. We derived a consensus translation initiation site of CAMMATGGCT, a 5′-intron donor site of G^GTAAGTnnYCnYY, an internal branchpoint of WRCTRACMnnnnnnYY, and a 3′-acceptor site of WACAG^.</p></div>","PeriodicalId":12110,"journal":{"name":"Experimental Mycology","volume":"18 1","pages":"Pages 70-81"},"PeriodicalIF":0.0000,"publicationDate":"1994-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1006/emyc.1994.1007","citationCount":"111","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Experimental Mycology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0147597584710073","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 111
Abstract
Edelmann, S. E., and Staben, C. 1994. A statistical analysis of features within genes from Neurospora crassa. Experimental Mycology 18, 70-81. We analyzed gene sequences from Neurospora crassa deposited in GenBank or EMBL for GC content, codon usage, intron prevalence, intron length, exon length, translation initiation sites, and mRNA splice sites. Protein coding regions were 59% GC, and noncoding regions were 49% GC. Codon usage was biased, primarily due to a strong preference for C in the final position of the codons. Over 80% of N. crassa protein coding genes had introns, which are typically 60 nucleotides long. Exons varied greatly in length, but were typically much longer than introns. The distribution of nucleotides surrounding translation initiation and intron splice sites was clearly nonrandom. We derived a consensus translation initiation site of CAMMATGGCT, a 5′-intron donor site of G^GTAAGTnnYCnYY, an internal branchpoint of WRCTRACMnnnnnnYY, and a 3′-acceptor site of WACAG^.