{"title":"Complexity charts can be used to map functional domains in DNA","authors":"Andrzej K. Konopka, John Owens","doi":"10.1016/0735-0651(90)90010-D","DOIUrl":null,"url":null,"abstract":"<div><p>We measured local compositional complexity (LCC) of DNA sequences by calculating Shannon information content over mononucleotide frequencies. Eukaryotic DNA appeared to be “simpler” than bacterial DNA even at the level of short oligonucleotides. Moreover, different DNA functional domains displayed different compositional complexity in a systematic manner. In particular, the complexity of exon sequences was systematically higher than the complexity of corresponding introns. We therefore present examples of complexity charts (plots of complexity versus position in sequence) for pre-mRNA sequences from higher eukaryotes. By taking a window width of 100 nucleotides and a window step of 1 nucleotide, introns can be distinguished from exons in the majority of cases studied. Complexity charts of immunoglobulin variable regions allowed correct mapping of exons and introns in these sequences as well, a task was impossible with commercial programs available to date.</p></div>","PeriodicalId":77714,"journal":{"name":"Gene analysis techniques","volume":"7 2","pages":"Pages 35-38"},"PeriodicalIF":0.0000,"publicationDate":"1990-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0735-0651(90)90010-D","citationCount":"33","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gene analysis techniques","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/073506519090010D","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 33
Abstract
We measured local compositional complexity (LCC) of DNA sequences by calculating Shannon information content over mononucleotide frequencies. Eukaryotic DNA appeared to be “simpler” than bacterial DNA even at the level of short oligonucleotides. Moreover, different DNA functional domains displayed different compositional complexity in a systematic manner. In particular, the complexity of exon sequences was systematically higher than the complexity of corresponding introns. We therefore present examples of complexity charts (plots of complexity versus position in sequence) for pre-mRNA sequences from higher eukaryotes. By taking a window width of 100 nucleotides and a window step of 1 nucleotide, introns can be distinguished from exons in the majority of cases studied. Complexity charts of immunoglobulin variable regions allowed correct mapping of exons and introns in these sequences as well, a task was impossible with commercial programs available to date.