{"title":"Linguistic Behavior of Well-Defined Strings in the Non-Coding Human Genome.","authors":"Havard R Glattre, Eystein Glattre, Lars Moe","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>In this article we do a top-down analysis of the non-protein-coding human genome using well-defined parameters, resulting in what we call ?-strings. We show that there are altogether 45,371,328 different ?-strings in the human non-protein-coding genome. We explore statistical properties of the y-strings and demonstrate that they have many characteristics in common with human words. We indicate how they are 'packed' in the chromosomes and that each chromosome has its own specific y-dictionary. We also outline our future work exploring the linguistic features of y-strings and y-text using methods developed to study human, natural language.</p>","PeriodicalId":46218,"journal":{"name":"Nonlinear Dynamics Psychology and Life Sciences","volume":"26 1","pages":"1-19"},"PeriodicalIF":0.6000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nonlinear Dynamics Psychology and Life Sciences","FirstCategoryId":"102","ListUrlMain":"","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PSYCHOLOGY, MATHEMATICAL","Score":null,"Total":0}
引用次数: 0
Abstract
In this article we do a top-down analysis of the non-protein-coding human genome using well-defined parameters, resulting in what we call ?-strings. We show that there are altogether 45,371,328 different ?-strings in the human non-protein-coding genome. We explore statistical properties of the y-strings and demonstrate that they have many characteristics in common with human words. We indicate how they are 'packed' in the chromosomes and that each chromosome has its own specific y-dictionary. We also outline our future work exploring the linguistic features of y-strings and y-text using methods developed to study human, natural language.