{"title":"The Average Profile of Suffix Trees","authors":"Mark Daniel Ward","doi":"10.1137/1.9781611972979.3","DOIUrl":null,"url":null,"abstract":"The internal profile of a tree structure denotes the number of internal nodes found at a specific level of the tree. Similarly, the external profile denotes the number of leaves on a level. The profile is of great interest because of its intimate connection to many other parameters of trees. For instance, the depth, fill-up level, height, path length, shortest path, and size of trees can each be interpreted in terms of the profile. \n \nThe current study is motivated by the work of Park et al. [22], which was a comprehensive study of the profile of tries constructed from independent strings (also, each string generated by a memoryless source). In the present paper, however, we consider suffix trees, which are constructed from suffixes of a common string. The dependency between suffixes demands a careful, intricate treatment of overlaps in words. \n \nWe precisely analyze the average internal and external profiles of suffix trees generated by a memoryless source. We utilize combinatorics on words (in particular, autocorrelation, i.e., the degree to which a word overlaps with itself) generating functions, singularity analysis, and the Mellin transform. We make comparisons of the average profile of suffix trees to the average profile of tries constructed from independent strings. We emphasize that our methods are extensible to higher moments. The present report describes the first moment of both the internal and external profiles of suffix trees.","PeriodicalId":340112,"journal":{"name":"Workshop on Analytic Algorithmics and Combinatorics","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Workshop on Analytic Algorithmics and Combinatorics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1137/1.9781611972979.3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The internal profile of a tree structure denotes the number of internal nodes found at a specific level of the tree. Similarly, the external profile denotes the number of leaves on a level. The profile is of great interest because of its intimate connection to many other parameters of trees. For instance, the depth, fill-up level, height, path length, shortest path, and size of trees can each be interpreted in terms of the profile.
The current study is motivated by the work of Park et al. [22], which was a comprehensive study of the profile of tries constructed from independent strings (also, each string generated by a memoryless source). In the present paper, however, we consider suffix trees, which are constructed from suffixes of a common string. The dependency between suffixes demands a careful, intricate treatment of overlaps in words.
We precisely analyze the average internal and external profiles of suffix trees generated by a memoryless source. We utilize combinatorics on words (in particular, autocorrelation, i.e., the degree to which a word overlaps with itself) generating functions, singularity analysis, and the Mellin transform. We make comparisons of the average profile of suffix trees to the average profile of tries constructed from independent strings. We emphasize that our methods are extensible to higher moments. The present report describes the first moment of both the internal and external profiles of suffix trees.