Tomasz Stanisz, Stanisław Drożdż, Jarosław Kwapień
{"title":"Statistics of punctuation in experimental literature -- the remarkable case of \"Finnegans Wake\" by James Joyce","authors":"Tomasz Stanisz, Stanisław Drożdż, Jarosław Kwapień","doi":"arxiv-2409.00483","DOIUrl":null,"url":null,"abstract":"As the recent studies indicate, the structure imposed onto written texts by\nthe presence of punctuation develops patterns which reveal certain\ncharacteristics of universality. In particular, based on a large collection of\nclassic literary works, it has been evidenced that the distances between\nconsecutive punctuation marks, measured in terms of the number of words, obey\nthe discrete Weibull distribution - a discrete variant of a distribution often\nused in survival analysis. The present work extends the analysis of punctuation\nusage patterns to more experimental pieces of world literature. It turns out\nthat the compliance of the the distances between punctuation marks with the\ndiscrete Weibull distribution typically applies here as well. However, some of\nthe works by James Joyce are distinct in this regard - in the sense that the\ntails of the relevant distributions are significantly thicker and,\nconsequently, the corresponding hazard functions are decreasing functions not\nobserved in typical literary texts in prose. \"Finnegans Wake\" - the same one to\nwhich science owes the word \"quarks\" for the most fundamental constituents of\nmatter - is particularly striking in this context. At the same time, in all the\nstudied texts, the sentence lengths - representing the distances between\nsentence-ending punctuation marks - reveal more freedom and are not constrained\nby the discrete Weibull distribution. This freedom in some cases translates\ninto long-range nonlinear correlations, which manifest themselves in\nmultifractality. Again, a text particularly spectacular in terms of\nmultifractality is \"Finnegans Wake\".","PeriodicalId":501172,"journal":{"name":"arXiv - STAT - Applications","volume":"44 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.00483","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
As the recent studies indicate, the structure imposed onto written texts by
the presence of punctuation develops patterns which reveal certain
characteristics of universality. In particular, based on a large collection of
classic literary works, it has been evidenced that the distances between
consecutive punctuation marks, measured in terms of the number of words, obey
the discrete Weibull distribution - a discrete variant of a distribution often
used in survival analysis. The present work extends the analysis of punctuation
usage patterns to more experimental pieces of world literature. It turns out
that the compliance of the the distances between punctuation marks with the
discrete Weibull distribution typically applies here as well. However, some of
the works by James Joyce are distinct in this regard - in the sense that the
tails of the relevant distributions are significantly thicker and,
consequently, the corresponding hazard functions are decreasing functions not
observed in typical literary texts in prose. "Finnegans Wake" - the same one to
which science owes the word "quarks" for the most fundamental constituents of
matter - is particularly striking in this context. At the same time, in all the
studied texts, the sentence lengths - representing the distances between
sentence-ending punctuation marks - reveal more freedom and are not constrained
by the discrete Weibull distribution. This freedom in some cases translates
into long-range nonlinear correlations, which manifest themselves in
multifractality. Again, a text particularly spectacular in terms of
multifractality is "Finnegans Wake".