Age Rating of Books and Readability: On the Correlation of Two Indices

IF 0.1 0 HUMANITIES, MULTIDISCIPLINARY Tekst Kniga Knigoizdanie-Text Book Publishing Pub Date : 2021-01-01 DOI:10.17223/23062061/26/8
A. V. Glazkova
{"title":"Age Rating of Books and Readability: On the Correlation of Two Indices","authors":"A. V. Glazkova","doi":"10.17223/23062061/26/8","DOIUrl":null,"url":null,"abstract":"The article examines the correlation of two indices characterizing the level of linguistic or semantic complexity of the book content. The first index is the age rating in accordance with the Russian Age Rating System for information products. The second index is the ease of understanding of the text, calculated based on the common readability metrics. The author compares the values of readability metrics for texts with different age rating scores. The experiments were carried out on the collection of 5,516 book previews collected by the author of the article. The previews used are freely available in electronic libraries, and they have age rating scores obtained from their publishers. In accordance with the system adopted in the Russian Federation, age rating scores characterize the book’s targeting to the following age categories: 0+, 6+, 12+, 16+, and 18+. In most cases, the size of the book preview is 10% of the full text, which makes it possible to calculate readability indices. The collected texts were scored according to five commonly used readability metrics: Flash-Kincaid Index, Coleman-Liau Index, ARI Index, SMOG Index, and Dale-Chell Formula. As a result of the readability assessment for the texts of each age category, the author obtained recommended levels of education necessary for their understanding. The obtained values were averaged within the age category and analyzed. The results of the experiments allow asserting that in most cases there is a direct relationship between the age rating score of the book and the expected level of education required to understand it. Moreover, readability scores in accordance with all the considered metrics are directly proportional to age rating scores for age categories from 0+ to 16+. The readability scores of books in the 18+ category roughly correspond to children’s literature, which is apparently explained by the genre characteristics of the books marked by the 18+ label. First of all, the results obtained indicate the adequacy of the existing approach to assessing the book age rating in terms of attributing the text to the target audience by age. Secondly, the relationship between readability indices and age rating scores allow using the values of readability metrics as text features in various computational linguistics tasks aimed at text addressee prediction.","PeriodicalId":40676,"journal":{"name":"Tekst Kniga Knigoizdanie-Text Book Publishing","volume":"35 1","pages":""},"PeriodicalIF":0.1000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tekst Kniga Knigoizdanie-Text Book Publishing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17223/23062061/26/8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"HUMANITIES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The article examines the correlation of two indices characterizing the level of linguistic or semantic complexity of the book content. The first index is the age rating in accordance with the Russian Age Rating System for information products. The second index is the ease of understanding of the text, calculated based on the common readability metrics. The author compares the values of readability metrics for texts with different age rating scores. The experiments were carried out on the collection of 5,516 book previews collected by the author of the article. The previews used are freely available in electronic libraries, and they have age rating scores obtained from their publishers. In accordance with the system adopted in the Russian Federation, age rating scores characterize the book’s targeting to the following age categories: 0+, 6+, 12+, 16+, and 18+. In most cases, the size of the book preview is 10% of the full text, which makes it possible to calculate readability indices. The collected texts were scored according to five commonly used readability metrics: Flash-Kincaid Index, Coleman-Liau Index, ARI Index, SMOG Index, and Dale-Chell Formula. As a result of the readability assessment for the texts of each age category, the author obtained recommended levels of education necessary for their understanding. The obtained values were averaged within the age category and analyzed. The results of the experiments allow asserting that in most cases there is a direct relationship between the age rating score of the book and the expected level of education required to understand it. Moreover, readability scores in accordance with all the considered metrics are directly proportional to age rating scores for age categories from 0+ to 16+. The readability scores of books in the 18+ category roughly correspond to children’s literature, which is apparently explained by the genre characteristics of the books marked by the 18+ label. First of all, the results obtained indicate the adequacy of the existing approach to assessing the book age rating in terms of attributing the text to the target audience by age. Secondly, the relationship between readability indices and age rating scores allow using the values of readability metrics as text features in various computational linguistics tasks aimed at text addressee prediction.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
图书年龄分级与易读性:两项指标的相关性研究
本文考察了表征图书内容的语言或语义复杂程度的两个指标之间的相关性。第一个指标是根据俄罗斯信息产品年龄分级系统进行的年龄分级。第二个指标是文本的易理解性,基于常见的可读性指标计算。作者比较了不同年龄评分文本的可读性指标值。实验是在本文作者收集的5516本图书预览集上进行的。所使用的预览可以在电子图书馆免费获得,并且它们有从出版商那里获得的年龄评级分数。根据俄罗斯联邦采用的系统,年龄分级分数描述了这本书针对以下年龄段的目标:0+,6+,12+,16+和18+。在大多数情况下,图书预览的大小是全文的10%,这使得计算可读性指数成为可能。根据五种常用的可读性指标进行评分:Flash-Kincaid指数、Coleman-Liau指数、ARI指数、SMOG指数和Dale-Chell公式。通过对每个年龄组文本的可读性评估,作者获得了他们理解所需的推荐教育水平。所得值在年龄组内取平均值并进行分析。实验的结果可以断言,在大多数情况下,书的年龄评级分数与理解它所需的预期教育水平之间存在直接关系。此外,根据所有考虑的指标,可读性得分与0+到16+年龄类别的年龄评分得分成正比。18+类别书籍的可读性得分大致对应于儿童文学,这显然是由18+标签所标记的书籍的类型特征所解释的。首先,所获得的结果表明,在按年龄将文本归给目标读者方面,现有的评估图书年龄评级方法是适当的。其次,可读性指标与年龄评分之间的关系允许在各种旨在预测文本收件人的计算语言学任务中使用可读性指标的值作为文本特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Tekst Kniga Knigoizdanie-Text Book Publishing
Tekst Kniga Knigoizdanie-Text Book Publishing HUMANITIES, MULTIDISCIPLINARY-
CiteScore
0.20
自引率
0.00%
发文量
0
期刊最新文献
Titles of editions: Between advertising and pragmatics On church singing in the Fedoseevtsy collection Paternal Testaments. Article I The reader-character in Yuri Felzen’s novel Letters about Lermontov Book review: Tatsumi, Y. & Tsurumi, T. (eds) (2020) Publishing in Tsarist Russia: A history of print media from Enlightenment to Revolution. London: Bloomsbury Academic. 280 р. Perception of pictorial text: Problematization, actualization, new methodological approaches
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1