On the ‘Stickiness’ of Words. A Comparative Language Study Screening the Internet for English, German, French and Latin Phrases

IF 0.7 2区 文学 N/A LANGUAGE & LINGUISTICS Journal of Quantitative Linguistics Pub Date : 2019-01-02 DOI:10.1080/09296174.2018.1451206
M. Berger
{"title":"On the ‘Stickiness’ of Words. A Comparative Language Study Screening the Internet for English, German, French and Latin Phrases","authors":"M. Berger","doi":"10.1080/09296174.2018.1451206","DOIUrl":null,"url":null,"abstract":"Abstract Language, one of the defining attributes of Homo sapiens, not only deploys as a chain of words. Rather, words group together in a non-random way to form phrases. Here, the world-wide web was searched for idiomatic expressions in three living and one extinct language: 1102 English, 1183 German, 1138 French and 1128 Latin phrases distributed into three categories, with high, middle and low frequencies. High-frequency phrases such as in addition to and as a matter of fact constituted 49.5% of all English phrases, but only 9.0% of the French and 2.5% of the German ones. The middle-frequency category with classical idioms such as a bitter pill or carved in stone comprised 34.9% of the English, 33.0% of the French, and 24.9% of the German phrases. Most French and German phrases were of low frequency. Latin phrases were found as often as French and more often than German ones in the world-wide web, and exhibited a frequency distribution similar to those of French and German. Frequency distributions yielded three main categories around similar maxima for all four languages, with differing relative proportions. The internet may prove useful for the quantitative comparison of languages.","PeriodicalId":45514,"journal":{"name":"Journal of Quantitative Linguistics","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2019-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/09296174.2018.1451206","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Quantitative Linguistics","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1080/09296174.2018.1451206","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"N/A","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Language, one of the defining attributes of Homo sapiens, not only deploys as a chain of words. Rather, words group together in a non-random way to form phrases. Here, the world-wide web was searched for idiomatic expressions in three living and one extinct language: 1102 English, 1183 German, 1138 French and 1128 Latin phrases distributed into three categories, with high, middle and low frequencies. High-frequency phrases such as in addition to and as a matter of fact constituted 49.5% of all English phrases, but only 9.0% of the French and 2.5% of the German ones. The middle-frequency category with classical idioms such as a bitter pill or carved in stone comprised 34.9% of the English, 33.0% of the French, and 24.9% of the German phrases. Most French and German phrases were of low frequency. Latin phrases were found as often as French and more often than German ones in the world-wide web, and exhibited a frequency distribution similar to those of French and German. Frequency distributions yielded three main categories around similar maxima for all four languages, with differing relative proportions. The internet may prove useful for the quantitative comparison of languages.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
关于单词的“粘性”。一项比较语言研究:筛选网络上的英语、德语、法语和拉丁语短语
抽象语言是智人的特征之一,它不仅是一个单词链。相反,单词以非随机的方式组合在一起形成短语。在这里,万维网搜索了三种现存和一种已灭绝语言的习语:1102个英语、1183个德语、1138个法语和1128个拉丁短语,分为三类,频率分别为高、中、低频。事实上,高频短语占所有英语短语的49.5%,但仅占法语短语的9.0%和德语短语的2.5%。英语、法语和德语中分别占34.9%、33.0%和24.9%。大多数法语和德语短语频率较低。在万维网上,拉丁短语与法语一样常见,比德语更常见,其频率分布与法语和德语相似。频率分布在所有四种语言的相似最大值周围产生了三个主要类别,具有不同的相对比例。事实证明,互联网可能有助于语言的定量比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.90
自引率
7.10%
发文量
7
期刊介绍: The Journal of Quantitative Linguistics is an international forum for the publication and discussion of research on the quantitative characteristics of language and text in an exact mathematical form. This approach, which is of growing interest, opens up important and exciting theoretical perspectives, as well as solutions for a wide range of practical problems such as machine learning or statistical parsing, by introducing into linguistics the methods and models of advanced scientific disciplines such as the natural sciences, economics, and psychology.
期刊最新文献
Exploring Colligation Diversity and Grammaticalization in Chinese: An Entropy-Based Approach The Menzerath-Altmann Law at the Paragraph Level in Written Chinese: Why Register and Text Size Matter? An Information-Theoretic Approach to Morphosyntactic Complexity in English, Dutch and German Investigating the Hierarchical Relationship Between Clause and Phrase Using the Menzerath-Altmann Law: Evidence from Academic Research Articles Quantifying Syntactic Complexity in Czech Texts: An Analysis of Mean Dependency Distance and Average Sentence Length Across Genres
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1