语料库语言学:使用古英语词典获得更多古英语拼写变化的数据

IF 0.7 3区 文学 0 HUMANITIES, MULTIDISCIPLINARY Digital Scholarship in the Humanities Pub Date : 2023-10-11 DOI:10.1093/llc/fqad064
Mark Faulkner
{"title":"语料库语言学:使用古英语词典获得更多古英语拼写变化的数据","authors":"Mark Faulkner","doi":"10.1093/llc/fqad064","DOIUrl":null,"url":null,"abstract":"Abstract This article presents a methodology for obtaining large datasets for the spelling of individual phonological segments in Old English texts, based on searching the Dictionary of Old English Corpus for the attested spellings listed in the Dictionary of Old English A-H. It exemplifies this ‘corpus philology’ through a study of 216,526 spellings for words beginning with h followed by a vowel, using a variety of techniques to evaluate the methodology’s precision and recall, which are calculated as very high for <h->initial spellings (precision 100% precision, recall 92.1%) and moderate, but still usable, for <h->less spellings (precision 85.5%, recall 58.3%). Data for fourteen other segments related to the behaviour of h- in Old English is presented in the Supplementary Materials that complement the paper online. This dataset of 379,484 spellings from 2,605 Old English texts is shown to seriously problematize the findings of traditional philology, the conclusions of which are in contrast based on only a handful of spellings from a few texts, and to have the potential to radically enhance our understanding of the literary and linguistic histories of English.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"5 1","pages":"0"},"PeriodicalIF":0.7000,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Corpus philology: Using the Dictionary of Old English to get bigger data for Old English spelling variation\",\"authors\":\"Mark Faulkner\",\"doi\":\"10.1093/llc/fqad064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract This article presents a methodology for obtaining large datasets for the spelling of individual phonological segments in Old English texts, based on searching the Dictionary of Old English Corpus for the attested spellings listed in the Dictionary of Old English A-H. It exemplifies this ‘corpus philology’ through a study of 216,526 spellings for words beginning with h followed by a vowel, using a variety of techniques to evaluate the methodology’s precision and recall, which are calculated as very high for <h->initial spellings (precision 100% precision, recall 92.1%) and moderate, but still usable, for <h->less spellings (precision 85.5%, recall 58.3%). Data for fourteen other segments related to the behaviour of h- in Old English is presented in the Supplementary Materials that complement the paper online. This dataset of 379,484 spellings from 2,605 Old English texts is shown to seriously problematize the findings of traditional philology, the conclusions of which are in contrast based on only a handful of spellings from a few texts, and to have the potential to radically enhance our understanding of the literary and linguistic histories of English.\",\"PeriodicalId\":45315,\"journal\":{\"name\":\"Digital Scholarship in the Humanities\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2023-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Scholarship in the Humanities\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/llc/fqad064\",\"RegionNum\":3,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"HUMANITIES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Scholarship in the Humanities","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/llc/fqad064","RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"HUMANITIES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

摘要本文提出了一种基于古英语语料库词典(Dictionary of Old English Corpus)对古英语a - h中列出的已证实的拼写进行检索的方法,用于获取古英语文本中单个音韵段拼写的大型数据集。它通过对以h开头的单词后面跟着一个元音的216,526个拼写的研究来例证这种“语库语言学”,使用各种技术来评估该方法的准确性和召回率,计算结果表明,<h->初始拼写非常高(精确度100%,召回率92.1%),中等,但仍然可用,对于<h->较少拼写(精确度85.5%,召回率58.3%)。与古英语中h-的行为相关的其他14个片段的数据在补充材料中提出,补充在线论文。这个包含2605个古英语文本的379484个拼写的数据集严重质疑了传统文献学的发现,传统文献学的结论仅基于少数文本的少数拼写,并且有可能从根本上增强我们对英语文学和语言历史的理解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Corpus philology: Using the Dictionary of Old English to get bigger data for Old English spelling variation
Abstract This article presents a methodology for obtaining large datasets for the spelling of individual phonological segments in Old English texts, based on searching the Dictionary of Old English Corpus for the attested spellings listed in the Dictionary of Old English A-H. It exemplifies this ‘corpus philology’ through a study of 216,526 spellings for words beginning with h followed by a vowel, using a variety of techniques to evaluate the methodology’s precision and recall, which are calculated as very high for &lt;h-&gt;initial spellings (precision 100% precision, recall 92.1%) and moderate, but still usable, for &lt;h-&gt;less spellings (precision 85.5%, recall 58.3%). Data for fourteen other segments related to the behaviour of h- in Old English is presented in the Supplementary Materials that complement the paper online. This dataset of 379,484 spellings from 2,605 Old English texts is shown to seriously problematize the findings of traditional philology, the conclusions of which are in contrast based on only a handful of spellings from a few texts, and to have the potential to radically enhance our understanding of the literary and linguistic histories of English.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.80
自引率
25.00%
发文量
78
期刊介绍: DSH or Digital Scholarship in the Humanities is an international, peer reviewed journal which publishes original contributions on all aspects of digital scholarship in the Humanities including, but not limited to, the field of what is currently called the Digital Humanities. Long and short papers report on theoretical, methodological, experimental, and applied research and include results of research projects, descriptions and evaluations of tools, techniques, and methodologies, and reports on work in progress. DSH also publishes reviews of books and resources. Digital Scholarship in the Humanities was previously known as Literary and Linguistic Computing.
期刊最新文献
Social network analysis of the Babylonian Talmud Ancient classical theatre from the digital humanities: a systematic review 2010–21 Language-based machine perception: linguistic perspectives on the compilation of captioning datasets Personality prediction via multi-task transformer architecture combined with image aesthetics Who wrote the first Constitutions of Freemasonry?
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1