标准化的意义

A. Petrulevich
{"title":"标准化的意义","authors":"A. Petrulevich","doi":"10.1163/18756719-12340262","DOIUrl":null,"url":null,"abstract":"\nThis article examines current practices of normalization of names in Norse philology and computational linguistics that to a large extent build on deductive reasoning and external authoritative sources such as grammars, dictionaries and gazetteers. Instead, a survey of manuscript evidence and quantification of name forms at several levels of abstraction is proposed as an alternative inductive principle of normalization. A case study of name-form distributions in a dataset of 6,633 spatial attestations in East Norse literature from the Norse World resource serves as a point of departure for a discussion of the advantages and disadvantages of the approach. The comparison between attestations linked to the five most frequent place-names in Old Swedish and Old Danish shows the existence of typical spellings. However, there are still examples of norm negotiations and competitive distributions. Thus, the first inductive step of normalization can be complemented by further processing based on correspondences between phonology and spelling. Finally, stratified normalization of place-names pioneered by Norse World is seen as more versatile compared to traditional methods; the approach has a potential to facilitate both more nuanced philological and linguistic research as well as the further development of named-entity recognition tools.","PeriodicalId":108095,"journal":{"name":"Amsterdamer Beiträge zur älteren Germanistik","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Making Sense of Normalization\",\"authors\":\"A. Petrulevich\",\"doi\":\"10.1163/18756719-12340262\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\nThis article examines current practices of normalization of names in Norse philology and computational linguistics that to a large extent build on deductive reasoning and external authoritative sources such as grammars, dictionaries and gazetteers. Instead, a survey of manuscript evidence and quantification of name forms at several levels of abstraction is proposed as an alternative inductive principle of normalization. A case study of name-form distributions in a dataset of 6,633 spatial attestations in East Norse literature from the Norse World resource serves as a point of departure for a discussion of the advantages and disadvantages of the approach. The comparison between attestations linked to the five most frequent place-names in Old Swedish and Old Danish shows the existence of typical spellings. However, there are still examples of norm negotiations and competitive distributions. Thus, the first inductive step of normalization can be complemented by further processing based on correspondences between phonology and spelling. Finally, stratified normalization of place-names pioneered by Norse World is seen as more versatile compared to traditional methods; the approach has a potential to facilitate both more nuanced philological and linguistic research as well as the further development of named-entity recognition tools.\",\"PeriodicalId\":108095,\"journal\":{\"name\":\"Amsterdamer Beiträge zur älteren Germanistik\",\"volume\":\"90 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Amsterdamer Beiträge zur älteren Germanistik\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1163/18756719-12340262\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Amsterdamer Beiträge zur älteren Germanistik","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1163/18756719-12340262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文研究了挪威语文学和计算语言学中名称规范化的当前实践,这些实践在很大程度上建立在演绎推理和外部权威来源(如语法、字典和地名词典)的基础上。相反,在几个抽象层次上对手稿证据和名称形式的量化进行调查,作为规范化的另一种归纳原则。对来自挪威世界资源的东挪威文献中6,633个空间证明的数据集中的名称形式分布的案例研究可以作为讨论该方法优缺点的出发点。对比古瑞典语和古丹麦语中与五个最常见地名相关的证明,可以发现典型拼写的存在。然而,仍然存在规范谈判和竞争性分配的例子。因此,标准化的第一个归纳步骤可以通过基于语音和拼写之间的对应关系的进一步处理来补充。最后,与传统方法相比,由北欧世界开创的地名分层规范化被认为更通用;该方法有潜力促进更细致入微的语言学和语言学研究,以及命名实体识别工具的进一步发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Making Sense of Normalization
This article examines current practices of normalization of names in Norse philology and computational linguistics that to a large extent build on deductive reasoning and external authoritative sources such as grammars, dictionaries and gazetteers. Instead, a survey of manuscript evidence and quantification of name forms at several levels of abstraction is proposed as an alternative inductive principle of normalization. A case study of name-form distributions in a dataset of 6,633 spatial attestations in East Norse literature from the Norse World resource serves as a point of departure for a discussion of the advantages and disadvantages of the approach. The comparison between attestations linked to the five most frequent place-names in Old Swedish and Old Danish shows the existence of typical spellings. However, there are still examples of norm negotiations and competitive distributions. Thus, the first inductive step of normalization can be complemented by further processing based on correspondences between phonology and spelling. Finally, stratified normalization of place-names pioneered by Norse World is seen as more versatile compared to traditional methods; the approach has a potential to facilitate both more nuanced philological and linguistic research as well as the further development of named-entity recognition tools.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Die nie in Druck erschienene Hannover Apokalypse Wolfdietrichs Wurm Zu Otachar im Hildebrandslied Stabreimende Wortpaare in der deutschen Lyrik des späten Mittelalters Reshaping the Anglo-Saxon Scop
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1