{"title":"标准化的意义","authors":"A. Petrulevich","doi":"10.1163/18756719-12340262","DOIUrl":null,"url":null,"abstract":"\nThis article examines current practices of normalization of names in Norse philology and computational linguistics that to a large extent build on deductive reasoning and external authoritative sources such as grammars, dictionaries and gazetteers. Instead, a survey of manuscript evidence and quantification of name forms at several levels of abstraction is proposed as an alternative inductive principle of normalization. A case study of name-form distributions in a dataset of 6,633 spatial attestations in East Norse literature from the Norse World resource serves as a point of departure for a discussion of the advantages and disadvantages of the approach. The comparison between attestations linked to the five most frequent place-names in Old Swedish and Old Danish shows the existence of typical spellings. However, there are still examples of norm negotiations and competitive distributions. Thus, the first inductive step of normalization can be complemented by further processing based on correspondences between phonology and spelling. Finally, stratified normalization of place-names pioneered by Norse World is seen as more versatile compared to traditional methods; the approach has a potential to facilitate both more nuanced philological and linguistic research as well as the further development of named-entity recognition tools.","PeriodicalId":108095,"journal":{"name":"Amsterdamer Beiträge zur älteren Germanistik","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Making Sense of Normalization\",\"authors\":\"A. Petrulevich\",\"doi\":\"10.1163/18756719-12340262\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\nThis article examines current practices of normalization of names in Norse philology and computational linguistics that to a large extent build on deductive reasoning and external authoritative sources such as grammars, dictionaries and gazetteers. Instead, a survey of manuscript evidence and quantification of name forms at several levels of abstraction is proposed as an alternative inductive principle of normalization. A case study of name-form distributions in a dataset of 6,633 spatial attestations in East Norse literature from the Norse World resource serves as a point of departure for a discussion of the advantages and disadvantages of the approach. The comparison between attestations linked to the five most frequent place-names in Old Swedish and Old Danish shows the existence of typical spellings. However, there are still examples of norm negotiations and competitive distributions. Thus, the first inductive step of normalization can be complemented by further processing based on correspondences between phonology and spelling. Finally, stratified normalization of place-names pioneered by Norse World is seen as more versatile compared to traditional methods; the approach has a potential to facilitate both more nuanced philological and linguistic research as well as the further development of named-entity recognition tools.\",\"PeriodicalId\":108095,\"journal\":{\"name\":\"Amsterdamer Beiträge zur älteren Germanistik\",\"volume\":\"90 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Amsterdamer Beiträge zur älteren Germanistik\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1163/18756719-12340262\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Amsterdamer Beiträge zur älteren Germanistik","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1163/18756719-12340262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This article examines current practices of normalization of names in Norse philology and computational linguistics that to a large extent build on deductive reasoning and external authoritative sources such as grammars, dictionaries and gazetteers. Instead, a survey of manuscript evidence and quantification of name forms at several levels of abstraction is proposed as an alternative inductive principle of normalization. A case study of name-form distributions in a dataset of 6,633 spatial attestations in East Norse literature from the Norse World resource serves as a point of departure for a discussion of the advantages and disadvantages of the approach. The comparison between attestations linked to the five most frequent place-names in Old Swedish and Old Danish shows the existence of typical spellings. However, there are still examples of norm negotiations and competitive distributions. Thus, the first inductive step of normalization can be complemented by further processing based on correspondences between phonology and spelling. Finally, stratified normalization of place-names pioneered by Norse World is seen as more versatile compared to traditional methods; the approach has a potential to facilitate both more nuanced philological and linguistic research as well as the further development of named-entity recognition tools.