首页 > 最新文献

Word Structure最新文献

英文 中文
Realised overabundance in Estonian noun paradigms: A corpus study 爱沙尼亚语名词范式的实现过剩:语料库研究
0 LANGUAGE & LINGUISTICS Pub Date : 2023-11-01 DOI: 10.3366/word.2023.0227
Mari Aigro, Virve-Anneli Vihman
In this paper we investigate the use of parallel forms, or morphological overabundance, in Estonian nouns, which are richly inflected in fourteen cases and two numbers. The complexity of the nominal inflection system derives from the multiplicity of declension classes, which allow for the availability of parallel forms in many morphological paradigm cells. Whereas previous studies of overabundance in nominal inflection have focussed on available overabundance in particular cells, this study focuses on the actual usage of overabundant forms, or ‘realised overabundance’, taking as a starting point those lexemes which are attested with two parallel forms in the corpus. Following the criteria proposed by Thornton (2019) , we investigate the canonicity of realised overabundance in Estonian, in terms of its extent across both lexemes and cells. In addition, frequency ratios of alternate variants and conditions on their usage are investigated by comparing lexemes in the most overabundant cells. Finally, we tackle the connections between morphological overabundance and syntactic argumenthood, exploring the hypothesis that parallel forms serve distinct functions. We find no contextual preference in terms of which of the two forms is used. In general, we find realised overabundance to be restricted and more canonical than potential overabundance.
在本文中,我们研究了爱沙尼亚名词中平行形式或形态过剩的使用,这些名词在14个案例和两个数字中都有丰富的屈折。名义词形变化系统的复杂性源于词形变化类的多样性,它允许在许多形态范式细胞中使用平行形式。鉴于之前关于名义词形变化中过剩的研究主要集中在特定细胞中的可用过剩,本研究侧重于过剩形式的实际使用,或“实现过剩”,以语料库中两个平行形式证明的词汇为起点。根据Thornton(2019)提出的标准,我们调查了爱沙尼亚语中已实现的过剩的规范性,就其在词素和细胞中的程度而言。此外,通过比较最丰富的细胞中的词素,研究了交替变异体的频率比及其使用条件。最后,我们讨论了形态过剩和句法论证之间的联系,探讨了平行形式具有不同功能的假设。我们发现在使用这两种形式中的哪一种方面没有上下文偏好。一般来说,我们发现已实现的过剩比潜在的过剩更受限制和规范。
{"title":"Realised overabundance in Estonian noun paradigms: A corpus study","authors":"Mari Aigro, Virve-Anneli Vihman","doi":"10.3366/word.2023.0227","DOIUrl":"https://doi.org/10.3366/word.2023.0227","url":null,"abstract":"In this paper we investigate the use of parallel forms, or morphological overabundance, in Estonian nouns, which are richly inflected in fourteen cases and two numbers. The complexity of the nominal inflection system derives from the multiplicity of declension classes, which allow for the availability of parallel forms in many morphological paradigm cells. Whereas previous studies of overabundance in nominal inflection have focussed on available overabundance in particular cells, this study focuses on the actual usage of overabundant forms, or ‘realised overabundance’, taking as a starting point those lexemes which are attested with two parallel forms in the corpus. Following the criteria proposed by Thornton (2019) , we investigate the canonicity of realised overabundance in Estonian, in terms of its extent across both lexemes and cells. In addition, frequency ratios of alternate variants and conditions on their usage are investigated by comparing lexemes in the most overabundant cells. Finally, we tackle the connections between morphological overabundance and syntactic argumenthood, exploring the hypothesis that parallel forms serve distinct functions. We find no contextual preference in terms of which of the two forms is used. In general, we find realised overabundance to be restricted and more canonical than potential overabundance.","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":"23 7","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135566279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertainty in the production of Czech noun and verb forms 捷克语名词和动词形式的不确定性
0 LANGUAGE & LINGUISTICS Pub Date : 2023-11-01 DOI: 10.3366/word.2023.0231
Neil Bermel, Luděk Knittl, Alexandre Nikolaev
We examine the reactions of Czech native speakers to cues asking them to supply inflectional forms of nouns and verbs that are either canonical (non-variant), overabundant, or supposedly defective, to see what distinguishing characteristics these three conditions have for production. We find that respondents handle defective material differently from other conditions, producing different sorts of forms at different frequencies, and taking significantly longer to do so. Overabundant cells pattern at the individual level like canonical inflectional cells, but collectively display a significantly more varied and less focused spread of forms produced than our canonical cells. The individual dimension of uncertainty in production is thus limited to defective cells, but the collective dimension of uncertainty is evident between all three conditions.
我们研究了母语为捷克语的人对要求他们提供名词和动词的屈折形式的提示的反应,这些屈折形式要么是规范的(无变体),要么是过多的,要么是有缺陷的,看看这三种条件对生产有什么显著的特征。我们发现,受访者处理有缺陷的材料与其他条件不同,以不同的频率产生不同种类的形式,并且需要更长的时间来这样做。在个体水平上,细胞数量过多,就像典型的屈折细胞一样,但总的来说,它们所产生的形式比我们的典型细胞更多样化,更不集中。因此,生产中个别方面的不确定性仅限于有缺陷的细胞,但在所有三种情况下,集体方面的不确定性是明显的。
{"title":"Uncertainty in the production of Czech noun and verb forms","authors":"Neil Bermel, Luděk Knittl, Alexandre Nikolaev","doi":"10.3366/word.2023.0231","DOIUrl":"https://doi.org/10.3366/word.2023.0231","url":null,"abstract":"We examine the reactions of Czech native speakers to cues asking them to supply inflectional forms of nouns and verbs that are either canonical (non-variant), overabundant, or supposedly defective, to see what distinguishing characteristics these three conditions have for production. We find that respondents handle defective material differently from other conditions, producing different sorts of forms at different frequencies, and taking significantly longer to do so. Overabundant cells pattern at the individual level like canonical inflectional cells, but collectively display a significantly more varied and less focused spread of forms produced than our canonical cells. The individual dimension of uncertainty in production is thus limited to defective cells, but the collective dimension of uncertainty is evident between all three conditions.","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":"189 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135566138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stem overgeneralizations in the acquisition of Croatian verbal morphology: Evidence from parental questionnaires 克罗地亚语词法习得中的词干过度概括:来自父母调查问卷的证据
0 LANGUAGE & LINGUISTICS Pub Date : 2023-11-01 DOI: 10.3366/word.2023.0228
Gordana Hržica, Tomislava Bošnjak Botica, Sara Košutar
Studies on verbal overgeneralization often focus on languages with low morphological complexity. The Croatian conjugational system exhibits varying degrees of complexity, and this complexity is not primarily based on the number of inflectional morphemes, but on an elaborate system of stem changes. During early language development, children face the difficult task of acquiring this system, using overgeneralized forms to overcome its complexity. To date, studies have used a corpus-based method to retrieve overgeneralizations in child language, which has had limited success in capturing this phenomenon. The aim of this study was to investigate the production of overgeneralized verb forms in Croatian monolingual children aged 2;6 to 5;11 using a questionnaire in which parents report overgeneralizations used by their children. We tested the relationship between the production of overgeneralized forms and features of the input language (token frequency and class size). We hypothesized that the rate of overgeneralizations will depend on input language features, i.e. a higher rate of overgeneralizations for infrequent verbs and for verbs with smaller class size. The items selected for the questionnaire are the verbs with stem change used by parents in the longitudinal Croatian corpus of child language. Parents report overgeneralized forms in all verb classes, and verb frequency and class size negatively correlate with the proportion of overgeneralizations. Our results show that children gradually abstract morphological systems in a way that is highly sensitive to the properties of the input.
言语过度概括的研究往往集中在形态复杂性较低的语言上。克罗地亚语的共轭系统表现出不同程度的复杂性,这种复杂性主要不是基于屈折语素的数量,而是基于一个复杂的系统变化。在早期的语言发展过程中,儿童面临着获取这一系统的困难任务,他们使用过度概括的形式来克服其复杂性。迄今为止,已有研究使用基于语料库的方法来检索儿童语言中的过度概括,但在捕捉这种现象方面收效甚微。本研究的目的是调查2岁至5岁11岁克罗地亚单语儿童过度概括动词形式的产生,使用问卷调查,父母报告他们的孩子使用过度概括。我们测试了过度一般化形式的产生与输入语言的特征(标记频率和类大小)之间的关系。我们假设过度概括的比率将取决于输入语言的特征,即对于不频繁的动词和较小的类规模的动词,过度概括的比率更高。问卷所选项目为纵向克罗埃西亚语儿童语料库中父母使用的词干变化动词。在所有的动词类中,家长都报告了过度概括的形式,动词的使用频率和班级规模与过度概括的比例呈负相关。我们的研究结果表明,儿童逐渐抽象形态系统的方式是高度敏感的输入属性。
{"title":"Stem overgeneralizations in the acquisition of Croatian verbal morphology: Evidence from parental questionnaires","authors":"Gordana Hržica, Tomislava Bošnjak Botica, Sara Košutar","doi":"10.3366/word.2023.0228","DOIUrl":"https://doi.org/10.3366/word.2023.0228","url":null,"abstract":"Studies on verbal overgeneralization often focus on languages with low morphological complexity. The Croatian conjugational system exhibits varying degrees of complexity, and this complexity is not primarily based on the number of inflectional morphemes, but on an elaborate system of stem changes. During early language development, children face the difficult task of acquiring this system, using overgeneralized forms to overcome its complexity. To date, studies have used a corpus-based method to retrieve overgeneralizations in child language, which has had limited success in capturing this phenomenon. The aim of this study was to investigate the production of overgeneralized verb forms in Croatian monolingual children aged 2;6 to 5;11 using a questionnaire in which parents report overgeneralizations used by their children. We tested the relationship between the production of overgeneralized forms and features of the input language (token frequency and class size). We hypothesized that the rate of overgeneralizations will depend on input language features, i.e. a higher rate of overgeneralizations for infrequent verbs and for verbs with smaller class size. The items selected for the questionnaire are the verbs with stem change used by parents in the longitudinal Croatian corpus of child language. Parents report overgeneralized forms in all verb classes, and verb frequency and class size negatively correlate with the proportion of overgeneralizations. Our results show that children gradually abstract morphological systems in a way that is highly sensitive to the properties of the input.","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":"198 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135566371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The structuralist tradition meets empirical data: Corpus data enhancing the Czech Internet Language Reference Book 结构主义传统与经验数据的相遇:语料库数据对捷克网络语言参考书的增强
0 LANGUAGE & LINGUISTICS Pub Date : 2023-11-01 DOI: 10.3366/word.2023.0230
Dominika Kováříková, Martin Beneš, Kamila Smejkalová, Oleg Kovářík
This paper demonstrates how the corpus grammar tool GramatiKat can be used to improve and refine morphological information in the Internet Language Reference Book (ILRB), which presents complete declension paradigms for 45,632 standard Czech nouns. The paradigm tables are based mainly on morphological types, following structuralist conceptions of language as a fully articulated system. The paper discusses how to update the ILRB and provide users with empirically based grammatical information for individual word forms in each cell of the paradigm. All noun lemmas have been investigated using the GramatiKat tool for research into grammatical categories in Czech. The tool observes the distribution of word forms of a particular lexeme in comparison with the standard distribution across the whole word class. It is capable of identifying nouns that have an unusually high occurrence of a certain word form, as well as nouns with unattested word forms. GramatiKat is based on the data from two corpora of Czech written texts, SYN2015 and SYN2020 (200 million word tokens). The paper investigates the relationship between defectiveness and overabundance on one side and language variability and potentiality on the other. Based on the unique combination of data from the ILRB and GramatiKat, the paper suggests how information about unusually frequent or overabundant word forms as well as unattested ones should be pointed out, so that ILRB provides the user with accurate, empirically based data.
本文展示了语料库语法工具GramatiKat如何改进和完善网络语言工具书(ILRB)中的形态信息,该工具书提供了45,632个标准捷克语名词的完整变格范式。范式表主要基于形态类型,遵循语言作为一个完全铰接系统的结构主义概念。本文讨论了如何更新语料库,为用户提供基于经验的语料库范式中每个单元中单个词形的语法信息。使用GramatiKat工具研究捷克语的语法类别,对所有名词引理进行了调查。该工具观察特定词素的词形分布,并与整个词类的标准分布进行比较。它能够识别在某种词形中出现频率异常高的名词,以及具有未经证实的词形的名词。GramatiKat基于两个捷克语语料库SYN2015和SYN2020(2亿个单词标记)的数据。本文探讨了语言的缺陷和过剩与语言的变异性和潜能之间的关系。基于ILRB和GramatiKat数据的独特组合,本文提出了如何指出异常频繁或过多的词形信息以及未经证实的词形信息,以便ILRB为用户提供准确的、基于经验的数据。
{"title":"The structuralist tradition meets empirical data: Corpus data enhancing the Czech Internet Language Reference Book","authors":"Dominika Kováříková, Martin Beneš, Kamila Smejkalová, Oleg Kovářík","doi":"10.3366/word.2023.0230","DOIUrl":"https://doi.org/10.3366/word.2023.0230","url":null,"abstract":"This paper demonstrates how the corpus grammar tool GramatiKat can be used to improve and refine morphological information in the Internet Language Reference Book (ILRB), which presents complete declension paradigms for 45,632 standard Czech nouns. The paradigm tables are based mainly on morphological types, following structuralist conceptions of language as a fully articulated system. The paper discusses how to update the ILRB and provide users with empirically based grammatical information for individual word forms in each cell of the paradigm. All noun lemmas have been investigated using the GramatiKat tool for research into grammatical categories in Czech. The tool observes the distribution of word forms of a particular lexeme in comparison with the standard distribution across the whole word class. It is capable of identifying nouns that have an unusually high occurrence of a certain word form, as well as nouns with unattested word forms. GramatiKat is based on the data from two corpora of Czech written texts, SYN2015 and SYN2020 (200 million word tokens). The paper investigates the relationship between defectiveness and overabundance on one side and language variability and potentiality on the other. Based on the unique combination of data from the ILRB and GramatiKat, the paper suggests how information about unusually frequent or overabundant word forms as well as unattested ones should be pointed out, so that ILRB provides the user with accurate, empirically based data.","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":"15 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135565240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Front matter 前页
0 LANGUAGE & LINGUISTICS Pub Date : 2023-11-01 DOI: 10.3366/word.2023.0225
{"title":"Front matter","authors":"","doi":"10.3366/word.2023.0225","DOIUrl":"https://doi.org/10.3366/word.2023.0225","url":null,"abstract":"","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":"209 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135566368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Studying negative evidence in Finnish language corpora 芬兰语料库中的否定证据研究
0 LANGUAGE & LINGUISTICS Pub Date : 2023-11-01 DOI: 10.3366/word.2023.0229
Alexandre Nikolaev, Neil Bermel
This study explores the relationship between lower-than-expected frequencies of word forms and inherent gaps in Finnish inflectional paradigms. The research aims to determine whether it is possible to predict paradigmatic gaps from lower-than-expected frequencies of word forms. We examined Finnish nouns inflected in a marginal case (the instructive) and hypothesized that some of these nouns may potentially have gaps in their inflectional paradigms. However, we found that such gaps are contingent and do not cause uncertainty when filled. We find that the correlation between inherent gaps and lower frequencies is one-directional: predicting inherent gaps from lower-than-expected frequencies is problematic. The results suggest that any paradigmatic gap suggested by corpus frequency is more likely to be contingent than inherent, and that the less semantic need there is for a particular word form, the more likely it will be unattested even in a large corpus. The research highlights the importance of considering semantic profiles when analyzing the grammaticality of word forms and suggests that statistical tests like Fisher’s exact are not necessarily the right approach to tackle the problem of negative evidence in corpus studies.
本研究探讨芬兰语屈折范式中低于预期的词形频率与固有差距之间的关系。该研究旨在确定是否有可能从低于预期的词形频率中预测范式间隔。我们研究了芬兰语名词在边缘情况下的屈折变化(指导性的),并假设其中一些名词可能在其屈折变化范式中存在潜在的差距。然而,我们发现这样的差距是偶然的,当填补时不会造成不确定性。我们发现固有间隙和低频之间的相关性是单向的:从低于预期的频率预测固有间隙是有问题的。结果表明,语料库频率暗示的任何范式差距更有可能是偶然的,而不是固有的,并且对特定词形的语义需求越少,即使在大型语料库中,它也越有可能未经证实。该研究强调了在分析词形的语法性时考虑语义轮廓的重要性,并表明像Fisher的精确统计测试不一定是解决语料库研究中负面证据问题的正确方法。
{"title":"Studying negative evidence in Finnish language corpora","authors":"Alexandre Nikolaev, Neil Bermel","doi":"10.3366/word.2023.0229","DOIUrl":"https://doi.org/10.3366/word.2023.0229","url":null,"abstract":"This study explores the relationship between lower-than-expected frequencies of word forms and inherent gaps in Finnish inflectional paradigms. The research aims to determine whether it is possible to predict paradigmatic gaps from lower-than-expected frequencies of word forms. We examined Finnish nouns inflected in a marginal case (the instructive) and hypothesized that some of these nouns may potentially have gaps in their inflectional paradigms. However, we found that such gaps are contingent and do not cause uncertainty when filled. We find that the correlation between inherent gaps and lower frequencies is one-directional: predicting inherent gaps from lower-than-expected frequencies is problematic. The results suggest that any paradigmatic gap suggested by corpus frequency is more likely to be contingent than inherent, and that the less semantic need there is for a particular word form, the more likely it will be unattested even in a large corpus. The research highlights the importance of considering semantic profiles when analyzing the grammaticality of word forms and suggests that statistical tests like Fisher’s exact are not necessarily the right approach to tackle the problem of negative evidence in corpus studies.","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":"18 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135565237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Affix rivalry in French demonym formation: The role of linguistic and non-linguistic parameters 法语魔词形成中的词缀竞争:语言和非语言参数的作用
IF 0.6 0 LANGUAGE & LINGUISTICS Pub Date : 2023-03-01 DOI: 10.3366/word.2023.0223
Juliette Thuilier, D. Tribout, Marine Wauquier
Affix rivalry is defined as the phenomenon of morphological competition where affixes and meaning are in a many-to-one relationship. Because of their poor semantic content, demonyms are perfect candidates for the investigation of selectional constraints in such a context. Indeed the morphological processes they originate from are characterized by their shared, straightfoward semantic relation, as they denote inhabitants linked to the toponym they derive from, which allows for the apparently simplified scrutinization of non-semantic properties. Investigations suggest a more nuanced and complex reality. The present study provides a quantitative and statistical investigation of the rivalry between French - ois, - ais, - ien and - éen suffixes. It notably relies on phonological and morphological features. Its contribution pertains to the use of statistical modeling to provide a quantitative description and to the integration of extralinguistic features on the nature of geographical proximity in a quantitative approach. The study shows that while the model cannot accurately predict the suffix of a given demonym based on these features, it still draws on the main tendencies underlying affix rivalry in the formation of French demonyms.
词缀竞争是词缀与意义处于多对一关系中的形态竞争现象。由于其较差的语义内容,在这种情况下,同义词是研究选择约束的完美候选者。事实上,它们起源的形态过程以其共享的、直接的语义关系为特征,因为它们表示与它们来源的地名相关的居民,这允许对非语义属性进行明显简化的审查。调查显示了一个更加微妙和复杂的现实。本研究对法语-ois、-ais、-ien和-éen后缀之间的竞争进行了定量和统计调查。它主要依赖于语音和形态特征。它的贡献涉及使用统计建模来提供定量描述,以及以定量方法整合关于地理邻近性质的语言外特征。研究表明,尽管该模型无法根据这些特征准确预测给定地名的后缀,但它仍然借鉴了法语地名形成中词缀竞争的主要趋势。
{"title":"Affix rivalry in French demonym formation: The role of linguistic and non-linguistic parameters","authors":"Juliette Thuilier, D. Tribout, Marine Wauquier","doi":"10.3366/word.2023.0223","DOIUrl":"https://doi.org/10.3366/word.2023.0223","url":null,"abstract":"Affix rivalry is defined as the phenomenon of morphological competition where affixes and meaning are in a many-to-one relationship. Because of their poor semantic content, demonyms are perfect candidates for the investigation of selectional constraints in such a context. Indeed the morphological processes they originate from are characterized by their shared, straightfoward semantic relation, as they denote inhabitants linked to the toponym they derive from, which allows for the apparently simplified scrutinization of non-semantic properties. Investigations suggest a more nuanced and complex reality. The present study provides a quantitative and statistical investigation of the rivalry between French - ois, - ais, - ien and - éen suffixes. It notably relies on phonological and morphological features. Its contribution pertains to the use of statistical modeling to provide a quantitative description and to the integration of extralinguistic features on the nature of geographical proximity in a quantitative approach. The study shows that while the model cannot accurately predict the suffix of a given demonym based on these features, it still draws on the main tendencies underlying affix rivalry in the formation of French demonyms.","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":"1 1","pages":""},"PeriodicalIF":0.6,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42914169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A distributional assessment of rivalry in word formation 词语形成竞争的分布评价
IF 0.6 0 LANGUAGE & LINGUISTICS Pub Date : 2023-03-01 DOI: 10.3366/word.2023.0222
Matı́as Guzmán Naranjo, Olivier Bonami
We contrast two views of rivalry in word formation. Under the classical, categorical view, two processes are rivals if they are semantically equivalent. Under the more nuanced, gradient view, two processes can be rivals at different degrees, depending on how frequently they are amenable to be deployed as alternatives to one another. We propose to use methods from distributional semantics to explore the usefulness of both views. Building on data from French, we first show that distributional differences between average difference vectors capture semantic similarity across derivational processes in a manner comparable to the expectations of expert morphologists. We then propose an operational implementation of the classical view of rivalry based on computational classifiers: processes are rivals if and only if a classifier is unable to discriminate between them. Experimentation with French data shows that this operationalization correctly captures the broad brushes of rivalry, but also reveals finer, gradient aspects of competition in the spirit of gradient rivalry.
我们对比了两种关于单词构成竞争的观点。在经典的范畴观点下,如果两个过程在语义上等价,那么它们就是竞争对手。在更微妙的梯度观点下,两个过程可以在不同程度上成为竞争对手,这取决于它们作为彼此替代品的部署频率。我们建议使用分布语义的方法来探索这两种观点的有用性。基于来自法语的数据,我们首先表明,平均差异向量之间的分布差异以与专业形态学家的期望相当的方式捕捉了衍生过程中的语义相似性。然后,我们提出了一种基于计算分类器的竞争经典观点的操作实现:当且仅当分类器无法区分进程时,进程才是竞争对手。对法国数据的实验表明,这种操作正确地捕捉到了竞争的全貌,但也以梯度竞争的精神揭示了竞争的更精细、梯度方面。
{"title":"A distributional assessment of rivalry in word formation","authors":"Matı́as Guzmán Naranjo, Olivier Bonami","doi":"10.3366/word.2023.0222","DOIUrl":"https://doi.org/10.3366/word.2023.0222","url":null,"abstract":"We contrast two views of rivalry in word formation. Under the classical, categorical view, two processes are rivals if they are semantically equivalent. Under the more nuanced, gradient view, two processes can be rivals at different degrees, depending on how frequently they are amenable to be deployed as alternatives to one another. We propose to use methods from distributional semantics to explore the usefulness of both views. Building on data from French, we first show that distributional differences between average difference vectors capture semantic similarity across derivational processes in a manner comparable to the expectations of expert morphologists. We then propose an operational implementation of the classical view of rivalry based on computational classifiers: processes are rivals if and only if a classifier is unable to discriminate between them. Experimentation with French data shows that this operationalization correctly captures the broad brushes of rivalry, but also reveals finer, gradient aspects of competition in the spirit of gradient rivalry.","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":" ","pages":""},"PeriodicalIF":0.6,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49371806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Affix rivalry: Theoretical and methodological challenges 词缀竞争:理论和方法上的挑战
IF 0.6 0 LANGUAGE & LINGUISTICS Pub Date : 2023-03-01 DOI: 10.3366/word.2023.0218
R. Huyghe, Rossella Varvara
Affix rivalry is a key element in the organization and evolution of derivational systems. Its study provides insight into word-formation semantics, morphological change, productivity, lexical variation, and the many-to-many relationship between form and meaning. Affix rivalry is characterized by its gradient nature and the multiplicity of factors that can be involved in its resolution. On the one hand, rival affixes can be regarded as more or less competing depending on their semantic similarity and on the proportion of semantic functions they have in common. On the other hand, the distinction between rival affixes can rely on different linguistic properties, including not only semantic, but also phonological, morphological, syntactic, stylistic, and sociolinguistic properties. Differences are observed as tendencies and have a variable influence on affix selection. Quantitative methods can be used to precisely assess degrees of rivalry and multifactorial resolution of competition. Based on the statistical analysis of large sets of derivatives, they can provide an accurate description of affix rivalry and help us better understand the mechanisms of competition in word formation.
词缀竞争是衍生系统组织和进化的关键因素。它的研究提供了对构词语义、形态变化、生产力、词汇变化以及形式和意义之间多对多关系的洞察。词缀竞争具有梯度特征,其解决涉及多种因素。一方面,根据词缀的语义相似度和它们共同的语义功能的比例,可以将竞争词缀视为或多或少的竞争。另一方面,对立词缀之间的区分依赖于不同的语言特性,不仅包括语义特性,还包括音系、形态、句法、文体和社会语言学特性。差异是一种趋势,对词缀的选择有不同的影响。定量方法可以用来精确地评估竞争程度和竞争的多因素解决方案。基于对大量衍生词的统计分析,可以准确地描述词缀竞争,帮助我们更好地理解构词法中的竞争机制。
{"title":"Affix rivalry: Theoretical and methodological challenges","authors":"R. Huyghe, Rossella Varvara","doi":"10.3366/word.2023.0218","DOIUrl":"https://doi.org/10.3366/word.2023.0218","url":null,"abstract":"Affix rivalry is a key element in the organization and evolution of derivational systems. Its study provides insight into word-formation semantics, morphological change, productivity, lexical variation, and the many-to-many relationship between form and meaning. Affix rivalry is characterized by its gradient nature and the multiplicity of factors that can be involved in its resolution. On the one hand, rival affixes can be regarded as more or less competing depending on their semantic similarity and on the proportion of semantic functions they have in common. On the other hand, the distinction between rival affixes can rely on different linguistic properties, including not only semantic, but also phonological, morphological, syntactic, stylistic, and sociolinguistic properties. Differences are observed as tendencies and have a variable influence on affix selection. Quantitative methods can be used to precisely assess degrees of rivalry and multifactorial resolution of competition. Based on the statistical analysis of large sets of derivatives, they can provide an accurate description of affix rivalry and help us better understand the mechanisms of competition in word formation.","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":" ","pages":""},"PeriodicalIF":0.6,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43534119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A quantitative approach to doublets in Russian denominal adjective construction 俄语名词形容词结构中对偶词的定量分析
IF 0.6 0 LANGUAGE & LINGUISTICS Pub Date : 2023-03-01 DOI: 10.3366/word.2023.0221
N. Bobkova, Fabio Montermini
This article is devoted to the rivalry between suffixes in the derivation of denominal adjectives in Russian. In particular, it proposes a large-scale quantitative analysis based on the Russian National Corpus. Its main goal is to contribute to identifying the properties that determine the choice of the suffix in these derivatives. Denominal adjectival derivation in Russian makes use of a wide variety of exponents. Most of them are phonological variants (extensions) of three main suffixes, - n-, - sk- and - Ov-. These three, which can be considered as basic, constitute the focus of our analysis. Two datasets were built for this research, a general one containing one of the suffixes above, and a more specific one containing doublets, which is to say, adjectives constructed on the same base with different suffixes. Data from the two sets are analyzed by means of various statistical models. Our results globally provide a quantitatively robust confirmation of observations previously made in the literature. In particular, we show that -n- occupies a specific position in the derivational system of Russian, as it is less productive and its derivatives are tendentially less transparent and more prone to display lexicalized meanings that point towards the qualitative pole of the qualitative-relational semantic spectrum. Moreover, - sk- and - Ov- are more likely to form doublets (be attached to the same bases), a further argument in favor of a greater homogeneity between them as opposed to - n-.
本文研究了俄语名形容词派生中后缀之间的竞争。特别地,它提出了基于俄罗斯国家语料库的大规模定量分析。其主要目标是帮助识别决定这些衍生词中后缀选择的属性。俄语中名称形容词的派生使用了各种各样的指数。它们大多是三个主要后缀- n-、- sk-和- Ov-的音系变体(扩展)。这三个可以被认为是基本的,构成了我们分析的重点。为这项研究建立了两个数据集,一个包含上述后缀之一的通用数据集,一个包含双重后缀的更具体的数据集,也就是说,在同一碱基上构建不同后缀的形容词。通过各种统计模型对两组数据进行分析。我们的研究结果在全球范围内为文献中先前的观察结果提供了定量的有力证实。特别是,我们表明-n-在俄语的衍生系统中占有特定的位置,因为它的产出较低,其衍生词往往不太透明,更倾向于显示指向定性关系语义谱的定性极的词汇化意义。此外,- sk-和- Ov-更有可能形成双键(附着在相同的碱基上),这是与- n-相反,它们之间具有更大同质性的进一步论据。
{"title":"A quantitative approach to doublets in Russian denominal adjective construction","authors":"N. Bobkova, Fabio Montermini","doi":"10.3366/word.2023.0221","DOIUrl":"https://doi.org/10.3366/word.2023.0221","url":null,"abstract":"This article is devoted to the rivalry between suffixes in the derivation of denominal adjectives in Russian. In particular, it proposes a large-scale quantitative analysis based on the Russian National Corpus. Its main goal is to contribute to identifying the properties that determine the choice of the suffix in these derivatives. Denominal adjectival derivation in Russian makes use of a wide variety of exponents. Most of them are phonological variants (extensions) of three main suffixes, - n-, - sk- and - Ov-. These three, which can be considered as basic, constitute the focus of our analysis. Two datasets were built for this research, a general one containing one of the suffixes above, and a more specific one containing doublets, which is to say, adjectives constructed on the same base with different suffixes. Data from the two sets are analyzed by means of various statistical models. Our results globally provide a quantitatively robust confirmation of observations previously made in the literature. In particular, we show that -n- occupies a specific position in the derivational system of Russian, as it is less productive and its derivatives are tendentially less transparent and more prone to display lexicalized meanings that point towards the qualitative pole of the qualitative-relational semantic spectrum. Moreover, - sk- and - Ov- are more likely to form doublets (be attached to the same bases), a further argument in favor of a greater homogeneity between them as opposed to - n-.","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":" ","pages":""},"PeriodicalIF":0.6,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47780321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Word Structure
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1