首页 > 最新文献

Slovenščina 2.0: empirical, applied and interdisciplinary research最新文献

英文 中文
Simpozij Obdobja 41
Pub Date : 2022-12-21 DOI: 10.4312/slo2.0.2022.1.136-140
Ina Poteko
Poročilo o 41. simpoziju Obdobja, ki sta ga med 16. in 18. novembrom 2022 organizirala Center za slovenščino kot drugi in tuji jezik (CSDTJ) in Oddelek za slovenistiko Filozofske fakultete Univerze v Ljubljani pod naslovom Na stičišču svetov: slovenščina kot drugi in tuji jezik.
{"title":"Simpozij Obdobja 41","authors":"Ina Poteko","doi":"10.4312/slo2.0.2022.1.136-140","DOIUrl":"https://doi.org/10.4312/slo2.0.2022.1.136-140","url":null,"abstract":"Poročilo o 41. simpoziju Obdobja, ki sta ga med 16. in 18. novembrom 2022 organizirala Center za slovenščino kot drugi in tuji jezik (CSDTJ) in Oddelek za slovenistiko Filozofske fakultete Univerze v Ljubljani pod naslovom Na stičišču svetov: slovenščina kot drugi in tuji jezik.\u0000","PeriodicalId":371035,"journal":{"name":"Slovenščina 2.0: empirical, applied and interdisciplinary research","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115758071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From verbal to adjectival 从动词变为形容词
Pub Date : 2022-12-21 DOI: 10.4312/slo2.0.2022.1.65-97
G. Paulsen, Maria Tuulik, Ahti Lohk, Ene Vainik
This study addresses categorization issues related to adjective candidates in Estonian, focusing on the category of participles. The aim of the analysis was to assess the ranges of the prototypical adjective and to determine its degree of deviation on the prototypicality scale. The investigation was based on a group of validated adjectives – selected adjectives included in the Basic Estonian Dic­tionary – and two control groups of more and less lexicalized participles. We tested seven morphosyntactic corpus patterns characteristic of adjectives. The test patterns were based on the prototypical features of the adjective, as well as on observations made in the actual lexicographic analysis. To assess the sam­ple words and determine the significance of the test patterns from the point of view of defining adjectivity, we used deviation analysis. The results of this study can be applied to establish a measure of adjectivity for lexicographic judgments when distinguishing, for instance, lexicalized participles from regular ones.
本研究解决了爱沙尼亚语中形容词候选人的分类问题,重点是分词的类别。分析的目的是评估原型形容词的范围,并确定其在原型性量表上的偏差程度。这项调查是基于一组经过验证的形容词——爱沙尼亚基本词典中精选的形容词——和两组词汇化程度不同的分词。我们测试了形容词的七种形态句法语料库模式。测试模式是基于形容词的原型特征,以及在实际词典分析中所做的观察。为了从定义形容词的角度评估样本词并确定测试模式的意义,我们使用了偏差分析。这项研究的结果可以应用于建立词典判断的形容词测量,例如,在区分词汇化分词和规则分词时。
{"title":"From verbal to adjectival","authors":"G. Paulsen, Maria Tuulik, Ahti Lohk, Ene Vainik","doi":"10.4312/slo2.0.2022.1.65-97","DOIUrl":"https://doi.org/10.4312/slo2.0.2022.1.65-97","url":null,"abstract":"This study addresses categorization issues related to adjective candidates in Estonian, focusing on the category of participles. The aim of the analysis was to assess the ranges of the prototypical adjective and to determine its degree of deviation on the prototypicality scale. The investigation was based on a group of validated adjectives – selected adjectives included in the Basic Estonian Dic­tionary – and two control groups of more and less lexicalized participles. We tested seven morphosyntactic corpus patterns characteristic of adjectives. The test patterns were based on the prototypical features of the adjective, as well as on observations made in the actual lexicographic analysis. To assess the sam­ple words and determine the significance of the test patterns from the point of view of defining adjectivity, we used deviation analysis. The results of this study can be applied to establish a measure of adjectivity for lexicographic judgments when distinguishing, for instance, lexicalized participles from regular ones.","PeriodicalId":371035,"journal":{"name":"Slovenščina 2.0: empirical, applied and interdisciplinary research","volume":"312 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113991311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Raba jezikovnih virov in Kolokacijskega slovarja sodobne slovenščine med študenti prevajalstva
Pub Date : 2022-12-21 DOI: 10.4312/slo2.0.2022.1.23-64
Nataša Hirci, Martin Anton Grad
V prispevku se posvečamo študiji, ki predstavlja uporabniško izkušnjo in odnos študentov prevajalstva na Univerzi v Ljubljani do jezikovnih virov in prevajal­skih pripomočkov, saj se v času informacijske družbe od bodočih prevajalcev pričakuje, da so v kar največji meri digitalno osveščeni, predvsem pa seznanje­ni z relevantnimi viri informacij. Študija se v prvem delu osredotoča na odnos študentov do angleških in slovenskih jezikovnih virov in na strategije reševanja prevajalskih zagat pri prevajanju kolokacij ter rabo različnih relevantnih jezi­kovnih virov. Raziskava je v drugem delu osredinjena na opazovanje prevajal­skega procesa in rabo Kolokacijskega slovarja sodobne slovenščine (KSSS) pri prevajanju izbranih kolokacij iz angleščine v slovenščino. Rezultati študije so pokazali, da se študenti zavedajo prednosti reševanja prevodnih težav s po­močjo relevantnih jezikovnih virov, saj se je strategija vključevanja kolokacij­skega slovarja v prevajalski proces izkazala za relativno uspešno. Vendar pa je obenem pokazala, da je že v času študija nujno dodatno osveščati študente o naprednih funkcijah tega jezikovnega vira, predvsem pa spodbujati iskanje po relevantnih virih in kritično ovrednotenje najdenih informacij z zavedanjem, da je v prevajalskem procesu nujno poiskati takšne prevodne rešitve, ki so v danem kontekstu najustreznejše. 
{"title":"Raba jezikovnih virov in Kolokacijskega slovarja sodobne slovenščine med študenti prevajalstva","authors":"Nataša Hirci, Martin Anton Grad","doi":"10.4312/slo2.0.2022.1.23-64","DOIUrl":"https://doi.org/10.4312/slo2.0.2022.1.23-64","url":null,"abstract":"V prispevku se posvečamo študiji, ki predstavlja uporabniško izkušnjo in odnos študentov prevajalstva na Univerzi v Ljubljani do jezikovnih virov in prevajal­skih pripomočkov, saj se v času informacijske družbe od bodočih prevajalcev pričakuje, da so v kar največji meri digitalno osveščeni, predvsem pa seznanje­ni z relevantnimi viri informacij. Študija se v prvem delu osredotoča na odnos študentov do angleških in slovenskih jezikovnih virov in na strategije reševanja prevajalskih zagat pri prevajanju kolokacij ter rabo različnih relevantnih jezi­kovnih virov. Raziskava je v drugem delu osredinjena na opazovanje prevajal­skega procesa in rabo Kolokacijskega slovarja sodobne slovenščine (KSSS) pri prevajanju izbranih kolokacij iz angleščine v slovenščino. Rezultati študije so pokazali, da se študenti zavedajo prednosti reševanja prevodnih težav s po­močjo relevantnih jezikovnih virov, saj se je strategija vključevanja kolokacij­skega slovarja v prevajalski proces izkazala za relativno uspešno. Vendar pa je obenem pokazala, da je že v času študija nujno dodatno osveščati študente o naprednih funkcijah tega jezikovnega vira, predvsem pa spodbujati iskanje po relevantnih virih in kritično ovrednotenje najdenih informacij z zavedanjem, da je v prevajalskem procesu nujno poiskati takšne prevodne rešitve, ki so v danem kontekstu najustreznejše.\u0000 ","PeriodicalId":371035,"journal":{"name":"Slovenščina 2.0: empirical, applied and interdisciplinary research","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114583684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Okrogla miza »Jezikovna ideologija, poklicni stereotipi ter primeri dobre prakse lektorjev in jezikovnih svetovalcev«
Pub Date : 2022-12-21 DOI: 10.4312/slo2.0.2022.1.98-125
Matejka Grgič, Milica Antić Gaber, Vojko Gorjanc, Erika Kržišnik, Tina Lengar Verovnik, Eva Vrtačič
Transkripcija okrogle mize na 2. kongresu Lektorskega društva Slovenije oktobra 2022.
2022 年 10 月斯洛文尼亚讲师协会第二届大会圆桌会议记录。
{"title":"Okrogla miza »Jezikovna ideologija, poklicni stereotipi ter primeri dobre prakse lektorjev in jezikovnih svetovalcev«","authors":"Matejka Grgič, Milica Antić Gaber, Vojko Gorjanc, Erika Kržišnik, Tina Lengar Verovnik, Eva Vrtačič","doi":"10.4312/slo2.0.2022.1.98-125","DOIUrl":"https://doi.org/10.4312/slo2.0.2022.1.98-125","url":null,"abstract":"Transkripcija okrogle mize na 2. kongresu Lektorskega društva Slovenije oktobra 2022.","PeriodicalId":371035,"journal":{"name":"Slovenščina 2.0: empirical, applied and interdisciplinary research","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126668282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spletna orodja za slovenščino in tuji študenti Univerze v Ljubljani 卢布尔雅那大学斯洛文尼亚学生和外国学生的在线工具
Pub Date : 2021-12-29 DOI: 10.4312/slo2.0.2021.2.100-125
Mojca Stritar Kučuk
Redno vpisani tuji študenti Univerze v Ljubljani, ki se v prvem letu študija v okviru modula Leto plus učijo slovensko, se v drugem semestru na posebni delavnici podrobneje spoznajo s spletnimi jezikovnimi viri in tehnologijami za slovenščino. V prispevku je opisana izvedba te delavnice v študijskem letu 2019/20, ko je zaradi pandemije koronavirusa potekala na daljavo, v obliki interaktivnih videoposnetkov z nalogami za preverjanje razumevanja snovi. Drugi del prispevka se osredotoča na mnenje študentov o tovrstnih jezikovnih virih. S spletno anketo sem analizirala stališča in izkušnje študentov dveh generacij: študenti generacije 2018/19 so spletna orodja spoznavali v razredu, študenti generacije 2019/20 pa na daljavo. Sodeč po rezultatih ankete, mlajša generacija študentov jezikovne vire na spletu uporablja pogosteje. Študenti obeh skupin najpogosteje uporabljajo Googlov Prevajalnik, ki mu sledijo Sloleks, pregibnik Besana, Fran in Pons. Kot argumente za uporabo teh virov izpostavljajo predvsem hitrost oz. enostavnost uporabe in navajenost na določen vir.
在卢布尔雅那大学学习斯洛文尼亚语的正规注册外国学生在第二学期的一个特别研讨会上学习了更多关于斯洛文尼亚语的在线语言资源和技术。本文介绍了该研讨会在2019/20学年的实施情况,当时由于冠状病毒大流行,该研讨会以带有理解任务的互动视频的形式远程举行。论文的第二部分侧重于学生对此类语言资源的看法。我利用在线调查分析了两代学生的观点和经验:2018/19 届学生在课堂上接触了在线工具,而 2019/20 届学生则通过远程接触了这些工具。调查结果显示,年轻一代学生使用在线语言资源的频率更高。这两个群体的学生最常使用谷歌翻译,其次是 Sloleks、Besana、Fran 和 Pons。 他们使用这些资源的主要理由是速度或易用性以及对特定资源的熟悉程度。
{"title":"Spletna orodja za slovenščino in tuji študenti Univerze v Ljubljani","authors":"Mojca Stritar Kučuk","doi":"10.4312/slo2.0.2021.2.100-125","DOIUrl":"https://doi.org/10.4312/slo2.0.2021.2.100-125","url":null,"abstract":"Redno vpisani tuji študenti Univerze v Ljubljani, ki se v prvem letu študija v okviru modula Leto plus učijo slovensko, se v drugem semestru na posebni delavnici podrobneje spoznajo s spletnimi jezikovnimi viri in tehnologijami za slovenščino. V prispevku je opisana izvedba te delavnice v študijskem letu 2019/20, ko je zaradi pandemije koronavirusa potekala na daljavo, v obliki interaktivnih videoposnetkov z nalogami za preverjanje razumevanja snovi. Drugi del prispevka se osredotoča na mnenje študentov o tovrstnih jezikovnih virih. S spletno anketo sem analizirala stališča in izkušnje študentov dveh generacij: študenti generacije 2018/19 so spletna orodja spoznavali v razredu, študenti generacije 2019/20 pa na daljavo. Sodeč po rezultatih ankete, mlajša generacija študentov jezikovne vire na spletu uporablja pogosteje. Študenti obeh skupin najpogosteje uporabljajo Googlov Prevajalnik, ki mu sledijo Sloleks, pregibnik Besana, Fran in Pons. Kot argumente za uporabo teh virov izpostavljajo predvsem hitrost oz. enostavnost uporabe in navajenost na določen vir.","PeriodicalId":371035,"journal":{"name":"Slovenščina 2.0: empirical, applied and interdisciplinary research","volume":"122 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134632139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stalnost, variantnost in modificirana raba frazemov v slovenskem jeziku in slovarjih
Pub Date : 2021-12-29 DOI: 10.4312/slo2.0.2021.2.71-99
Eva Trivunović
Prispevek prinaša pregled variant in modifikacij sedmih (iz)biblijskih frazemov v sodobni slovenščini ter njihove prisotnosti v sodobnem jeziku. Ugotovitve so primerjane z obravnavo frazemov v obstoječih slovarjih, kjer se kaže velik razkorak med slovarskim prikazom in stanjem, ki ga izkazuje korpusno gradivo. Za zanesljivejše ugotavljanje, v katerih primerih lahko govorimo o že ustaljeni variantnosti, so bili v raziskavi uporabljeni trije zvrstno različni korpusi: Gigafida 2.0, Janes in slWaC. Poleg ustaljenih variant so predstavljene neustaljene modifikacije, poseben poudarek je na prenovitvah, vendar se je jasno zastavljena tipologija mestoma izkazala za preveč togo, saj pri nekaterih mejnih primerih ni bilo mogoče nedvoumno ločiti ustaljenih variant od neprenovitvenih modifikacij ter neprenovitvenih modifikacij od prenovitvenih. Vsi izbrani frazemi in njihove prenovitve so najpogostejši v korpusu Janes, kar dokazuje nujnost vključevanja večjega števila raznovrstnih korpusov v jezikoslovne raziskave.
本文概述了当代斯洛文尼亚语中七个(等)圣经短语的变体和修改及其在现代语言中的存在。研究结果与现有词典中对这些短语的处理进行了比较,发现词典中的表述与语料库中的语料存在很大差距。为了更可靠地确定在哪些情况下我们可以谈论已经确立的变体,本研究使用了三个不同体裁的语料库:Gigafida 2.0、Janes 和 slWaC。除既定变体外,还介绍了非既定修改,特别关注更新,但事实证明,明确界定的类型学在某些地方过于死板,因为在一些边缘情况下,无法明确区分既定变体和非既定修改,以及非既定修改和更新。所有被选中的短语及其改写在 Janes 语料库中都是最常见的,这说明在语言学研究中有必要纳入更多的异质语料库。
{"title":"Stalnost, variantnost in modificirana raba frazemov v slovenskem jeziku in slovarjih","authors":"Eva Trivunović","doi":"10.4312/slo2.0.2021.2.71-99","DOIUrl":"https://doi.org/10.4312/slo2.0.2021.2.71-99","url":null,"abstract":"Prispevek prinaša pregled variant in modifikacij sedmih (iz)biblijskih frazemov v sodobni slovenščini ter njihove prisotnosti v sodobnem jeziku. Ugotovitve so primerjane z obravnavo frazemov v obstoječih slovarjih, kjer se kaže velik razkorak med slovarskim prikazom in stanjem, ki ga izkazuje korpusno gradivo. Za zanesljivejše ugotavljanje, v katerih primerih lahko govorimo o že ustaljeni variantnosti, so bili v raziskavi uporabljeni trije zvrstno različni korpusi: Gigafida 2.0, Janes in slWaC. Poleg ustaljenih variant so predstavljene neustaljene modifikacije, poseben poudarek je na prenovitvah, vendar se je jasno zastavljena tipologija mestoma izkazala za preveč togo, saj pri nekaterih mejnih primerih ni bilo mogoče nedvoumno ločiti ustaljenih variant od neprenovitvenih modifikacij ter neprenovitvenih modifikacij od prenovitvenih. Vsi izbrani frazemi in njihove prenovitve so najpogostejši v korpusu Janes, kar dokazuje nujnost vključevanja večjega števila raznovrstnih korpusov v jezikoslovne raziskave.","PeriodicalId":371035,"journal":{"name":"Slovenščina 2.0: empirical, applied and interdisciplinary research","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127067154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sociolingvistični posvet: aktualni sociolingvistični izzivi in prednostne raziskovalne tematike
Pub Date : 2021-12-29 DOI: 10.4312/slo2.0.2021.2.1-40
Maja Bitenc, Marko Stabej, Nataša Gliha Komac, Matejka Grgič, Monika Kalin Golob, K. Kenda-Jež, Albina Nečak Lük, Sonja Novak Lukanovič, Krištof Savski
Zapis posveta o aktualnih sociolingvističnih izzivih in prednostnih raziskovalnih tematikah, ki sta ga organizirala doc. dr. Maja Bitenc in red. prof. dr. Marko Stabej z Oddelka za slovenistiko in je potekal v ponedeljek, 27. 9. 2021, na Filozofski fakulteti Univerze v Ljubljani in s prenosom preko Zooma. V prvem delu so vabljene strokovnjakinje in strokovnjaki predstavili svoje poglede ob izhodiščnih vprašanjih, v drugem je sledila razprava vseh sodelujočih. Zapis posnetka so govornice in govorniki uredili po lastni presoji, načeloma s čim manj intervencijami, iz razprave pa so za branje prilagojene in objavljene vsebinsko tehtnejše replike.
由斯洛文尼亚语研究系的 Maja Bitenc 博士副教授和 Marko Stabej 博士副教授组织,于 2021 年 9 月 27 日星期一在卢布尔雅那大学文学院举行的关于当前社会语言学挑战和优先研究课题的讨论实录,并通过 Zoom 进行了流媒体传输。在第一部分,受邀专家就最初的问题发表了看法,随后所有与会者进行了讨论。录音誊本由发言者自行编辑,原则上尽量减少发言,辩论中更具实质性的反驳意见经改编后供阅读和发表。
{"title":"Sociolingvistični posvet: aktualni sociolingvistični izzivi in prednostne raziskovalne tematike","authors":"Maja Bitenc, Marko Stabej, Nataša Gliha Komac, Matejka Grgič, Monika Kalin Golob, K. Kenda-Jež, Albina Nečak Lük, Sonja Novak Lukanovič, Krištof Savski","doi":"10.4312/slo2.0.2021.2.1-40","DOIUrl":"https://doi.org/10.4312/slo2.0.2021.2.1-40","url":null,"abstract":"Zapis posveta o aktualnih sociolingvističnih izzivih in prednostnih raziskovalnih tematikah, ki sta ga organizirala doc. dr. Maja Bitenc in red. prof. dr. Marko Stabej z Oddelka za slovenistiko in je potekal v ponedeljek, 27. 9. 2021, na Filozofski fakulteti Univerze v Ljubljani in s prenosom preko Zooma. V prvem delu so vabljene strokovnjakinje in strokovnjaki predstavili svoje poglede ob izhodiščnih vprašanjih, v drugem je sledila razprava vseh sodelujočih. Zapis posnetka so govornice in govorniki uredili po lastni presoji, načeloma s čim manj intervencijami, iz razprave pa so za branje prilagojene in objavljene vsebinsko tehtnejše replike.","PeriodicalId":371035,"journal":{"name":"Slovenščina 2.0: empirical, applied and interdisciplinary research","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126862833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collocation ranking: frequency vs semantics 搭配排序:频率vs语义
Pub Date : 2021-12-29 DOI: 10.4312/slo2.0.2021.2.41-70
Nikola Ljubesic, N. Logar, Iztok Kosem
Collocations play a very important role in language description, especially in identifying meanings of words. Modern lexicography’s inevitable part of meaning deduction are lists of collocates ranked by some statistical measurement. In the paper, we present a comparison between two approaches to the ranking of collocates: (a) the logDice method, which is dominantly used and frequency-based, and (b) the fastText word embeddings method, which is new and semantic-based. The comparison was made on two Slovene datasets, one representing general language headwords and their collocates, and the other representing headwords and their collocates extracted from a language for special purposes corpus. In the experiment, two methods were used: for the quantitative part of the evaluation, we used supervised machine learning with the area-under-the-curve (AUC) ROC score and support-vector machines (SVMs) algorithm, and in the qualitative part the ranking results of the two methods were evaluated by lexicographers. The results were somewhat inconsistent; while the quantitative evaluation confirmed that the machine-learning-based approach produced better collocate ranking results than the frequency-based one, lexicographers in most cases considered the listings of collocates of both methods very similar.
搭配在语言描述中起着非常重要的作用,尤其是对词义的识别。现代词典编纂中不可避免的意义演绎部分是通过一些统计测量来排列搭配列表。在本文中,我们提出了两种方法之间的比较:(a) logDice方法,这是主要使用的和基于频率的方法,(b) fastText词嵌入方法,这是一种新的和基于语义的方法。对两个斯洛文尼亚语数据集进行了比较,其中一个数据集代表一般语言词首词及其搭配,另一个数据集代表从特殊用途语言语料库中提取的词首词及其搭配。在实验中,我们使用了两种方法:对于定量部分的评估,我们使用了带有曲线下面积(AUC) ROC评分和支持向量机(svm)算法的监督机器学习,在定性部分,两种方法的排名结果由词典编纂者进行评估。结果有些不一致;虽然定量评估证实,基于机器学习的方法比基于频率的方法产生了更好的搭配排名结果,但词典编纂者在大多数情况下认为这两种方法的搭配列表非常相似。
{"title":"Collocation ranking: frequency vs semantics","authors":"Nikola Ljubesic, N. Logar, Iztok Kosem","doi":"10.4312/slo2.0.2021.2.41-70","DOIUrl":"https://doi.org/10.4312/slo2.0.2021.2.41-70","url":null,"abstract":"Collocations play a very important role in language description, especially in identifying meanings of words. Modern lexicography’s inevitable part of meaning deduction are lists of collocates ranked by some statistical measurement. In the paper, we present a comparison between two approaches to the ranking of collocates: (a) the logDice method, which is dominantly used and frequency-based, and (b) the fastText word embeddings method, which is new and semantic-based. The comparison was made on two Slovene datasets, one representing general language headwords and their collocates, and the other representing headwords and their collocates extracted from a language for special purposes corpus. In the experiment, two methods were used: for the quantitative part of the evaluation, we used supervised machine learning with the area-under-the-curve (AUC) ROC score and support-vector machines (SVMs) algorithm, and in the qualitative part the ranking results of the two methods were evaluated by lexicographers. The results were somewhat inconsistent; while the quantitative evaluation confirmed that the machine-learning-based approach produced better collocate ranking results than the frequency-based one, lexicographers in most cases considered the listings of collocates of both methods very similar.","PeriodicalId":371035,"journal":{"name":"Slovenščina 2.0: empirical, applied and interdisciplinary research","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130593069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Mednarodni konferenci eLex (5.–7. julij 2021) in EURALEX (7.–9. september 2021)
Pub Date : 2021-12-29 DOI: 10.4312/slo2.0.2021.2.126-129
Magdalena Gapsa
Poročilo o dveh pomembnih leksikografskih konferencah, in sicer o sedmi bienalni konferenci združenja Electronic lexicography in the 21st century (na kratko: eLex), ki je potekala med 5. in 7. julijem 2021, ter devetnajsti bienalni konferenci Evropskega leksikografskega združenja (European Association for Lexicography, EURALEX), ki je potekala med 7. in 9. septembrom 2021.
{"title":"Mednarodni konferenci eLex (5.–7. julij 2021) in EURALEX (7.–9. september 2021)","authors":"Magdalena Gapsa","doi":"10.4312/slo2.0.2021.2.126-129","DOIUrl":"https://doi.org/10.4312/slo2.0.2021.2.126-129","url":null,"abstract":"Poročilo o dveh pomembnih leksikografskih konferencah, in sicer o sedmi bienalni konferenci združenja Electronic lexicography in the 21st century (na kratko: eLex), ki je potekala med 5. in 7. julijem 2021, ter devetnajsti bienalni konferenci Evropskega leksikografskega združenja (European Association for Lexicography, EURALEX), ki je potekala med 7. in 9. septembrom 2021.","PeriodicalId":371035,"journal":{"name":"Slovenščina 2.0: empirical, applied and interdisciplinary research","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115032306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Slovene and Croatian word embeddings in terms of gender occupational analogies 斯洛文尼亚语和克罗地亚语在性别职业类比方面的词嵌入
Pub Date : 2021-07-06 DOI: 10.4312/SLO2.0.2021.1.26-59
Matej Ulčar, Anka Supej, M. Robnik-Sikonja, Senja Pollak
In recent years, the use of deep neural networks and dense vector embeddings for text representation have led to excellent results in the field of computational understanding of natural language. It has also been shown that word embeddings often capture gender, racial and other types of bias. The article focuses on evaluating Slovene and Croatian word embeddings in terms of gender bias using word analogy calculations. We compiled a list of masculine and feminine nouns for occupations in Slovene and evaluated the gender bias of fastText, word2vec and ELMo embeddings with different configurations and different approaches to analogy calculations. The lowest occupational gender bias was observed with the fastText embeddings. Similarly, we compared different fastText embeddings on Croatian occupational analogies.
近年来,使用深度神经网络和密集向量嵌入进行文本表示在自然语言的计算理解领域取得了优异的成绩。研究还表明,词嵌入通常会捕捉到性别、种族和其他类型的偏见。本文着重于评估斯洛文尼亚语和克罗地亚语的词嵌入方面的性别偏见使用词类比计算。我们编制了斯洛文尼亚语职业的阳性和阴性名词列表,并评估了fastText、word2vec和ELMo嵌入在不同配置和不同类比计算方法下的性别偏见。快速文本嵌入的职业性别偏见最低。同样,我们比较了克罗地亚职业类比的不同fastText嵌入。
{"title":"Slovene and Croatian word embeddings in terms of gender occupational analogies","authors":"Matej Ulčar, Anka Supej, M. Robnik-Sikonja, Senja Pollak","doi":"10.4312/SLO2.0.2021.1.26-59","DOIUrl":"https://doi.org/10.4312/SLO2.0.2021.1.26-59","url":null,"abstract":"In recent years, the use of deep neural networks and dense vector embeddings for text representation have led to excellent results in the field of computational understanding of natural language. It has also been shown that word embeddings often capture gender, racial and other types of bias. The article focuses on evaluating Slovene and Croatian word embeddings in terms of gender bias using word analogy calculations. We compiled a list of masculine and feminine nouns for occupations in Slovene and evaluated the gender bias of fastText, word2vec and ELMo embeddings with different configurations and different approaches to analogy calculations. The lowest occupational gender bias was observed with the fastText embeddings. Similarly, we compared different fastText embeddings on Croatian occupational analogies.","PeriodicalId":371035,"journal":{"name":"Slovenščina 2.0: empirical, applied and interdisciplinary research","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120962954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Slovenščina 2.0: empirical, applied and interdisciplinary research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1