Pub Date : 2023-09-01DOI: 10.5117/tet2023.1.004.huis
John L.A. Huisman, Roeland van Hout
Recent work in dialectometry has proposed the use of linear mixed-effects regression (LMER) for analysing full distance matrices. While the outcomes are promising, work is needed to confirm that such outcomes are valid, given that the analysis of distance matrices using this method is not established. The current contribution provides a supporting framework for this approach by testing its validity through a series of simulated datasets. We analysed the generated data using LMER, and compared its performance to that of the well-established multiple regression on distance matrices (MRM) approach. We find that the LMER results are on par with—and sometimes even exceed—the results obtained from MRM. The potential to include random effects makes LMER a more powerful tool than MRM to examine a linguistic area as a whole, with all pairwise comparisons included, making it an ideal candidate for big data analyses that are becoming more prevalent with the ongoing digitisation of large dialect databases.
{"title":"The validity of mixed-effects regression for analysing linguistic distance matrices: a simulation study","authors":"John L.A. Huisman, Roeland van Hout","doi":"10.5117/tet2023.1.004.huis","DOIUrl":"https://doi.org/10.5117/tet2023.1.004.huis","url":null,"abstract":"Recent work in dialectometry has proposed the use of linear mixed-effects regression (LMER) for analysing full distance matrices. While the outcomes are promising, work is needed to confirm that such outcomes are valid, given that the analysis of distance matrices using this method is not established. The current contribution provides a supporting framework for this approach by testing its validity through a series of simulated datasets. We analysed the generated data using LMER, and compared its performance to that of the well-established multiple regression on distance matrices (MRM) approach. We find that the LMER results are on par with—and sometimes even exceed—the results obtained from MRM. The potential to include random effects makes LMER a more powerful tool than MRM to examine a linguistic area as a whole, with all pairwise comparisons included, making it an ideal candidate for big data analyses that are becoming more prevalent with the ongoing digitisation of large dialect databases.","PeriodicalId":30675,"journal":{"name":"Taal en Tongval Language Variation in the Low Countries","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135433978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01DOI: 10.5117/tet2023.1.003.gill
Peter Gilles
Amsterdam University Press is a leading publisher of academic books, journals and textbooks in the Humanities and Social Sciences. Our aim is to make current research available to scholars, students, innovators, and the general public. AUP stands for scholarly excellence, global presence, and engagement with the international academic community.
{"title":"Regional variation, internal change and language contact in Luxembourgish: results from an app-based language survey1","authors":"Peter Gilles","doi":"10.5117/tet2023.1.003.gill","DOIUrl":"https://doi.org/10.5117/tet2023.1.003.gill","url":null,"abstract":"Amsterdam University Press is a leading publisher of academic books, journals and textbooks in the Humanities and Social Sciences. Our aim is to make current research available to scholars, students, innovators, and the general public. AUP stands for scholarly excellence, global presence, and engagement with the international academic community.","PeriodicalId":30675,"journal":{"name":"Taal en Tongval Language Variation in the Low Countries","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135433980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01DOI: 10.5117/tet2023.1.001.brei
Anne Breitbarth, Anne-Sophie Ghyselen, Roeland van Hout, Martijn Wieling
Amsterdam University Press is a leading publisher of academic books, journals and textbooks in the Humanities and Social Sciences. Our aim is to make current research available to scholars, students, innovators, and the general public. AUP stands for scholarly excellence, global presence, and engagement with the international academic community.
{"title":"Big data: New perspectives for research on language variation and change","authors":"Anne Breitbarth, Anne-Sophie Ghyselen, Roeland van Hout, Martijn Wieling","doi":"10.5117/tet2023.1.001.brei","DOIUrl":"https://doi.org/10.5117/tet2023.1.001.brei","url":null,"abstract":"Amsterdam University Press is a leading publisher of academic books, journals and textbooks in the Humanities and Social Sciences. Our aim is to make current research available to scholars, students, innovators, and the general public. AUP stands for scholarly excellence, global presence, and engagement with the international academic community.","PeriodicalId":30675,"journal":{"name":"Taal en Tongval Language Variation in the Low Countries","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135433975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01DOI: 10.5117/tet2023.1.002.buur
Raoul Sergio Samuel Jan Buurke, Martijn Wieling
Large phonetic corpora are frequently used to investigate language variation and change in dialects, but these corpora are often constructed by many researchers in a collaborative effort. This typically results in inter-transcriber issues that may impact the reliability of analyses using these data. This problem is exacerbated when multiple phonetic corpora are compared when investigating real time dialect change. In this study, we therefore propose a method to automatically and iteratively merge phonetic symbols used in the transcriptions to obtain a more coarse-grained, but better comparable, phonetic transcription. Our approach is evaluated using two large phonetic Netherlandic dialect corpora in an attempt to estimate sound change in the area in the 20th century. The results are discussed in the context of the available literature about dialect change in the Netherlandic area.
{"title":"Sound Change Estimation in Netherlandic Regional Languages: Reducing Inter-Transcriber Variability in Dialect Corpora","authors":"Raoul Sergio Samuel Jan Buurke, Martijn Wieling","doi":"10.5117/tet2023.1.002.buur","DOIUrl":"https://doi.org/10.5117/tet2023.1.002.buur","url":null,"abstract":"Large phonetic corpora are frequently used to investigate language variation and change in dialects, but these corpora are often constructed by many researchers in a collaborative effort. This typically results in inter-transcriber issues that may impact the reliability of analyses using these data. This problem is exacerbated when multiple phonetic corpora are compared when investigating real time dialect change. In this study, we therefore propose a method to automatically and iteratively merge phonetic symbols used in the transcriptions to obtain a more coarse-grained, but better comparable, phonetic transcription. Our approach is evaluated using two large phonetic Netherlandic dialect corpora in an attempt to estimate sound change in the area in the 20th century. The results are discussed in the context of the available literature about dialect change in the Netherlandic area.","PeriodicalId":30675,"journal":{"name":"Taal en Tongval Language Variation in the Low Countries","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135433972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01DOI: 10.5117/tet2023.1.005.pijp
Dirk Pijpops, Stefano De Pascale, Freek Van de Velde, Eline Zenner
This article illustrates some of the opportunities and challenges of pursuing a big data approach in linguistic research. To do so, we investigate the diffusion of the loan verb pimpen ‘to fancify’ in Dutch based on Twitter data. First, we focus on the derivations of the verb (e.g.: terugpimpen ‘to pimp back’, herpimpen ‘to repimp’, etc.) and plot the diversity of these forms through time, using the Chao-Wang-Jost estimation of Shannon entropy. We follow this up with an alternation study that compares pimpen not only to its ‘native’ alternative opleuken, but also its most frequent derivation oppimpen, using multinomial regression. It is found that, while pimpen’s early expansion in Dutch has proceeded at breakneck speed, resulting e.g. in a plethora of derivations that has so far gone undetected, its current momentum seems to be waning.
{"title":"Big Pimpin’. Een big data-benadering van de verspreiding van het leenwoord pimpen in het Nederlands","authors":"Dirk Pijpops, Stefano De Pascale, Freek Van de Velde, Eline Zenner","doi":"10.5117/tet2023.1.005.pijp","DOIUrl":"https://doi.org/10.5117/tet2023.1.005.pijp","url":null,"abstract":"This article illustrates some of the opportunities and challenges of pursuing a big data approach in linguistic research. To do so, we investigate the diffusion of the loan verb pimpen ‘to fancify’ in Dutch based on Twitter data. First, we focus on the derivations of the verb (e.g.: terugpimpen ‘to pimp back’, herpimpen ‘to repimp’, etc.) and plot the diversity of these forms through time, using the Chao-Wang-Jost estimation of Shannon entropy. We follow this up with an alternation study that compares pimpen not only to its ‘native’ alternative opleuken, but also its most frequent derivation oppimpen, using multinomial regression. It is found that, while pimpen’s early expansion in Dutch has proceeded at breakneck speed, resulting e.g. in a plethora of derivations that has so far gone undetected, its current momentum seems to be waning.","PeriodicalId":30675,"journal":{"name":"Taal en Tongval Language Variation in the Low Countries","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135433966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01DOI: 10.5117/tet2023.1.006.vand
Iris Van de Voorde, Gijsbert Rutten, Rik Vosters, Marijke van der Wal, Wim Vandenbussche
In this contribution, we present the Historical Corpus of Dutch (HCD), a new multi-genre, diachronic corpus of Early and Late Modern Dutch (ca. 1550-1850). It consists of a digitised collection of handwritten administrative texts (e.g. town council meeting reports), handwritten ego-documents (e.g. diaries and travelogues), and printed pamphlets (e.g. of a political or religious nature). The corpus is also balanced between northern and southern material, with data from the provinces of Holland and Zeeland for the North, and from Flanders and Brabant for the South. After having discussed its structure and composition, we will illustrate the value of the new corpus with a number of smaller case studies. Based on our experiences with the corpus, we will conclude by launching a plea for historical corpus building not to focus too much on the quantity of data (‘big data’), but rather shift attention to data quality.
{"title":"Historical Corpus of Dutch: A new multi-genre corpus of Early and Late Modern Dutch","authors":"Iris Van de Voorde, Gijsbert Rutten, Rik Vosters, Marijke van der Wal, Wim Vandenbussche","doi":"10.5117/tet2023.1.006.vand","DOIUrl":"https://doi.org/10.5117/tet2023.1.006.vand","url":null,"abstract":"In this contribution, we present the Historical Corpus of Dutch (HCD), a new multi-genre, diachronic corpus of Early and Late Modern Dutch (ca. 1550-1850). It consists of a digitised collection of handwritten administrative texts (e.g. town council meeting reports), handwritten ego-documents (e.g. diaries and travelogues), and printed pamphlets (e.g. of a political or religious nature). The corpus is also balanced between northern and southern material, with data from the provinces of Holland and Zeeland for the North, and from Flanders and Brabant for the South. After having discussed its structure and composition, we will illustrate the value of the new corpus with a number of smaller case studies. Based on our experiences with the corpus, we will conclude by launching a plea for historical corpus building not to focus too much on the quantity of data (‘big data’), but rather shift attention to data quality.","PeriodicalId":30675,"journal":{"name":"Taal en Tongval Language Variation in the Low Countries","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135433660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-01DOI: 10.5117/tet2022.2.003.verg
Philip C. Vergeiner
{"title":"Variation und Wandel des postvokalischen r","authors":"Philip C. Vergeiner","doi":"10.5117/tet2022.2.003.verg","DOIUrl":"https://doi.org/10.5117/tet2022.2.003.verg","url":null,"abstract":"","PeriodicalId":30675,"journal":{"name":"Taal en Tongval Language Variation in the Low Countries","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81602154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-01DOI: 10.5117/tet2022.2.001.brei
Anne Breitbarth
{"title":"Continuity, change, and linguistic recycling in Flemish dialects: Negation, polarity focus, and mirativity1","authors":"Anne Breitbarth","doi":"10.5117/tet2022.2.001.brei","DOIUrl":"https://doi.org/10.5117/tet2022.2.001.brei","url":null,"abstract":"","PeriodicalId":30675,"journal":{"name":"Taal en Tongval Language Variation in the Low Countries","volume":"33 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77303840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-01DOI: 10.5117/tet2022.2.002.buur
Raoul Sergio Samuel Jan Buurke, Hedwig Sekeres, W. Heeringa, Remco Knooihuizen, Martijn Wieling
{"title":"Estimating the level and direction of aggregated sound change of dialects in the northern Netherlands1","authors":"Raoul Sergio Samuel Jan Buurke, Hedwig Sekeres, W. Heeringa, Remco Knooihuizen, Martijn Wieling","doi":"10.5117/tet2022.2.002.buur","DOIUrl":"https://doi.org/10.5117/tet2022.2.002.buur","url":null,"abstract":"","PeriodicalId":30675,"journal":{"name":"Taal en Tongval Language Variation in the Low Countries","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79230977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-08-01DOI: 10.5117/tet2022.1.002.phei
Jeffrey Pheiff, L. Schäfer
{"title":"komen ‘come’ + Verb of Movement","authors":"Jeffrey Pheiff, L. Schäfer","doi":"10.5117/tet2022.1.002.phei","DOIUrl":"https://doi.org/10.5117/tet2022.1.002.phei","url":null,"abstract":"","PeriodicalId":30675,"journal":{"name":"Taal en Tongval Language Variation in the Low Countries","volume":"60 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73571771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}