D. Divjak, Dagmar Serge Tomaž Sharoff, Dagmar Serge Tomaž Erjavec
{"title":"Slavic Corpus and Computational Linguistics","authors":"D. Divjak, Dagmar Serge Tomaž Sharoff, Dagmar Serge Tomaž Erjavec","doi":"10.1353/JSL.2017.0008","DOIUrl":null,"url":null,"abstract":"Abstract:In this paper we focus on corpus-linguistic studies that address theoretical questions and on computational linguistic work on corpus annotation that makes corpora useful for linguistic analysis. First we discuss why the corpus linguistic approach was discredited by generative linguists in the second half of the 20th century, how it made a comeback through advances in computing and was finally adopted by usage-based linguistics at the beginning of the 21st century. Then we move on to an overview of necessary and common annotation layers and the issues that are encountered when performing automatic annotation, with special emphasis on Slavic languages. Finally we survey the types of research requiring corpora that Slavic linguists are involved in worldwide, and the resources they have at their disposal.","PeriodicalId":52037,"journal":{"name":"Journal of Slavic Linguistics","volume":null,"pages":null},"PeriodicalIF":0.4000,"publicationDate":"2018-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/JSL.2017.0008","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Slavic Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1353/JSL.2017.0008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 5
Abstract
Abstract:In this paper we focus on corpus-linguistic studies that address theoretical questions and on computational linguistic work on corpus annotation that makes corpora useful for linguistic analysis. First we discuss why the corpus linguistic approach was discredited by generative linguists in the second half of the 20th century, how it made a comeback through advances in computing and was finally adopted by usage-based linguistics at the beginning of the 21st century. Then we move on to an overview of necessary and common annotation layers and the issues that are encountered when performing automatic annotation, with special emphasis on Slavic languages. Finally we survey the types of research requiring corpora that Slavic linguists are involved in worldwide, and the resources they have at their disposal.
期刊介绍:
Journal of Slavic Linguistics, or JSL, is the official journal of the Slavic Linguistics Society. JSL publishes research articles and book reviews that address the description and analysis of Slavic languages and that are of general interest to linguists. Published papers deal with any aspect of synchronic or diachronic Slavic linguistics – phonetics, phonology, morphology, syntax, semantics, or pragmatics – which raises substantive problems of broad theoretical concern or proposes significant descriptive generalizations. Comparative studies and formal analyses are also published. Different theoretical orientations are represented in the journal. One volume (two issues) is published per year, ca. 360 pp.