{"title":"Korpusomat -一个创建可搜索的形态句法标记语料库的工具","authors":"Witold Kieraś, Ł. Kobyliński, Maciej Ogrodniczuk","doi":"10.12921/CMST.2018.0000005","DOIUrl":null,"url":null,"abstract":"The paper presents Korpusomat, a web application aimed at building annotated corpora for the purpose of corpus linguistic studies. Korpusomat combines existing tools, such as morphological analyser, tagger and corpus search engine, and provides an easy-to-use environment for building corpora technically compatible with the National Corpus of Polish from almost any text, including texts in binary formats. In the paper we present the current state of the project, its features and functionalities, as well as some future plans and developments tasks. A usage example is also presented.","PeriodicalId":10561,"journal":{"name":"computational methods in science and technology","volume":"167 1","pages":"21-27"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Korpusomat – a Tool for Creating Searchable Morphosyntactically Tagged Corpora\",\"authors\":\"Witold Kieraś, Ł. Kobyliński, Maciej Ogrodniczuk\",\"doi\":\"10.12921/CMST.2018.0000005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper presents Korpusomat, a web application aimed at building annotated corpora for the purpose of corpus linguistic studies. Korpusomat combines existing tools, such as morphological analyser, tagger and corpus search engine, and provides an easy-to-use environment for building corpora technically compatible with the National Corpus of Polish from almost any text, including texts in binary formats. In the paper we present the current state of the project, its features and functionalities, as well as some future plans and developments tasks. A usage example is also presented.\",\"PeriodicalId\":10561,\"journal\":{\"name\":\"computational methods in science and technology\",\"volume\":\"167 1\",\"pages\":\"21-27\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"computational methods in science and technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12921/CMST.2018.0000005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"computational methods in science and technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12921/CMST.2018.0000005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Korpusomat – a Tool for Creating Searchable Morphosyntactically Tagged Corpora
The paper presents Korpusomat, a web application aimed at building annotated corpora for the purpose of corpus linguistic studies. Korpusomat combines existing tools, such as morphological analyser, tagger and corpus search engine, and provides an easy-to-use environment for building corpora technically compatible with the National Corpus of Polish from almost any text, including texts in binary formats. In the paper we present the current state of the project, its features and functionalities, as well as some future plans and developments tasks. A usage example is also presented.