{"title":"字母在语料库、单词(类型)和语料库单词初始位置出现频率的数学建模","authors":"H. Pande","doi":"10.1515/glot-2020-2010","DOIUrl":null,"url":null,"abstract":"Abstract In the present paper an attempt has been made to determine the mathematical model for the frequencies of occurrence of letters in the corpora, in the word types of the corpora and in the initial positions of words of the corpora while both the word tokens and word types have been taken into account. In the current study corpora written in American English have been used by the selection of the entities from ‘The Open American National Corpus (OANC)’.","PeriodicalId":37792,"journal":{"name":"Glottotheory","volume":"12 1","pages":"57 - 69"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/glot-2020-2010","citationCount":"0","resultStr":"{\"title\":\"Mathematical modeling of the frequencies of letters for their occurrence in corpora, words (types) and in the initial positions of words of corpora\",\"authors\":\"H. Pande\",\"doi\":\"10.1515/glot-2020-2010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract In the present paper an attempt has been made to determine the mathematical model for the frequencies of occurrence of letters in the corpora, in the word types of the corpora and in the initial positions of words of the corpora while both the word tokens and word types have been taken into account. In the current study corpora written in American English have been used by the selection of the entities from ‘The Open American National Corpus (OANC)’.\",\"PeriodicalId\":37792,\"journal\":{\"name\":\"Glottotheory\",\"volume\":\"12 1\",\"pages\":\"57 - 69\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1515/glot-2020-2010\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Glottotheory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1515/glot-2020-2010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Glottotheory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/glot-2020-2010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Arts and Humanities","Score":null,"Total":0}
Mathematical modeling of the frequencies of letters for their occurrence in corpora, words (types) and in the initial positions of words of corpora
Abstract In the present paper an attempt has been made to determine the mathematical model for the frequencies of occurrence of letters in the corpora, in the word types of the corpora and in the initial positions of words of the corpora while both the word tokens and word types have been taken into account. In the current study corpora written in American English have been used by the selection of the entities from ‘The Open American National Corpus (OANC)’.
期刊介绍:
The foci of Glottotheory are: observations and descriptions of all aspects of language and text phenomena including the areas of psycholinguistics, sociolinguistics, dialectology, pragmatics, etc. on all levels of linguistic analysis, applications of methods, models or findings from quantitative linguistics concerning problems of natural language processing, language teaching, documentation and information retrieval, methodological problems of linguistic measurement, model construction, sampling and test theory, epistemological issues such as explanation of language and text phenomena, contributions to theory construction, systems theory, philosophy of science. The journal considers itself as platform for a dialogue between quantitative and qualitative linguistics.