{"title":"利用向量空间模型探索印尼语前缀PE-和PEN-之间的语义差异","authors":"Karlina Denistia, E. Shafaei-Bajestan, R. Baayen","doi":"10.1515/cllt-2020-0023","DOIUrl":null,"url":null,"abstract":"Abstract Indonesian has two prefixes, PE- and PEN-, that are similar in form and meaning, but are probably not allomorphs. In this study, we applied a distributional vector space model to clarify whether these prefixes have discriminable semantics. Comparisons of pairs of words within and across morphologically defined sets of words revealed that cosine similarities of pairs consisting of a word with PE- and a word with PEN- were reduced compared to pairs of only PE- words, or of only PEN- words. Furthermore, nouns with PE- were more similar to their base words than was the case for words with PEN-. The specialized use of PE- for words denoting agents, and the specialized use of PEN- for denoting instruments, was also visible in the semantic vector space. These differences in the semantics of PE- and PEN- thus provide further quantitative support for the independent status of PE- as opposed to PEN-.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"18 1","pages":"573 - 598"},"PeriodicalIF":1.0000,"publicationDate":"2021-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2020-0023","citationCount":"6","resultStr":"{\"title\":\"Exploring semantic differences between the Indonesian prefixes PE- and PEN- using a vector space model\",\"authors\":\"Karlina Denistia, E. Shafaei-Bajestan, R. Baayen\",\"doi\":\"10.1515/cllt-2020-0023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Indonesian has two prefixes, PE- and PEN-, that are similar in form and meaning, but are probably not allomorphs. In this study, we applied a distributional vector space model to clarify whether these prefixes have discriminable semantics. Comparisons of pairs of words within and across morphologically defined sets of words revealed that cosine similarities of pairs consisting of a word with PE- and a word with PEN- were reduced compared to pairs of only PE- words, or of only PEN- words. Furthermore, nouns with PE- were more similar to their base words than was the case for words with PEN-. The specialized use of PE- for words denoting agents, and the specialized use of PEN- for denoting instruments, was also visible in the semantic vector space. These differences in the semantics of PE- and PEN- thus provide further quantitative support for the independent status of PE- as opposed to PEN-.\",\"PeriodicalId\":45605,\"journal\":{\"name\":\"Corpus Linguistics and Linguistic Theory\",\"volume\":\"18 1\",\"pages\":\"573 - 598\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2021-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1515/cllt-2020-0023\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Corpus Linguistics and Linguistic Theory\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1515/cllt-2020-0023\",\"RegionNum\":2,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Corpus Linguistics and Linguistic Theory","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1515/cllt-2020-0023","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
Exploring semantic differences between the Indonesian prefixes PE- and PEN- using a vector space model
Abstract Indonesian has two prefixes, PE- and PEN-, that are similar in form and meaning, but are probably not allomorphs. In this study, we applied a distributional vector space model to clarify whether these prefixes have discriminable semantics. Comparisons of pairs of words within and across morphologically defined sets of words revealed that cosine similarities of pairs consisting of a word with PE- and a word with PEN- were reduced compared to pairs of only PE- words, or of only PEN- words. Furthermore, nouns with PE- were more similar to their base words than was the case for words with PEN-. The specialized use of PE- for words denoting agents, and the specialized use of PEN- for denoting instruments, was also visible in the semantic vector space. These differences in the semantics of PE- and PEN- thus provide further quantitative support for the independent status of PE- as opposed to PEN-.
期刊介绍:
Corpus Linguistics and Linguistic Theory (CLLT) is a peer-reviewed journal publishing high-quality original corpus-based research focusing on theoretically relevant issues in all core areas of linguistic research, or other recognized topic areas. It provides a forum for researchers from different theoretical backgrounds and different areas of interest that share a commitment to the systematic and exhaustive analysis of naturally occurring language. Contributions from all theoretical frameworks are welcome but they should be addressed at a general audience and thus be explicit about their assumptions and discovery procedures and provide sufficient theoretical background to be accessible to researchers from different frameworks. Topics Corpus Linguistics Quantitative Linguistics Phonology Morphology Semantics Syntax Pragmatics.