{"title":"The Relational Parts of Speech in Text Analysis for Definition Detection, for Romanian Language","authors":"Cristian Niculiță, L. Dumitriu","doi":"10.1109/ROEDUNET.2019.8909642","DOIUrl":null,"url":null,"abstract":"This article presents a method for detecting definitions in text, as well as identifying their formal parts, using the predictive capacity of certain relational parts of speech, mainly prepositions but also including words from other lexical categories such as adverbs, pronouns and conjunctions. The method is specifically tailored for Romanian language. Based on their predicting capacity, the relational words can be distributed into four categories that influence the way the definitions are detected. Using an annotated corpus of definitions, a series of statistics is extracted and used to formulate measures describing the impact and usage patterns of these relational words.","PeriodicalId":309683,"journal":{"name":"2019 18th RoEduNet Conference: Networking in Education and Research (RoEduNet)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 18th RoEduNet Conference: Networking in Education and Research (RoEduNet)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROEDUNET.2019.8909642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This article presents a method for detecting definitions in text, as well as identifying their formal parts, using the predictive capacity of certain relational parts of speech, mainly prepositions but also including words from other lexical categories such as adverbs, pronouns and conjunctions. The method is specifically tailored for Romanian language. Based on their predicting capacity, the relational words can be distributed into four categories that influence the way the definitions are detected. Using an annotated corpus of definitions, a series of statistics is extracted and used to formulate measures describing the impact and usage patterns of these relational words.