D. Mati, Jaumin Ajdari, Bujar Raufi, Mentor Hamiti, B. Selimi
{"title":"A Systematic Mapping Study of Language Features Identification from Large Text Collection","authors":"D. Mati, Jaumin Ajdari, Bujar Raufi, Mentor Hamiti, B. Selimi","doi":"10.1109/MECO.2019.8760042","DOIUrl":null,"url":null,"abstract":"Natural Language Processing11Henceforth: NLP is an emerging research area in today's era. The NLP resources are quite useful when it comes to building a machine capable of translating between linguistic pairs – a solution that strives to resolve the language barrier problems. Based on this premise, we are focusing our research on feature identification from large text collections of Albanian language. ‘Rule-based’ or statistical Part-of-Speech22Henceforth: POS (POS) taggers are sought to be utilized that would either need considerable time for rule development or a sufficient amount of manually labelled data. In light of this, the impact of this research is based on exploring numerous cases that are conducive to progress and further development of this field. One of the goals of this paper is to conduct a systematic review study; to explore and analyze existing research that seek to target low resources language such as is the case of the Albanian language. According to prior observation of published research conducted since 2015, we are focusing our research on studies that have been published in areas that are relevant to Natural Language Processing. Based on considerable load of related research on this field, it is essential to conduct a review and provide an outline of the research situation as well as current developments in this specific but important field of research.","PeriodicalId":141324,"journal":{"name":"2019 8th Mediterranean Conference on Embedded Computing (MECO)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 8th Mediterranean Conference on Embedded Computing (MECO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MECO.2019.8760042","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Natural Language Processing11Henceforth: NLP is an emerging research area in today's era. The NLP resources are quite useful when it comes to building a machine capable of translating between linguistic pairs – a solution that strives to resolve the language barrier problems. Based on this premise, we are focusing our research on feature identification from large text collections of Albanian language. ‘Rule-based’ or statistical Part-of-Speech22Henceforth: POS (POS) taggers are sought to be utilized that would either need considerable time for rule development or a sufficient amount of manually labelled data. In light of this, the impact of this research is based on exploring numerous cases that are conducive to progress and further development of this field. One of the goals of this paper is to conduct a systematic review study; to explore and analyze existing research that seek to target low resources language such as is the case of the Albanian language. According to prior observation of published research conducted since 2015, we are focusing our research on studies that have been published in areas that are relevant to Natural Language Processing. Based on considerable load of related research on this field, it is essential to conduct a review and provide an outline of the research situation as well as current developments in this specific but important field of research.