{"title":"Toward the Resolution of Arabic Lexical Ambiguities with Transduction on Text Automaton","authors":"Nadia Ghezaiel, K. Haddar","doi":"10.1109/ACLING.2015.12","DOIUrl":null,"url":null,"abstract":"Lexical analysis can be a way to remove ambiguities in Arabic language. So their resolution is an important task in several Natural Language Processing (NLP) applications. In this context that this paper is inscribed. Our proposed resolution method is based essentially on the use of transducers on text automata. Indeed these transducers specify the lexical and contextual rules for Arabic language. They allow the resolution of lexical ambiguities. In order to achieve this resolution method, different types of lexical ambiguities are identified and studied to extract an appropriate set of rules. After that, we described lexical rules in ELAG [19] system (Elimination of Lexical Ambiguities by Grammars), which can delete paths representing morphosyntactic ambiguities. In addition, we present an experimentation implemented in the Unitex platform and conducted by various linguistic resources to obtain disambiguated syntactic structures suitable for the syntactic analysis. The obtained results are ambitious and can be improved by adding other rules and heuristics.","PeriodicalId":404268,"journal":{"name":"2015 First International Conference on Arabic Computational Linguistics (ACLing)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 First International Conference on Arabic Computational Linguistics (ACLing)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACLING.2015.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Lexical analysis can be a way to remove ambiguities in Arabic language. So their resolution is an important task in several Natural Language Processing (NLP) applications. In this context that this paper is inscribed. Our proposed resolution method is based essentially on the use of transducers on text automata. Indeed these transducers specify the lexical and contextual rules for Arabic language. They allow the resolution of lexical ambiguities. In order to achieve this resolution method, different types of lexical ambiguities are identified and studied to extract an appropriate set of rules. After that, we described lexical rules in ELAG [19] system (Elimination of Lexical Ambiguities by Grammars), which can delete paths representing morphosyntactic ambiguities. In addition, we present an experimentation implemented in the Unitex platform and conducted by various linguistic resources to obtain disambiguated syntactic structures suitable for the syntactic analysis. The obtained results are ambitious and can be improved by adding other rules and heuristics.
词法分析是消除阿拉伯语歧义的一种方法。因此,它们的解析是自然语言处理(NLP)应用中的一个重要任务。这篇论文就是在这样的背景下题写的。我们提出的解决方法基本上是基于在文本自动机上使用换能器。实际上,这些换能器指定了阿拉伯语的词汇和上下文规则。它们允许解决词汇歧义。为了实现这种解决方法,对不同类型的词汇歧义进行识别和研究,以提取合适的规则集。之后,我们描述了ELAG[19]系统中的词法规则(Elimination of lexical ambigities by Grammars),它可以删除表示形态句法歧义的路径。此外,我们提出了在Unitex平台上实施的实验,并利用各种语言资源进行了实验,以获得适合句法分析的消歧句法结构。所获得的结果是雄心勃勃的,可以通过添加其他规则和启发式来改进。