Mohamed A. Meselhi, Hitham M. Abo Bakr, I. Ziedan, K. Shaalan
{"title":"Hybrid Named Entity Recognition - Application to Arabic Language","authors":"Mohamed A. Meselhi, Hitham M. Abo Bakr, I. Ziedan, K. Shaalan","doi":"10.1109/ICCES.2014.7030933","DOIUrl":null,"url":null,"abstract":"Most Named Entity Recognition (NER) systems follow either a rule-based approach or machine learning approach. In this paper, we introduce out attempt at developing a hybrid NER system, which combines the rule-based approach with a machine learning approach in order to obtain the advantages of both approaches and overcomes their problems [1]. The system is able to recognize eight types of named entities including Location, Person, Organization, Date, Time, Price, Measurement and Percent. Experimental results on ANERcorp dataset indicated that our hybrid approach outperforms the rule-based approach and the machine learning approach when they are processed separately. Moreover, our hybrid approach outperforms the state-of-the-art of Arabic NER.","PeriodicalId":339697,"journal":{"name":"2014 9th International Conference on Computer Engineering & Systems (ICCES)","volume":"65 41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 9th International Conference on Computer Engineering & Systems (ICCES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCES.2014.7030933","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Most Named Entity Recognition (NER) systems follow either a rule-based approach or machine learning approach. In this paper, we introduce out attempt at developing a hybrid NER system, which combines the rule-based approach with a machine learning approach in order to obtain the advantages of both approaches and overcomes their problems [1]. The system is able to recognize eight types of named entities including Location, Person, Organization, Date, Time, Price, Measurement and Percent. Experimental results on ANERcorp dataset indicated that our hybrid approach outperforms the rule-based approach and the machine learning approach when they are processed separately. Moreover, our hybrid approach outperforms the state-of-the-art of Arabic NER.