{"title":"Chinese named entity recognition based on Heterogeneous Graph and Dynamic Attention Network","authors":"Yuke Wang, Ling Lu, Wu Yang, Yinong Chen","doi":"10.1109/ISADS56919.2023.10092180","DOIUrl":null,"url":null,"abstract":"Abstract Lexicon have been proved to enhance character representation to help Chinese named entity recognition (NER) model distinguish entities. Although lexicon information includes both semantic and boundary information of words, existing studies usually use only part of them and has low utilization of lexicon information. To efficiently extract dictionary features and integrate character representation, we propose a Heterogeneous Graph and Dynamic Attention Network (HGDAN), aiming at fusing contextual information and capturing dynamic associations between characters and words, thus improving the performance of Chinese NER. PGDNA uses the boundary information of the dictionary to construct a heterogeneous graph and uses the graph attention method to extract semantic information, as well as suppressing lexical noise through the gating unit. In addition, we found the traditional attention model has a non-zero attention problem that will distract the attention of the model, and proposed a simple and effective method to solve it. Experiments on the performance and inference speed of HGDAN on four Chinese datasets have proved its superiority.","PeriodicalId":412453,"journal":{"name":"2023 IEEE 15th International Symposium on Autonomous Decentralized System (ISADS)","volume":"60 7","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 15th International Symposium on Autonomous Decentralized System (ISADS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISADS56919.2023.10092180","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Lexicon have been proved to enhance character representation to help Chinese named entity recognition (NER) model distinguish entities. Although lexicon information includes both semantic and boundary information of words, existing studies usually use only part of them and has low utilization of lexicon information. To efficiently extract dictionary features and integrate character representation, we propose a Heterogeneous Graph and Dynamic Attention Network (HGDAN), aiming at fusing contextual information and capturing dynamic associations between characters and words, thus improving the performance of Chinese NER. PGDNA uses the boundary information of the dictionary to construct a heterogeneous graph and uses the graph attention method to extract semantic information, as well as suppressing lexical noise through the gating unit. In addition, we found the traditional attention model has a non-zero attention problem that will distract the attention of the model, and proposed a simple and effective method to solve it. Experiments on the performance and inference speed of HGDAN on four Chinese datasets have proved its superiority.