基于预训练模型和多头注意机制的命名实体识别

Q3 Arts and Humanities Icon Pub Date : 2023-03-01 DOI:10.1109/icnlp58431.2023.00040

GuoHua Zhu, Jian Wang

{"title":"基于预训练模型和多头注意机制的命名实体识别","authors":"GuoHua Zhu, Jian Wang","doi":"10.1109/icnlp58431.2023.00040","DOIUrl":null,"url":null,"abstract":"When processing Chinese named entity recognition, the traditional algorithm model have been having the ambiguity of word segmentation and the singleness of the word vector, and the training consequence of algorithm models was not well. To solve this problem, a BERT-BiLSTM Multi-Attention (PMA-CNER) model was proposed to improve the accuracy of Chinese named entity recognition (CNER). This model used BERT model to embed words based on BiLSTM model, which can extract global context semantic features more effectively. Next, a layer of Multi-head attention mechanism was added behind the BiLSTM layer, which can effectively extract multiple semantic features and overcome the shortage of BiLSTM in obtaining local features. Finally, the experimental results on the CLUSER2020 dataset and the Yudu-S4K dataset show that the accuracy rate is significantly improved, reaching 93.94% and 91.83% respectively.","PeriodicalId":53637,"journal":{"name":"Icon","volume":"36 1","pages":"187-190"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Named Entity Recognition Based on Pre-training Model and Multi-head Attention Mechanism\",\"authors\":\"GuoHua Zhu, Jian Wang\",\"doi\":\"10.1109/icnlp58431.2023.00040\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When processing Chinese named entity recognition, the traditional algorithm model have been having the ambiguity of word segmentation and the singleness of the word vector, and the training consequence of algorithm models was not well. To solve this problem, a BERT-BiLSTM Multi-Attention (PMA-CNER) model was proposed to improve the accuracy of Chinese named entity recognition (CNER). This model used BERT model to embed words based on BiLSTM model, which can extract global context semantic features more effectively. Next, a layer of Multi-head attention mechanism was added behind the BiLSTM layer, which can effectively extract multiple semantic features and overcome the shortage of BiLSTM in obtaining local features. Finally, the experimental results on the CLUSER2020 dataset and the Yudu-S4K dataset show that the accuracy rate is significantly improved, reaching 93.94% and 91.83% respectively.\",\"PeriodicalId\":53637,\"journal\":{\"name\":\"Icon\",\"volume\":\"36 1\",\"pages\":\"187-190\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Icon\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icnlp58431.2023.00040\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Icon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icnlp58431.2023.00040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Arts and Humanities","Score":null,"Total":0}

引用次数: 0

摘要

在处理中文命名实体识别时，传统算法模型存在分词模糊和词向量单一的问题，算法模型的训练效果不佳。为了解决这一问题，提出了BERT-BiLSTM多注意(PMA-CNER)模型，以提高中文命名实体识别(CNER)的准确率。该模型在BiLSTM模型的基础上使用BERT模型嵌入词，可以更有效地提取全局上下文语义特征。其次，在BiLSTM层后面增加了一层多头注意机制，可以有效地提取多个语义特征，克服了BiLSTM在获取局部特征方面的不足;最后，在CLUSER2020数据集和Yudu-S4K数据集上的实验结果表明，准确率显著提高，分别达到93.94%和91.83%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Named Entity Recognition Based on Pre-training Model and Multi-head Attention Mechanism

When processing Chinese named entity recognition, the traditional algorithm model have been having the ambiguity of word segmentation and the singleness of the word vector, and the training consequence of algorithm models was not well. To solve this problem, a BERT-BiLSTM Multi-Attention (PMA-CNER) model was proposed to improve the accuracy of Chinese named entity recognition (CNER). This model used BERT model to embed words based on BiLSTM model, which can extract global context semantic features more effectively. Next, a layer of Multi-head attention mechanism was added behind the BiLSTM layer, which can effectively extract multiple semantic features and overcome the shortage of BiLSTM in obtaining local features. Finally, the experimental results on the CLUSER2020 dataset and the Yudu-S4K dataset show that the accuracy rate is significantly improved, reaching 93.94% and 91.83% respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Icon Arts and Humanities-History and Philosophy of Science

CiteScore

0.30

自引率

0.00%

发文量