Yuekun Ma, He Liu, Dezheng Zhang, Chang Gao, Yujue Liu, Yujie Liu
{"title":"基于词典信息和文本局部特征的命名实体识别方法","authors":"Yuekun Ma, He Liu, Dezheng Zhang, Chang Gao, Yujue Liu, Yujie Liu","doi":"10.17559/tv-20230121000257","DOIUrl":null,"url":null,"abstract":": At present, Named Entity Recognition (NER) is one of the fundamental tasks for extracting knowledge from traditional Chinese medicine (TCM) texts. The variability of the length of TCM entities and the characteristics of the language of TCM texts lead to ambiguity of TCM entity boundaries. In addition, better extracting and exploiting local features of text can improve the accuracy of named entity recognition. In this paper, we proposed a TCM NER model with lexicon information and text local feature enhancement of text. In this model, a lexicon is introduced to encode the characters in the text to obtain the context-sensitive global semantic representation of the text. The convolutional neural network (CNN) and gate joined collaborative attention network are used to form a text local feature extraction module to capture the important semantic features of local text. Experiments were conducted on two TCM domain datasets and the F 1 values are 91.13% and 90.21% respectively.","PeriodicalId":49443,"journal":{"name":"Tehnicki Vjesnik-Technical Gazette","volume":"37 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Named Entity Recognition Method Enhanced with Lexicon Information and Text Local Feature\",\"authors\":\"Yuekun Ma, He Liu, Dezheng Zhang, Chang Gao, Yujue Liu, Yujie Liu\",\"doi\":\"10.17559/tv-20230121000257\",\"DOIUrl\":null,\"url\":null,\"abstract\":\": At present, Named Entity Recognition (NER) is one of the fundamental tasks for extracting knowledge from traditional Chinese medicine (TCM) texts. The variability of the length of TCM entities and the characteristics of the language of TCM texts lead to ambiguity of TCM entity boundaries. In addition, better extracting and exploiting local features of text can improve the accuracy of named entity recognition. In this paper, we proposed a TCM NER model with lexicon information and text local feature enhancement of text. In this model, a lexicon is introduced to encode the characters in the text to obtain the context-sensitive global semantic representation of the text. The convolutional neural network (CNN) and gate joined collaborative attention network are used to form a text local feature extraction module to capture the important semantic features of local text. Experiments were conducted on two TCM domain datasets and the F 1 values are 91.13% and 90.21% respectively.\",\"PeriodicalId\":49443,\"journal\":{\"name\":\"Tehnicki Vjesnik-Technical Gazette\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2023-06-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tehnicki Vjesnik-Technical Gazette\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.17559/tv-20230121000257\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tehnicki Vjesnik-Technical Gazette","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.17559/tv-20230121000257","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
A Named Entity Recognition Method Enhanced with Lexicon Information and Text Local Feature
: At present, Named Entity Recognition (NER) is one of the fundamental tasks for extracting knowledge from traditional Chinese medicine (TCM) texts. The variability of the length of TCM entities and the characteristics of the language of TCM texts lead to ambiguity of TCM entity boundaries. In addition, better extracting and exploiting local features of text can improve the accuracy of named entity recognition. In this paper, we proposed a TCM NER model with lexicon information and text local feature enhancement of text. In this model, a lexicon is introduced to encode the characters in the text to obtain the context-sensitive global semantic representation of the text. The convolutional neural network (CNN) and gate joined collaborative attention network are used to form a text local feature extraction module to capture the important semantic features of local text. Experiments were conducted on two TCM domain datasets and the F 1 values are 91.13% and 90.21% respectively.
期刊介绍:
The journal TEHNIČKI VJESNIK - TECHNICAL GAZETTE publishes scientific and professional papers in the area of technical sciences (mostly from mechanical, electrical and civil engineering, and also from their boundary areas).
All articles have undergone peer review and upon acceptance are permanently free of all restrictions on access, for everyone to read and download.
For all articles authors will be asked to pay a publication fee prior to the article appearing in the journal. However, this fee only to be paid after the article has been positively reviewed and accepted for publishing. All details can be seen at http://www.tehnicki-vjesnik.com/web/public/page
First year of publication: 1994
Frequency (annually): 6