{"title":"领域知识指导下的中国古典文学命名实体识别方法研究","authors":"Wenjuan Zhao, Zhongbao Liu, Jian Lian","doi":"10.56028/aetr.8.1.344.2023","DOIUrl":null,"url":null,"abstract":"The current dominant named entity recognition methods of Chinese classics are classified as data-driven methods, which are limited by the data quality. The domain knowledge is introduced in this paper to supervise the process of the named entity recognition, so as to solve the poor performance problem because of the low-quality data. The experiments on the Historical Records corpus show that compared with the domain knowledge unsupervised case, the average accuracy, recall rate, and F1 value have respectively improved by 2.76%, 2.70%, and 2.75% under the supervision of domain knowledge. Domain knowledge plays an important role in improving the performance of the named entity recognition methods of Chinese classics.","PeriodicalId":502380,"journal":{"name":"Advances in Engineering Technology Research","volume":"35 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on Named Entity Recognition Method of Chinese Classics Under the Supervision of Domain Knowledge\",\"authors\":\"Wenjuan Zhao, Zhongbao Liu, Jian Lian\",\"doi\":\"10.56028/aetr.8.1.344.2023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The current dominant named entity recognition methods of Chinese classics are classified as data-driven methods, which are limited by the data quality. The domain knowledge is introduced in this paper to supervise the process of the named entity recognition, so as to solve the poor performance problem because of the low-quality data. The experiments on the Historical Records corpus show that compared with the domain knowledge unsupervised case, the average accuracy, recall rate, and F1 value have respectively improved by 2.76%, 2.70%, and 2.75% under the supervision of domain knowledge. Domain knowledge plays an important role in improving the performance of the named entity recognition methods of Chinese classics.\",\"PeriodicalId\":502380,\"journal\":{\"name\":\"Advances in Engineering Technology Research\",\"volume\":\"35 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Engineering Technology Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.56028/aetr.8.1.344.2023\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Engineering Technology Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56028/aetr.8.1.344.2023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
目前主流的中文经典命名实体识别方法属于数据驱动型方法,受到数据质量的限制。本文引入领域知识对命名实体识别过程进行监督,从而解决了因数据质量低而导致识别效果不佳的问题。对历史记录语料库的实验表明,与无领域知识监督的情况相比,在领域知识的监督下,平均准确率、召回率和 F1 值分别提高了 2.76%、2.70% 和 2.75%。领域知识在提高中文经典命名实体识别方法的性能方面发挥了重要作用。
Research on Named Entity Recognition Method of Chinese Classics Under the Supervision of Domain Knowledge
The current dominant named entity recognition methods of Chinese classics are classified as data-driven methods, which are limited by the data quality. The domain knowledge is introduced in this paper to supervise the process of the named entity recognition, so as to solve the poor performance problem because of the low-quality data. The experiments on the Historical Records corpus show that compared with the domain knowledge unsupervised case, the average accuracy, recall rate, and F1 value have respectively improved by 2.76%, 2.70%, and 2.75% under the supervision of domain knowledge. Domain knowledge plays an important role in improving the performance of the named entity recognition methods of Chinese classics.