用于少镜头命名实体识别的联合跨度和令牌框架

IF 8.9 AI Open Pub Date : 2023-01-01 Epub Date: 2023-09-04 DOI:10.1016/j.aiopen.2023.08.009

Wenlong Fang, Yongbin Liu, Chunping Ouyang, Lin Ren, Jiale Li, Yaping Wan

{"title":"用于少镜头命名实体识别的联合跨度和令牌框架","authors":"Wenlong Fang, Yongbin Liu, Chunping Ouyang, Lin Ren, Jiale Li, Yaping Wan","doi":"10.1016/j.aiopen.2023.08.009","DOIUrl":null,"url":null,"abstract":"<div>Few-shot Named Entity Recognition (NER) is a challenging task that involves identifying new entity types using a limited number of labeled instances for training. Currently, the majority of Few-shot NER methods are based on span, which pay more attention to the boundary information of the spans as candidate entities and the entity-level information. However, these methods often overlook token-level semantic information, which can limit their effectiveness. To address this issue, we propose a novel Joint Span and Token (JST) framework that integrates both the boundary information of an entity and the semantic information of each token that comprises an entity. The JST framework employs span features to extract the boundary features of the entity and token features to extract the semantic features of each token. Additionally, to reduce the negative impact of the Other class, we introduce a method to separate named entities from the Other class in semantic space, which helps to improve the distinction between entities and the Other class. In addition, we used GPT to do data augmentation on the support sentences, generating similar sentences to the original ones. These sentences increase the diversity of the sample and the reliability of our model. Our experimental results on the Few-NERD1 and SNIPS2 datasets demonstrate that our model outperforms existing methods in terms of performance.</div>","PeriodicalId":100068,"journal":{"name":"AI Open","volume":"4 ","pages":"Pages 111-119"},"PeriodicalIF":8.9000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Joint span and token framework for few-shot named entity recognition\",\"authors\":\"Wenlong Fang, Yongbin Liu, Chunping Ouyang, Lin Ren, Jiale Li, Yaping Wan\",\"doi\":\"10.1016/j.aiopen.2023.08.009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>Few-shot Named Entity Recognition (NER) is a challenging task that involves identifying new entity types using a limited number of labeled instances for training. Currently, the majority of Few-shot NER methods are based on span, which pay more attention to the boundary information of the spans as candidate entities and the entity-level information. However, these methods often overlook token-level semantic information, which can limit their effectiveness. To address this issue, we propose a novel Joint Span and Token (JST) framework that integrates both the boundary information of an entity and the semantic information of each token that comprises an entity. The JST framework employs span features to extract the boundary features of the entity and token features to extract the semantic features of each token. Additionally, to reduce the negative impact of the Other class, we introduce a method to separate named entities from the Other class in semantic space, which helps to improve the distinction between entities and the Other class. In addition, we used GPT to do data augmentation on the support sentences, generating similar sentences to the original ones. These sentences increase the diversity of the sample and the reliability of our model. Our experimental results on the Few-NERD1 and SNIPS2 datasets demonstrate that our model outperforms existing methods in terms of performance.</div>\",\"PeriodicalId\":100068,\"journal\":{\"name\":\"AI Open\",\"volume\":\"4 \",\"pages\":\"Pages 111-119\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AI Open\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666651023000116\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/9/4 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AI Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666651023000116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/9/4 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

少镜头命名实体识别（NER）是一项具有挑战性的任务，涉及使用有限数量的标记实例来识别新的实体类型进行训练。目前，大多数少镜头NER方法都是基于跨度的，它们更关注作为候选实体的跨度的边界信息和实体级别的信息。然而，这些方法往往忽略了令牌级别的语义信息，这可能会限制它们的有效性。为了解决这个问题，我们提出了一种新的联合跨度和令牌（JST）框架，该框架集成了实体的边界信息和包括实体的每个令牌的语义信息。JST框架使用跨度特征来提取实体的边界特征，使用令牌特征来提取每个令牌的语义特征。此外，为了减少Other类的负面影响，我们引入了一种在语义空间中将命名实体与Other类分离的方法，这有助于改进实体和Other类之间的区别。此外，我们使用GPT对支持语句进行数据扩充，生成与原始语句相似的语句。这些句子增加了样本的多样性和我们模型的可靠性。我们在Few-NERD1和SNIPS2数据集上的实验结果表明，我们的模型在性能方面优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Joint span and token framework for few-shot named entity recognition

Few-shot Named Entity Recognition (NER) is a challenging task that involves identifying new entity types using a limited number of labeled instances for training. Currently, the majority of Few-shot NER methods are based on span, which pay more attention to the boundary information of the spans as candidate entities and the entity-level information. However, these methods often overlook token-level semantic information, which can limit their effectiveness. To address this issue, we propose a novel Joint Span and Token (JST) framework that integrates both the boundary information of an entity and the semantic information of each token that comprises an entity. The JST framework employs span features to extract the boundary features of the entity and token features to extract the semantic features of each token. Additionally, to reduce the negative impact of the Other class, we introduce a method to separate named entities from the Other class in semantic space, which helps to improve the distinction between entities and the Other class. In addition, we used GPT to do data augmentation on the support sentences, generating similar sentences to the original ones. These sentences increase the diversity of the sample and the reliability of our model. Our experimental results on the Few-NERD¹ and SNIPS² datasets demonstrate that our model outperforms existing methods in terms of performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

AI Open

CiteScore

45.00

自引率

0.00%

发文量