Yong Song, Zhiwei Yan, Yukun Qin, Xiaozhou Ye, Ye Ouyang
{"title":"Self-distilled Named Entity Recognition Based on Boundary Detection and Biaffine Attention","authors":"Yong Song, Zhiwei Yan, Yukun Qin, Xiaozhou Ye, Ye Ouyang","doi":"10.1109/smartworld-uic-atc-scalcom-digitaltwin-pricomp-metaverse56740.2022.00162","DOIUrl":null,"url":null,"abstract":"Named Entity Recognition (NER) is an important down-streaming task in natural language processing. Span-based methods are applicable to both flat and nested entities. However, they lack explicit boundary supervision. To address this issue, we propose a multi-task and self-distilled model which combines biaffine span classification and entity boundary detection tasks. Firstly, the boundary detection and biaffine span classification models are jointly trained under a multi-task learning framework to address the problem of lacking supervision of boundaries. Then, self-distillation technique is applied on the model to reassign entity probabilities from annotated spans to surrounding spans and more entity types, further improving the accuracy of NER by soft labels that contain richer knowledge. Experiments were based on a high-density entity text dataset of the commodity titles from an e-commerce company. Finally, the experimental results show that our model exhibited a better F1 score than the existing common models.","PeriodicalId":43791,"journal":{"name":"Scalable Computing-Practice and Experience","volume":"1 1","pages":"1132-1137"},"PeriodicalIF":0.9000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scalable Computing-Practice and Experience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/smartworld-uic-atc-scalcom-digitaltwin-pricomp-metaverse56740.2022.00162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Named Entity Recognition (NER) is an important down-streaming task in natural language processing. Span-based methods are applicable to both flat and nested entities. However, they lack explicit boundary supervision. To address this issue, we propose a multi-task and self-distilled model which combines biaffine span classification and entity boundary detection tasks. Firstly, the boundary detection and biaffine span classification models are jointly trained under a multi-task learning framework to address the problem of lacking supervision of boundaries. Then, self-distillation technique is applied on the model to reassign entity probabilities from annotated spans to surrounding spans and more entity types, further improving the accuracy of NER by soft labels that contain richer knowledge. Experiments were based on a high-density entity text dataset of the commodity titles from an e-commerce company. Finally, the experimental results show that our model exhibited a better F1 score than the existing common models.
期刊介绍:
The area of scalable computing has matured and reached a point where new issues and trends require a professional forum. SCPE will provide this avenue by publishing original refereed papers that address the present as well as the future of parallel and distributed computing. The journal will focus on algorithm development, implementation and execution on real-world parallel architectures, and application of parallel and distributed computing to the solution of real-life problems.