A Bidirectional Extraction-Then-Evaluation Framework for Complex Relation Extraction

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Knowledge and Data Engineering Pub Date : 2024-07-30 DOI:10.1109/TKDE.2024.3435765

Weiyan Zhang;Jiacheng Wang;Chuang Chen;Wanpeng Lu;Wen Du;Haofen Wang;Jingping Liu;Tong Ruan

{"title":"A Bidirectional Extraction-Then-Evaluation Framework for Complex Relation Extraction","authors":"Weiyan Zhang;Jiacheng Wang;Chuang Chen;Wanpeng Lu;Wen Du;Haofen Wang;Jingping Liu;Tong Ruan","doi":"10.1109/TKDE.2024.3435765","DOIUrl":null,"url":null,"abstract":"Relation extraction is an important task in the field of natural language processing. Previous works mainly focus on adopting pipeline methods or joint methods to model relation extraction in general scenarios. However, these existing methods face challenges when adapting to complex relation extraction scenarios, such as handling overlapped triplets, multiple triplets, and cross-sentence triplets. In this paper, we revisit the advantages and disadvantages of the aforementioned methods in complex relation extraction. Based on the in-depth analysis, we propose a novel two-stage bidirectional extract-then-evaluate framework named \n<sc>BeeRe</small>\n. In the extraction stage, we first obtain the subject set, relation set, and object set. Then, we design subject- and object-oriented triplet extractors to iteratively recurrent obtain candidate triplets, ensuring high recall. In the evaluation stage, we adopt a relation-oriented triplet filter to determine subject-object pairs based on relations in triplets obtained in the first stage, ensuring high precision. We conduct extensive experiments on three public datasets to show that \n<sc>BeeRe</small>\n achieves state-of-the-art performance in both complex and general relation extraction scenarios. Even when compared to large language models like closed-source/open-source LLMs, \n<sc>BeeRe</small>\n still has significant performance gains.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"36 12","pages":"7442-7454"},"PeriodicalIF":8.9000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10614870/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Relation extraction is an important task in the field of natural language processing. Previous works mainly focus on adopting pipeline methods or joint methods to model relation extraction in general scenarios. However, these existing methods face challenges when adapting to complex relation extraction scenarios, such as handling overlapped triplets, multiple triplets, and cross-sentence triplets. In this paper, we revisit the advantages and disadvantages of the aforementioned methods in complex relation extraction. Based on the in-depth analysis, we propose a novel two-stage bidirectional extract-then-evaluate framework named BeeRe . In the extraction stage, we first obtain the subject set, relation set, and object set. Then, we design subject- and object-oriented triplet extractors to iteratively recurrent obtain candidate triplets, ensuring high recall. In the evaluation stage, we adopt a relation-oriented triplet filter to determine subject-object pairs based on relations in triplets obtained in the first stage, ensuring high precision. We conduct extensive experiments on three public datasets to show that BeeRe achieves state-of-the-art performance in both complex and general relation extraction scenarios. Even when compared to large language models like closed-source/open-source LLMs, BeeRe still has significant performance gains.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

复杂关系提取的 "先提取后评估 "双向框架

关系提取是自然语言处理领域的一项重要任务。以往的工作主要集中在采用管道方法或联合方法对一般场景下的关系提取进行建模。然而，这些现有方法在适应复杂的关系提取场景时面临挑战，如处理重叠三元组、多重三元组和跨句三元组等。在本文中，我们重新审视了上述方法在复杂关系提取中的优缺点。在深入分析的基础上，我们提出了一种名为 BeeRe 的新颖的两阶段双向提取-评估框架。在提取阶段，我们首先获得主题集、关系集和对象集。然后，我们设计了面向主体和对象的三元组提取器，反复循环地获取候选三元组，确保高召回率。在评估阶段，我们采用面向关系的三元组过滤器，根据第一阶段获得的三元组中的关系确定主客体对，确保高精度。我们在三个公共数据集上进行了广泛的实验，结果表明，BeeRe 在复杂和一般关系提取场景中都达到了最先进的性能。即使与闭源/开源 LLM 等大型语言模型相比，BeeRe 的性能也有显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Knowledge and Data Engineering 工程技术-工程：电子与电气

CiteScore

11.70

自引率

3.40%

发文量

515

审稿时长

6 months

期刊介绍： The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.