GEGA: Graph Convolutional Networks and Evidence Retrieval Guided Attention for Enhanced Document-level Relation Extraction

arXiv - CS - Computation and Language Pub Date : 2024-07-31 DOI:arxiv-2407.21384

Yanxu Mao, Peipei Liu, Tiehan Cui

{"title":"GEGA: Graph Convolutional Networks and Evidence Retrieval Guided Attention for Enhanced Document-level Relation Extraction","authors":"Yanxu Mao, Peipei Liu, Tiehan Cui","doi":"arxiv-2407.21384","DOIUrl":null,"url":null,"abstract":"Document-level relation extraction (DocRE) aims to extract relations between\nentities from unstructured document text. Compared to sentence-level relation\nextraction, it requires more complex semantic understanding from a broader text\ncontext. Currently, some studies are utilizing logical rules within evidence\nsentences to enhance the performance of DocRE. However, in the data without\nprovided evidence sentences, researchers often obtain a list of evidence\nsentences for the entire document through evidence retrieval (ER). Therefore,\nDocRE suffers from two challenges: firstly, the relevance between evidence and\nentity pairs is weak; secondly, there is insufficient extraction of complex\ncross-relations between long-distance multi-entities. To overcome these\nchallenges, we propose GEGA, a novel model for DocRE. The model leverages graph\nneural networks to construct multiple weight matrices, guiding attention\nallocation to evidence sentences. It also employs multi-scale representation\naggregation to enhance ER. Subsequently, we integrate the most efficient\nevidence information to implement both fully supervised and weakly supervised\ntraining processes for the model. We evaluate the GEGA model on three widely\nused benchmark datasets: DocRED, Re-DocRED, and Revisit-DocRED. The\nexperimental results indicate that our model has achieved comprehensive\nimprovements compared to the existing SOTA model.","PeriodicalId":501030,"journal":{"name":"arXiv - CS - Computation and Language","volume":"47 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computation and Language","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.21384","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Document-level relation extraction (DocRE) aims to extract relations between entities from unstructured document text. Compared to sentence-level relation extraction, it requires more complex semantic understanding from a broader text context. Currently, some studies are utilizing logical rules within evidence sentences to enhance the performance of DocRE. However, in the data without provided evidence sentences, researchers often obtain a list of evidence sentences for the entire document through evidence retrieval (ER). Therefore, DocRE suffers from two challenges: firstly, the relevance between evidence and entity pairs is weak; secondly, there is insufficient extraction of complex cross-relations between long-distance multi-entities. To overcome these challenges, we propose GEGA, a novel model for DocRE. The model leverages graph neural networks to construct multiple weight matrices, guiding attention allocation to evidence sentences. It also employs multi-scale representation aggregation to enhance ER. Subsequently, we integrate the most efficient evidence information to implement both fully supervised and weakly supervised training processes for the model. We evaluate the GEGA model on three widely used benchmark datasets: DocRED, Re-DocRED, and Revisit-DocRED. The experimental results indicate that our model has achieved comprehensive improvements compared to the existing SOTA model.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

GEGA：图卷积网络和证据检索引导注意力用于增强文档级关系提取

文档级关系提取（DocRE）旨在从非结构化文档文本中提取实体之间的关系。与句子级关系提取相比，它需要从更广泛的文本语境中获得更复杂的语义理解。目前，一些研究利用证据信息中的逻辑规则来提高 DocRE 的性能。然而，在没有提供证据句的数据中，研究人员通常通过证据检索（ER）获得整个文档的证据句列表。因此，DocRE 面临两个挑战：第一，证据和实体对之间的相关性较弱；第二，远距离多实体之间的复杂交叉关系提取不足。为了克服这些挑战，我们为 DocRE 提出了一个新模型--GEGA。该模型利用图神经网络来构建多个权重矩阵，从而指导对证据句子的注意力分配。它还采用了多尺度表征聚合（multi-scale representationaggregation）来增强ER。随后，我们整合了最有效的证据信息，为模型实施了全监督和弱监督训练过程。我们在三个广泛使用的基准数据集上对 GEGA 模型进行了评估：DocRED、Re-DocRED 和 Revisit-DocRED。实验结果表明，与现有的 SOTA 模型相比，我们的模型取得了全面的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - CS - Computation and Language

自引率

0.00%

发文量