Legal Case Document Summarization: Extractive and Abstractive Methods and their Evaluation

Q3 Environmental Science AACL Bioflux Pub Date : 2022-10-14 DOI:10.48550/arXiv.2210.07544

A. Shukla, Paheli Bhattacharya, Soham Poddar, Rajdeep Mukherjee, Kripabandhu Ghosh, Pawan Goyal, Saptarshi Ghosh

引用次数: 13

Abstract

Summarization of legal case judgement documents is a challenging problem in Legal NLP. However, not much analyses exist on how different families of summarization models (e.g., extractive vs. abstractive) perform when applied to legal case documents. This question is particularly important since many recent transformer-based abstractive summarization models have restrictions on the number of input tokens, and legal documents are known to be very long. Also, it is an open question on how best to evaluate legal case document summarization systems. In this paper, we carry out extensive experiments with several extractive and abstractive summarization methods (both supervised and unsupervised) over three legal summarization datasets that we have developed. Our analyses, that includes evaluation by law practitioners, lead to several interesting insights on legal summarization in specific and long document summarization in general.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

法律案件文书摘要:抽取与抽象方法及其评价

案件判决书摘要是法律自然语言处理中的一个难题。然而，对于不同类型的摘要模型(例如，抽取型与抽象型)在应用于法律案例文件时的表现，并没有太多的分析。这个问题尤其重要，因为许多最新的基于转换器的抽象摘要模型对输入令牌的数量有限制，而且法律文档非常长。此外，如何最好地评估法律案件文件摘要系统也是一个悬而未决的问题。在本文中，我们在我们开发的三个法律摘要数据集上使用几种提取和抽象摘要方法(监督和无监督)进行了广泛的实验。我们的分析，包括法律从业人员的评估，导致了几个有趣的见解，具体的法律摘要和一般的长文件摘要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

AACL Bioflux Environmental Science-Management, Monitoring, Policy and Law

CiteScore

1.40

自引率

0.00%

发文量