首页 > 最新文献

Artificial Intelligence and Law最新文献

英文 中文
LAWSUIT: a LArge expert-Written SUmmarization dataset of ITalian constitutional court verdicts 诉讼:意大利宪法法院判决的一个大型专家撰写的摘要数据集
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-09 DOI: 10.1007/s10506-024-09414-w
Luca Ragazzi, Gianluca Moro, Stefano Guidi, Giacomo Frisoni

Large-scale public datasets are vital for driving the progress of abstractive summarization, especially in law, where documents have highly specialized jargon. However, the available resources are English-centered, limiting research advancements in other languages. This paper introduces LAWSUIT, a collection of 14K Italian legal verdicts with expert-authored abstractive maxims drawn from the Constitutional Court of the Italian Republic. LAWSUIT presents an arduous task with lengthy source texts and evenly distributed salient content. We offer extensive experiments with sequence-to-sequence and segmentation-based approaches, revealing that the latter achieve better results in full and few-shot settings. We openly release LAWSUIT to foster the development and automation of real-world legal applications.

大规模的公共数据集对于推动抽象摘要的进步至关重要,特别是在法律领域,其中的文档具有高度专业化的术语。然而,可用的资源以英语为中心,限制了其他语言的研究进展。本文介绍了诉讼,14K意大利法律判决与专家撰写的抽象格言从意大利共和国宪法法院的集合。诉讼提出了一个艰巨的任务,冗长的源文本和均匀分布的突出内容。我们对序列到序列和基于分割的方法进行了大量的实验,结果表明,后者在全镜头和少镜头设置下取得了更好的效果。我们公开发布诉讼,以促进现实世界法律应用程序的开发和自动化。
{"title":"LAWSUIT: a LArge expert-Written SUmmarization dataset of ITalian constitutional court verdicts","authors":"Luca Ragazzi,&nbsp;Gianluca Moro,&nbsp;Stefano Guidi,&nbsp;Giacomo Frisoni","doi":"10.1007/s10506-024-09414-w","DOIUrl":"10.1007/s10506-024-09414-w","url":null,"abstract":"<div><p>Large-scale public datasets are vital for driving the progress of abstractive summarization, especially in law, where documents have highly specialized jargon. However, the available resources are English-centered, limiting research advancements in other languages. This paper introduces <span>LAWSUIT</span>, a collection of 14K Italian legal verdicts with expert-authored abstractive maxims drawn from the Constitutional Court of the Italian Republic. <span>LAWSUIT</span> presents an arduous task with lengthy source texts and evenly distributed salient content. We offer extensive experiments with sequence-to-sequence and segmentation-based approaches, revealing that the latter achieve better results in full and few-shot settings. We openly release <span>LAWSUIT</span> to foster the development and automation of real-world legal applications.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"33 4","pages":"1151 - 1187"},"PeriodicalIF":3.1,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10506-024-09414-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145493398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DGGCCM: a hybrid neural model for legal event detection DGGCCM:一种用于法律事件检测的混合神经模型
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-05 DOI: 10.1007/s10506-024-09418-6
Shutao Gong, Xudong Luo

This paper introduces an advanced event detection model for legal intelligence, focusing on identifying event types in legal cases by examining trigger word candidates. It employs the DeBERTa pre-trained language model for encoding sentences into enriched word representations, supplemented by the Global Pointer neural network for initial scoring. The model further uses a graph convolutional network, conditional layer normalisation, and a convolutional neural network to extract features from these representations. A multilayer perceptron then determines the event type based on these features and initial scores. Additionally, a dictionary-matching method revises the predicted event types, with adversarial training and a sentence-length mask employed to enhance model performance and address missing trigger words. The model’s effectiveness is proven through extensive experimentation, outperforming state-of-the-art baselines (including some large language models) and securing third prize in the event detection task at the Challenge of AI in Law (CAIL) 2022. The code of our model is available at https://github.com/1gst/DGGCCN/tree/main.

本文介绍了一种用于法律智能的高级事件检测模型,重点是通过检测触发词候选者来识别法律案件中的事件类型。它采用DeBERTa预训练语言模型将句子编码为丰富的单词表示,并辅以Global Pointer神经网络进行初始评分。该模型进一步使用图卷积网络、条件层归一化和卷积神经网络从这些表示中提取特征。然后多层感知器根据这些特征和初始分数确定事件类型。此外,字典匹配方法修正预测的事件类型,使用对抗性训练和句子长度掩码来提高模型性能并解决缺失的触发词。通过广泛的实验,该模型的有效性得到了证明,优于最先进的基线(包括一些大型语言模型),并在2022年人工智能法律挑战(CAIL)的事件检测任务中获得了三等奖。我们模型的代码可以在https://github.com/1gst/DGGCCN/tree/main上找到。
{"title":"DGGCCM: a hybrid neural model for legal event detection","authors":"Shutao Gong,&nbsp;Xudong Luo","doi":"10.1007/s10506-024-09418-6","DOIUrl":"10.1007/s10506-024-09418-6","url":null,"abstract":"<div><p>This paper introduces an advanced event detection model for legal intelligence, focusing on identifying event types in legal cases by examining trigger word candidates. It employs the DeBERTa pre-trained language model for encoding sentences into enriched word representations, supplemented by the Global Pointer neural network for initial scoring. The model further uses a graph convolutional network, conditional layer normalisation, and a convolutional neural network to extract features from these representations. A multilayer perceptron then determines the event type based on these features and initial scores. Additionally, a dictionary-matching method revises the predicted event types, with adversarial training and a sentence-length mask employed to enhance model performance and address missing trigger words. The model’s effectiveness is proven through extensive experimentation, outperforming state-of-the-art baselines (including some large language models) and securing third prize in the event detection task at the Challenge of AI in Law (CAIL) 2022. The code of our model is available at https://github.com/1gst/DGGCCN/tree/main.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"33 4","pages":"1109 - 1149"},"PeriodicalIF":3.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145493397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An interdisciplinary account of the terminological choices by EU policymakers ahead of the final agreement on the AI Act: AI system, general purpose AI system, foundation model, and generative AI 对欧盟政策制定者在就《人工智能法》达成最终协议之前的术语选择进行跨学科阐述:人工智能系统、通用人工智能系统、基础模型和生成式人工智能
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-09 DOI: 10.1007/s10506-024-09412-y
David Fernández-Llorca, Emilia Gómez, Ignacio Sánchez, Gabriele Mazzini

The European Union’s Artificial Intelligence Act (AI Act) is a groundbreaking regulatory framework that integrates technical concepts and terminology from the rapidly evolving ecosystems of AI research and innovation into the legal domain. Precise definitions accessible to both AI experts and lawyers are crucial for the legislation to be effective. This paper provides an interdisciplinary analysis of the concepts of AI system, general purpose AI system, foundation model and generative AI across the different versions of the legal text (Commission proposal, Parliament position and Council General Approach) before the final political agreement. The goal is to help bridge the understanding of these key terms between the technical and legal communities and contribute to a proper implementation of the AI Act. We provide an analysis of the concept of AI system considering its scientific foundation and the crucial role that it plays in the regulation, which requires a sound definition both from legal and technical standpoints. We connect the outcomes of this discussion with the analysis of the concept of general purpose AI system and its evolution during the negotiations. We also address the distinct conceptual meanings of AI system vs AI model and explore the technical nuances of the term foundation model. We conclude that rooting the definition of foundation model to its general purpose capabilities following standardised evaluation methodologies appears to be most appropriate approach. Lastly, we tackle the concept of generative AI, arguing that definitions of AI system that include “content” as one of the system’s outputs already captures it, and concluding that not all generative AI is based on foundation models.

欧盟的人工智能法案(AI法案)是一个开创性的监管框架,将人工智能研究和创新快速发展的生态系统中的技术概念和术语整合到法律领域。人工智能专家和律师都能获得的精确定义对于立法的有效性至关重要。本文在最终政治协议之前,对不同版本的法律文本(委员会提案、议会立场和理事会一般方法)中的人工智能系统、通用人工智能系统、基础模型和生成式人工智能的概念进行了跨学科分析。其目标是帮助弥合技术和法律界对这些关键术语的理解,并为《人工智能法案》的适当实施做出贡献。我们对人工智能系统的概念进行了分析,考虑到它的科学基础和它在监管中发挥的关键作用,这需要从法律和技术的角度对其进行合理的定义。我们将讨论的结果与分析通用人工智能系统的概念及其在谈判过程中的演变联系起来。我们还讨论了人工智能系统与人工智能模型的不同概念含义,并探讨了术语基础模型的技术细微差别。我们得出的结论是,将基础模型的定义根植于其通用功能,遵循标准化的评估方法似乎是最合适的方法。最后,我们讨论了生成人工智能的概念,认为将“内容”作为系统输出之一的人工智能系统的定义已经捕获了它,并得出结论,并非所有生成人工智能都基于基础模型。
{"title":"An interdisciplinary account of the terminological choices by EU policymakers ahead of the final agreement on the AI Act: AI system, general purpose AI system, foundation model, and generative AI","authors":"David Fernández-Llorca,&nbsp;Emilia Gómez,&nbsp;Ignacio Sánchez,&nbsp;Gabriele Mazzini","doi":"10.1007/s10506-024-09412-y","DOIUrl":"10.1007/s10506-024-09412-y","url":null,"abstract":"<div><p>The European Union’s Artificial Intelligence Act (AI Act) is a groundbreaking regulatory framework that integrates technical concepts and terminology from the rapidly evolving ecosystems of AI research and innovation into the legal domain. Precise definitions accessible to both AI experts and lawyers are crucial for the legislation to be effective. This paper provides an interdisciplinary analysis of the concepts of <i>AI system</i>, <i>general purpose AI system</i>, <i>foundation model</i> and <i>generative AI</i> across the different versions of the legal text (Commission proposal, Parliament position and Council General Approach) before the final political agreement. The goal is to help bridge the understanding of these key terms between the technical and legal communities and contribute to a proper implementation of the AI Act. We provide an analysis of the concept of <i>AI system</i> considering its scientific foundation and the crucial role that it plays in the regulation, which requires a sound definition both from legal and technical standpoints. We connect the outcomes of this discussion with the analysis of the concept of <i>general purpose AI system</i> and its evolution during the negotiations. We also address the distinct conceptual meanings of <i>AI system</i> vs <i>AI model</i> and explore the technical nuances of the term <i>foundation model</i>. We conclude that rooting the definition of <i>foundation model</i> to its general purpose capabilities following standardised evaluation methodologies appears to be most appropriate approach. Lastly, we tackle the concept of <i>generative AI</i>, arguing that definitions of <i>AI system</i> that include “content” as one of the system’s outputs already captures it, and concluding that not all <i>generative AI</i> is based on <i>foundation models</i>.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"33 4","pages":"875 - 888"},"PeriodicalIF":3.1,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10506-024-09412-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141922867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Jurisprudence in hard and soft law output of international organizations: a network analysis of the use of precedent in UN Security Council and general assembly resolutions 国际组织的软硬法律产出的法理学:联合国安理会和大会决议中使用先例的网络分析
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-07 DOI: 10.1007/s10506-024-09416-8
Rafael Mesquita, Antonio Pires

Do hard law international organizations use jurisprudence differently than soft law ones? Precedent can be asset or an encumbrance to international organizations and their members, depending on their aims and on the policy area. Linking current decisions to previously-agreed ones helps to increase cohesion, facilitate consensus among members, and borrow authority – benefits that might be more necessary for some organizations than for others. To compare whether the features of norm-producing organizations correlate with their preference for jurisprudence, we compare two organs from the United Nations system: the Security Council, which produces binding decisions, and the General Assembly, which delivers soft law resolutions. We explore the citation networks formed by the approximately 20,400 resolutions adopted by each organ between 1946 and 2019 to test their differences with regards to the dynamics of citation formation, concentration of citations, and timing. Descriptive results reveal the main periods of jurisprudential activity by the Security Council and the General Assembly, but find no sizeable difference in their overall rate of precedent usage. We apply the Citation Exponential Random Graph Model (cERGM) to test for network determinants of citations and find additional similarities on transitivity and homophily, but also variations regarding preferential attachment.

硬法国际组织使用的法学与软法国际组织不同吗?先例可以是国际组织及其成员的资产,也可以是累赘,这取决于它们的目标和政策领域。将当前的决定与先前商定的决定联系起来有助于增强凝聚力,促进成员之间的共识,并借用权威——这些好处对某些组织来说可能比其他组织更必要。为了比较制定规范的组织的特征是否与他们对法律的偏好相关,我们比较了联合国系统的两个机构:产生约束性决定的安理会和提供软性法律决议的联合国大会。我们研究了1946年至2019年间各机构通过的约20,400项决议所形成的引文网络,以测试它们在引文形成动态、引文集中和时间方面的差异。描述性的结果揭示了安全理事会和大会的法学活动的主要时期,但发现它们在使用先例的总比率方面没有大的差别。我们应用引文指数随机图模型(cERGM)来测试引文的网络决定因素,并发现在传递性和同质性方面的其他相似性,以及关于优先依恋的变化。
{"title":"Jurisprudence in hard and soft law output of international organizations: a network analysis of the use of precedent in UN Security Council and general assembly resolutions","authors":"Rafael Mesquita,&nbsp;Antonio Pires","doi":"10.1007/s10506-024-09416-8","DOIUrl":"10.1007/s10506-024-09416-8","url":null,"abstract":"<div><p>Do hard law international organizations use jurisprudence differently than soft law ones? Precedent can be asset or an encumbrance to international organizations and their members, depending on their aims and on the policy area. Linking current decisions to previously-agreed ones helps to increase cohesion, facilitate consensus among members, and borrow authority – benefits that might be more necessary for some organizations than for others. To compare whether the features of norm-producing organizations correlate with their preference for jurisprudence, we compare two organs from the United Nations system: the Security Council, which produces binding decisions, and the General Assembly, which delivers soft law resolutions. We explore the citation networks formed by the approximately 20,400 resolutions adopted by each organ between 1946 and 2019 to test their differences with regards to the dynamics of citation formation, concentration of citations, and timing. Descriptive results reveal the main periods of jurisprudential activity by the Security Council and the General Assembly, but find no sizeable difference in their overall rate of precedent usage. We apply the Citation Exponential Random Graph Model (cERGM) to test for network determinants of citations and find additional similarities on transitivity and homophily, but also variations regarding preferential attachment.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"33 4","pages":"1079 - 1108"},"PeriodicalIF":3.1,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145493400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classifying proportionality - identification of a legal argument 分类比例-法律论点的识别
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-03 DOI: 10.1007/s10506-024-09415-9
Kilian Lüders, Bent Stohlmann

Proportionality is a central and globally spread argumentation technique in public law. This article provides a conceptual introduction to proportionality and argues that such a domain-specific form of argumentation is particularly interesting for argument mining. As a major contribution of this article, we share a new dataset for which proportionality has been annotated. The dataset consists of 300 German Federal Constitutional Court decisions annotated at the sentence level (54,929 sentences). In addition to separating textual parts, a fine-grained system of proportionality categories was used. Finally, we used these data for a classification task. We built classifiers that predict whether or not proportionality is invoked in a sentence. We employed several models, including neural and deep learning models and transformers. A BERT-BiLSTM-CRF model performed best.

比例论是公法中一种核心的、在全球广泛传播的论证方法。本文提供了比例性的概念介绍,并认为这种特定于领域的论证形式对于论证挖掘特别有趣。作为本文的主要贡献,我们共享了一个新的数据集,其中对比例性进行了注释。该数据集由300个德国联邦宪法法院判决组成,在句子级别(54,929个句子)进行了注释。除了分离文本部分外,还使用了比例分类的细粒度系统。最后,我们将这些数据用于分类任务。我们建立了分类器来预测句子中是否调用了比例性。我们使用了几个模型,包括神经和深度学习模型以及变形器。BERT-BiLSTM-CRF模型表现最好。
{"title":"Classifying proportionality - identification of a legal argument","authors":"Kilian Lüders,&nbsp;Bent Stohlmann","doi":"10.1007/s10506-024-09415-9","DOIUrl":"10.1007/s10506-024-09415-9","url":null,"abstract":"<div><p>Proportionality is a central and globally spread argumentation technique in public law. This article provides a conceptual introduction to proportionality and argues that such a domain-specific form of argumentation is particularly interesting for argument mining. As a major contribution of this article, we share a new dataset for which proportionality has been annotated. The dataset consists of 300 German Federal Constitutional Court decisions annotated at the sentence level (54,929 sentences). In addition to separating textual parts, a fine-grained system of proportionality categories was used. Finally, we used these data for a classification task. We built classifiers that predict whether or not proportionality is invoked in a sentence. We employed several models, including neural and deep learning models and transformers. A BERT-BiLSTM-CRF model performed best.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"33 4","pages":"1051 - 1078"},"PeriodicalIF":3.1,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10506-024-09415-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145493399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: A support system for the detection of abusive clauses in B2C contracts 更正:B2C合同中滥用条款检测支持系统
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-24 DOI: 10.1007/s10506-024-09413-x
Sławomir Dadas, Marek Kozłowski, Rafał Poświata, Michał Perełkiewicz, Marcin Białas, Małgorzata Grębowiec
{"title":"Correction: A support system for the detection of abusive clauses in B2C contracts","authors":"Sławomir Dadas,&nbsp;Marek Kozłowski,&nbsp;Rafał Poświata,&nbsp;Michał Perełkiewicz,&nbsp;Marcin Białas,&nbsp;Małgorzata Grębowiec","doi":"10.1007/s10506-024-09413-x","DOIUrl":"10.1007/s10506-024-09413-x","url":null,"abstract":"","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"33 4","pages":"953 - 954"},"PeriodicalIF":3.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10506-024-09413-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145493402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applicability of large language models and generative models for legal case judgement summarization 大语言模型和生成模型在法律案件判决摘要中的适用性
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-15 DOI: 10.1007/s10506-024-09411-z
Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh

Automatic summarization of legal case judgements, which are known to be long and complex, has traditionally been tried via extractive summarization models. In recent years, generative models including abstractive summarization models and Large language models (LLMs) have gained huge popularity. In this paper, we explore the applicability of such models for legal case judgement summarization. We applied various domain-specific abstractive summarization models and general-domain LLMs as well as extractive summarization models over two sets of legal case judgements – from the United Kingdom (UK) Supreme Court and the Indian Supreme Court – and evaluated the quality of the generated summaries. We also perform experiments on a third dataset of legal documents of a different type – Government reports from the United States. Results show that abstractive summarization models and LLMs generally perform better than the extractive methods as per traditional metrics for evaluating summary quality. However, detailed investigation shows the presence of inconsistencies and hallucinations in the outputs of the generative models, and we explore ways to reduce the hallucinations and inconsistencies in the summaries. Overall, the investigation suggests that further improvements are needed to enhance the reliability of abstractive models and LLMs for legal case judgement summarization. At present, a human-in-the-loop technique is more suitable for performing manual checks to identify inconsistencies in the generated summaries.

法律案件判决的自动摘要是冗长而复杂的,传统上是通过抽取摘要模型进行的。近年来,包括抽象摘要模型和大语言模型在内的生成模型得到了广泛的应用。在本文中,我们探讨了这些模型在法律案件判决总结中的适用性。我们在英国最高法院和印度最高法院的两组法律案件判决中应用了各种特定领域的抽象摘要模型和通用领域的法学硕士以及抽取摘要模型,并评估了生成摘要的质量。我们还对另一种类型的法律文件的第三个数据集——美国政府报告进行了实验。结果表明,抽象摘要模型和llm在评价摘要质量方面通常优于传统的提取方法。然而,详细的调查表明,在生成模型的输出中存在不一致和幻觉,我们探索了减少总结中的幻觉和不一致的方法。总体而言,调查表明,在案件判决总结中,抽象模型和llm的可靠性有待进一步提高。目前,人在循环技术更适合于执行手动检查,以识别生成摘要中的不一致之处。
{"title":"Applicability of large language models and generative models for legal case judgement summarization","authors":"Aniket Deroy,&nbsp;Kripabandhu Ghosh,&nbsp;Saptarshi Ghosh","doi":"10.1007/s10506-024-09411-z","DOIUrl":"10.1007/s10506-024-09411-z","url":null,"abstract":"<div><p>Automatic summarization of legal case judgements, which are known to be long and complex, has traditionally been tried via extractive summarization models. In recent years, generative models including abstractive summarization models and Large language models (LLMs) have gained huge popularity. In this paper, we explore the applicability of such models for legal case judgement summarization. We applied various domain-specific abstractive summarization models and general-domain LLMs as well as extractive summarization models over two sets of legal case judgements – from the United Kingdom (UK) Supreme Court and the Indian Supreme Court – and evaluated the quality of the generated summaries. We also perform experiments on a third dataset of legal documents of a different type – Government reports from the United States. Results show that abstractive summarization models and LLMs generally perform better than the extractive methods as per traditional metrics for evaluating summary quality. However, detailed investigation shows the presence of inconsistencies and hallucinations in the outputs of the generative models, and we explore ways to reduce the hallucinations and inconsistencies in the summaries. Overall, the investigation suggests that further improvements are needed to enhance the reliability of abstractive models and LLMs for legal case judgement summarization. At present, a human-in-the-loop technique is more suitable for performing manual checks to identify inconsistencies in the generated summaries.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"33 4","pages":"1007 - 1050"},"PeriodicalIF":3.1,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141836870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SIM-GCN: similarity graph convolutional networks for charges prediction SIM-GCN:用于电荷预测的相似性图卷积网络
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-13 DOI: 10.1007/s10506-024-09410-0
Qiang Ge, Jing Zhang, Xiaoding Guo

In recent years, the analysis of legal judgments and the prediction of outcomes based on case factual descriptions have become hot research topics in the field of judiciary. Among them, the task of charge prediction aims to predict the applicable charges of a judicial case based on its factual description, making it an important research area in the intelligent judiciary. While significant progress has been made in machine learning and deep learning, traditional methods are limited to handling data in Euclidean space and cannot effectively capture the semantic information in the text. To overcome the limitations of traditional learning approaches, many studies have started exploring the use of graphs to represent rich relationships between entities in text and employing graph convolutional neural networks to learn text representations. In this paper, we propose a charge prediction method based on graph convolutional neural networks. By constructing a similarity graph between cases and utilizing graph convolutional neural networks to learn case feature representations, we can better capture the relational information between cases and improve the accuracy of charge prediction. Experimental results on multiple benchmark datasets demonstrate that our proposed model outperforms traditional methods in charge prediction tasks.

近年来,基于案件事实描述的法律判决分析和结果预测成为司法领域的研究热点。其中,收费预测任务旨在根据司法案件的事实描述,对其适用的收费进行预测,是智能司法的一个重要研究领域。虽然在机器学习和深度学习方面取得了重大进展,但传统的方法仅限于在欧几里得空间中处理数据,不能有效地捕获文本中的语义信息。为了克服传统学习方法的局限性,许多研究开始探索使用图来表示文本中实体之间的丰富关系,并使用图卷积神经网络来学习文本表示。本文提出了一种基于图卷积神经网络的电荷预测方法。通过构建案例之间的相似图,利用图卷积神经网络学习案例特征表示,可以更好地捕捉案例之间的关系信息,提高收费预测的准确性。在多个基准数据集上的实验结果表明,该模型在电荷预测任务上优于传统方法。
{"title":"SIM-GCN: similarity graph convolutional networks for charges prediction","authors":"Qiang Ge,&nbsp;Jing Zhang,&nbsp;Xiaoding Guo","doi":"10.1007/s10506-024-09410-0","DOIUrl":"10.1007/s10506-024-09410-0","url":null,"abstract":"<div><p>In recent years, the analysis of legal judgments and the prediction of outcomes based on case factual descriptions have become hot research topics in the field of judiciary. Among them, the task of charge prediction aims to predict the applicable charges of a judicial case based on its factual description, making it an important research area in the intelligent judiciary. While significant progress has been made in machine learning and deep learning, traditional methods are limited to handling data in Euclidean space and cannot effectively capture the semantic information in the text. To overcome the limitations of traditional learning approaches, many studies have started exploring the use of graphs to represent rich relationships between entities in text and employing graph convolutional neural networks to learn text representations. In this paper, we propose a charge prediction method based on graph convolutional neural networks. By constructing a similarity graph between cases and utilizing graph convolutional neural networks to learn case feature representations, we can better capture the relational information between cases and improve the accuracy of charge prediction. Experimental results on multiple benchmark datasets demonstrate that our proposed model outperforms traditional methods in charge prediction tasks.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"33 4","pages":"983 - 1005"},"PeriodicalIF":3.1,"publicationDate":"2024-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141650807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From PARIS to LE-PARIS: toward patent response automation with recommender systems and collaborative large language models 从PARIS到LE-PARIS:采用推荐系统和协作式大型语言模型实现专利响应自动化
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-12 DOI: 10.1007/s10506-024-09409-7
Jung-Mei Chu, Hao-Cheng Lo, Jieh Hsiang, Chun-Chieh Cho

In patent prosecution, timely and effective responses to Office Actions (OAs) are crucial for securing patents. However, past automation and artificial intelligence research have largely overlooked this aspect. To bridge this gap, our study introduces the Patent Office Action Response Intelligence System (PARIS) and its advanced version, the Large Language Model (LLM) Enhanced PARIS (LE-PARIS). These systems are designed to enhance the efficiency of patent attorneys in handling OA responses through collaboration with AI. The systems’ key features include the construction of an OA Topics Database, development of Response Templates, and implementation of Recommender Systems and LLM-based Response Generation. To validate the effectiveness of the systems, we have employed a multi-paradigm analysis using the USPTO Office Action database and longitudinal data based on attorney interactions with our systems over six years. Through five studies, we have examined the constructiveness of OA topics (studies 1 and 2) using topic modeling and our proposed Delphi process, the efficacy of our proposed hybrid LLM-based recommender system tailored for OA responses (study 3), the quality of generated responses (study 4), and the systems’ practical value in real-world scenarios through user studies (study 5). The results indicate that both PARIS and LE-PARIS significantly achieve key metrics and have a positive impact on attorney performance.

在专利申请中,及时有效地回应专利局行动(OAs)对于保护专利至关重要。然而,过去的自动化和人工智能研究在很大程度上忽略了这一点。为了弥补这一差距,我们的研究引入了专利局行动响应智能系统(PARIS)及其高级版本,大型语言模型(LLM)增强型PARIS (LE-PARIS)。这些系统旨在通过与人工智能合作,提高专利代理人处理OA响应的效率。该系统的主要特点包括构建OA主题数据库、开发响应模板、实现推荐系统和基于法学硕士的响应生成。为了验证系统的有效性,我们采用了多范式分析,使用了USPTO Office Action数据库和基于律师与我们系统在六年内互动的纵向数据。通过五项研究,我们使用主题建模和我们提出的德尔菲过程检验了OA主题的建设性(研究1和2),我们提出的针对OA响应量身定制的基于llm的混合推荐系统的有效性(研究3),生成的响应的质量(研究4),以及通过用户研究检验了系统在现实场景中的实用价值(研究5)。结果表明,PARIS和LE-PARIS均显著达到关键指标,并对律师绩效产生积极影响。
{"title":"From PARIS to LE-PARIS: toward patent response automation with recommender systems and collaborative large language models","authors":"Jung-Mei Chu,&nbsp;Hao-Cheng Lo,&nbsp;Jieh Hsiang,&nbsp;Chun-Chieh Cho","doi":"10.1007/s10506-024-09409-7","DOIUrl":"10.1007/s10506-024-09409-7","url":null,"abstract":"<div><p>In patent prosecution, timely and effective responses to Office Actions (OAs) are crucial for securing patents. However, past automation and artificial intelligence research have largely overlooked this aspect. To bridge this gap, our study introduces the Patent Office Action Response Intelligence System (PARIS) and its advanced version, the Large Language Model (LLM) Enhanced PARIS (LE-PARIS). These systems are designed to enhance the efficiency of patent attorneys in handling OA responses through collaboration with AI. The systems’ key features include the construction of an OA Topics Database, development of Response Templates, and implementation of Recommender Systems and LLM-based Response Generation. To validate the effectiveness of the systems, we have employed a multi-paradigm analysis using the USPTO Office Action database and longitudinal data based on attorney interactions with our systems over six years. Through five studies, we have examined the constructiveness of OA topics (studies 1 and 2) using topic modeling and our proposed Delphi process, the efficacy of our proposed hybrid LLM-based recommender system tailored for OA responses (study 3), the quality of generated responses (study 4), and the systems’ practical value in real-world scenarios through user studies (study 5). The results indicate that both PARIS and LE-PARIS significantly achieve key metrics and have a positive impact on attorney performance.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"33 4","pages":"955 - 981"},"PeriodicalIF":3.1,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145493396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The digital transformation of jurisprudence: an evaluation of ChatGPT-4’s applicability to solve cases in business law 法学的数字化转型:评估 ChatGPT-4 在解决商业法案件中的适用性
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-01 DOI: 10.1007/s10506-024-09406-w
Sascha Schweitzer, Markus Conrads

In the evolving landscape of legal information systems, ChatGPT-4 and other advanced conversational agents (CAs) offer the potential to disruptively transform the law industry. This study evaluates commercially available CAs within the German legal context, thereby assessing the generalizability of previous U.S.-based findings. Employing a unique corpus of 200 distinct legal tasks, ChatGPT-4 was benchmarked against Google Bard, Google Gemini, and its predecessor, ChatGPT-3.5. Human-expert and automated assessments of 4000 CA-generated responses reveal ChatGPT-4 to be the first CA to surpass the threshold of solving realistic legal tasks and passing a German business law exam. While ChatGPT-4 outperforms ChatGPT-3.5, Google Bard, and Google Gemini in both consistency and quality, the results demonstrate a considerable degree of variability, especially in complex cases with no predefined response options. Based on these findings, legal professionals should manually verify all texts produced by CAs before use. Novices must exercise caution with CA-generated legal advice, given the expertise needed for its assessment.

在不断发展的法律信息系统中,ChatGPT-4和其他高级对话代理(ca)提供了颠覆性地改变法律行业的潜力。本研究在德国法律背景下评估了商业上可获得的ca,从而评估了以前美国研究结果的普遍性。ChatGPT-4采用了包含200个不同法律任务的独特语料库,与谷歌Bard、谷歌Gemini及其前身ChatGPT-3.5进行了基准测试。人类专家和对4000个CA生成的回复的自动评估表明,ChatGPT-4是第一个超越解决现实法律任务和通过德国商法考试门槛的CA。虽然ChatGPT-4在一致性和质量上都优于ChatGPT-3.5、谷歌Bard和谷歌Gemini,但结果显示出相当大的差异,特别是在没有预定义响应选项的复杂情况下。基于这些发现,法律专业人员应该在使用ca生成的所有文本之前手动验证。鉴于评估所需的专业知识,新手必须谨慎对待核证机关提供的法律意见。
{"title":"The digital transformation of jurisprudence: an evaluation of ChatGPT-4’s applicability to solve cases in business law","authors":"Sascha Schweitzer,&nbsp;Markus Conrads","doi":"10.1007/s10506-024-09406-w","DOIUrl":"10.1007/s10506-024-09406-w","url":null,"abstract":"<div><p>In the evolving landscape of legal information systems, ChatGPT-4 and other advanced conversational agents (CAs) offer the potential to disruptively transform the law industry. This study evaluates commercially available CAs within the German legal context, thereby assessing the generalizability of previous U.S.-based findings. Employing a unique corpus of 200 distinct legal tasks, ChatGPT-4 was benchmarked against Google Bard, Google Gemini, and its predecessor, ChatGPT-3.5. Human-expert and automated assessments of 4000 CA-generated responses reveal ChatGPT-4 to be the first CA to surpass the threshold of solving realistic legal tasks and passing a German business law exam. While ChatGPT-4 outperforms ChatGPT-3.5, Google Bard, and Google Gemini in both consistency and quality, the results demonstrate a considerable degree of variability, especially in complex cases with no predefined response options. Based on these findings, legal professionals should manually verify all texts produced by CAs before use. Novices must exercise caution with CA-generated legal advice, given the expertise needed for its assessment.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"33 3","pages":"847 - 871"},"PeriodicalIF":3.1,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10506-024-09406-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141707348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence and Law
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1