首页 > 最新文献

Artificial Intelligence and Law最新文献

英文 中文
Improving abstractive summarization of legal rulings through textual entailment 通过文本蕴涵改进法律裁决书的抽象概括
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-11-27 DOI: 10.1007/s10506-021-09305-4
Diego de Vargas Feijo, Viviane P. Moreira

The standard approach for abstractive text summarization is to use an encoder-decoder architecture. The encoder is responsible for capturing the general meaning from the source text, and the decoder is in charge of generating the final text summary. While this approach can compose summaries that resemble human writing, some may contain unrelated or unfaithful information. This problem is called “hallucination” and it represents a serious issue in legal texts as legal practitioners rely on these summaries when looking for precedents, used to support legal arguments. Another concern is that legal documents tend to be very long and may not be fed entirely to the encoder. We propose our method called LegalSumm for addressing these issues by creating different “views” over the source text, training summarization models to generate independent versions of summaries, and applying entailment module to judge how faithful these candidate summaries are with respect to the source text. We show that the proposed approach can select candidate summaries that improve ROUGE scores in all metrics evaluated.

抽象文本摘要的标准方法是使用编码器-解码器架构。编码器负责从源文本中获取一般含义,解码器负责生成最终文本摘要。虽然这种方法可以编写类似于人类写作的摘要,但有些可能包含无关或不忠的信息。这个问题被称为“幻觉”,它代表了法律文本中的一个严重问题,因为法律从业者在寻找先例时依赖这些摘要来支持法律论点。另一个令人担忧的问题是,法律文件往往很长,可能无法完全提供给编码器。我们提出了一种称为LegalSumm的方法来解决这些问题,方法是对源文本创建不同的“视图”,训练摘要模型以生成独立版本的摘要,并应用蕴涵模块来判断这些候选摘要对源文本的忠实程度。我们表明,所提出的方法可以选择在所有评估指标中提高ROGE分数的候选摘要。
{"title":"Improving abstractive summarization of legal rulings through textual entailment","authors":"Diego de Vargas Feijo,&nbsp;Viviane P. Moreira","doi":"10.1007/s10506-021-09305-4","DOIUrl":"10.1007/s10506-021-09305-4","url":null,"abstract":"<div><p>The standard approach for abstractive text summarization is to use an encoder-decoder architecture. The encoder is responsible for capturing the general meaning from the source text, and the decoder is in charge of generating the final text summary. While this approach can compose summaries that resemble human writing, some may contain unrelated or unfaithful information. This problem is called “hallucination” and it represents a serious issue in legal texts as legal practitioners rely on these summaries when looking for precedents, used to support legal arguments. Another concern is that legal documents tend to be very long and may not be fed entirely to the encoder. We propose our method called LegalSumm for addressing these issues by creating different “views” over the source text, training summarization models to generate independent versions of summaries, and applying entailment module to judge how faithful these candidate summaries are with respect to the source text. We show that the proposed approach can select candidate summaries that improve ROUGE scores in all metrics evaluated.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"31 1","pages":"91 - 113"},"PeriodicalIF":4.1,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47866335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
DeepRhole: deep learning for rhetorical role labeling of sentences in legal case documents DeepRhole:法律案件文件中句子修辞角色标注的深度学习
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-11-13 DOI: 10.1007/s10506-021-09304-5
Paheli Bhattacharya, Shounak Paul, Kripabandhu Ghosh, Saptarshi Ghosh, Adam Wyner

The task of rhetorical role labeling is to assign labels (such as Fact, Argument, Final Judgement, etc.) to sentences of a court case document. Rhetorical role labeling is an important problem in the field of Legal Analytics, since it can aid in various downstream tasks as well as enhances the readability of lengthy case documents. The task is challenging as case documents are highly various in structure and the rhetorical labels are often subjective. Previous works for automatic rhetorical role identification (i) mainly used Conditional Random Fields over manually handcrafted features, and (ii) focused on certain law domains only (e.g., Immigration cases, Rent law), and a particular jurisdiction/country (e.g., US, Canada, India). In this work, we improve upon the prior works on rhetorical role identification by proposing novel Deep Learning models for automatically identifying rhetorical roles, which substantially outperform the prior methods. Additionally, we show the effectiveness of the proposed models over documents from five different law domains, and from two different jurisdictions—the Supreme Court of India and the Supreme Court of the UK. Through extensive experiments over different variations of the Deep Learning models, including Transformer models based on BERT and LegalBERT, we show the robustness of the methods for the task. We also perform an extensive inter-annotator study and analyse the agreement of the predictions of the proposed model with the annotations by domain experts. We find that some rhetorical labels are inherently hard/subjective and both law experts and neural models frequently get confused in predicting them correctly.

修辞角色标签的任务是为法庭案件文件的句子分配标签(如事实、论据、终审判决等)。修辞角色标签是法律分析领域的一个重要问题,因为它可以帮助完成各种下游任务,并提高冗长案件文件的可读性。这项任务具有挑战性,因为案例文件的结构高度多样,修辞标签往往是主观的。先前的自动修辞角色识别工作(i)主要使用条件随机场而不是手动手工制作的特征,以及(ii)仅关注某些法律领域(例如,移民案件、租金法)和特定管辖区/国家(例如,美国、加拿大、印度)。在这项工作中,我们通过提出新的用于自动识别修辞角色的深度学习模型来改进先前关于修辞角色识别的工作,该模型大大优于先前的方法。此外,我们还展示了所提出的模型对来自五个不同法律领域和两个不同司法管辖区(印度最高法院和英国最高法院)的文件的有效性。通过对深度学习模型的不同变体进行广泛实验,包括基于BERT和LegalBERT的Transformer模型,我们展示了该任务方法的稳健性。我们还进行了广泛的注释器间研究,并分析了所提出的模型的预测与领域专家的注释的一致性。我们发现,一些修辞标签本质上是硬/主观的,法律专家和神经模型在正确预测它们时经常会感到困惑。
{"title":"DeepRhole: deep learning for rhetorical role labeling of sentences in legal case documents","authors":"Paheli Bhattacharya,&nbsp;Shounak Paul,&nbsp;Kripabandhu Ghosh,&nbsp;Saptarshi Ghosh,&nbsp;Adam Wyner","doi":"10.1007/s10506-021-09304-5","DOIUrl":"10.1007/s10506-021-09304-5","url":null,"abstract":"<div><p>The task of rhetorical role labeling is to assign labels (such as Fact, Argument, Final Judgement, etc.) to sentences of a court case document. Rhetorical role labeling is an important problem in the field of Legal Analytics, since it can aid in various downstream tasks as well as enhances the readability of lengthy case documents. The task is challenging as case documents are highly various in structure and the rhetorical labels are often subjective. Previous works for automatic rhetorical role identification (i) mainly used Conditional Random Fields over manually handcrafted features, and (ii) focused on certain law domains only (e.g., Immigration cases, Rent law), and a particular jurisdiction/country (e.g., US, Canada, India). In this work, we improve upon the prior works on rhetorical role identification by proposing novel Deep Learning models for automatically identifying rhetorical roles, which substantially outperform the prior methods. Additionally, we show the effectiveness of the proposed models over documents from five different law domains, and from two different jurisdictions—the Supreme Court of India and the Supreme Court of the UK. Through extensive experiments over different variations of the Deep Learning models, including Transformer models based on BERT and LegalBERT, we show the robustness of the methods for the task. We also perform an extensive inter-annotator study and analyse the agreement of the predictions of the proposed model with the annotations by domain experts. We find that some rhetorical labels are inherently hard/subjective and both law experts and neural models frequently get confused in predicting them correctly.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"31 1","pages":"53 - 90"},"PeriodicalIF":4.1,"publicationDate":"2021-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44962668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Human-Algorithm Interaction: Algorithmic Pricing in Hybrid Laboratory Markets 人-算法交互:混合实验室市场中的算法定价
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-10-29 DOI: 10.2139/ssrn.3840789
Hans-Theo Normann, Martin Sternberg
This paper investigates pricing in laboratory markets when human players interact with an algorithm. We compare the degree of competition when exclusively humans interact to the case of one firm delegating its decisions to an algorithm. We further vary whether participants know about the presence of the algorithm. When one of three firms in a market is an algorithm, we observe significantly higher prices compared to humanonly markets. Firms employing an algorithm earn significantly less profit than their rivals. For four-firm markets, we find no significant differences. (Un)certainty about the actual presence of an algorithm does not significantly affect collusion, although humans seem to perceive algorithms as more disruptive.
本文研究了人类玩家与算法交互时实验室市场的定价。我们将纯人类互动时的竞争程度与一家公司将其决策委托给算法的情况进行比较。我们进一步改变参与者是否知道算法的存在。当市场中的三家公司中有一家是算法时,我们观察到的价格明显高于只有人类的市场。采用算法的公司比他们的竞争对手赚取的利润少得多。对于四家公司的市场,我们没有发现显著差异。对算法实际存在的不确定性不会显著影响共谋,尽管人类似乎认为算法更具破坏性。
{"title":"Human-Algorithm Interaction: Algorithmic Pricing in Hybrid Laboratory Markets","authors":"Hans-Theo Normann, Martin Sternberg","doi":"10.2139/ssrn.3840789","DOIUrl":"https://doi.org/10.2139/ssrn.3840789","url":null,"abstract":"This paper investigates pricing in laboratory markets when human players interact with an algorithm. We compare the degree of competition when exclusively humans interact to the case of one firm delegating its decisions to an algorithm. We further vary whether participants know about the presence of the algorithm. When one of three firms in a market is an algorithm, we observe significantly higher prices compared to humanonly markets. Firms employing an algorithm earn significantly less profit than their rivals. For four-firm markets, we find no significant differences. (Un)certainty about the actual presence of an algorithm does not significantly affect collusion, although humans seem to perceive algorithms as more disruptive.","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"67 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78899557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Towards a machine understanding of Malawi legal text 实现对马拉维法律文本的机器理解
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-10-23 DOI: 10.1007/s10506-021-09303-6
Amelia V. Taylor, Eva Mfutso-Bengo

Legal professionals in Malawi rely on a limited number of textbooks, outdated law reports and inadequate library services. Most documents available are in image form, are un-structured, i.e. contain no useful legal meta-data, summaries, keynotes, and do not support a system of citation that is essential to legal research. While advances in document processing and machine learning have benefited many fields, legal research is still only marginally affected. In this interdisciplinary research, the authors build semi-automatic tools for creating a corpus of Malawi criminal law decisions annotated with legal meta-data, case and law citations. We used this corpus to extract legal meta-data, including law and case citations as used in Malawi by employing machine learning tools, spaCy and Gensim LDA. We set the foundation for a new methodology for classifying Malawi criminal case law according to the recently introduced International Classification of Crime for Statistical Purposes (ICCS).

马拉维的法律专业人员依赖数量有限的教科书、过时的法律报告和不足的图书馆服务。大多数可用的文件都是图像形式的,没有结构,即不包含有用的法律元数据、摘要、主题演讲,并且不支持对法律研究至关重要的引用系统。尽管文档处理和机器学习的进步使许多领域受益,但法律研究仍然只受到轻微影响。在这项跨学科研究中,作者构建了半自动工具,用于创建马拉维刑法判决语料库,并用法律元数据、案例和法律引文进行注释。我们使用该语料库提取法律元数据,包括马拉维使用的法律和案例引文,方法是使用机器学习工具spaCy和Gensim LDA。我们为根据最近推出的《国际统计犯罪分类法》对马拉维刑事判例法进行分类的新方法奠定了基础。
{"title":"Towards a machine understanding of Malawi legal text","authors":"Amelia V. Taylor,&nbsp;Eva Mfutso-Bengo","doi":"10.1007/s10506-021-09303-6","DOIUrl":"10.1007/s10506-021-09303-6","url":null,"abstract":"<div><p>Legal professionals in Malawi rely on a limited number of textbooks, outdated law reports and inadequate library services. Most documents available are in image form, are un-structured, i.e. contain no useful legal meta-data, summaries, keynotes, and do not support a system of citation that is essential to legal research. While advances in document processing and machine learning have benefited many fields, legal research is still only marginally affected. In this interdisciplinary research, the authors build semi-automatic tools for creating a corpus of Malawi criminal law decisions annotated with legal meta-data, case and law citations. We used this corpus to extract legal meta-data, including law and case citations as used in Malawi by employing machine learning tools, spaCy and Gensim LDA. We set the foundation for a new methodology for classifying Malawi criminal case law according to the recently introduced International Classification of Crime for Statistical Purposes (ICCS).</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"31 1","pages":"1 - 11"},"PeriodicalIF":4.1,"publicationDate":"2021-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45360285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Contract as automaton: representing a simple financial agreement in computational form 作为自动机的合同:以计算形式表示简单的财务协议
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-10-13 DOI: 10.1007/s10506-021-09300-9
Mark D. Flood, Oliver R. Goodenough

We show that the fundamental legal structure of a well-written financial contract follows a state-transition logic that can be formalized mathematically as a finite-state machine (specifically, a deterministic finite automaton or DFA). The automaton defines the states that a financial relationship can be in, such as “default,” “delinquency,” “performing,” etc., and it defines an “alphabet” of events that can trigger state transitions, such as “payment arrives,” “due date passes,” etc. The core of a contract describes the rules by which different sequences of events trigger particular sequences of state transitions in the relationship between the counterparties. By conceptualizing and representing the legal structure of a contract in this way, we expose it to a range of powerful tools and results from the theory of computation. These allow, for example, automated reasoning to determine whether a contract is internally coherent and whether it is complete relative to a particular event alphabet. We illustrate the process by representing a simple loan agreement as an automaton.

我们证明,写得好的金融合同的基本法律结构遵循状态转换逻辑,该逻辑可以在数学上形式化为有限状态机(特别是确定性有限自动机或DFA)。自动机定义了财务关系可能处于的状态,如“违约”、“拖欠”、“执行”等,并定义了可以触发状态转换的事件的“字母表”,如“付款到达”、“到期日过去”等。合同的核心描述了不同事件序列触发交易对手之间关系中特定状态转换序列的规则。通过以这种方式概念化和表示合同的法律结构,我们将其暴露在一系列强大的工具和计算理论的结果中。例如,这些允许自动推理来确定合同是否在内部连贯,以及它相对于特定事件字母表是否完整。我们通过将一个简单的贷款协议表示为一个自动机来说明这个过程。
{"title":"Contract as automaton: representing a simple financial agreement in computational form","authors":"Mark D. Flood,&nbsp;Oliver R. Goodenough","doi":"10.1007/s10506-021-09300-9","DOIUrl":"10.1007/s10506-021-09300-9","url":null,"abstract":"<div><p>We show that the fundamental legal structure of a well-written financial contract follows a state-transition logic that can be formalized mathematically as a finite-state machine (specifically, a deterministic finite automaton or DFA). The automaton defines the states that a financial relationship can be in, such as “default,” “delinquency,” “performing,” etc., and it defines an “alphabet” of events that can trigger state transitions, such as “payment arrives,” “due date passes,” etc. The core of a contract describes the rules by which different sequences of events trigger particular sequences of state transitions in the relationship between the counterparties. By conceptualizing and representing the legal structure of a contract in this way, we expose it to a range of powerful tools and results from the theory of computation. These allow, for example, automated reasoning to determine whether a contract is internally coherent and whether it is complete relative to a particular event alphabet. We illustrate the process by representing a simple loan agreement as an automaton.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"30 3","pages":"391 - 416"},"PeriodicalIF":4.1,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10506-021-09300-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46590930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Copyright Protection in a Digital Environment: Some Introspection 数字环境下的版权保护:一些反思
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-09-15 DOI: 10.2139/ssrn.3924212
J. Kevins
This paper examines the hindrances to Copyright Protection in the digital era. The author is of the view that there are six factors that pose as a challenge and in equal measure presents remedies to mitigate the challenges.
本文探讨了数字时代版权保护的障碍。提交人认为,有六个因素构成挑战,并在同等程度上提出减轻挑战的补救办法。
{"title":"Copyright Protection in a Digital Environment: Some Introspection","authors":"J. Kevins","doi":"10.2139/ssrn.3924212","DOIUrl":"https://doi.org/10.2139/ssrn.3924212","url":null,"abstract":"This paper examines the hindrances to Copyright Protection in the digital era. The author is of the view that there are six factors that pose as a challenge and in equal measure presents remedies to mitigate the challenges.","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"20 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74339660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised law article mining based on deep pre-trained language representation models with application to the Italian civil code 基于深度预训练语言表示模型的无监督法律文章挖掘及其在意大利民法典中的应用
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-09-15 DOI: 10.1007/s10506-021-09301-8
Andrea Tagarelli, Andrea Simeri

Modeling law search and retrieval as prediction problems has recently emerged as a predominant approach in law intelligence. Focusing on the law article retrieval task, we present a deep learning framework named LamBERTa, which is designed for civil-law codes, and specifically trained on the Italian civil code. To our knowledge, this is the first study proposing an advanced approach to law article prediction for the Italian legal system based on a BERT (Bidirectional Encoder Representations from Transformers) learning framework, which has recently attracted increased attention among deep learning approaches, showing outstanding effectiveness in several natural language processing and learning tasks. We define LamBERTa models by fine-tuning an Italian pre-trained BERT on the Italian civil code or its portions, for law article retrieval as a classification task. One key aspect of our LamBERTa framework is that we conceived it to address an extreme classification scenario, which is characterized by a high number of classes, the few-shot learning problem, and the lack of test query benchmarks for Italian legal prediction tasks. To solve such issues, we define different methods for the unsupervised labeling of the law articles, which can in principle be applied to any law article code system. We provide insights into the explainability and interpretability of our LamBERTa models, and we present an extensive experimental analysis over query sets of different type, for single-label as well as multi-label evaluation tasks. Empirical evidence has shown the effectiveness of LamBERTa, and also its superiority against widely used deep-learning text classifiers and a few-shot learner conceived for an attribute-aware prediction task.

将法律搜索和检索建模为预测问题最近已成为法律智能中的一种主要方法。围绕法律文章检索任务,我们提出了一个名为LamBERTa的深度学习框架,该框架是为民法典设计的,并专门针对意大利民法典进行了培训。据我们所知,这是第一项基于BERT(变压器双向编码器表示)学习框架为意大利法律系统提出法律文章预测高级方法的研究,该方法最近在深度学习方法中引起了越来越多的关注,在一些自然语言处理和学习任务中表现出了突出的有效性。我们通过在意大利民法典或其部分上微调意大利预先训练的BERT来定义LamBERTa模型,将法律文章检索作为一项分类任务。我们的LamBERTa框架的一个关键方面是,我们认为它是为了解决一个极端的分类场景,其特点是类数量多,镜头学习问题少,并且缺乏意大利法律预测任务的测试查询基准。为了解决这些问题,我们定义了不同的方法来对法律条文进行无监督标记,原则上可以应用于任何法律条文编码系统。我们深入了解了LamBERTa模型的可解释性和可解释性,并对不同类型的查询集进行了广泛的实验分析,用于单标签和多标签评估任务。经验证据表明了LamBERTa的有效性,以及它相对于广泛使用的深度学习文本分类器和为属性感知预测任务设计的少数镜头学习器的优势。
{"title":"Unsupervised law article mining based on deep pre-trained language representation models with application to the Italian civil code","authors":"Andrea Tagarelli,&nbsp;Andrea Simeri","doi":"10.1007/s10506-021-09301-8","DOIUrl":"10.1007/s10506-021-09301-8","url":null,"abstract":"<div><p>Modeling law search and retrieval as prediction problems has recently emerged as a predominant approach in law intelligence. Focusing on the law article retrieval task, we present a deep learning framework named LamBERTa, which is designed for civil-law codes, and specifically trained on the Italian civil code. To our knowledge, this is the first study proposing an advanced approach to law article prediction for the Italian legal system based on a BERT (Bidirectional Encoder Representations from Transformers) learning framework, which has recently attracted increased attention among deep learning approaches, showing outstanding effectiveness in several natural language processing and learning tasks. We define LamBERTa models by fine-tuning an Italian pre-trained BERT on the Italian civil code or its portions, for law article retrieval as a classification task. One key aspect of our LamBERTa framework is that we conceived it to address an extreme classification scenario, which is characterized by a high number of classes, the few-shot learning problem, and the lack of test query benchmarks for Italian legal prediction tasks. To solve such issues, we define different methods for the unsupervised labeling of the law articles, which can in principle be applied to any law article code system. We provide insights into the explainability and interpretability of our LamBERTa models, and we present an extensive experimental analysis over query sets of different type, for single-label as well as multi-label evaluation tasks. Empirical evidence has shown the effectiveness of LamBERTa, and also its superiority against widely used deep-learning text classifiers and a few-shot learner conceived for an attribute-aware prediction task.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"30 3","pages":"417 - 473"},"PeriodicalIF":4.1,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10506-021-09301-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42634721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Correction to: A review of predictive policing from the perspective of fairness 更正:从公平的角度回顾预测性警务
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-08-28 DOI: 10.1007/s10506-021-09299-z
Kiana Alikhademi, Emma Drobina, Diandra Prioleau, Brianna Richardson, Duncan Purves, Juan E. Gilbert
{"title":"Correction to: A review of predictive policing from the perspective of fairness","authors":"Kiana Alikhademi,&nbsp;Emma Drobina,&nbsp;Diandra Prioleau,&nbsp;Brianna Richardson,&nbsp;Duncan Purves,&nbsp;Juan E. Gilbert","doi":"10.1007/s10506-021-09299-z","DOIUrl":"10.1007/s10506-021-09299-z","url":null,"abstract":"","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"30 1","pages":"19 - 20"},"PeriodicalIF":4.1,"publicationDate":"2021-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10506-021-09299-z","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48462414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Logical English meets legal English for swaps and derivatives 交换和衍生品的逻辑英语与法律英语
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-08-12 DOI: 10.1007/s10506-021-09295-3
Robert Kowalski, Akber Datoo

In this paper, we present an informal introduction to Logical English (LE) and illustrate its use to standardise the legal wording of the Automatic Early Termination (AET) clauses of International Swaps and Derivatives Association (ISDA) Agreements. LE can be viewed both as an alternative to conventional legal English for expressing legal documents, and as an alternative to conventional computer languages for automating legal documents. LE is a controlled natural language (CNL), which is designed both to be computer-executable and to be readable by English speakers without special training. The basic form of LE is syntactic sugar for logic programs, in which all sentences have the same standard form, either as rules of the form conclusion if conditions or as unconditional sentences of the form conclusion. However, LE extends normal logic programming by introducing features that are present in other computer languages and other logics. These features include typed variables signalled by common nouns, and existentially quantified variables in the conclusions of sentences signalled by indefinite articles. Although LE translates naturally into a logic programming language such as Prolog or ASP, it can also serve as a neutral standard, which can be compiled into other lower-level computer languages.

在本文中,我们对逻辑英语(LE)进行了非正式介绍,并说明了它用于标准化国际掉期和衍生品协会(ISDA)协议中自动提前终止(AET)条款的法律措辞。LE既可以被视为表达法律文件的传统法律英语的替代品,也可以被视是实现法律文件自动化的传统计算机语言的替代品。LE是一种受控自然语言(CNL),它被设计为计算机可执行和英语使用者无需特殊培训即可阅读。LE的基本形式是逻辑程序的句法糖,其中所有句子都有相同的标准形式,要么作为条件条件条件下的形式结论的规则,要么作为形式结论的无条件句子。然而,LE通过引入其他计算机语言和其他逻辑中存在的特性来扩展普通逻辑编程。这些特征包括常见名词发出的类型化变量,以及不定冠词发出的句子结论中存在的量化变量。尽管LE可以自然地翻译成逻辑编程语言,如Prolog或ASP,但它也可以作为一种中性标准,可以编译成其他较低级别的计算机语言。
{"title":"Logical English meets legal English for swaps and derivatives","authors":"Robert Kowalski,&nbsp;Akber Datoo","doi":"10.1007/s10506-021-09295-3","DOIUrl":"10.1007/s10506-021-09295-3","url":null,"abstract":"<div><p>In this paper, we present an informal introduction to Logical English (LE) and illustrate its use to standardise the legal wording of the Automatic Early Termination (AET) clauses of International Swaps and Derivatives Association (ISDA) Agreements. LE can be viewed both as an alternative to conventional legal English for expressing legal documents, and as an alternative to conventional computer languages for automating legal documents. LE is a controlled natural language (CNL), which is designed both to be computer-executable and to be readable by English speakers without special training. The basic form of LE is syntactic sugar for logic programs, in which all sentences have the same standard form, either as rules of the form <i>conclusion if conditions</i> or as unconditional sentences of the form <i>conclusion.</i> However, LE extends normal logic programming by introducing features that are present in other computer languages and other logics. These features include typed variables signalled by common nouns, and existentially quantified variables in the <i>conclusions</i> of sentences signalled by indefinite articles. Although LE translates naturally into a logic programming language such as Prolog or ASP, it can also serve as a neutral standard, which can be compiled into other lower-level computer languages.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"30 2","pages":"163 - 197"},"PeriodicalIF":4.1,"publicationDate":"2021-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10506-021-09295-3","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47103960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
PRILJ: an efficient two-step method based on embedding and clustering for the identification of regularities in legal case judgments PRILJ:一种基于嵌入和聚类的有效的两步法,用于识别法律案件判决中的规则性
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-08-04 DOI: 10.1007/s10506-021-09297-1
Graziella De Martino, Gianvito Pio, Michelangelo Ceci

In an era characterized by fast technological progress that introduces new unpredictable scenarios every day, working in the law field may appear very difficult, if not supported by the right tools. In this respect, some systems based on Artificial Intelligence methods have been proposed in the literature, to support several tasks in the legal sector. Following this line of research, in this paper we propose a novel method, called PRILJ, that identifies paragraph regularities in legal case judgments, to support legal experts during the redaction of legal documents. Methodologically, PRILJ adopts a two-step approach that first groups documents into clusters, according to their semantic content, and then identifies regularities in the paragraphs for each cluster. Embedding-based methods are adopted to properly represent documents and paragraphs into a semantic numerical feature space, and an Approximated Nearest Neighbor Search method is adopted to efficiently retrieve the most similar paragraphs with respect to the paragraphs of a document under preparation. Our extensive experimental evaluation, performed on a real-world dataset provided by EUR-Lex, proves the effectiveness and the efficiency of the proposed method. In particular, its ability of modeling different topics of legal documents, as well as of capturing the semantics of the textual content, appear very beneficial for the considered task, and make PRILJ very robust to the possible presence of noise in the data.

在一个技术进步迅速、每天都会出现新的不可预测的场景的时代,如果没有正确的工具支持,在法律领域工作可能会显得非常困难。在这方面,文献中提出了一些基于人工智能方法的系统,以支持法律部门的多项任务。根据这一研究思路,在本文中,我们提出了一种新的方法,称为PRILJ,用于识别法律案件判决中的段落规则,以支持法律专家在法律文件的编辑过程中。在方法上,PRILJ采用了两步方法,首先根据文档的语义内容将其分组,然后为每个聚类确定段落中的规律。采用基于嵌入的方法将文档和段落适当地表示到语义数字特征空间中,并采用近似最近邻搜索方法来有效地检索与正在准备的文档的段落最相似的段落。我们在EUR-Lex提供的真实世界数据集上进行了广泛的实验评估,证明了所提出方法的有效性和效率。特别是,它对法律文件的不同主题建模的能力,以及捕获文本内容的语义的能力,似乎对所考虑的任务非常有益,并使PRILJ对数据中可能存在的噪声非常鲁棒。
{"title":"PRILJ: an efficient two-step method based on embedding and clustering for the identification of regularities in legal case judgments","authors":"Graziella De Martino,&nbsp;Gianvito Pio,&nbsp;Michelangelo Ceci","doi":"10.1007/s10506-021-09297-1","DOIUrl":"10.1007/s10506-021-09297-1","url":null,"abstract":"<div><p>In an era characterized by fast technological progress that introduces new unpredictable scenarios every day, working in the law field may appear very difficult, if not supported by the right tools. In this respect, some systems based on Artificial Intelligence methods have been proposed in the literature, to support several tasks in the legal sector. Following this line of research, in this paper we propose a novel method, called PRILJ, that identifies paragraph regularities in legal case judgments, to support legal experts during the redaction of legal documents. Methodologically, PRILJ adopts a two-step approach that first groups documents into clusters, according to their semantic content, and then identifies regularities in the paragraphs for each cluster. Embedding-based methods are adopted to properly represent documents and paragraphs into a semantic numerical feature space, and an Approximated Nearest Neighbor Search method is adopted to efficiently retrieve the most similar paragraphs with respect to the paragraphs of a document under preparation. Our extensive experimental evaluation, performed on a real-world dataset provided by EUR-Lex, proves the effectiveness and the efficiency of the proposed method. In particular, its ability of modeling different topics of legal documents, as well as of capturing the semantics of the textual content, appear very beneficial for the considered task, and make PRILJ very robust to the possible presence of noise in the data.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"30 3","pages":"359 - 390"},"PeriodicalIF":4.1,"publicationDate":"2021-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10506-021-09297-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43617883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
Artificial Intelligence and Law
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1