首页 > 最新文献

Artificial Intelligence and Law最新文献

英文 中文
Methods of incorporating common element characteristics for law article prediction 法律条文预测中融入共同元素特征的方法
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-04-24 DOI: 10.1007/s10506-023-09359-6
Yifan Hou, Ge Cheng, Yun Zhang, Dongliang Zhang

Law article prediction is a task of predicting the relevant laws and regulations involved in a case according to the description text of the case, and it has broad application prospects in improving judicial efficiency. In the existing research work, researchers often only consider a single case, employing the neural network method to extract features for prediction, which lack the mining of related and common element information between different data. In order to solve this problem, we propose a law article prediction method that integrates the characteristics of common elements. It can effectively utilize the co-occurrence information of the training data, fully mine the relevant common elements between cases, and fuse local features. Experiments show that our method performs well.

法条预测是根据案件描述文本对案件涉及的相关法律法规进行预测的一项工作,在提高司法效率方面具有广阔的应用前景。在现有的研究工作中,研究者往往只考虑单个案件,采用神经网络方法提取特征进行预测,缺乏对不同数据间关联和共性要素信息的挖掘。为了解决这一问题,我们提出了一种整合共性要素特征的法条预测方法。它能有效利用训练数据的共现信息,充分挖掘案例间的相关共性要素,并融合局部特征。实验表明,我们的方法性能良好。
{"title":"Methods of incorporating common element characteristics for law article prediction","authors":"Yifan Hou,&nbsp;Ge Cheng,&nbsp;Yun Zhang,&nbsp;Dongliang Zhang","doi":"10.1007/s10506-023-09359-6","DOIUrl":"10.1007/s10506-023-09359-6","url":null,"abstract":"<div><p>Law article prediction is a task of predicting the relevant laws and regulations involved in a case according to the description text of the case, and it has broad application prospects in improving judicial efficiency. In the existing research work, researchers often only consider a single case, employing the neural network method to extract features for prediction, which lack the mining of related and common element information between different data. In order to solve this problem, we propose a law article prediction method that integrates the characteristics of common elements. It can effectively utilize the co-occurrence information of the training data, fully mine the relevant common elements between cases, and fuse local features. Experiments show that our method performs well.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"32 2","pages":"487 - 503"},"PeriodicalIF":3.1,"publicationDate":"2023-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47160130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The black box problem revisited. Real and imaginary challenges for automated legal decision making 黑匣子问题再次出现。自动化法律决策面临的现实和想象挑战
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-04-04 DOI: 10.1007/s10506-023-09356-9
Bartosz Brożek, Michał Furman, Marek Jakubiec, Bartłomiej Kucharzyk

This paper addresses the black-box problem in artificial intelligence (AI), and the related problem of explainability of AI in the legal context. We argue, first, that the black box problem is, in fact, a superficial one as it results from an overlap of four different – albeit interconnected – issues: the opacity problem, the strangeness problem, the unpredictability problem, and the justification problem. Thus, we propose a framework for discussing both the black box problem and the explainability of AI. We argue further that contrary to often defended claims the opacity issue is not a genuine problem. We also dismiss the justification problem. Further, we describe the tensions involved in the strangeness and unpredictability problems and suggest some ways to alleviate them.

本文探讨了人工智能(AI)中的黑箱问题,以及与之相关的人工智能在法律背景下的可解释性问题。首先,我们认为黑箱问题实际上是一个表面问题,因为它是由四个不同问题--尽管相互关联--的重叠造成的,这四个问题是:不透明性问题、陌生性问题、不可预测性问题和正当性问题。因此,我们提出了一个讨论黑箱问题和人工智能可解释性的框架。我们进一步指出,与经常被辩护的说法相反,不透明问题并不是一个真正的问题。我们还否定了合理性问题。此外,我们还描述了奇怪性和不可预测性问题所涉及的矛盾,并提出了一些缓解这些矛盾的方法。
{"title":"The black box problem revisited. Real and imaginary challenges for automated legal decision making","authors":"Bartosz Brożek,&nbsp;Michał Furman,&nbsp;Marek Jakubiec,&nbsp;Bartłomiej Kucharzyk","doi":"10.1007/s10506-023-09356-9","DOIUrl":"10.1007/s10506-023-09356-9","url":null,"abstract":"<div><p>This paper addresses the black-box problem in artificial intelligence (AI), and the related problem of explainability of AI in the legal context. We argue, first, that the black box problem is, in fact, a superficial one as it results from an overlap of four different – albeit interconnected – issues: the opacity problem, the strangeness problem, the unpredictability problem, and the justification problem. Thus, we propose a framework for discussing both the black box problem and the explainability of AI. We argue further that contrary to often defended claims the opacity issue is not a genuine problem. We also dismiss the justification problem. Further, we describe the tensions involved in the strangeness and unpredictability problems and suggest some ways to alleviate them.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"32 2","pages":"427 - 440"},"PeriodicalIF":3.1,"publicationDate":"2023-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10506-023-09356-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46797808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The use of AI in legal systems: determining independent contractor vs. employee status. 人工智能在法律体系中的使用:确定独立承包商与员工身份。
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-03-30 DOI: 10.1007/s10506-023-09353-y
Maxime C Cohen, Samuel Dahan, Warut Khern-Am-Nuai, Hajime Shimao, Jonathan Touboul

The use of artificial intelligence (AI) to aid legal decision making has become prominent. This paper investigates the use of AI in a critical issue in employment law, the determination of a worker's status-employee vs. independent contractor-in two common law countries (the U.S. and Canada). This legal question has been a contentious labor issue insofar as independent contractors are not eligible for the same benefits as employees. It has become an important societal issue due to the ubiquity of the gig economy and the recent disruptions in employment arrangements. To address this problem, we collected, annotated, and structured the data for all Canadian and Californian court cases related to this legal question between 2002 and 2021, resulting in 538 Canadian cases and 217 U.S. cases. In contrast to legal literature focusing on complex and correlated characteristics of the employment relationship, our statistical analyses of the data show very strong correlations between the worker's status and a small subset of quantifiable characteristics of the employment relationship. In fact, despite the variety of situations in the case law, we show that simple, off-the-shelf AI models classify the cases with an out-of-sample accuracy of more than 90%. Interestingly, the analysis of misclassified cases reveals consistent misclassification patterns by most algorithms. Legal analyses of these cases led us to identify how equity is ensured by judges in ambiguous situations. Finally, our findings have practical implications for access to legal advice and justice. We deployed our AI model via the open-access platform, https://MyOpenCourt.org/, to help users answer employment legal questions. This platform has already assisted many Canadian users, and we hope it will help democratize access to legal advice to large crowds.

使用人工智能(AI)来帮助法律决策已经变得突出。本文调查了人工智能在两个普通法国家(美国和加拿大)就业法中的一个关键问题,即工人身份的确定——雇员与独立承包商。这个法律问题一直是一个有争议的劳工问题,因为独立承包商没有资格获得与员工相同的福利。由于零工经济的普遍存在和最近就业安排的混乱,这已成为一个重要的社会问题。为了解决这个问题,我们收集、注释和结构化了2002年至2021年间与这个法律问题有关的所有加拿大和加利福尼亚法院案件的数据,导致538起加拿大案件和217起美国案件。与关注就业关系的复杂和相关特征的法律文献不同,我们对数据的统计分析显示,工人的地位与就业关系的一小部分可量化特征之间存在非常强的相关性。事实上,尽管判例法中的情况多种多样,但我们发现,简单的现成人工智能模型对案件进行分类,样本外准确率超过90%。有趣的是,对错误分类案例的分析揭示了大多数算法一致的错误分类模式。对这些案件的法律分析使我们能够确定法官在模棱两可的情况下是如何确保公平的。最后,我们的调查结果对获得法律咨询和司法公正具有实际意义。我们通过开放访问平台部署了我们的人工智能模型,https://MyOpenCourt.org/,帮助用户回答就业法律问题。这个平台已经为许多加拿大用户提供了帮助,我们希望它将有助于使大量人群获得法律咨询的民主化。
{"title":"The use of AI in legal systems: determining independent contractor vs. employee status.","authors":"Maxime C Cohen, Samuel Dahan, Warut Khern-Am-Nuai, Hajime Shimao, Jonathan Touboul","doi":"10.1007/s10506-023-09353-y","DOIUrl":"10.1007/s10506-023-09353-y","url":null,"abstract":"<p><p>The use of artificial intelligence (AI) to aid legal decision making has become prominent. This paper investigates the use of AI in a critical issue in employment law, the determination of a worker's status-employee vs. independent contractor-in two common law countries (the U.S. and Canada). This legal question has been a contentious labor issue insofar as independent contractors are not eligible for the same benefits as employees. It has become an important societal issue due to the ubiquity of the gig economy and the recent disruptions in employment arrangements. To address this problem, we collected, annotated, and structured the data for all Canadian and Californian court cases related to this legal question between 2002 and 2021, resulting in 538 Canadian cases and 217 U.S. cases. In contrast to legal literature focusing on complex and correlated characteristics of the employment relationship, our statistical analyses of the data show very strong correlations between the worker's status and a small subset of quantifiable characteristics of the employment relationship. In fact, despite the variety of situations in the case law, we show that simple, off-the-shelf AI models classify the cases with an out-of-sample accuracy of more than 90%. Interestingly, the analysis of misclassified cases reveals consistent misclassification patterns by most algorithms. Legal analyses of these cases led us to identify how equity is ensured by judges in ambiguous situations. Finally, our findings have practical implications for access to legal advice and justice. We deployed our AI model via the open-access platform, https://MyOpenCourt.org/, to help users answer employment legal questions. This platform has already assisted many Canadian users, and we hope it will help democratize access to legal advice to large crowds.</p>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":" ","pages":"1-30"},"PeriodicalIF":3.1,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10061388/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9742579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A formalization of the Protagoras court paradox in a temporal logic of epistemic and normative reasons 普罗泰哥拉宫廷悖论在认知和规范原因的时间逻辑中的形式化
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-03-16 DOI: 10.1007/s10506-023-09351-0
Meghdad Ghari

We combine linear temporal logic (with both past and future modalities) with a deontic version of justification logic to provide a framework for reasoning about time and epistemic and normative reasons. In addition to temporal modalities, the resulting logic contains two kinds of justification assertions: epistemic justification assertions and deontic justification assertions. The former presents justification for the agent’s knowledge and the latter gives reasons for why a proposition is obligatory. We present two kinds of semantics for the logic: one based on Fitting models and the other based on neighborhood models. The use of neighborhood semantics enables us to define the dual of deontic justification assertions properly, which corresponds to a notion of permission in deontic logic. We then establish the soundness and completeness of an axiom system of the logic with respect to these semantics. Further, we formalize the Protagoras versus Euathlus paradox in this logic and present a precise analysis of the paradox, and also briefly discuss Leibniz’s solution.

我们将线性时间逻辑(具有过去和未来两种模态)与正当理由逻辑的一个deontic版本结合起来,提供了一个关于时间、认识论和规范论理由的推理框架。除了时间模态,由此产生的逻辑还包含两种理由断言:认识论理由断言和行为论理由断言。前者提出了代理知识的理由,后者给出了命题具有强制性的原因。我们为逻辑提出了两种语义:一种基于拟合模型,另一种基于邻域模型。邻域语义的使用使我们能够正确地定义deontic正当性断言的对偶,这相当于deontic逻辑中的许可概念。然后,我们根据这些语义建立了逻辑公理系统的健全性和完备性。此外,我们还将普罗塔戈拉斯与欧阿特拉斯悖论形式化到这一逻辑中,并提出了对这一悖论的精确分析,还简要讨论了莱布尼茨的解决方案。
{"title":"A formalization of the Protagoras court paradox in a temporal logic of epistemic and normative reasons","authors":"Meghdad Ghari","doi":"10.1007/s10506-023-09351-0","DOIUrl":"10.1007/s10506-023-09351-0","url":null,"abstract":"<div><p>We combine linear temporal logic (with both past and future modalities) with a deontic version of justification logic to provide a framework for reasoning about time and epistemic and normative reasons. In addition to temporal modalities, the resulting logic contains two kinds of justification assertions: epistemic justification assertions and deontic justification assertions. The former presents justification for the agent’s knowledge and the latter gives reasons for why a proposition is obligatory. We present two kinds of semantics for the logic: one based on Fitting models and the other based on neighborhood models. The use of neighborhood semantics enables us to define the dual of deontic justification assertions properly, which corresponds to a notion of permission in deontic logic. We then establish the soundness and completeness of an axiom system of the logic with respect to these semantics. Further, we formalize the Protagoras versus Euathlus paradox in this logic and present a precise analysis of the paradox, and also briefly discuss Leibniz’s solution.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"32 2","pages":"325 - 367"},"PeriodicalIF":3.1,"publicationDate":"2023-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44602578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting inmates misconduct using the SHAP approach 使用SHAP方法预测囚犯的不当行为
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-03-15 DOI: 10.1007/s10506-023-09352-z
Fábio M. Oliveira, Marcelo S. Balbino, Luis E. Zarate, Fawn Ngo, Ramakrishna Govindu, Anurag Agarwal, Cristiane N. Nobre

Internal misconduct is a universal problem in prisons and affects the maintenance of social order. Consequently, correctional institutions often develop rehabilitation programs to reduce the likelihood of inmates committing internal offenses and criminal recidivism after release. Therefore, it is necessary to identify the profile of each offender, both for the appropriate indication of a rehabilitation program and the level of internal security to which he must be submitted. In this context, this work aims to discover the most significant characteristics in predicting inmate misconduct from ML methods and the SHAP approach. A database produced in 2004 through the Survey of Inmates in State and Federal Correctional Facilities in the United States of America was used, which provides nationally representative data on prisoners from state and federal facilities. The predictive model based on Random Forest performed the best, thus, we applied the SHAP to it. Overall, the results showed that features related to victimization, type of crime committed, age and age at first arrest, history of association with criminal groups, education, and drug and alcohol use are most relevant in predicting internal misconduct. Thus, it is expected to contribute to the prior classification of an inmate on time, to use programs and practices that aim to improve the lives of offenders, their reintegration into society, and consequently, the reduction of criminal recidivism.

内部不当行为是监狱中普遍存在的问题,影响着社会秩序的维护。因此,惩教机构通常会制定改造计划,以减少囚犯内部犯罪和出狱后重新犯罪的可能性。因此,有必要确定每名罪犯的特征,以便适当指明改造计划和他必须服从的内部安全级别。在此背景下,这项工作旨在从多重参照方法和 SHAP 方法中发现预测囚犯不当行为的最重要特征。研究使用了 2004 年通过 "美国州立和联邦惩教机构囚犯调查 "建立的数据库,该数据库提供了具有全国代表性的州立和联邦惩教机构囚犯数据。基于随机森林的预测模型表现最佳,因此我们将 SHAP 应用于该模型。总体而言,结果表明,与受害情况、所犯罪行类型、年龄和首次被捕年龄、与犯罪团伙的关联史、教育程度以及吸毒和酗酒情况有关的特征与预测内部不当行为最为相关。因此,预计这将有助于按时对囚犯进行事先分类,使用旨在改善罪犯生活、使其重新融入社会的方案和做法,从而减少刑事累犯。
{"title":"Predicting inmates misconduct using the SHAP approach","authors":"Fábio M. Oliveira,&nbsp;Marcelo S. Balbino,&nbsp;Luis E. Zarate,&nbsp;Fawn Ngo,&nbsp;Ramakrishna Govindu,&nbsp;Anurag Agarwal,&nbsp;Cristiane N. Nobre","doi":"10.1007/s10506-023-09352-z","DOIUrl":"10.1007/s10506-023-09352-z","url":null,"abstract":"<div><p>Internal misconduct is a universal problem in prisons and affects the maintenance of social order. Consequently, correctional institutions often develop rehabilitation programs to reduce the likelihood of inmates committing internal offenses and criminal recidivism after release. Therefore, it is necessary to identify the profile of each offender, both for the appropriate indication of a rehabilitation program and the level of internal security to which he must be submitted. In this context, this work aims to discover the most significant characteristics in predicting inmate misconduct from ML methods and the SHAP approach. A database produced in 2004 through the Survey of Inmates in State and Federal Correctional Facilities in the United States of America was used, which provides nationally representative data on prisoners from state and federal facilities. The predictive model based on Random Forest performed the best, thus, we applied the SHAP to it. Overall, the results showed that features related to victimization, type of crime committed, age and age at first arrest, history of association with criminal groups, education, and drug and alcohol use are most relevant in predicting internal misconduct. Thus, it is expected to contribute to the prior classification of an inmate on time, to use programs and practices that aim to improve the lives of offenders, their reintegration into society, and consequently, the reduction of criminal recidivism.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"32 2","pages":"369 - 395"},"PeriodicalIF":3.1,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43644754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic matching based legal information retrieval system for COVID-19 pandemic 基于语义匹配的新冠肺炎疫情法律信息检索系统。
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-03-14 DOI: 10.1007/s10506-023-09354-x
Junlin Zhu, Jiaye Wu, Xudong Luo, Jie Liu

Recently, the pandemic caused by COVID-19 is severe in the entire world. The prevention and control of crimes associated with COVID-19 are critical for controlling the pandemic. Therefore, to provide efficient and convenient intelligent legal knowledge services during the pandemic, we develop an intelligent system for legal information retrieval on the WeChat platform in this paper. The data source we used for training our system is “The typical cases of national procuratorial authorities handling crimes against the prevention and control of the new coronary pneumonia pandemic following the law”, which is published online by the Supreme People’s Procuratorate of the People’s Republic of China. We base our system on convolutional neural network and use the semantic matching mechanism to capture inter-sentence relationship information and make a prediction. Moreover, we introduce an auxiliary learning process to help the network better distinguish the relation between two sentences. Finally, the system uses the trained model to identify the information entered by a user and responds to the user with a reference case similar to the query case and gives the reference legal gist applicable to the query case.

最近,新冠肺炎引起的疫情在全世界都很严重。预防和控制与新冠肺炎有关的犯罪对于控制这一流行病至关重要。因此,为了在疫情期间提供高效便捷的智能法律知识服务,本文开发了一个在微信平台上进行法律信息检索的智能系统。我们用于培训系统的数据源是中华人民共和国最高人民检察院在线发布的“全国检察机关依法处理危害新型冠状病毒肺炎疫情防控犯罪的典型案例”。我们的系统基于卷积神经网络,并使用语义匹配机制来捕获句间关系信息并进行预测。此外,我们引入了一个辅助学习过程来帮助网络更好地区分两句之间的关系。最后,该系统使用经过训练的模型来识别用户输入的信息,并用与查询案例类似的参考案例来响应用户,并给出适用于查询案例的参考法律依据。
{"title":"Semantic matching based legal information retrieval system for COVID-19 pandemic","authors":"Junlin Zhu,&nbsp;Jiaye Wu,&nbsp;Xudong Luo,&nbsp;Jie Liu","doi":"10.1007/s10506-023-09354-x","DOIUrl":"10.1007/s10506-023-09354-x","url":null,"abstract":"<div><p>Recently, the pandemic caused by COVID-19 is severe in the entire world. The prevention and control of crimes associated with COVID-19 are critical for controlling the pandemic. Therefore, to provide efficient and convenient intelligent legal knowledge services during the pandemic, we develop an intelligent system for legal information retrieval on the WeChat platform in this paper. The data source we used for training our system is “The typical cases of national procuratorial authorities handling crimes against the prevention and control of the new coronary pneumonia pandemic following the law”, which is published online by the Supreme People’s Procuratorate of the People’s Republic of China. We base our system on convolutional neural network and use the semantic matching mechanism to capture inter-sentence relationship information and make a prediction. Moreover, we introduce an auxiliary learning process to help the network better distinguish the relation between two sentences. Finally, the system uses the trained model to identify the information entered by a user and responds to the user with a reference case similar to the query case and gives the reference legal gist applicable to the query case.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"32 2","pages":"397 - 426"},"PeriodicalIF":3.1,"publicationDate":"2023-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10074769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensemble methods for improving extractive summarization of legal case judgements 改进法律案件判决摘要提取的集成方法
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-03-04 DOI: 10.1007/s10506-023-09349-8
Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh

Summarization of legal case judgement documents is a practical and challenging problem, for which many summarization algorithms of different varieties have been tried. In this work, rather than developing yet another summarization algorithm, we investigate if intelligently ensembling (combining) the outputs of multiple (base) summarization algorithms can lead to better summaries of legal case judgements than any of the base algorithms. Using two datasets of case judgement documents from the Indian Supreme Court, one with extractive gold standard summaries and the other with abstractive gold standard summaries, we apply various ensembling techniques on summaries generated by a wide variety of summarization algorithms. The ensembling methods applied range from simple voting-based methods to ranking-based and graph-based ensembling methods. We show that many of our ensembling methods yield summaries that are better than the summaries produced by any of the individual base algorithms, in terms of ROUGE and METEOR scores.

法律案件判决文件的摘要是一个具有挑战性的实际问题,为此人们尝试了许多不同种类的摘要算法。在这项工作中,我们没有开发另一种摘要算法,而是研究智能地组合(结合)多种(基础)摘要算法的输出是否能比任何一种基础算法获得更好的法律案件判决摘要。我们使用印度最高法院的两个案件判决文件数据集(一个是提取型黄金标准摘要,另一个是抽象型黄金标准摘要),对各种摘要算法生成的摘要应用了各种集合技术。所应用的集合方法包括基于投票的简单方法、基于排序的集合方法和基于图的集合方法。我们的研究表明,就 ROUGE 和 METEOR 分数而言,我们的许多合集方法生成的摘要都优于任何单个基础算法生成的摘要。
{"title":"Ensemble methods for improving extractive summarization of legal case judgements","authors":"Aniket Deroy,&nbsp;Kripabandhu Ghosh,&nbsp;Saptarshi Ghosh","doi":"10.1007/s10506-023-09349-8","DOIUrl":"10.1007/s10506-023-09349-8","url":null,"abstract":"<div><p>Summarization of legal case judgement documents is a practical and challenging problem, for which many summarization algorithms of different varieties have been tried. In this work, rather than developing yet another summarization algorithm, we investigate if intelligently ensembling (combining) the outputs of multiple (base) summarization algorithms can lead to better summaries of legal case judgements than any of the base algorithms. Using two datasets of case judgement documents from the Indian Supreme Court, one with extractive gold standard summaries and the other with abstractive gold standard summaries, we apply various ensembling techniques on summaries generated by a wide variety of summarization algorithms. The ensembling methods applied range from simple voting-based methods to ranking-based and graph-based ensembling methods. We show that many of our ensembling methods yield summaries that are better than the summaries produced by any of the individual base algorithms, in terms of ROUGE and METEOR scores.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"32 1","pages":"231 - 289"},"PeriodicalIF":3.1,"publicationDate":"2023-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46658483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Going beyond the “common suspects”: to be presumed innocent in the era of algorithms, big data and artificial intelligence 超越“常见嫌疑人”:在算法、大数据和人工智能时代被推定无罪
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-02-22 DOI: 10.1007/s10506-023-09347-w
Athina Sachoulidou
{"title":"Going beyond the “common suspects”: to be presumed innocent in the era of algorithms, big data and artificial intelligence","authors":"Athina Sachoulidou","doi":"10.1007/s10506-023-09347-w","DOIUrl":"https://doi.org/10.1007/s10506-023-09347-w","url":null,"abstract":"","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":" ","pages":""},"PeriodicalIF":4.1,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48818700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joining metadata and textual features to advise administrative courts decisions: a cascading classifier approach 结合元数据和文本特征为行政法院裁决提供建议:级联分类器方法
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-02-18 DOI: 10.1007/s10506-023-09348-9
Hugo Mentzingen, Nuno Antonio, Victor Lobo

Decisions of regulatory government bodies and courts affect many aspects of citizens’ lives. These organizations and courts are expected to provide timely and coherent decisions, although they struggle to keep up with the increasing demand. The ability of machine learning (ML) models to predict such decisions based on past cases under similar circumstances was assessed in some recent works. The dominant conclusion is that the prediction goal is achievable with high accuracy. Nevertheless, most of those works do not consider important aspects for ML models that can impact performance and affect real-world usefulness, such as consistency, out-of-sample applicability, generality, and explainability preservation. To our knowledge, none considered all those aspects, and no previous study addressed the joint use of metadata and text-extracted variables to predict administrative decisions. We propose a predictive model that addresses the abovementioned concerns based on a two-stage cascade classifier. The model employs a first-stage prediction based on textual features extracted from the original documents and a second-stage classifier that includes proceedings’ metadata. The study was conducted using time-based cross-validation, built on data available before the predicted judgment. It provides predictions as soon as the decision date is scheduled and only considers the first document in each proceeding, along with the metadata recorded when the infringement is first registered. Finally, the proposed model provides local explainability by preserving visibility on the textual features and employing the SHapley Additive exPlanations (SHAP). Our findings suggest that this cascade approach surpasses the standalone stages and achieves relatively high Precision and Recall when both text and metadata are available while preserving real-world usefulness. With a weighted F1 score of 0.900, the results outperform the text-only baseline by 1.24% and the metadata-only baseline by 5.63%, with better discriminative properties evaluated by the receiver operating characteristic and precision-recall curves.

政府监管机构和法院的决定影响着公民生活的许多方面。人们期望这些机构和法院及时做出一致的决定,但它们却难以满足日益增长的需求。最近的一些著作评估了机器学习(ML)模型在类似情况下根据以往案例预测此类决定的能力。主要结论是,预测目标是可以实现的,而且准确率很高。然而,这些研究大多没有考虑到 ML 模型的一些重要方面,如一致性、样本外适用性、通用性和可解释性保护等,这些方面可能会影响模型的性能并影响其在现实世界中的实用性。据我们所知,没有一项研究考虑到了所有这些方面,而且以前也没有研究探讨过如何联合使用元数据和文本提取变量来预测行政决策。我们提出了一个基于两级级联分类器的预测模型来解决上述问题。该模型的第一阶段预测基于从原始文件中提取的文本特征,第二阶段分类器则包括诉讼程序的元数据。研究采用基于时间的交叉验证,建立在预测判决之前的可用数据上。该模型在判决日期确定后立即提供预测,并且只考虑每个诉讼程序中的第一份文件以及侵权首次登记时记录的元数据。最后,所提议的模型通过保留文本特征的可见性和使用 SHapley Additive exPlanations(SHAP)提供了局部可解释性。我们的研究结果表明,当文本和元数据都可用时,这种级联方法超越了独立阶段,并实现了相对较高的精确度和召回率,同时保留了现实世界中的实用性。加权 F1 得分为 0.900,结果比纯文本基线高出 1.24%,比纯元数据基线高出 5.63%,并通过接收者操作特征和精确率-召回曲线评估了更好的判别特性。
{"title":"Joining metadata and textual features to advise administrative courts decisions: a cascading classifier approach","authors":"Hugo Mentzingen,&nbsp;Nuno Antonio,&nbsp;Victor Lobo","doi":"10.1007/s10506-023-09348-9","DOIUrl":"10.1007/s10506-023-09348-9","url":null,"abstract":"<div><p>Decisions of regulatory government bodies and courts affect many aspects of citizens’ lives. These organizations and courts are expected to provide timely and coherent decisions, although they struggle to keep up with the increasing demand. The ability of machine learning (ML) models to predict such decisions based on past cases under similar circumstances was assessed in some recent works. The dominant conclusion is that the prediction goal is achievable with high accuracy. Nevertheless, most of those works do not consider important aspects for ML models that can impact performance and affect real-world usefulness, such as consistency, out-of-sample applicability, generality, and explainability preservation. To our knowledge, none considered all those aspects, and no previous study addressed the joint use of metadata and text-extracted variables to predict administrative decisions. We propose a predictive model that addresses the abovementioned concerns based on a two-stage cascade classifier. The model employs a first-stage prediction based on textual features extracted from the original documents and a second-stage classifier that includes proceedings’ metadata. The study was conducted using time-based cross-validation, built on data available before the predicted judgment. It provides predictions as soon as the decision date is scheduled and only considers the first document in each proceeding, along with the metadata recorded when the infringement is first registered. Finally, the proposed model provides local explainability by preserving visibility on the textual features and employing the SHapley Additive exPlanations (SHAP). Our findings suggest that this cascade approach surpasses the standalone stages and achieves relatively high Precision and Recall when both text and metadata are available while preserving real-world usefulness. With a weighted F1 score of 0.900, the results outperform the text-only baseline by 1.24% and the metadata-only baseline by 5.63%, with better discriminative properties evaluated by the receiver operating characteristic and precision-recall curves.</p></div>","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"32 1","pages":"201 - 230"},"PeriodicalIF":3.1,"publicationDate":"2023-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10506-023-09348-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45108787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: Using attention methods to predict judicial outcomes 更正:使用注意力方法预测司法结果
IF 3.1 2区 社会学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-02-09 DOI: 10.1007/s10506-023-09346-x
Vithor Gomes Ferreira Bertalan, Evandro Eduardo Seron Ruiz
{"title":"Correction: Using attention methods to predict judicial outcomes","authors":"Vithor Gomes Ferreira Bertalan,&nbsp;Evandro Eduardo Seron Ruiz","doi":"10.1007/s10506-023-09346-x","DOIUrl":"10.1007/s10506-023-09346-x","url":null,"abstract":"","PeriodicalId":51336,"journal":{"name":"Artificial Intelligence and Law","volume":"32 1","pages":"291 - 291"},"PeriodicalIF":3.1,"publicationDate":"2023-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49128770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence and Law
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1