Modified Hierarchical-Attention Network model for legal judgment predictions

IF 2.7 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Data & Knowledge Engineering Pub Date : 2023-09-01 DOI:10.1016/j.datak.2023.102203

G. Sukanya , J. Priyadarshini

{"title":"Modified Hierarchical-Attention Network model for legal judgment predictions","authors":"G. Sukanya , J. Priyadarshini","doi":"10.1016/j.datak.2023.102203","DOIUrl":null,"url":null,"abstract":"<div><p>The impact of Artificial Intelligence in Legal Research has reached a high level in simulating human thought processes. Case Pendency is a long-lasting problem in many countries. The judicial system has to be more competent and reliable to provide justice on time for any developing country. Litigants and attorneys devote more time and effort to trial case preparation in the courtroom. The task of decision prediction is to automatically forecast the type of charge, law article, and term of punishment. Most of the earlier works for verdict prediction focused to work on civil law jurisdictions. Some of the challenges in the task are case facts are highly unstructured lengthy documents with a lack of annotations and mainly used machine learning techniques. While most research works ignore the information loss at the encoding stage, our proposed MHAN overcomes the above issue and long-range dependency problem using the attention model over hierarchical encoders with three tiers namely Sentence encoder, word encoder, and character encoder. To avoid information loss, a brand-new judgment prediction framework called MHAN is developed in this study effort. It is built on a modified Hierarchical-Attention network and a specially designed domain-specific word embedding model. Additionally, it emphasizes the feature extraction phase by joining features obtained using MHAN with an improved cosine similarity feature. Finally, a hybrid Self Improved RNN is employed to provide the projected results. Furthermore, the proposed model is trained on 10 types of real-time criminal cases from the Madras High Court of India and Supreme Court of India. It has outperformed prior methods in terms of verdict prediction. By applying different variations of the deep learning model and ablation tests, the proposed model achieves consistent results over baseline models.</p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"147 ","pages":"Article 102203"},"PeriodicalIF":2.7000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data & Knowledge Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169023X23000630","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 1

Abstract

The impact of Artificial Intelligence in Legal Research has reached a high level in simulating human thought processes. Case Pendency is a long-lasting problem in many countries. The judicial system has to be more competent and reliable to provide justice on time for any developing country. Litigants and attorneys devote more time and effort to trial case preparation in the courtroom. The task of decision prediction is to automatically forecast the type of charge, law article, and term of punishment. Most of the earlier works for verdict prediction focused to work on civil law jurisdictions. Some of the challenges in the task are case facts are highly unstructured lengthy documents with a lack of annotations and mainly used machine learning techniques. While most research works ignore the information loss at the encoding stage, our proposed MHAN overcomes the above issue and long-range dependency problem using the attention model over hierarchical encoders with three tiers namely Sentence encoder, word encoder, and character encoder. To avoid information loss, a brand-new judgment prediction framework called MHAN is developed in this study effort. It is built on a modified Hierarchical-Attention network and a specially designed domain-specific word embedding model. Additionally, it emphasizes the feature extraction phase by joining features obtained using MHAN with an improved cosine similarity feature. Finally, a hybrid Self Improved RNN is employed to provide the projected results. Furthermore, the proposed model is trained on 10 types of real-time criminal cases from the Madras High Court of India and Supreme Court of India. It has outperformed prior methods in terms of verdict prediction. By applying different variations of the deep learning model and ablation tests, the proposed model achieves consistent results over baseline models.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

法律判决预测的改进层次注意网络模型

人工智能在法律研究中的影响在模拟人类思维过程方面达到了很高的水平。未决案件在许多国家是一个长期存在的问题。司法系统必须更加胜任和可靠，以便及时为任何发展中国家伸张正义。诉讼人和律师在法庭上投入更多的时间和精力准备案件。决策预测的任务是自动预测指控类型、法律条款和处罚期限。早期的判决预测工作大多集中在民法管辖权方面。该任务中的一些挑战是，案例事实是高度非结构化的冗长文档，缺乏注释，主要使用机器学习技术。虽然大多数研究工作忽略了编码阶段的信息损失，但我们提出的MHAN使用三层（即句子编码器、单词编码器和字符编码器）的层次编码器上的注意力模型克服了上述问题和长程依赖性问题。为了避免信息丢失，本研究开发了一个全新的判断预测框架MHAN。它建立在一个改进的层次注意力网络和一个专门设计的领域特定单词嵌入模型之上。此外，它通过将使用MHAN获得的特征与改进的余弦相似性特征相结合来强调特征提取阶段。最后，采用混合自改进RNN来提供预测结果。此外，该模型还针对印度马德拉斯高等法院和印度最高法院的10种实时刑事案件进行了培训。在判决预测方面，它已经超过了以前的方法。通过应用深度学习模型和消融测试的不同变体，所提出的模型实现了与基线模型一致的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Data & Knowledge Engineering 工程技术-计算机：人工智能

CiteScore

5.00

自引率

0.00%

发文量

审稿时长

6 months

期刊介绍： Data & Knowledge Engineering (DKE) stimulates the exchange of ideas and interaction between these two related fields of interest. DKE reaches a world-wide audience of researchers, designers, managers and users. The major aim of the journal is to identify, investigate and analyze the underlying principles in the design and effective use of these systems.

期刊最新文献

When temporary results meet intermediate index: An optimization technique of procedural SQL query processing Heterogeneity in entity matching: A survey and experimental analysis Editorial Board Rankingdom: A cooperative architecture for the on-demand analysis of Wikidata A large-scale multi-disciplinary analysis of uncertainty in research articles