{"title":"Modified Hierarchical-Attention Network model for legal judgment predictions","authors":"G. Sukanya , J. Priyadarshini","doi":"10.1016/j.datak.2023.102203","DOIUrl":null,"url":null,"abstract":"<div><p>The impact of Artificial Intelligence in Legal Research has reached a high level in simulating human thought processes. Case Pendency is a long-lasting problem in many countries. The judicial system has to be more competent and reliable to provide justice on time for any developing country. Litigants and attorneys devote more time and effort to trial case preparation in the courtroom. The task of decision prediction is to automatically forecast the type of charge, law article, and term of punishment. Most of the earlier works for verdict prediction focused to work on civil law jurisdictions. Some of the challenges in the task are case facts are highly unstructured lengthy documents with a lack of annotations and mainly used machine learning techniques. While most research works ignore the information loss at the encoding stage, our proposed MHAN overcomes the above issue and long-range dependency problem using the attention model over hierarchical encoders with three tiers namely Sentence encoder, word encoder, and character encoder. To avoid information loss, a brand-new judgment prediction framework called MHAN is developed in this study effort. It is built on a modified Hierarchical-Attention network and a specially designed domain-specific word embedding model. Additionally, it emphasizes the feature extraction phase by joining features obtained using MHAN with an improved cosine similarity feature. Finally, a hybrid Self Improved RNN is employed to provide the projected results. Furthermore, the proposed model is trained on 10 types of real-time criminal cases from the Madras High Court of India and Supreme Court of India. It has outperformed prior methods in terms of verdict prediction. By applying different variations of the deep learning model and ablation tests, the proposed model achieves consistent results over baseline models.</p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data & Knowledge Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169023X23000630","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 1
Abstract
The impact of Artificial Intelligence in Legal Research has reached a high level in simulating human thought processes. Case Pendency is a long-lasting problem in many countries. The judicial system has to be more competent and reliable to provide justice on time for any developing country. Litigants and attorneys devote more time and effort to trial case preparation in the courtroom. The task of decision prediction is to automatically forecast the type of charge, law article, and term of punishment. Most of the earlier works for verdict prediction focused to work on civil law jurisdictions. Some of the challenges in the task are case facts are highly unstructured lengthy documents with a lack of annotations and mainly used machine learning techniques. While most research works ignore the information loss at the encoding stage, our proposed MHAN overcomes the above issue and long-range dependency problem using the attention model over hierarchical encoders with three tiers namely Sentence encoder, word encoder, and character encoder. To avoid information loss, a brand-new judgment prediction framework called MHAN is developed in this study effort. It is built on a modified Hierarchical-Attention network and a specially designed domain-specific word embedding model. Additionally, it emphasizes the feature extraction phase by joining features obtained using MHAN with an improved cosine similarity feature. Finally, a hybrid Self Improved RNN is employed to provide the projected results. Furthermore, the proposed model is trained on 10 types of real-time criminal cases from the Madras High Court of India and Supreme Court of India. It has outperformed prior methods in terms of verdict prediction. By applying different variations of the deep learning model and ablation tests, the proposed model achieves consistent results over baseline models.
期刊介绍:
Data & Knowledge Engineering (DKE) stimulates the exchange of ideas and interaction between these two related fields of interest. DKE reaches a world-wide audience of researchers, designers, managers and users. The major aim of the journal is to identify, investigate and analyze the underlying principles in the design and effective use of these systems.