Method to Predict Confidential Words in Japanese Judicial Precedents Using Neural Networks With Part-of-Speech Tags

IF 0.9 Q4 TELECOMMUNICATIONS Infocommunications Journal Pub Date : 2020-01-01 DOI:10.36244/icj.2020.1.3
Masakazu Kanazawa, Atsushi Ito, Kazuyuki Yamasawa, Takehiko Kasahara, Yuya Kiryu, Fubito Toyama
{"title":"Method to Predict Confidential Words in Japanese Judicial Precedents Using Neural Networks With Part-of-Speech Tags","authors":"Masakazu Kanazawa, Atsushi Ito, Kazuyuki Yamasawa, Takehiko Kasahara, Yuya Kiryu, Fubito Toyama","doi":"10.36244/icj.2020.1.3","DOIUrl":null,"url":null,"abstract":"Abstract—Cognitive Infocommunications involve a combination of informatics and telecommunications. In the future, infocommunication is expected to become more intelligent and life supportive. Privacy is one of the most critical concerns in infocommunications. Encryption is a well-recognized technology that ensures privacy; however, it is not easy to completely hide personal information. One technique to protect privacy is by finding confidential words in a file or a website and changing them into meaningless words. In this paper, we investigate a technology used to hide confidential words taken from judicial precedents. In the Japanese judicial field, details of most precedents are not made available to the public on the Japanese court web pages to protect the persons involved. To ensure privacy, confidential words, such as personal names, are replaced by other meaningless words. This operation takes time and effort because it is done manually. Therefore, it is desirable to automatically predict confidential words. We proposed a method for predicting confidential words in Japanese judicial precedents by using part-of-speech (POS) tagging with neural networks. As a result, we obtained 88% accuracy improvement over a previous model. In this paper, we describe the mechanism of our proposed model and the prediction results using perplexity. Then, we evaluated how our proposed model was useful for the actual precedents by using recall and precision. As a result, our proposed model could detect confidential words in certain Japanese precedents.","PeriodicalId":42504,"journal":{"name":"Infocommunications Journal","volume":"7 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infocommunications Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36244/icj.2020.1.3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 2

Abstract

Abstract—Cognitive Infocommunications involve a combination of informatics and telecommunications. In the future, infocommunication is expected to become more intelligent and life supportive. Privacy is one of the most critical concerns in infocommunications. Encryption is a well-recognized technology that ensures privacy; however, it is not easy to completely hide personal information. One technique to protect privacy is by finding confidential words in a file or a website and changing them into meaningless words. In this paper, we investigate a technology used to hide confidential words taken from judicial precedents. In the Japanese judicial field, details of most precedents are not made available to the public on the Japanese court web pages to protect the persons involved. To ensure privacy, confidential words, such as personal names, are replaced by other meaningless words. This operation takes time and effort because it is done manually. Therefore, it is desirable to automatically predict confidential words. We proposed a method for predicting confidential words in Japanese judicial precedents by using part-of-speech (POS) tagging with neural networks. As a result, we obtained 88% accuracy improvement over a previous model. In this paper, we describe the mechanism of our proposed model and the prediction results using perplexity. Then, we evaluated how our proposed model was useful for the actual precedents by using recall and precision. As a result, our proposed model could detect confidential words in certain Japanese precedents.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于词性标签的神经网络预测日本司法判例中机密词的方法
摘要-认知信息通信涉及信息学和电信的结合。在未来,信息通信有望变得更加智能化和生活支持性。隐私是信息通信中最重要的问题之一。加密是一种公认的确保隐私的技术;然而,完全隐藏个人信息并不容易。保护隐私的一种方法是在文件或网站中找到机密词,并将其转换为无意义的词。在本文中,我们研究了一种用于隐藏司法判例中的机密词的技术。在日本司法领域,为了保护当事人,大多数判例的细节都没有在日本法院的网页上向公众公布。为了保证隐私,机密词(如人名)会被其他没有意义的词所取代。由于该操作是手动完成的,因此需要花费时间和精力。因此,需要自动预测机密词。本文提出了一种基于神经网络的词性标注的日语司法判例机密词预测方法。结果,我们比以前的模型获得了88%的准确性提高。在本文中,我们描述了我们所提出的模型的机制和使用perplexity的预测结果。然后,我们通过召回率和精确率来评估我们提出的模型对实际案例的有用性。结果表明,我们提出的模型可以检测到某些日本判例中的机密词。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Infocommunications Journal
Infocommunications Journal TELECOMMUNICATIONS-
CiteScore
1.90
自引率
27.30%
发文量
0
期刊最新文献
Evolution of Digitization toward the Internet of Digital & Cognitive Realities and Smart Ecosystems On the Convex Hull of the Achievable Capacity Region of the Two User FDM OMA Downlink A game theoretic framework for controlling the behavior of a content seeking to be popular on social networking sites In-network DDoS detection and mitigation using INT data for IoT ecosystem Optimizing the Performance of the Iptables Stateful NAT44 Solution
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1