C. Zhuang, Yemao Zhou, Jidong Ge, Zhongjin Li, Chuanyi Li, Xiaoyu Zhou, B. Luo
{"title":"Information Extraction from Chinese Judgment Documents","authors":"C. Zhuang, Yemao Zhou, Jidong Ge, Zhongjin Li, Chuanyi Li, Xiaoyu Zhou, B. Luo","doi":"10.1109/WISA.2017.67","DOIUrl":null,"url":null,"abstract":"Judgment documents contain a wealth of valuable information. The original judgment documents are written in pure text format, so we cannot obtain information directly, which hinders the study of the judgment documents. We propose an approach to parse Chinese judgment documents into structured documents to solve this problem. Divide a judgment document into logical segments, and then extract and label information items from these logical segments. Use information items to build analytic document information model and the model is output into a structured XML document.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th Web Information Systems and Applications Conference (WISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2017.67","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Judgment documents contain a wealth of valuable information. The original judgment documents are written in pure text format, so we cannot obtain information directly, which hinders the study of the judgment documents. We propose an approach to parse Chinese judgment documents into structured documents to solve this problem. Divide a judgment document into logical segments, and then extract and label information items from these logical segments. Use information items to build analytic document information model and the model is output into a structured XML document.