Guangyi Xiao, Even Chow, Hao Chen, Jiqian Mo, J. Guo, Zhiguo Gong
{"title":"Chinese Questions Classification in the Law Domain","authors":"Guangyi Xiao, Even Chow, Hao Chen, Jiqian Mo, J. Guo, Zhiguo Gong","doi":"10.1109/ICEBE.2017.41","DOIUrl":null,"url":null,"abstract":"Question classification is an essential part of Question Answering system(QA). This paper introduces our research work on automatic question classification that depends on the sample set including questions from legal forum. We propose a taxonomy for law question, and divide questions into three main parts: civil, criminal and administrative according to Chinese legal system. We have experimented with four machine learning algorithms: Nearest Neighbors (NN), Naïve Bayes (NB), Logistic Regression (LR) and Support Vector Machines (SVM) using two kinds of features: TF-IDF and word2vec embeddings. Further, we used fastText and adjusted the parameters to get the better results. The research shows high accuracy in Chinese question classification in law domain. Moreover, to the best of our knowledge, our work is the first attempt in this promising domain.","PeriodicalId":347774,"journal":{"name":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEBE.2017.41","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Question classification is an essential part of Question Answering system(QA). This paper introduces our research work on automatic question classification that depends on the sample set including questions from legal forum. We propose a taxonomy for law question, and divide questions into three main parts: civil, criminal and administrative according to Chinese legal system. We have experimented with four machine learning algorithms: Nearest Neighbors (NN), Naïve Bayes (NB), Logistic Regression (LR) and Support Vector Machines (SVM) using two kinds of features: TF-IDF and word2vec embeddings. Further, we used fastText and adjusted the parameters to get the better results. The research shows high accuracy in Chinese question classification in law domain. Moreover, to the best of our knowledge, our work is the first attempt in this promising domain.