Guangyi Xiao, Even Chow, Hao Chen, Jiqian Mo, J. Guo, Zhiguo Gong
{"title":"法律领域的汉语问题分类","authors":"Guangyi Xiao, Even Chow, Hao Chen, Jiqian Mo, J. Guo, Zhiguo Gong","doi":"10.1109/ICEBE.2017.41","DOIUrl":null,"url":null,"abstract":"Question classification is an essential part of Question Answering system(QA). This paper introduces our research work on automatic question classification that depends on the sample set including questions from legal forum. We propose a taxonomy for law question, and divide questions into three main parts: civil, criminal and administrative according to Chinese legal system. We have experimented with four machine learning algorithms: Nearest Neighbors (NN), Naïve Bayes (NB), Logistic Regression (LR) and Support Vector Machines (SVM) using two kinds of features: TF-IDF and word2vec embeddings. Further, we used fastText and adjusted the parameters to get the better results. The research shows high accuracy in Chinese question classification in law domain. Moreover, to the best of our knowledge, our work is the first attempt in this promising domain.","PeriodicalId":347774,"journal":{"name":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Chinese Questions Classification in the Law Domain\",\"authors\":\"Guangyi Xiao, Even Chow, Hao Chen, Jiqian Mo, J. Guo, Zhiguo Gong\",\"doi\":\"10.1109/ICEBE.2017.41\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Question classification is an essential part of Question Answering system(QA). This paper introduces our research work on automatic question classification that depends on the sample set including questions from legal forum. We propose a taxonomy for law question, and divide questions into three main parts: civil, criminal and administrative according to Chinese legal system. We have experimented with four machine learning algorithms: Nearest Neighbors (NN), Naïve Bayes (NB), Logistic Regression (LR) and Support Vector Machines (SVM) using two kinds of features: TF-IDF and word2vec embeddings. Further, we used fastText and adjusted the parameters to get the better results. The research shows high accuracy in Chinese question classification in law domain. Moreover, to the best of our knowledge, our work is the first attempt in this promising domain.\",\"PeriodicalId\":347774,\"journal\":{\"name\":\"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)\",\"volume\":\"119 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEBE.2017.41\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEBE.2017.41","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Chinese Questions Classification in the Law Domain
Question classification is an essential part of Question Answering system(QA). This paper introduces our research work on automatic question classification that depends on the sample set including questions from legal forum. We propose a taxonomy for law question, and divide questions into three main parts: civil, criminal and administrative according to Chinese legal system. We have experimented with four machine learning algorithms: Nearest Neighbors (NN), Naïve Bayes (NB), Logistic Regression (LR) and Support Vector Machines (SVM) using two kinds of features: TF-IDF and word2vec embeddings. Further, we used fastText and adjusted the parameters to get the better results. The research shows high accuracy in Chinese question classification in law domain. Moreover, to the best of our knowledge, our work is the first attempt in this promising domain.