{"title":"二元短文本分类的结构学习框架","authors":"Wuying Liu, Lin Wang","doi":"10.1109/FSKD.2016.7603347","DOIUrl":null,"url":null,"abstract":"With the fast-paced prevalence of smartphones, binary short text classification (STC) is becoming a basic and challenging issue, and relevant STC algorithms can be successfully used in spam filtering for short message service (SMS), wechat, microblogging, and so on. In this manuscript, we address the structural feature of SMS documents and propose a structural learning framework, which decomposes the complex binary STC problem according to the SMS document structure, and predicts the final category by combining several sub-predictions. Supported by our index of string-frequence, we also implement some STC domain classifiers. The experimental results show that the performance of two previous STC algorithms can be upgraded by the structural learning framework, and our STC domain classifiers can achieve the state-of-the-art performance on the task of Chinese SMS spam filtering within the structural learning framework.","PeriodicalId":373155,"journal":{"name":"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Structural learning framework for binary short text classification\",\"authors\":\"Wuying Liu, Lin Wang\",\"doi\":\"10.1109/FSKD.2016.7603347\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the fast-paced prevalence of smartphones, binary short text classification (STC) is becoming a basic and challenging issue, and relevant STC algorithms can be successfully used in spam filtering for short message service (SMS), wechat, microblogging, and so on. In this manuscript, we address the structural feature of SMS documents and propose a structural learning framework, which decomposes the complex binary STC problem according to the SMS document structure, and predicts the final category by combining several sub-predictions. Supported by our index of string-frequence, we also implement some STC domain classifiers. The experimental results show that the performance of two previous STC algorithms can be upgraded by the structural learning framework, and our STC domain classifiers can achieve the state-of-the-art performance on the task of Chinese SMS spam filtering within the structural learning framework.\",\"PeriodicalId\":373155,\"journal\":{\"name\":\"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FSKD.2016.7603347\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2016.7603347","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Structural learning framework for binary short text classification
With the fast-paced prevalence of smartphones, binary short text classification (STC) is becoming a basic and challenging issue, and relevant STC algorithms can be successfully used in spam filtering for short message service (SMS), wechat, microblogging, and so on. In this manuscript, we address the structural feature of SMS documents and propose a structural learning framework, which decomposes the complex binary STC problem according to the SMS document structure, and predicts the final category by combining several sub-predictions. Supported by our index of string-frequence, we also implement some STC domain classifiers. The experimental results show that the performance of two previous STC algorithms can be upgraded by the structural learning framework, and our STC domain classifiers can achieve the state-of-the-art performance on the task of Chinese SMS spam filtering within the structural learning framework.