{"title":"基于知识库和混合网络的敏感文本分类研究","authors":"Yongfeng Li, Hongliang Wang, Xueting Li, Pengfei Xiu","doi":"10.1109/ICCC56324.2022.10065790","DOIUrl":null,"url":null,"abstract":"Aiming at the problems that convolution neural network is not sufficient to extract text features, it is difficult to capture long text structure information and sentence semantic relationship, and neural network can only extract the surface features of text, and it is difficult to obtain the implicit features of sentences when classifying text, this paper proposes a hybrid neural network text classification method based on a knowledge base. By extracting keywords from the category text, the keyword weight is calculated according to the co-occurrence relationship, and the category keyword knowledge base is constructed. It integrates CNN and bilstm, introduces attention mechanism, enhances the ability to extract local and temporal features of text, and improves the accuracy of text classification in combination with knowledge base information. The experimental results on the CNews dataset show that compared with TextCNN, TextRCNN, BILSTM and Bert models, this model can effectively improve the accuracy of text classification.","PeriodicalId":263098,"journal":{"name":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on Sensitive Text Classification Based on Knowledge Base and Hybrid Network\",\"authors\":\"Yongfeng Li, Hongliang Wang, Xueting Li, Pengfei Xiu\",\"doi\":\"10.1109/ICCC56324.2022.10065790\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problems that convolution neural network is not sufficient to extract text features, it is difficult to capture long text structure information and sentence semantic relationship, and neural network can only extract the surface features of text, and it is difficult to obtain the implicit features of sentences when classifying text, this paper proposes a hybrid neural network text classification method based on a knowledge base. By extracting keywords from the category text, the keyword weight is calculated according to the co-occurrence relationship, and the category keyword knowledge base is constructed. It integrates CNN and bilstm, introduces attention mechanism, enhances the ability to extract local and temporal features of text, and improves the accuracy of text classification in combination with knowledge base information. The experimental results on the CNews dataset show that compared with TextCNN, TextRCNN, BILSTM and Bert models, this model can effectively improve the accuracy of text classification.\",\"PeriodicalId\":263098,\"journal\":{\"name\":\"2022 IEEE 8th International Conference on Computer and Communications (ICCC)\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 8th International Conference on Computer and Communications (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCC56324.2022.10065790\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC56324.2022.10065790","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Sensitive Text Classification Based on Knowledge Base and Hybrid Network
Aiming at the problems that convolution neural network is not sufficient to extract text features, it is difficult to capture long text structure information and sentence semantic relationship, and neural network can only extract the surface features of text, and it is difficult to obtain the implicit features of sentences when classifying text, this paper proposes a hybrid neural network text classification method based on a knowledge base. By extracting keywords from the category text, the keyword weight is calculated according to the co-occurrence relationship, and the category keyword knowledge base is constructed. It integrates CNN and bilstm, introduces attention mechanism, enhances the ability to extract local and temporal features of text, and improves the accuracy of text classification in combination with knowledge base information. The experimental results on the CNews dataset show that compared with TextCNN, TextRCNN, BILSTM and Bert models, this model can effectively improve the accuracy of text classification.