{"title":"面向电力智能问答的短文本相似度计算方法研究","authors":"Fanqi Meng, Wenhui Wang, Jingdong Wang","doi":"10.1109/CICN51697.2021.9574692","DOIUrl":null,"url":null,"abstract":"With the development of artificial intelligence, the question answering system has penetrated into various industries and has become an important production factor. In the electricity field, problems such as the diversification of power equipment failures and the complicated terminology of the power industry are challenging the traditional power question answering system solutions. Therefore, it is of great significance to construct a question answering system based on the knowledge base in the electricity field. However, there are two problems to be solved in the question answering system in this field: (1) How to accurately segment the vocabulary (2) How to effectively match the sentence similarity. To solve the above problems, this paper proposes an algorithm model of cosine similarity combined with TF-IDF. First, add a custom electricity power dictionary in the word segmentation stage, secondly use the space vector model (VSM)-based TD-IDF algorithm for vectorization, and finally, use cosine similarity degree to perform similarity comparison. This method is verified on the electricity power question answering data set, and compared with the LDA model, TF -IDF algorithm and LSI model respectively. The experimental results show that the accuracy of the method proposed in this paper reaches 75.8%, which is significantly better than the other three. It proves that the research model can accurately match user questions, effectively reduce labor costs, and help electric power workers better solve the problems encountered in their work.","PeriodicalId":224313,"journal":{"name":"2021 13th International Conference on Computational Intelligence and Communication Networks (CICN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Research on Short Text Similarity Calculation Method for Power Intelligent Question Answering\",\"authors\":\"Fanqi Meng, Wenhui Wang, Jingdong Wang\",\"doi\":\"10.1109/CICN51697.2021.9574692\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of artificial intelligence, the question answering system has penetrated into various industries and has become an important production factor. In the electricity field, problems such as the diversification of power equipment failures and the complicated terminology of the power industry are challenging the traditional power question answering system solutions. Therefore, it is of great significance to construct a question answering system based on the knowledge base in the electricity field. However, there are two problems to be solved in the question answering system in this field: (1) How to accurately segment the vocabulary (2) How to effectively match the sentence similarity. To solve the above problems, this paper proposes an algorithm model of cosine similarity combined with TF-IDF. First, add a custom electricity power dictionary in the word segmentation stage, secondly use the space vector model (VSM)-based TD-IDF algorithm for vectorization, and finally, use cosine similarity degree to perform similarity comparison. This method is verified on the electricity power question answering data set, and compared with the LDA model, TF -IDF algorithm and LSI model respectively. The experimental results show that the accuracy of the method proposed in this paper reaches 75.8%, which is significantly better than the other three. It proves that the research model can accurately match user questions, effectively reduce labor costs, and help electric power workers better solve the problems encountered in their work.\",\"PeriodicalId\":224313,\"journal\":{\"name\":\"2021 13th International Conference on Computational Intelligence and Communication Networks (CICN)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 13th International Conference on Computational Intelligence and Communication Networks (CICN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CICN51697.2021.9574692\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Computational Intelligence and Communication Networks (CICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CICN51697.2021.9574692","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Short Text Similarity Calculation Method for Power Intelligent Question Answering
With the development of artificial intelligence, the question answering system has penetrated into various industries and has become an important production factor. In the electricity field, problems such as the diversification of power equipment failures and the complicated terminology of the power industry are challenging the traditional power question answering system solutions. Therefore, it is of great significance to construct a question answering system based on the knowledge base in the electricity field. However, there are two problems to be solved in the question answering system in this field: (1) How to accurately segment the vocabulary (2) How to effectively match the sentence similarity. To solve the above problems, this paper proposes an algorithm model of cosine similarity combined with TF-IDF. First, add a custom electricity power dictionary in the word segmentation stage, secondly use the space vector model (VSM)-based TD-IDF algorithm for vectorization, and finally, use cosine similarity degree to perform similarity comparison. This method is verified on the electricity power question answering data set, and compared with the LDA model, TF -IDF algorithm and LSI model respectively. The experimental results show that the accuracy of the method proposed in this paper reaches 75.8%, which is significantly better than the other three. It proves that the research model can accurately match user questions, effectively reduce labor costs, and help electric power workers better solve the problems encountered in their work.