Research on Short Text Similarity Calculation Method for Power Intelligent Question Answering

Fanqi Meng, Wenhui Wang, Jingdong Wang
{"title":"Research on Short Text Similarity Calculation Method for Power Intelligent Question Answering","authors":"Fanqi Meng, Wenhui Wang, Jingdong Wang","doi":"10.1109/CICN51697.2021.9574692","DOIUrl":null,"url":null,"abstract":"With the development of artificial intelligence, the question answering system has penetrated into various industries and has become an important production factor. In the electricity field, problems such as the diversification of power equipment failures and the complicated terminology of the power industry are challenging the traditional power question answering system solutions. Therefore, it is of great significance to construct a question answering system based on the knowledge base in the electricity field. However, there are two problems to be solved in the question answering system in this field: (1) How to accurately segment the vocabulary (2) How to effectively match the sentence similarity. To solve the above problems, this paper proposes an algorithm model of cosine similarity combined with TF-IDF. First, add a custom electricity power dictionary in the word segmentation stage, secondly use the space vector model (VSM)-based TD-IDF algorithm for vectorization, and finally, use cosine similarity degree to perform similarity comparison. This method is verified on the electricity power question answering data set, and compared with the LDA model, TF -IDF algorithm and LSI model respectively. The experimental results show that the accuracy of the method proposed in this paper reaches 75.8%, which is significantly better than the other three. It proves that the research model can accurately match user questions, effectively reduce labor costs, and help electric power workers better solve the problems encountered in their work.","PeriodicalId":224313,"journal":{"name":"2021 13th International Conference on Computational Intelligence and Communication Networks (CICN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Computational Intelligence and Communication Networks (CICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CICN51697.2021.9574692","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

With the development of artificial intelligence, the question answering system has penetrated into various industries and has become an important production factor. In the electricity field, problems such as the diversification of power equipment failures and the complicated terminology of the power industry are challenging the traditional power question answering system solutions. Therefore, it is of great significance to construct a question answering system based on the knowledge base in the electricity field. However, there are two problems to be solved in the question answering system in this field: (1) How to accurately segment the vocabulary (2) How to effectively match the sentence similarity. To solve the above problems, this paper proposes an algorithm model of cosine similarity combined with TF-IDF. First, add a custom electricity power dictionary in the word segmentation stage, secondly use the space vector model (VSM)-based TD-IDF algorithm for vectorization, and finally, use cosine similarity degree to perform similarity comparison. This method is verified on the electricity power question answering data set, and compared with the LDA model, TF -IDF algorithm and LSI model respectively. The experimental results show that the accuracy of the method proposed in this paper reaches 75.8%, which is significantly better than the other three. It proves that the research model can accurately match user questions, effectively reduce labor costs, and help electric power workers better solve the problems encountered in their work.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
面向电力智能问答的短文本相似度计算方法研究
随着人工智能的发展,问答系统已经渗透到各个行业,成为重要的生产要素。在电力领域,电力设备故障的多样化、电力行业术语的复杂化等问题对传统的电力问答系统解决方案提出了挑战。因此,构建基于知识库的电力领域问答系统具有十分重要的意义。然而,该领域的问答系统需要解决两个问题:(1)如何准确地分词(2)如何有效地匹配句子相似度。针对上述问题,本文提出了一种结合TF-IDF的余弦相似度算法模型。首先在分词阶段添加自定义电功率字典,其次使用基于空间向量模型(VSM)的TD-IDF算法进行矢量化,最后使用余弦相似度进行相似度比较。该方法在电力问答数据集上进行了验证,并分别与LDA模型、TF -IDF算法和LSI模型进行了比较。实验结果表明,本文方法的准确率达到75.8%,明显优于其他三种方法。实践证明,该研究模型能够准确匹配用户问题,有效降低人工成本,帮助电力工作人员更好地解决工作中遇到的问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Hybrid Model based on Support Vector Machine and Principal Component Analysis Applied to Arterial Hypertension Detection Sentiment Analysis on Zomato Reviews A Review of Image-Based Deep Learning Algorithms for Cervical Cancer Screening Establish Program WCET and Energy Consumption Prediction Model Based on L-M Algorithm Chicken Swarm Optimization Algorithm Based on Adaptive Dynamic Distribution
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1