Yue Li, Yuanhui Yu, Yaxian Su, Tao Yang, Xu Zhang, Jiandong Shi
{"title":"A Study of Review Hot Words Extraction Technology Based on the LSTM Web Review Validity Model","authors":"Yue Li, Yuanhui Yu, Yaxian Su, Tao Yang, Xu Zhang, Jiandong Shi","doi":"10.1109/CISP-BMEI56279.2022.9979915","DOIUrl":null,"url":null,"abstract":"The huge amount of online review text data brings a great challenge to the extraction of valid information and hot words extraction work. This paper addresses this problem and designs a study on hot words extraction based on a Bidirectional LSTM(Long short term memory) online review text validity model. Firstly, data pre-processing is performed on the data set of online review texts collected by crawlers, secondly, a validity model of online review texts based on LSTM neural network is established to filter the valid online review texts, and finally, hot words are extracted from the valid review texts to get the hot words containing valuable information. In this paper, we take hotel review text as an example to conduct experiments, and the experimental results prove that the accuracy of LSTM online review text validity model reaches 90%, the loss value reaches 0.2, and the screening of valid text for hot words extraction achieves good results.","PeriodicalId":198522,"journal":{"name":"2022 15th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 15th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI56279.2022.9979915","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The huge amount of online review text data brings a great challenge to the extraction of valid information and hot words extraction work. This paper addresses this problem and designs a study on hot words extraction based on a Bidirectional LSTM(Long short term memory) online review text validity model. Firstly, data pre-processing is performed on the data set of online review texts collected by crawlers, secondly, a validity model of online review texts based on LSTM neural network is established to filter the valid online review texts, and finally, hot words are extracted from the valid review texts to get the hot words containing valuable information. In this paper, we take hotel review text as an example to conduct experiments, and the experimental results prove that the accuracy of LSTM online review text validity model reaches 90%, the loss value reaches 0.2, and the screening of valid text for hot words extraction achieves good results.