J. Niu, Qingjuan Zhao, Lei Wang, Huanpei Chen, Shichao Zheng
{"title":"Opinion summarization for short texts based on BM25 and syntactic parsing","authors":"J. Niu, Qingjuan Zhao, Lei Wang, Huanpei Chen, Shichao Zheng","doi":"10.1109/INDIN.2016.7819344","DOIUrl":null,"url":null,"abstract":"Online short texts of hot topics submitted to social media by users can provide valuable personal opinions, which are useful for service providers and individuals. However, it is difficult for readers to grasp the main opinions of massive short texts. In this paper, to cope with the summarization challenge of short texts, we proposed a novel approach, which makes full use of BM25 to weight each short text and syntactic parsing to generate important information of each opinion cluster. The approach also utilizes the feature pruning to reduce the dimensions of the vectors. We conduct our experiments on real datasets and evaluate the results by standard metrics and manual evaluation. The experimental results show that our proposed approach improves the accuracy when compared to the state-of-the-art method.","PeriodicalId":421680,"journal":{"name":"2016 IEEE 14th International Conference on Industrial Informatics (INDIN)","volume":"88 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 14th International Conference on Industrial Informatics (INDIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIN.2016.7819344","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Online short texts of hot topics submitted to social media by users can provide valuable personal opinions, which are useful for service providers and individuals. However, it is difficult for readers to grasp the main opinions of massive short texts. In this paper, to cope with the summarization challenge of short texts, we proposed a novel approach, which makes full use of BM25 to weight each short text and syntactic parsing to generate important information of each opinion cluster. The approach also utilizes the feature pruning to reduce the dimensions of the vectors. We conduct our experiments on real datasets and evaluate the results by standard metrics and manual evaluation. The experimental results show that our proposed approach improves the accuracy when compared to the state-of-the-art method.