{"title":"Conditioning Customers' Product Reviews for Accurate Classification Performance","authors":"Dorothy Yao, Ishani Chatterjee, Mengchu Zhou","doi":"10.1109/ICNSC55942.2022.10004165","DOIUrl":null,"url":null,"abstract":"In recent years, people use Internet as a platform to express their own ideas and opinions about various subjects or products. The data from these sites serve as sources for sentiment analysis. On e-commerce websites, the costumer product review conventionally expresses sentiment that corresponds with the given star rating; however, this is not always true; there are reviews that express sentiments opposite to the given star rating, which can be labeled as outliers. This paper builds on previous work that finds outliers in product review datasets, scraped from Amazon.com, using a statistics-based outlier detection and correction method (SODCM). This work focuses on 3-star reviews specifically and studies the correct polarity assignment of 3-star reviews. It investigates the behavior of SODCM when 3-star reviews are classified as negative and positive respectively.","PeriodicalId":230499,"journal":{"name":"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNSC55942.2022.10004165","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, people use Internet as a platform to express their own ideas and opinions about various subjects or products. The data from these sites serve as sources for sentiment analysis. On e-commerce websites, the costumer product review conventionally expresses sentiment that corresponds with the given star rating; however, this is not always true; there are reviews that express sentiments opposite to the given star rating, which can be labeled as outliers. This paper builds on previous work that finds outliers in product review datasets, scraped from Amazon.com, using a statistics-based outlier detection and correction method (SODCM). This work focuses on 3-star reviews specifically and studies the correct polarity assignment of 3-star reviews. It investigates the behavior of SODCM when 3-star reviews are classified as negative and positive respectively.