{"title":"Shannon Entropy is better Feature than Category and Sentiment in User Feedback Processing","authors":"Andres Rojas Paredes, Brenda Mareco","doi":"arxiv-2409.12012","DOIUrl":null,"url":null,"abstract":"App reviews in mobile app stores contain useful information which is used to\nimprove applications and promote software evolution. This information is\nprocessed by automatic tools which prioritize reviews. In order to carry out\nthis prioritization, reviews are decomposed into features like category and\nsentiment. Then, a weighted function assigns a weight to each feature and a\nreview ranking is calculated. Unfortunately, in order to extract category and\nsentiment from reviews, its is required at least a classifier trained in an\nannotated corpus. Therefore this task is computational demanding. Thus, in this\nwork, we propose Shannon Entropy as a simple feature which can replace standard\nfeatures. Our results show that a Shannon Entropy based ranking is better than\na standard ranking according to the NDCG metric. This result is promising even\nif we require fairness by means of algorithmic bias. Finally, we highlight a\ncomputational limit which appears in the search of the best ranking.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"41 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.12012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
App reviews in mobile app stores contain useful information which is used to
improve applications and promote software evolution. This information is
processed by automatic tools which prioritize reviews. In order to carry out
this prioritization, reviews are decomposed into features like category and
sentiment. Then, a weighted function assigns a weight to each feature and a
review ranking is calculated. Unfortunately, in order to extract category and
sentiment from reviews, its is required at least a classifier trained in an
annotated corpus. Therefore this task is computational demanding. Thus, in this
work, we propose Shannon Entropy as a simple feature which can replace standard
features. Our results show that a Shannon Entropy based ranking is better than
a standard ranking according to the NDCG metric. This result is promising even
if we require fairness by means of algorithmic bias. Finally, we highlight a
computational limit which appears in the search of the best ranking.