Sentiment Analysis for Software Engineering: How Far Can We Go?

2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE) Pub Date : 2018-05-27 DOI:10.1145/3180155.3180195

B. Lin, Fiorella Zampetti, G. Bavota, M. D. Penta, Michele Lanza, R. Oliveto

{"title":"Sentiment Analysis for Software Engineering: How Far Can We Go?","authors":"B. Lin, Fiorella Zampetti, G. Bavota, M. D. Penta, Michele Lanza, R. Oliveto","doi":"10.1145/3180155.3180195","DOIUrl":null,"url":null,"abstract":"Sentiment analysis has been applied to various software engineering (SE) tasks, such as evaluating app reviews or analyzing developers' emotions in commit messages. Studies indicate that sentiment analysis tools provide unreliable results when used out-of-the-box, since they are not designed to process SE datasets. The silver bullet for a successful application of sentiment analysis tools to SE datasets might be their customization to the specific usage context. We describe our experience in building a software library recommender exploiting crowdsourced opinions mined from Stack Overflow (e.g., what is the sentiment of developers about the usability of a library). To reach our goal, we retrained—on a set of 40k manually labeled sentences/words extracted from Stack Overflow—a state-of-the-art sentiment analysis tool exploiting deep learning. Despite such an effort- and time-consuming training process, the results were negative. We changed our focus and performed a thorough investigation of the accuracy of these tools on a variety of SE datasets. Our results should warn the research community about the strong limitations of current sentiment analysis tools.","PeriodicalId":6560,"journal":{"name":"2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE)","volume":"33 1","pages":"94-104"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"155","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3180155.3180195","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 155

Abstract

Sentiment analysis has been applied to various software engineering (SE) tasks, such as evaluating app reviews or analyzing developers' emotions in commit messages. Studies indicate that sentiment analysis tools provide unreliable results when used out-of-the-box, since they are not designed to process SE datasets. The silver bullet for a successful application of sentiment analysis tools to SE datasets might be their customization to the specific usage context. We describe our experience in building a software library recommender exploiting crowdsourced opinions mined from Stack Overflow (e.g., what is the sentiment of developers about the usability of a library). To reach our goal, we retrained—on a set of 40k manually labeled sentences/words extracted from Stack Overflow—a state-of-the-art sentiment analysis tool exploiting deep learning. Despite such an effort- and time-consuming training process, the results were negative. We changed our focus and performed a thorough investigation of the accuracy of these tools on a variety of SE datasets. Our results should warn the research community about the strong limitations of current sentiment analysis tools.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

面向软件工程的情感分析:我们能走多远?

情感分析已经应用于各种软件工程(SE)任务，例如评估应用程序评论或分析提交消息中的开发人员情绪。研究表明，情绪分析工具在开箱即用时提供的结果不可靠，因为它们不是为处理SE数据集而设计的。情感分析工具成功应用于SE数据集的灵丹妙药可能是它们对特定使用上下文的定制。我们描述了我们建立软件库推荐的经验，利用从Stack Overflow中挖掘的众包意见(例如，开发人员对库的可用性的看法)。为了达到我们的目标，我们重新训练了一组从Stack overflow中提取的40k手动标记的句子/单词，这是一种利用深度学习的最先进的情感分析工具。尽管这样一个努力和耗时的训练过程，结果是消极的。我们改变了我们的重点，并对这些工具在各种SE数据集上的准确性进行了彻底的调查。我们的研究结果应该提醒研究界，当前的情感分析工具有很强的局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE)

自引率

0.00%

发文量

期刊最新文献

Launch-Mode-Aware Context-Sensitive Activity Transition Analysis A Combinatorial Approach for Exposing Off-Nominal Behaviors Perses: Syntax-Guided Program Reduction Fine-Grained Test Minimization From UI Design Image to GUI Skeleton: A Neural Machine Translator to Bootstrap Mobile GUI Implementation