An exploratory evaluation of code smell agglomerations

IF 2.3 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Software Quality Journal Pub Date : 2024-07-11 DOI:10.1007/s11219-024-09680-6

Amanda Santana, Eduardo Figueiredo, Juliana Alves Pereira, Alessandro Garcia

{"title":"An exploratory evaluation of code smell agglomerations","authors":"Amanda Santana, Eduardo Figueiredo, Juliana Alves Pereira, Alessandro Garcia","doi":"10.1007/s11219-024-09680-6","DOIUrl":null,"url":null,"abstract":"<p>Code smell is a symptom of decisions about the system design or code that may degrade its modularity. For example, they may indicate inheritance misuse, excessive coupling and size. When two or more code smells occur in the same snippet of code, they form a code smell agglomeration. Few studies evaluate how agglomerations may impact code modularity. In this work, we evaluate which aspects of modularity are being hindered by agglomerations. This way, we can support practitioners in improving their code, by refactoring the code involved with code smell agglomeration that was found as harmful to the system modularity. We analyze agglomerations composed of four types of code smells: Large Class, Long Method, Feature Envy, and Refused Bequest. We then conduct a comparison study between 20 systems mined from the Qualita Corpus dataset with 10 systems mined from GitHub. In total, we analyzed 1789 agglomerations in 30 software projects, from both repositories: Qualita Corpus and GitHub. We rely on frequent itemset mining and non-parametric hypothesis testing for our analysis. Agglomerations formed by two or more Feature Envy smells have a significant frequency in the source code for both repositories. Agglomerations formed by different smell types impact the modularity more than classes with only one smell type and classes without smells. For some metrics, when Large Class appears alone, it has a significant and large impact when compared to classes that have two or more method-level smells of the same type. We have identified which agglomerations are more frequent in the source code, and how they may impact the code modularity. Consequently, we provide supporting evidence of which agglomerations developers should refactor to improve the code modularity.</p>","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"42 1","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Software Quality Journal","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11219-024-09680-6","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Code smell is a symptom of decisions about the system design or code that may degrade its modularity. For example, they may indicate inheritance misuse, excessive coupling and size. When two or more code smells occur in the same snippet of code, they form a code smell agglomeration. Few studies evaluate how agglomerations may impact code modularity. In this work, we evaluate which aspects of modularity are being hindered by agglomerations. This way, we can support practitioners in improving their code, by refactoring the code involved with code smell agglomeration that was found as harmful to the system modularity. We analyze agglomerations composed of four types of code smells: Large Class, Long Method, Feature Envy, and Refused Bequest. We then conduct a comparison study between 20 systems mined from the Qualita Corpus dataset with 10 systems mined from GitHub. In total, we analyzed 1789 agglomerations in 30 software projects, from both repositories: Qualita Corpus and GitHub. We rely on frequent itemset mining and non-parametric hypothesis testing for our analysis. Agglomerations formed by two or more Feature Envy smells have a significant frequency in the source code for both repositories. Agglomerations formed by different smell types impact the modularity more than classes with only one smell type and classes without smells. For some metrics, when Large Class appears alone, it has a significant and large impact when compared to classes that have two or more method-level smells of the same type. We have identified which agglomerations are more frequent in the source code, and how they may impact the code modularity. Consequently, we provide supporting evidence of which agglomerations developers should refactor to improve the code modularity.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

代码气味聚集的探索性评估

代码气味是系统设计或代码决策的一种症状，可能会降低其模块性。例如，它们可能表明继承使用不当、过度耦合和规模过大。当两个或两个以上的代码气味出现在同一代码片段中时，它们就形成了代码气味聚集。很少有研究会评估聚集会如何影响代码模块化。在这项工作中，我们将评估聚合会阻碍模块化的哪些方面。这样，我们就可以通过重构代码气味聚集所涉及的代码，帮助实践者改进他们的代码，因为我们发现这些代码气味聚集对系统的模块性是有害的。我们分析了由四种代码气味组成的聚集：大类、长方法、嫉妒特征和拒绝请求。然后，我们对从 Qualita 语料库数据集中挖掘的 20 个系统和从 GitHub 挖掘的 10 个系统进行了对比研究。我们总共分析了 30 个软件项目中的 1789 个聚类，它们都来自这两个资源库：Qualita语料库和GitHub。我们的分析依赖于频繁项集挖掘和非参数假设检验。在这两个资源库的源代码中，由两种或两种以上的 Feature Envy 气味形成的聚合具有显著的频率。与只有一种气味类型的类和没有气味的类相比，由不同气味类型形成的聚合体对模块化的影响更大。就某些指标而言，当大类单独出现时，与有两个或更多相同类型的方法级气味的类相比，大类的影响更大。我们已经确定了源代码中哪些聚集现象更为频繁，以及它们会如何影响代码模块化。因此，我们提供了开发人员应重构哪些聚类以提高代码模块化程度的佐证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Software Quality Journal 工程技术-计算机：软件工程

CiteScore

4.90

自引率

5.30%

发文量

审稿时长

>12 weeks

期刊介绍： The aims of the Software Quality Journal are: (1) To promote awareness of the crucial role of quality management in the effective construction of the software systems developed, used, and/or maintained by organizations in pursuit of their business objectives. (2) To provide a forum of the exchange of experiences and information on software quality management and the methods, tools and products used to measure and achieve it. (3) To provide a vehicle for the publication of academic papers related to all aspects of software quality. The Journal addresses all aspects of software quality from both a practical and an academic viewpoint. It invites contributions from practitioners and academics, as well as national and international policy and standard making bodies, and sets out to be the definitive international reference source for such information. The Journal will accept research, technique, case study, survey and tutorial submissions that address quality-related issues including, but not limited to: internal and external quality standards, management of quality within organizations, technical aspects of quality, quality aspects for product vendors, software measurement and metrics, software testing and other quality assurance techniques, total quality management and cultural aspects. Other technical issues with regard to software quality, including: data management, formal methods, safety critical applications, and CASE.