Software Quality Journal最新文献_第2页

ASDMG: business topic clustering-based architecture smell detection for microservice granularity ASDMG：基于业务主题聚类的微服务粒度架构气味检测

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-07-08 DOI: 10.1007/s11219-024-09681-5

Sixuan Wang, Baoqing Jin, Dongjin Yu, Shuhan Cheng

Microservices architecture smells can significantly affect the quality of microservices due to poor design decisions, especially the granularity smells of microservice architectures will greatly affect the quality of a microservices architecture. The state-of-the-art methods of microservice architectural granularity detection primarily focus on the service level, which lacks consideration of detailed information such as interfaces, and these methods also lack considerations about semantic information related to business logic, leading to lower accuracy in the detection results. To address these issues, we introduce ASDMG, which takes semantic information within the Abstract Syntax Tree (AST) into consideration, integrating them with data dependency to extract business topic relationships of functions. It performs interface-oriented business topic clustering, allowing comprehensive detection of granularity smells both within individual microservices as well as the overall microservice architecture. Experiments were conducted using 5 open-source microservice systems in different scales and domains. Results show that ASDMG achieves an average precision of 83.41%, an average recall of 95.84%, and an average accuracy of 95.85% in detecting architectural granularity smells. Compared to state-of-the-art methods, it achieves better detection results and can improve the quality of microservice architecture.

由于设计决策失误，微服务架构气味会严重影响微服务的质量，尤其是微服务架构粒度气味会极大地影响微服务架构的质量。最先进的微服务架构粒度检测方法主要集中在服务层面，缺乏对接口等详细信息的考虑，而且这些方法也缺乏对业务逻辑相关语义信息的考虑，导致检测结果的准确性较低。为了解决这些问题，我们引入了 ASDMG，它考虑了抽象语法树（AST）中的语义信息，并将其与数据依赖性相结合，以提取函数的业务主题关系。它可执行面向接口的业务主题聚类，从而全面检测单个微服务以及整体微服务架构中的粒度气味。我们使用 5 个不同规模和领域的开源微服务系统进行了实验。结果表明，在检测架构粒度气味方面，ASDMG 的平均精确度为 83.41%，平均召回率为 95.84%，平均准确率为 95.85%。与最先进的方法相比，它能获得更好的检测结果，并能提高微服务架构的质量。

{"title":"ASDMG: business topic clustering-based architecture smell detection for microservice granularity","authors":"Sixuan Wang, Baoqing Jin, Dongjin Yu, Shuhan Cheng","doi":"10.1007/s11219-024-09681-5","DOIUrl":"https://doi.org/10.1007/s11219-024-09681-5","url":null,"abstract":"Microservices architecture smells can significantly affect the quality of microservices due to poor design decisions, especially the granularity smells of microservice architectures will greatly affect the quality of a microservices architecture. The state-of-the-art methods of microservice architectural granularity detection primarily focus on the service level, which lacks consideration of detailed information such as interfaces, and these methods also lack considerations about semantic information related to business logic, leading to lower accuracy in the detection results. To address these issues, we introduce ASDMG, which takes semantic information within the Abstract Syntax Tree (AST) into consideration, integrating them with data dependency to extract business topic relationships of functions. It performs interface-oriented business topic clustering, allowing comprehensive detection of granularity smells both within individual microservices as well as the overall microservice architecture. Experiments were conducted using 5 open-source microservice systems in different scales and domains. Results show that ASDMG achieves an average precision of 83.41%, an average recall of 95.84%, and an average accuracy of 95.85% in detecting architectural granularity smells. Compared to state-of-the-art methods, it achieves better detection results and can improve the quality of microservice architecture.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"8 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141569078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Periodic and random incremental backup policies in reliability theory 可靠性理论中的周期和随机增量备份策略

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-07-05 DOI: 10.1007/s11219-024-09685-1

Xufeng Zhao, Yilei Bu, Wendi Pang, Jiajia Cai

For a 24/7 database system, backups should be implemented right after a large volume of data has been updated, putting their backup windows in non-busy states with user’s convenience.From this viewpoint, this paper studies periodic and random incremental backup policies, in which, incremental backup is implemented right after data update and full backup is performed at periodic times (varvec{KT}), or at a number (varvec{N}) of data updates, respectively. We firstly describe the stochastic processes of data update and database failure, and then model the expected cost rates for data backup and data restoration.Respective (varvec{K^*, N^*, K_f^*}), and (varvec{N_f^*}) are obtained to minimize their expected cost rates in analytical ways, respectively. Finally, numerical examples are given to illustrate the optimum policies.

从这个角度出发，本文研究了周期性和随机增量备份策略，其中，增量备份在数据更新后立即执行，而完全备份则分别在数据更新的周期时间（(varvec{KT})）或次数（(varvec{N})）执行。我们首先描述了数据更新和数据库故障的随机过程，然后对数据备份和数据恢复的预期成本率进行建模，通过分析方法分别得到了 (varvec{K^*, N^*, K_f^*}) 和 (varvec{N_f^*}) 以最小化它们的预期成本率。最后，我们给出了一些数值示例来说明最优策略。

引用次数: 0

Integrated multi-view modeling for reliable machine learning-intensive software engineering 为可靠的机器学习密集型软件工程建立综合多视角模型

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-07-03 DOI: 10.1007/s11219-024-09687-z

Jati H. Husen, Hironori Washizaki, Jomphon Runpakprakun, Nobukazu Yoshioka, Hnin Thandar Tun, Yoshiaki Fukazawa, Hironori Takeuchi

Development of machine learning (ML) systems differs from traditional approaches. The probabilistic nature of ML leads to a more experimentative development approach, which often results in a disparity between the quality of ML models with other aspects such as business, safety, and the overall system architecture. Herein the Multi-view Modeling Framework for ML Systems (M³S) is proposed as a solution to this problem. M³S provides an analysis framework that integrates different views. It is supported by an integrated metamodel to ensure the connection and consistency between different models. To facilitate the experimentative nature of ML training, M³S provides an integrated platform between the modeling environment and the ML training pipeline. M³S is validated through a case study and a controlled experiment. M³S shows promise, but future research needs to confirm its generality.

机器学习（ML）系统的开发不同于传统方法。机器学习的概率性质导致开发方法更具实验性，这往往会造成机器学习模型的质量与业务、安全和整体系统架构等其他方面之间的差异。本文提出的多视角 ML 系统建模框架（M3S）就是解决这一问题的方案。M3S 提供了一个整合不同视图的分析框架。它由一个集成元模型提供支持，以确保不同模型之间的连接和一致性。为了促进 ML 训练的实验性质，M3S 在建模环境和 ML 训练管道之间提供了一个集成平台。M3S 通过案例研究和对照实验进行了验证。M3S 显示了前景，但未来的研究需要确认其通用性。

引用次数: 0

A comparative study of software defect binomial classification prediction models based on machine learning 基于机器学习的软件缺陷二项式分类预测模型比较研究

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-07-03 DOI: 10.1007/s11219-024-09683-3

Hongwei Tao, Xiaoxu Niu, Lang Xu, Lianyou Fu, Qiaoling Cao, Haoran Chen, Songtao Shang, Yang Xian

As information technology continues to advance, software applications are becoming increasingly critical. However, the growing size and complexity of software development can lead to serious flaws resulting in significant financial losses. To address this issue, Software Defect Prediction (SDP) technology is being developed to detect and resolve defects early in the software development process, ensuring high software quality. As a result, SDP research has become a major focus for academics worldwide. This study aims to compare various machine learning-based SDP algorithm models and determine if traditional machine learning algorithms affect SDP outcomes. Unlike previous studies that aimed to identify the best prediction model for all datasets, this paper constructs SDP superiority models separately for different datasets. Using the publicly available ESEM2016 dataset, 13 machine learning classification algorithms are employed to predict software defects. Evaluation indicators such as Accuracy, AUC(Area Under the Curve), F-measure, and Running Time(RT) are utilized to assess the performance of the classification algorithms. Due to the serious class imbalance problem in this dataset, 10 sampling methods are combined with the 13 machine learning algorithms to explore the effect of sampling techniques on the performance of traditional machine learning classification models. Finally, a comprehensive evaluation is conducted to identify the best combination of sampling techniques and classification models to construct the final dominant model for SDP.

随着信息技术的不断进步，软件应用变得越来越重要。然而，软件开发的规模和复杂性不断增加，可能导致严重缺陷，造成重大经济损失。为了解决这个问题，人们正在开发软件缺陷预测（SDP）技术，以便在软件开发过程中及早发现和解决缺陷，确保软件的高质量。因此，SDP 研究已成为全球学术界关注的焦点。本研究旨在比较各种基于机器学习的 SDP 算法模型，并确定传统机器学习算法是否会影响 SDP 的结果。与以往旨在确定所有数据集最佳预测模型的研究不同，本文针对不同数据集分别构建了 SDP 优越性模型。利用公开的 ESEM2016 数据集，采用 13 种机器学习分类算法来预测软件缺陷。利用准确率、AUC（曲线下面积）、F-measure 和运行时间（RT）等评价指标来评估分类算法的性能。由于该数据集存在严重的类不平衡问题，因此将 10 种抽样方法与 13 种机器学习算法相结合，以探讨抽样技术对传统机器学习分类模型性能的影响。最后，进行综合评估，找出抽样技术与分类模型的最佳组合，构建出 SDP 的最终主导模型。

{"title":"A comparative study of software defect binomial classification prediction models based on machine learning","authors":"Hongwei Tao, Xiaoxu Niu, Lang Xu, Lianyou Fu, Qiaoling Cao, Haoran Chen, Songtao Shang, Yang Xian","doi":"10.1007/s11219-024-09683-3","DOIUrl":"https://doi.org/10.1007/s11219-024-09683-3","url":null,"abstract":"As information technology continues to advance, software applications are becoming increasingly critical. However, the growing size and complexity of software development can lead to serious flaws resulting in significant financial losses. To address this issue, Software Defect Prediction (SDP) technology is being developed to detect and resolve defects early in the software development process, ensuring high software quality. As a result, SDP research has become a major focus for academics worldwide. This study aims to compare various machine learning-based SDP algorithm models and determine if traditional machine learning algorithms affect SDP outcomes. Unlike previous studies that aimed to identify the best prediction model for all datasets, this paper constructs SDP superiority models separately for different datasets. Using the publicly available ESEM2016 dataset, 13 machine learning classification algorithms are employed to predict software defects. Evaluation indicators such as Accuracy, AUC(Area Under the Curve), F-measure, and Running Time(RT) are utilized to assess the performance of the classification algorithms. Due to the serious class imbalance problem in this dataset, 10 sampling methods are combined with the 13 machine learning algorithms to explore the effect of sampling techniques on the performance of traditional machine learning classification models. Finally, a comprehensive evaluation is conducted to identify the best combination of sampling techniques and classification models to construct the final dominant model for SDP.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"189 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141523168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring behaviours of RESTful APIs in an industrial setting 在工业环境中探索 RESTful API 的行为

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-07-03 DOI: 10.1007/s11219-024-09686-0

Stefan Karlsson, Robbert Jongeling, Adnan Čaušević, Daniel Sundmark

A common way of exposing functionality in contemporary systems is by providing a Web-API based on the REST API architectural guidelines. To describe REST APIs, the industry standard is currently OpenAPI-specifications. Test generation and fuzzing methods targeting OpenAPI-described REST APIs have been a very active research area in recent years. An open research challenge is to aid users in better understanding their API, in addition to finding faults and to cover all the code. In this paper, we address this challenge by proposing a set of behavioural properties, common to REST APIs, which are used to generate examples of behaviours that these APIs exhibit. These examples can be used both (i) to further the understanding of the API and (ii) as a source of automatic test cases. Our evaluation shows that our approach can generate examples deemed relevant for understanding the system and for a source of test generation by practitioners. In addition, we show that basing test generation on behavioural properties provides tests that are less dependent on the state of the system, while at the same time yielding a similar code coverage as state-of-the-art methods in REST API fuzzing in a given time limit.

在当代系统中，公开功能的一种常见方式是根据 REST API 架构指南提供网络应用程序接口（Web-API）。目前，描述 REST API 的行业标准是 OpenAPI 规范。近年来，针对 OpenAPI 描述的 REST API 的测试生成和模糊处理方法一直是一个非常活跃的研究领域。除了查找故障和覆盖所有代码外，帮助用户更好地理解其 API 也是一项公开的研究挑战。在本文中，我们针对这一挑战提出了一套 REST 应用程序接口常见的行为属性，用于生成这些应用程序接口的行为示例。这些示例可用于：(i) 进一步理解 API；(ii) 作为自动测试案例的来源。我们的评估结果表明，我们的方法可以生成与理解系统相关的示例，并可作为从业人员生成测试的来源。此外，我们还表明，基于行为属性生成测试可提供对系统状态依赖性较低的测试，同时在给定的时间限制内，其代码覆盖率与 REST API 模糊测试中最先进的方法相似。

{"title":"Exploring behaviours of RESTful APIs in an industrial setting","authors":"Stefan Karlsson, Robbert Jongeling, Adnan Čaušević, Daniel Sundmark","doi":"10.1007/s11219-024-09686-0","DOIUrl":"https://doi.org/10.1007/s11219-024-09686-0","url":null,"abstract":"A common way of exposing functionality in contemporary systems is by providing a Web-API based on the REST API architectural guidelines. To describe REST APIs, the industry standard is currently OpenAPI-specifications. Test generation and fuzzing methods targeting OpenAPI-described REST APIs have been a very active research area in recent years. An open research challenge is to aid users in better understanding their API, in addition to finding faults and to cover all the code. In this paper, we address this challenge by proposing a set of behavioural properties, common to REST APIs, which are used to generate examples of behaviours that these APIs exhibit. These examples can be used both (i) to further the understanding of the API and (ii) as a source of automatic test cases. Our evaluation shows that our approach can generate examples deemed relevant for understanding the system and for a source of test generation by practitioners. In addition, we show that basing test generation on behavioural properties provides tests that are less dependent on the state of the system, while at the same time yielding a similar code coverage as state-of-the-art methods in REST API fuzzing in a given time limit.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"33 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141546998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancement and formal verification of the ICC mechanism with a sandbox approach in android system 在安卓系统中采用沙盒方法增强和正式验证 ICC 机制

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-06-27 DOI: 10.1007/s11219-024-09684-2

Jiaqi Yin, Sini Chen, Yixiao Lv, Huibiao Zhu

Inter-Component Communication (ICC) plays a crucial role in facilitating information exchange and functionality integration within the complex ecosystem of Android systems. However, the security and safety implications arising from ICC interactions pose significant challenges. This paper is an extended work building upon our previously published research that focuses on the verification of safety properties in the ICC mechanism. We address the previously observed issues of data leakage and privilege escalation by incorporating a sandbox mechanism and permission control. The sandbox mechanism provides an isolated and controlled environment in which ICC components can operate while permission control mechanisms are introduced to enforce fine-grained access controls, ensuring that only authorized entities have access to sensitive resources. We further leverage formal methods, specifically communicating sequential processes (CSP), to verify several properties of the enhanced ICC mechanism. By employing CSP, we aim to systematically model and analyze the flow of information, the behavior of components, and the potential vulnerabilities associated with the enhanced ICC mechanism. The verification results highlight the effectiveness of our approach in enhancing the security and reliability of ICC mechanisms, ultimately contributing to the development of safer and more trustworthy Android Systems.

组件间通信（ICC）在促进安卓系统复杂生态系统内的信息交换和功能集成方面发挥着至关重要的作用。然而，ICC 交互带来的安全性和安全影响构成了重大挑战。本文是在我们之前发表的研究基础上进行的扩展工作，重点关注 ICC 机制中安全属性的验证。我们通过引入沙盒机制和权限控制，解决了之前观察到的数据泄露和权限升级问题。沙箱机制为 ICC 组件的运行提供了一个隔离和受控的环境，而权限控制机制则用于执行细粒度访问控制，确保只有授权实体才能访问敏感资源。我们进一步利用形式方法，特别是通信顺序进程（CSP），来验证增强型 ICC 机制的若干属性。通过使用 CSP，我们旨在系统地模拟和分析信息流、组件行为以及与增强型 ICC 机制相关的潜在漏洞。验证结果凸显了我们的方法在增强 ICC 机制的安全性和可靠性方面的有效性，最终有助于开发更安全、更可信的 Android 系统。

{"title":"Enhancement and formal verification of the ICC mechanism with a sandbox approach in android system","authors":"Jiaqi Yin, Sini Chen, Yixiao Lv, Huibiao Zhu","doi":"10.1007/s11219-024-09684-2","DOIUrl":"https://doi.org/10.1007/s11219-024-09684-2","url":null,"abstract":"Inter-Component Communication (ICC) plays a crucial role in facilitating information exchange and functionality integration within the complex ecosystem of Android systems. However, the security and safety implications arising from ICC interactions pose significant challenges. This paper is an extended work building upon our previously published research that focuses on the verification of safety properties in the ICC mechanism. We address the previously observed issues of data leakage and privilege escalation by incorporating a sandbox mechanism and permission control. The sandbox mechanism provides an isolated and controlled environment in which ICC components can operate while permission control mechanisms are introduced to enforce fine-grained access controls, ensuring that only authorized entities have access to sensitive resources. We further leverage formal methods, specifically communicating sequential processes (CSP), to verify several properties of the enhanced ICC mechanism. By employing CSP, we aim to systematically model and analyze the flow of information, the behavior of components, and the potential vulnerabilities associated with the enhanced ICC mechanism. The verification results highlight the effectiveness of our approach in enhancing the security and reliability of ICC mechanisms, ultimately contributing to the development of safer and more trustworthy Android Systems.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"60 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141529571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unraveling the code: an in-depth empirical study on the impact of development practices in auxiliary functions implementation 解读代码：关于辅助功能实施过程中开发实践影响的深入实证研究

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-06-25 DOI: 10.1007/s11219-024-09682-4

Otávio Lemos, Fábio Silveira, Fabiano Ferrari, Tiago Silva, Eduardo Guerra, Alessandro Garcia

Auxiliary functions in software systems, often overlooked due to their perceived simplicity, play a crucial role in overall system reliability. This study focuses on the effectiveness of agile practices, specifically the pair programming and the test-first programming practices. Despite the importance of these functions, there exists a dearth of empirical evidence on the impact of agile practices on their development, raising questions about their potential to enhance correctness without affecting time-to-market. This paper aims to bridge this gap by comparing the application of agile practices with traditional approaches in the context of auxiliary function development. We conducted six experiments involving 122 participants (85 novices and 37 professionals) who used both traditional and agile methods to develop six auxiliary functions across three different domains. Our analysis of 244 implementations suggests the potential benefits of agile practices in auxiliary function development. Pair programming showed a tendency towards improved correctness, while test-first programming did not significantly extend the total development time, particularly among professionals. However, these findings should be interpreted cautiously as they do not conclusively establish that agile practices outperform traditional approaches universally. As indicated by our results, the potential benefits of agile practices may vary depending on factors such as the programmer’s experience level and the nature of the functions being developed. Further research is needed to fully understand the contexts in which these practices can be most effectively applied and to address the potential limitations of our study.

软件系统中的辅助功能往往因其简单易用而被忽视，但它们在整个系统的可靠性方面却发挥着至关重要的作用。本研究重点关注敏捷实践的有效性，特别是结对编程和测试优先编程实践。尽管这些功能非常重要，但有关敏捷实践对其开发的影响的实证证据却非常匮乏，这让人们对敏捷实践在不影响产品上市时间的情况下提高正确性的潜力产生了疑问。本文旨在通过比较敏捷实践与传统方法在辅助功能开发中的应用，弥补这一不足。我们进行了六次实验，共有 122 名参与者（85 名新手和 37 名专业人员）参加，他们同时使用传统方法和敏捷方法开发了三个不同领域的六个辅助功能。我们对 244 个实施方案的分析表明，敏捷实践在辅助功能开发中具有潜在优势。结对编程显示出提高正确性的趋势，而测试优先编程并没有显著延长总开发时间，尤其是在专业人员中。不过，对这些发现应谨慎解读，因为它们并不能最终确定敏捷实践普遍优于传统方法。正如我们的研究结果所示，敏捷实践的潜在优势可能会因程序员的经验水平和开发功能的性质等因素而有所不同。我们需要进一步研究，以充分了解这些实践在哪些情况下可以得到最有效的应用，并解决我们的研究可能存在的局限性。

{"title":"Unraveling the code: an in-depth empirical study on the impact of development practices in auxiliary functions implementation","authors":"Otávio Lemos, Fábio Silveira, Fabiano Ferrari, Tiago Silva, Eduardo Guerra, Alessandro Garcia","doi":"10.1007/s11219-024-09682-4","DOIUrl":"https://doi.org/10.1007/s11219-024-09682-4","url":null,"abstract":"Auxiliary functions in software systems, often overlooked due to their perceived simplicity, play a crucial role in overall system reliability. This study focuses on the effectiveness of agile practices, specifically the pair programming and the test-first programming practices. Despite the importance of these functions, there exists a dearth of empirical evidence on the impact of agile practices on their development, raising questions about their potential to enhance correctness without affecting time-to-market. This paper aims to bridge this gap by comparing the application of agile practices with traditional approaches in the context of auxiliary function development. We conducted six experiments involving 122 participants (85 novices and 37 professionals) who used both traditional and agile methods to develop six auxiliary functions across three different domains. Our analysis of 244 implementations suggests the potential benefits of agile practices in auxiliary function development. Pair programming showed a tendency towards improved correctness, while test-first programming did not significantly extend the total development time, particularly among professionals. However, these findings should be interpreted cautiously as they do not conclusively establish that agile practices outperform traditional approaches universally. As indicated by our results, the potential benefits of agile practices may vary depending on factors such as the programmer’s experience level and the nature of the functions being developed. Further research is needed to fully understand the contexts in which these practices can be most effectively applied and to address the potential limitations of our study.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"25 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141507337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automated SC-MCC test case generation using coverage-guided fuzzing 利用覆盖引导模糊自动生成 SC-MCC 测试用例

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-05-14 DOI: 10.1007/s11219-024-09667-3

Monika Rani Golla, Sangharatna Godboley

One of the main objectives of testing is to achieve adequate code coverage. Modern code coverage standards suggest MC/DC (Modified Condition/Decision Coverage) instead of MCC (Multiple Condition Coverage) due to its ability to generate a feasible number of test cases. In contrast to the MC/DC, which only takes independent pairs into consideration, the MCC often considers each and every test case. In our work, we suggest SC-MCC, i.e., MCC with Short-Circuit. The key aspect of this paper is to demonstrate the effectiveness of SC-MCC-based test cases compared to MC/DC using Coverage-Guided Fuzzing (CGF) technique. In this work, we have considered American Fuzzy Lop (AFL) tool to generate both the SC-MCC and MC/DC test cases for 54 RERS benchmark programs. As part of this paper, we propose unique goal constraint generation and fuzz-instrumentation techniques that help in mitigating the masking problem of AFL. Subsequently, we performed mutation testing by employing the GCOV tool and computed the mutation score in order to evaluate the quality of the generated test cases. Finally, based on our observations, SC-MCC has performed better for over 85% of the programs taken into consideration.

测试的主要目标之一是实现足够的代码覆盖率。现代代码覆盖率标准建议采用 MC/DC（修正条件/决策覆盖率），而不是 MCC（多重条件覆盖率），因为它能生成可行数量的测试用例。MC/DC 只考虑独立的测试对，而 MCC 通常考虑每个测试用例。在我们的工作中，我们提出了 SC-MCC，即带短路的 MCC。本文的主要内容是利用覆盖引导模糊（CGF）技术展示基于 SC-MCC 的测试用例与 MC/DC 相比的有效性。在这项工作中，我们使用 American Fuzzy Lop (AFL) 工具为 54 个 RERS 基准程序生成 SC-MCC 和 MC/DC 测试用例。作为本文的一部分，我们提出了独特的目标约束生成和模糊仪器技术，有助于减轻 AFL 的掩蔽问题。随后，我们使用 GCOV 工具进行了突变测试，并计算了突变分数，以评估生成测试用例的质量。最后，根据我们的观察，SC-MCC 在超过 85% 的程序中表现较好。

{"title":"Automated SC-MCC test case generation using coverage-guided fuzzing","authors":"Monika Rani Golla, Sangharatna Godboley","doi":"10.1007/s11219-024-09667-3","DOIUrl":"https://doi.org/10.1007/s11219-024-09667-3","url":null,"abstract":"One of the main objectives of testing is to achieve adequate code coverage. Modern code coverage standards suggest MC/DC (Modified Condition/Decision Coverage) instead of MCC (Multiple Condition Coverage) due to its ability to generate a feasible number of test cases. In contrast to the MC/DC, which only takes independent pairs into consideration, the MCC often considers each and every test case. In our work, we suggest SC-MCC, i.e., MCC with Short-Circuit. The key aspect of this paper is to demonstrate the effectiveness of SC-MCC-based test cases compared to MC/DC using Coverage-Guided Fuzzing (CGF) technique. In this work, we have considered American Fuzzy Lop (AFL) tool to generate both the SC-MCC and MC/DC test cases for 54 RERS benchmark programs. As part of this paper, we propose unique goal constraint generation and fuzz-instrumentation techniques that help in mitigating the masking problem of AFL. Subsequently, we performed mutation testing by employing the GCOV tool and computed the mutation score in order to evaluate the quality of the generated test cases. Finally, based on our observations, SC-MCC has performed better for over 85% of the programs taken into consideration.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"161 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140939100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CLOUD-QM: a quality model for benchmarking cloud-based enterprise information systems CLOUD-QM：基于云的企业信息系统基准质量模型

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-05-14 DOI: 10.1007/s11219-024-09669-1

Umut Şener, Ebru Gökalp, P. Erhan Eren

Organizations are increasingly migrating from on-premise enterprise information systems (EIS) to cloud products due to cloud computing benefits, such as flexibility, elasticity, and on-demand service. However, identifying the most suitable option becomes challenging with the proliferation of Cloud-EIS solutions in the market. To address this challenge, this study introduces a novel quality model named Cloud-QM, based on ISO/IEC 250nn standards. It diagnoses the quality of Cloud-EIS products, benchmarks available options, and identifies the most suitable choice for the organization. Cloud-QM comprises 10 main dimensions, 33 sub-dimensions, and corresponding metrics for a systematic quality assessment. Furthermore, the practical use of Cloud-QM is illustrated through a case study that evaluates two substitute Cloud-EIS products. The results from the case study highlight the effectiveness of Cloud-QM in enabling decision-makers to delve into the quality dimensions and facilitate the selection of the most suitable product for their organizations. The main contributions are as follows: (1) proposing a comprehensive and hierarchically structured quality model for Cloud-EIS products; (2) offering a quantifiable and standardized assessment approach through a set of metrics for quality evaluation; and (3) demonstrating applicability and usability of Cloud-QM by benchmarking Cloud-EIS products.

由于云计算具有灵活性、弹性和按需服务等优势，越来越多的企业正在从内部部署的企业信息系统（EIS）迁移到云产品。然而，随着市场上云 EIS 解决方案的激增，确定最合适的选择变得极具挑战性。为应对这一挑战，本研究基于 ISO/IEC 250nn 标准引入了一种名为 Cloud-QM 的新型质量模型。它能诊断云计算-电子信息服务（Cloud-EIS）产品的质量，对可用选项进行基准测试，并确定最适合企业的选择。Cloud-QM 包括 10 个主要维度、33 个子维度以及用于系统质量评估的相应指标。此外，还通过一个案例研究来说明云质量管理的实际应用，该案例研究评估了两种可替代的云-EIS 产品。案例研究的结果凸显了云质量管理在帮助决策者深入研究质量维度和促进选择最适合其组织的产品方面的有效性。主要贡献如下(1) 为云-EIS 产品提出了一个全面的、分层结构的质量模型；(2) 通过一套质量评估指标，提供了一种可量化和标准化的评估方法；以及 (3) 通过对云-EIS 产品进行基准测试，证明了云-质量管理的适用性和可用性。

{"title":"CLOUD-QM: a quality model for benchmarking cloud-based enterprise information systems","authors":"Umut Şener, Ebru Gökalp, P. Erhan Eren","doi":"10.1007/s11219-024-09669-1","DOIUrl":"https://doi.org/10.1007/s11219-024-09669-1","url":null,"abstract":"Organizations are increasingly migrating from on-premise enterprise information systems (EIS) to cloud products due to cloud computing benefits, such as flexibility, elasticity, and on-demand service. However, identifying the most suitable option becomes challenging with the proliferation of Cloud-EIS solutions in the market. To address this challenge, this study introduces a novel quality model named Cloud-QM, based on ISO/IEC 250nn standards. It diagnoses the quality of Cloud-EIS products, benchmarks available options, and identifies the most suitable choice for the organization. Cloud-QM comprises 10 main dimensions, 33 sub-dimensions, and corresponding metrics for a systematic quality assessment. Furthermore, the practical use of Cloud-QM is illustrated through a case study that evaluates two substitute Cloud-EIS products. The results from the case study highlight the effectiveness of Cloud-QM in enabling decision-makers to delve into the quality dimensions and facilitate the selection of the most suitable product for their organizations. The main contributions are as follows: (1) proposing a comprehensive and hierarchically structured quality model for Cloud-EIS products; (2) offering a quantifiable and standardized assessment approach through a set of metrics for quality evaluation; and (3) demonstrating applicability and usability of Cloud-QM by benchmarking Cloud-EIS products.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"98 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140939288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

KAD: a knowledge formalization-based anomaly detection approach for distributed systems KAD：基于知识形式化的分布式系统异常检测方法

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-05-10 DOI: 10.1007/s11219-024-09670-8

Xinjie Wei, Chang-ai Sun, Xiao-Yi Zhang

Large-scale distributed systems are becoming key engines of the IT industry due to their scalability and extensibility. A distributed system often involves numerous complex interactions among components, suffering anomalies such as data inconsistencies between components and unanticipated delays in response times. Existing anomaly detection techniques, which extract knowledge from system logs using either statistical or machine learning techniques, exhibit limitations. Statistical techniques often miss implicit anomalies that are related to complex interactions manifested by logs, whereas machine learning techniques lack explainability and they are usually sensitive to log variations. In this paper, we propose KAD, a knowledge formalization-based anomaly detection approach for distributed systems. KAD includes a general knowledge description language (KDL), leveraging the general structure of system logs and extended Backus-Naur form (EBNF) for complex knowledge extraction. Particularly, the semantic set is constructed based on the bidirectional encoder representation from the transformer (BERT) model to improve the expressive capabilities of KDL in knowledge description. In addition, KAD incorporates distributed scheduling computation module to improve the efficiency of anomaly detection processes. Experimental results based on two widely used benchmarks show that KAD can accurately describe the knowledge associated with anomalies, with a high F1-score in detecting various anomaly types.

大型分布式系统因其可扩展性和可伸缩性，正在成为 IT 行业的关键引擎。分布式系统通常涉及众多组件之间的复杂交互，会出现组件之间数据不一致和响应时间意外延迟等异常情况。现有的异常检测技术利用统计或机器学习技术从系统日志中提取知识，但这些技术都有局限性。统计技术往往会遗漏与日志中表现出的复杂交互相关的隐含异常，而机器学习技术则缺乏可解释性，它们通常对日志变化很敏感。在本文中，我们提出了一种基于知识形式化的分布式系统异常检测方法 KAD。KAD 包括通用知识描述语言（KDL），利用系统日志的通用结构和扩展 Backus-Naur form（EBNF）进行复杂知识提取。特别是，语义集是基于转换器（BERT）模型的双向编码器表示法构建的，以提高 KDL 在知识描述方面的表达能力。此外，KAD 还加入了分布式调度计算模块，以提高异常检测过程的效率。基于两个广泛使用的基准的实验结果表明，KAD 能够准确描述与异常相关的知识，在检测各种异常类型时具有较高的 F1 分数。

{"title":"KAD: a knowledge formalization-based anomaly detection approach for distributed systems","authors":"Xinjie Wei, Chang-ai Sun, Xiao-Yi Zhang","doi":"10.1007/s11219-024-09670-8","DOIUrl":"https://doi.org/10.1007/s11219-024-09670-8","url":null,"abstract":"Large-scale distributed systems are becoming key engines of the IT industry due to their scalability and extensibility. A distributed system often involves numerous complex interactions among components, suffering anomalies such as data inconsistencies between components and unanticipated delays in response times. Existing anomaly detection techniques, which extract knowledge from system logs using either statistical or machine learning techniques, exhibit limitations. Statistical techniques often miss implicit anomalies that are related to complex interactions manifested by logs, whereas machine learning techniques lack explainability and they are usually sensitive to log variations. In this paper, we propose KAD, a knowledge formalization-based anomaly detection approach for distributed systems. KAD includes a general knowledge description language (KDL), leveraging the general structure of system logs and extended Backus-Naur form (EBNF) for complex knowledge extraction. Particularly, the semantic set is constructed based on the bidirectional encoder representation from the transformer (BERT) model to improve the expressive capabilities of KDL in knowledge description. In addition, KAD incorporates distributed scheduling computation module to improve the efficiency of anomaly detection processes. Experimental results based on two widely used benchmarks show that KAD can accurately describe the knowledge associated with anomalies, with a high F1-score in detecting various anomaly types.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"40 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140939196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0