Software Quality Journal最新文献_第3页

Exploring API behaviours through generated examples 通过生成的示例探索应用程序接口行为

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-04-22 DOI: 10.1007/s11219-024-09668-2

Stefan Karlsson, John Hughes, Robbert Jongeling, Adnan Čaušević, Daniel Sundmark

Understanding the behaviour of a system’s API can be hard. Giving users access to relevant examples of how an API behaves has been shown to make this easier for them. In addition, such examples can be used to verify expected behaviour or identify unwanted behaviours. Methods for automatically generating examples have existed for a long time. However, state-of-the-art methods rely on either white-box information, such as source code, or on formal specifications of the system behaviour. But what if you do not have access to either? This may be the case, for example, when interacting with a third-party API. In this paper, we present an approach to automatically generate relevant examples of behaviours of an API, without requiring either source code or a formal specification of behaviour. Evaluation on an industry-grade REST API shows that our method can produce small and relevant examples that can help engineers to understand the system under exploration.

理解系统应用程序接口的行为可能很难。事实证明，为用户提供有关应用程序接口行为的相关示例会让他们更容易理解。此外，这些示例还可用于验证预期行为或识别不需要的行为。自动生成示例的方法由来已久。不过，最先进的方法要么依赖源代码等白盒信息，要么依赖系统行为的正式规范。但如果您无法获得这两种信息呢？例如，在与第三方应用程序接口交互时就可能出现这种情况。在本文中，我们介绍了一种无需源代码或正式行为规范即可自动生成应用程序接口相关行为示例的方法。对一个行业级 REST 应用程序接口的评估表明，我们的方法可以生成小型相关示例，帮助工程师理解正在探索的系统。

引用次数: 0

Enhancing big data analytics deployment: uncovering stakeholder dynamics and balancing salience in project roles 加强大数据分析部署：揭示利益相关者动态并平衡项目角色的突出性

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-03-13 DOI: 10.1007/s11219-024-09665-5

Maria Hoffmann Jensen, Maja Due Kadenic

Deployment constitutes a pivotal aspect of data science projects, such as big data analytics (BDA). A comprehensive definition of successful deployment necessitates the integration of perspectives from both the project stakeholders and the end-users. However, adequate consideration of project stakeholders remains notably absent within the broader view of project deployment. This paper investigates the role of stakeholders in the deployment of BDA projects by applying an ethnographic research design throughout a 12-month period within a large multinational organization. The study employs critical systems heuristics concepts to identify stakeholder roles, which are subsequently classified and analyzed according to the salience model. The empirical findings point towards the missing link between the technical and the business aspects of a BDA project. The organizational function and product management, (capable of comprehending both the technical and business dimensions) must undertake a highly salient stakeholder role to effectively guide the project toward the successful deployment. Additionally, this role will be able to identify the exact beneficiaries, thus enabling them to increase their salience and their interests to resonate across the spectrum of project stakeholders. This study advances the knowledge and understanding of BDA deployment through the lens of a stakeholder perspective and systems thinking. It uncovers the necessary resources by mapping the social roles of a project and assessing their salience. Balancing role-based salience contributes to successful BDA project deployment.

部署是大数据分析（BDA）等数据科学项目的关键环节。要全面定义成功部署，就必须整合项目利益相关者和最终用户的观点。然而，在更广泛的项目部署视角中，仍然明显缺乏对项目利益相关者的充分考虑。本文采用人种学研究设计，在一家大型跨国企业中进行了为期 12 个月的研究，探讨了利益相关者在 BDA 项目部署中的作用。研究采用了关键系统启发式概念来识别利益相关者的角色，随后根据显著性模型对其进行了分类和分析。实证研究结果表明，BDA 项目的技术和业务方面之间存在缺失。组织职能部门和产品管理部门（能够理解技术和业务两个方面）必须扮演高度突出的利益相关者角色，以有效指导项目的成功部署。此外，这一角色将能够确定确切的受益人，从而使他们能够提高其显著性，并使他们的利益在项目利益相关者中产生共鸣。本研究通过利益相关者视角和系统思维的视角，推进了对 BDA 部署的认识和理解。它通过绘制项目的社会角色图并评估其显著性，发现了必要的资源。平衡基于角色的显著性有助于成功部署 BDA 项目。

{"title":"Enhancing big data analytics deployment: uncovering stakeholder dynamics and balancing salience in project roles","authors":"Maria Hoffmann Jensen, Maja Due Kadenic","doi":"10.1007/s11219-024-09665-5","DOIUrl":"https://doi.org/10.1007/s11219-024-09665-5","url":null,"abstract":"Deployment constitutes a pivotal aspect of data science projects, such as big data analytics (BDA). A comprehensive definition of successful deployment necessitates the integration of perspectives from both the project stakeholders and the end-users. However, adequate consideration of project stakeholders remains notably absent within the broader view of project deployment. This paper investigates the role of stakeholders in the deployment of BDA projects by applying an ethnographic research design throughout a 12-month period within a large multinational organization. The study employs critical systems heuristics concepts to identify stakeholder roles, which are subsequently classified and analyzed according to the salience model. The empirical findings point towards the missing link between the technical and the business aspects of a BDA project. The organizational function and product management, (capable of comprehending both the technical and business dimensions) must undertake a highly salient stakeholder role to effectively guide the project toward the successful deployment. Additionally, this role will be able to identify the exact beneficiaries, thus enabling them to increase their salience and their interests to resonate across the spectrum of project stakeholders. This study advances the knowledge and understanding of BDA deployment through the lens of a stakeholder perspective and systems thinking. It uncovers the necessary resources by mapping the social roles of a project and assessing their salience. Balancing role-based salience contributes to successful BDA project deployment.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"91 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140126377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tackling visual and conceptual complexity of problem-oriented modeling of requirements 应对以问题为导向的需求建模的视觉和概念复杂性

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-03-11 DOI: 10.1007/s11219-024-09662-8

Waqas Junaid, Zhi Li

In the contemporary age of cyber-physical systems (CPS), where software governs the control, coordination, and management of physical objects to tackle real-world problems, engineering requirements for such systems present considerable challenges. Consequently, it is not surprising that the requirements engineering (RE) community, encompassing both academia and industry, has turned to established RE methodologies and applied them within the context of CPS; notably, Jackson’s Problem Frames (PF) approach deploys problem diagrams for modeling CPS. However, problem diagrams of realistic CPS often present visual and conceptual complexities, which must be addressed before PF can be usefully applied in practice. In this paper, the above problem is addressed in two steps: firstly, preliminary findings are presented by deriving from the application of eye-tracking software in the assessment of a technique designed to mitigate the visual complexity inherent in the CARE (Computer-Aided Requirements Engineering) tool developed for PF. An auto-layout technique is developed for detecting and resolving overlaps, aimed at enhancing its usability from the perspectives of cognition, psychology, and user studies; secondly, the technique of a complexity matrix is applied for calculating and evaluating the conceptual complexity inherent in problem diagrams. Our eye-tracking results show the successful implementation of overlap detection and resolution. The case studies in the second step also demonstrate the applicability and effectiveness of the complexity matrix calculations.

在当代网络物理系统（CPS）时代，软件可以控制、协调和管理物理对象以解决现实世界中的问题。因此，包括学术界和工业界在内的需求工程（RE）界转向已有的 RE 方法并将其应用于 CPS，这并不奇怪。然而，现实 CPS 的问题图往往具有视觉和概念上的复杂性，在实际应用 PF 之前必须解决这些问题。本文将分两步解决上述问题：首先，通过应用眼动跟踪软件评估一种旨在减轻为 PF 开发的 CARE（计算机辅助需求工程）工具固有的视觉复杂性的技术，得出初步结论。我们开发了一种用于检测和解决重叠问题的自动布局技术，旨在从认知、心理学和用户研究的角度提高其可用性；其次，复杂性矩阵技术被用于计算和评估问题图中固有的概念复杂性。我们的眼动跟踪结果表明，重叠检测和解决方法已成功实施。第二步的案例研究也证明了复杂度矩阵计算的适用性和有效性。

{"title":"Tackling visual and conceptual complexity of problem-oriented modeling of requirements","authors":"Waqas Junaid, Zhi Li","doi":"10.1007/s11219-024-09662-8","DOIUrl":"https://doi.org/10.1007/s11219-024-09662-8","url":null,"abstract":"In the contemporary age of cyber-physical systems (CPS), where software governs the control, coordination, and management of physical objects to tackle real-world problems, engineering requirements for such systems present considerable challenges. Consequently, it is not surprising that the requirements engineering (RE) community, encompassing both academia and industry, has turned to established RE methodologies and applied them within the context of CPS; notably, Jackson’s Problem Frames (PF) approach deploys problem diagrams for modeling CPS. However, problem diagrams of realistic CPS often present visual and conceptual complexities, which must be addressed before PF can be usefully applied in practice. In this paper, the above problem is addressed in two steps: firstly, preliminary findings are presented by deriving from the application of eye-tracking software in the assessment of a technique designed to mitigate the visual complexity inherent in the CARE (Computer-Aided Requirements Engineering) tool developed for PF. An auto-layout technique is developed for detecting and resolving overlaps, aimed at enhancing its usability from the perspectives of cognition, psychology, and user studies; secondly, the technique of a complexity matrix is applied for calculating and evaluating the conceptual complexity inherent in problem diagrams. Our eye-tracking results show the successful implementation of overlap detection and resolution. The case studies in the second step also demonstrate the applicability and effectiveness of the complexity matrix calculations.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"98 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140097611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A comprehensive catalog of refactoring strategies to handle test smells in Java-based systems 处理基于 Java 的系统中测试气味的重构策略综合目录

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-03-08 DOI: 10.1007/s11219-024-09663-7

Luana Martins, Taher A. Ghaleb, Heitor Costa, Ivan Machado

Test smells, detrimental coding practices that hinder high-quality test code development, pose a significant challenge in software testing and maintenance. Software refactoring, traditionally a powerful approach for addressing code smells and improving software quality without changing functionality, often focuses on production code, leaving test code overlooked. Despite extensive research in test smell refactoring, understanding the efficacy of existing refactoring operations on test code quality remains limited. Investigating real-world developer refactoring practices is crucial to bridge this knowledge gap. In this study, we investigate refactorings performed by developers to address test smells, resulting in a comprehensive catalog of test smells and their corresponding test-specific refactorings. Two test-specific refactorings closely tied to JUnit5 and seven version-agnostic refactorings for various JUnit versions have been identified. While many of these test-specific refactorings are documented in the literature, this analysis unveils new test-specific refactorings aimed at dealing with the “Inappropriate Assertion” test smell. This research provides insights into the challenges faced by developers and prevailing practices for effectively refactoring test code, thereby enhancing software testing and maintenance.

测试气味是妨碍高质量测试代码开发的有害编码实践，是软件测试和维护的重大挑战。软件重构历来是解决代码气味和在不改变功能的情况下提高软件质量的有效方法，但它往往只关注生产代码，而忽略了测试代码。尽管对测试气味重构进行了广泛的研究，但对现有重构操作对测试代码质量的影响的了解仍然有限。调查现实世界中开发人员的重构实践对于弥合这一知识鸿沟至关重要。在本研究中，我们调查了开发人员为解决测试气味而进行的重构，从而形成了一个全面的测试气味及其相应的测试特定重构目录。我们发现了两种与 JUnit5 紧密相关的特定测试重构，以及七种针对不同 JUnit 版本的版本无关重构。虽然这些特定于测试的重构在文献中已有记载，但本分析揭示了新的特定于测试的重构，旨在处理 "不恰当断言 "测试气味。这项研究深入探讨了开发人员面临的挑战以及有效重构测试代码的普遍做法，从而加强了软件测试和维护。

{"title":"A comprehensive catalog of refactoring strategies to handle test smells in Java-based systems","authors":"Luana Martins, Taher A. Ghaleb, Heitor Costa, Ivan Machado","doi":"10.1007/s11219-024-09663-7","DOIUrl":"https://doi.org/10.1007/s11219-024-09663-7","url":null,"abstract":"Test smells, detrimental coding practices that hinder high-quality test code development, pose a significant challenge in software testing and maintenance. Software refactoring, traditionally a powerful approach for addressing code smells and improving software quality without changing functionality, often focuses on production code, leaving test code overlooked. Despite extensive research in test smell refactoring, understanding the efficacy of existing refactoring operations on test code quality remains limited. Investigating real-world developer refactoring practices is crucial to bridge this knowledge gap. In this study, we investigate refactorings performed by developers to address test smells, resulting in a comprehensive catalog of test smells and their corresponding test-specific refactorings. Two test-specific refactorings closely tied to JUnit5 and seven version-agnostic refactorings for various JUnit versions have been identified. While many of these test-specific refactorings are documented in the literature, this analysis unveils new test-specific refactorings aimed at dealing with the “Inappropriate Assertion” test smell. This research provides insights into the challenges faced by developers and prevailing practices for effectively refactoring test code, thereby enhancing software testing and maintenance.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"20 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140076241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring the role of big data analytics and dynamic capabilities in ESG programs within pharmaceuticals 探索大数据分析和动态能力在制药企业 ESG 计划中的作用

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-03-06 DOI: 10.1007/s11219-024-09666-4

Abstract

Achieving the United Nations’ Sustainable Development Goals (SDGs) requires environmental, social, and governance (ESG) programs in the pharmaceutical industry. Using the Millennium Development Goals, the 2030 agenda aims to transform European Union companies toward sustainability. In pharmaceuticals, in particular, ESG programs come with complexities such as employee skills, corporate goals, and management expectations. Managing these programs effectively requires advanced technologies such as big data analytics (BDA) and dynamic capabilities (DC). In this study, DC theory is used to develop an architecture for managing ESG criteria, focusing on provenance, traceability, and availability. BDA’s role in ESG programs is explored, along with its use cases and benefits, and how DC drives success in ESG implementation. The study examined five pharmaceutical companies in Germany, Portugal, and Switzerland, all consulting the same firm for BDA systems, to identify the characteristics of effective BDA implementation. The research explores how BDA and DC jointly enhance ESG efforts, the essential skills needed, and how DC aids in real-time decision-making in BDA projects aligned with ESG standards. It highlights the BDA system’s accuracy and effectiveness in managing ESG programs, with DC as a pivotal facilitator. Findings reveal BDA’s value in operational efficiency and aligning business models with ESG goals, underscoring the need for diverse skills in BDA implementation and DC’s importance in integrating various managerial capacities into effective strategies. The study promotes a dynamic, data-driven approach in the pharmaceutical industry for managing complex ESG initiatives. It stresses continuous learning, adaptation, and integrating technological advances with ethical business practices. The research concludes by emphasizing BDA and DC’s vital roles in advocating ethical, socially responsible, and environmentally sustainable practices in the pharmaceutical sector, marrying technology with ethical business strategies.

摘要要实现联合国的可持续发展目标（SDGs），就必须在制药行业实施环境、社会和治理（ESG）计划。2030 年议程以千年发展目标为基础，旨在帮助欧盟企业实现可持续发展。特别是在制药行业，ESG 计划具有复杂性，如员工技能、企业目标和管理期望。有效管理这些项目需要大数据分析（BDA）和动态能力（DC）等先进技术。在本研究中，DC 理论被用于开发管理 ESG 标准的架构，重点关注出处、可追溯性和可用性。研究探讨了 BDA 在环境、社会和治理项目中的作用、使用案例和优势，以及动态能力如何推动环境、社会和治理的成功实施。该研究考察了德国、葡萄牙和瑞士的五家制药公司，这些公司都向同一家公司咨询了 BDA 系统，以确定有效实施 BDA 的特征。研究探讨了 BDA 和 DC 如何共同加强环境、社会和公司治理工作、所需的基本技能，以及 DC 如何在符合环境、社会和公司治理标准的 BDA 项目中协助实时决策。研究强调了 BDA 系统在管理 ESG 计划方面的准确性和有效性，而 DC 是其中的关键促进因素。研究结果揭示了 BDA 在提高运营效率和使业务模式与 ESG 目标相一致方面的价值，强调了在 BDA 实施过程中对各种技能的需求，以及 DC 在将各种管理能力整合到有效战略中的重要性。本研究提倡制药行业采用动态、数据驱动的方法来管理复杂的环境、社会和公司治理措施。它强调持续学习、适应以及将技术进步与商业道德实践相结合。研究最后强调了 BDA 和 DC 在倡导制药行业道德、社会责任和环境可持续发展实践方面的重要作用，并将技术与道德商业战略相结合。

{"title":"Exploring the role of big data analytics and dynamic capabilities in ESG programs within pharmaceuticals","authors":"","doi":"10.1007/s11219-024-09666-4","DOIUrl":"https://doi.org/10.1007/s11219-024-09666-4","url":null,"abstract":"<h3>Abstract</h3> Achieving the United Nations’ Sustainable Development Goals (SDGs) requires environmental, social, and governance (ESG) programs in the pharmaceutical industry. Using the Millennium Development Goals, the 2030 agenda aims to transform European Union companies toward sustainability. In pharmaceuticals, in particular, ESG programs come with complexities such as employee skills, corporate goals, and management expectations. Managing these programs effectively requires advanced technologies such as big data analytics (BDA) and dynamic capabilities (DC). In this study, DC theory is used to develop an architecture for managing ESG criteria, focusing on provenance, traceability, and availability. BDA’s role in ESG programs is explored, along with its use cases and benefits, and how DC drives success in ESG implementation. The study examined five pharmaceutical companies in Germany, Portugal, and Switzerland, all consulting the same firm for BDA systems, to identify the characteristics of effective BDA implementation. The research explores how BDA and DC jointly enhance ESG efforts, the essential skills needed, and how DC aids in real-time decision-making in BDA projects aligned with ESG standards. It highlights the BDA system’s accuracy and effectiveness in managing ESG programs, with DC as a pivotal facilitator. Findings reveal BDA’s value in operational efficiency and aligning business models with ESG goals, underscoring the need for diverse skills in BDA implementation and DC’s importance in integrating various managerial capacities into effective strategies. The study promotes a dynamic, data-driven approach in the pharmaceutical industry for managing complex ESG initiatives. It stresses continuous learning, adaptation, and integrating technological advances with ethical business practices. The research concludes by emphasizing BDA and DC’s vital roles in advocating ethical, socially responsible, and environmentally sustainable practices in the pharmaceutical sector, marrying technology with ethical business strategies.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"192 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140045191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A family of experiments about how developers perceive delayed system response time 关于开发人员如何看待系统延迟响应时间的一系列实验

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-03-04 DOI: 10.1007/s11219-024-09660-w

Abstract

Collecting and analyzing data about developers working on their development tasks can help improve development practices, finally increasing the productivity of teams. Indeed, monitoring and analysis tools have already been used to collect data from productivity tools. Monitoring inevitably consumes resources and, depending on their extensiveness, may significantly slow down software systems, interfering with developers’ activity. There is thus a challenging trade-off between monitoring and validating applications in their operational environment and preventing the degradation of the user experience. The lack of studies about when developers perceive an overhead introduced in an application makes it extremely difficult to fine-tune techniques working in the field. In this paper, we address this challenge by presenting an empirical study that quantifies how developers perceive overhead. The study consists of three replications of an experiment that involved 99 computer science students in total, followed by a small-scale experimental assessment of the key findings with 12 professional developers. Results show that non-negligible overhead can be introduced for a short period into applications without developers perceiving it and that the sequence in which complex operations are executed influences the perception of the system response time. This information can be exploited to design better monitoring techniques.

摘要收集和分析有关开发人员开发任务的数据有助于改进开发实践，最终提高团队的工作效率。事实上，监控和分析工具已被用于从生产力工具中收集数据。监控不可避免地会消耗资源，而且根据监控的广泛程度，可能会大大降低软件系统的运行速度，干扰开发人员的工作。因此，既要监控和验证运行环境中的应用程序，又要防止用户体验下降，两者之间的权衡具有挑战性。由于缺乏对开发人员何时感知到应用程序中引入的开销的研究，因此要对现场工作的技术进行微调极为困难。在本文中，我们提出了一项实证研究，通过量化开发人员对开销的感知来应对这一挑战。这项研究包括对一项实验的三次重复，共有 99 名计算机科学专业的学生参与了这项实验，随后又对 12 名专业开发人员的主要研究结果进行了小规模的实验评估。结果表明，可以在短期内在应用程序中引入不可忽略的开销，而开发人员不会察觉到它，并且执行复杂操作的顺序会影响对系统响应时间的感知。利用这些信息可以设计出更好的监控技术。

{"title":"A family of experiments about how developers perceive delayed system response time","authors":"","doi":"10.1007/s11219-024-09660-w","DOIUrl":"https://doi.org/10.1007/s11219-024-09660-w","url":null,"abstract":"<h3>Abstract</h3> Collecting and analyzing data about developers working on their development tasks can help improve development practices, finally increasing the productivity of teams. Indeed, monitoring and analysis tools have already been used to collect data from productivity tools. Monitoring inevitably consumes resources and, depending on their extensiveness, may significantly slow down software systems, interfering with developers’ activity. There is thus a challenging trade-off between monitoring and validating applications in their operational environment and preventing the degradation of the user experience. The lack of studies about when developers perceive an overhead introduced in an application makes it extremely difficult to fine-tune techniques working in the field. In this paper, we address this challenge by presenting an empirical study that quantifies how developers perceive overhead. The study consists of three replications of an experiment that involved 99 computer science students in total, followed by a small-scale experimental assessment of the key findings with 12 professional developers. Results show that non-negligible overhead can be introduced for a short period into applications without developers perceiving it and that the sequence in which complex operations are executed influences the perception of the system response time. This information can be exploited to design better monitoring techniques.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"5 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140034661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimal test case generation for boundary value analysis 边界值分析的最佳测试用例生成

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-02-13 DOI: 10.1007/s11219-023-09659-9

Xiujing Guo, Hiroyuki Okamura, Tadashi Dohi

Boundary value analysis (BVA) is a common technique in software testing that uses input values that lie at the boundaries where significant changes in behavior are expected. This approach is widely recognized and used as a natural and effective strategy for testing software. Test coverage is one of the criteria to measure how much the software execution paths are covered by the set of test cases. This paper focuses on evaluating test coverage with respect to BVA by defining a metric called boundary coverage distance (BCD). The BCD metric measures the extent to which a test set covers the boundaries. In addition, based on BCD, we consider the optimal test input generation to minimize BCD under the random testing scheme. We propose three algorithms, each representing a different test input generation strategy, and evaluate their fault detection capabilities through experimental validation. The results indicate that the BCD-based approach has the potential to generate boundary values and improve the effectiveness of software testing.

边界值分析（BVA）是软件测试中的一种常用技术，它使用的输入值位于预期行为会发生重大变化的边界。这种方法作为一种自然而有效的软件测试策略得到了广泛认可和使用。测试覆盖率是衡量测试用例集对软件执行路径覆盖程度的标准之一。本文通过定义一种称为边界覆盖距离（BCD）的指标，重点评估与 BVA 有关的测试覆盖率。BCD 指标衡量测试用例集覆盖边界的程度。此外，在 BCD 的基础上，我们还考虑了在随机测试方案下生成最佳测试输入以最小化 BCD 的问题。我们提出了三种算法，分别代表不同的测试输入生成策略，并通过实验验证评估了它们的故障检测能力。结果表明，基于 BCD 的方法具有生成边界值和提高软件测试效率的潜力。

引用次数: 0

Quality evaluation meta-model for open-source software: multi-method validation study 开源软件质量评估元模型：多方法验证研究

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-02-06 DOI: 10.1007/s11219-023-09658-w

Nebi Yılmaz, Ayça Kolukısa Tarhan

In recent years, open-source software (OSS) has attracted increasing attention due to its easy accessibility via cloud repositories, voluntary community, no vendor lock-in, and low total cost of ownership. In turn, specifying and evaluating OSS quality has become a significant challenge for OSS adoption in organizations that are inclined to use them. Although many OSS quality models have been proposed in the literature, the dynamic and diverse nature of OSS has caused these models to be heterogeneous in terms of structure and content. This has adversely affected the standardization of evaluations and led to the evaluation results obtained from different OSS quality models for the same purpose being incomparable and sometimes unreliable. Therefore, in this study, a meta-model for OSS quality (OSS-QMM), which employs a unified structure from existing quality models and enables the derivation of homogeneous models, has been proposed. For this purpose, a systematic and laborious effort has been spent via a step-based meta-model creation process including review-and-revise iterations. In order to validate the OSS-QMM, case study and expert opinion methods have been applied to answer three research questions (RQs) targeted to investigate practical applicability, results comparability, and effectiveness of using the meta-model. Multiple and embedded case study designs have been employed for evaluating three real ERP systems, and 20 subject matter experts have been interviewed during the validation process. The results of multi-faceted empirical studies have indicated that the OSS-QMM has addressed solving problems in OSS quality evaluation and its adoption with high degrees of confidence.

近年来，开放源码软件（OSS）因其可通过云存储库轻松访问、自愿加入社区、无供应商锁定以及总体拥有成本低等特点，吸引了越来越多的关注。反过来，规定和评估开放源码软件的质量已成为有意使用开放源码软件的组织在采用开放源码软件时面临的重大挑战。尽管文献中已经提出了许多开放源码软件质量模型，但开放源码软件的动态性和多样性使得这些模型在结构和内容上各不相同。这对评价的标准化产生了不利影响，导致针对同一目的的不同开放源码软件质量模型得出的评价结果无法比较，有时甚至不可靠。因此，本研究提出了一种开放源码软件质量元模型（OSS-QMM），它采用了现有质量模型的统一结构，能够推导出同质模型。为此，我们通过一个基于步骤的元模型创建过程，包括审查和修订迭代，付出了系统和艰苦的努力。为了验证开放源码软件质量管理模型，我们采用了案例研究和专家意见方法来回答三个研究问题（RQs），旨在调查元模型的实际适用性、结果可比性和使用效果。在评估三个真实的企业资源规划系统时，采用了多重和嵌入式案例研究设计，并在验证过程中采访了 20 位主题专家。多方面的实证研究结果表明，开放源码软件质量元模型解决了开放源码软件质量评估中的问题，其采用具有很高的可信度。

{"title":"Quality evaluation meta-model for open-source software: multi-method validation study","authors":"Nebi Yılmaz, Ayça Kolukısa Tarhan","doi":"10.1007/s11219-023-09658-w","DOIUrl":"https://doi.org/10.1007/s11219-023-09658-w","url":null,"abstract":"In recent years, open-source software (OSS) has attracted increasing attention due to its easy accessibility via cloud repositories, voluntary community, no vendor lock-in, and low total cost of ownership. In turn, specifying and evaluating OSS quality has become a significant challenge for OSS adoption in organizations that are inclined to use them. Although many OSS quality models have been proposed in the literature, the dynamic and diverse nature of OSS has caused these models to be heterogeneous in terms of structure and content. This has adversely affected the standardization of evaluations and led to the evaluation results obtained from different OSS quality models for the same purpose being incomparable and sometimes unreliable. Therefore, in this study, a meta-model for OSS quality (OSS-QMM), which employs a unified structure from existing quality models and enables the derivation of homogeneous models, has been proposed. For this purpose, a systematic and laborious effort has been spent via a step-based meta-model creation process including review-and-revise iterations. In order to validate the OSS-QMM, case study and expert opinion methods have been applied to answer three research questions (RQs) targeted to investigate practical applicability, results comparability, and effectiveness of using the meta-model. Multiple and embedded case study designs have been employed for evaluating three real ERP systems, and 20 subject matter experts have been interviewed during the validation process. The results of multi-faceted empirical studies have indicated that the OSS-QMM has addressed solving problems in OSS quality evaluation and its adoption with high degrees of confidence.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"29 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139770061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards improving agility in public administration 提高公共行政的灵活性

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2024-01-20 DOI: 10.1007/s11219-023-09657-x

Hanna Looks, Jannik Fangmann, Jörg Thomaschewski, María-José Escalona, Eva-Maria Schön

In times of crises, such as the refugee crisis or the corona pandemic, the workload in public administrations increases because of the demands of citizens or short-term legal changes. In addition, there is an increasing need for digitalization or to be able to react flexibly to changes. Agile process models and agile practices are appropriate to overcome these challenges. The objective of this paper is to investigate how public administrations can measure their degree of agility to identify potential for improving it. The authors conducted a descriptive single-case study which included multiple units of analysis in a public administration in Germany. The case study was supported by their questionnaire for measuring the degree of agility. One outcome of this study is a conceptual framework that can be used to drive agile transformation in public administrations by continuously measuring agility. Therefore, a questionnaire for measuring agility at team level in public administrations has been developed. The application of the questionnaire in three teams provide insights into dysfunctionality in the interdisciplinary teams as well as optimization potential in terms of affinity to change. The adoption of agility in public administration is a challenge, given that resistance to change is still prevalent. A transformational change is a constant journey, and therefore, the measurement of progress plays an important role in the continuous improvement of an organization. The applied approach delivers high potential for improvement in terms of agility and provides interesting insights for both practitioners and academics.

在危机时期，如难民危机或冠状病毒大流行，由于公民的需求或短期法律变化，公共管理部门的工作量会增加。此外，对数字化或灵活应对变化的需求也日益增加。敏捷流程模型和敏捷实践适合于克服这些挑战。本文旨在研究公共管理部门如何衡量其敏捷程度，以确定改进的潜力。作者在德国的一个公共行政部门开展了一项描述性单一案例研究，其中包括多个分析单位。该案例研究得到了他们用于衡量敏捷程度的调查问卷的支持。这项研究的成果之一是提出了一个概念框架，可用于通过持续测量敏捷性来推动公共行政机构的敏捷转型。因此，我们编制了一份用于衡量公共行政部门团队敏捷性的调查问卷。通过在三个团队中应用该问卷，可以深入了解跨学科团队中的功能障碍，以及在变革亲和力方面的优化潜力。在公共行政中采用敏捷性是一项挑战，因为抵制变革的现象仍然普遍存在。转型变革是一个持续的过程，因此，对进展情况的衡量对组织的持续改进起着重要作用。这种应用方法在敏捷性方面具有很大的改进潜力，并为从业人员和学术界提供了有趣的见解。

{"title":"Towards improving agility in public administration","authors":"Hanna Looks, Jannik Fangmann, Jörg Thomaschewski, María-José Escalona, Eva-Maria Schön","doi":"10.1007/s11219-023-09657-x","DOIUrl":"https://doi.org/10.1007/s11219-023-09657-x","url":null,"abstract":"In times of crises, such as the refugee crisis or the corona pandemic, the workload in public administrations increases because of the demands of citizens or short-term legal changes. In addition, there is an increasing need for digitalization or to be able to react flexibly to changes. Agile process models and agile practices are appropriate to overcome these challenges. The objective of this paper is to investigate how public administrations can measure their degree of agility to identify potential for improving it. The authors conducted a descriptive single-case study which included multiple units of analysis in a public administration in Germany. The case study was supported by their questionnaire for measuring the degree of agility. One outcome of this study is a conceptual framework that can be used to drive agile transformation in public administrations by continuously measuring agility. Therefore, a questionnaire for measuring agility at team level in public administrations has been developed. The application of the questionnaire in three teams provide insights into dysfunctionality in the interdisciplinary teams as well as optimization potential in terms of affinity to change. The adoption of agility in public administration is a challenge, given that resistance to change is still prevalent. A transformational change is a constant journey, and therefore, the measurement of progress plays an important role in the continuous improvement of an organization. The applied approach delivers high potential for improvement in terms of agility and provides interesting insights for both practitioners and academics.","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"136 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139508233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring better alternatives to size metrics for explainable software defect prediction 为可解释软件缺陷预测探索尺寸度量的更好替代方案

IF 1.9 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Quality Journal

Pub Date : 2023-12-29 DOI: 10.1007/s11219-023-09656-y

Chenchen Chai, Guisheng Fan, Huiqun Yu, Zijie Huang, Jianshu Ding, Yao Guan

Delivering reliable software under the constraint of limited time and budget is a significant challenge. Recent progress in software defect prediction is helping developers to locate defect-prone code components and allocate quality assurance resources more efficiently. However, practitioners’ criticisms on defect predictors from academia are not practical since they rely heavily on size metrics such as lines of code (LOC), which over-abstracts technical details and provides limited insights for software maintenance. Thus, the performance of predictors may be overclaimed. In response, based on a state-of-the-art defect prediction model, we (1) exclude size metrics and evaluate the impact on performance, (2) include new features such as network dependency metrics, and (3) explore which ones are better alternatives to size metrics using explainable artificial intelligence (XAI) technique. We find that excluding size metrics decreases model performance by 1.99% and 0.66% on AUC-ROC in within- and cross-project prediction respectively. The results show that two involved network dependence metrics (i.e., Betweenness and pWeakC(out)) and four other code metrics (i.e., LCOM, AVG(CC), LCOM3, and CAM) could effectively preserve or improve the prediction performance, even if we exclude size metrics. In conclusion, we suggest discarding size metrics and involving the mentioned network dependency metrics for better performance and explainability.

在有限的时间和预算限制下交付可靠的软件是一项重大挑战。软件缺陷预测领域的最新进展有助于开发人员找到容易出现缺陷的代码组件，并更有效地分配质量保证资源。然而，从业人员对学术界缺陷预测方法的批评并不切合实际，因为这些方法严重依赖代码行数（LOC）等规模指标，过度抽象了技术细节，对软件维护的启示有限。因此，预测器的性能可能会被夸大。为此，我们以最先进的缺陷预测模型为基础，(1) 排除了大小度量并评估了其对性能的影响，(2) 加入了网络依赖性度量等新特征，(3) 利用可解释人工智能（XAI）技术探索哪些特征可以更好地替代大小度量。我们发现，在项目内预测和跨项目预测中，排除规模指标会使模型的 AUC-ROC 性能分别降低 1.99% 和 0.66%。结果表明，即使排除了大小指标，两个涉及网络依赖性的指标（即 Betweenness 和 pWeakC(out)）和其他四个代码指标（即 LCOM、AVG(CC)、LCOM3 和 CAM）也能有效地保持或提高预测性能。总之，我们建议摒弃大小度量，采用上述网络依赖性度量，以获得更好的性能和可解释性。

{"title":"Exploring better alternatives to size metrics for explainable software defect prediction","authors":"Chenchen Chai, Guisheng Fan, Huiqun Yu, Zijie Huang, Jianshu Ding, Yao Guan","doi":"10.1007/s11219-023-09656-y","DOIUrl":"https://doi.org/10.1007/s11219-023-09656-y","url":null,"abstract":"Delivering reliable software under the constraint of limited time and budget is a significant challenge. Recent progress in software defect prediction is helping developers to locate defect-prone code components and allocate quality assurance resources more efficiently. However, practitioners’ criticisms on defect predictors from academia are not practical since they rely heavily on size metrics such as lines of code (LOC), which over-abstracts technical details and provides limited insights for software maintenance. Thus, the performance of predictors may be overclaimed. In response, based on a state-of-the-art defect prediction model, we (1) exclude size metrics and evaluate the impact on performance, (2) include new features such as network dependency metrics, and (3) explore which ones are better alternatives to size metrics using explainable artificial intelligence (XAI) technique. We find that excluding size metrics decreases model performance by 1.99% and 0.66% on AUC-ROC in within- and cross-project prediction respectively. The results show that two involved network dependence metrics (i.e., Betweenness and pWeakC(out)) and four other code metrics (i.e., LCOM, AVG(CC), LCOM3, and CAM) could effectively preserve or improve the prediction performance, even if we exclude size metrics. In conclusion, we suggest discarding size metrics and involving the mentioned network dependency metrics for better performance and explainability.\u0000","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"43 7 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139063487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0