International Conference on Evaluation & Assessment in Software Engineering最新文献

Dealing with identifiers and comments in source code comprehension and maintenance: results from an ethnographically-informed study with students and professionals 处理源代码理解和维护中的标识符和注释:来自学生和专业人员的民族志研究的结果

International Conference on Evaluation & Assessment in Software Engineering

Pub Date : 2014-05-13 DOI: 10.1145/2601248.2601251

Felice Salviulo, G. Scanniello

There are a number of empirical studies that assess the benefit deriving from the use of documentation and models in the execution of maintenance tasks. The greater part of these studies are quantitative and fail to analyze the values, beliefs, and assumptions that inform and shape source code comprehensibility and maintainability. We designed and conducted a qualitative study to understand the role of source code comments and identifiers in source code comprehensibility and maintainability. In particular, we sought to understand how novice and young professional developers perceive comments and identifier names after they have inspected the system behavior visible in its user interfaces. Novice developers were 18 third-year Bachelor students in Computer Science. The young professional developers were 12 and had work experience in between 3 months and 2 and half years. The used qualitative methodological approach is ethnographic. We asked the participants to inspect the behavior of a Java application visible in its user interfaces and then to comprehend and modify the source code of that application. We immersed ourselves and participated to the study, while collecting data by means of contemporaneous field notes, audio recordings, and copies of various artifacts. From the collected data, we have identified insights into comprehension and maintenance practices. The main insights can be summarized as follows: (i) with respect to novice developers, professional developers prefer to deal with identifier names rather than comments, (ii) all the participants indicate as essential the use of naming convention techniques for identifiers, and (iii) for all the participants the names of identifiers are important and should be properly chosen. Summarizing, independently from the kind of developer, it is advisable to use naming convention techniques and to properly choose identifiers.

有许多实证研究评估了在执行维护任务时使用文档和模型所带来的好处。这些研究的大部分都是定量的，并且没有分析告知和塑造源代码可理解性和可维护性的价值、信念和假设。我们设计并进行了一项定性研究，以了解源代码注释和标识符在源代码可理解性和可维护性中的作用。特别是，我们试图理解新手和年轻的专业开发人员在检查了用户界面中可见的系统行为之后如何感知注释和标识符名称。初级开发人员是18名计算机科学专业三年级本科学生。这些年轻的专业开发者都是12岁，有3个月到2年半的工作经验。所使用的定性方法是民族志。我们要求参与者检查在其用户界面中可见的Java应用程序的行为，然后理解并修改该应用程序的源代码。我们沉浸在研究中，同时通过当时的现场记录、录音和各种文物的副本收集数据。从收集的数据中，我们已经确定了对理解和维护实践的见解。主要的见解可以总结如下:(i)对于新手开发人员，专业开发人员更喜欢处理标识符名称而不是注释，(ii)所有参与者都指出使用标识符命名约定技术是必要的，(iii)对于所有参与者来说，标识符的名称都很重要，应该正确选择。总之，与开发人员的类型无关，建议使用命名约定技术并正确选择标识符。

{"title":"Dealing with identifiers and comments in source code comprehension and maintenance: results from an ethnographically-informed study with students and professionals","authors":"Felice Salviulo, G. Scanniello","doi":"10.1145/2601248.2601251","DOIUrl":"https://doi.org/10.1145/2601248.2601251","url":null,"abstract":"There are a number of empirical studies that assess the benefit deriving from the use of documentation and models in the execution of maintenance tasks. The greater part of these studies are quantitative and fail to analyze the values, beliefs, and assumptions that inform and shape source code comprehensibility and maintainability. We designed and conducted a qualitative study to understand the role of source code comments and identifiers in source code comprehensibility and maintainability. In particular, we sought to understand how novice and young professional developers perceive comments and identifier names after they have inspected the system behavior visible in its user interfaces. Novice developers were 18 third-year Bachelor students in Computer Science. The young professional developers were 12 and had work experience in between 3 months and 2 and half years. The used qualitative methodological approach is ethnographic. We asked the participants to inspect the behavior of a Java application visible in its user interfaces and then to comprehend and modify the source code of that application. We immersed ourselves and participated to the study, while collecting data by means of contemporaneous field notes, audio recordings, and copies of various artifacts. From the collected data, we have identified insights into comprehension and maintenance practices. The main insights can be summarized as follows: (i) with respect to novice developers, professional developers prefer to deal with identifier names rather than comments, (ii) all the participants indicate as essential the use of naming convention techniques for identifiers, and (iii) for all the participants the names of identifiers are important and should be properly chosen. Summarizing, independently from the kind of developer, it is advisable to use naming convention techniques and to properly choose identifiers.","PeriodicalId":115935,"journal":{"name":"International Conference on Evaluation & Assessment in Software Engineering","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125833613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 41

Towards a novel testing approach for holonic agents 一种新的全息试剂检测方法

International Conference on Evaluation & Assessment in Software Engineering

Pub Date : 2014-05-13 DOI: 10.1145/2601248.2613079

N. H. Dehimi

We propose, in this paper, a new holonic agent testing technique. The technique is based on models and uses genetic algorithms. It considers the successive versions of an agent. The approach is organized in two main phases that are conducted iteratively. The first phase is concerned with detecting a new version of an agent under test. The second phase focuses on testing each new detected version. The new version is analyzed in order to generate a behavioral model on which is based the generation of test cases. The test case generation process focuses on the new parts of the agent behavior. In this way, the technique supports an incremental update of test cases.

本文提出了一种新的全息试剂检测技术。该技术基于模型并使用遗传算法。它考虑代理的连续版本。该方法被组织为迭代执行的两个主要阶段。第一阶段涉及检测被测代理的新版本。第二阶段的重点是测试每个新检测到的版本。为了生成基于测试用例生成的行为模型，对新版本进行了分析。测试用例生成过程关注于代理行为的新部分。通过这种方式，该技术支持测试用例的增量更新。

引用次数: 1

A systematic mapping study on serious game quality 严肃游戏质量的系统映射研究

International Conference on Evaluation & Assessment in Software Engineering

Pub Date : 2014-05-13 DOI: 10.1145/2601248.2601261

M. Genero, J. A. Vargas, Lilia García-Mundo, M. Piattini

Context: A Serious Game (SG) is a game for purposes other than entertainment [12]. SGs are currently in widespread use and their popularity has begun to steadily increase; their application areas now extend not only to education, but also to military, health and corporate [9] [12] sectors. SGs are of vital importance at present, as they can be a means to achieve relevant goals from both a personal and an institutional point of view. This may take place in fields as diverse as defense, education, scientific exploration, health care, emergency management, city planning, engineering, religion, and politics. The number of users of these systems grows each day, signifying that their impact is very high, and it is precisely for this reason that more extensive research on SG quality is needed. Objective: The aim of this study is to discover the current state of SG quality initiatives, identifying gaps that merit future investigation. Method: We conducted a systematic mapping study (SMS) on SG quality, following the guidelines proposed by Kitchenham and Charters [7]. We selected 112 papers found in six digital libraries until April of 2013. Results: Since 2007, research on SG quality proves to have grown very significantly. Research has focused mostly on addressing the effectiveness of SGs (78.57%), in addition to several entertainment characteristics that are principally related to pleasure (62.50%). The most widely-researched software artifact was the final product (97.32%), with design coming very far behind (7.14%). Less than half of all the research reviewed had been validated by means of experiments, and in most of these cases, experiments were conducted by the same researchers who had proposed the SG. The majority of experiments have not been replicated. The most common research outcome was questionnaires, closely followed by the confirmation of knowledge. Most of these outcomes evaluated the quality of a particular SG. Conclusion: Results show that SG quality has undergone a very important growth, thus making SG quality an area of opportunity for future research. Researchers are mainly concerned with demonstrating or confirming the effectiveness of SGs, but very little research has been conducted as regards the characteristics of playability that make SGs more effective. Since effectiveness and playability are evaluated in the final product there is a need to provide quality assurance methods that incorporate quality issues from the early stages of SG development. Further empirical validation is also needed, and in particular, external replications must be performed in order to corroborate and generalize the findings obtained.

情境:严肃游戏(SG)是指不以娱乐为目的的游戏。SGs目前已被广泛使用，其普及程度已开始稳步增长;它们的应用领域现在不仅扩展到教育领域，而且还扩展到军事、卫生和企业领域。SGs在当前是至关重要的，因为无论是从个人还是从机构的角度来看，它们都是实现相关目标的手段。这可能发生在国防、教育、科学探索、卫生保健、应急管理、城市规划、工程、宗教和政治等各个领域。这些系统的用户数量每天都在增长，这表明它们的影响非常大，正是由于这个原因，需要对SG质量进行更广泛的研究。目的:本研究的目的是发现SG质量倡议的现状，确定值得未来调查的差距。方法:我们按照Kitchenham和Charters[7]提出的指南，对SG质量进行了系统的制图研究(SMS)。我们选取了截至2013年4月在6个数字图书馆中发现的112篇论文。结果:自2007年以来，对SG质量的研究有了显著的增长。研究主要集中在解决SGs的有效性(78.57%)，以及几个主要与愉悦相关的娱乐特征(62.50%)。最广泛研究的软件工件是最终产品(97.32%)，设计远远落后(7.14%)。在所有被审查的研究中，只有不到一半的研究得到了实验的验证，而在大多数情况下，实验是由提出SG的同一研究人员进行的。大多数实验都没有被重复。最常见的研究结果是问卷调查，紧随其后的是对知识的确认。这些结果大多评估了特定SG的质量。结论:结果表明，SG质量有了非常重要的增长，从而使SG质量成为未来研究的机会领域。研究人员主要关注的是证明或确认SGs的有效性，但很少有人研究可玩性的特征，使SGs更有效。由于有效性和可玩性是在最终产品中进行评估的，因此有必要提供包含SG开发早期质量问题的质量保证方法。还需要进一步的经验验证，特别是必须进行外部重复，以证实和推广所获得的发现。

{"title":"A systematic mapping study on serious game quality","authors":"M. Genero, J. A. Vargas, Lilia García-Mundo, M. Piattini","doi":"10.1145/2601248.2601261","DOIUrl":"https://doi.org/10.1145/2601248.2601261","url":null,"abstract":"Context: A Serious Game (SG) is a game for purposes other than entertainment [12]. SGs are currently in widespread use and their popularity has begun to steadily increase; their application areas now extend not only to education, but also to military, health and corporate [9] [12] sectors. SGs are of vital importance at present, as they can be a means to achieve relevant goals from both a personal and an institutional point of view. This may take place in fields as diverse as defense, education, scientific exploration, health care, emergency management, city planning, engineering, religion, and politics. The number of users of these systems grows each day, signifying that their impact is very high, and it is precisely for this reason that more extensive research on SG quality is needed. Objective: The aim of this study is to discover the current state of SG quality initiatives, identifying gaps that merit future investigation. Method: We conducted a systematic mapping study (SMS) on SG quality, following the guidelines proposed by Kitchenham and Charters [7]. We selected 112 papers found in six digital libraries until April of 2013. Results: Since 2007, research on SG quality proves to have grown very significantly. Research has focused mostly on addressing the effectiveness of SGs (78.57%), in addition to several entertainment characteristics that are principally related to pleasure (62.50%). The most widely-researched software artifact was the final product (97.32%), with design coming very far behind (7.14%). Less than half of all the research reviewed had been validated by means of experiments, and in most of these cases, experiments were conducted by the same researchers who had proposed the SG. The majority of experiments have not been replicated. The most common research outcome was questionnaires, closely followed by the confirmation of knowledge. Most of these outcomes evaluated the quality of a particular SG. Conclusion: Results show that SG quality has undergone a very important growth, thus making SG quality an area of opportunity for future research. Researchers are mainly concerned with demonstrating or confirming the effectiveness of SGs, but very little research has been conducted as regards the characteristics of playability that make SGs more effective. Since effectiveness and playability are evaluated in the final product there is a need to provide quality assurance methods that incorporate quality issues from the early stages of SG development. Further empirical validation is also needed, and in particular, external replications must be performed in order to corroborate and generalize the findings obtained.","PeriodicalId":115935,"journal":{"name":"International Conference on Evaluation & Assessment in Software Engineering","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122131834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 42

Code ownership in open-source software 开源软件中的代码所有权

International Conference on Evaluation & Assessment in Software Engineering

Pub Date : 2014-05-13 DOI: 10.1145/2601248.2601283

Matthieu Foucault, Jean-Rémy Falleri, Xavier Blanc

Context: Ownership metrics measure how the workload of software modules is shared among their developers. They have been shown to be accurate indicators of software quality. Objective: Since ownership metrics studies were done only on industrial software projects, we replicated such a study on Java free/libre and open source software (FLOSS) projects. Our goal was to generalize an "ownership law" that stated that minor developers should be avoided. Method: We explored the relationship between ownership metrics and fault-proneness on seven FLOSS projects, using publicly available corpora to retrieve the fault-related information. Results: In our corpus, the relationship between ownership metrics and module faults is weak. At best, less than half of projects exhibit a significant correlation, and at worst, no projects at all. Moreover, fault-proneness seems to be much more influenced by module size than by ownership. Conclusion: The results of ownership studies done on closed-source projects do not generalize to FLOSS projects. To understand the reasons for that, we performed an in-depth analysis and found that the lack of correlation between ownership metrics and module faults is due to the distributions of contributions among developers and the presence of "heroes" in FLOSS projects.

上下文:所有权度量度量软件模块的工作负载是如何在开发人员之间共享的。它们已被证明是软件质量的准确指示器。目标:由于所有权度量研究只在工业软件项目中完成，我们在Java自由/自由和开源软件(FLOSS)项目中复制了这样的研究。我们的目标是概括一个“所有权法则”，说明应该避免小型开发者。方法:我们在七个FLOSS项目中探索了所有权度量和错误倾向之间的关系，使用公开可用的语料库检索与错误相关的信息。结果:在我们的语料库中，所有权度量和模块故障之间的关系很弱。在最好的情况下，只有不到一半的项目表现出显著的相关性，在最坏的情况下，根本没有项目。此外，故障倾向似乎更受模块大小的影响，而不是所有权。结论:对闭源项目进行的所有权研究的结果并不适用于FLOSS项目。为了理解其中的原因，我们进行了深入的分析，发现所有权度量和模块故障之间缺乏相关性是由于开发人员之间的贡献分布和FLOSS项目中“英雄”的存在。

{"title":"Code ownership in open-source software","authors":"Matthieu Foucault, Jean-Rémy Falleri, Xavier Blanc","doi":"10.1145/2601248.2601283","DOIUrl":"https://doi.org/10.1145/2601248.2601283","url":null,"abstract":"Context: Ownership metrics measure how the workload of software modules is shared among their developers. They have been shown to be accurate indicators of software quality. Objective: Since ownership metrics studies were done only on industrial software projects, we replicated such a study on Java free/libre and open source software (FLOSS) projects. Our goal was to generalize an \"ownership law\" that stated that minor developers should be avoided. Method: We explored the relationship between ownership metrics and fault-proneness on seven FLOSS projects, using publicly available corpora to retrieve the fault-related information. Results: In our corpus, the relationship between ownership metrics and module faults is weak. At best, less than half of projects exhibit a significant correlation, and at worst, no projects at all. Moreover, fault-proneness seems to be much more influenced by module size than by ownership. Conclusion: The results of ownership studies done on closed-source projects do not generalize to FLOSS projects. To understand the reasons for that, we performed an in-depth analysis and found that the lack of correlation between ownership metrics and module faults is due to the distributions of contributions among developers and the presence of \"heroes\" in FLOSS projects.","PeriodicalId":115935,"journal":{"name":"International Conference on Evaluation & Assessment in Software Engineering","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123855801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 32

Together we are stronger: facilitating the conduction of distributed human-oriented experiments 团结起来，我们会更强大:促进以人为本的分布式实验的进行

International Conference on Evaluation & Assessment in Software Engineering

Pub Date : 2014-05-13 DOI: 10.1145/2601248.2613083

Waldemar Ferreira

This paper presents the main goals, questions, and methods of my PhD research. This research focuses on facilitating the conduction of distributed human-oriented experiments in software engineering (SE). First, I am studying how the experiments are being conducted in SE and what are the challenges in conducting distributed experiments. Second, I plan to propose a platform able to mitigate the challenges encountered. Finally, I am going to evaluate the proposed solution using both academic-industrial experiments. The research was divided in phases and each phase is based on a research question. Moreover, I propose a set of empirical studies that intends to provide answer to each research question.

本文介绍了我博士研究的主要目标、问题和方法。本研究的重点是促进软件工程(SE)中分布式面向人的实验的进行。首先，我正在研究如何在SE中进行实验，以及进行分布式实验的挑战是什么。其次，我计划提出一个能够减轻所遇到的挑战的平台。最后，我将使用学术和工业实验来评估所提出的解决方案。研究分为几个阶段，每个阶段都基于一个研究问题。此外，我提出了一套实证研究，旨在为每个研究问题提供答案。

引用次数: 2

Conformance factor in test-driven development: initial results from an enhanced replication 测试驱动开发中的一致性因素:来自增强复制的初始结果

International Conference on Evaluation & Assessment in Software Engineering

Pub Date : 2014-05-13 DOI: 10.1145/2601248.2601272

D. Fucci, Burak Turhan, M. Oivo

Test-driven development (TDD) is an iterative software development technique where unit-tests are defined before production code. The proponents of TDD claim that it improves both external quality and developers' productivity. In particular, Erdogmus et al. (i.e., original study) proposed a two-stage model to investigate these claims regarding TDD's effects. Our aim is to enhance the model proposed in the original study by investigating an additional factor: TDD process conformance. We conducted a close, external replication of the original study accompanied by a correlation analysis to check whether process conformance is related to improvements for the subjects using TDD. We partially confirmed the results of the original study. Moreover, we observed a correlation between process conformance and quality, but not productivity. We found no evidence to support the claim that external quality and productivity are improved by the adoption of TDD compared to test-last development. Finally, conformance to TDD process improves the quality and does not affect productivity. We conclude that the role of process conformance is relevant in studying the quality and productivity-related effects of TDD.

测试驱动开发(TDD)是一种迭代软件开发技术，其中单元测试是在生产代码之前定义的。TDD的支持者声称它可以提高外部质量和开发人员的生产力。特别是，Erdogmus等人(即原始研究)提出了一个两阶段模型来调查这些关于TDD影响的说法。我们的目标是通过调查一个额外的因素来增强原始研究中提出的模型:TDD过程一致性。我们对原始研究进行了密切的外部复制，并进行了相关分析，以检查过程一致性是否与使用TDD的受试者的改进有关。我们部分证实了原始研究的结果。此外，我们观察到过程一致性和质量之间的相关性，而不是生产力之间的相关性。我们没有发现任何证据来支持通过采用TDD来提高外部质量和生产力的说法，而不是与最后一次测试的开发相比。最后，遵循TDD过程提高了质量，并且不影响生产力。我们得出结论，过程一致性的作用与研究TDD的质量和生产力相关影响是相关的。

{"title":"Conformance factor in test-driven development: initial results from an enhanced replication","authors":"D. Fucci, Burak Turhan, M. Oivo","doi":"10.1145/2601248.2601272","DOIUrl":"https://doi.org/10.1145/2601248.2601272","url":null,"abstract":"Test-driven development (TDD) is an iterative software development technique where unit-tests are defined before production code. The proponents of TDD claim that it improves both external quality and developers' productivity. In particular, Erdogmus et al. (i.e., original study) proposed a two-stage model to investigate these claims regarding TDD's effects. Our aim is to enhance the model proposed in the original study by investigating an additional factor: TDD process conformance. We conducted a close, external replication of the original study accompanied by a correlation analysis to check whether process conformance is related to improvements for the subjects using TDD. We partially confirmed the results of the original study. Moreover, we observed a correlation between process conformance and quality, but not productivity. We found no evidence to support the claim that external quality and productivity are improved by the adoption of TDD compared to test-last development. Finally, conformance to TDD process improves the quality and does not affect productivity. We conclude that the role of process conformance is relevant in studying the quality and productivity-related effects of TDD.","PeriodicalId":115935,"journal":{"name":"International Conference on Evaluation & Assessment in Software Engineering","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125983313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Software paradigms, assessment types and non-functional requirements in model-based integration testing: a systematic literature review 基于模型的集成测试中的软件范例、评估类型和非功能需求:系统的文献回顾

International Conference on Evaluation & Assessment in Software Engineering

Pub Date : 2014-05-13 DOI: 10.1145/2601248.2601257

Florian Häser, M. Felderer, R. Breu

Context: In modern systems, like cyber-physical systems, where software and physical services are interacting, safety, security or performance play an important role. In order to guarantee the correct interoperability of such systems, with respect to functional and non-functional requirements, integration testing is an effective measure to achieve this. Model-based testing moreover not only enables early definition and validation, but also test automation. This makes it a good choice to overcome urgent challenges of integration testing. Objective: Many publications on model-based integration testing (MBIT) approaches can be found. Nevertheless, a study giving a systematic overview on the underlying software paradigms, measures for guiding the integration testing process as well as non-functional requirements they are suitable for, is missing. The aim of this paper is to find and synthesize the relevant primary studies to gain a comprehensive understanding of the current state of model-based integration testing. Method: For synthesizing the relevant studies, we conducted a systematic literature review (SLR) according to the guidelines of Kitchenham. Results: The systematic search and selection retrieved 83 relevant studies from which data has been extracted. Our review identified three assessment criteria for guiding the testing process, namely static metrics, dynamic metrics and stochastic &random. In addition it shows that just a small fraction considers non-functional requirements. Most approaches are for component-oriented systems. Conclusion: Results from the SLR show that there are two major research gaps. First, there is an accumulated need for approaches in the MBIT field that support non-functional requirements, as they are gaining importance. Second, means for steering the integration testing process, especially together with automation, need to evolve.

背景:在现代系统中，如网络物理系统，软件和物理服务相互作用，安全、保障或性能起着重要作用。为了保证这些系统的正确互操作性，就功能和非功能需求而言，集成测试是实现这一目标的有效措施。此外，基于模型的测试不仅支持早期定义和验证，还支持测试自动化。这使得它成为克服集成测试的紧迫挑战的一个很好的选择。目的:可以找到许多关于基于模型的集成测试(MBIT)方法的出版物。然而，缺少对底层软件范例、指导集成测试过程的方法以及它们适合的非功能需求进行系统概述的研究。本文的目的是寻找和综合相关的初步研究，以全面了解基于模型的集成测试的现状。方法:为了综合相关研究，我们按照Kitchenham的指南进行了系统的文献回顾(SLR)。结果:通过系统的检索和选择，检索到83项相关研究，并从中提取数据。我们的审查确定了指导测试过程的三个评估标准，即静态度量，动态度量和随机和随机。此外，它表明只有一小部分考虑了非功能需求。大多数方法都适用于面向组件的系统。结论:从单反的结果来看，存在两大研究空白。首先，由于支持非功能性需求的方法越来越重要，因此在MBIT领域中对这些方法的需求越来越多。第二，指导集成测试过程的方法，特别是与自动化一起，需要发展。

{"title":"Software paradigms, assessment types and non-functional requirements in model-based integration testing: a systematic literature review","authors":"Florian Häser, M. Felderer, R. Breu","doi":"10.1145/2601248.2601257","DOIUrl":"https://doi.org/10.1145/2601248.2601257","url":null,"abstract":"Context: In modern systems, like cyber-physical systems, where software and physical services are interacting, safety, security or performance play an important role. In order to guarantee the correct interoperability of such systems, with respect to functional and non-functional requirements, integration testing is an effective measure to achieve this. Model-based testing moreover not only enables early definition and validation, but also test automation. This makes it a good choice to overcome urgent challenges of integration testing. Objective: Many publications on model-based integration testing (MBIT) approaches can be found. Nevertheless, a study giving a systematic overview on the underlying software paradigms, measures for guiding the integration testing process as well as non-functional requirements they are suitable for, is missing. The aim of this paper is to find and synthesize the relevant primary studies to gain a comprehensive understanding of the current state of model-based integration testing. Method: For synthesizing the relevant studies, we conducted a systematic literature review (SLR) according to the guidelines of Kitchenham. Results: The systematic search and selection retrieved 83 relevant studies from which data has been extracted. Our review identified three assessment criteria for guiding the testing process, namely static metrics, dynamic metrics and stochastic &random. In addition it shows that just a small fraction considers non-functional requirements. Most approaches are for component-oriented systems. Conclusion: Results from the SLR show that there are two major research gaps. First, there is an accumulated need for approaches in the MBIT field that support non-functional requirements, as they are gaining importance. Second, means for steering the integration testing process, especially together with automation, need to evolve.","PeriodicalId":115935,"journal":{"name":"International Conference on Evaluation & Assessment in Software Engineering","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132374364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 26

An empirical investigation of the factors affecting agile usage 对影响敏捷使用的因素的实证调查

International Conference on Evaluation & Assessment in Software Engineering

Pub Date : 2014-05-13 DOI: 10.1145/2601248.2601253

Mali Senapathi, A. Srinivasan

Context: Although agile software development (ASD) method use has grown dramatically in recent years, there is lack of empirical evidence for their effective use and predicted improvements in software development. This suggests that there is an imprecise understanding of their use and practice in organisations beyond the adoption phase. There is a lack of clarity on what 'usage' or 'effectiveness' mean in the context of ASD. In particular, the relationship between agile usage and agile effectiveness has not been investigated. Objective: To study the factors that impact usage of ASD methods and analyse the relationship between post-adoption use of ASD methods, i.e. agile usage and the results effectively achieved in terms of improvements in development outcomes, i.e. agile effectiveness. Method: An agile usage model of post-adoption or continued agile usage is tested using a survey instrument. The study employs variance-based or partial least squares structural equation modelling (PLS-SEM) implemented in SmartPLS version 2.0. Special attention is given to the quality of the measures used by developing reliable and valid measures of study variables. Results: The results from a partial least squares (PLS) analysis shows that relative advantage and agile coach significantly influence the usage of agile methods. Results also support the thesis that agile usage as measured by intensity and extent of agile method use significantly impacts agile effectiveness. Conclusions: The overall contribution of this research is to improve the effectiveness of ASD methods by increasing the likelihood that ASD methods will be effectively used and sustained beyond the adoption phase.

背景:尽管近年来敏捷软件开发(ASD)方法的使用急剧增长，但缺乏经验证据证明它们在软件开发中的有效使用和预测改进。这表明，在采用阶段之后的组织中，对它们的使用和实践存在不精确的理解。在自闭症谱系障碍的背景下，“使用”或“有效性”的含义尚不明确。特别是，敏捷使用和敏捷有效性之间的关系还没有被研究过。目的:研究影响ASD方法使用的因素，分析采用ASD方法(即敏捷使用)后与有效取得的开发成果(即敏捷有效性)的改善之间的关系。方法:使用调查工具测试采用后的敏捷使用模型或持续的敏捷使用模型。该研究采用基于方差或偏最小二乘结构方程建模(PLS-SEM)在SmartPLS 2.0版本中实现。通过开发可靠和有效的研究变量测量方法，特别注意所使用测量方法的质量。结果:偏最小二乘(PLS)分析结果表明，相对优势和敏捷教练对敏捷方法的使用有显著影响。结果也支持了这样的论点，即通过敏捷方法使用的强度和程度来衡量的敏捷使用显著影响了敏捷的有效性。结论:本研究的总体贡献是通过增加ASD方法在采用阶段后有效使用和持续的可能性来提高ASD方法的有效性。

{"title":"An empirical investigation of the factors affecting agile usage","authors":"Mali Senapathi, A. Srinivasan","doi":"10.1145/2601248.2601253","DOIUrl":"https://doi.org/10.1145/2601248.2601253","url":null,"abstract":"Context: Although agile software development (ASD) method use has grown dramatically in recent years, there is lack of empirical evidence for their effective use and predicted improvements in software development. This suggests that there is an imprecise understanding of their use and practice in organisations beyond the adoption phase. There is a lack of clarity on what 'usage' or 'effectiveness' mean in the context of ASD. In particular, the relationship between agile usage and agile effectiveness has not been investigated. Objective: To study the factors that impact usage of ASD methods and analyse the relationship between post-adoption use of ASD methods, i.e. agile usage and the results effectively achieved in terms of improvements in development outcomes, i.e. agile effectiveness. Method: An agile usage model of post-adoption or continued agile usage is tested using a survey instrument. The study employs variance-based or partial least squares structural equation modelling (PLS-SEM) implemented in SmartPLS version 2.0. Special attention is given to the quality of the measures used by developing reliable and valid measures of study variables. Results: The results from a partial least squares (PLS) analysis shows that relative advantage and agile coach significantly influence the usage of agile methods. Results also support the thesis that agile usage as measured by intensity and extent of agile method use significantly impacts agile effectiveness. Conclusions: The overall contribution of this research is to improve the effectiveness of ASD methods by increasing the likelihood that ASD methods will be effectively used and sustained beyond the adoption phase.","PeriodicalId":115935,"journal":{"name":"International Conference on Evaluation & Assessment in Software Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130686456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 34

Cross- vs. within-company cost estimation studies revisited: an extended systematic review 重新审视公司内部成本估算研究:扩展的系统回顾

International Conference on Evaluation & Assessment in Software Engineering

Pub Date : 2014-05-13 DOI: 10.1145/2601248.2601284

E. Mendes, Marcos Kalinowski, Daves Martins, F. Ferrucci, Federica Sarro

[Objective] The objective of this paper is to extend a previously conducted systematic literature review (SLR) that investigated under what circumstances individual organizations would be able to rely on cross-company based estimation models. [Method] We applied the same methodology used in the SLR we are extending herein (covering the period 2006-2013) based on primary studies that compared predictions from cross-company models with predictions from within-company models constructed from analysis of project data. [Results] We identified 11 additional papers; however two of these did not present independent results and one had inconclusive findings. Two of the remaining eight papers presented both, trials where cross-company predictions were not significantly different from within-company predictions and others where they were significantly different. Four found that cross-company models gave prediction accuracy significantly different from within-company models (one of them in favor of cross-company models), while two found no significant difference. The main pattern when examining the study related factors was that studies where cross-company predictions were significantly different from within-company predictions employed larger within-company data sets. [Conclusions] Overall, half of the analyzed evidence indicated that cross-company estimation models are not significantly worse than within-company estimation models. Moreover, there is some evidence that sample size does not imply in higher estimation accuracy, and that samples for building estimation models should be carefully selected/filtered based on quality control and project similarity aspects. The results need to be combined with the findings from the SLR we are extending to allow further investigating this topic.

[目的]本文的目的是扩展先前进行的系统文献综述(SLR)，该综述调查了在什么情况下单个组织能够依赖基于跨公司的估计模型。【方法】我们采用了与本文扩展的SLR相同的方法(涵盖2006-2013年期间)，该方法基于对跨公司模型的预测与通过项目数据分析构建的公司内部模型的预测进行比较的初步研究。【结果】我们又发现了11篇论文;然而，其中两项没有独立的结果，一项没有结论性的发现。剩下的八篇论文中有两篇同时提出了这两种观点，其中跨公司的预测与公司内部的预测没有显著差异，而另一些则有显著差异。其中4个发现跨公司模型的预测精度与公司内部模型存在显著差异(其中1个倾向于跨公司模型)，2个发现无显著差异。在检查研究相关因素时的主要模式是，跨公司预测与公司内部预测显著不同的研究采用了更大的公司内部数据集。【结论】总体而言，有一半的分析证据表明，跨公司的估计模型并不明显差于公司内部的估计模型。此外，有一些证据表明，样本大小并不意味着更高的估计准确性，并且应该根据质量控制和项目相似性方面仔细选择/过滤用于构建估计模型的样本。这些结果需要与我们正在扩展的单反相机的发现相结合，以便进一步研究这个主题。

{"title":"Cross- vs. within-company cost estimation studies revisited: an extended systematic review","authors":"E. Mendes, Marcos Kalinowski, Daves Martins, F. Ferrucci, Federica Sarro","doi":"10.1145/2601248.2601284","DOIUrl":"https://doi.org/10.1145/2601248.2601284","url":null,"abstract":"[Objective] The objective of this paper is to extend a previously conducted systematic literature review (SLR) that investigated under what circumstances individual organizations would be able to rely on cross-company based estimation models. [Method] We applied the same methodology used in the SLR we are extending herein (covering the period 2006-2013) based on primary studies that compared predictions from cross-company models with predictions from within-company models constructed from analysis of project data. [Results] We identified 11 additional papers; however two of these did not present independent results and one had inconclusive findings. Two of the remaining eight papers presented both, trials where cross-company predictions were not significantly different from within-company predictions and others where they were significantly different. Four found that cross-company models gave prediction accuracy significantly different from within-company models (one of them in favor of cross-company models), while two found no significant difference. The main pattern when examining the study related factors was that studies where cross-company predictions were significantly different from within-company predictions employed larger within-company data sets. [Conclusions] Overall, half of the analyzed evidence indicated that cross-company estimation models are not significantly worse than within-company estimation models. Moreover, there is some evidence that sample size does not imply in higher estimation accuracy, and that samples for building estimation models should be carefully selected/filtered based on quality control and project similarity aspects. The results need to be combined with the findings from the SLR we are extending to allow further investigating this topic.","PeriodicalId":115935,"journal":{"name":"International Conference on Evaluation & Assessment in Software Engineering","volume":"48 16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130999683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 28

Essence reflection meetings: field study 本质反思会议:实地考察

International Conference on Evaluation & Assessment in Software Engineering

Pub Date : 2014-05-13 DOI: 10.1145/2601248.2601296

Cécile Péraire, Todd Sedano

This paper presents an empirical evaluation of the team reflection support provided by the Software Engineering Method and Theory (SEMAT) Essence framework, and compares Essence reflection meetings to other types of team reflection meetings. The researchers conducted a field study involving seven graduate master student teams running Essence reflection meetings throughout their practicum projects aiming at delivering a working product for an industry client. The main result validates that Essence meetings generate reflective team discussions through a thinking framework that is holistic, state-based, goal-driven, and method-agnostic. Student teams benefit from stepping back and assessing the project holistically throughout its lifecycle. The goals set by the framework's checklists lead the teams to address critical aspects of the project that have not been considered. All team members are encouraged to express their views and influence the various project dimensions. Essence reflection meetings are comparable and complementary to Agile retrospectives, and project teams might want to leverage both techniques. The value added by Essence reflections is to surface unknown issues, help monitor progress, steer the project to a higher state, and prevent retrospectives from being repetitive by varying styles.

本文对软件工程方法与理论(SEMAT)本质框架提供的团队反思支持进行了实证评估，并将本质反思会议与其他类型的团队反思会议进行了比较。研究人员进行了一项实地研究，涉及七个研究生硕士团队，他们在整个实习项目中运行Essence反思会议，旨在为行业客户交付工作产品。主要结果验证了Essence会议通过一个整体的、基于状态的、目标驱动的和方法不可知的思维框架产生了反思的团队讨论。学生团队从退后一步并在整个生命周期中全面评估项目中受益。框架的检查表所设定的目标引导团队处理项目中尚未考虑到的关键方面。鼓励所有团队成员表达他们的意见，并影响项目的各个方面。本质上，反思会议与敏捷回顾会议是可比较和互补的，项目团队可能希望利用这两种技术。本质反思的附加价值在于揭示未知问题，帮助监控进度，将项目引导到更高的状态，并防止通过不同的风格重复回顾。

{"title":"Essence reflection meetings: field study","authors":"Cécile Péraire, Todd Sedano","doi":"10.1145/2601248.2601296","DOIUrl":"https://doi.org/10.1145/2601248.2601296","url":null,"abstract":"This paper presents an empirical evaluation of the team reflection support provided by the Software Engineering Method and Theory (SEMAT) Essence framework, and compares Essence reflection meetings to other types of team reflection meetings. The researchers conducted a field study involving seven graduate master student teams running Essence reflection meetings throughout their practicum projects aiming at delivering a working product for an industry client. The main result validates that Essence meetings generate reflective team discussions through a thinking framework that is holistic, state-based, goal-driven, and method-agnostic. Student teams benefit from stepping back and assessing the project holistically throughout its lifecycle. The goals set by the framework's checklists lead the teams to address critical aspects of the project that have not been considered. All team members are encouraged to express their views and influence the various project dimensions. Essence reflection meetings are comparable and complementary to Agile retrospectives, and project teams might want to leverage both techniques. The value added by Essence reflections is to surface unknown issues, help monitor progress, steer the project to a higher state, and prevent retrospectives from being repetitive by varying styles.","PeriodicalId":115935,"journal":{"name":"International Conference on Evaluation & Assessment in Software Engineering","volume":"217 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115103458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13