Inf. Softw. Technol.最新文献

英文中文

Squeeziness for non-deterministic systems 非确定性系统的挤压性

Inf. Softw. Technol.

Pub Date : 2023-02-01 DOI: 10.2139/ssrn.4151287

Alfredo Ibias, Manuel Núñez

引用次数: 0

An investigation of causes and effects of trust in Boundary Artefacts 边界人工制品信任的因果研究

Inf. Softw. Technol.

Pub Date : 2023-02-01 DOI: 10.2139/ssrn.4253491

R. Ouriques, Fabian Fagerholm, Daniel Mendez, Baldvin Gislason Bern

Context: Boundary Artefacts (BAs) support software development activities in many aspects because it carries lots of information in the same object that can be used and interpreted by several social groups within an organisation. When the BAs are inconsistent regarding their content, such as many meanings or lack of contextual information, their efficiency is reduced because stakeholders will not trust them. Objective: This study aimed to understand the implications of differences in the perception of trust on software projects and their influence on stakeholders’ behaviour. Methods: We conducted an exploratory case study to observe the creation and utilisation of one specific BA and the implications of differences in trust and their influence on stakeholders’ behaviour. Results : Our investigation has shown that practitioners adding and adjusting existing content do not entirely understand the stakeholders’ needs. Together with the partial management of the content, trust is impacted. When the content of BAs does not meet the trust factors, specifically reliability and predictability, the stakeholders cannot execute their tasks appropriately, and several implications affect the software development project. Additionally, they create workarounds to supply their needs. Conclusion: The differences in trust in BAs affect software projects in different areas of the organisation and interfere with the task execution of various stakeholders. The decrease in trust results from inconsistencies in the content associated with the lack of management of the BA. A structured strategy for representing and managing a BA’s content seems appropriate to increase trust levels and efficiency.

上下文:边界工件(BAs)在许多方面支持软件开发活动，因为它在同一个对象中携带了许多信息，这些信息可以被组织内的几个社会团体使用和解释。当BAs的内容不一致时，例如许多含义或缺乏上下文信息，它们的效率就会降低，因为涉众不会信任它们。目的:本研究旨在了解软件项目信任感知差异的含义及其对利益相关者行为的影响。方法:我们进行了一个探索性案例研究，以观察一个特定BA的创建和使用，以及信任差异的含义及其对利益相关者行为的影响。结果:我们的调查显示，从业者增加和调整现有内容并没有完全理解利益相关者的需求。再加上内容的部分管理，信任受到了影响。当BAs的内容不满足信任因素，特别是可靠性和可预测性时，涉众就不能适当地执行他们的任务，并且会影响软件开发项目。此外，他们还会创建变通方法来满足自己的需求。结论:ba的信任差异会影响组织不同领域的软件项目，并干扰各种利益相关者的任务执行。信任的下降是由于缺乏BA管理导致的内容不一致。表示和管理BA内容的结构化策略似乎适合于提高信任级别和效率。

{"title":"An investigation of causes and effects of trust in Boundary Artefacts","authors":"R. Ouriques, Fabian Fagerholm, Daniel Mendez, Baldvin Gislason Bern","doi":"10.2139/ssrn.4253491","DOIUrl":"https://doi.org/10.2139/ssrn.4253491","url":null,"abstract":"Context: Boundary Artefacts (BAs) support software development activities in many aspects because it carries lots of information in the same object that can be used and interpreted by several social groups within an organisation. When the BAs are inconsistent regarding their content, such as many meanings or lack of contextual information, their efficiency is reduced because stakeholders will not trust them. Objective: This study aimed to understand the implications of differences in the perception of trust on software projects and their influence on stakeholders’ behaviour. Methods: We conducted an exploratory case study to observe the creation and utilisation of one specific BA and the implications of differences in trust and their influence on stakeholders’ behaviour. Results : Our investigation has shown that practitioners adding and adjusting existing content do not entirely understand the stakeholders’ needs. Together with the partial management of the content, trust is impacted. When the content of BAs does not meet the trust factors, specifically reliability and predictability, the stakeholders cannot execute their tasks appropriately, and several implications affect the software development project. Additionally, they create workarounds to supply their needs. Conclusion: The differences in trust in BAs affect software projects in different areas of the organisation and interfere with the task execution of various stakeholders. The decrease in trust results from inconsistencies in the content associated with the lack of management of the BA. A structured strategy for representing and managing a BA’s content seems appropriate to increase trust levels and efficiency.","PeriodicalId":133352,"journal":{"name":"Inf. Softw. Technol.","volume":"466 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134261627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

COSMOS: A comprehensive framework for automatically generating domain-oriented test suite COSMOS:用于自动生成面向领域的测试套件的综合框架

Inf. Softw. Technol.

Pub Date : 2023-02-01 DOI: 10.2139/ssrn.4092941

A. Kalaee, S. Parsa, Negar Fathi

引用次数: 1

A systematic literature review of capstone courses in software engineering 软件工程顶点课程的系统文献综述

Inf. Softw. Technol.

Pub Date : 2023-01-09 DOI: 10.48550/arXiv.2301.03554

Saara Tenhunen, T. Männistö, Matti Luukkainen, Petri Ihantola

Tertiary education institutions aim to prepare their computer science and software engineering students for working life. While much of the technical principles are covered in lower-level courses, team-based capstone projects are a common way to provide students with hands-on experience and teach soft skills. This paper explores the characteristics of software engineering capstone courses presented in the literature. The goal of this work is to understand the pros and cons of different approaches by synthesising the various aspects of software engineering capstone courses and related experiences. In a systematic literature review for 2007-2022, we identified 127 primary studies. These studies were analysed based on their presented course characteristics and the reported course outcomes. The characteristics were synthesised into a taxonomy consisting of duration, team sizes, client and project sources, project implementation, and student assessment. We found out that capstone courses generally last one semester and divide students into groups of 4-5 where they work on a project for a client. For a slight majority of courses, the clients are external to the course staff and students are often expected to produce a proof-of-concept level software product as the main end deliverable. The courses also offer versatile assessments for students throughout the project. This paper provides researchers and educators with a classification of characteristics of software engineering capstone courses based on previous research. We further synthesise insights on the reported outcomes of capstone courses. Our review study aims to help educators to identify various ways of organising capstones and effectively plan and deliver their own capstone courses. The characterisation also helps researchers to conduct further studies on software engineering capstones.

高等教育机构的目标是让计算机科学和软件工程专业的学生为工作生活做好准备。虽然低级别的课程涵盖了许多技术原理，但基于团队的顶点项目是为学生提供实践经验和教授软技能的常用方法。本文探讨了在文献中提出的软件工程顶点课程的特点。这项工作的目标是通过综合软件工程顶点课程和相关经验的各个方面来理解不同方法的优缺点。在2007-2022年的系统文献综述中，我们确定了127项主要研究。这些研究是根据他们所呈现的课程特征和报告的课程结果进行分析的。这些特征被综合成一个由持续时间、团队规模、客户和项目来源、项目实施和学生评估组成的分类法。我们发现顶点课程通常持续一个学期，并将学生分成4-5人一组，在那里他们为客户完成一个项目。对于大多数课程来说，客户是外部的课程员工，学生通常被期望生产一个概念验证级别的软件产品作为主要的最终交付物。课程还为学生在整个项目中提供多种评估。本文在前人研究的基础上，为研究人员和教育工作者提供了软件工程顶点课程的特征分类。我们进一步综合了对顶点课程报告结果的见解。我们的回顾研究旨在帮助教育工作者确定各种组织顶点课程的方法，并有效地计划和交付他们自己的顶点课程。这种特征也有助于研究人员对软件工程顶点进行进一步的研究。

{"title":"A systematic literature review of capstone courses in software engineering","authors":"Saara Tenhunen, T. Männistö, Matti Luukkainen, Petri Ihantola","doi":"10.48550/arXiv.2301.03554","DOIUrl":"https://doi.org/10.48550/arXiv.2301.03554","url":null,"abstract":"Tertiary education institutions aim to prepare their computer science and software engineering students for working life. While much of the technical principles are covered in lower-level courses, team-based capstone projects are a common way to provide students with hands-on experience and teach soft skills. This paper explores the characteristics of software engineering capstone courses presented in the literature. The goal of this work is to understand the pros and cons of different approaches by synthesising the various aspects of software engineering capstone courses and related experiences. In a systematic literature review for 2007-2022, we identified 127 primary studies. These studies were analysed based on their presented course characteristics and the reported course outcomes. The characteristics were synthesised into a taxonomy consisting of duration, team sizes, client and project sources, project implementation, and student assessment. We found out that capstone courses generally last one semester and divide students into groups of 4-5 where they work on a project for a client. For a slight majority of courses, the clients are external to the course staff and students are often expected to produce a proof-of-concept level software product as the main end deliverable. The courses also offer versatile assessments for students throughout the project. This paper provides researchers and educators with a classification of characteristics of software engineering capstone courses based on previous research. We further synthesise insights on the reported outcomes of capstone courses. Our review study aims to help educators to identify various ways of organising capstones and effectively plan and deliver their own capstone courses. The characterisation also helps researchers to conduct further studies on software engineering capstones.","PeriodicalId":133352,"journal":{"name":"Inf. Softw. Technol.","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124452210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Many-Objective Optimization of Non-Functional Attributes based on Refactoring of Software Models 基于软件模型重构的非功能属性多目标优化

Inf. Softw. Technol.

Pub Date : 2023-01-01 DOI: 10.48550/arXiv.2301.09531

V. Cortellessa, Daniele Di Pompeo, Vincenzo Stoico, Michele Tucci

Software quality estimation is a challenging and time-consuming activity, and models are crucial to face the complexity of such activity on modern software applications. In this context, software refactoring is a crucial activity within development life-cycles where requirements and functionalities rapidly evolve. One main challenge is that the improvement of distinctive quality attributes may require contrasting refactoring actions on software, as for trade-off between performance and reliability (or other non-functional attributes). In such cases, multi-objective optimization can provide the designer with a wider view on these trade-offs and, consequently, can lead to identify suitable refactoring actions that take into account independent or even competing objectives. In this paper, we present an approach that exploits NSGA-II as the genetic algorithm to search optimal Pareto frontiers for software refactoring while considering many objectives. We consider performance and reliability variations of a model alternative with respect to an initial model, the amount of performance antipatterns detected on the model alternative, and the architectural distance, which quantifies the effort to obtain a model alternative from the initial one. We applied our approach on two case studies: a Train Ticket Booking Service, and CoCoME. We observed that our approach is able to improve performance (by up to 42%) while preserving or even improving the reliability (by up to 32%) of generated model alternatives. We also observed that there exists an order of preference of refactoring actions among model alternatives. We can state that performance antipatterns confirmed their ability to improve performance of a subject model in the context of many-objective optimization. In addition, the metric that we adopted for the architectural distance seems to be suitable for estimating the refactoring effort.

软件质量评估是一项具有挑战性且耗时的活动，而模型对于面对现代软件应用程序中此类活动的复杂性至关重要。在这种情况下，软件重构是需求和功能快速发展的开发生命周期中的关键活动。一个主要的挑战是，不同质量属性的改进可能需要对软件进行不同的重构操作，比如在性能和可靠性(或其他非功能属性)之间进行权衡。在这种情况下，多目标优化可以为设计人员提供一个更广泛的视角来看待这些权衡，因此，可以确定适当的重构操作，考虑到独立甚至竞争的目标。在本文中，我们提出了一种利用NSGA-II作为遗传算法来搜索软件重构的最优Pareto边界的方法，同时考虑了许多目标。我们考虑与初始模型相关的模型替代的性能和可靠性变化，在模型替代上检测到的性能反模式的数量，以及体系结构距离，它量化了从初始模型替代中获得模型替代的努力。我们在两个案例研究中应用了我们的方法:火车票预订服务和CoCoME。我们观察到，我们的方法能够提高性能(高达42%)，同时保留甚至提高生成的模型替代方案的可靠性(高达32%)。我们还观察到，在模型备选方案中存在重构行为的优先顺序。我们可以说，性能反模式证实了它们在多目标优化上下文中提高主题模型性能的能力。此外，我们采用的体系结构距离度量似乎适合于估计重构工作。

{"title":"Many-Objective Optimization of Non-Functional Attributes based on Refactoring of Software Models","authors":"V. Cortellessa, Daniele Di Pompeo, Vincenzo Stoico, Michele Tucci","doi":"10.48550/arXiv.2301.09531","DOIUrl":"https://doi.org/10.48550/arXiv.2301.09531","url":null,"abstract":"Software quality estimation is a challenging and time-consuming activity, and models are crucial to face the complexity of such activity on modern software applications. In this context, software refactoring is a crucial activity within development life-cycles where requirements and functionalities rapidly evolve. One main challenge is that the improvement of distinctive quality attributes may require contrasting refactoring actions on software, as for trade-off between performance and reliability (or other non-functional attributes). In such cases, multi-objective optimization can provide the designer with a wider view on these trade-offs and, consequently, can lead to identify suitable refactoring actions that take into account independent or even competing objectives. In this paper, we present an approach that exploits NSGA-II as the genetic algorithm to search optimal Pareto frontiers for software refactoring while considering many objectives. We consider performance and reliability variations of a model alternative with respect to an initial model, the amount of performance antipatterns detected on the model alternative, and the architectural distance, which quantifies the effort to obtain a model alternative from the initial one. We applied our approach on two case studies: a Train Ticket Booking Service, and CoCoME. We observed that our approach is able to improve performance (by up to 42%) while preserving or even improving the reliability (by up to 32%) of generated model alternatives. We also observed that there exists an order of preference of refactoring actions among model alternatives. We can state that performance antipatterns confirmed their ability to improve performance of a subject model in the context of many-objective optimization. In addition, the metric that we adopted for the architectural distance seems to be suitable for estimating the refactoring effort.","PeriodicalId":133352,"journal":{"name":"Inf. Softw. Technol.","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130195832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

A light-weight data augmentation method for fault localization 一种轻量级数据增强故障定位方法

Inf. Softw. Technol.

Pub Date : 2023-01-01 DOI: 10.2139/ssrn.4182106

Jian Hu, Huan Xie, Yan Lei, Ke Yu

引用次数: 1

Requirements Engineering for Artificial Intelligence Systems: A Systematic Mapping Study 人工智能系统需求工程:系统映射研究

Inf. Softw. Technol.

Pub Date : 2022-12-20 DOI: 10.48550/arXiv.2212.10693

Khlood Ahmad, Mohamed Almorsy, Chetan Arora, Muneera Bano, John C. Grundy

[Context] In traditional software systems, Requirements Engineering (RE) activities are well-established and researched. However, building Artificial Intelligence (AI) based software with limited or no insight into the system's inner workings poses significant new challenges to RE. Existing literature has focused on using AI to manage RE activities, with limited research on RE for AI (RE4AI). [Objective] This paper investigates current approaches for specifying requirements for AI systems, identifies available frameworks, methodologies, tools, and techniques used to model requirements, and finds existing challenges and limitations. [Method] We performed a systematic mapping study to find papers on current RE4AI approaches. We identified 43 primary studies and analysed the existing methodologies, models, tools, and techniques used to specify and model requirements in real-world scenarios. [Results] We found several challenges and limitations of existing RE4AI practices. The findings highlighted that current RE applications were not adequately adaptable for building AI systems and emphasised the need to provide new techniques and tools to support RE4AI. [Conclusion] Our results showed that most of the empirical studies on RE4AI focused on autonomous, self-driving vehicles and managing data requirements, and areas such as ethics, trust, and explainability need further research.

在传统的软件系统中，需求工程(RE)活动是建立和研究的。然而，构建基于人工智能(AI)的软件，对系统内部工作原理的了解有限或没有，对可再生能源构成了重大的新挑战。现有文献主要集中在使用人工智能管理可再生能源活动，而对可再生能源用于人工智能(RE4AI)的研究有限。[目的]本文调查了当前用于指定AI系统需求的方法，确定了可用的框架、方法、工具和用于建模需求的技术，并发现了现有的挑战和限制。[方法]我们进行了系统的图谱研究，查找当前RE4AI方法的论文。我们确定了43项主要研究，并分析了用于指定和建模真实场景中的需求的现有方法、模型、工具和技术。[结果]我们发现了现有RE4AI实践的一些挑战和局限性。研究结果强调，目前的可再生能源应用不能充分适应构建人工智能系统，并强调需要提供新的技术和工具来支持可再生能源人工智能。[结论]我们的研究结果表明，大多数关于RE4AI的实证研究都集中在自动驾驶、自动驾驶车辆和管理数据需求上，道德、信任和可解释性等领域需要进一步研究。

{"title":"Requirements Engineering for Artificial Intelligence Systems: A Systematic Mapping Study","authors":"Khlood Ahmad, Mohamed Almorsy, Chetan Arora, Muneera Bano, John C. Grundy","doi":"10.48550/arXiv.2212.10693","DOIUrl":"https://doi.org/10.48550/arXiv.2212.10693","url":null,"abstract":"[Context] In traditional software systems, Requirements Engineering (RE) activities are well-established and researched. However, building Artificial Intelligence (AI) based software with limited or no insight into the system's inner workings poses significant new challenges to RE. Existing literature has focused on using AI to manage RE activities, with limited research on RE for AI (RE4AI). [Objective] This paper investigates current approaches for specifying requirements for AI systems, identifies available frameworks, methodologies, tools, and techniques used to model requirements, and finds existing challenges and limitations. [Method] We performed a systematic mapping study to find papers on current RE4AI approaches. We identified 43 primary studies and analysed the existing methodologies, models, tools, and techniques used to specify and model requirements in real-world scenarios. [Results] We found several challenges and limitations of existing RE4AI practices. The findings highlighted that current RE applications were not adequately adaptable for building AI systems and emphasised the need to provide new techniques and tools to support RE4AI. [Conclusion] Our results showed that most of the empirical studies on RE4AI focused on autonomous, self-driving vehicles and managing data requirements, and areas such as ethics, trust, and explainability need further research.","PeriodicalId":133352,"journal":{"name":"Inf. Softw. Technol.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131806638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Synthesizing Research on Programmers' Mental Models of Programs, Tasks and Concepts - a Systematic Literature Review 程序员程序、任务和概念心理模型的综合研究——系统文献综述

Inf. Softw. Technol.

Pub Date : 2022-12-15 DOI: 10.48550/arXiv.2212.07763

Ava Heinonen, Bettina Lehtelä, Arto Hellas, Fabian Fagerholm

Programmers' mental models represent their knowledge and understanding of programs, programming concepts, and programming in general. They guide programmers' work and influence their task performance. Understanding mental models is important for designing work systems and practices that support programmers. Although the importance of programmers' mental models is widely acknowledged, research on mental models has decreased over the years. The results are scattered and do not take into account recent developments in software engineering. We analyze the state of research into programmers' mental models and provide an overview of existing research. We connect results on mental models from different strands of research to form a more unified knowledge base on the topic. We conducted a systematic literature review on programmers' mental models. We analyzed literature addressing mental models in different contexts, including mental models of programs, programming tasks, and programming concepts. Using nine search engines, we found 3678 articles (excluding duplicates). 84 were selected for further analysis. Using the snowballing technique, we obtained a final result set containing 187 articles. We show that the literature shares a kernel of shared understanding of mental models. By collating and connecting results on mental models from different fields of research, we uncovered some well-researched aspects, which we argue are fundamental characteristics of programmers' mental models. This work provides a basis for future work on mental models. The research field on programmers' mental models still faces many challenges rising from a lack of a shared knowledge base and poorly defined constructs. We created a unified knowledge base on the topic. We also point to directions for future studies. In particular, we call for studies that examine programmers working with modern practices and tools.

程序员的心智模型代表了他们对程序、编程概念和一般编程的知识和理解。它们指导程序员的工作并影响他们的任务表现。理解心智模型对于设计支持程序员的工作系统和实践是很重要的。尽管程序员的心智模型的重要性得到了广泛的认可，但对心智模型的研究近年来却有所减少。结果是分散的，并且没有考虑到软件工程的最新发展。我们分析了程序员心理模型的研究状态，并提供了现有研究的概述。我们将来自不同研究方向的心智模型的结果联系起来，形成关于该主题的更统一的知识库。我们对程序员的心智模型进行了系统的文献回顾。我们分析了在不同背景下处理心理模型的文献，包括程序的心理模型、编程任务和编程概念。使用9个搜索引擎，我们发现了3678篇文章(不包括重复的)。选取84例进行进一步分析。使用滚雪球技术，我们获得了包含187篇文章的最终结果集。我们表明，这些文献共享了对心智模型的共同理解的核心。通过整理和连接来自不同研究领域的心智模型的结果，我们发现了一些研究得很好的方面，我们认为这些是程序员心智模型的基本特征。本研究为今后心理模型的研究奠定了基础。由于缺乏共享的知识库和定义不清的结构，程序员心智模型的研究领域仍然面临着许多挑战。我们创建了一个关于这个主题的统一知识库。我们还指出了未来研究的方向。特别是，我们要求研究使用现代实践和工具的程序员。

{"title":"Synthesizing Research on Programmers' Mental Models of Programs, Tasks and Concepts - a Systematic Literature Review","authors":"Ava Heinonen, Bettina Lehtelä, Arto Hellas, Fabian Fagerholm","doi":"10.48550/arXiv.2212.07763","DOIUrl":"https://doi.org/10.48550/arXiv.2212.07763","url":null,"abstract":"Programmers' mental models represent their knowledge and understanding of programs, programming concepts, and programming in general. They guide programmers' work and influence their task performance. Understanding mental models is important for designing work systems and practices that support programmers. Although the importance of programmers' mental models is widely acknowledged, research on mental models has decreased over the years. The results are scattered and do not take into account recent developments in software engineering. We analyze the state of research into programmers' mental models and provide an overview of existing research. We connect results on mental models from different strands of research to form a more unified knowledge base on the topic. We conducted a systematic literature review on programmers' mental models. We analyzed literature addressing mental models in different contexts, including mental models of programs, programming tasks, and programming concepts. Using nine search engines, we found 3678 articles (excluding duplicates). 84 were selected for further analysis. Using the snowballing technique, we obtained a final result set containing 187 articles. We show that the literature shares a kernel of shared understanding of mental models. By collating and connecting results on mental models from different fields of research, we uncovered some well-researched aspects, which we argue are fundamental characteristics of programmers' mental models. This work provides a basis for future work on mental models. The research field on programmers' mental models still faces many challenges rising from a lack of a shared knowledge base and poorly defined constructs. We created a unified knowledge base on the topic. We also point to directions for future studies. In particular, we call for studies that examine programmers working with modern practices and tools.","PeriodicalId":133352,"journal":{"name":"Inf. Softw. Technol.","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127678797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

CLeBPI: Contrastive Learning for Bug Priority Inference CLeBPI: Bug优先级推理的对比学习

Inf. Softw. Technol.

Pub Date : 2022-12-02 DOI: 10.48550/arXiv.2212.01011

Wen-Yen Wang, Chenhao Wu, Jie He

Automated bug priority inference can reduce the time overhead of bug triagers for priority assignments, improving the efficiency of software maintenance. Currently, there are two orthogonal lines for this task, i.e., traditional machine learning based (TML-based) and neural network based (NN-based) approaches. Although these approaches achieve competitive performance, our observation finds that existing approaches face the following two issues: 1) TML-based approaches require much manual feature engineering and cannot learn the semantic information of bug reports; 2) Both TML-based and NN-based approaches cannot effectively address the label imbalance problem because they are difficult to distinguish the semantic difference between bug reports with different priorities. In this paper, we propose CLeBPI (Contrastive Learning for Bug Priority Inference), which leverages pre-trained language model and contrastive learning to tackle the above-mentioned two issues. Specifically, CLeBPI is first pre-trained on a large-scale bug report corpus in a self-supervised way, thus it can automatically learn contextual representations of bug reports without manual feature engineering. Afterward, it is further pre-trained by a contrastive learning objective, which enables it to distinguish semantic differences between bug reports, learning more precise contextual representations for each bug report. When finishing pre-training, we can connect a classification layer to CLeBPI and fine-tune it for bug priority inference in a supervised way. To verify the effectiveness of CLeBPI, we choose four baseline approaches and conduct comparison experiments on a public dataset. The experimental results show that CLeBPI outperforms all baseline approaches by 23.86%-77.80% in terms of weighted average F1-score, showing its effectiveness.

自动化的bug优先级推断可以减少bug触发器分配优先级的时间开销，提高软件维护的效率。目前，该任务有两条正交线，即基于传统机器学习(tm -based)和基于神经网络(NN-based)的方法。虽然这些方法取得了相当的性能，但我们的观察发现，现有的方法面临以下两个问题:1)基于xml的方法需要大量的手动特征工程，并且无法学习bug报告的语义信息;2)由于难以区分不同优先级的bug报告之间的语义差异，基于html和基于nn的方法都不能有效地解决标签不平衡问题。在本文中，我们提出了CLeBPI (Bug优先级推理的对比学习)，它利用预训练语言模型和对比学习来解决上述两个问题。具体来说，CLeBPI首先以一种自监督的方式在大规模的错误报告语料库上进行预训练，因此它可以自动学习错误报告的上下文表示，而无需手动特征工程。之后，通过对比学习目标对它进行进一步的预训练，这使它能够区分错误报告之间的语义差异，为每个错误报告学习更精确的上下文表示。在完成预训练后，我们可以将分类层连接到CLeBPI上，并以监督的方式对其进行微调以进行bug优先级推断。为了验证CLeBPI的有效性，我们选择了四种基线方法，并在公共数据集上进行了对比实验。实验结果表明，在加权平均f1得分方面，CLeBPI优于所有基线方法23.86% ~ 77.80%，显示了其有效性。

{"title":"CLeBPI: Contrastive Learning for Bug Priority Inference","authors":"Wen-Yen Wang, Chenhao Wu, Jie He","doi":"10.48550/arXiv.2212.01011","DOIUrl":"https://doi.org/10.48550/arXiv.2212.01011","url":null,"abstract":"Automated bug priority inference can reduce the time overhead of bug triagers for priority assignments, improving the efficiency of software maintenance. Currently, there are two orthogonal lines for this task, i.e., traditional machine learning based (TML-based) and neural network based (NN-based) approaches. Although these approaches achieve competitive performance, our observation finds that existing approaches face the following two issues: 1) TML-based approaches require much manual feature engineering and cannot learn the semantic information of bug reports; 2) Both TML-based and NN-based approaches cannot effectively address the label imbalance problem because they are difficult to distinguish the semantic difference between bug reports with different priorities. In this paper, we propose CLeBPI (Contrastive Learning for Bug Priority Inference), which leverages pre-trained language model and contrastive learning to tackle the above-mentioned two issues. Specifically, CLeBPI is first pre-trained on a large-scale bug report corpus in a self-supervised way, thus it can automatically learn contextual representations of bug reports without manual feature engineering. Afterward, it is further pre-trained by a contrastive learning objective, which enables it to distinguish semantic differences between bug reports, learning more precise contextual representations for each bug report. When finishing pre-training, we can connect a classification layer to CLeBPI and fine-tune it for bug priority inference in a supervised way. To verify the effectiveness of CLeBPI, we choose four baseline approaches and conduct comparison experiments on a public dataset. The experimental results show that CLeBPI outperforms all baseline approaches by 23.86%-77.80% in terms of weighted average F1-score, showing its effectiveness.","PeriodicalId":133352,"journal":{"name":"Inf. Softw. Technol.","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116718287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Composite refactoring: Representations, characteristics and effects on software projects 复合重构:对软件项目的表示、特征和影响

Inf. Softw. Technol.

Pub Date : 2022-12-01 DOI: 10.2139/ssrn.4119519

Ana Carla Bibiano, Anderson G. Uchôa, W. K. Assunção, Daniel Oliveira, T. Colanzi, S. Vergilio, Alessandro F. Garcia

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Inf. Softw. Technol.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀