首页 > 最新文献

2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)最新文献

英文 中文
Reducing Experiment Costs in Automated Software Performance Regression Detection 降低自动化软件性能回归检测的实验成本
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00017
Milad Abdullah, L. Bulej, T. Bures, P. Hnetynka, Vojtech Horký, P. Tůma
In this position paper we formulate performance regression testing as an automated experimentation problem and focus on the problem of controlling the experiment so as to provide more computation time to experiments that are more likely to detect performance changes. Conversely, this requires detecting and stopping experiments early if they are unlikely to detect any performance changes. To this end, we present a method that uses results from previous performance testing experiments to predict the outcome of new experiments in early stages of their execution.
在本文中,我们将性能回归测试表述为一个自动化的实验问题,并重点关注控制实验的问题,以便为更有可能检测到性能变化的实验提供更多的计算时间。相反,如果不太可能检测到任何性能变化,则需要及早检测并停止实验。为此,我们提出了一种方法,该方法使用以前的性能测试实验的结果来预测新实验执行的早期阶段的结果。
{"title":"Reducing Experiment Costs in Automated Software Performance Regression Detection","authors":"Milad Abdullah, L. Bulej, T. Bures, P. Hnetynka, Vojtech Horký, P. Tůma","doi":"10.1109/SEAA56994.2022.00017","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00017","url":null,"abstract":"In this position paper we formulate performance regression testing as an automated experimentation problem and focus on the problem of controlling the experiment so as to provide more computation time to experiments that are more likely to detect performance changes. Conversely, this requires detecting and stopping experiments early if they are unlikely to detect any performance changes. To this end, we present a method that uses results from previous performance testing experiments to predict the outcome of new experiments in early stages of their execution.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115932299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Effort Prediction with Limited Data: A Case Study for Data Warehouse Projects 有限数据下的工作量预测:数据仓库项目的案例研究
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00044
Hüseyin Ünlü, Ali Yildiz, Onur Demirörs
Organizations may create a sustainable competitive advantage against competitors by using data warehouse systems with which they can assess the current status of their operations at any moment. They can analyze trends and connections using up-to-date data. However, data warehouse projects tend to fail more often than other projects as it can be tough to estimate the effort required to build a data warehouse system. Functional size measurement is one of the methods used as an input for estimating the amount of work in a software project. In this study, we formed a measurement basis for DWH projects in an organization based on the COSMIC Functional Size Measurement Method. We mapped COSMIC rules on two different architectures used for DWH projects in the organization and measured the size of the projects. We calculated the productivity of the projects and compared them with the organization’s previous projects and DWH projects in the ISBSG repository. We could not create an organization-wide effort estimation model as we had a limited number of projects. As an alternative, we evaluated the success of effort estimation using DWH projects in the ISBSG repository. We also reported the challenges we faced during the size measurement process.
通过使用数据仓库系统,组织可以随时评估其操作的当前状态,从而创建相对于竞争对手的可持续竞争优势。他们可以使用最新的数据分析趋势和联系。然而,数据仓库项目比其他项目更容易失败,因为很难估计构建数据仓库系统所需的工作量。功能大小度量是用于估计软件项目中工作量的输入方法之一。在本研究中,我们基于COSMIC功能规模测量方法形成了一个组织DWH项目的测量基础。我们将COSMIC规则映射到组织中用于DWH项目的两种不同体系结构上,并测量了项目的大小。我们计算了项目的生产率,并将它们与ISBSG存储库中的组织以前的项目和DWH项目进行了比较。由于项目数量有限,我们无法创建组织范围内的工作量评估模型。作为替代方案,我们使用ISBSG存储库中的DWH项目评估了工作量估算的成功。我们还报告了我们在尺寸测量过程中所面临的挑战。
{"title":"Effort Prediction with Limited Data: A Case Study for Data Warehouse Projects","authors":"Hüseyin Ünlü, Ali Yildiz, Onur Demirörs","doi":"10.1109/SEAA56994.2022.00044","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00044","url":null,"abstract":"Organizations may create a sustainable competitive advantage against competitors by using data warehouse systems with which they can assess the current status of their operations at any moment. They can analyze trends and connections using up-to-date data. However, data warehouse projects tend to fail more often than other projects as it can be tough to estimate the effort required to build a data warehouse system. Functional size measurement is one of the methods used as an input for estimating the amount of work in a software project. In this study, we formed a measurement basis for DWH projects in an organization based on the COSMIC Functional Size Measurement Method. We mapped COSMIC rules on two different architectures used for DWH projects in the organization and measured the size of the projects. We calculated the productivity of the projects and compared them with the organization’s previous projects and DWH projects in the ISBSG repository. We could not create an organization-wide effort estimation model as we had a limited number of projects. As an alternative, we evaluated the success of effort estimation using DWH projects in the ISBSG repository. We also reported the challenges we faced during the size measurement process.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115559382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Microservices smell detection through dynamic analysis 通过动态分析进行微服务气味检测
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00052
Paolo Bacchiega, Ilaria Pigazzini, F. Fontana
The past few years saw the rise of microservices studies and best practices, along with wide industrial adoption of this architectural style. We now witness the birth of another challenging topic: microservices quality. Like other kinds of architectures, also microservices suffer from erosion and technical debt, whose symptoms can be the appearance of microservices smells, which impact negatively on the system’s quality, by hindering, for example, its maintainability. In this paper we propose a tool called Aroma, to reconstruct microservices architectures and detect microservices smells, based on the dynamic analysis of microservices execution traces. We describe the main features of the tool, the strategies adopted for microservice smells detection and the first preliminary experimentation.
在过去的几年里,微服务研究和最佳实践的兴起,以及这种架构风格在工业上的广泛采用。现在我们见证了另一个具有挑战性的话题的诞生:微服务质量。与其他类型的体系结构一样,微服务也受到侵蚀和技术债务的影响,其症状可能是微服务气味的出现,这会对系统的质量产生负面影响,例如阻碍其可维护性。在本文中,我们提出了一个名为Aroma的工具,基于对微服务执行轨迹的动态分析,来重建微服务架构并检测微服务气味。我们描述了该工具的主要特征,微服务气味检测所采用的策略和第一个初步实验。
{"title":"Microservices smell detection through dynamic analysis","authors":"Paolo Bacchiega, Ilaria Pigazzini, F. Fontana","doi":"10.1109/SEAA56994.2022.00052","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00052","url":null,"abstract":"The past few years saw the rise of microservices studies and best practices, along with wide industrial adoption of this architectural style. We now witness the birth of another challenging topic: microservices quality. Like other kinds of architectures, also microservices suffer from erosion and technical debt, whose symptoms can be the appearance of microservices smells, which impact negatively on the system’s quality, by hindering, for example, its maintainability. In this paper we propose a tool called Aroma, to reconstruct microservices architectures and detect microservices smells, based on the dynamic analysis of microservices execution traces. We describe the main features of the tool, the strategies adopted for microservice smells detection and the first preliminary experimentation.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114605005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The Role Of Post-Release Software Traceability in Release Engineering: A Software-Intensive Embedded Systems Case Study From The Telecommunications Domain 发布后软件可追溯性在发布工程中的作用:来自电信领域的软件密集型嵌入式系统案例研究
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00034
Anas Dakkak, Jan Bosch, H. H. Olsson
Modern release engineering practices such as continuous integration and delivery have allowed software development companies to transition from a long release cycle to a shorter one. The shorter release cycle has led to more software releases available to customers. At the same time, companies developing high-volume software-intensive embedded systems often deliver patch releases and maintenance releases on top of major and minor releases to customers who pick and choose what releases apply to them and decide when to upgrade the system, if to upgrade at all. While release engineering has been studied before in web-based, desktop-based, and embedded software, the focus has been on pre-release activities. Few studies have investigated what happens after the release, particularly the role of tracing software from release to deployment in high-volume software-intensive embedded systems. To address this gap, we conducted a qualitative case study at a multi-national telecommunications systems provider focusing on Radio Access Network (RAN) software. RAN software is a complex and large-scale embedded software used in mobile networks Base Stations (BS), providing software functionality for RAN mobile technologies ranging from 2G to 5G. Our study shed light on post-release software traceability and how it is used in the release engineering process.
像持续集成和交付这样的现代发布工程实践已经允许软件开发公司从较长的发布周期过渡到较短的发布周期。更短的发布周期为客户提供了更多的软件版本。与此同时,开发大量软件密集型嵌入式系统的公司经常在主要和次要版本之上向客户交付补丁版本和维护版本,客户选择适用于他们的版本,并决定何时升级系统,如果要升级的话。虽然发布工程以前已经在基于web的、基于桌面的和嵌入式软件中进行了研究,但重点一直放在发布前的活动上。很少有研究调查发布之后发生了什么,特别是在大容量软件密集型嵌入式系统中跟踪软件从发布到部署的角色。为了解决这一差距,我们在一家专注于无线接入网络(RAN)软件的跨国电信系统供应商进行了定性案例研究。RAN软件是一种复杂的大规模嵌入式软件,用于移动网络基站(BS),为2G到5G的RAN移动技术提供软件功能。我们的研究揭示了发布后软件的可追溯性,以及如何在发布工程过程中使用它。
{"title":"The Role Of Post-Release Software Traceability in Release Engineering: A Software-Intensive Embedded Systems Case Study From The Telecommunications Domain","authors":"Anas Dakkak, Jan Bosch, H. H. Olsson","doi":"10.1109/SEAA56994.2022.00034","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00034","url":null,"abstract":"Modern release engineering practices such as continuous integration and delivery have allowed software development companies to transition from a long release cycle to a shorter one. The shorter release cycle has led to more software releases available to customers. At the same time, companies developing high-volume software-intensive embedded systems often deliver patch releases and maintenance releases on top of major and minor releases to customers who pick and choose what releases apply to them and decide when to upgrade the system, if to upgrade at all. While release engineering has been studied before in web-based, desktop-based, and embedded software, the focus has been on pre-release activities. Few studies have investigated what happens after the release, particularly the role of tracing software from release to deployment in high-volume software-intensive embedded systems. To address this gap, we conducted a qualitative case study at a multi-national telecommunications systems provider focusing on Radio Access Network (RAN) software. RAN software is a complex and large-scale embedded software used in mobile networks Base Stations (BS), providing software functionality for RAN mobile technologies ranging from 2G to 5G. Our study shed light on post-release software traceability and how it is used in the release engineering process.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114711039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Simple and Complex Models’ Performance When Predicting Accepted Answers on Stack Overflow 在堆栈溢出预测可接受答案时评估简单和复杂模型的性能
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00014
Osayande P. Omondiagbe, Sherlock A. Licorish, Stephen G. MacDonell
Stack Overflow is used to solve programming issues during software development. Research efforts have looked to identify relevant content on this platform. In particular, researchers have proposed various modelling techniques to predict acceptable Stack Overflow answers. Less interest, however, has been dedicated to examining the performance and quality of typically used modelling methods with respect to the model and feature complexity. Such insights could be of practical significance to the many practitioners who develop models for Stack Overflow. This study examines the performance and quality of two modelling methods, of varying degree of complexity, used for predicting Java and JavaScript acceptable answers on Stack Overflow. Our dataset comprised 249,588 posts drawn from years 2014-2016. Outcomes reveal significant differences in models’ performances and quality given the type of features and complexity of models used. Researchers examining model performance and quality and feature complexity may leverage these findings in selecting suitable modelling approaches for Q&A prediction.
堆栈溢出用于解决软件开发过程中的编程问题。研究工作旨在确定该平台上的相关内容。特别是,研究人员提出了各种建模技术来预测可接受的堆栈溢出答案。然而,较少的兴趣被用于检查关于模型和特征复杂性的典型建模方法的性能和质量。这样的见解对于许多为Stack Overflow开发模型的实践者来说可能具有实际意义。本研究考察了两种不同复杂程度的建模方法的性能和质量,用于预测Java和JavaScript在堆栈溢出上的可接受答案。我们的数据集包括2014-2016年的249588个帖子。结果显示,给定的特征类型和模型的复杂性,模型的性能和质量存在显著差异。研究人员检查模型的性能、质量和特征复杂性,可以利用这些发现来选择合适的建模方法进行问答预测。
{"title":"Evaluating Simple and Complex Models’ Performance When Predicting Accepted Answers on Stack Overflow","authors":"Osayande P. Omondiagbe, Sherlock A. Licorish, Stephen G. MacDonell","doi":"10.1109/SEAA56994.2022.00014","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00014","url":null,"abstract":"Stack Overflow is used to solve programming issues during software development. Research efforts have looked to identify relevant content on this platform. In particular, researchers have proposed various modelling techniques to predict acceptable Stack Overflow answers. Less interest, however, has been dedicated to examining the performance and quality of typically used modelling methods with respect to the model and feature complexity. Such insights could be of practical significance to the many practitioners who develop models for Stack Overflow. This study examines the performance and quality of two modelling methods, of varying degree of complexity, used for predicting Java and JavaScript acceptable answers on Stack Overflow. Our dataset comprised 249,588 posts drawn from years 2014-2016. Outcomes reveal significant differences in models’ performances and quality given the type of features and complexity of models used. Researchers examining model performance and quality and feature complexity may leverage these findings in selecting suitable modelling approaches for Q&A prediction.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129957589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
End-to-end Timing Model Extraction from TSN-Aware Distributed Vehicle Software 基于tsn感知的分布式车辆软件端到端时序模型提取
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00064
Bahar Houtan, Mehmet Onur Aybek, M. Ashjaei, M. Daneshtalab, Mikael Sjödin, S. Mubeen
Extraction of end-to-end timing information from software architectures of vehicular systems to support their timing analysis is a daunting challenge. To address this challenge, this paper presents a systematic method to extract this information from vehicular software architectures that can be distributed over several electronic control units connected by Time-Sensitive Networking (TSN) networks. As a proof of concept, the proposed extraction method is applied to an industrial component model, namely the Rubus Component Model (RCM), and its toolchain. Furthermore, the usability of the proposed method is demonstrated in an industrial use case from the vehicular domain.
从车辆系统的软件架构中提取端到端定时信息以支持其定时分析是一项艰巨的挑战。为了应对这一挑战,本文提出了一种系统的方法,从车辆软件架构中提取这些信息,这些信息可以分布在通过时间敏感网络(TSN)网络连接的多个电子控制单元上。作为概念验证,将提出的提取方法应用于工业组件模型,即Rubus组件模型(RCM)及其工具链。此外,在车辆领域的一个工业用例中验证了所提出方法的可用性。
{"title":"End-to-end Timing Model Extraction from TSN-Aware Distributed Vehicle Software","authors":"Bahar Houtan, Mehmet Onur Aybek, M. Ashjaei, M. Daneshtalab, Mikael Sjödin, S. Mubeen","doi":"10.1109/SEAA56994.2022.00064","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00064","url":null,"abstract":"Extraction of end-to-end timing information from software architectures of vehicular systems to support their timing analysis is a daunting challenge. To address this challenge, this paper presents a systematic method to extract this information from vehicular software architectures that can be distributed over several electronic control units connected by Time-Sensitive Networking (TSN) networks. As a proof of concept, the proposed extraction method is applied to an industrial component model, namely the Rubus Component Model (RCM), and its toolchain. Furthermore, the usability of the proposed method is demonstrated in an industrial use case from the vehicular domain.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129739894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Easing the Reuse of ML Solutions by Interactive Clustering-based Autotuning in Scientific Applications 在科学应用中通过基于交互式聚类的自动调优简化机器学习解决方案的重用
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00011
H. Hajiabadi, Lennart Hilbert, A. Koziolek
Machine learning techniques have revolutionised scientific software projects. Scientists are continuously looking for novel approaches to production-quality reuse of machine learning solutions and to make them available to other components of the project with satisfactory quality and low costs. However, scientists often have limited knowledge about how to effectively reuse and adjust machine learning solutions in their particular scientific project. One challenge is that many machine learning solutions require parameter tuning based on the input data to achieve satisfactory results, which is difficult and cumbersome for users not familiar with machine learning. Autotuning is the common technique for potentially adjusting the parameters based on the data, but it requires a well-defined objective function to optimize for. Such an objective function is commonly unknown in exploratory scientific research such as biological image segmentation tasks. In this paper, we propose a framework based on the novel combination of autotuning and active learning to ease and partially automate the reuse effort of machine learning solutions for scientists in biological image segmentation cases. Underlying this combination is a mapping between an object type and specific parameters applied during the segmentation process. This mapping is iteratively adjusted by asking users for visual feedback. We then through a biological case study demonstrate that our method enables tuning of the segmentation specifically to object types, while the selective requests of user input reduce the number of user interactions required for this task.
机器学习技术已经彻底改变了科学软件项目。科学家们一直在寻找新的方法来实现机器学习解决方案的生产质量重用,并使它们能够以令人满意的质量和低成本提供给项目的其他组件。然而,科学家们通常对如何在特定的科学项目中有效地重用和调整机器学习解决方案知之甚少。一个挑战是,许多机器学习解决方案需要根据输入数据进行参数调优才能达到满意的结果,这对于不熟悉机器学习的用户来说是困难和繁琐的。自动调优是基于数据调整参数的常用技术,但它需要一个定义良好的目标函数来进行优化。这样的目标函数在探索性科学研究如生物图像分割任务中通常是未知的。在本文中,我们提出了一个基于自动调整和主动学习的新组合的框架,以减轻和部分自动化科学家在生物图像分割案例中的机器学习解决方案的重用工作。这种组合的基础是对象类型和在分割过程中应用的特定参数之间的映射。这种映射是通过要求用户提供视觉反馈来迭代调整的。然后,我们通过一个生物学案例研究证明,我们的方法可以根据对象类型调整分割,而用户输入的选择性请求减少了该任务所需的用户交互次数。
{"title":"Easing the Reuse of ML Solutions by Interactive Clustering-based Autotuning in Scientific Applications","authors":"H. Hajiabadi, Lennart Hilbert, A. Koziolek","doi":"10.1109/SEAA56994.2022.00011","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00011","url":null,"abstract":"Machine learning techniques have revolutionised scientific software projects. Scientists are continuously looking for novel approaches to production-quality reuse of machine learning solutions and to make them available to other components of the project with satisfactory quality and low costs. However, scientists often have limited knowledge about how to effectively reuse and adjust machine learning solutions in their particular scientific project. One challenge is that many machine learning solutions require parameter tuning based on the input data to achieve satisfactory results, which is difficult and cumbersome for users not familiar with machine learning. Autotuning is the common technique for potentially adjusting the parameters based on the data, but it requires a well-defined objective function to optimize for. Such an objective function is commonly unknown in exploratory scientific research such as biological image segmentation tasks. In this paper, we propose a framework based on the novel combination of autotuning and active learning to ease and partially automate the reuse effort of machine learning solutions for scientists in biological image segmentation cases. Underlying this combination is a mapping between an object type and specific parameters applied during the segmentation process. This mapping is iteratively adjusted by asking users for visual feedback. We then through a biological case study demonstrate that our method enables tuning of the segmentation specifically to object types, while the selective requests of user input reduce the number of user interactions required for this task.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123085129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Maintainability Challenges in ML: A Systematic Literature Review 机器学习中的可维护性挑战:系统文献综述
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00018
Karthik Shivashankar, A. Martini
Background: As Machine Learning (ML) advances rapidly in many fields, it is being adopted by academics and businesses alike. However, ML has a number of different challenges in terms of maintenance not found in traditional software projects. Identifying what causes these maintainability challenges can help mitigate them early and continue delivering value in the long run without degrading ML performance. Aim: This study aims to identify and synthesise the maintainability challenges in different stages of the ML workflow and understand how these stages are interdependent and impact each other’s maintainability. Method: Using a systematic literature review, we screened more than 13000 papers, then selected and qualitatively analysed 56 of them. Results: (i) a catalogue of maintainability challenges in different stages of Data Engineering, Model Engineering workflows and the current challenges when building ML systems are discussed; (ii) a map of 13 maintainability challenges to different interdependent stages of ML that impact the overall workflow; (iii) Provided insights to developers of ML tools and researchers. Conclusions: In this study, practitioners and organisations will learn about maintainability challenges and their impact at different stages of ML workflow. This will enable them to avoid pitfalls and help to build a maintainable ML system. The implications and challenges will also serve as a basis for future research to strengthen our understanding of the ML system’s maintainability.
背景:随着机器学习(ML)在许多领域的迅速发展,它正在被学术界和企业界所采用。然而,ML在维护方面有许多不同的挑战,这在传统软件项目中是没有的。确定导致这些可维护性挑战的原因可以帮助尽早减轻这些挑战,并在不降低ML性能的情况下长期持续交付价值。目的:本研究旨在识别和综合机器学习工作流程不同阶段的可维护性挑战,并了解这些阶段是如何相互依赖并相互影响的。方法:采用系统文献综述的方法,从13000余篇文献中筛选出56篇进行定性分析。结果:(i)讨论了数据工程、模型工程工作流程不同阶段的可维护性挑战,以及构建ML系统时当前面临的挑战;(ii) 13个可维护性挑战的地图,这些挑战涉及机器学习的不同相互依存阶段,影响整个工作流程;(iii)为机器学习工具的开发人员和研究人员提供见解。结论:在本研究中,从业者和组织将了解可维护性挑战及其在ML工作流程不同阶段的影响。这将使他们能够避免陷阱,并有助于构建可维护的ML系统。这些影响和挑战也将作为未来研究的基础,以加强我们对机器学习系统可维护性的理解。
{"title":"Maintainability Challenges in ML: A Systematic Literature Review","authors":"Karthik Shivashankar, A. Martini","doi":"10.1109/SEAA56994.2022.00018","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00018","url":null,"abstract":"Background: As Machine Learning (ML) advances rapidly in many fields, it is being adopted by academics and businesses alike. However, ML has a number of different challenges in terms of maintenance not found in traditional software projects. Identifying what causes these maintainability challenges can help mitigate them early and continue delivering value in the long run without degrading ML performance. Aim: This study aims to identify and synthesise the maintainability challenges in different stages of the ML workflow and understand how these stages are interdependent and impact each other’s maintainability. Method: Using a systematic literature review, we screened more than 13000 papers, then selected and qualitatively analysed 56 of them. Results: (i) a catalogue of maintainability challenges in different stages of Data Engineering, Model Engineering workflows and the current challenges when building ML systems are discussed; (ii) a map of 13 maintainability challenges to different interdependent stages of ML that impact the overall workflow; (iii) Provided insights to developers of ML tools and researchers. Conclusions: In this study, practitioners and organisations will learn about maintainability challenges and their impact at different stages of ML workflow. This will enable them to avoid pitfalls and help to build a maintainable ML system. The implications and challenges will also serve as a basis for future research to strengthen our understanding of the ML system’s maintainability.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122911930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Search Budget in Multi-Objective Refactoring optimization: a Model-Based Empirical Study 多目标重构优化中的搜索预算:基于模型的实证研究
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00070
Daniele Di Pompeo, Michele Tucci
Software model optimization is the task of automatically generate design alternatives, usually to improve quality aspects of software that are quantifiable, like performance and reliability. In this context, multi-objective optimization techniques have been applied to help the designer find suitable tradeoffs among several non-functional properties. In this process, design alternatives can be generated through automated model refactoring, and evaluated on non-functional models. Due to their complexity, this type of optimization tasks require considerable time and resources, often limiting their application in software engineering processes.In this paper, we investigate the effects of using a search budget, specifically a time limit, to the search for new solutions. We performed experiments to quantify the impact that a change in the search budget may have on the quality of solutions. Furthermore, we analyzed how different genetic algorithms (i.e., NSGh-II, SPEh2, and PESA2) perform when imposing different budgets. We experimented on two case studies of different size, complexity, and domain.We observed that imposing a search budget considerably deteriorates the quality of the generated solutions, but the specific algorithm we choose seems to play a crucial role. From our experiments, NSGh-II is the fastest algorithm, while PESA2 generates solutions with the highest quality. Differently, SPEh2 is the slowest algorithm, and produces the solutions with the lowest quality.
软件模型优化是自动生成设计备选方案的任务,通常是为了改进可量化的软件质量方面,如性能和可靠性。在这种情况下,多目标优化技术被应用于帮助设计者在几个非功能属性之间找到合适的权衡。在此过程中,可以通过自动模型重构生成设计备选方案,并在非功能模型上进行评估。由于其复杂性,这种类型的优化任务需要大量的时间和资源,通常限制了它们在软件工程过程中的应用。在本文中,我们研究了使用搜索预算,特别是时间限制,对搜索新解决方案的影响。我们进行了实验来量化搜索预算的变化对解决方案质量的影响。此外,我们分析了不同的遗传算法(即NSGh-II, SPEh2和PESA2)在施加不同预算时的表现。我们对两个不同规模、复杂性和领域的案例研究进行了实验。我们观察到,强加搜索预算大大降低了生成解决方案的质量,但我们选择的特定算法似乎起着至关重要的作用。从我们的实验来看,NSGh-II是最快的算法,而PESA2生成的解质量最高。不同的是,SPEh2是最慢的算法,产生的解质量最低。
{"title":"Search Budget in Multi-Objective Refactoring optimization: a Model-Based Empirical Study","authors":"Daniele Di Pompeo, Michele Tucci","doi":"10.1109/SEAA56994.2022.00070","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00070","url":null,"abstract":"Software model optimization is the task of automatically generate design alternatives, usually to improve quality aspects of software that are quantifiable, like performance and reliability. In this context, multi-objective optimization techniques have been applied to help the designer find suitable tradeoffs among several non-functional properties. In this process, design alternatives can be generated through automated model refactoring, and evaluated on non-functional models. Due to their complexity, this type of optimization tasks require considerable time and resources, often limiting their application in software engineering processes.In this paper, we investigate the effects of using a search budget, specifically a time limit, to the search for new solutions. We performed experiments to quantify the impact that a change in the search budget may have on the quality of solutions. Furthermore, we analyzed how different genetic algorithms (i.e., NSGh-II, SPEh2, and PESA2) perform when imposing different budgets. We experimented on two case studies of different size, complexity, and domain.We observed that imposing a search budget considerably deteriorates the quality of the generated solutions, but the specific algorithm we choose seems to play a crucial role. From our experiments, NSGh-II is the fastest algorithm, while PESA2 generates solutions with the highest quality. Differently, SPEh2 is the slowest algorithm, and produces the solutions with the lowest quality.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126406123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Handling Environmental Uncertainty in Design Time Access Control Analysis 设计时访问控制分析中的环境不确定性处理
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00067
Nicolas Boltz, Sebastian Hahner, Maximilian Walter, Stephan Seifermann, R. Heinrich, T. Bures, P. Hnetynka
The high complexity, connectivity, and data exchange of modern software systems make it crucial to consider confidentiality early. An often used mechanism to ensure confidentiality is access control. When the system is modeled during design time, access control can already be analyzed. This enables early identification of confidentiality violations and the ability to analyze the impact of what-if scenarios. However, due to the abstract view of the design time model and the ambiguity in the early stages of development, uncertainties exist in the system environment. These uncertainties can have a direct effect on the validity of access control attributes in use, which might result in compromised confidentiality.To handle such known uncertainty, we present a notion of confidence in the context of design time access control. We define confidence as a composition of known uncertainties in the environment of the system, which influence the validity of access control attributes. We extend an existing modeling and analysis approach for design time access control with our notion of confidence. For evaluation, we apply the notion of confidence to multiple real-world case studies and discuss the resulting benefits for different stages of system development. We also analyze the expressiveness of the extended approach in defining confidentiality constraints and measure the accuracy in identifying confidentiality violations. Our results show that using the notion of confidence increases expressiveness while being able to accurately identify access control violations.
现代软件系统的高度复杂性、连接性和数据交换使得尽早考虑机密性至关重要。确保机密性的常用机制是访问控制。当在设计阶段对系统进行建模时,就可以对访问控制进行分析。这使得能够及早识别违反机密性的行为,并能够分析假设情景的影响。然而,由于设计时间模型的抽象观点和开发初期的模糊性,使得系统环境存在不确定性。这些不确定性会直接影响使用中的访问控制属性的有效性,从而可能导致机密性受损。为了处理这种已知的不确定性,我们在设计时访问控制上下文中提出了信心的概念。我们将置信度定义为系统环境中已知不确定性的组合,这些不确定性会影响访问控制属性的有效性。我们扩展了现有的建模和分析方法,使用我们的信心概念来进行设计时访问控制。为了评估,我们将信心的概念应用到多个真实世界的案例研究中,并讨论系统开发不同阶段的结果收益。我们还分析了扩展方法在定义保密约束方面的表达性,并测量了识别保密违规行为的准确性。我们的结果表明,使用信心的概念增加了表达性,同时能够准确地识别访问控制违规。
{"title":"Handling Environmental Uncertainty in Design Time Access Control Analysis","authors":"Nicolas Boltz, Sebastian Hahner, Maximilian Walter, Stephan Seifermann, R. Heinrich, T. Bures, P. Hnetynka","doi":"10.1109/SEAA56994.2022.00067","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00067","url":null,"abstract":"The high complexity, connectivity, and data exchange of modern software systems make it crucial to consider confidentiality early. An often used mechanism to ensure confidentiality is access control. When the system is modeled during design time, access control can already be analyzed. This enables early identification of confidentiality violations and the ability to analyze the impact of what-if scenarios. However, due to the abstract view of the design time model and the ambiguity in the early stages of development, uncertainties exist in the system environment. These uncertainties can have a direct effect on the validity of access control attributes in use, which might result in compromised confidentiality.To handle such known uncertainty, we present a notion of confidence in the context of design time access control. We define confidence as a composition of known uncertainties in the environment of the system, which influence the validity of access control attributes. We extend an existing modeling and analysis approach for design time access control with our notion of confidence. For evaluation, we apply the notion of confidence to multiple real-world case studies and discuss the resulting benefits for different stages of system development. We also analyze the expressiveness of the extended approach in defining confidentiality constraints and measure the accuracy in identifying confidentiality violations. Our results show that using the notion of confidence increases expressiveness while being able to accurately identify access control violations.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114480454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1