首页 > 最新文献

2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)最新文献

英文 中文
WALTS: Walmart AutoML Libraries, Tools and Services WALTS:沃尔玛自动化库、工具和服务
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00013
Rahul Bajaj, Kunal Banerjee, Lalitdutt Parsai, Deepanshu Goyal, Sachin Parmar, Divyajyothi Bn, Balamurugan Subramaniam, Chaitanya Sai, Tarun Balotia, Anirban Chatterjee, Kailash Sati
Automated Machine Learning (AutoML) is an upcoming field in machine learning (ML) that searches the candidate model space for a given task, dataset and an evaluation metric and returns the best performing model on the supplied dataset as per the given metric. AutoML not only reduces the man-power and expertise needed to develop ML models but also decreases the time-to-market for ML models substantially. In Walmart, we have designed an enterprise-scale AutoML frame-work called WALTS to meet the rising demand of employing ML in the retail business, and thus help democratize ML within our organization. In this work, we delve into the design of WALTS from both algorithmic and architectural perspectives. Specfiically, we elaborate on how we explore models from a pool of candidates along with describing our choice of technology stack to make the whole process scalable and robust. We illustrate the process with the help of a business use-case, and finally underline how WALTS has impacted our business so far.
自动化机器学习(AutoML)是机器学习(ML)中一个即将到来的领域,它为给定的任务、数据集和评估指标搜索候选模型空间,并根据给定的指标返回所提供数据集上表现最好的模型。AutoML不仅减少了开发ML模型所需的人力和专业知识,还大大缩短了ML模型的上市时间。在沃尔玛,我们设计了一个名为WALTS的企业级AutoML框架,以满足在零售业务中使用ML的日益增长的需求,从而帮助ML在我们的组织中民主化。在这项工作中,我们从算法和建筑的角度深入研究了WALTS的设计。具体来说,我们详细说明了如何从候选模型池中探索模型,并描述了我们选择的技术堆栈,以使整个过程可扩展和健壮。我们在一个业务用例的帮助下说明了这个过程,最后强调了到目前为止,华尔兹是如何影响我们的业务的。
{"title":"WALTS: Walmart AutoML Libraries, Tools and Services","authors":"Rahul Bajaj, Kunal Banerjee, Lalitdutt Parsai, Deepanshu Goyal, Sachin Parmar, Divyajyothi Bn, Balamurugan Subramaniam, Chaitanya Sai, Tarun Balotia, Anirban Chatterjee, Kailash Sati","doi":"10.1109/SEAA56994.2022.00013","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00013","url":null,"abstract":"Automated Machine Learning (AutoML) is an upcoming field in machine learning (ML) that searches the candidate model space for a given task, dataset and an evaluation metric and returns the best performing model on the supplied dataset as per the given metric. AutoML not only reduces the man-power and expertise needed to develop ML models but also decreases the time-to-market for ML models substantially. In Walmart, we have designed an enterprise-scale AutoML frame-work called WALTS to meet the rising demand of employing ML in the retail business, and thus help democratize ML within our organization. In this work, we delve into the design of WALTS from both algorithmic and architectural perspectives. Specfiically, we elaborate on how we explore models from a pool of candidates along with describing our choice of technology stack to make the whole process scalable and robust. We illustrate the process with the help of a business use-case, and finally underline how WALTS has impacted our business so far.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129405293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Utilization of Three Software Size Measures for Effort Estimation in Agile World: A Case Study 在敏捷世界中,三种软件规模度量在工作量估算中的应用:一个案例研究
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00045
Hüseyin Ünlü, Tuna Hacaloglu, Fatma Büber, Kıvılcım Berrak, Onur Leblebici, Onur Demirörs
Functional size measurement (FSM) methods, by being systematic and repeatable, are beneficial in the early phases of the software life cycle for core project management activities such as effort, cost, and schedule estimation. However, in agile projects, requirements are kept minimal in the early phases and are detailed over time as the project progresses. This situation makes it challenging to identify measurement components of FSM methods from requirements in the early phases, hence complicates applying FSM in agile projects. In addition, the existing FSM methods are not fully compatible with today’s architectural styles, which are evolving into event-driven decentralized structures. In this study, we present the results of a case study to compare the effectiveness of different size measures: functional -COSMIC Function Points (CFP)-, event-based - Event Points-, and code length-based - Line of Code (LOC)-on projects that were developed with agile methods and utilized a microservice-based architecture. For this purpose, we measured the size of the project and created effort estimation models based on three methods. It is found that the event-based method estimated effort with better accuracy than the CFP and LOC-based methods.
功能大小度量(FSM)方法由于具有系统性和可重复性,在软件生命周期的早期阶段对核心项目管理活动(如工作量、成本和进度估计)是有益的。然而,在敏捷项目中,需求在早期阶段保持最小化,并随着项目的进展逐渐细化。这种情况使得在早期阶段从需求中识别FSM方法的度量组件变得具有挑战性,因此使FSM在敏捷项目中的应用变得复杂。此外,现有的FSM方法与当今的体系结构风格并不完全兼容,它们正在演变成事件驱动的分散结构。在本研究中,我们展示了一个案例研究的结果,以比较不同规模度量的有效性:功能性- cosmic功能点(CFP)-,基于事件的-事件点-和基于代码长度的-代码行(LOC)-在使用敏捷方法开发并利用基于微服务的架构的项目上。为此,我们测量了项目的大小,并基于三种方法创建了工作量估计模型。结果表明,基于事件的估算方法比基于CFP和loc的方法具有更好的估算精度。
{"title":"Utilization of Three Software Size Measures for Effort Estimation in Agile World: A Case Study","authors":"Hüseyin Ünlü, Tuna Hacaloglu, Fatma Büber, Kıvılcım Berrak, Onur Leblebici, Onur Demirörs","doi":"10.1109/SEAA56994.2022.00045","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00045","url":null,"abstract":"Functional size measurement (FSM) methods, by being systematic and repeatable, are beneficial in the early phases of the software life cycle for core project management activities such as effort, cost, and schedule estimation. However, in agile projects, requirements are kept minimal in the early phases and are detailed over time as the project progresses. This situation makes it challenging to identify measurement components of FSM methods from requirements in the early phases, hence complicates applying FSM in agile projects. In addition, the existing FSM methods are not fully compatible with today’s architectural styles, which are evolving into event-driven decentralized structures. In this study, we present the results of a case study to compare the effectiveness of different size measures: functional -COSMIC Function Points (CFP)-, event-based - Event Points-, and code length-based - Line of Code (LOC)-on projects that were developed with agile methods and utilized a microservice-based architecture. For this purpose, we measured the size of the project and created effort estimation models based on three methods. It is found that the event-based method estimated effort with better accuracy than the CFP and LOC-based methods.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123233076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Timing is Everything! A Test and Production Class View of Self-Admitted Technical Debt 时机就是一切!自我承认技术债务的测试和生产类观点
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00056
S. Counsell, S. Swift
In this short paper, we investigate whether the “time of day” when recognised changes are made to code influences the self-admission of technical debt (SATD). We look at this question from a test and production class perspective. We examine if there is a specific time of day when technical debt is “self-admitted” more frequently and whether there are any similarities in this sense between test and production classes. We also analyse whether class complexity makes a difference to SATD occurrence. To facilitate our analysis, we used a data set of over 300k changes developed by Riquet et al., as a basis. Results suggest that a lower proportion of SATD occur in afternoons as opposed to mornings and that class complexity has a significant say in the role and application of SATD.
在这篇简短的论文中,我们调查了对代码进行公认更改的“时间”是否会影响技术债务的自我承认(SATD)。我们从测试类和生产类的角度来看这个问题。我们检查是否有一个特定的时间,技术债务更频繁地“自我承认”,以及测试类和生产类之间在这个意义上是否有任何相似之处。我们还分析了类复杂性是否会对SATD的发生产生影响。为了便于分析,我们使用了Riquet等人开发的超过30万次变化的数据集作为基础。结果表明,与上午相比,下午的SATD比例较低,班级复杂性对SATD的作用和应用有重要影响。
{"title":"Timing is Everything! A Test and Production Class View of Self-Admitted Technical Debt","authors":"S. Counsell, S. Swift","doi":"10.1109/SEAA56994.2022.00056","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00056","url":null,"abstract":"In this short paper, we investigate whether the “time of day” when recognised changes are made to code influences the self-admission of technical debt (SATD). We look at this question from a test and production class perspective. We examine if there is a specific time of day when technical debt is “self-admitted” more frequently and whether there are any similarities in this sense between test and production classes. We also analyse whether class complexity makes a difference to SATD occurrence. To facilitate our analysis, we used a data set of over 300k changes developed by Riquet et al., as a basis. Results suggest that a lower proportion of SATD occur in afternoons as opposed to mornings and that class complexity has a significant say in the role and application of SATD.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123326215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Living in a Pink Cloud or Fighting a Whack-a-Mole? On the Creation of Recurring Revenue Streams in the Embedded Systems Domain 生活在粉红云中还是打地鼠?论嵌入式系统领域经常性收入流的创造
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00033
H. H. Olsson, Jan Bosch
For companies in the embedded systems domain, digitalization and digital technologies allow endless opportunities for new business models and continuous value delivery. While physical products still provide the core revenue, these are rapidly being complemented with offerings that allow for recurring revenue and that are based on software, data and artificial intelligence (AI). However, while new digital offerings allow for fundamentally new and recurring revenue streams and continuous value delivery to customers, the creation of these proves to be a challenging endeavour. In this paper, we study how companies explore ways to create new or additional value with the intention to complement their product portfolio with offerings that allow for recurring revenue. Based on multi-case study research, we identify the key challenges that companies in the embedded systems domain experience and we derive four organizational patterns that we see slow down innovation. Second, we present a framework outlining alternative types of offerings to customers. Third, we provide a value taxonomy in which we detail the different types of offerings and the value these provide to customers. For each value offering, we indicate whether this offering is (1) static or evolving, (2) bundled or unbundled, (3) free or monetized, and we provide examples from the case companies we studied.
对于嵌入式系统领域的公司来说,数字化和数字技术为新的商业模式和持续的价值交付提供了无限的机会。虽然实体产品仍然提供核心收入,但这些产品正在迅速被基于软件、数据和人工智能(AI)的经常性收入产品所补充。然而,尽管新的数字产品从根本上带来了新的、经常性的收入流,并为客户提供了持续的价值,但这些产品的创造被证明是一项具有挑战性的努力。在本文中,我们研究了公司如何探索创造新价值或附加价值的方法,以便通过提供允许经常性收入的产品来补充其产品组合。基于多案例研究,我们确定了嵌入式系统领域公司面临的主要挑战,并得出了四种我们认为会减缓创新的组织模式。其次,我们提出了一个框架,概述了可供客户选择的产品类型。第三,我们提供了一个价值分类法,其中详细说明了不同类型的产品及其为客户提供的价值。对于每个价值产品,我们指出该产品是(1)静态的还是不断发展的,(2)捆绑的还是非捆绑的,(3)免费的还是货币化的,并且我们提供了我们研究的案例公司的例子。
{"title":"Living in a Pink Cloud or Fighting a Whack-a-Mole? On the Creation of Recurring Revenue Streams in the Embedded Systems Domain","authors":"H. H. Olsson, Jan Bosch","doi":"10.1109/SEAA56994.2022.00033","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00033","url":null,"abstract":"For companies in the embedded systems domain, digitalization and digital technologies allow endless opportunities for new business models and continuous value delivery. While physical products still provide the core revenue, these are rapidly being complemented with offerings that allow for recurring revenue and that are based on software, data and artificial intelligence (AI). However, while new digital offerings allow for fundamentally new and recurring revenue streams and continuous value delivery to customers, the creation of these proves to be a challenging endeavour. In this paper, we study how companies explore ways to create new or additional value with the intention to complement their product portfolio with offerings that allow for recurring revenue. Based on multi-case study research, we identify the key challenges that companies in the embedded systems domain experience and we derive four organizational patterns that we see slow down innovation. Second, we present a framework outlining alternative types of offerings to customers. Third, we provide a value taxonomy in which we detail the different types of offerings and the value these provide to customers. For each value offering, we indicate whether this offering is (1) static or evolving, (2) bundled or unbundled, (3) free or monetized, and we provide examples from the case companies we studied.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131422194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the impact of scenario and distance information on the reliability assessment of multi-sensor systems 探讨场景和距离信息对多传感器系统可靠性评估的影响
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00058
Minhao Qiu, T. Antesberger, Florian Bock, Reinhard German
With the growth of self-driving technologies, the reliability analysis of automated driving systems has received considerable attention from both academia and industry. Safety of the intended functionality (SOTIF) serves as one of the primary standards to assure the reliability and safety of the automated driving system. One of its key issues is the performance limitations of perception sensor systems. Generally, the reliability of the perception sensor system depends on the different scenarios of the driving environment. In this work, we investigate the sensor features and dependencies of the front camera and the top LiDAR of the nuTonomy scenes (nuScenes) dataset with respect to scenarios (e.g., rain and night) and distance information (e.g., two distance-based regions of interest). In addition, we apply the obtained parameters to a proven analytical reliability model to examine the impact of scenario and distance information on the reliability assessment.
随着自动驾驶技术的发展,自动驾驶系统的可靠性分析受到了学术界和工业界的广泛关注。预期功能安全(SOTIF)是确保自动驾驶系统可靠性和安全性的主要标准之一。其中一个关键问题是感知传感器系统的性能限制。一般来说,感知传感器系统的可靠性取决于驾驶环境的不同场景。在这项工作中,我们研究了nuTonomy场景(nuScenes)数据集的前置摄像头和顶部激光雷达的传感器特征和依赖关系,这些特征和依赖关系涉及场景(例如,雨和夜)和距离信息(例如,两个基于距离的感兴趣区域)。此外,我们将获得的参数应用于一个经过验证的分析可靠性模型,以检查场景和距离信息对可靠性评估的影响。
{"title":"Exploring the impact of scenario and distance information on the reliability assessment of multi-sensor systems","authors":"Minhao Qiu, T. Antesberger, Florian Bock, Reinhard German","doi":"10.1109/SEAA56994.2022.00058","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00058","url":null,"abstract":"With the growth of self-driving technologies, the reliability analysis of automated driving systems has received considerable attention from both academia and industry. Safety of the intended functionality (SOTIF) serves as one of the primary standards to assure the reliability and safety of the automated driving system. One of its key issues is the performance limitations of perception sensor systems. Generally, the reliability of the perception sensor system depends on the different scenarios of the driving environment. In this work, we investigate the sensor features and dependencies of the front camera and the top LiDAR of the nuTonomy scenes (nuScenes) dataset with respect to scenarios (e.g., rain and night) and distance information (e.g., two distance-based regions of interest). In addition, we apply the obtained parameters to a proven analytical reliability model to examine the impact of scenario and distance information on the reliability assessment.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132692814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synthesis of Pareto-optimal Policies for Continuous-Time Markov Decision Processes 连续时间马尔可夫决策过程pareto最优策略的综合
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00071
Naif Alasmari, R. Calinescu
We present a work-in-progress method for the synthesis of continuous-time Markov decision process (CTMDP) policies–an important problem not handled by current probabilistic model checkers. The policies synthesised by this method correspond to configurations of software systems or software controllers of cyber-physical systems (CPS) that satisfy predefined nonfunctional constraints and are Pareto-optimal with respect to a set of optimisation objectives. We illustrate the effectiveness of our method by using it to synthesise optimal configurations for a client-server system, and optimal controllers for a driver-attention management CPS.
我们提出了一种合成连续时间马尔可夫决策过程(CTMDP)策略的方法,这是目前概率模型检查器没有处理的一个重要问题。通过这种方法合成的策略对应于软件系统或网络物理系统(CPS)的软件控制器的配置,这些配置满足预定义的非功能约束,并且相对于一组优化目标是帕累托最优的。我们通过使用它来综合客户端-服务器系统的最优配置和驾驶员-注意力管理CPS的最优控制器来说明我们方法的有效性。
{"title":"Synthesis of Pareto-optimal Policies for Continuous-Time Markov Decision Processes","authors":"Naif Alasmari, R. Calinescu","doi":"10.1109/SEAA56994.2022.00071","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00071","url":null,"abstract":"We present a work-in-progress method for the synthesis of continuous-time Markov decision process (CTMDP) policies–an important problem not handled by current probabilistic model checkers. The policies synthesised by this method correspond to configurations of software systems or software controllers of cyber-physical systems (CPS) that satisfy predefined nonfunctional constraints and are Pareto-optimal with respect to a set of optimisation objectives. We illustrate the effectiveness of our method by using it to synthesise optimal configurations for a client-server system, and optimal controllers for a driver-attention management CPS.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128987363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Anomaly Detection in Cloud-Native Systems 云原生系统中的异常检测
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00023
Francesco Lomio, Sergio Moreschini, Xiaozhou Li, Valentina Lenarduzzi
Companies develop cloud-native systems deployed on public and private clouds. Since private clouds have limited resources, the systems should run efficiently by keeping performance related anomalies under control. The goal of this work is to understand whether a set of five performance-related KPIs depends on the metrics collected at runtime by Kafka, Zookeeper, and other tools (168 different metrics). We considered four weeks worth of runtime data collected from a system running in production. We trained eight Machine Learning algorithms on three weeks worth of data and tested them on one week’s worth of data to compare their prediction accuracy and their training and testing time. It is possible to detect performance-related anomalies with a very high level of accuracy (higher than 95% AUC) and with very limited training time (between 8 and 17 minutes). Machine Learning algorithms can help to identify runtime anomalies and to detect them efficiently. Future work will include the identification of a proactive approach to recognize the root cause of the anomalies and to prevent them as early as possible.
公司开发部署在公共云和私有云上的云原生系统。由于私有云的资源有限,因此系统应该通过控制与性能相关的异常来高效运行。这项工作的目标是了解一组五个与性能相关的kpi是否依赖于Kafka, Zookeeper和其他工具在运行时收集的指标(168个不同的指标)。我们考虑了从生产中运行的系统收集的4周的运行时数据。我们在三周的数据上训练了八种机器学习算法,并在一周的数据上对它们进行了测试,以比较它们的预测准确性和训练和测试时间。它可以以非常高的准确度(高于95% AUC)和非常有限的训练时间(8到17分钟)检测与性能相关的异常。机器学习算法可以帮助识别运行时异常并有效地检测它们。未来的工作将包括确定一种主动的方法,以识别异常的根本原因,并尽早预防它们。
{"title":"Anomaly Detection in Cloud-Native Systems","authors":"Francesco Lomio, Sergio Moreschini, Xiaozhou Li, Valentina Lenarduzzi","doi":"10.1109/SEAA56994.2022.00023","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00023","url":null,"abstract":"Companies develop cloud-native systems deployed on public and private clouds. Since private clouds have limited resources, the systems should run efficiently by keeping performance related anomalies under control. The goal of this work is to understand whether a set of five performance-related KPIs depends on the metrics collected at runtime by Kafka, Zookeeper, and other tools (168 different metrics). We considered four weeks worth of runtime data collected from a system running in production. We trained eight Machine Learning algorithms on three weeks worth of data and tested them on one week’s worth of data to compare their prediction accuracy and their training and testing time. It is possible to detect performance-related anomalies with a very high level of accuracy (higher than 95% AUC) and with very limited training time (between 8 and 17 minutes). Machine Learning algorithms can help to identify runtime anomalies and to detect them efficiently. Future work will include the identification of a proactive approach to recognize the root cause of the anomalies and to prevent them as early as possible.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128848700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Investigating the Adoption of History-based Prioritization in the Context of Manual Testing in a Real Industrial Setting 在真实工业环境中手工测试的背景下,调查采用基于历史的优先级
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00030
Vinicius Siqueira, Breno Miranda
Many test case prioritization techniques have been proposed with the ultimate goal of speeding up fault detection. History-based prioritization, in particular, has been shown to be an effective strategy. Most of the empirical studies conducted on this topic, however, have focused on the context of automated testing. Investigating the effectiveness of history-based prioritization in the context of manual testing is important because, despite the popularity of automated approaches, manual testing is still largely adopted in industry. In this work we propose two history-based prioritization heuristics and evaluate them in the context of manual testing in a real industrial setting. For our evaluation we collected historical test execution information for 23 products, spanning over seven years of historical information, accounting for a total of 2,352 unique test cases and 3,993,863 test results. The results of our experiments showed that the effectiveness of the proposed approach is not far from a theoretical optimal prioritization, and that they are significantly better than alternative orderings of the test suite, including the order suggested by the test management tool and the execution order followed by the testers during the real execution of the test suites evaluated as part of our study.
许多测试用例优先级划分技术被提出,其最终目标是加速故障检测。特别是基于历史的优先排序,已被证明是一种有效的策略。然而,大多数关于这个主题的实证研究都集中在自动化测试的背景上。在手工测试的上下文中调查基于历史的优先级的有效性是很重要的,因为尽管自动化方法很流行,但手工测试在工业中仍然被大量采用。在这项工作中,我们提出了两种基于历史的优先级启发式方法,并在实际工业环境中的手动测试环境中对它们进行了评估。为了我们的评估,我们收集了23个产品的历史测试执行信息,跨越了7年的历史信息,总共有2352个独特的测试用例和3993863个测试结果。我们的实验结果表明,所提出的方法的有效性与理论上的最优优先级相差不远,并且它们明显优于测试套件的其他排序,包括测试管理工具建议的顺序和测试人员在实际执行测试套件期间所遵循的执行顺序,这是我们研究的一部分。
{"title":"Investigating the Adoption of History-based Prioritization in the Context of Manual Testing in a Real Industrial Setting","authors":"Vinicius Siqueira, Breno Miranda","doi":"10.1109/SEAA56994.2022.00030","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00030","url":null,"abstract":"Many test case prioritization techniques have been proposed with the ultimate goal of speeding up fault detection. History-based prioritization, in particular, has been shown to be an effective strategy. Most of the empirical studies conducted on this topic, however, have focused on the context of automated testing. Investigating the effectiveness of history-based prioritization in the context of manual testing is important because, despite the popularity of automated approaches, manual testing is still largely adopted in industry. In this work we propose two history-based prioritization heuristics and evaluate them in the context of manual testing in a real industrial setting. For our evaluation we collected historical test execution information for 23 products, spanning over seven years of historical information, accounting for a total of 2,352 unique test cases and 3,993,863 test results. The results of our experiments showed that the effectiveness of the proposed approach is not far from a theoretical optimal prioritization, and that they are significantly better than alternative orderings of the test suite, including the order suggested by the test management tool and the execution order followed by the testers during the real execution of the test suites evaluated as part of our study.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129029245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Software Reuse and Evolution in JavaScript Applications JavaScript应用程序中的软件重用和演化
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00048
Anastasia Terzi, Orfeas Christou, S. Bibi, P. Angelidis
JavaScript (JS) is one of the most popular programming languages on GitHub. Most JavaScript applications are reusing third-party components to acquire various functionalities. Despite the benefits offered by software reuse there are still challenges, during the evolution of JavaScript applications, related to the management and maintenance of the third-party dependencies. Our key objective is to explore the evolution of library dependencies constraints in the context of JavaScript applications in terms of (a) the changeability (i.e., number of removed, added, or maintained libraries) (b) the update frequency of the library dependencies. For this purpose, we conducted a case study on the 86 most forked JavaScript applications hosted on GitHub and analyzed reuse data from a total of 2.363 successive releases. In general, 39% of the packages introduced in the first version of the project are being reused in the entire project’s lifetime. The number of package dependencies slightly grows over time, while several other are being permanently removed. Regarding the evolution of third-party applications, it is observed that developers do not update the dependencies constraints to a most recent version, waiting to reach probably “breaking points” when the updates will be inevitable.
JavaScript是GitHub上最流行的编程语言之一。大多数JavaScript应用程序都重用第三方组件来获得各种功能。尽管软件重用带来了好处,但在JavaScript应用程序的发展过程中,与第三方依赖关系的管理和维护相关的挑战仍然存在。我们的主要目标是从(a)可变性(即删除、添加或维护的库的数量)(b)库依赖的更新频率方面探索JavaScript应用程序上下文中库依赖约束的演变。为此,我们对托管在GitHub上的86个分叉最多的JavaScript应用程序进行了案例研究,并分析了总共2.363个连续版本的重用数据。一般来说,在项目的第一个版本中引入的包中有39%在整个项目的生命周期中被重用。随着时间的推移,包依赖项的数量会略有增长,而其他一些依赖项将被永久删除。关于第三方应用程序的发展,可以观察到开发人员不会将依赖约束更新到最新版本,而是等待可能达到不可避免的更新时的“断点”。
{"title":"Software Reuse and Evolution in JavaScript Applications","authors":"Anastasia Terzi, Orfeas Christou, S. Bibi, P. Angelidis","doi":"10.1109/SEAA56994.2022.00048","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00048","url":null,"abstract":"JavaScript (JS) is one of the most popular programming languages on GitHub. Most JavaScript applications are reusing third-party components to acquire various functionalities. Despite the benefits offered by software reuse there are still challenges, during the evolution of JavaScript applications, related to the management and maintenance of the third-party dependencies. Our key objective is to explore the evolution of library dependencies constraints in the context of JavaScript applications in terms of (a) the changeability (i.e., number of removed, added, or maintained libraries) (b) the update frequency of the library dependencies. For this purpose, we conducted a case study on the 86 most forked JavaScript applications hosted on GitHub and analyzed reuse data from a total of 2.363 successive releases. In general, 39% of the packages introduced in the first version of the project are being reused in the entire project’s lifetime. The number of package dependencies slightly grows over time, while several other are being permanently removed. Regarding the evolution of third-party applications, it is observed that developers do not update the dependencies constraints to a most recent version, waiting to reach probably “breaking points” when the updates will be inevitable.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121993072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Impact of Forced Working-From-Home on Code Technical Debt: An Industrial Case Study 强迫在家工作对代码技术债务的影响:一个工业案例研究
Pub Date : 2022-08-01 DOI: 10.1109/SEAA56994.2022.00054
Ehsan Zabardast, J. Gonzalez-Huerta, Francis Palma
Background: The COVID-19 outbreak interrupted regular activities for over a year in many countries and resulted in a radical change in ways of working for software development companies, i.e., most software development companies switched to a forced Working-From-Home (WFH) mode. Aim: Although several studies have analysed different aspects of forced WFH mode, it is unknown whether and to what extent WFH impacted the accumulation of technical debt (TD) when developers have different ways to coordinate and communicate with peers. Method: Using the year 2019 as a baseline, we carried out an industrial case study to analyse the evolution of TD in five components that are part of a large project while WFH. As part of the data collection, we carried out a focus group with developers to explain the different patterns observed from the quantitative data analysis. Results: TD accumulated at a slower pace during WFH as compared with the working-from-office period in four components out of five. These differences were found to be statistically significant. Through a focus group, we have identified different factors that might explain the changes in TD accumulation. One of these factors is responsibility diffusion which seems to explain why TD grows faster during the WFH period in one of the components. Conclusion: The results suggest that when the ways of working change, the change between working from office and working from home does not result in an increased accumulation of TD.
背景:新冠肺炎疫情在许多国家中断了一年多的正常活动,导致软件开发公司的工作方式发生了根本性变化,即大多数软件开发公司转向强制在家工作(WFH)模式。虽然有一些研究分析了强制WFH模式的不同方面,但当开发人员与同行的协调和沟通方式不同时,WFH是否以及在多大程度上影响了技术债务(TD)的积累,这是未知的。方法:以2019年为基准,我们开展了一个工业案例研究,分析了TD在大型项目中五个组成部分的演变,而WFH是其中的一部分。作为数据收集的一部分,我们与开发人员进行了焦点小组讨论,以解释从定量数据分析中观察到的不同模式。结果:与下班期间相比,在5个组成部分中,有4个组成部分的TD累积速度较慢。这些差异在统计学上是显著的。通过焦点小组,我们已经确定了可能解释TD积累变化的不同因素。其中一个因素是责任扩散,这似乎解释了为什么TD在其中一个组成部分的WFH期间增长更快。结论:研究结果表明,当工作方式发生变化时,办公室工作与在家工作之间的变化并不会导致TD的积累增加。
{"title":"The Impact of Forced Working-From-Home on Code Technical Debt: An Industrial Case Study","authors":"Ehsan Zabardast, J. Gonzalez-Huerta, Francis Palma","doi":"10.1109/SEAA56994.2022.00054","DOIUrl":"https://doi.org/10.1109/SEAA56994.2022.00054","url":null,"abstract":"Background: The COVID-19 outbreak interrupted regular activities for over a year in many countries and resulted in a radical change in ways of working for software development companies, i.e., most software development companies switched to a forced Working-From-Home (WFH) mode. Aim: Although several studies have analysed different aspects of forced WFH mode, it is unknown whether and to what extent WFH impacted the accumulation of technical debt (TD) when developers have different ways to coordinate and communicate with peers. Method: Using the year 2019 as a baseline, we carried out an industrial case study to analyse the evolution of TD in five components that are part of a large project while WFH. As part of the data collection, we carried out a focus group with developers to explain the different patterns observed from the quantitative data analysis. Results: TD accumulated at a slower pace during WFH as compared with the working-from-office period in four components out of five. These differences were found to be statistically significant. Through a focus group, we have identified different factors that might explain the changes in TD accumulation. One of these factors is responsibility diffusion which seems to explain why TD grows faster during the WFH period in one of the components. Conclusion: The results suggest that when the ways of working change, the change between working from office and working from home does not result in an increased accumulation of TD.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126799031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1