首页 > 最新文献

2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)最新文献

英文 中文
FlakeFlagger: Predicting Flakiness Without Rerunning Tests FlakeFlagger:在不重新运行测试的情况下预测异常
A. Alshammari, Christopher Morris, Michael C Hilton, Jonathan Bell
This is an extended abstract that describes the code and data artifact [1] of our paper “FlakeFlagger: Predicting Flakiness Without Rerunning Tests” [2]. The goal of our artifact is to let researchers reproduce our experiment on our provided flaky dataset or reuse our tool on different flaky tests datasets.
这是一个扩展的摘要,描述了我们的论文“FlakeFlagger:在不重新运行测试的情况下预测Flakiness”[2]的代码和数据工件[1]。我们的工件的目标是让研究人员在我们提供的不可靠数据集上重现我们的实验,或者在不同的不可靠测试数据集上重用我们的工具。
{"title":"FlakeFlagger: Predicting Flakiness Without Rerunning Tests","authors":"A. Alshammari, Christopher Morris, Michael C Hilton, Jonathan Bell","doi":"10.1109/ICSE-Companion52605.2021.00081","DOIUrl":"https://doi.org/10.1109/ICSE-Companion52605.2021.00081","url":null,"abstract":"This is an extended abstract that describes the code and data artifact [1] of our paper “FlakeFlagger: Predicting Flakiness Without Rerunning Tests” [2]. The goal of our artifact is to let researchers reproduce our experiment on our provided flaky dataset or reuse our tool on different flaky tests datasets.","PeriodicalId":136929,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122844282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
PLELog: Semi-Supervised Log-Based Anomaly Detection via Probabilistic Label Estimation 基于概率标签估计的半监督日志异常检测
Lin Yang, Junjie Chen, Zan Wang, Weijing Wang, Jiajun Jiang, Xuyuan Dong, Wenbin Zhang
PLELog is a novel approach for log-based anomaly detection via probabilistic label estimation. It is designed to effectively detect anomalies in unlabeled logs and meanwhile avoid the manual labeling effort for training data generation. We use semantic information within log events as fixed-length vectors and apply HDBSCAN to automatically clustering log sequences. After that, we also propose a Probabilistic Label Estimation approach to reduce the noises introduced by error labeling and put "labeled" instances into an attention-based GRU network for training. We conducted an empirical study to evaluate the effectiveness of PLELog on two open-source log data (i.e., HDFS and BGL). The results demonstrate the effectiveness of PLELog. In particular, PLELog has been applied to two real-world systems from a university and a large corporation, further demonstrating its practicability.
PLELog是一种基于概率标签估计的基于日志的异常检测新方法。它的目的是有效地检测未标记日志中的异常,同时避免人工标记训练数据生成的工作。我们使用日志事件中的语义信息作为固定长度的向量,并应用HDBSCAN对日志序列进行自动聚类。之后,我们还提出了一种概率标签估计方法来减少错误标记带来的噪声,并将“标记”的实例放入基于注意力的GRU网络中进行训练。我们进行了一项实证研究,以评估PLELog在两个开源日志数据(即HDFS和BGL)上的有效性。结果证明了PLELog的有效性。特别地,PLELog已经应用到一个大学和一个大公司的两个现实系统中,进一步证明了它的实用性。
{"title":"PLELog: Semi-Supervised Log-Based Anomaly Detection via Probabilistic Label Estimation","authors":"Lin Yang, Junjie Chen, Zan Wang, Weijing Wang, Jiajun Jiang, Xuyuan Dong, Wenbin Zhang","doi":"10.1109/ICSE-Companion52605.2021.00106","DOIUrl":"https://doi.org/10.1109/ICSE-Companion52605.2021.00106","url":null,"abstract":"PLELog is a novel approach for log-based anomaly detection via probabilistic label estimation. It is designed to effectively detect anomalies in unlabeled logs and meanwhile avoid the manual labeling effort for training data generation. We use semantic information within log events as fixed-length vectors and apply HDBSCAN to automatically clustering log sequences. After that, we also propose a Probabilistic Label Estimation approach to reduce the noises introduced by error labeling and put \"labeled\" instances into an attention-based GRU network for training. We conducted an empirical study to evaluate the effectiveness of PLELog on two open-source log data (i.e., HDFS and BGL). The results demonstrate the effectiveness of PLELog. In particular, PLELog has been applied to two real-world systems from a university and a large corporation, further demonstrating its practicability.","PeriodicalId":136929,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124576041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Bayesian Data Analysis for Software Engineering 软件工程中的贝叶斯数据分析
R. Torkar, Carlo A. Furia, R. Feldt
n/a
{"title":"Bayesian Data Analysis for Software Engineering","authors":"R. Torkar, Carlo A. Furia, R. Feldt","doi":"10.1109/ICSE-Companion52605.2021.00140","DOIUrl":"https://doi.org/10.1109/ICSE-Companion52605.2021.00140","url":null,"abstract":"n/a","PeriodicalId":136929,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128120413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Technical Briefing: Hands-On Session on the Development of Trustworthy AI Software 技术简报:可信赖人工智能软件开发实训环节
Ville Vakkuri, Kai-Kristian Kemell, P. Abrahamsson
Following various real-world incidents involving both purely digital and cyber-physical Artificial Intelligence (AI) systems, AI Ethics has become a prominent topic of discussion in both research and practice, accompanied by various calls for trustworthy AI systems. Failures are often costly, and many of them stem from issues that could have been avoided during development. For example, AI ethics issues, such as data privacy are currently highly topical. However, implementing AI ethics in practice remains a challenge for organizations. Various guidelines have been published to aid companies in doing so, but these have not seen widespread adoption and may feel impractical. In this technical briefing, we discuss how to implement AI ethics. We showcase a method developed for this purpose, ECCOLA, which is based on academic research. ECCOLA is intended to make AI ethics more practical for developers in order to make it easier to incorporate into AI development to create trustworthy AI systems. It is a sprint-based and adaptive tool designed for agile development that facilitates reflection within the development team and helps developers make ethics into tangible product backlog items.
在涉及纯数字和网络物理人工智能(AI)系统的各种现实世界事件之后,人工智能伦理已经成为研究和实践中讨论的一个突出话题,伴随着对值得信赖的人工智能系统的各种呼吁。失败通常是代价高昂的,其中许多失败源于开发过程中可以避免的问题。例如,人工智能伦理问题,如数据隐私,目前是高度热门的话题。然而,在实践中实施人工智能伦理对组织来说仍然是一个挑战。已经发布了各种指导方针来帮助公司这样做,但这些指导方针尚未得到广泛采用,而且可能感觉不切实际。在本次技术简报中,我们将讨论如何实施人工智能伦理。我们展示了为此目的开发的一种方法,ECCOLA,这是基于学术研究的。ECCOLA旨在使人工智能伦理对开发人员来说更加实用,以便更容易融入人工智能开发,以创建值得信赖的人工智能系统。它是一种基于sprint的自适应工具,专为敏捷开发而设计,促进了开发团队内部的反思,并帮助开发人员将道德规范转化为有形的产品待办事项项。
{"title":"Technical Briefing: Hands-On Session on the Development of Trustworthy AI Software","authors":"Ville Vakkuri, Kai-Kristian Kemell, P. Abrahamsson","doi":"10.1109/ICSE-Companion52605.2021.00142","DOIUrl":"https://doi.org/10.1109/ICSE-Companion52605.2021.00142","url":null,"abstract":"Following various real-world incidents involving both purely digital and cyber-physical Artificial Intelligence (AI) systems, AI Ethics has become a prominent topic of discussion in both research and practice, accompanied by various calls for trustworthy AI systems. Failures are often costly, and many of them stem from issues that could have been avoided during development. For example, AI ethics issues, such as data privacy are currently highly topical. However, implementing AI ethics in practice remains a challenge for organizations. Various guidelines have been published to aid companies in doing so, but these have not seen widespread adoption and may feel impractical. In this technical briefing, we discuss how to implement AI ethics. We showcase a method developed for this purpose, ECCOLA, which is based on academic research. ECCOLA is intended to make AI ethics more practical for developers in order to make it easier to incorporate into AI development to create trustworthy AI systems. It is a sprint-based and adaptive tool designed for agile development that facilitates reflection within the development team and helps developers make ethics into tangible product backlog items.","PeriodicalId":136929,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133161069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
MigrationAdvisor: Recommending Library Migrations from Large-Scale Open-Source Data MigrationAdvisor:从大规模开源数据中推荐库迁移
Hao He, Yulin Xu, Xiaoyan Cheng, Guangtai Liang, Minghui Zhou
During software maintenance, developers may need to migrate an already in-use library to another library with similar functionalities. However, it is difficult to make the optimal migration decision with limited information, knowledge, or expertise. In this paper, we present MigrationAdvisor, an evidence-based tool to recommend library migration targets through intelligent analysis upon a large number of GitHub repositories and Java libraries. The migration advisories are provided through a search engine style web service where developers can seek migration suggestions for a specific library. We conduct systematic evaluations on the correctness of results, and evaluate the usefulness of the tool by collecting usage feedback from industry developers. Video: https://youtu.be/4I75W22TqwQ.
在软件维护期间,开发人员可能需要将已经在使用的库迁移到具有类似功能的另一个库。然而,在信息、知识或专业知识有限的情况下,很难做出最优的迁移决策。在本文中,我们介绍了MigrationAdvisor,这是一个基于证据的工具,可以通过对大量GitHub存储库和Java库的智能分析来推荐库迁移目标。迁移建议是通过搜索引擎样式的web服务提供的,开发人员可以在其中寻找特定库的迁移建议。我们对结果的正确性进行系统的评估,并通过收集来自行业开发人员的使用反馈来评估工具的有用性。视频:https://youtu.be/4I75W22TqwQ。
{"title":"MigrationAdvisor: Recommending Library Migrations from Large-Scale Open-Source Data","authors":"Hao He, Yulin Xu, Xiaoyan Cheng, Guangtai Liang, Minghui Zhou","doi":"10.1109/ICSE-Companion52605.2021.00023","DOIUrl":"https://doi.org/10.1109/ICSE-Companion52605.2021.00023","url":null,"abstract":"During software maintenance, developers may need to migrate an already in-use library to another library with similar functionalities. However, it is difficult to make the optimal migration decision with limited information, knowledge, or expertise. In this paper, we present MigrationAdvisor, an evidence-based tool to recommend library migration targets through intelligent analysis upon a large number of GitHub repositories and Java libraries. The migration advisories are provided through a search engine style web service where developers can seek migration suggestions for a specific library. We conduct systematic evaluations on the correctness of results, and evaluate the usefulness of the tool by collecting usage feedback from industry developers. Video: https://youtu.be/4I75W22TqwQ.","PeriodicalId":136929,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131958657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Metamorphic Testing of Autonomous Vehicles: A Case Study on Simulink 自动驾驶汽车的变形测试:以Simulink为例
P. Valle
Autonomous Vehicles (AVs) will revolutionize the way people travel by car. However, in order to deploy autonomous vehicles, effective testing techniques are required. The driving quality of an AV should definitely be considered when testing such systems. However, as in other complex systems, determining the outcome of a test in the driving quality on an AV can be extremely complex. To solve this issue, in this paper we explore the application of Quality-of-Service (QoS) aware metamorphic testing to test AVs modeled in MATLAB/Simulink, one of the predominant modeling tools in the market. We first defined a set of QoS measures applied to AVs by considering as input a recent study. With them, we define metamorphic relations. Lastly we assess the approach in an AV modeled in Simulink by using mutation testing. The results suggests that our approach is effective at detecting faults.
自动驾驶汽车(AVs)将彻底改变人们开车出行的方式。然而,为了部署自动驾驶汽车,需要有效的测试技术。在测试自动驾驶汽车系统时,一定要考虑到自动驾驶汽车的驾驶质量。然而,与其他复杂系统一样,确定自动驾驶汽车驾驶质量测试的结果可能极其复杂。为了解决这一问题,本文探索了服务质量感知(QoS)变形测试的应用,以测试在市场上主要的建模工具之一MATLAB/Simulink中建模的自动驾驶汽车。我们首先通过考虑最近的一项研究作为输入,定义了一组应用于自动驾驶汽车的QoS度量。利用它们,我们定义了变质关系。最后,我们通过使用突变测试在Simulink中建模的AV中评估了该方法。结果表明,我们的方法在检测故障方面是有效的。
{"title":"Metamorphic Testing of Autonomous Vehicles: A Case Study on Simulink","authors":"P. Valle","doi":"10.1109/ICSE-Companion52605.2021.00048","DOIUrl":"https://doi.org/10.1109/ICSE-Companion52605.2021.00048","url":null,"abstract":"Autonomous Vehicles (AVs) will revolutionize the way people travel by car. However, in order to deploy autonomous vehicles, effective testing techniques are required. The driving quality of an AV should definitely be considered when testing such systems. However, as in other complex systems, determining the outcome of a test in the driving quality on an AV can be extremely complex. To solve this issue, in this paper we explore the application of Quality-of-Service (QoS) aware metamorphic testing to test AVs modeled in MATLAB/Simulink, one of the predominant modeling tools in the market. We first defined a set of QoS measures applied to AVs by considering as input a recent study. With them, we define metamorphic relations. Lastly we assess the approach in an AV modeled in Simulink by using mutation testing. The results suggests that our approach is effective at detecting faults.","PeriodicalId":136929,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124544173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A Replication Package for It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports 一个用于探戈需要两个人的复制包:结合视觉和文本信息来检测重复的基于视频的Bug报告
Nathan Cooper, Carlos Bernal-Cárdenas, Oscar Chaparro, Kevin Moran, D. Poshyvanyk
When a bug manifests in a user-facing application, it is likely to be exposed through the graphical user interface (GUI). Given the importance of visual information to the process of identifying and understanding such bugs, users are increasingly making use of screenshots and screen-recordings as a means to report issues to developers. Due to their graphical nature, screen-recordings present challenges for automated analysis that preclude the use of current duplicate bug report detection techniques. This paper describes in detail our reproduction package artifact for TANGO, a duplicate detection technique that operates purely on video-based bug reports by leveraging both visual and textual information to overcome these challenges and aid developers in this task. Specifically, this reproduction package contains the data and code that enables our TANGO’s empirical evaluation replication and future research in the area of duplicate video-based bug report detection.
当在面向用户的应用程序中出现错误时,它很可能通过图形用户界面(GUI)暴露出来。考虑到视觉信息在识别和理解这些bug过程中的重要性,用户越来越多地使用截图和屏幕记录作为向开发人员报告问题的一种手段。由于它们的图形特性,屏幕记录为自动分析提出了挑战,这就排除了使用当前的重复错误报告检测技术。本文详细描述了TANGO的复制包工件,这是一种复制检测技术,通过利用视觉和文本信息来克服这些挑战并帮助开发人员完成这项任务,从而完全基于视频的错误报告进行操作。具体来说,这个复制包包含数据和代码,这些数据和代码使我们的TANGO的经验评估复制和未来在基于重复视频的错误报告检测领域的研究成为可能。
{"title":"A Replication Package for It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports","authors":"Nathan Cooper, Carlos Bernal-Cárdenas, Oscar Chaparro, Kevin Moran, D. Poshyvanyk","doi":"10.1109/ICSE-Companion52605.2021.00067","DOIUrl":"https://doi.org/10.1109/ICSE-Companion52605.2021.00067","url":null,"abstract":"When a bug manifests in a user-facing application, it is likely to be exposed through the graphical user interface (GUI). Given the importance of visual information to the process of identifying and understanding such bugs, users are increasingly making use of screenshots and screen-recordings as a means to report issues to developers. Due to their graphical nature, screen-recordings present challenges for automated analysis that preclude the use of current duplicate bug report detection techniques. This paper describes in detail our reproduction package artifact for TANGO, a duplicate detection technique that operates purely on video-based bug reports by leveraging both visual and textual information to overcome these challenges and aid developers in this task. Specifically, this reproduction package contains the data and code that enables our TANGO’s empirical evaluation replication and future research in the area of duplicate video-based bug report detection.","PeriodicalId":136929,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124327913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Replication of SOAR: A Synthesis Approach for Data Science API Refactoring SOAR的复制:数据科学API重构的综合方法
Ansong Ni, Daniel Ramos, Aidan Z. H. Yang, I. Lynce, Vasco M. Manquinho, R. Martins, Claire Le Goues
This paper provides provides a guide to the replication package of SOAR: A Synthesis Approach for Data Science API Refactoring. Our replication package provides a reliable way of reproducing results of the paper using a virtual machine. The replication packages includes scripts to generate the tables and figures presented in results section of the paper. Details on how to use those scripts and run SOAR are explained throughout this guide.
本文提供了SOAR的复制包指南:一种用于数据科学API重构的综合方法。我们的复制包提供了一种使用虚拟机复制论文结果的可靠方法。复制包包括脚本,用于生成论文结果部分中呈现的表格和图形。本指南将详细说明如何使用这些脚本和运行SOAR。
{"title":"Replication of SOAR: A Synthesis Approach for Data Science API Refactoring","authors":"Ansong Ni, Daniel Ramos, Aidan Z. H. Yang, I. Lynce, Vasco M. Manquinho, R. Martins, Claire Le Goues","doi":"10.1109/ICSE-Companion52605.2021.00083","DOIUrl":"https://doi.org/10.1109/ICSE-Companion52605.2021.00083","url":null,"abstract":"This paper provides provides a guide to the replication package of SOAR: A Synthesis Approach for Data Science API Refactoring. Our replication package provides a reliable way of reproducing results of the paper using a virtual machine. The replication packages includes scripts to generate the tables and figures presented in results section of the paper. Details on how to use those scripts and run SOAR are explained throughout this guide.","PeriodicalId":136929,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117157073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Replication of Are Machine Learning Cloud APIs Used Correctly 正确使用机器学习云api的复制
Chengcheng Wan, Shicheng Liu, H. Hoffmann, M. Maire, Shan Lu
This artifact aims to provide benchmark suite, data, and script used in our study "Are Machine Learning Cloud APIs Used Correctly?". We collected a suite of 360 non-trivial applications that use ML cloud APIs for manual study. We also developed checkers and tool to detect and fix API mis-uses. We hope this artifact can motivate and help future research to further tackle ML API mis-uses. All related data are available online.
该工件旨在提供基准套件、数据和脚本,用于我们的研究“机器学习云api是否被正确使用?”我们收集了一套360个重要的应用程序,这些应用程序使用ML云api进行手工研究。我们还开发了检查器和工具来检测和修复API的误用。我们希望这个工件可以激励和帮助未来的研究进一步解决ML API的误用问题。所有相关数据均可在线获取。
{"title":"A Replication of Are Machine Learning Cloud APIs Used Correctly","authors":"Chengcheng Wan, Shicheng Liu, H. Hoffmann, M. Maire, Shan Lu","doi":"10.1109/ICSE-Companion52605.2021.00066","DOIUrl":"https://doi.org/10.1109/ICSE-Companion52605.2021.00066","url":null,"abstract":"This artifact aims to provide benchmark suite, data, and script used in our study \"Are Machine Learning Cloud APIs Used Correctly?\". We collected a suite of 360 non-trivial applications that use ML cloud APIs for manual study. We also developed checkers and tool to detect and fix API mis-uses. We hope this artifact can motivate and help future research to further tackle ML API mis-uses. All related data are available online.","PeriodicalId":136929,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129596171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
WebEvo: Taming Web Application Evolution via Semantic Structure Change Detection WebEvo:通过语义结构变化检测来驯服Web应用程序的演变
Fei Shao
In order to prevent information retrieval (IR) and robotic process automation (RPA) tools from functioning improperly due to website evolution, it is important to develop web monitoring tools to monitor changes in a website and report them to the developers and testers. Existing monitoring tools commonly make use of DOM-tree based similarity and visual analysis between different versions of web pages. However, DOM-tree based similarity suffers are prone to false positives, since they cannot identify content-based changes (i.e., contents refreshed every time a web page is retrieved) and GUI widget evolution (e.g., moving a button). Such imprecision adversely affect IR tools or test scripts. To address this problem, we propose approach, WebEvo, that first performs DOM-based change detection, and then leverages historic pages to identify the regions that represent content-based changes, which can be safely ignored. Further, to identify refactoring changes that preserve semantics and appearances of GUI widgets, WebEvo adapts computer vision (CV) techniques to identify the mappings of the GUI widgets from the old web page to the new web page on an element-by-element basis. We evaluated WebEvo on 10 real-world websites from 8 popular categories to demonstrate the superiority of WebEvo over the existing work that relies on DOM-tree based detection or whole-page visual comparison.
为了防止信息检索(IR)和机器人过程自动化(RPA)工具由于网站的发展而功能不正常,开发网络监控工具来监控网站的变化并向开发人员和测试人员报告是很重要的。现有的监控工具通常在不同版本的网页之间使用基于dom树的相似性和可视化分析。然而,dom树的相似度很容易出现误报,因为它们不能识别基于内容的变化(例如,每次检索网页时刷新的内容)和GUI小部件的演变(例如,移动按钮)。这种不精确会对IR工具或测试脚本产生不利影响。为了解决这个问题,我们提出了一种方法,WebEvo,它首先执行基于dom的变化检测,然后利用历史页面来识别代表基于内容的变化的区域,这些变化可以被安全地忽略。此外,为了识别那些保留GUI小部件语义和外观的重构变化,WebEvo采用计算机视觉(CV)技术,逐个元素地识别GUI小部件从旧网页到新网页的映射。我们在8个流行类别的10个真实网站上对WebEvo进行了评估,以证明WebEvo优于现有的基于dom树的检测或全页面视觉比较的工作。
{"title":"WebEvo: Taming Web Application Evolution via Semantic Structure Change Detection","authors":"Fei Shao","doi":"10.1109/ICSE-Companion52605.2021.00060","DOIUrl":"https://doi.org/10.1109/ICSE-Companion52605.2021.00060","url":null,"abstract":"In order to prevent information retrieval (IR) and robotic process automation (RPA) tools from functioning improperly due to website evolution, it is important to develop web monitoring tools to monitor changes in a website and report them to the developers and testers. Existing monitoring tools commonly make use of DOM-tree based similarity and visual analysis between different versions of web pages. However, DOM-tree based similarity suffers are prone to false positives, since they cannot identify content-based changes (i.e., contents refreshed every time a web page is retrieved) and GUI widget evolution (e.g., moving a button). Such imprecision adversely affect IR tools or test scripts. To address this problem, we propose approach, WebEvo, that first performs DOM-based change detection, and then leverages historic pages to identify the regions that represent content-based changes, which can be safely ignored. Further, to identify refactoring changes that preserve semantics and appearances of GUI widgets, WebEvo adapts computer vision (CV) techniques to identify the mappings of the GUI widgets from the old web page to the new web page on an element-by-element basis. We evaluated WebEvo on 10 real-world websites from 8 popular categories to demonstrate the superiority of WebEvo over the existing work that relies on DOM-tree based detection or whole-page visual comparison.","PeriodicalId":136929,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129733501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1