2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)最新文献

英文中文

Effective API Recommendation without Historical Software Repositories 没有历史软件存储库的有效API推荐

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Pub Date : 2018-09-01 DOI: 10.1145/3238147.3238216

Xiaoyu Liu, LiGuo Huang, Vincent Ng

It is time-consuming and labor-intensive to learn and locate the correct API for programming tasks. Thus, it is beneficial to perform API recommendation automatically. The graph-based statistical model has been shown to recommend top-10 API candidates effectively. It falls short, however, in accurately recommending an actual top-1 API. To address this weakness, we propose RecRank, an approach and tool that applies a novel ranking-based discriminative approach leveraging API usage path features to improve top-1 API recommendation. Empirical evaluation on a large corpus of (1385+8) open source projects shows that RecRank significantly improves top-1 API recommendation accuracy and mean reciprocal rank when compared to state-of-the-art API recommendation approaches.

为编程任务学习和定位正确的API是非常耗时和费力的。因此，自动执行API推荐是有益的。基于图的统计模型已被证明可以有效地推荐前10个候选API。然而，它在准确推荐一个真正的顶级API方面做得不够。为了解决这个缺点，我们提出了RecRank，这是一种方法和工具，它应用了一种基于排名的新方法，利用API使用路径特征来改进前1名的API推荐。对(1385+8)个开源项目的大型语料库的实证评估表明，与最先进的API推荐方法相比，RecRank显著提高了top-1 API推荐的准确性和平均倒数排名。

引用次数: 37

Automatically Quantifying the Impact of a Change in Systems (Journal-First Abstract) 自动量化系统变化的影响(期刊第一篇摘要)

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Pub Date : 2018-09-01 DOI: 10.1145/3238147.3241984

Nada Almasri, L. Tahat, B. Korel

Software maintenance is becoming more challenging with the increased complexity of the software and the frequently applied changes. Performing impact analysis before the actual implementation of a change is a crucial task during system maintenance. While many tools and techniques are available to measure the impact of a change at the code level, only a few research work is done to measure the impact of a change at an earlier stage in the development process. This work introduces an approach to measure the impact of a change at the model level.

随着软件复杂性的增加和频繁应用的更改，软件维护变得越来越具有挑战性。在实际实现更改之前执行影响分析是系统维护期间的关键任务。虽然有许多工具和技术可用于在代码级别度量变更的影响，但是在开发过程的早期阶段度量变更的影响的研究工作却很少。这项工作引入了一种在模型级别度量变更影响的方法。

引用次数: 1

Reducing Interactive Refactoring Effort via Clustering-Based Multi-objective Search 基于聚类的多目标搜索减少交互式重构工作量

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Pub Date : 2018-09-01 DOI: 10.1145/3238147.3238217

Vahid Alizadeh, M. Kessentini

Refactoring is nowadays widely adopted in the industry because bad design decisions can be very costly and extremely risky. On the one hand, automated refactoring does not always lead to the desired design. On the other hand, manual refactoring is error-prone, time-consuming and not practical for radical changes. Thus, recent research trends in the field focused on integrating developers feedback into automated refactoring recommendations because developers understand the problem domain intuitively and may have a clear target design in mind. However, this interactive process can be repetitive, expensive, and tedious since developers must evaluate recommended refactorings, and adapt them to the targeted design especially in large systems where the number of possible strategies can grow exponentially. In this paper, we propose an interactive approach combining the use of multi-objective and unsupervised learning to reduce the developer's interaction effort when refactoring systems. We generate, first, using multi-objective search different possible refactoring strategies by finding a trade-off between several conflicting quality attributes. Then, an unsupervised learning algorithm clusters the different trade-off solutions, called the Pareto front, to guide the developers in selecting their region of interests and reduce the number of refactoring options to explore. The feedback from the developer, both at the cluster and solution levels, are used to automatically generate constraints to reduce the search space in the next iterations and focus on the region of developer preferences. We selected 14 active developers to manually evaluate the effectiveness our tool on 5 open source projects and one industrial system. The results show that the participants found their desired refactorings faster and more accurate than the current state of the art.

如今，重构在行业中被广泛采用，因为糟糕的设计决策可能非常昂贵且风险极大。一方面，自动化重构并不总能得到想要的设计。另一方面，手动重构容易出错，耗时，而且对于根本的更改不实用。因此，该领域最近的研究趋势集中在将开发人员的反馈集成到自动化重构建议中，因为开发人员可以直观地理解问题域，并且可能在头脑中有一个明确的目标设计。然而，这种交互过程可能是重复的、昂贵的和乏味的，因为开发人员必须评估推荐的重构，并使它们适应目标设计，特别是在可能的策略数量呈指数级增长的大型系统中。在本文中，我们提出了一种结合使用多目标和无监督学习的交互式方法，以减少开发人员在重构系统时的交互努力。首先，我们使用多目标搜索，通过在几个相互冲突的质量属性之间找到权衡，生成不同可能的重构策略。然后，一种无监督学习算法将不同的权衡方案聚类，称为帕累托前沿，以指导开发人员选择他们感兴趣的区域，并减少需要探索的重构选项的数量。来自开发人员的反馈，无论是在集群还是解决方案级别，都用于自动生成约束，以减少下一个迭代中的搜索空间，并关注开发人员首选项的区域。我们选择了14个活跃的开发人员来手动评估我们的工具在5个开源项目和一个工业系统上的有效性。结果表明，参与者发现他们想要的重构比当前的技术状态更快、更准确。

{"title":"Reducing Interactive Refactoring Effort via Clustering-Based Multi-objective Search","authors":"Vahid Alizadeh, M. Kessentini","doi":"10.1145/3238147.3238217","DOIUrl":"https://doi.org/10.1145/3238147.3238217","url":null,"abstract":"Refactoring is nowadays widely adopted in the industry because bad design decisions can be very costly and extremely risky. On the one hand, automated refactoring does not always lead to the desired design. On the other hand, manual refactoring is error-prone, time-consuming and not practical for radical changes. Thus, recent research trends in the field focused on integrating developers feedback into automated refactoring recommendations because developers understand the problem domain intuitively and may have a clear target design in mind. However, this interactive process can be repetitive, expensive, and tedious since developers must evaluate recommended refactorings, and adapt them to the targeted design especially in large systems where the number of possible strategies can grow exponentially. In this paper, we propose an interactive approach combining the use of multi-objective and unsupervised learning to reduce the developer's interaction effort when refactoring systems. We generate, first, using multi-objective search different possible refactoring strategies by finding a trade-off between several conflicting quality attributes. Then, an unsupervised learning algorithm clusters the different trade-off solutions, called the Pareto front, to guide the developers in selecting their region of interests and reduce the number of refactoring options to explore. The feedback from the developer, both at the cluster and solution levels, are used to automatically generate constraints to reduce the search space in the next iterations and focus on the region of developer preferences. We selected 14 active developers to manually evaluate the effectiveness our tool on 5 open source projects and one industrial system. The results show that the participants found their desired refactorings faster and more accurate than the current state of the art.","PeriodicalId":6622,"journal":{"name":"2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)","volume":"44 3","pages":"464-474"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91492481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 35

Delta Debugging Microservice Systems Delta调试微服务系统

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Pub Date : 2018-09-01 DOI: 10.1145/3238147.3240730

Xiaoping Zhou, Xin Peng, Tao Xie, Jun Sun, Wenhai Li, Chao Ji, Dan Ding

Debugging microservice systems involves the deployment and manipulation of microservice systems on a containerized environment and faces unique challenges due to the high complexity and dynamism of microservices. To address these challenges, in this paper, we propose a debugging approach for microservice systems based on the delta debugging algorithm, which is to minimize failure-inducing deltas of circumstances (e.g., deployment, environmental configurations) for effective debugging. Our approach includes novel techniques for defining, deploying/manipulating, and executing deltas following the idea of delta debugging. In particular, to construct a (failing) circumstance space for delta debugging to minimize, our approach defines a set of dimensions that can affect the execution of microservice systems. Our experimental study on a medium-size microservice benchmark system shows that our approach can effectively identify failure-inducing deltas that help diagnose the root causes.

调试微服务系统涉及在容器化环境中部署和操作微服务系统，并且由于微服务的高复杂性和动态性而面临独特的挑战。为了解决这些挑战，在本文中，我们提出了一种基于增量调试算法的微服务系统调试方法，该方法可以最大限度地减少导致故障的环境增量(例如，部署，环境配置)，以实现有效的调试。我们的方法包括定义、部署/操作和执行增量的新技术，遵循增量调试的思想。特别是，为了构建一个(失败的)环境空间，以便最小化增量调试，我们的方法定义了一组可能影响微服务系统执行的维度。我们在一个中型微服务基准系统上的实验研究表明，我们的方法可以有效地识别故障诱发delta，从而帮助诊断根本原因。

引用次数: 32

Descartes: A PITest Engine to Detect Pseudo-Tested Methods: Tool Demonstration 笛卡儿:一个检测伪测试方法的测试引擎:工具演示

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Pub Date : 2018-09-01 DOI: 10.1145/3238147.3240474

O. Vera-Pérez, Monperrus Martin, B. Baudry

Descartes is a tool that implements extreme mutation operators and aims at finding pseudo-tested methods in Java projects. It leverages the efficient transformation and runtime features of PITest. The demonstration compares Descartes with Gregor, the default mutation engine provided by PITest, in a set of real open source projects. It considers the execution time, number of mutants created and the relationship between the mutation scores produced by both engines. It provides some insights on the main features exposed by Descartes.

笛卡儿是一个实现极端变异操作符的工具，旨在发现Java项目中的伪测试方法。它利用了PITest的高效转换和运行时特性。演示将笛卡尔与Gregor(由PITest提供的默认突变引擎)在一组真正的开源项目中进行比较。它考虑了执行时间、产生的突变数以及两个引擎产生的突变分数之间的关系。它提供了一些关于笛卡尔所揭示的主要特征的见解。

引用次数: 17

Mining File Histories: Should We Consider Branches? 挖掘文件历史:我们应该考虑分支吗?

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Pub Date : 2018-09-01 DOI: 10.1145/3238147.3238169

V. Kovalenko, Fabio Palomba, Alberto Bacchelli

Modern distributed version control systems, such as Git, offer support for branching — the possibility to develop parts of software outside the master trunk. Consideration of the repository structure in Mining Software Repository (MSR) studies requires a thorough approach to mining, but there is no well-documented, widespread methodology regarding the handling of merge commits and branches. Moreover, there is still a lack of knowledge of the extent to which considering branches during MSR studies impacts the results of the studies. In this study, we set out to evaluate the importance of proper handling of branches when calculating file modification histories. We analyze over 1,400 Git repositories of four open source ecosystems and compute modification histories for over two million files, using two different algorithms. One algorithm only follows the first parent of each commit when traversing the repository, the other returns the full modification history of a file across all branches. We show that the two algorithms consistently deliver different results, but the scale of the difference varies across projects and ecosystems. Further, we evaluate the importance of accurate mining of file histories by comparing the performance of common techniques that rely on file modification history — reviewer recommendation, change recommendation, and defect prediction — for two algorithms of file history retrieval. We find that considering full file histories leads to an increase in the techniques' performance that is rather modest.

现代分布式版本控制系统，如Git，提供了分支支持——在主主干之外开发软件部分的可能性。在挖掘软件存储库(MSR)研究中，考虑存储库结构需要一种彻底的挖掘方法，但是关于合并提交和分支的处理，没有文档完备的、广泛的方法。此外，在MSR研究中考虑分支对研究结果的影响程度仍然缺乏认识。在本研究中，我们开始评估在计算文件修改历史时正确处理分支的重要性。我们分析了四个开源生态系统的1400多个Git存储库，并使用两种不同的算法计算了超过200万个文件的修改历史。当遍历存储库时，一种算法只遵循每个提交的第一个父节点，另一种算法返回跨所有分支的文件的完整修改历史。我们表明，这两种算法始终提供不同的结果，但差异的规模因项目和生态系统而异。此外，我们通过比较两种文件历史检索算法中依赖于文件修改历史的常用技术(审阅者推荐、变更推荐和缺陷预测)的性能来评估准确挖掘文件历史的重要性。我们发现，考虑完整的文件历史记录会导致技术性能的提高，但这种提高是相当适度的。

{"title":"Mining File Histories: Should We Consider Branches?","authors":"V. Kovalenko, Fabio Palomba, Alberto Bacchelli","doi":"10.1145/3238147.3238169","DOIUrl":"https://doi.org/10.1145/3238147.3238169","url":null,"abstract":"Modern distributed version control systems, such as Git, offer support for branching — the possibility to develop parts of software outside the master trunk. Consideration of the repository structure in Mining Software Repository (MSR) studies requires a thorough approach to mining, but there is no well-documented, widespread methodology regarding the handling of merge commits and branches. Moreover, there is still a lack of knowledge of the extent to which considering branches during MSR studies impacts the results of the studies. In this study, we set out to evaluate the importance of proper handling of branches when calculating file modification histories. We analyze over 1,400 Git repositories of four open source ecosystems and compute modification histories for over two million files, using two different algorithms. One algorithm only follows the first parent of each commit when traversing the repository, the other returns the full modification history of a file across all branches. We show that the two algorithms consistently deliver different results, but the scale of the difference varies across projects and ecosystems. Further, we evaluate the importance of accurate mining of file histories by comparing the performance of common techniques that rely on file modification history — reviewer recommendation, change recommendation, and defect prediction — for two algorithms of file history retrieval. We find that considering full file histories leads to an increase in the techniques' performance that is rather modest.","PeriodicalId":6622,"journal":{"name":"2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)","volume":"19 1","pages":"202-213"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88115509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 31

SRCIROR: A Toolset for Mutation Testing of C Source Code and LLVM Intermediate Representation SRCIROR:用于C源代码和LLVM中间表示的突变测试的工具集

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Pub Date : 2018-09-01 DOI: 10.1145/3238147.3240482

Farah Hariri, A. Shi

We present SRCIROR (pronounced “sorcerer”), a toolset for performing mutation testing at the levels of C/C++ source code (SRC) and the LLVM compiler intermediate representation (IR). At the SRC level, SRCIROR identifies program constructs for mutation by pattern-matching on the Clang AST. At the IR level, SRCIROR directly mutates the LLVM IR instructions through LLVM passes. Our implementation enables SRCIROR to (1) handle any program that Clang can handle, extending to large programs with a minimal overhead, and (2) have a small percentage of invalid mutants that do not compile. SRCIROR enables performing mutation testing using the same classes of mutation operators at both the SRC and IR levels, and it is easily extensible to support more operators. In addition, SRCIROR can collect coverage to generate mutants only for covered code elements. Our tool is publicly available on GitHub (https://github.com/TestingResearchIllinois/srciror). We evaluate SRCIROR on Coreutils subjects. Our evaluation shows interesting differences between SRC and IR, demonstrating the value of SR-CIROR in enabling mutation testing research across different levels of code representation.

我们提出了SRCIROR(发音为“sorcerer”)，一个用于在C/ c++源代码(SRC)和LLVM编译器中间表示(IR)级别执行突变测试的工具集。在SRC级别，SRCIROR通过Clang AST上的模式匹配来识别要进行突变的程序结构。在IR级别，SRCIROR通过LLVM传递直接对LLVM IR指令进行突变。我们的实现使SRCIROR能够(1)处理Clang可以处理的任何程序，以最小的开销扩展到大型程序，以及(2)具有一小部分无法编译的无效突变。SRCIROR允许在SRC和IR级别使用相同类的突变操作符执行突变测试，并且它很容易扩展以支持更多操作符。此外，SRCIROR可以收集覆盖范围，仅为覆盖的代码元素生成突变。我们的工具在GitHub (https://github.com/TestingResearchIllinois/srciror)上公开可用。我们评估了coretils受试者的SRCIROR。我们的评估显示了SRC和IR之间有趣的差异，证明了SR-CIROR在跨不同级别代码表示的突变测试研究中的价值。

{"title":"SRCIROR: A Toolset for Mutation Testing of C Source Code and LLVM Intermediate Representation","authors":"Farah Hariri, A. Shi","doi":"10.1145/3238147.3240482","DOIUrl":"https://doi.org/10.1145/3238147.3240482","url":null,"abstract":"We present SRCIROR (pronounced “sorcerer”), a toolset for performing mutation testing at the levels of C/C++ source code (SRC) and the LLVM compiler intermediate representation (IR). At the SRC level, SRCIROR identifies program constructs for mutation by pattern-matching on the Clang AST. At the IR level, SRCIROR directly mutates the LLVM IR instructions through LLVM passes. Our implementation enables SRCIROR to (1) handle any program that Clang can handle, extending to large programs with a minimal overhead, and (2) have a small percentage of invalid mutants that do not compile. SRCIROR enables performing mutation testing using the same classes of mutation operators at both the SRC and IR levels, and it is easily extensible to support more operators. In addition, SRCIROR can collect coverage to generate mutants only for covered code elements. Our tool is publicly available on GitHub (https://github.com/TestingResearchIllinois/srciror). We evaluate SRCIROR on Coreutils subjects. Our evaluation shows interesting differences between SRC and IR, demonstrating the value of SR-CIROR in enabling mutation testing research across different levels of code representation.","PeriodicalId":6622,"journal":{"name":"2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)","volume":"10 1","pages":"860-863"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85183959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

A Genetic Algorithm for Goal-Conflict Identification 一种目标冲突识别的遗传算法

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Pub Date : 2018-09-01 DOI: 10.1145/3238147.3238220

Renzo Degiovanni, F. Molina, Germán Regis, Nazareno Aguirre

Goal-conflict analysis has been widely used as an abstraction for risk analysis in goal-oriented requirements engineering approaches. In this context, where the expected behaviour of the system-to-be is captured in terms of domain properties and goals, identifying combinations of circumstances that may make the goals diverge, i.e., not to be satisfied as a whole, is of most importance. Various approaches have been proposed in order to automatically identify boundary conditions, i.e., formulas capturing goal-divergent situations, but they either apply only to some specific goal expressions, or are affected by scalability issues that make them applicable only to relatively small specifications. In this paper, we present a novel approach to automatically identify boundary conditions, using evolutionary computation. More precisely, we develop a genetic algorithm that, given the LTL formulation of the domain properties and the goals, it searches for formulas that capture divergences in the specification. We exploit a modern LTL satisfiability checker to successfully guide our genetic algorithm to the solutions. We assess our technique on a set of case studies, and show that our genetic algorithm is able to find boundary conditions that cannot be generated by related approaches, and is able to efficiently scale to LTL specifications that other approaches are unable to deal with.

在面向目标的需求工程方法中，目标冲突分析作为风险分析的一种抽象被广泛使用。在这种情况下，根据领域属性和目标捕获系统的预期行为，识别可能使目标偏离的环境组合，即，不能作为一个整体得到满足，是最重要的。为了自动识别边界条件(即捕获目标发散情况的公式)，已经提出了各种方法，但是它们要么只适用于某些特定的目标表达式，要么受到可伸缩性问题的影响，使它们只适用于相对较小的规范。本文提出了一种利用进化计算自动识别边界条件的新方法。更准确地说，我们开发了一种遗传算法，给定领域属性和目标的LTL公式，它搜索捕获规范中的分歧的公式。我们利用一个现代LTL可满足性检查器来成功地引导我们的遗传算法得到解决方案。我们在一组案例研究中评估了我们的技术，并表明我们的遗传算法能够找到相关方法无法生成的边界条件，并且能够有效地扩展到其他方法无法处理的LTL规范。

{"title":"A Genetic Algorithm for Goal-Conflict Identification","authors":"Renzo Degiovanni, F. Molina, Germán Regis, Nazareno Aguirre","doi":"10.1145/3238147.3238220","DOIUrl":"https://doi.org/10.1145/3238147.3238220","url":null,"abstract":"Goal-conflict analysis has been widely used as an abstraction for risk analysis in goal-oriented requirements engineering approaches. In this context, where the expected behaviour of the system-to-be is captured in terms of domain properties and goals, identifying combinations of circumstances that may make the goals diverge, i.e., not to be satisfied as a whole, is of most importance. Various approaches have been proposed in order to automatically identify boundary conditions, i.e., formulas capturing goal-divergent situations, but they either apply only to some specific goal expressions, or are affected by scalability issues that make them applicable only to relatively small specifications. In this paper, we present a novel approach to automatically identify boundary conditions, using evolutionary computation. More precisely, we develop a genetic algorithm that, given the LTL formulation of the domain properties and the goals, it searches for formulas that capture divergences in the specification. We exploit a modern LTL satisfiability checker to successfully guide our genetic algorithm to the solutions. We assess our technique on a set of case studies, and show that our genetic algorithm is able to find boundary conditions that cannot be generated by related approaches, and is able to efficiently scale to LTL specifications that other approaches are unable to deal with.","PeriodicalId":6622,"journal":{"name":"2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)","volume":"1 1","pages":"520-531"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83167187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Trimmer: Application Specialization for Code Debloating 修剪器:用于代码展开的应用程序专门化

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Pub Date : 2018-09-01 DOI: 10.1145/3238147.3238160

Hashim Sharif, Muhammad Abubakar, Ashish Gehani, Fareed Zaffar

With the proliferation of new hardware architectures and ever-evolving user requirements, the software stack is becoming increasingly bloated. In practice, only a limited subset of the supported functionality is utilized in a particular usage context, thereby presenting an opportunity to eliminate unused features. In the past, program specialization has been proposed as a mechanism for enabling automatic software debloating. In this work, we show how existing program specialization techniques lack the analyses required for providing code simplification for real-world programs. We present an approach that uses stronger analysis techniques to take advantage of constant configuration data, thereby enabling more effective debloating. We developed Trimmer, an application specialization tool that leverages user-provided configuration data to specialize an application to its deployment context. The specialization process attempts to eliminate the application functionality that is unused in the user-defined context. Our evaluation demonstrates Trimmer can effectively reduce code bloat. For 13 applications spanning various domains, we observe a mean binary size reduction of 21% and a maximum reduction of 75%. We also show specialization reduces the surface for code-reuse attacks by reducing the number of exploitable gadgets. For the evaluated programs, we observe a 20% mean reduction in the total gadget count and a maximum reduction of 87%.

随着新硬件架构的激增和用户需求的不断发展，软件堆栈变得越来越臃肿。在实践中，在特定的使用上下文中只使用受支持功能的有限子集，因此提供了消除未使用特性的机会。在过去，程序专门化被认为是实现自动软件扩展的一种机制。在这项工作中，我们展示了现有的程序专门化技术如何缺乏为现实世界的程序提供代码简化所需的分析。我们提出了一种方法，该方法使用更强大的分析技术来利用恒定的配置数据，从而实现更有效的解压。我们开发了Trimmer，这是一个应用程序专门化工具，它利用用户提供的配置数据将应用程序专门化到其部署上下文。专门化过程试图消除用户定义上下文中未使用的应用程序功能。我们的评估表明Trimmer可以有效地减少代码膨胀。对于跨越不同领域的13个应用程序，我们观察到二进制大小平均减少了21%，最大减少了75%。我们还展示了专门化通过减少可利用小工具的数量来减少代码重用攻击的表面。对于评估的程序，我们观察到总gadget计数平均减少20%，最大减少87%。

{"title":"Trimmer: Application Specialization for Code Debloating","authors":"Hashim Sharif, Muhammad Abubakar, Ashish Gehani, Fareed Zaffar","doi":"10.1145/3238147.3238160","DOIUrl":"https://doi.org/10.1145/3238147.3238160","url":null,"abstract":"With the proliferation of new hardware architectures and ever-evolving user requirements, the software stack is becoming increasingly bloated. In practice, only a limited subset of the supported functionality is utilized in a particular usage context, thereby presenting an opportunity to eliminate unused features. In the past, program specialization has been proposed as a mechanism for enabling automatic software debloating. In this work, we show how existing program specialization techniques lack the analyses required for providing code simplification for real-world programs. We present an approach that uses stronger analysis techniques to take advantage of constant configuration data, thereby enabling more effective debloating. We developed Trimmer, an application specialization tool that leverages user-provided configuration data to specialize an application to its deployment context. The specialization process attempts to eliminate the application functionality that is unused in the user-defined context. Our evaluation demonstrates Trimmer can effectively reduce code bloat. For 13 applications spanning various domains, we observe a mean binary size reduction of 21% and a maximum reduction of 75%. We also show specialization reduces the surface for code-reuse attacks by reducing the number of exploitable gadgets. For the evaluated programs, we observe a 20% mean reduction in the total gadget count and a maximum reduction of 87%.","PeriodicalId":6622,"journal":{"name":"2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)","volume":"64 1","pages":"329-339"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80729556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 87

OCELOT: A Search-Based Test-Data Generation Tool for C OCELOT:一个基于搜索的测试数据生成工具

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Pub Date : 2018-09-01 DOI: 10.1145/3238147.3240477

Simone Scalabrino, Giovanni Grano, Dario Di Nucci, Michele Guerra, A. De Lucia, H. Gall, R. Oliveto

Automatically generating test cases plays an important role to reduce the time spent by developers during the testing phase. In last years, several approaches have been proposed to tackle such a problem: amongst others, search-based techniques have been shown to be particularly promising. In this paper we describe Ocelot, a search-based tool for the automatic generation of test cases in C. Ocelot allows practitioners to write skeletons of test cases for their programs and researchers to easily implement and experiment new approaches for automatic test-data generation. We show that Ocelot achieves a higher coverage compared to a competitive tool in 81% of the cases. Ocelot is publicly available to support both researchers and practitioners.

自动生成测试用例对于减少开发人员在测试阶段所花费的时间起着重要的作用。在过去的几年里，已经提出了几种方法来解决这个问题:其中，基于搜索的技术已经被证明是特别有前途的。在本文中，我们描述了Ocelot，一个基于搜索的工具，用于在c语言中自动生成测试用例。Ocelot允许从业者为他们的程序编写测试用例的框架，研究人员可以轻松地实现和实验自动测试数据生成的新方法。我们发现，在81%的情况下，Ocelot实现了比竞争工具更高的覆盖率。Ocelot是公开的，可以支持研究人员和从业者。

引用次数: 7

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀