Proceedings of the ACM on Programming Languages最新文献_第2页

Semantic Code Refactoring for Abstract Data Types 抽象数据类型的语义代码重构

IF 1.8 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on Programming Languages

Pub Date : 2024-01-05 DOI: 10.1145/3632870

Shankara Pailoor, Yuepeng Wang, Işıl Dillig

Modifications to the data representation of an abstract data type (ADT) can require significant semantic refactoring of the code. Motivated by this observation, this paper presents a new method to automate semantic code refactoring tasks. Our method takes as input the original ADT implementation, a new data representation, and a so-called relational representation invariant (relating the old and new data representations), and automatically generates a new ADT implementation that is semantically equivalent to the original version. Our method is based on counterexample-guided inductive synthesis (CEGIS) but leverages three key ideas that allow it to handle real-world refactoring tasks. First, our approach reduces the underlying relational synthesis problem to a set of (simpler) programming-by-example problems, one for each method in the ADT. Second, it leverages symbolic reasoning techniques, based on logical abduction, to deduce code snippets that should occur in the refactored version. Finally, it utilizes a notion of partial equivalence to make inductive synthesis much more effective in this setting. We have implemented the proposed approach in a new tool called Revamp for automatically refactoring Java classes and evaluated it on 30 Java class mined from Github. Our evaluation shows that Revamp can correctly refactor the entire ADT in 97% of the cases and that it can successfully re-implement 144 out of the 146 methods that require modifications.

修改抽象数据类型（ADT）的数据表示可能需要对代码进行大量语义重构。受此启发，本文提出了一种自动执行语义代码重构任务的新方法。我们的方法将原始 ADT 实现、新数据表示和所谓的关系表示不变式（连接新旧数据表示）作为输入，并自动生成与原始版本在语义上等价的新 ADT 实现。我们的方法基于反例引导归纳综合（CEGIS），但利用了三个关键思想，使其能够处理现实世界中的重构任务。首先，我们的方法将底层关系合成问题简化为一组（更简单的）逐例编程问题，ADT 中的每个方法都有一个逐例编程问题。其次，它利用基于逻辑归纳的符号推理技术，推导出重构版本中应该出现的代码片段。最后，它利用部分等价的概念，使归纳综合在这种情况下更加有效。我们在名为 "Revamp "的用于自动重构 Java 类的新工具中实施了所提出的方法，并在从 Github 挖掘出的 30 个 Java 类上对其进行了评估。评估结果表明，在 97% 的情况下，Revamp 可以正确地重构整个 ADT，并且可以成功地重新实现 146 个需要修改的方法中的 144 个。

{"title":"Semantic Code Refactoring for Abstract Data Types","authors":"Shankara Pailoor, Yuepeng Wang, Işıl Dillig","doi":"10.1145/3632870","DOIUrl":"https://doi.org/10.1145/3632870","url":null,"abstract":"Modifications to the data representation of an abstract data type (ADT) can require significant semantic refactoring of the code. Motivated by this observation, this paper presents a new method to automate semantic code refactoring tasks. Our method takes as input the original ADT implementation, a new data representation, and a so-called relational representation invariant (relating the old and new data representations), and automatically generates a new ADT implementation that is semantically equivalent to the original version. Our method is based on counterexample-guided inductive synthesis (CEGIS) but leverages three key ideas that allow it to handle real-world refactoring tasks. First, our approach reduces the underlying relational synthesis problem to a set of (simpler) programming-by-example problems, one for each method in the ADT. Second, it leverages symbolic reasoning techniques, based on logical abduction, to deduce code snippets that should occur in the refactored version. Finally, it utilizes a notion of partial equivalence to make inductive synthesis much more effective in this setting. We have implemented the proposed approach in a new tool called Revamp for automatically refactoring Java classes and evaluated it on 30 Java class mined from Github. Our evaluation shows that Revamp can correctly refactor the entire ADT in 97% of the cases and that it can successfully re-implement 144 out of the 146 methods that require modifications.","PeriodicalId":20697,"journal":{"name":"Proceedings of the ACM on Programming Languages","volume":"31 29","pages":"816 - 847"},"PeriodicalIF":1.8,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139382656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Quotient Haskell: Lightweight Quotient Types for All Quotient Haskell：面向所有人的轻量级商类型

IF 1.8 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on Programming Languages

Pub Date : 2024-01-05 DOI: 10.1145/3632869

Brandon Hewer, Graham Hutton

Subtypes and quotient types are dual type abstractions. However, while subtypes are widely used both explicitly and implicitly, quotient types have not seen much practical use outside of proof assistants. A key difficulty to wider adoption of quotient types lies in the significant burden of proof-obligations that arises from their use. In this article, we address this issue by introducing a class of quotient types for which the proof-obligations are decidable by an SMT solver. We demonstrate this idea in practice by presenting Quotient Haskell, an extension of Liquid Haskell with support for quotient types.

子类型和商类型是双重类型抽象。然而，虽然子类型在显式和隐式中都得到了广泛应用，但商数类型在证明助手之外的实际应用并不多。商类型的广泛应用所面临的一个主要困难在于，使用商类型会产生大量的证明义务。在本文中，我们通过引入一类商类型来解决这个问题，商类型的证明义务是可由 SMT 求解器解密的。我们通过介绍支持商类型的 Liquid Haskell 扩展 Quotient Haskell 来在实践中证明这一想法。

引用次数: 0

Regular Abstractions for Array Systems 阵列系统的规则抽象

IF 1.8 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on Programming Languages

Pub Date : 2024-01-05 DOI: 10.1145/3632864

Chih-Duo Hong, Anthony W. Lin

Verifying safety and liveness over array systems is a highly challenging problem. Array systems naturally capture parameterized systems such as distributed protocols with an unbounded number of processes. Such distributed protocols often exploit process IDs during their computation, resulting in array systems whose element values range over an infinite domain. In this paper, we develop a novel framework for proving safety and liveness over array systems. The crux of the framework is to overapproximate an array system as a string rewriting system (i.e. over a finite alphabet) by means of a new predicate abstraction that exploits the so-called indexed predicates. This allows us to tap into powerful verification methods for string rewriting systems that have been heavily developed in the last two decades or so (e.g. regular model checking). We demonstrate how our method yields simple, automatically verifiable proofs of safety and liveness properties for challenging examples, including Dijkstra's self-stabilizing protocol and the Chang-Roberts leader election protocol.

验证阵列系统的安全性和有效性是一个极具挑战性的问题。数组系统天然地捕获了参数化系统，例如具有无限进程数的分布式协议。此类分布式协议在计算过程中通常会利用进程 ID，从而导致数组系统的元素值遍布无限域。在本文中，我们开发了一种新型框架，用于证明数组系统的安全性和有效性。该框架的核心是通过利用所谓索引谓词的新谓词抽象，将数组系统过度近似为字符串重写系统（即在有限字母表上）。这样，我们就能利用过去二十多年来大量开发的强大的字符串重写系统验证方法（如常规模型检查）。我们演示了我们的方法如何为具有挑战性的例子（包括 Dijkstra 的自稳定协议和 Chang-Roberts 领导者选举协议）提供简单、可自动验证的安全性和有效性证明。

引用次数: 0

Automatic Parallelism Management 自动并行管理

IF 1.8 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on Programming Languages

Pub Date : 2024-01-05 DOI: 10.1145/3632880

Sam Westrick, M. Fluet, Mike Rainey, Umut A. Acar

On any modern computer architecture today, parallelism comes with a modest cost, born from the creation and management of threads or tasks. Today, programmers battle this cost by manually optimizing/tuning their codes to minimize the cost of parallelism without harming its benefit, performance. This is a difficult battle: programmers must reason about architectural constant factors hidden behind layers of software abstractions, including thread schedulers and memory managers, and their impact on performance, also at scale. In languages that support higher-order functions, the battle hardens: higher order functions can make it difficult, if not impossible, to reason about the cost and benefits of parallelism. Motivated by these challenges and the numerous advantages of high-level languages, we believe that it has become essential to manage parallelism automatically so as to minimize its cost and maximize its benefit. This is a challenging problem, even when considered on a case-by-case, application-specific basis. But if a solution were possible, then it could combine the many correctness benefits of high-level languages with performance by managing parallelism without the programmer effort needed to ensure performance. This paper proposes techniques for such automatic management of parallelism by combining static (compilation) and run-time techniques. Specifically, we consider the Parallel ML language with task parallelism, and describe a compiler pipeline that embeds "potential parallelism" directly into the call-stack and avoids the cost of task creation by default. We then pair this compilation pipeline with a run-time system that dynamically converts potential parallelism into actual parallel tasks. Together, the compiler and run-time system guarantee that the cost of parallelism remains low without losing its benefit. We prove that our techniques have no asymptotic impact on the work and span of parallel programs and thus preserve their asymptotic properties. We implement the proposed techniques by extending the MPL compiler for Parallel ML and show that it can eliminate the burden of manual optimization while delivering good practical performance.

在当今任何现代计算机体系结构中，并行性都会产生一定的成本，这来自于线程或任务的创建和管理。如今，程序员通过手动优化/调整代码来降低并行成本，同时又不损害并行的优势--性能。这是一场艰苦的战斗：程序员必须推理隐藏在层层软件抽象（包括线程调度器和内存管理器）背后的架构常量因素，以及它们对性能（同样是大规模性能）的影响。在支持高阶函数的语言中，这场战斗更加艰苦：高阶函数可能会使推理并行性的成本和收益变得困难，甚至不可能。在这些挑战和高级语言众多优势的激励下，我们认为，自动管理并行性以最小化其成本和最大化其收益已变得至关重要。这是一个极具挑战性的问题，即使是根据具体情况和特定应用来考虑也是如此。但是，如果有可能找到一种解决方案，那么它就可以通过管理并行性，将高级语言的许多正确性优势与性能结合起来，而无需程序员为确保性能而付出努力。本文结合静态（编译）和运行时技术，提出了自动管理并行性的技术。具体来说，我们考虑了具有任务并行性的 Parallel ML 语言，并描述了一种编译器流水线，该流水线可将 "潜在并行性 "直接嵌入调用堆栈，并在默认情况下避免任务创建的成本。然后，我们将该编译管道与运行时系统配对，后者可将潜在并行性动态转换为实际并行任务。编译器和运行时系统共同保证了并行的低成本，同时又不失其优势。我们证明，我们的技术对并行程序的工作和跨度没有渐进影响，因此保留了它们的渐进特性。我们通过为并行 ML 扩展 MPL 编译器来实现所提出的技术，并证明它可以消除手动优化的负担，同时提供良好的实用性能。

{"title":"Automatic Parallelism Management","authors":"Sam Westrick, M. Fluet, Mike Rainey, Umut A. Acar","doi":"10.1145/3632880","DOIUrl":"https://doi.org/10.1145/3632880","url":null,"abstract":"On any modern computer architecture today, parallelism comes with a modest cost, born from the creation and management of threads or tasks. Today, programmers battle this cost by manually optimizing/tuning their codes to minimize the cost of parallelism without harming its benefit, performance. This is a difficult battle: programmers must reason about architectural constant factors hidden behind layers of software abstractions, including thread schedulers and memory managers, and their impact on performance, also at scale. In languages that support higher-order functions, the battle hardens: higher order functions can make it difficult, if not impossible, to reason about the cost and benefits of parallelism. Motivated by these challenges and the numerous advantages of high-level languages, we believe that it has become essential to manage parallelism automatically so as to minimize its cost and maximize its benefit. This is a challenging problem, even when considered on a case-by-case, application-specific basis. But if a solution were possible, then it could combine the many correctness benefits of high-level languages with performance by managing parallelism without the programmer effort needed to ensure performance. This paper proposes techniques for such automatic management of parallelism by combining static (compilation) and run-time techniques. Specifically, we consider the Parallel ML language with task parallelism, and describe a compiler pipeline that embeds \"potential parallelism\" directly into the call-stack and avoids the cost of task creation by default. We then pair this compilation pipeline with a run-time system that dynamically converts potential parallelism into actual parallel tasks. Together, the compiler and run-time system guarantee that the cost of parallelism remains low without losing its benefit. We prove that our techniques have no asymptotic impact on the work and span of parallel programs and thus preserve their asymptotic properties. We implement the proposed techniques by extending the MPL compiler for Parallel ML and show that it can eliminate the burden of manual optimization while delivering good practical performance.","PeriodicalId":20697,"journal":{"name":"Proceedings of the ACM on Programming Languages","volume":"48 19","pages":"1118 - 1149"},"PeriodicalIF":1.8,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139384261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ReLU Hull Approximation ReLU 船体近似法

IF 1.8 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on Programming Languages

Pub Date : 2024-01-05 DOI: 10.1145/3632917

Zhongkui Ma, Jiaying Li, Guangdong Bai

Convex hulls are commonly used to tackle the non-linearity of activation functions in the verification of neural networks. Computing the exact convex hull is a costly task though. In this work, we propose a fast and precise approach to over-approximating the convex hull of the ReLU function (referred to as the ReLU hull), one of the most used activation functions. Our key insight is to formulate a convex polytope that ”wraps” the ReLU hull, by reusing the linear pieces of the ReLU function as the lower faces and constructing upper faces that are adjacent to the lower faces. The upper faces can be efficiently constructed based on the edges and vertices of the lower faces, given that an n-dimensional (or simply nd hereafter) hyperplane can be determined by an (n−1)d hyperplane and a point outside of it. We implement our approach as WraLU, and evaluate its performance in terms of precision, efficiency, constraint complexity, and scalability. WraLU outperforms existing advanced methods by generating fewer constraints to achieve tighter approximation in less time. It exhibits versatility by effectively addressing arbitrary input polytopes and higher-dimensional cases, which are beyond the capabilities of existing methods. We integrate WraLU into PRIMA, a state-of-the-art neural network verifier, and apply it to verify large-scale ReLU-based neural networks. Our experimental results demonstrate that WraLU achieves a high efficiency without compromising precision. It reduces the number of constraints that need to be solved by the linear programming solver by up to half, while delivering comparable or even superior results compared to the state-of-the-art verifiers.

在神经网络验证中，凸壳通常用于解决激活函数的非线性问题。不过，计算精确的凸壳是一项代价高昂的任务。在这项工作中，我们提出了一种快速、精确的方法来过度逼近 ReLU 函数的凸壳（简称 ReLU 壳），ReLU 是最常用的激活函数之一。我们的主要见解是，通过重复使用 ReLU 函数的线性片段作为下部面，并构建与下部面相邻的上部面，制定一个 "包裹 "ReLU 凸壳的凸多胞形。鉴于一个 n 维（以下简称 nd）超平面可以由一个 (n-1)d 超平面及其外一点决定，因此可以根据下层面的边和顶点高效地构造上层面。我们用 WraLU 实现了我们的方法，并从精度、效率、约束复杂度和可扩展性等方面对其性能进行了评估。WraLU 通过生成更少的约束条件，在更短的时间内实现更严格的逼近，从而超越了现有的先进方法。它还能有效处理任意输入多边形和高维情况，这超出了现有方法的能力范围，从而展现了其多功能性。我们将 WraLU 集成到最先进的神经网络验证器 PRIMA 中，并将其用于验证基于 ReLU 的大规模神经网络。我们的实验结果表明，WraLU 在不影响精度的前提下实现了高效率。它将线性规划求解器需要求解的约束条件数量最多减少了一半，同时提供了与最先进验证器相当甚至更优的结果。

{"title":"ReLU Hull Approximation","authors":"Zhongkui Ma, Jiaying Li, Guangdong Bai","doi":"10.1145/3632917","DOIUrl":"https://doi.org/10.1145/3632917","url":null,"abstract":"Convex hulls are commonly used to tackle the non-linearity of activation functions in the verification of neural networks. Computing the exact convex hull is a costly task though. In this work, we propose a fast and precise approach to over-approximating the convex hull of the ReLU function (referred to as the ReLU hull), one of the most used activation functions. Our key insight is to formulate a convex polytope that ”wraps” the ReLU hull, by reusing the linear pieces of the ReLU function as the lower faces and constructing upper faces that are adjacent to the lower faces. The upper faces can be efficiently constructed based on the edges and vertices of the lower faces, given that an n-dimensional (or simply nd hereafter) hyperplane can be determined by an (n−1)d hyperplane and a point outside of it. We implement our approach as WraLU, and evaluate its performance in terms of precision, efficiency, constraint complexity, and scalability. WraLU outperforms existing advanced methods by generating fewer constraints to achieve tighter approximation in less time. It exhibits versatility by effectively addressing arbitrary input polytopes and higher-dimensional cases, which are beyond the capabilities of existing methods. We integrate WraLU into PRIMA, a state-of-the-art neural network verifier, and apply it to verify large-scale ReLU-based neural networks. Our experimental results demonstrate that WraLU achieves a high efficiency without compromising precision. It reduces the number of constraints that need to be solved by the linear programming solver by up to half, while delivering comparable or even superior results compared to the state-of-the-art verifiers.","PeriodicalId":20697,"journal":{"name":"Proceedings of the ACM on Programming Languages","volume":"6 15","pages":"2260 - 2287"},"PeriodicalIF":1.8,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139380971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Effectful Software Contracts 有效的软件合同

IF 1.8 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on Programming Languages

Pub Date : 2024-01-05 DOI: 10.1145/3632930

Cameron Moy, Christos Dimoulas, Matthias Felleisen

Software contracts empower programmers to describe functional properties of components. When it comes to constraining effects, though, the literature offers only one-off solutions for various effects. It lacks a universal principle. This paper presents the design of an effectful contract system in the context of effect handlers. A key metatheorem shows that contracts cannot unduly interfere with a program's execution. An implementation of this design, along with an evaluation of its generality, demonstrates that the theory can guide practice.

软件合约赋予程序员描述组件功能属性的能力。不过，在约束效果方面，文献只提供了针对各种效果的一次性解决方案。它缺乏一个通用的原则。本文以效果处理程序为背景，介绍了效果合约系统的设计。一个关键的元定理表明，契约不能过度干扰程序的执行。对这一设计的实现以及对其通用性的评估表明，理论可以指导实践。

引用次数: 0

Fusing Direct Manipulations into Functional Programs 将直接操作融合到功能程序中

IF 1.8 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on Programming Languages

Pub Date : 2024-01-05 DOI: 10.1145/3632883

Xing Zhang, Ruifeng Xie, Guanchen Guo, Xiao He, Tao Zan, Zhenjiang Hu

Bidirectional live programming systems (BLP) enable developers to modify a program by directly manipulating the program output, so that the updated program can produce the manipulated output. One state-of-the-art approach to BLP systems is operation-based, which captures the developer's intention of program modifications by taking how the developer manipulates the output into account. The program modifications are usually hard coded for each direct manipulation in these BLP systems, which are difficult to extend. Moreover, to reflect the manipulations to the source program, these BLP systems trace the modified output to appropriate code fragments and perform corresponding code transformations. Accordingly, they require direct manipulation users be aware of the source code and how it is changed, making "direct" manipulation (on output) be "indirect". In this paper, we resolve this problem by presenting a novel operation-based framework for bidirectional live programming, which can automatically fuse direct manipulations into the source code, thus supporting code-insensitive direct manipulations. Firstly, we design a simple but expressive delta language DM capable of expressing common direct manipulations for output values. Secondly, we present a fusion algorithm that propagates direct manipulations into the source functional programs and applies them to the constants whenever possible; otherwise, the algorithm embeds manipulations into the "proper positions" of programs. We prove the correctness of the fusion algorithm that the updated program executes to get the manipulated output. To demonstrate the expressiveness of DM and the effectiveness of our fusion algorithm, we have implemented FuseDM, a prototype SVG editor that supports GUI-based operations for direct manipulation, and successfully designed 14 benchmark examples starting from blank code using FuseDM.

双向实时编程系统（BLP）使开发人员能够通过直接操作程序输出来修改程序，从而使更新后的程序能够产生被操作的输出。基于操作的双向实时编程系统是最先进的双向实时编程系统方法之一，它通过考虑开发者如何操作输出来捕捉开发者修改程序的意图。在这些 BLP 系统中，程序修改通常是为每次直接操作硬编码的，很难扩展。此外，为了将操作反映到源程序中，这些 BLP 系统会将修改后的输出追踪到相应的代码片段，并执行相应的代码转换。因此，这些系统要求直接操作用户了解源代码及其更改方式，从而使（对输出的）"直接 "操作变成了 "间接 "操作。在本文中，我们提出了一种新颖的基于操作的双向实时编程框架来解决这个问题，它可以自动将直接操作融合到源代码中，从而支持对代码不敏感的直接操作。首先，我们设计了一种简单但富有表现力的 delta 语言 DM，它能够表达输出值的常见直接操作。其次，我们提出了一种融合算法，该算法将直接操作传播到源代码函数式程序中，并尽可能将其应用到常量中；否则，该算法将操作嵌入程序的 "适当位置"。我们证明了融合算法的正确性，更新后的程序会执行融合算法以获得操作输出。为了证明 DM 的表现力和融合算法的有效性，我们实现了 FuseDM，一个支持基于图形用户界面的直接操作的 SVG 编辑器原型，并使用 FuseDM 从空白代码开始成功设计了 14 个基准示例。

{"title":"Fusing Direct Manipulations into Functional Programs","authors":"Xing Zhang, Ruifeng Xie, Guanchen Guo, Xiao He, Tao Zan, Zhenjiang Hu","doi":"10.1145/3632883","DOIUrl":"https://doi.org/10.1145/3632883","url":null,"abstract":"Bidirectional live programming systems (BLP) enable developers to modify a program by directly manipulating the program output, so that the updated program can produce the manipulated output. One state-of-the-art approach to BLP systems is operation-based, which captures the developer's intention of program modifications by taking how the developer manipulates the output into account. The program modifications are usually hard coded for each direct manipulation in these BLP systems, which are difficult to extend. Moreover, to reflect the manipulations to the source program, these BLP systems trace the modified output to appropriate code fragments and perform corresponding code transformations. Accordingly, they require direct manipulation users be aware of the source code and how it is changed, making \"direct\" manipulation (on output) be \"indirect\". In this paper, we resolve this problem by presenting a novel operation-based framework for bidirectional live programming, which can automatically fuse direct manipulations into the source code, thus supporting code-insensitive direct manipulations. Firstly, we design a simple but expressive delta language DM capable of expressing common direct manipulations for output values. Secondly, we present a fusion algorithm that propagates direct manipulations into the source functional programs and applies them to the constants whenever possible; otherwise, the algorithm embeds manipulations into the \"proper positions\" of programs. We prove the correctness of the fusion algorithm that the updated program executes to get the manipulated output. To demonstrate the expressiveness of DM and the effectiveness of our fusion algorithm, we have implemented FuseDM, a prototype SVG editor that supports GUI-based operations for direct manipulation, and successfully designed 14 benchmark examples starting from blank code using FuseDM.","PeriodicalId":20697,"journal":{"name":"Proceedings of the ACM on Programming Languages","volume":"17 4","pages":"1211 - 1238"},"PeriodicalIF":1.8,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139383233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decision and Complexity of Dolev-Yao Hyperproperties 多列夫-姚超特性的判定与复杂性

IF 1.8 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on Programming Languages

Pub Date : 2024-01-05 DOI: 10.1145/3632906

Itsaka Rakotonirina, Gilles Barthe, Clara Schneidewind

The formal analysis of cryptographic protocols traditionally focuses on trace and equivalence properties, for which decision procedures in the symbolic (or Dolev-Yao, or DY) model are known. However, many relevant security properties are expressed as DY hyperproperties that involve quantifications over both execution paths and attacker computations (which are constrained by the attacker's knowledge in the underlying model of computation). DY hyperproperties generalise hyperproperties, for which many decision procedures exist, to the setting of DY models. Unfortunately, the subtle interactions between both forms of quantifications have been an obstacle to lifting decision procedures from hyperproperties to DY hyperproperties. The central contribution of the paper is the first procedure for deciding DY hyperproperties, in the usual setting where the number of protocol sessions is bounded and where the equational theory modelling cryptography is subterm-convergent. We prove that our decision procedure can decide the validity of any hyperproperty in which quantifications over messages are guarded and quantifications over attacker computations are limited to expressing the attacker's knowledge. We also establish the complexity of the decision problem for several important fragments of the hyperlogic. Further, we illustrate the techniques and scope of our contributions through examples of related hyperproperties.

密码协议的形式分析传统上侧重于跟踪和等价特性，而符号模型（或 Dolev-Yao 模型，或 DY 模型）中的决策程序是已知的。然而，许多相关的安全属性都可以用 DY 超属性来表示，其中涉及对执行路径和攻击者计算的量化（受攻击者在底层计算模型中的知识限制）。DY 超属性将超属性泛化到了 DY 模型的环境中，而对于超属性，已有许多决策程序。遗憾的是，两种量化形式之间微妙的相互作用一直是将决策程序从超属性提升到 DY 超属性的障碍。本文的核心贡献在于，在协议会话次数有界、等式理论建模密码学是子项收敛的通常情况下，首次提出了 DY 超属性决策程序。我们证明了我们的判定过程可以判定任何超属性的有效性，在这些超属性中，对信息的量化是受保护的，对攻击者计算的量化仅限于表达攻击者的知识。我们还为超逻辑的几个重要片段确定了决策问题的复杂性。此外，我们还通过相关超属性的例子来说明我们所贡献的技术和范围。

{"title":"Decision and Complexity of Dolev-Yao Hyperproperties","authors":"Itsaka Rakotonirina, Gilles Barthe, Clara Schneidewind","doi":"10.1145/3632906","DOIUrl":"https://doi.org/10.1145/3632906","url":null,"abstract":"The formal analysis of cryptographic protocols traditionally focuses on trace and equivalence properties, for which decision procedures in the symbolic (or Dolev-Yao, or DY) model are known. However, many relevant security properties are expressed as DY hyperproperties that involve quantifications over both execution paths and attacker computations (which are constrained by the attacker's knowledge in the underlying model of computation). DY hyperproperties generalise hyperproperties, for which many decision procedures exist, to the setting of DY models. Unfortunately, the subtle interactions between both forms of quantifications have been an obstacle to lifting decision procedures from hyperproperties to DY hyperproperties. The central contribution of the paper is the first procedure for deciding DY hyperproperties, in the usual setting where the number of protocol sessions is bounded and where the equational theory modelling cryptography is subterm-convergent. We prove that our decision procedure can decide the validity of any hyperproperty in which quantifications over messages are guarded and quantifications over attacker computations are limited to expressing the attacker's knowledge. We also establish the complexity of the decision problem for several important fragments of the hyperlogic. Further, we illustrate the techniques and scope of our contributions through examples of related hyperproperties.","PeriodicalId":20697,"journal":{"name":"Proceedings of the ACM on Programming Languages","volume":"109 5","pages":"1913 - 1944"},"PeriodicalIF":1.8,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139383285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Inference of Robust Reachability Constraints 鲁棒可达性约束推理

IF 1.8 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on Programming Languages

Pub Date : 2024-01-05 DOI: 10.1145/3632933

Yanis Sellami, Guillaume Girol, Frédéric Recoules, Damien Couroussé, S. Bardin

Characterization of bugs and attack vectors is in many practical scenarios as important as their finding. Recently, Girol et. al. have introduced the concept of robust reachability, which ensures a perfect reproducibility of the reported violations by distinguishing inputs that are under the control of the attacker (controlled inputs) from those that are not (uncontrolled inputs), and proposed first automated analysis for it. While it is a step toward distinguishing severe bugs from benign ones, it fails for example to describe violations that are mostly reproducible, i.e., when triggering conditions are likely to happen, meaning that they happen for all uncontrolled inputs but a few corner cases. To address this issue, we propose to leverage theory-agnostic abduction techniques to generate constraints on the uncontrolled program inputs that ensure that a target property is robustly satisfied. Our proposal comes with an extension of robust reachability that is generic on the type of trace property and on the technology used to verify the properties. We show that our approach is complete w.r.t its inference language, and we additionally discuss strategies for the efficient exploration of the inference space. We demonstrate the feasibility of the method and its practical ability to refine the notion of robust reachability with an implementation that uses robust reachability oracles to generate constraints on standard benchmarks from software verification and security analysis. We illustrate the use of our implementation to a vulnerability characterization problem in the context of fault injection attacks. Our method overcomes a major limitation of the initial proposal of robust reachability, without complicating its definition. From a practical view, this is a step toward new verification tools that are able to characterize program violations through high-level feedback.

在许多实际应用场景中，描述漏洞和攻击向量与发现漏洞和攻击向量同样重要。最近，Girol 等人引入了鲁棒可达性概念，通过区分受攻击者控制的输入（受控输入）和不受攻击者控制的输入（不受控输入），确保报告的违规行为具有完美的可重复性，并首次提出了自动分析方法。虽然这是区分严重漏洞和良性漏洞的一个步骤，但它无法描述大部分情况下可重现的违规行为，即触发条件很可能发生，也就是说，除了少数几个角落情况外，所有不受控制的输入都会发生违规行为。为解决这一问题，我们建议利用与理论无关的归纳技术，生成不受控制程序输入的约束条件，确保目标属性得到稳健满足。我们的建议是对稳健可达性的扩展，它与跟踪属性的类型和用于验证属性的技术有关。我们证明了我们的方法在推理语言方面是完整的，我们还讨论了有效探索推理空间的策略。我们通过使用鲁棒可及性奥义来生成软件验证和安全分析标准基准约束的实现，证明了该方法的可行性及其完善鲁棒可及性概念的实际能力。我们以故障注入攻击为背景，说明了如何将我们的实现方法用于漏洞特征描述问题。我们的方法克服了最初提出的鲁棒可达性的一个主要局限性，而没有使其定义复杂化。从实用的角度来看，这是向新验证工具迈出的一步，新验证工具能够通过高级反馈来表征程序违规行为。

{"title":"Inference of Robust Reachability Constraints","authors":"Yanis Sellami, Guillaume Girol, Frédéric Recoules, Damien Couroussé, S. Bardin","doi":"10.1145/3632933","DOIUrl":"https://doi.org/10.1145/3632933","url":null,"abstract":"Characterization of bugs and attack vectors is in many practical scenarios as important as their finding. Recently, Girol et. al. have introduced the concept of robust reachability, which ensures a perfect reproducibility of the reported violations by distinguishing inputs that are under the control of the attacker (controlled inputs) from those that are not (uncontrolled inputs), and proposed first automated analysis for it. While it is a step toward distinguishing severe bugs from benign ones, it fails for example to describe violations that are mostly reproducible, i.e., when triggering conditions are likely to happen, meaning that they happen for all uncontrolled inputs but a few corner cases. To address this issue, we propose to leverage theory-agnostic abduction techniques to generate constraints on the uncontrolled program inputs that ensure that a target property is robustly satisfied. Our proposal comes with an extension of robust reachability that is generic on the type of trace property and on the technology used to verify the properties. We show that our approach is complete w.r.t its inference language, and we additionally discuss strategies for the efficient exploration of the inference space. We demonstrate the feasibility of the method and its practical ability to refine the notion of robust reachability with an implementation that uses robust reachability oracles to generate constraints on standard benchmarks from software verification and security analysis. We illustrate the use of our implementation to a vulnerability characterization problem in the context of fault injection attacks. Our method overcomes a major limitation of the initial proposal of robust reachability, without complicating its definition. From a practical view, this is a step toward new verification tools that are able to characterize program violations through high-level feedback.","PeriodicalId":20697,"journal":{"name":"Proceedings of the ACM on Programming Languages","volume":"59 5","pages":"2731 - 2760"},"PeriodicalIF":1.8,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139383522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Complex(ity) Landscape of Checking Infinite Descent 检查无限后裔的复杂景观

IF 1.8 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on Programming Languages

Pub Date : 2024-01-05 DOI: 10.1145/3632888

Liron Cohen, Adham Jabarin, Andrei Popescu, R. Rowe

Cyclic proof systems, in which induction is managed implicitly, are a promising approach to automatic verification. The soundness of cyclic proof graphs is ensured by checking them against a trace-based Infinite Descent property. Although the problem of checking Infinite Descent is known to be PSPACE-complete, this leaves much room for variation in practice. Indeed, a number of different approaches are employed across the various cyclic proof systems described in the literature. In this paper, we study criteria for Infinite Descent in an abstract, logic-independent setting. We look at criteria based on Büchi automata encodings and relational abstractions, and determine their parameterized time complexities in terms of natural dimensions of cyclic proofs: the numbers of vertices of the proof-tree graphs, and the vertex width—an upper bound on the number of components (e.g., formulas) of a sequent that can be simultaneously tracked for descent. We identify novel algorithms that improve upon the parameterised complexity of the existing algorithms. We implement the studied criteria and compare their performance on various benchmarks.

在循环证明系统中，归纳是隐式管理的，它是一种很有前途的自动验证方法。循环证明图的完备性是通过检查它们是否符合基于轨迹的 "无限后裔"（Infinite Descent）属性来确保的。虽然检查无限后裔的问题已知是 PSPACE-complete，但这在实践中留下了很大的变化空间。事实上，文献中描述的各种循环证明系统都采用了许多不同的方法。在本文中，我们将在一个抽象的、与逻辑无关的环境中研究无限后裔的标准。我们研究了基于布基自动机编码和关系抽象的标准，并根据循环证明的自然维度确定了它们的参数化时间复杂性：证明树图的顶点数和顶点宽度--可同时跟踪下降的序列成分（如公式）数量的上限。我们发现了一些新算法，这些算法提高了现有算法的参数化复杂度。我们实现了所研究的标准，并在各种基准上比较了它们的性能。

引用次数: 0