Journal of the ACM最新文献_第10页

Sampling-based Sublinear Low-rank Matrix Arithmetic Framework for Dequantizing Quantum Machine Learning 基于采样的亚线性低秩矩阵去量化量子机器学习算法框架

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of the ACM

Pub Date : 2022-08-10 DOI: 10.1145/3549524

Nai-Hui Chia, A. Gilyén, Tongyang Li, Han-Hsuan Lin, Ewin Tang, C. Wang

We present an algorithmic framework for quantum-inspired classical algorithms on close-to-low-rank matrices, generalizing the series of results started by Tang’s breakthrough quantum-inspired algorithm for recommendation systems [STOC’19]. Motivated by quantum linear algebra algorithms and the quantum singular value transformation (SVT) framework of Gilyén et al. [STOC’19], we develop classical algorithms for SVT that run in time independent of input dimension, under suitable quantum-inspired sampling assumptions. Our results give compelling evidence that in the corresponding QRAM data structure input model, quantum SVT does not yield exponential quantum speedups. Since the quantum SVT framework generalizes essentially all known techniques for quantum linear algebra, our results, combined with sampling lemmas from previous work, suffice to generalize all prior results about dequantizing quantum machine learning algorithms. In particular, our classical SVT framework recovers and often improves the dequantization results on recommendation systems, principal component analysis, supervised clustering, support vector machines, low-rank regression, and semidefinite program solving. We also give additional dequantization results on low-rank Hamiltonian simulation and discriminant analysis. Our improvements come from identifying the key feature of the quantum-inspired input model that is at the core of all prior quantum-inspired results: ℓ2-norm sampling can approximate matrix products in time independent of their dimension. We reduce all our main results to this fact, making our exposition concise, self-contained, and intuitive.

我们提出了一个基于近低秩矩阵的量子启发经典算法的算法框架，推广了Tang在推荐系统中的突破性量子启发算法[STOC ' 19]所带来的一系列结果。受量子线性代数算法和gily等人[STOC ' 19]的量子奇异值变换(SVT)框架的启发，我们在合适的量子启发采样假设下，开发了与输入维无关的SVT经典算法。我们的结果给出了令人信服的证据，在相应的QRAM数据结构输入模型中，量子SVT不会产生指数级的量子速度。由于量子SVT框架基本上概括了所有已知的量子线性代数技术，我们的结果，结合以前工作的采样引理，足以概括所有关于去量子化量子机器学习算法的先前结果。特别是，我们的经典SVT框架在推荐系统、主成分分析、监督聚类、支持向量机、低秩回归和半定规划求解等方面恢复并经常改善去量化结果。我们还给出了低秩哈密顿模拟和判别分析的去量化结果。我们的改进来自于识别量子启发输入模型的关键特征，这是所有先前量子启发结果的核心:2范数采样可以在时间上独立于它们的维数近似矩阵乘积。我们将所有的主要结果归结为这一事实，使我们的阐述简洁、独立、直观。

{"title":"Sampling-based Sublinear Low-rank Matrix Arithmetic Framework for Dequantizing Quantum Machine Learning","authors":"Nai-Hui Chia, A. Gilyén, Tongyang Li, Han-Hsuan Lin, Ewin Tang, C. Wang","doi":"10.1145/3549524","DOIUrl":"https://doi.org/10.1145/3549524","url":null,"abstract":"We present an algorithmic framework for quantum-inspired classical algorithms on close-to-low-rank matrices, generalizing the series of results started by Tang’s breakthrough quantum-inspired algorithm for recommendation systems [STOC’19]. Motivated by quantum linear algebra algorithms and the quantum singular value transformation (SVT) framework of Gilyén et al. [STOC’19], we develop classical algorithms for SVT that run in time independent of input dimension, under suitable quantum-inspired sampling assumptions. Our results give compelling evidence that in the corresponding QRAM data structure input model, quantum SVT does not yield exponential quantum speedups. Since the quantum SVT framework generalizes essentially all known techniques for quantum linear algebra, our results, combined with sampling lemmas from previous work, suffice to generalize all prior results about dequantizing quantum machine learning algorithms. In particular, our classical SVT framework recovers and often improves the dequantization results on recommendation systems, principal component analysis, supervised clustering, support vector machines, low-rank regression, and semidefinite program solving. We also give additional dequantization results on low-rank Hamiltonian simulation and discriminant analysis. Our improvements come from identifying the key feature of the quantum-inspired input model that is at the core of all prior quantum-inspired results: ℓ2-norm sampling can approximate matrix products in time independent of their dimension. We reduce all our main results to this fact, making our exposition concise, self-contained, and intuitive.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"75 1","pages":"1 - 72"},"PeriodicalIF":2.5,"publicationDate":"2022-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82026044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Decentralized Asynchronous Crash-resilient Runtime Verification 分散式异步抗崩溃运行时验证

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of the ACM

Pub Date : 2022-08-10 DOI: 10.1145/3550483

Borzoo Bonakdarpour, P. Fraigniaud, S. Rajsbaum, D. Rosenblueth, Corentin Travers

Runtime verification is a lightweight method for monitoring the formal specification of a system during its execution. It has recently been shown that a given state predicate can be monitored consistently by a set of crash-prone asynchronous distributed monitors observing the system, only if each monitor can emit verdicts taken from a large enough finite set. We revisit this impossibility result in the concrete context of linear-time logic (ltl) semantics for runtime verification, that is, when the correctness of the system is specified by an ltl formula on its execution traces. First, we show that monitors synthesized based on the 4-valued semantics of ltl (rv-ltl) may result in inconsistent distributed monitoring, even for some simple ltl formulas. More generally, given any ltl formula φ, we relate the number of different verdicts required by the monitors for consistently monitoring φ, with a specific structural characteristic of φ called its alternation number. Specifically, we show that, for every k ≥ 0, there is an ltl formula φ with alternation number k that cannot be verified at runtime by distributed monitors emitting verdicts from a set of cardinality smaller than k + 1. On the positive side, we define a family of logics, called distributed ltl (abbreviated as dltl), parameterized by k ≥ 0, which refines rv-ltl by incorporating 2k + 4 truth values. Our main contribution is to show that, for every k ≥ 0, every ltl formula φ with alternation number k can be consistently monitored by distributed monitors, each running an automaton based on a (2 ⌈ k/2 ⌉ +4)-valued logic taken from the dltl family.

运行时验证是一种轻量级方法，用于在系统执行期间监视系统的正式规范。最近的研究表明，一个给定的状态谓词可以由一组观察系统的容易崩溃的异步分布式监视器一致地监视，前提是每个监视器都可以从一个足够大的有限集合中发出结论。我们在用于运行时验证的线性时间逻辑(ltl)语义的具体上下文中重新讨论这个不可能结果，也就是说，当系统的正确性由其执行轨迹上的ltl公式指定时。首先，我们展示了基于ltl的4值语义(rv-ltl)合成的监视器可能导致不一致的分布式监视，即使对于一些简单的ltl公式也是如此。更一般地说，给定任何ltl公式φ，我们将监视器为持续监测φ所需的不同判决的数目与φ的特定结构特征(称为其交替数)联系起来。具体地说，我们表明，对于每一个k≥0，存在一个具有交替数k的小公式φ，该公式不能在运行时通过分布式监视器从小于k + 1的基数集发出判决来验证。在积极的方面，我们定义了一个逻辑族，称为分布式ltl(缩写为dltl)，参数化为k≥0，它通过合并2k + 4个真值来改进rv-ltl。我们的主要贡献是表明，对于每一个k≥0，每一个具有交替数k的小公式φ都可以由分布式监视器一致地监视，每个监视器都运行基于从小函数族中取的(2≤k/2²+4)值逻辑的自动机。

{"title":"Decentralized Asynchronous Crash-resilient Runtime Verification","authors":"Borzoo Bonakdarpour, P. Fraigniaud, S. Rajsbaum, D. Rosenblueth, Corentin Travers","doi":"10.1145/3550483","DOIUrl":"https://doi.org/10.1145/3550483","url":null,"abstract":"Runtime verification is a lightweight method for monitoring the formal specification of a system during its execution. It has recently been shown that a given state predicate can be monitored consistently by a set of crash-prone asynchronous distributed monitors observing the system, only if each monitor can emit verdicts taken from a large enough finite set. We revisit this impossibility result in the concrete context of linear-time logic (ltl) semantics for runtime verification, that is, when the correctness of the system is specified by an ltl formula on its execution traces. First, we show that monitors synthesized based on the 4-valued semantics of ltl (rv-ltl) may result in inconsistent distributed monitoring, even for some simple ltl formulas. More generally, given any ltl formula φ, we relate the number of different verdicts required by the monitors for consistently monitoring φ, with a specific structural characteristic of φ called its alternation number. Specifically, we show that, for every k ≥ 0, there is an ltl formula φ with alternation number k that cannot be verified at runtime by distributed monitors emitting verdicts from a set of cardinality smaller than k + 1. On the positive side, we define a family of logics, called distributed ltl (abbreviated as dltl), parameterized by k ≥ 0, which refines rv-ltl by incorporating 2k + 4 truth values. Our main contribution is to show that, for every k ≥ 0, every ltl formula φ with alternation number k can be consistently monitored by distributed monitors, each running an automaton based on a (2 ⌈ k/2 ⌉ +4)-valued logic taken from the dltl family.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"22 1","pages":"1 - 31"},"PeriodicalIF":2.5,"publicationDate":"2022-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91172810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 29

Co-lexicographically Ordering Automata and Regular Languages - Part I 自动机和规则语言的共词典排序。第1部分

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of the ACM

Pub Date : 2022-08-09 DOI: 10.48550/arXiv.2208.04931

Nicola Cotumaccio, G. D’Agostino, A. Policriti, N. Prezza

The states of a finite-state automaton 𝒩 can be identified with collections of words in the prefix closure of the regular language accepted by 𝒩. But words can be ordered, and among the many possible orders a very natural one is the co-lexicographic order. Such naturalness stems from the fact that it suggests a transfer of the order from words to the automaton’s states. This suggestion is, in fact, concrete and in a number of articles automata admitting a total co-lexicographic (co-lex for brevity) ordering of states have been proposed and studied. Such class of ordered automata — Wheeler automata — turned out to require just a constant number of bits per transition to be represented and enable regular expression matching queries in constant time per matched character. Unfortunately, not all automata can be totally ordered as previously outlined. In the present work, we lay out a new theory showing that all automata can always be partially ordered, and an intrinsic measure of their complexity can be defined and effectively determined, namely, the minimum width p of one of their admissible co-lex partial orders–dubbed here the automaton’s co-lex width. We first show that this new measure captures at once the complexity of several seemingly-unrelated hard problems on automata. Any NFA of co-lex width p: (i) has an equivalent powerset DFA whose size is exponential in p rather than (as a classic analysis shows) in the NFA’s size; (ii) can be encoded using just Θ(log p) bits per transition; (iii) admits a linear-space data structure solving regular expression matching queries in time proportional to p2 per matched character. Some consequences of this new parameterization of automata are that PSPACE-hard problems such as NFA equivalence are FPT in p, and quadratic lower bounds for the regular expression matching problem do not hold for sufficiently small p. Having established that the co-lex width of an automaton is a fundamental complexity measure, we proceed by (i) determining its computational complexity and (ii) extending this notion from automata to regular languages by studying their smallest-width accepting NFAs and DFAs. In this work we focus on the deterministic case and prove that a canonical minimum-width DFA accepting a language ℒ–dubbed the Hasse automaton ℋ of ℒ–can be exhibited. ℋ provides, in a precise sense, the best possible way to (partially) order the states of any DFA accepting ℒ, as long as we want to maintain an operational link with the (co-lexicographic) order of ℒ’s prefixes. Finally, we explore the relationship between two conflicting objectives: minimizing the width and minimizing the number of states of a DFA. In this context, we provide an analogue of the Myhill-Nerode Theorem for co-lexicographically ordered regular languages.

有限状态自动机的状态可以用正则语言的前缀闭包中的单词集合来标识。但是单词是可以排序的，在许多可能的顺序中，一个非常自然的顺序是词典编纂顺序。这种自然源于这样一个事实，即它表明了从单词到自动机状态的顺序转移。事实上，这个建议是具体的，并且在许多文章中已经提出和研究了承认状态的完全共词典排序(简称共词典)的自动机。这类有序自动机——惠勒自动机——证明每次转换只需要固定数量的比特来表示，并且在每个匹配字符的恒定时间内支持正则表达式匹配查询。不幸的是，并非所有自动机都可以像前面所述的那样完全有序。在目前的工作中，我们提出了一个新的理论，表明所有自动机总是可以部分有序的，并且可以定义和有效地确定其复杂性的内在度量，即它们的一个可容许的协环偏序的最小宽度p -这里称为自动机的协环宽度。我们首先表明，这种新方法可以立即捕捉到自动机上几个看似不相关的难题的复杂性。任何协环宽度为p:(i)的NFA都有一个等效的幂集DFA，其大小在p上呈指数增长，而不是(如经典分析所示)在NFA的大小上呈指数增长;(ii)每个转换只使用Θ(log p)位进行编码;(iii)允许线性空间数据结构以与每个匹配字符p2成比例的时间解决正则表达式匹配查询。这种自动机的新参数化的一些结果是，PSPACE-hard问题，如NFA等价，在p中是FPT，而正则表达式匹配问题的二次下界对于足够小的p不成立。建立了自动机的协环宽度是一个基本的复杂性度量，我们通过(i)确定其计算复杂度和(ii)通过研究它们的最小宽度接受nfa和dfa，将这一概念从自动机扩展到常规语言。在本文中，我们重点讨论了确定性情况，并证明了一个正则最小宽度DFA可以被展示出来，该DFA接受一种语言，被称为Hasse自动机。从精确的意义上说，只要我们想要保持一个具有(共字典)次序的操作链接，那么对于任何接受(部分)排序的DFA的状态，h提供了最好的可能方法。最后，我们探讨了两个相互冲突的目标之间的关系:最小化DFA的宽度和最小化状态数。在这种情况下，我们为共字典顺序有序的正则语言提供了Myhill-Nerode定理的类比。

{"title":"Co-lexicographically Ordering Automata and Regular Languages - Part I","authors":"Nicola Cotumaccio, G. D’Agostino, A. Policriti, N. Prezza","doi":"10.48550/arXiv.2208.04931","DOIUrl":"https://doi.org/10.48550/arXiv.2208.04931","url":null,"abstract":"The states of a finite-state automaton 𝒩 can be identified with collections of words in the prefix closure of the regular language accepted by 𝒩. But words can be ordered, and among the many possible orders a very natural one is the co-lexicographic order. Such naturalness stems from the fact that it suggests a transfer of the order from words to the automaton’s states. This suggestion is, in fact, concrete and in a number of articles automata admitting a total co-lexicographic (co-lex for brevity) ordering of states have been proposed and studied. Such class of ordered automata — Wheeler automata — turned out to require just a constant number of bits per transition to be represented and enable regular expression matching queries in constant time per matched character. Unfortunately, not all automata can be totally ordered as previously outlined. In the present work, we lay out a new theory showing that all automata can always be partially ordered, and an intrinsic measure of their complexity can be defined and effectively determined, namely, the minimum width p of one of their admissible co-lex partial orders–dubbed here the automaton’s co-lex width. We first show that this new measure captures at once the complexity of several seemingly-unrelated hard problems on automata. Any NFA of co-lex width p: (i) has an equivalent powerset DFA whose size is exponential in p rather than (as a classic analysis shows) in the NFA’s size; (ii) can be encoded using just Θ(log p) bits per transition; (iii) admits a linear-space data structure solving regular expression matching queries in time proportional to p2 per matched character. Some consequences of this new parameterization of automata are that PSPACE-hard problems such as NFA equivalence are FPT in p, and quadratic lower bounds for the regular expression matching problem do not hold for sufficiently small p. Having established that the co-lex width of an automaton is a fundamental complexity measure, we proceed by (i) determining its computational complexity and (ii) extending this notion from automata to regular languages by studying their smallest-width accepting NFAs and DFAs. In this work we focus on the deterministic case and prove that a canonical minimum-width DFA accepting a language ℒ–dubbed the Hasse automaton ℋ of ℒ–can be exhibited. ℋ provides, in a precise sense, the best possible way to (partially) order the states of any DFA accepting ℒ, as long as we want to maintain an operational link with the (co-lexicographic) order of ℒ’s prefixes. Finally, we explore the relationship between two conflicting objectives: minimizing the width and minimizing the number of states of a DFA. In this context, we provide an analogue of the Myhill-Nerode Theorem for co-lexicographically ordered regular languages.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"15 1","pages":"1 - 73"},"PeriodicalIF":2.5,"publicationDate":"2022-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85355525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Chains, Koch Chains, and Point Sets with Many Triangulations 链，科赫链，和点集与许多三角

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of the ACM

Pub Date : 2022-03-15 DOI: 10.1145/3585535

Daniel Rutschmann, Manuel Wettstein

We introduce the abstract notion of a chain, which is a sequence of n points in the plane, ordered by x-coordinates, so that the edge between any two consecutive points is unavoidable as far as triangulations are concerned. A general theory of the structural properties of chains is developed, alongside a general understanding of their number of triangulations. We also describe an intriguing new and concrete configuration, which we call the Koch chain due to its similarities to the Koch curve. A specific construction based on Koch chains is then shown to have Ω (9.08n) triangulations. This is a significant improvement over the previous and long-standing lower bound of Ω (8.65n) for the maximum number of triangulations of planar point sets.

我们引入了链的抽象概念，它是平面上n个点的序列，由x坐标排序，因此对于三角剖分而言，任意两个连续点之间的边是不可避免的。关于链的结构性质的一般理论被开发出来，同时对它们的三角剖分数量有了一般的理解。我们还描述了一个有趣的新的和具体的结构，我们称之为科赫链，因为它与科赫曲线相似。基于科赫链的特定结构随后显示具有Ω (9.08n)三角剖分。对于平面点集的最大三角剖分数，这是对先前和长期存在的Ω (8.65n)下界的重大改进。

引用次数: 1

Chasing Convex Bodies with Linear Competitive Ratio 具有线性竞争比的凸体追逐

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of the ACM

Pub Date : 2021-08-12 DOI: 10.1145/3450349

J. ArgueC., GuptaAnupam, TangZiye, GuruganeshGuru

We study the problem of chasing convex bodies online: given a sequence of convex bodies the algorithm must respond with points in an online fashion (i.e., is chosen before is revealed). The objecti...

我们研究了在线追踪凸体的问题:给定一个凸体序列，算法必须以在线方式响应点(即，在被揭示之前被选择)。objecti……

引用次数: 1

How to Construct Quantum Random Functions 如何构造量子随机函数

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of the ACM

Pub Date : 2021-08-12 DOI: 10.1145/3450745

ZhandryMark

Pseudorandom functions (PRFs) are one of the foundational concepts in theoretical computer science, with numerous applications in complexity theory and cryptography. In this work, we study the secu...

伪随机函数是理论计算机科学的基本概念之一，在复杂性理论和密码学中有着广泛的应用。在这项工作中，我们研究了…

引用次数: 228

Rate-independent Computation in Continuous Chemical Reaction Networks 连续化学反应网络中的速率无关计算

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of the ACM

Pub Date : 2021-07-29 DOI: 10.1145/3590776

Ho-Lin Chen, David Doty, Wyatt Reeves, D. Soloveichik

Understanding the algorithmic behaviors that are in principle realizable in a chemical system is necessary for a rigorous understanding of the design principles of biological regulatory networks. Further, advances in synthetic biology herald the time when we will be able to rationally engineer complex chemical systems and when idealized formal models will become blueprints for engineering. Coupled chemical interactions in a well-mixed solution are commonly formalized as chemical reaction networks (CRNs). However, despite the widespread use of CRNs in the natural sciences, the range of computational behaviors exhibited by CRNs is not well understood. Here, we study the following problem: What functions f : ℝk → ℝ can be computed by a CRN, in which the CRN eventually produces the correct amount of the “output” molecule, no matter the rate at which reactions proceed? This captures a previously unexplored but very natural class of computations: For example, the reaction X1 + X2 → Y can be thought to compute the function y = min (x1, x2). Such a CRN is robust in the sense that it is correct whether its evolution is governed by the standard model of mass-action kinetics, alternatives such as Hill-function or Michaelis-Menten kinetics, or other arbitrary models of chemistry that respect the (fundamentally digital) stoichiometric constraints (what are the reactants and products?). We develop a reachability relation based on a broad notion of “what could happen” if reaction rates can vary arbitrarily over time. Using reachability, we define stable computation analogously to probability 1 computation in distributed computing and connect it with a seemingly stronger notion of rate-independent computation based on convergence in the limit t → ∞ under a wide class of generalized rate laws. Besides the direct mapping of a concentration to a nonnegative analog value, we also consider the “dual-rail representation” that can represent negative values as the difference of two concentrations and allows the composition of CRN modules. We prove that a function is rate-independently computable if and only if it is piecewise linear (with rational coefficients) and continuous (dual-rail representation), or non-negative with discontinuities occurring only when some inputs switch from zero to positive (direct representation). The many contexts where continuous piecewise linear functions are powerful targets for implementation, combined with the systematic construction we develop for computing these functions, demonstrate the potential of rate-independent chemical computation.

理解原则上在化学系统中可实现的算法行为对于严格理解生物调节网络的设计原则是必要的。此外，合成生物学的进步预示着我们将能够合理地设计复杂的化学系统，理想化的正式模型将成为工程的蓝图。在混合良好的溶液中，耦合的化学相互作用通常形式化为化学反应网络(crn)。然而，尽管crn在自然科学中广泛使用，但crn所表现出的计算行为的范围尚未得到很好的理解。在这里，我们研究了以下问题:什么函数f:∈k→∈可以由CRN计算，其中CRN最终产生正确数量的“输出”分子，无论反应进行的速度如何?这捕获了以前未探索但非常自然的一类计算:例如，反应X1 + X2→Y可以被认为是计算函数Y = min (X1, X2)。这样的CRN在某种意义上是可靠的，它是正确的，无论它的进化是由质量作用动力学的标准模型，希尔函数或Michaelis-Menten动力学等替代品，还是其他尊重(基本上是数字的)化学计量学约束(什么是反应物和产物?)的任意化学模型控制的。如果反应速率随时间任意变化，我们基于“可能发生的事情”这一广义概念开发了可达性关系。利用可达性，我们将稳定计算类比地定义为分布式计算中的概率1计算，并将其与广义速率定律下基于极限t→∞收敛的看似更强的速率无关计算概念联系起来。除了将浓度直接映射到非负模拟值之外，我们还考虑了“双轨表示”，它可以将负值表示为两个浓度的差值，并允许组成CRN模块。我们证明了一个函数是速率独立可计算的，当且仅当它是分段线性(具有有理系数)和连续(双轨道表示)，或者非负的，只有当一些输入从零切换到正(直接表示)时才发生不连续。在许多情况下，连续分段线性函数是实现的强大目标，结合我们为计算这些函数而开发的系统结构，展示了速率无关化学计算的潜力。

{"title":"Rate-independent Computation in Continuous Chemical Reaction Networks","authors":"Ho-Lin Chen, David Doty, Wyatt Reeves, D. Soloveichik","doi":"10.1145/3590776","DOIUrl":"https://doi.org/10.1145/3590776","url":null,"abstract":"Understanding the algorithmic behaviors that are in principle realizable in a chemical system is necessary for a rigorous understanding of the design principles of biological regulatory networks. Further, advances in synthetic biology herald the time when we will be able to rationally engineer complex chemical systems and when idealized formal models will become blueprints for engineering. Coupled chemical interactions in a well-mixed solution are commonly formalized as chemical reaction networks (CRNs). However, despite the widespread use of CRNs in the natural sciences, the range of computational behaviors exhibited by CRNs is not well understood. Here, we study the following problem: What functions f : ℝk → ℝ can be computed by a CRN, in which the CRN eventually produces the correct amount of the “output” molecule, no matter the rate at which reactions proceed? This captures a previously unexplored but very natural class of computations: For example, the reaction X1 + X2 → Y can be thought to compute the function y = min (x1, x2). Such a CRN is robust in the sense that it is correct whether its evolution is governed by the standard model of mass-action kinetics, alternatives such as Hill-function or Michaelis-Menten kinetics, or other arbitrary models of chemistry that respect the (fundamentally digital) stoichiometric constraints (what are the reactants and products?). We develop a reachability relation based on a broad notion of “what could happen” if reaction rates can vary arbitrarily over time. Using reachability, we define stable computation analogously to probability 1 computation in distributed computing and connect it with a seemingly stronger notion of rate-independent computation based on convergence in the limit t → ∞ under a wide class of generalized rate laws. Besides the direct mapping of a concentration to a nonnegative analog value, we also consider the “dual-rail representation” that can represent negative values as the difference of two concentrations and allows the composition of CRN modules. We prove that a function is rate-independently computable if and only if it is piecewise linear (with rational coefficients) and continuous (dual-rail representation), or non-negative with discontinuities occurring only when some inputs switch from zero to positive (direct representation). The many contexts where continuous piecewise linear functions are powerful targets for implementation, combined with the systematic construction we develop for computing these functions, demonstrate the potential of rate-independent chemical computation.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"103 1","pages":"1 - 61"},"PeriodicalIF":2.5,"publicationDate":"2021-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73684848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Domain-Theoretic Approach to Statistical Programming Languages 统计程序设计语言的领域理论方法

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of the ACM

Pub Date : 2021-06-30 DOI: 10.1145/3611660

J. Goubault-Larrecq, Xiaodong Jia, Clément Théron

We give a domain-theoretic semantics to a statistical programming language, using the plain old category of dcpos, in contrast to some more sophisticated recent proposals. Remarkably, our monad of minimal valuations is commutative, which allows for program transformations that permute the order of independent random draws, as one would expect. A similar property is not known for Jones and Plotkin’ s monad of continuous valuations. Instead of working with true real numbers, we work with exact real arithmetic, providing a bridge towards possible implementations. (Implementations by themselves are not addressed here.) Rather remarkably, we show that restricting ourselves to minimal valuations does not restrict us much: all measures on the real line can be modeled by minimal valuations on the domain (mathbf {I}mathbb {R}_bot ) of exact real arithmetic. We give three operational semantics for our language, and we show that they are all adequate with respect to the denotational semantics. We also explore quite a few examples in order to demonstrate that our semantics computes exactly as one would expect, and in order to debunk the myth that a semantics based on continuous maps would not be expressive enough to encode measures with non-compact support using only measures with compact support, or to encode measures via non-continuous density functions, for instance. Our examples also include some useful, non-trivial cases of distributions on higher-order objects.

我们为统计编程语言提供了一个领域理论语义，使用dcpos的普通旧类别，而不是最近提出的一些更复杂的建议。值得注意的是，我们的最小值单子是可交换的，它允许程序转换，可以改变独立随机抽取的顺序，正如人们所期望的那样。Jones和Plotkin的连续估值单子中没有类似的性质。我们不是使用真正的实数，而是使用精确的实数算术，为可能的实现提供了一个桥梁。(这里不讨论实现本身。)值得注意的是，我们证明了将自己限制在最小值上并不会对我们造成很大的限制:实数线上的所有度量都可以通过精确实数算法的(mathbf {I}mathbb {R}_bot )域上的最小值来建模。我们为我们的语言给出了三种操作语义，并且证明了它们对于指称语义来说都是充分的。我们还探讨了一些例子，以证明我们的语义计算完全符合人们的期望，并且为了揭穿基于连续映射的语义不足以表达的神话，例如，仅使用具有紧凑支持的度量来编码具有非紧凑支持的度量，或者通过非连续密度函数来编码度量。我们的示例还包括高阶对象上的一些有用的、不平凡的分布情况。

{"title":"A Domain-Theoretic Approach to Statistical Programming Languages","authors":"J. Goubault-Larrecq, Xiaodong Jia, Clément Théron","doi":"10.1145/3611660","DOIUrl":"https://doi.org/10.1145/3611660","url":null,"abstract":"We give a domain-theoretic semantics to a statistical programming language, using the plain old category of dcpos, in contrast to some more sophisticated recent proposals. Remarkably, our monad of minimal valuations is commutative, which allows for program transformations that permute the order of independent random draws, as one would expect. A similar property is not known for Jones and Plotkin’ s monad of continuous valuations. Instead of working with true real numbers, we work with exact real arithmetic, providing a bridge towards possible implementations. (Implementations by themselves are not addressed here.) Rather remarkably, we show that restricting ourselves to minimal valuations does not restrict us much: all measures on the real line can be modeled by minimal valuations on the domain (mathbf {I}mathbb {R}_bot ) of exact real arithmetic. We give three operational semantics for our language, and we show that they are all adequate with respect to the denotational semantics. We also explore quite a few examples in order to demonstrate that our semantics computes exactly as one would expect, and in order to debunk the myth that a semantics based on continuous maps would not be expressive enough to encode measures with non-compact support using only measures with compact support, or to encode measures via non-continuous density functions, for instance. Our examples also include some useful, non-trivial cases of distributions on higher-order objects.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"2016 1","pages":""},"PeriodicalIF":2.5,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82622227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

A Universal Law of Robustness via Isoperimetry 等径法鲁棒性的普遍规律

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of the ACM

Pub Date : 2021-05-26 DOI: 10.1145/3578580

Sébastien Bubeck, Mark Sellke

Classically, data interpolation with a parametrized model class is possible as long as the number of parameters is larger than the number of equations to be satisfied. A puzzling phenomenon in deep learning is that models are trained with many more parameters than what this classical theory would suggest. We propose a partial theoretical explanation for this phenomenon. We prove that for a broad class of data distributions and model classes, overparametrization is necessary if one wants to interpolate the data smoothly. Namely we show that smooth interpolation requires d times more parameters than mere interpolation, where d is the ambient data dimension. We prove this universal law of robustness for any smoothly parametrized function class with polynomial size weights, and any covariate distribution verifying isoperimetry (or a mixture thereof). In the case of two-layer neural networks and Gaussian covariates, this law was conjectured in prior work by Bubeck, Li, and Nagaraj. We also give an interpretation of our result as an improved generalization bound for model classes consisting of smooth functions.

经典地，只要参数的数量大于要满足的方程的数量，就可以使用参数化模型类进行数据插值。深度学习中一个令人困惑的现象是，模型训练时使用的参数比经典理论所建议的要多得多。我们对这一现象提出了部分的理论解释。我们证明了对于一类广泛的数据分布和模型类，如果想要平滑地插值数据，过度参数化是必要的。也就是说，我们表明平滑插值需要的参数是单纯插值的d倍，其中d是环境数据维数。对于任何具有多项式大小权重的光滑参数化函数类，以及任何验证等规性的协变量分布(或其混合物)，我们证明了这一鲁棒性的普遍定律。在双层神经网络和高斯协变量的情况下，Bubeck、Li和Nagaraj在之前的工作中推测了这一定律。对于由光滑函数组成的模型类，我们也给出了一个改进的泛化界的解释。

引用次数: 146

Near-optimal Distributed Triangle Enumeration via Expander Decompositions 基于扩展器分解的近最优分布三角枚举

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of the ACM

Pub Date : 2021-05-13 DOI: 10.1145/3446330

Yi-Jun Chang, S. Pettie, Thatchaphol Saranurak, Hengjie Zhang

We present improved distributed algorithms for variants of the triangle finding problem in the <?TeX $mathsf {CONGEST}$?> model. We show that triangle detection, counting, and enumeration can be solved in <?TeX $tilde{O}(n^{1/3})$?> rounds using expander decompositions . This matches the triangle enumeration lower bound of <?TeX $tilde{Omega }(n^{1/3})$?> by Izumi and Le Gall [PODC’17] and Pandurangan, Robinson, and Scquizzato [SPAA’18], which holds even in the <?TeX $mathsf {CONGESTED}text{-}mathsf {CLIQUE}$?> model. The previous upper bounds for triangle detection and enumeration in <?TeX $mathsf {CONGEST}$?> were <?TeX $tilde{O}(n^{2/3})$?> and <?TeX $tilde{O}(n^{3/4})$?> , respectively, due to Izumi and Le Gall [PODC’17]. An <?TeX $(epsilon ,phi)$?> -expander decomposition of a graph <?TeX $G=(V,E)$?>

我们提出了改进的分布式算法来解决模型中三角形查找问题的变体。我们证明了三角形检测、计数和枚举可以使用扩展器分解在轮询中解决。这与Izumi和Le Gall [PODC ' 17]以及Pandurangan、Robinson和Scquizzato [SPAA ' 18]的三角枚举下界相吻合，即使在模型中也成立。由于Izumi和Le Gall [PODC ' 17]，之前的三角形检测和枚举上界分别为和。图的扩展分解是顶点的聚类，使得(i)每个聚类诱导出一个电导最少且(ii)簇间边数最多的子图。我们证明了对于任意的-展开式分解都可以被构造为整数一个正整数。例如，-展开器分解只需要计算轮数，这是最优的次多项式因子，对于任意小的常数，展开分解都可以以轮为单位计算。我们的三角查找算法基于以下使用扩展器分解的通用框架，这是独立的兴趣。我们首先构造一个展开器分解。对于每个集群，我们通过应用基于Ghaffari、Kuhn和Su [PODC ' 17]的扩展路由算法来模拟开销较小的算法。最后，我们使用递归调用处理集群间边缘。

{"title":"Near-optimal Distributed Triangle Enumeration via Expander Decompositions","authors":"Yi-Jun Chang, S. Pettie, Thatchaphol Saranurak, Hengjie Zhang","doi":"10.1145/3446330","DOIUrl":"https://doi.org/10.1145/3446330","url":null,"abstract":"<jats:p>\u0000 We present improved distributed algorithms for variants of the triangle finding problem in the\u0000 <jats:inline-formula>\u0000 <jats:alternatives>\u0000 <jats:tex-math>\u0000 <?TeX $mathsf {CONGEST}$?>\u0000 </jats:tex-math>\u0000 </jats:alternatives>\u0000 </jats:inline-formula>\u0000 model. We show that triangle detection, counting, and enumeration can be solved in\u0000 <jats:inline-formula>\u0000 <jats:alternatives>\u0000 <jats:tex-math>\u0000 <?TeX $tilde{O}(n^{1/3})$?>\u0000 </jats:tex-math>\u0000 </jats:alternatives>\u0000 </jats:inline-formula>\u0000 rounds using\u0000 <jats:italic>expander decompositions</jats:italic>\u0000 . This matches the triangle enumeration lower bound of\u0000 <jats:inline-formula>\u0000 <jats:alternatives>\u0000 <jats:tex-math>\u0000 <?TeX $tilde{Omega }(n^{1/3})$?>\u0000 </jats:tex-math>\u0000 </jats:alternatives>\u0000 </jats:inline-formula>\u0000 by Izumi and Le Gall [PODC’17] and Pandurangan, Robinson, and Scquizzato [SPAA’18], which holds even in the\u0000 <jats:inline-formula>\u0000 <jats:alternatives>\u0000 <jats:tex-math>\u0000 <?TeX $mathsf {CONGESTED}text{-}mathsf {CLIQUE}$?>\u0000 </jats:tex-math>\u0000 </jats:alternatives>\u0000 </jats:inline-formula>\u0000 model. The previous upper bounds for triangle detection and enumeration in\u0000 <jats:inline-formula>\u0000 <jats:alternatives>\u0000 <jats:tex-math>\u0000 <?TeX $mathsf {CONGEST}$?>\u0000 </jats:tex-math>\u0000 </jats:alternatives>\u0000 </jats:inline-formula>\u0000 were\u0000 <jats:inline-formula>\u0000 <jats:alternatives>\u0000 <jats:tex-math>\u0000 <?TeX $tilde{O}(n^{2/3})$?>\u0000 </jats:tex-math>\u0000 </jats:alternatives>\u0000 </jats:inline-formula>\u0000 and\u0000 <jats:inline-formula>\u0000 <jats:alternatives>\u0000 <jats:tex-math>\u0000 <?TeX $tilde{O}(n^{3/4})$?>\u0000 </jats:tex-math>\u0000 </jats:alternatives>\u0000 </jats:inline-formula>\u0000 , respectively, due to Izumi and Le Gall [PODC’17].\u0000 </jats:p>\u0000 <jats:p>\u0000 An\u0000 <jats:inline-formula>\u0000 <jats:alternatives>\u0000 <jats:tex-math>\u0000 <?TeX $(epsilon ,phi)$?>\u0000 </jats:tex-math>\u0000 </jats:alternatives>\u0000 </jats:inline-formula>\u0000 -expander decomposition of a graph\u0000 <jats:inline-formula>\u0000 <jats:alternatives>\u0000 <jats:tex-math>\u0000 <?TeX $G=(V,E)$?>\u0000 </jats:tex-math>\u0000 </jats:alternatives>\u0000 ","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"30 1","pages":"21:1-21:36"},"PeriodicalIF":2.5,"publicationDate":"2021-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76620735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13