Exponential Lower Bounds for Threshold Circuits of Sub-Linear Depth and Energy

International Symposium on Mathematical Foundations of Computer Science Pub Date : 2021-07-01 DOI:10.4230/LIPIcs.MFCS.2023.85

Kei Uchizawa, Haruki Abe

{"title":"Exponential Lower Bounds for Threshold Circuits of Sub-Linear Depth and Energy","authors":"Kei Uchizawa, Haruki Abe","doi":"10.4230/LIPIcs.MFCS.2023.85","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate computational power of threshold circuits and other theoretical models of neural networks in terms of the following four complexity measures: size (the number of gates), depth, weight and energy. Here the energy complexity of a circuit measures sparsity of their computation, and is defined as the maximum number of gates outputting non-zero values taken over all the input assignments. As our main result, we prove that any threshold circuit $C$ of size $s$, depth $d$, energy $e$ and weight $w$ satisfies $\\log (rk(M_C)) \\le ed (\\log s + \\log w + \\log n)$, where $rk(M_C)$ is the rank of the communication matrix $M_C$ of a $2n$-variable Boolean function that $C$ computes. Thus, such a threshold circuit $C$ is able to compute only a Boolean function of which communication matrix has rank bounded by a product of logarithmic factors of $s,w$ and linear factors of $d,e$. This implies an exponential lower bound on the size of even sublinear-depth threshold circuit if energy and weight are sufficiently small. For other models of neural networks such as a discretized ReLE circuits and decretized sigmoid circuits, we prove that a similar inequality also holds for a discretized circuit $C$: $rk(M_C) = O(ed(\\log s + \\log w + \\log n)^3)$.","PeriodicalId":369104,"journal":{"name":"International Symposium on Mathematical Foundations of Computer Science","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Symposium on Mathematical Foundations of Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/LIPIcs.MFCS.2023.85","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, we investigate computational power of threshold circuits and other theoretical models of neural networks in terms of the following four complexity measures: size (the number of gates), depth, weight and energy. Here the energy complexity of a circuit measures sparsity of their computation, and is defined as the maximum number of gates outputting non-zero values taken over all the input assignments. As our main result, we prove that any threshold circuit $C$ of size $s$, depth $d$, energy $e$ and weight $w$ satisfies $\log (rk(M_C)) \le ed (\log s + \log w + \log n)$, where $rk(M_C)$ is the rank of the communication matrix $M_C$ of a $2n$-variable Boolean function that $C$ computes. Thus, such a threshold circuit $C$ is able to compute only a Boolean function of which communication matrix has rank bounded by a product of logarithmic factors of $s,w$ and linear factors of $d,e$. This implies an exponential lower bound on the size of even sublinear-depth threshold circuit if energy and weight are sufficiently small. For other models of neural networks such as a discretized ReLE circuits and decretized sigmoid circuits, we prove that a similar inequality also holds for a discretized circuit $C$: $rk(M_C) = O(ed(\log s + \log w + \log n)^3)$.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

次线性深度和能量阈值电路的指数下界

在本文中，我们研究了阈值电路和其他神经网络理论模型的计算能力，根据以下四个复杂性指标:大小(门的数量)，深度，重量和能量。这里，电路的能量复杂度衡量其计算的稀疏性，并被定义为在所有输入赋值上输出非零值的门的最大数量。作为我们的主要结果，我们证明了任何阈值电路$C$的大小为$s$，深度为$d$，能量为$e$，权值为$w$满足$\log (rk(M_C)) \le ed (\log s + \log w + \log n)$，其中$rk(M_C)$是$C$计算的$2n$变量布尔函数的通信矩阵$M_C$的秩。因此，这样的阈值电路$C$只能计算一个布尔函数，其通信矩阵的秩以对数因子$s,w$和线性因子$d,e$的乘积为界。这意味着，如果能量和重量足够小，甚至亚线性深度阈值电路的大小有一个指数下界。对于神经网络的其他模型，如离散化的ReLE电路和非离散的s形电路，我们证明了一个类似的不等式也适用于离散化的电路$C$: $rk(M_C) = O(ed(\log s + \log w + \log n)^3)$。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Symposium on Mathematical Foundations of Computer Science

自引率

0.00%

发文量

期刊最新文献

FPT Approximation and Subexponential Algorithms for Covering Few or Many Edges Dynamic constant time parallel graph algorithms with sub-linear work Polynomial-Delay Enumeration of Large Maximal Common Independent Sets in Two Matroids Entropic Risk for Turn-Based Stochastic Games On the Expressive Power of Regular Expressions with Backreferences