Exponential Lower Bounds for Threshold Circuits of Sub-Linear Depth and Energy

Kei Uchizawa, Haruki Abe
{"title":"Exponential Lower Bounds for Threshold Circuits of Sub-Linear Depth and Energy","authors":"Kei Uchizawa, Haruki Abe","doi":"10.4230/LIPIcs.MFCS.2023.85","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate computational power of threshold circuits and other theoretical models of neural networks in terms of the following four complexity measures: size (the number of gates), depth, weight and energy. Here the energy complexity of a circuit measures sparsity of their computation, and is defined as the maximum number of gates outputting non-zero values taken over all the input assignments. As our main result, we prove that any threshold circuit $C$ of size $s$, depth $d$, energy $e$ and weight $w$ satisfies $\\log (rk(M_C)) \\le ed (\\log s + \\log w + \\log n)$, where $rk(M_C)$ is the rank of the communication matrix $M_C$ of a $2n$-variable Boolean function that $C$ computes. Thus, such a threshold circuit $C$ is able to compute only a Boolean function of which communication matrix has rank bounded by a product of logarithmic factors of $s,w$ and linear factors of $d,e$. This implies an exponential lower bound on the size of even sublinear-depth threshold circuit if energy and weight are sufficiently small. For other models of neural networks such as a discretized ReLE circuits and decretized sigmoid circuits, we prove that a similar inequality also holds for a discretized circuit $C$: $rk(M_C) = O(ed(\\log s + \\log w + \\log n)^3)$.","PeriodicalId":369104,"journal":{"name":"International Symposium on Mathematical Foundations of Computer Science","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Symposium on Mathematical Foundations of Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/LIPIcs.MFCS.2023.85","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, we investigate computational power of threshold circuits and other theoretical models of neural networks in terms of the following four complexity measures: size (the number of gates), depth, weight and energy. Here the energy complexity of a circuit measures sparsity of their computation, and is defined as the maximum number of gates outputting non-zero values taken over all the input assignments. As our main result, we prove that any threshold circuit $C$ of size $s$, depth $d$, energy $e$ and weight $w$ satisfies $\log (rk(M_C)) \le ed (\log s + \log w + \log n)$, where $rk(M_C)$ is the rank of the communication matrix $M_C$ of a $2n$-variable Boolean function that $C$ computes. Thus, such a threshold circuit $C$ is able to compute only a Boolean function of which communication matrix has rank bounded by a product of logarithmic factors of $s,w$ and linear factors of $d,e$. This implies an exponential lower bound on the size of even sublinear-depth threshold circuit if energy and weight are sufficiently small. For other models of neural networks such as a discretized ReLE circuits and decretized sigmoid circuits, we prove that a similar inequality also holds for a discretized circuit $C$: $rk(M_C) = O(ed(\log s + \log w + \log n)^3)$.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
次线性深度和能量阈值电路的指数下界
在本文中,我们研究了阈值电路和其他神经网络理论模型的计算能力,根据以下四个复杂性指标:大小(门的数量),深度,重量和能量。这里,电路的能量复杂度衡量其计算的稀疏性,并被定义为在所有输入赋值上输出非零值的门的最大数量。作为我们的主要结果,我们证明了任何阈值电路$C$的大小为$s$,深度为$d$,能量为$e$,权值为$w$满足$\log (rk(M_C)) \le ed (\log s + \log w + \log n)$,其中$rk(M_C)$是$C$计算的$2n$变量布尔函数的通信矩阵$M_C$的秩。因此,这样的阈值电路$C$只能计算一个布尔函数,其通信矩阵的秩以对数因子$s,w$和线性因子$d,e$的乘积为界。这意味着,如果能量和重量足够小,甚至亚线性深度阈值电路的大小有一个指数下界。对于神经网络的其他模型,如离散化的ReLE电路和非离散的s形电路,我们证明了一个类似的不等式也适用于离散化的电路$C$: $rk(M_C) = O(ed(\log s + \log w + \log n)^3)$。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
FPT Approximation and Subexponential Algorithms for Covering Few or Many Edges Dynamic constant time parallel graph algorithms with sub-linear work Polynomial-Delay Enumeration of Large Maximal Common Independent Sets in Two Matroids Entropic Risk for Turn-Based Stochastic Games On the Expressive Power of Regular Expressions with Backreferences
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1