首页 > 最新文献

arXiv - CS - Computational Complexity最新文献

英文 中文
Extensively Not P-Bi-Immune promiseBQP-Complete Languages 广泛的非 Pi-Immune promiseBQP-Complete 语言
Pub Date : 2024-06-24 DOI: arxiv-2406.16764
Andrew Jackson
In this paper, I first establish -- via methods other than theGottesman-Knill theorem -- the existence of an infinite set of instances ofsimulating a quantum circuit to decide a decision problem that can be simulatedclassically. I then examine under what restrictions on quantum circuits theexistence of infinitely many classically simulable instances persists. Thereturns out to be a vast number of such restrictions, and any combination ofthose found can be applied at the same time without eliminating the infiniteset of classically simulable instances. Further analysis of the tools used inthis then shows there exists a language that every (promise) BQP language isone-one reducible to. This language is also not P-bi-immune under very manypromises.
在本文中,我首先通过戈特曼-克尼尔定理以外的方法,证明了存在无限套模拟量子电路的实例,以决定一个可以经典模拟的决策问题。然后,我研究了量子电路在哪些限制条件下存在无限多的经典可模拟实例。结果发现有大量这样的限制,而且任何这些限制的组合都可以同时应用,而不会消除经典可模拟实例的无限集。对其中所用工具的进一步分析表明,存在一种语言,每一种(承诺)BQP 语言都是一一可还原的。这种语言在许诺非常多的情况下也不是 P-bi-immune 语言。
{"title":"Extensively Not P-Bi-Immune promiseBQP-Complete Languages","authors":"Andrew Jackson","doi":"arxiv-2406.16764","DOIUrl":"https://doi.org/arxiv-2406.16764","url":null,"abstract":"In this paper, I first establish -- via methods other than the\u0000Gottesman-Knill theorem -- the existence of an infinite set of instances of\u0000simulating a quantum circuit to decide a decision problem that can be simulated\u0000classically. I then examine under what restrictions on quantum circuits the\u0000existence of infinitely many classically simulable instances persists. There\u0000turns out to be a vast number of such restrictions, and any combination of\u0000those found can be applied at the same time without eliminating the infinite\u0000set of classically simulable instances. Further analysis of the tools used in\u0000this then shows there exists a language that every (promise) BQP language is\u0000one-one reducible to. This language is also not P-bi-immune under very many\u0000promises.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141507160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assembly Theory and its Relationship with Computational Complexity 组装理论及其与计算复杂性的关系
Pub Date : 2024-06-18 DOI: arxiv-2406.12176
Christopher Kempes, Sara I. Walker, Michael Lachmann, Leroy Cronin
Assembly theory (AT) quantifies selection using the assembly equation andidentifies complex objects that occur in abundance based on two measurements,assembly index and copy number. The assembly index is determined by the minimalnumber of recursive joining operations necessary to construct an object frombasic parts, and the copy number is how many of the given object(s) areobserved. Together these allow defining a quantity, called Assembly, whichcaptures the amount of causation required to produce the observed objects inthe sample. AT's focus on how selection generates complexity offers a distinctapproach to that of computational complexity theory which focuses on minimumdescriptions via compressibility. To explore formal differences between the twoapproaches, we show several simple and explicit mathematical examplesdemonstrating that the assembly index, itself only one piece of the theoreticalframework of AT, is formally not equivalent to other commonly used complexitymeasures from computer science and information theory including Huffmanencoding and Lempel-Ziv-Welch compression.
组装理论(AT)利用组装方程对选择进行量化,并根据组装指数和拷贝数这两个测量值来识别大量出现的复杂对象。装配指数由从基本部件构建一个物体所需的递归连接操作的最小数量决定,而拷贝数则是观测到的给定物体的数量。这些因素结合在一起,就可以定义一个称为 "集合"(Assembly)的量,它可以捕捉到产生样本中观察到的对象所需的因果关系量。计算复杂性理论关注的是选择如何产生复杂性,这与计算复杂性理论关注通过可压缩性进行最小描述的方法截然不同。为了探讨这两种方法在形式上的差异,我们展示了几个简单明了的数学例子,证明装配指数本身只是 AT 理论框架的一部分,在形式上并不等同于计算机科学和信息论中其他常用的复杂性度量,包括哈夫曼编码和 Lempel-Ziv-Welch 压缩。
{"title":"Assembly Theory and its Relationship with Computational Complexity","authors":"Christopher Kempes, Sara I. Walker, Michael Lachmann, Leroy Cronin","doi":"arxiv-2406.12176","DOIUrl":"https://doi.org/arxiv-2406.12176","url":null,"abstract":"Assembly theory (AT) quantifies selection using the assembly equation and\u0000identifies complex objects that occur in abundance based on two measurements,\u0000assembly index and copy number. The assembly index is determined by the minimal\u0000number of recursive joining operations necessary to construct an object from\u0000basic parts, and the copy number is how many of the given object(s) are\u0000observed. Together these allow defining a quantity, called Assembly, which\u0000captures the amount of causation required to produce the observed objects in\u0000the sample. AT's focus on how selection generates complexity offers a distinct\u0000approach to that of computational complexity theory which focuses on minimum\u0000descriptions via compressibility. To explore formal differences between the two\u0000approaches, we show several simple and explicit mathematical examples\u0000demonstrating that the assembly index, itself only one piece of the theoretical\u0000framework of AT, is formally not equivalent to other commonly used complexity\u0000measures from computer science and information theory including Huffman\u0000encoding and Lempel-Ziv-Welch compression.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141507162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On $NP$ versus ${rm co}NP$ 关于 $NP$ 与 ${rm co}NP$
Pub Date : 2024-06-15 DOI: arxiv-2406.10476
Tianrong Lin
We prove in this paper that there is a language $L_d$ accepted by somenondeterministic Turing machines but not by any ${rm co}mathcal{NP}$-machines(defined later). We further show that $L_d$ is in $mathcal{NP}$, thus provingthat $mathcal{NP}neq{rm co}mathcal{NP}$. The techniques used in this paperare lazy-diagonalization and the novel new technique developed in author'srecent work cite{Lin21}. As a by-product, we reach the important resultcite{Lin21} that $mathcal{P}neqmathcal{NP}$ once again, which is clear fromthe above outcome and the well-known fact that $mathcal{P}={rmco}mathcal{P}$. Other direct consequences are also summarized.
我们在本文中证明,有一种语言$L_d$被一些非确定性图灵机所接受,但不被任何${rm co}mathcal{NP}$机器所接受(定义在后)。我们进一步证明 $L_d$ 在 $mathcal{NP}$ 中,从而证明 $mathcal{NP}neq{rm co}mathcal{NP}$.本文使用的技术是懒对角化技术和作者在新近的工作中开发的新技术(cite{Lin21})。作为一个副产品,我们再次得出了 $mathcal{P}neqmathcal{NP}$ 这个重要结果,这一点从上述结果和众所周知的事实 $mathcal{P}={rmco}mathcal{P}$ 中可以清楚地看出。我们还总结了其他直接后果。
{"title":"On $NP$ versus ${rm co}NP$","authors":"Tianrong Lin","doi":"arxiv-2406.10476","DOIUrl":"https://doi.org/arxiv-2406.10476","url":null,"abstract":"We prove in this paper that there is a language $L_d$ accepted by some\u0000nondeterministic Turing machines but not by any ${rm co}mathcal{NP}$-machines\u0000(defined later). We further show that $L_d$ is in $mathcal{NP}$, thus proving\u0000that $mathcal{NP}neq{rm co}mathcal{NP}$. The techniques used in this paper\u0000are lazy-diagonalization and the novel new technique developed in author's\u0000recent work cite{Lin21}. As a by-product, we reach the important result\u0000cite{Lin21} that $mathcal{P}neqmathcal{NP}$ once again, which is clear from\u0000the above outcome and the well-known fact that $mathcal{P}={rm\u0000co}mathcal{P}$. Other direct consequences are also summarized.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141507163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Refinement of the McCreight-Meyer Union Theorem 麦克雷特-迈耶联合定理的完善
Pub Date : 2024-06-12 DOI: arxiv-2406.08600
Matthew Fox, Chaitanya Karamchedu
Using properties of Blum complexity measures and certain complexity classoperators, we exhibit a total computable and non-decreasing function$t_{mathsf{poly}}$ such that for all $k$, $Sigma_kmathsf{P} =Sigma_kmathsf{TIME}(t_{mathsf{poly}})$, $mathsf{BPP} =mathsf{BPTIME}(t_{mathsf{poly}})$, $mathsf{RP} =mathsf{RTIME}(t_{mathsf{poly}})$, $mathsf{UP} =mathsf{UTIME}(t_{mathsf{poly}})$, $mathsf{PP} =mathsf{PTIME}(t_{mathsf{poly}})$, $mathsf{Mod}_kmathsf{P} =mathsf{Mod}_kmathsf{TIME}(t_{mathsf{poly}})$, $mathsf{PSPACE} =mathsf{DSPACE}(t_{mathsf{poly}})$, and so forth. A similar statement holdsfor any collection of language classes, provided that each class is definableby applying a certain complexity class operator to some Blum complexity class.
利用布卢姆复杂性度量和某些复杂性类运算符的特性,我们展示了一个可计算且不递减的函数$t_{mathsf{poly}}$,对于所有$k$,该函数都是可计算的、$Sigma_kmathsf{P} =Sigma_kmathsf{TIME}(t_{mathsf{poly}}$, $mathsf{BPP} =mathsf{BPTIME}(t_{mathsf{poly}}$、$mathsf{RP} =mathsf{RTIME}(t_{mathsf{poly}})$, $mathsf{UP} =mathsf{UTIME}(t_{mathsf{poly}})$, $mathsf{PP} =mathsf{PTIME}(t_{mathsf{poly}})$、$mathsf{Mod}_kmathsf{P} =mathsf{Mod}_kmathsf{TIME}(t_{mathsf{poly}})$, $mathsf{PSPACE} =mathsf{DSPACE}(t_{mathsf{poly}})$, 等等。对于任何语言类集合,只要每个类都可以通过将某个复杂度类算子应用于某个布卢姆复杂度类来定义,那么类似的说法都是成立的。
{"title":"A Refinement of the McCreight-Meyer Union Theorem","authors":"Matthew Fox, Chaitanya Karamchedu","doi":"arxiv-2406.08600","DOIUrl":"https://doi.org/arxiv-2406.08600","url":null,"abstract":"Using properties of Blum complexity measures and certain complexity class\u0000operators, we exhibit a total computable and non-decreasing function\u0000$t_{mathsf{poly}}$ such that for all $k$, $Sigma_kmathsf{P} =\u0000Sigma_kmathsf{TIME}(t_{mathsf{poly}})$, $mathsf{BPP} =\u0000mathsf{BPTIME}(t_{mathsf{poly}})$, $mathsf{RP} =\u0000mathsf{RTIME}(t_{mathsf{poly}})$, $mathsf{UP} =\u0000mathsf{UTIME}(t_{mathsf{poly}})$, $mathsf{PP} =\u0000mathsf{PTIME}(t_{mathsf{poly}})$, $mathsf{Mod}_kmathsf{P} =\u0000mathsf{Mod}_kmathsf{TIME}(t_{mathsf{poly}})$, $mathsf{PSPACE} =\u0000mathsf{DSPACE}(t_{mathsf{poly}})$, and so forth. A similar statement holds\u0000for any collection of language classes, provided that each class is definable\u0000by applying a certain complexity class operator to some Blum complexity class.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141518331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PSMC: Provable and Scalable Algorithms for Motif Conductance Based Graph Clustering PSMC:基于动机传导的图形聚类的可证明和可扩展算法
Pub Date : 2024-06-11 DOI: arxiv-2406.07357
Longlong Lin, Tao Jia, Zeli Wang, Jin Zhao, Rong-Hua Li
Higher-order graph clustering aims to partition the graph using frequentlyoccurring subgraphs. Motif conductance is one of the most promisinghigher-order graph clustering models due to its strong interpretability.However, existing motif conductance based graph clustering algorithms aremainly limited by a seminal two-stage reweighting computing framework, needingto enumerate all motif instances to obtain an edge-weighted graph forpartitioning. However, such a framework has two-fold vital defects: (1) It canonly provide a quadratic bound for the motif with three vertices, and whetherthere is provable clustering quality for other motifs is still an openquestion. (2) The enumeration procedure of motif instances incurs prohibitivelyhigh costs against large motifs or large dense graphs due to combinatorialexplosions. Besides, expensive spectral clustering or local graph diffusion onthe edge-weighted graph also makes existing methods unable to handle massivegraphs with millions of nodes. To overcome these dilemmas, we propose aProvable and Scalable Motif Conductance algorithm PSMC, which has a fixed andmotif-independent approximation ratio for any motif. Specifically, PSMC firstdefines a new vertex metric Motif Resident based on the given motif, which canbe computed locally. Then, it iteratively deletes the vertex with the smallestmotif resident value very efficiently using novel dynamic update technologies.Finally, it outputs the locally optimal result during the above iterativeprocess. To further boost efficiency, we propose several effective bounds toestimate the motif resident value of each vertex, which can greatly reducecomputational costs. Empirical results show that our proposed algorithmsachieve 3.2-32 times speedup and improve the quality by at least 12 times thanthe baselines.
高阶图聚类旨在利用经常出现的子图对图进行划分。然而,现有的基于图案传导的图聚类算法主要受限于一个开创性的两阶段加权计算框架,即需要枚举所有图案实例以获得一个边加权图来进行划分。然而,这样的框架有两个重要缺陷:(1)它只能为有三个顶点的图案提供二次约束,而对于其他图案是否有可证明的聚类质量仍是一个未决问题。(2)由于组合爆炸,对大型图案或大型密集图来说,图案实例的枚举过程会产生过高的成本。此外,在边缘加权图上进行昂贵的谱聚类或局部图扩散也使得现有方法无法处理数百万节点的大型图。为了克服这些难题,我们提出了一种可预测和可扩展的图案传导算法 PSMC,它对任何图案都有固定的、与图案无关的近似率。具体来说,PSMC 首先根据给定的图案定义一个新的顶点度量 Motif Resident,该度量可在本地计算。然后,它利用新颖的动态更新技术,非常高效地迭代删除 Motif Resident 值最小的顶点。最后,它输出上述迭代过程中的局部最优结果。为了进一步提高效率,我们提出了几个有效的边界来估计每个顶点的图案驻留值,这可以大大降低计算成本。实证结果表明,我们提出的算法速度提高了 3.2-32 倍,质量提高了至少 12 倍。
{"title":"PSMC: Provable and Scalable Algorithms for Motif Conductance Based Graph Clustering","authors":"Longlong Lin, Tao Jia, Zeli Wang, Jin Zhao, Rong-Hua Li","doi":"arxiv-2406.07357","DOIUrl":"https://doi.org/arxiv-2406.07357","url":null,"abstract":"Higher-order graph clustering aims to partition the graph using frequently\u0000occurring subgraphs. Motif conductance is one of the most promising\u0000higher-order graph clustering models due to its strong interpretability.\u0000However, existing motif conductance based graph clustering algorithms are\u0000mainly limited by a seminal two-stage reweighting computing framework, needing\u0000to enumerate all motif instances to obtain an edge-weighted graph for\u0000partitioning. However, such a framework has two-fold vital defects: (1) It can\u0000only provide a quadratic bound for the motif with three vertices, and whether\u0000there is provable clustering quality for other motifs is still an open\u0000question. (2) The enumeration procedure of motif instances incurs prohibitively\u0000high costs against large motifs or large dense graphs due to combinatorial\u0000explosions. Besides, expensive spectral clustering or local graph diffusion on\u0000the edge-weighted graph also makes existing methods unable to handle massive\u0000graphs with millions of nodes. To overcome these dilemmas, we propose a\u0000Provable and Scalable Motif Conductance algorithm PSMC, which has a fixed and\u0000motif-independent approximation ratio for any motif. Specifically, PSMC first\u0000defines a new vertex metric Motif Resident based on the given motif, which can\u0000be computed locally. Then, it iteratively deletes the vertex with the smallest\u0000motif resident value very efficiently using novel dynamic update technologies.\u0000Finally, it outputs the locally optimal result during the above iterative\u0000process. To further boost efficiency, we propose several effective bounds to\u0000estimate the motif resident value of each vertex, which can greatly reduce\u0000computational costs. Empirical results show that our proposed algorithms\u0000achieve 3.2-32 times speedup and improve the quality by at least 12 times than\u0000the baselines.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141507164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Completeness classes in algebraic complexity theory 代数复杂性理论中的完备性类
Pub Date : 2024-06-10 DOI: arxiv-2406.06217
Peter Bürgisser
The purpose of this overview is to explain the enormous impact of LesValiant's eponymous short conference contribution from 1979 on the developmentof algebraic complexity.
本综述旨在解释莱斯-瓦里昂特 1979 年发表的同名短篇会议论文对代数复杂性发展的巨大影响。
{"title":"Completeness classes in algebraic complexity theory","authors":"Peter Bürgisser","doi":"arxiv-2406.06217","DOIUrl":"https://doi.org/arxiv-2406.06217","url":null,"abstract":"The purpose of this overview is to explain the enormous impact of Les\u0000Valiant's eponymous short conference contribution from 1979 on the development\u0000of algebraic complexity.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141518333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Kolmogorov Structure Functions 关于柯尔莫哥洛夫结构函数
Pub Date : 2024-06-09 DOI: arxiv-2406.05903
Samuel Epstein
All strings with low mutual information with the halting sequence will haveflat Kolmogorov Structure Functions, in the context of Algorithmic Statistics.Assuming the Independence Postulate, strings with non-negligible informationwith the halting sequence are purely mathematical constructions, and cannot befound in nature. Thus Algorithmic Statistics does not study strings in thephysical world. This leads to the general thesis that two part codes requirelimitations as shown in the Minimum Description Length Principle. We alsodiscuss issues with set-restricted Kolmogorov Structure Functions.
在《算法统计》中,所有与停止序列互信息较低的字符串都具有平坦的科尔莫哥洛夫结构函数。假定独立公设成立,与停止序列互信息不可忽略的字符串都是纯数学构造,不可能在自然界中找到。因此,《算法统计》并不研究物理世界中的字符串。这就引出了一个一般性的论点,即两部分代码需要限制,正如最小描述长度原则所显示的那样。我们还讨论了限制集合的柯尔莫哥洛夫结构函数的问题。
{"title":"On Kolmogorov Structure Functions","authors":"Samuel Epstein","doi":"arxiv-2406.05903","DOIUrl":"https://doi.org/arxiv-2406.05903","url":null,"abstract":"All strings with low mutual information with the halting sequence will have\u0000flat Kolmogorov Structure Functions, in the context of Algorithmic Statistics.\u0000Assuming the Independence Postulate, strings with non-negligible information\u0000with the halting sequence are purely mathematical constructions, and cannot be\u0000found in nature. Thus Algorithmic Statistics does not study strings in the\u0000physical world. This leads to the general thesis that two part codes require\u0000limitations as shown in the Minimum Description Length Principle. We also\u0000discuss issues with set-restricted Kolmogorov Structure Functions.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141518332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints 行动推理平台(ActionReasoningBench):推理有拉姆约束和无拉姆约束的行动
Pub Date : 2024-06-06 DOI: arxiv-2406.04046
Divij Handa, Pavel Dolin, Shrinidhi Kumbhar, Chitta Baral, Tran Cao Son
Reasoning about actions and change (RAC) has historically driven thedevelopment of many early AI challenges, such as the frame problem, and many AIdisciplines, including non-monotonic and commonsense reasoning. The role of RACremains important even now, particularly for tasks involving dynamicenvironments, interactive scenarios, and commonsense reasoning. Despite theprogress of Large Language Models (LLMs) in various AI domains, theirperformance on RAC is underexplored. To address this gap, we introduce a newbenchmark, ActionReasoningBench, encompassing 13 domains and rigorouslyevaluating LLMs across eight different areas of RAC. These include - ObjectTracking, Fluent Tracking, State Tracking, Action Executability, Effects ofActions, Numerical RAC, Hallucination Detection, and Composite Questions.Furthermore, we also investigate the indirect effect of actions due toramification constraints for every domain. Finally, we evaluate our benchmarkusing open-sourced and commercial state-of-the-art LLMs, including GPT-4o,Gemini-1.0-Pro, Llama2-7b-chat, Llama2-13b-chat, Llama3-8b-instruct,Gemma-2b-instruct, and Gemma-7b-instruct. Our findings indicate that thesemodels face significant challenges across all categories included in ourbenchmark.
行动与变化推理(RAC)在历史上推动了许多早期人工智能挑战(如框架问题)和许多人工智能学科(包括非单调推理和常识推理)的发展。即使是现在,RAC 的作用依然重要,尤其是在涉及动态环境、交互场景和常识推理的任务中。尽管大语言模型(LLM)在各种人工智能领域取得了进展,但它们在 RAC 上的表现却未得到充分探索。为了填补这一空白,我们引入了一个新的基准--行动推理基准(ActionReasoningBench),涵盖 13 个领域,并对 RAC 的 8 个不同领域中的 LLM 进行了严格评估。这些领域包括:对象跟踪(ObjectTracking)、流体跟踪(Fluent Tracking)、状态跟踪(State Tracking)、动作可执行性(Action Executability)、动作效果(Effects ofActions)、数字 RAC、幻觉检测(Hallucination Detection)和复合问题(Composite Questions)。最后,我们使用开源和商业的最先进 LLM(包括 GPT-4o、Gemini-1.0-Pro、Llama2-7b-chat、Llama2-13b-chat、Llama3-8b-instruct、Gemma-2b-instruct 和 Gemma-7b-instruct)评估了我们的基准。我们的研究结果表明,这些模型在基准测试的所有类别中都面临重大挑战。
{"title":"ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints","authors":"Divij Handa, Pavel Dolin, Shrinidhi Kumbhar, Chitta Baral, Tran Cao Son","doi":"arxiv-2406.04046","DOIUrl":"https://doi.org/arxiv-2406.04046","url":null,"abstract":"Reasoning about actions and change (RAC) has historically driven the\u0000development of many early AI challenges, such as the frame problem, and many AI\u0000disciplines, including non-monotonic and commonsense reasoning. The role of RAC\u0000remains important even now, particularly for tasks involving dynamic\u0000environments, interactive scenarios, and commonsense reasoning. Despite the\u0000progress of Large Language Models (LLMs) in various AI domains, their\u0000performance on RAC is underexplored. To address this gap, we introduce a new\u0000benchmark, ActionReasoningBench, encompassing 13 domains and rigorously\u0000evaluating LLMs across eight different areas of RAC. These include - Object\u0000Tracking, Fluent Tracking, State Tracking, Action Executability, Effects of\u0000Actions, Numerical RAC, Hallucination Detection, and Composite Questions.\u0000Furthermore, we also investigate the indirect effect of actions due to\u0000ramification constraints for every domain. Finally, we evaluate our benchmark\u0000using open-sourced and commercial state-of-the-art LLMs, including GPT-4o,\u0000Gemini-1.0-Pro, Llama2-7b-chat, Llama2-13b-chat, Llama3-8b-instruct,\u0000Gemma-2b-instruct, and Gemma-7b-instruct. Our findings indicate that these\u0000models face significant challenges across all categories included in our\u0000benchmark.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141546382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proportionally dense subgraphs of maximum size in degree-constrained graphs 度受限图中最大尺寸的比例致密子图
Pub Date : 2024-05-31 DOI: arxiv-2405.20847
Narmina Baghirova, Antoine Castillon
A proportionally dense subgraph (PDS) of a graph is an induced subgraph ofsize at least two such that every vertex in the subgraph has proportionally asmany neighbors inside as outside of the subgraph. Then, maxPDS is the problemof determining a PDS of maximum size in a given graph. If we further requirethat a PDS induces a connected subgraph, we refer to such problem as connectedmaxPDS. In this paper, we study the complexity of maxPDS with respect toparameters representing the density of a graph and its complement. We consider$Delta$, representing the maximum degree, $h$, representing the $h$-index, anddegen, representing the degeneracy of a graph. We show that maxPDS is NP-hardparameterized by $Delta,h$ and degen. More specifically, we show that maxPDSis NP-hard on graphs with $Delta=4$, $h=4$ and degen=2. Then, we show thatmaxPDS is NP-hard when restricted to dense graphs, more specifically graphs $G$such that $Delta(overline{G})leq 6$, and graphs $G$ such that$degen(overline{G}) leq 2$ and $overline{G}$ is bipartite, where$overline{G}$ represents the complement of $G$. On the other hand, we showthat maxPDS is polynomial-time solvable on graphs with $hle2$. Finally, weconsider graphs $G$ such that $h(overline{G})le 2$ and show that there existsa polynomial-time algorithm for finding a PDS of maximum size in such graphs.This result implies polynomial-time complexity on graphs with $n$ vertices ofminimum degree $n-3$, i.e. graphs $G$ such that $Delta(overline{G})le 2$.For each result presented in this paper, we consider connected maxPDS andexplain how to extend it when we require connectivity.
一个图的比例密集子图(PDS)是一个大小至少为 2 的诱导子图,子图中的每个顶点在子图内和子图外都有比例相同数量的邻居。那么,maxPDS 就是确定给定图中最大尺寸的 PDS 的问题。如果我们进一步要求 PDS 诱导一个连通的子图,我们就把这样的问题称为 connectedmaxPDS。在本文中,我们研究了 maxPDS 在表示图及其补集密度的参数方面的复杂性。我们考虑了代表最大度的 $/Delta$、代表 $h$-index 的 $h$ 和代表图的退化度的 degen。我们证明 maxPDS 是由 $Delta, h$ 和 degen 参数化的 NP-hard。更具体地说,我们证明了在 $Delta=4$、$h=4$ 和 degen=2 的图上,maxPDS 是 NP-hard。然后,我们证明了当局限于密集图,更具体地说,当 $Delta(overline{G})leq 6$,以及当 $degen(overline{G}) leq 2$,并且 $overline{G}$ 是双分部图,其中 $overline{G}$ 代表 $G$ 的补码时,maxPDS 是 NP-hard。另一方面,我们证明了 maxPDS 在具有 $hle2$ 的图上是多项式时间可解的。最后,我们考虑了$G$这样的图,即$h(overline{G})le 2$,并证明存在一种多项式时间算法,可以在这样的图中找到最大尺寸的 PDS。对于本文提出的每个结果,我们都考虑了连通的最大 PDS,并解释了当我们需要连通性时如何扩展它。
{"title":"Proportionally dense subgraphs of maximum size in degree-constrained graphs","authors":"Narmina Baghirova, Antoine Castillon","doi":"arxiv-2405.20847","DOIUrl":"https://doi.org/arxiv-2405.20847","url":null,"abstract":"A proportionally dense subgraph (PDS) of a graph is an induced subgraph of\u0000size at least two such that every vertex in the subgraph has proportionally as\u0000many neighbors inside as outside of the subgraph. Then, maxPDS is the problem\u0000of determining a PDS of maximum size in a given graph. If we further require\u0000that a PDS induces a connected subgraph, we refer to such problem as connected\u0000maxPDS. In this paper, we study the complexity of maxPDS with respect to\u0000parameters representing the density of a graph and its complement. We consider\u0000$Delta$, representing the maximum degree, $h$, representing the $h$-index, and\u0000degen, representing the degeneracy of a graph. We show that maxPDS is NP-hard\u0000parameterized by $Delta,h$ and degen. More specifically, we show that maxPDS\u0000is NP-hard on graphs with $Delta=4$, $h=4$ and degen=2. Then, we show that\u0000maxPDS is NP-hard when restricted to dense graphs, more specifically graphs $G$\u0000such that $Delta(overline{G})leq 6$, and graphs $G$ such that\u0000$degen(overline{G}) leq 2$ and $overline{G}$ is bipartite, where\u0000$overline{G}$ represents the complement of $G$. On the other hand, we show\u0000that maxPDS is polynomial-time solvable on graphs with $hle2$. Finally, we\u0000consider graphs $G$ such that $h(overline{G})le 2$ and show that there exists\u0000a polynomial-time algorithm for finding a PDS of maximum size in such graphs.\u0000This result implies polynomial-time complexity on graphs with $n$ vertices of\u0000minimum degree $n-3$, i.e. graphs $G$ such that $Delta(overline{G})le 2$.\u0000For each result presented in this paper, we consider connected maxPDS and\u0000explain how to extend it when we require connectivity.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141257054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A linear bound for the size of the finite terminal assembly of a directed non-cooperative tile assembly system 有向非合作瓦片组装系统有限终端组装规模的线性约束
Pub Date : 2024-05-28 DOI: arxiv-2405.18630
Sergiu Ivanov, Damien Regnault
The abstract tile assembly model (aTam) is a model of DNA self-assembly. Mostof the studies focus on cooperative aTam where a form of synchronizationbetween the tiles is possible. Simulating Turing machines is achievable in thiscontext. Few results and constructions are known for the non-cooperative case(a variant of Wang tilings where assemblies do not need to cover the wholeplane and some mismatches may occur). Introduced by P.E. Meunier and D. Regnault, efficient paths are a non-trivialconstruction for non-cooperative aTam. These paths of width nlog(n) aredesigned with n different tile types. Assembling them relies heavily on a formof ``non-determinism''. Indeed, the set of tiles may produced different finiteterminal assemblies but they all contain the same efficient path. Directednon-cooperative aTam does not allow this non-determinism as only one assemblymay be produced by a tile assembly system. This variant of aTam is the only onewho was shown to be decidable. In this paper, we show that if the terminal assembly of a directednon-cooperative tile assembly system is finite then its width and length are oflinear size according to the size of the tile assembly system. This resultimplies that the construction of efficient paths cannot be generalized to thedirected case and that some computation must rely on a competition betweendifferent paths. It also implies that the construction of a square of width nusing 2n-1 tiles types is asymptotically optimal. Moreover, we hope that thetechniques introduced here will lead to a better comprehension of thenon-directed case.
抽象瓦片组装模型(aTam)是一种 DNA 自组装模型。大多数研究都集中在合作型 aTam 上,在这种模型中,瓦片之间可能存在某种形式的同步。在这种情况下,模拟图灵机是可以实现的。对于非合作的情况(Wang tilings 的一种变体,在这种情况下,组装不需要覆盖整个平面,也可能出现一些错配),已知的结果和构造很少。由 P.E. Meunier 和 D. Regnault 提出的高效路径是非合作平面的一种非难构造。这些宽度为 nlog(n) 的路径由 n 种不同的瓦片类型设计而成。组装它们在很大程度上依赖于一种 "非确定性"。事实上,瓦片集合可能产生不同的有限确定性组合,但它们都包含相同的有效路径。有向非合作 aTam 不允许这种非确定性,因为瓦片装配系统只能产生一个装配体。这种 aTam 变体是唯一被证明是可解的。在本文中,我们证明了如果有向非合作瓦片装配系统的终端装配是有限的,那么它的宽度和长度与瓦片装配系统的大小成线性关系。这一结果表明,高效路径的构造不能推广到有向情况,某些计算必须依赖于不同路径之间的竞争。这也意味着,用 2n-1 种瓦片构建宽度为 n 的正方形是渐进最优的。此外,我们希望这里介绍的技术能让我们更好地理解非定向情况。
{"title":"A linear bound for the size of the finite terminal assembly of a directed non-cooperative tile assembly system","authors":"Sergiu Ivanov, Damien Regnault","doi":"arxiv-2405.18630","DOIUrl":"https://doi.org/arxiv-2405.18630","url":null,"abstract":"The abstract tile assembly model (aTam) is a model of DNA self-assembly. Most\u0000of the studies focus on cooperative aTam where a form of synchronization\u0000between the tiles is possible. Simulating Turing machines is achievable in this\u0000context. Few results and constructions are known for the non-cooperative case\u0000(a variant of Wang tilings where assemblies do not need to cover the whole\u0000plane and some mismatches may occur). Introduced by P.E. Meunier and D. Regnault, efficient paths are a non-trivial\u0000construction for non-cooperative aTam. These paths of width nlog(n) are\u0000designed with n different tile types. Assembling them relies heavily on a form\u0000of ``non-determinism''. Indeed, the set of tiles may produced different finite\u0000terminal assemblies but they all contain the same efficient path. Directed\u0000non-cooperative aTam does not allow this non-determinism as only one assembly\u0000may be produced by a tile assembly system. This variant of aTam is the only one\u0000who was shown to be decidable. In this paper, we show that if the terminal assembly of a directed\u0000non-cooperative tile assembly system is finite then its width and length are of\u0000linear size according to the size of the tile assembly system. This result\u0000implies that the construction of efficient paths cannot be generalized to the\u0000directed case and that some computation must rely on a competition between\u0000different paths. It also implies that the construction of a square of width n\u0000using 2n-1 tiles types is asymptotically optimal. Moreover, we hope that the\u0000techniques introduced here will lead to a better comprehension of the\u0000non-directed case.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141196664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Computational Complexity
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1