首页 > 最新文献

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence最新文献

英文 中文
Second-Order Quantified Boolean Logic 二阶量化布尔逻辑
J. H. Jiang
Second-order quantified Boolean formulas (SOQBFs) generalize quantified Boolean formulas (QBFs) by admitting second-order quantifiers on function variables in addition to first-order quantifiers on atomic variables. Recent endeavors establish that the complexity of SOQBF satisfiability corresponds to the exponential-time hierarchy (EXPH), similar to that of QBF satisfiability corresponding to the polynomial-time hierarchy (PH). This fact reveals the succinct expression power of SOQBFs in encoding decision problems not efficiently doable by QBFs. In this paper, we investigate the second-order quantified Boolean logic with the following main results: First, we present a procedure of quantifier elimination converting SOQBFs to QBFs and a game interpretation of SOQBF semantics. Second, we devise a sound and complete refutation-proof system for SOQBF. Third, we develop an algorithm for countermodel extraction from a refutation proof. Finally, we show potential applications of SOQBFs in system design and multi-agent planning. With these advances, we anticipate practical tools for development.
二阶量化布尔公式(SOQBFs)是对量化布尔公式(QBFs)的推广,它在原子变量上除了允许一阶量词外,还允许函数变量上的二阶量词。最近的研究建立了SOQBF可满足性的复杂性对应于指数时间层次(EXPH),类似于QBF可满足性对应于多项式时间层次(PH)的复杂性。这一事实揭示了SOQBFs在编码QBFs无法有效解决的决策问题时简洁的表达能力。本文研究了二阶量化布尔逻辑,得到了以下主要结果:首先,我们给出了一个量词消去的过程,将SOQBF转换为qbf,并给出了SOQBF语义的博弈解释。其次,设计了完善的SOQBF防反驳系统。第三,我们开发了一种从反驳证明中提取反模型的算法。最后,我们展示了SOQBFs在系统设计和多智能体规划中的潜在应用。有了这些进展,我们期望有实用的发展工具。
{"title":"Second-Order Quantified Boolean Logic","authors":"J. H. Jiang","doi":"10.1609/aaai.v37i4.25515","DOIUrl":"https://doi.org/10.1609/aaai.v37i4.25515","url":null,"abstract":"Second-order quantified Boolean formulas (SOQBFs) generalize quantified Boolean formulas (QBFs) by admitting second-order quantifiers on function variables in addition to first-order quantifiers on atomic variables. Recent endeavors establish that the complexity of SOQBF satisfiability corresponds to the exponential-time hierarchy (EXPH), similar to that of QBF satisfiability corresponding to the polynomial-time hierarchy (PH). This fact reveals the succinct expression power of SOQBFs in encoding decision problems not efficiently doable by QBFs. In this paper, we investigate the second-order quantified Boolean logic with the following main results: First, we present a procedure of quantifier elimination converting SOQBFs to QBFs and a game interpretation of SOQBF semantics. Second, we devise a sound and complete refutation-proof system for SOQBF. Third, we develop an algorithm for countermodel extraction from a refutation proof. Finally, we show potential applications of SOQBFs in system design and multi-agent planning. With these advances, we anticipate practical tools for development.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"47 1","pages":"4007-4015"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84300241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
MaskBooster: End-to-End Self-Training for Sparsely Supervised Instance Segmentation MaskBooster:稀疏监督实例分割的端到端自我训练
Shida Zheng, Chenshu Chen, Xi Yang, Wenming Tan
The present paper introduces sparsely supervised instance segmentation, with the datasets being fully annotated bounding boxes and sparsely annotated masks. A direct solution to this task is self-training, which is not fully explored for instance segmentation yet. In this paper, we propose MaskBooster for sparsely supervised instance segmentation (SpSIS) with comprehensive usage of pseudo masks. MaskBooster is featured with (1) dynamic and progressive pseudo masks from an online updating teacher model, (2) refining binary pseudo masks with the help of bounding box prior, (3) learning inter-class prediction distribution via knowledge distillation for soft pseudo masks. As an end-to-end and universal self-training framework, MaskBooster can empower fully supervised algorithms and boost their segmentation performance on SpSIS. Abundant experiments are conducted on COCO and BDD100K datasets and validate the effectiveness of MaskBooster. Specifically, on different COCO protocols and BDD100K, we surpass sparsely supervised baseline by a large margin for both Mask RCNN and ShapeProp. MaskBooster on SpSIS also outperforms weakly and semi-supervised instance segmentation state-of-the-art on the datasets with similar annotation budgets.
本文引入了稀疏监督实例分割,数据集是完全带注释的边界框和稀疏带注释的掩码。这个任务的一个直接解决方案是自我训练,这在实例分割方面还没有得到充分的探索。在本文中,我们提出了MaskBooster用于稀疏监督实例分割(SpSIS),并综合使用伪掩码。MaskBooster的特点是:(1)在线更新教师模型的动态渐进式伪掩码,(2)借助边界盒先验优化二进制伪掩码,(3)通过知识升华学习软伪掩码的类间预测分布。作为一个端到端和通用的自我训练框架,MaskBooster可以授权完全监督算法,并提高其在SpSIS上的分割性能。在COCO和BDD100K数据集上进行了大量的实验,验证了MaskBooster的有效性。具体来说,在不同的COCO协议和BDD100K上,我们在Mask RCNN和ShapeProp上都大大超过了稀疏监督的基线。在具有类似注释预算的数据集上,SpSIS上的MaskBooster也优于弱监督和半监督实例分割技术。
{"title":"MaskBooster: End-to-End Self-Training for Sparsely Supervised Instance Segmentation","authors":"Shida Zheng, Chenshu Chen, Xi Yang, Wenming Tan","doi":"10.1609/aaai.v37i3.25481","DOIUrl":"https://doi.org/10.1609/aaai.v37i3.25481","url":null,"abstract":"The present paper introduces sparsely supervised instance segmentation, with the datasets being fully annotated bounding boxes and sparsely annotated masks. A direct solution to this task is self-training, which is not fully explored for instance segmentation yet. In this paper, we propose MaskBooster for sparsely supervised instance segmentation (SpSIS) with comprehensive usage of pseudo masks. MaskBooster is featured with (1) dynamic and progressive pseudo masks from an online updating teacher model, (2) refining binary pseudo masks with the help of bounding box prior, (3) learning inter-class prediction distribution via knowledge distillation for soft pseudo masks. As an end-to-end and universal self-training framework, MaskBooster can empower fully supervised algorithms and boost their segmentation performance on SpSIS. Abundant experiments are conducted on COCO and BDD100K datasets and validate the effectiveness of MaskBooster. Specifically, on different COCO protocols and BDD100K, we surpass sparsely supervised baseline by a large margin for both Mask RCNN and ShapeProp. MaskBooster on SpSIS also outperforms weakly and semi-supervised instance segmentation state-of-the-art on the datasets with similar annotation budgets.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"10 1","pages":"3696-3704"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84328246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Embeddings of Logical Variables for Query Answering over Incomplete Knowledge Graphs 不完全知识图查询回答中逻辑变量的高效嵌入
Dingmin Wang, Yeyuan Chen, B. C. Grau
The problem of answering complex First-order Logic queries over incomplete knowledge graphs is receiving growing attention in the literature. A promising recent approach to this problem has been to exploit neural link predictors, which can be effective in identifying individual missing triples in the incomplete graph, in order to efficiently answer complex queries. A crucial advantage of this approach over other methods is that it does not require example answers to complex queries for training, as it relies only on the availability of a trained link predictor for the knowledge graph at hand. This approach, however, can be computationally expensive during inference, and cannot deal with queries involving negation. In this paper, we propose a novel approach that addresses all of these limitations. Experiments on established benchmark datasets demonstrate that our approach offers superior performance while significantly reducing inference times.
在不完全知识图上回答复杂一阶逻辑查询的问题越来越受到文献的关注。最近一种很有前途的方法是利用神经链接预测器,它可以有效地识别不完全图中单个缺失的三元组,以便有效地回答复杂的查询。与其他方法相比,这种方法的一个关键优势是,它不需要对复杂的查询进行示例回答来进行训练,因为它只依赖于手边知识图的训练链接预测器的可用性。然而,这种方法在推理过程中计算开销很大,并且不能处理涉及否定的查询。在本文中,我们提出了一种解决所有这些限制的新方法。在已建立的基准数据集上的实验表明,我们的方法在显著减少推理时间的同时提供了卓越的性能。
{"title":"Efficient Embeddings of Logical Variables for Query Answering over Incomplete Knowledge Graphs","authors":"Dingmin Wang, Yeyuan Chen, B. C. Grau","doi":"10.1609/aaai.v37i4.25588","DOIUrl":"https://doi.org/10.1609/aaai.v37i4.25588","url":null,"abstract":"The problem of answering complex First-order Logic queries over incomplete knowledge graphs is receiving growing attention in the literature. A promising recent approach to this problem has been to exploit neural link predictors, which can be effective in identifying individual missing triples in the incomplete graph, in order to efficiently answer complex queries. A crucial advantage of this approach over other methods is that it does not require example answers to complex queries for training, as it relies only on the availability of a trained link predictor for the knowledge graph at hand. This approach, however, can be computationally expensive during inference, and cannot deal with queries involving negation. \u0000\u0000In this paper, we propose a novel approach that addresses all of these limitations. Experiments on established benchmark datasets demonstrate that our approach offers superior performance while significantly reducing inference times.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"195 2 1","pages":"4652-4659"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85603025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Fast Fluid Simulation via Dynamic Multi-Scale Gridding 基于动态多尺度网格的快速流体仿真
Jinxian Liu, Ye Chen, Bingbing Ni, Wei Ren, Zhenbo Yu, Xiaoyang Huang
Recent works on learning-based frameworks for Lagrangian (i.e., particle-based) fluid simulation, though bypassing iterative pressure projection via efficient convolution operators, are still time-consuming due to excessive amount of particles. To address this challenge, we propose a dynamic multi-scale gridding method to reduce the magnitude of elements that have to be processed, by observing repeated particle motion patterns within certain consistent regions. Specifically, we hierarchically generate multi-scale micelles in Euclidean space by grouping particles that share similar motion patterns/characteristics based on super-light motion and scale estimation modules. With little internal motion variation, each micelle is modeled as a single rigid body with convolution only applied to a single representative particle. In addition, a distance-based interpolation is conducted to propagate relative motion message among micelles. With our efficient design, the network produces high visual fidelity fluid simulations with the inference time to be only 4.24 ms/frame (with 6K fluid particles), hence enables real-time human-computer interaction and animation. Experimental results on multiple datasets show that our work achieves great simulation acceleration with negligible prediction error increase.
最近关于拉格朗日(即基于粒子)流体模拟的基于学习框架的工作,尽管通过有效的卷积算子绕过了迭代压力投影,但由于粒子数量过多,仍然很耗时。为了解决这一挑战,我们提出了一种动态多尺度网格化方法,通过观察某些一致区域内重复的粒子运动模式来减少必须处理的元素的大小。具体来说,我们基于超轻运动和尺度估计模块,通过将具有相似运动模式/特征的粒子分组,在欧几里得空间中分层生成多尺度胶束。由于内部运动变化很小,每个胶束被建模为单个刚体,卷积仅应用于单个代表性粒子。此外,采用基于距离的插值方法在胶束间传播相对运动信息。通过我们高效的设计,网络产生高视觉保真度的流体模拟,推理时间仅为4.24 ms/帧(6K流体粒子),从而实现实时人机交互和动画。在多个数据集上的实验结果表明,我们的工作在预测误差增加可以忽略不计的情况下实现了很大的仿真加速。
{"title":"Fast Fluid Simulation via Dynamic Multi-Scale Gridding","authors":"Jinxian Liu, Ye Chen, Bingbing Ni, Wei Ren, Zhenbo Yu, Xiaoyang Huang","doi":"10.1609/aaai.v37i2.25255","DOIUrl":"https://doi.org/10.1609/aaai.v37i2.25255","url":null,"abstract":"Recent works on learning-based frameworks for Lagrangian (i.e., particle-based) fluid simulation, though bypassing iterative pressure projection via efficient convolution operators, are still time-consuming due to excessive amount of particles. To address this challenge, we propose a dynamic multi-scale gridding method to reduce the magnitude of elements that have to be processed, by observing repeated particle motion patterns within certain consistent regions. Specifically, we hierarchically generate multi-scale micelles in Euclidean space by grouping particles that share similar motion patterns/characteristics based on super-light motion and scale estimation modules. With little internal motion variation, each micelle is modeled as a single rigid body with convolution only applied to a single representative particle. In addition, a distance-based interpolation is conducted to propagate relative motion message among micelles. With our efficient design, the network produces high visual fidelity fluid simulations with the inference time to be only 4.24 ms/frame (with 6K fluid particles), hence enables real-time human-computer interaction and animation. Experimental results on multiple datasets show that our work achieves great simulation acceleration with negligible prediction error increase.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"63 1","pages":"1675-1682"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85630739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Fine-Grained Explainability for Heterogeneous Graph Neural Network 面向异构图神经网络的细粒度可解释性
Tong Li, Jiale Deng, Yanyan Shen, Luyu Qiu, Hu Yongxiang, Caleb Chen Cao
Heterogeneous graph neural networks (HGNs) are prominent approaches to node classification tasks on heterogeneous graphs. Despite the superior performance, insights about the predictions made from HGNs are obscure to humans. Existing explainability techniques are mainly proposed for GNNs on homogeneous graphs. They focus on highlighting salient graph objects to the predictions whereas the problem of how these objects affect the predictions remains unsolved. Given heterogeneous graphs with complex structures and rich semantics, it is imperative that salient objects can be accompanied with their influence paths to the predictions, unveiling the reasoning process of HGNs. In this paper, we develop xPath, a new framework that provides fine-grained explanations for black-box HGNs specifying a cause node with its influence path to the target node. In xPath, we differentiate the influence of a node on the prediction w.r.t. every individual influence path, and measure the influence by perturbing graph structure via a novel graph rewiring algorithm. Furthermore, we introduce a greedy search algorithm to find the most influential fine-grained explanations efficiently. Empirical results on various HGNs and heterogeneous graphs show that xPath yields faithful explanations efficiently, outperforming the adaptations of advanced GNN explanation approaches.
异构图神经网络(HGNs)是解决异构图节点分类问题的重要方法。尽管表现优异,但人类对hgn所做预测的见解却很模糊。现有的可解释性技术主要针对齐次图上的gnn提出。他们专注于突出突出的图形对象来预测,而这些对象如何影响预测的问题仍然没有解决。对于具有复杂结构和丰富语义的异构图,重要对象及其对预测的影响路径至关重要,从而揭示hgn的推理过程。在本文中,我们开发了xPath,这是一个新的框架,它为指定原因节点及其到目标节点的影响路径的黑箱hgn提供细粒度解释。在xPath中,我们区分节点对预测的影响与每个单独的影响路径,并通过一种新的图重布线算法通过扰动图结构来测量影响。此外,我们引入了贪婪搜索算法,以有效地找到最具影响力的细粒度解释。对各种GNN和异构图的实证结果表明,xPath有效地产生忠实的解释,优于高级GNN解释方法的适应性。
{"title":"Towards Fine-Grained Explainability for Heterogeneous Graph Neural Network","authors":"Tong Li, Jiale Deng, Yanyan Shen, Luyu Qiu, Hu Yongxiang, Caleb Chen Cao","doi":"10.1609/aaai.v37i7.26040","DOIUrl":"https://doi.org/10.1609/aaai.v37i7.26040","url":null,"abstract":"Heterogeneous graph neural networks (HGNs) are prominent approaches to node classification tasks on heterogeneous graphs. Despite the superior performance, insights about the predictions made from HGNs are obscure to humans. Existing explainability techniques are mainly proposed for GNNs on homogeneous graphs. They focus on highlighting salient graph objects to the predictions whereas the problem of how these objects affect the predictions remains unsolved. Given heterogeneous graphs with complex structures and rich semantics, it is imperative that salient objects can be accompanied with their influence paths to the predictions, unveiling the reasoning process of HGNs. In this paper, we develop xPath, a new framework that provides fine-grained explanations for black-box HGNs specifying a cause node with its influence path to the target node. In xPath, we differentiate the influence of a node on the prediction w.r.t. every individual influence path, and measure the influence by perturbing graph structure via a novel graph rewiring algorithm. Furthermore, we introduce a greedy search algorithm to find the most influential fine-grained explanations efficiently. Empirical results on various HGNs and heterogeneous graphs show that xPath yields faithful explanations efficiently, outperforming the adaptations of advanced GNN explanation approaches.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"16 1","pages":"8640-8647"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85875455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Social Relation Reasoning Based on Triangular Constraints 基于三角约束的社会关系推理
Yunfei Guo, Fei Yin, Wei Feng, Xudong Yan, Tao Xue, Shuqi Mei, Chengxiao Liu
Social networks are essentially in a graph structure where persons act as nodes and the edges connecting nodes denote social relations. The prediction of social relations, therefore, relies on the context in graphs to model the higher-order constraints among relations, which has not been exploited sufficiently by previous works, however. In this paper, we formulate the paradigm of the higher-order constraints in social relations into triangular relational closed-loop structures, i.e., triangular constraints, and further introduce the triangular reasoning graph attention network (TRGAT). Our TRGAT employs the attention mechanism to aggregate features with triangular constraints in the graph, thereby exploiting the higher-order context to reason social relations iteratively. Besides, to acquire better feature representations of persons, we introduce node contrastive learning into relation reasoning. Experimental results show that our method outperforms existing approaches significantly, with higher accuracy and better consistency in generating social relation graphs.
社交网络本质上是一个图形结构,其中人充当节点,连接节点的边表示社会关系。因此,社会关系的预测依赖于图中的上下文来模拟关系之间的高阶约束,然而,以前的工作并未充分利用这一点。本文将社会关系中的高阶约束范式形式化为三角关系闭环结构,即三角约束,并进一步引入三角推理图注意网络(TRGAT)。我们的TRGAT使用注意机制来聚合图中具有三角形约束的特征,从而利用高阶上下文来迭代地推理社会关系。此外,为了获得更好的人物特征表示,我们在关系推理中引入了节点对比学习。实验结果表明,我们的方法明显优于现有的方法,在生成社会关系图方面具有更高的准确性和更好的一致性。
{"title":"Social Relation Reasoning Based on Triangular Constraints","authors":"Yunfei Guo, Fei Yin, Wei Feng, Xudong Yan, Tao Xue, Shuqi Mei, Chengxiao Liu","doi":"10.1609/aaai.v37i1.25151","DOIUrl":"https://doi.org/10.1609/aaai.v37i1.25151","url":null,"abstract":"Social networks are essentially in a graph structure where persons act as nodes and the edges connecting nodes denote social relations. The prediction of social relations, therefore, relies on the context in graphs to model the higher-order constraints among relations, which has not been exploited sufficiently by previous works, however. In this paper, we formulate the paradigm of the higher-order constraints in social relations into triangular relational closed-loop structures, i.e., triangular constraints, and further introduce the triangular reasoning graph attention network (TRGAT). Our TRGAT employs the attention mechanism to aggregate features with triangular constraints in the graph, thereby exploiting the higher-order context to reason social relations iteratively. Besides, to acquire better feature representations of persons, we introduce node contrastive learning into relation reasoning. Experimental results show that our method outperforms existing approaches significantly, with higher accuracy and better consistency in generating social relation graphs.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"18 1","pages":"737-745"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85945175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
End-to-End Pipeline for Trigger Detection on Hit and Track Graphs 端到端管道触发检测命中和轨迹图
Tingting Xuan, Yimin Zhu, Giorgian Borca-Tasciuc, Ming Liu, Yu Sun, Cameron Dean, Y. C. Morales, Z. Shi, Dantong Yu
There has been a surge of interest in applying deep learning in particle and nuclear physics to replace labor-intensive offline data analysis with automated online machine learning tasks. This paper details a novel AI-enabled triggering solution for physics experiments in Relativistic Heavy Ion Collider and future Electron-Ion Collider. The triggering system consists of a comprehensive end-to-end pipeline based on Graph Neural Networks that classifies trigger events versus background events, makes online decisions to retain signal data, and enables efficient data acquisition. The triggering system first starts with the coordinates of pixel hits lit up by passing particles in the detector, applies three stages of event processing (hits clustering, track reconstruction, and trigger detection), and labels all processed events with the binary tag of trigger versus background events. By switching among different objective functions, we train the Graph Neural Networks in the pipeline to solve multiple tasks: the edge-level track reconstruction problem, the edge-level track adjacency matrix prediction, and the graph-level trigger detection problem. We propose a novel method to treat the events as track-graphs instead of hit-graphs. This method focuses on intertrack relations and is driven by underlying physics processing. As a result, it attains a solid performance (around 72% accuracy) for trigger detection and outperforms the baseline method using hit-graphs by 2% higher accuracy.
人们对将深度学习应用于粒子和核物理学,用自动化的在线机器学习任务取代劳动密集型的离线数据分析产生了浓厚的兴趣。本文详细介绍了一种新的人工智能触发方案,用于相对论重离子对撞机和未来的电子-离子对撞机的物理实验。触发系统由一个全面的端到端管道组成,该管道基于图神经网络,可对触发事件与背景事件进行分类,做出在线决策以保留信号数据,并实现高效的数据采集。触发系统首先从检测器中经过的粒子点亮的像素命中坐标开始,应用三个阶段的事件处理(命中聚类、轨迹重建、触发检测),用触发与背景事件的二元标记标记所有处理过的事件。通过在不同目标函数之间切换,我们训练流水线中的图神经网络来解决多个任务:边缘级轨道重建问题、边缘级轨道邻接矩阵预测问题和图级触发检测问题。我们提出了一种新的方法来处理事件的轨迹图,而不是命中图。该方法侧重于轨间关系,并由底层物理处理驱动。因此,它在触发检测方面获得了可靠的性能(大约72%的准确率),并且比使用命中图的基准方法的准确率高出2%。
{"title":"End-to-End Pipeline for Trigger Detection on Hit and Track Graphs","authors":"Tingting Xuan, Yimin Zhu, Giorgian Borca-Tasciuc, Ming Liu, Yu Sun, Cameron Dean, Y. C. Morales, Z. Shi, Dantong Yu","doi":"10.1609/aaai.v37i13.26870","DOIUrl":"https://doi.org/10.1609/aaai.v37i13.26870","url":null,"abstract":"There has been a surge of interest in applying deep learning in particle and nuclear physics to replace labor-intensive offline data analysis with automated online machine learning tasks. This paper details a novel AI-enabled triggering solution for physics experiments in Relativistic Heavy Ion Collider and future Electron-Ion Collider. The triggering system consists of a comprehensive end-to-end pipeline based on Graph Neural Networks that classifies trigger events versus background events, makes online decisions to retain signal data, and enables efficient data acquisition. The triggering system first starts with the coordinates of pixel hits lit up by passing particles in the detector, applies three stages of event processing (hits clustering, track reconstruction, and trigger detection), and labels all processed events with the binary tag of trigger versus background events. By switching among different objective functions, we train the Graph Neural Networks in the pipeline to solve multiple tasks: the edge-level track reconstruction problem, the edge-level track adjacency matrix prediction, and the graph-level trigger detection problem. We propose a novel method to treat the events as track-graphs instead of hit-graphs. This method focuses on intertrack relations and is driven by underlying physics processing. As a result, it attains a solid performance (around 72% accuracy) for trigger detection and outperforms the baseline method using hit-graphs by 2% higher accuracy.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"13 1","pages":"15752-15758"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85974712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Poisoning with Cerberus: Stealthy and Colluded Backdoor Attack against Federated Learning 用Cerberus中毒:对联邦学习的秘密和串通后门攻击
Xiaoting Lyu, Yufei Han, Wen Wang, Jingkai Liu, Bin Wang, Jiqiang Liu, Xiangliang Zhang
Are Federated Learning (FL) systems free from backdoor poisoning with the arsenal of various defense strategies deployed? This is an intriguing problem with significant practical implications regarding the utility of FL services. Despite the recent flourish of poisoning-resilient FL methods, our study shows that carefully tuning the collusion between malicious participants can minimize the trigger-induced bias of the poisoned local model from the poison-free one, which plays the key role in delivering stealthy backdoor attacks and circumventing a wide spectrum of state-of-the-art defense methods in FL. In our work, we instantiate the attack strategy by proposing a distributed backdoor attack method, namely Cerberus Poisoning (CerP). It jointly tunes the backdoor trigger and controls the poisoned model changes on each malicious participant to achieve a stealthy yet successful backdoor attack against a wide spectrum of defensive mechanisms of federated learning techniques. Our extensive study on 3 large-scale benchmark datasets and 13 mainstream defensive mechanisms confirms that Cerberus Poisoning raises a significantly severe threat to the integrity and security of federated learning practices, regardless of the flourish of robust Federated Learning methods.
联邦学习(FL)系统是否可以通过部署各种防御策略免于后门中毒?这是一个有趣的问题,对于FL服务的效用具有重要的实际意义。尽管最近毒弹性FL方法蓬勃发展,但我们的研究表明,仔细调整恶意参与者之间的勾结可以最大限度地减少触发引起的中毒局部模型偏差,这在提供隐形后门攻击和规避FL中广泛的最先进防御方法中起着关键作用。在我们的工作中,我们通过提出分布式后门攻击方法来实例化攻击策略。即Cerberus中毒(CerP)它联合调整后门触发器并控制每个恶意参与者的中毒模型更改,以实现针对联邦学习技术的广泛防御机制的隐蔽但成功的后门攻击。我们对3个大规模基准数据集和13种主流防御机制进行了广泛的研究,证实了Cerberus中毒对联邦学习实践的完整性和安全性构成了严重的威胁,无论健壮的联邦学习方法如何蓬勃发展。
{"title":"Poisoning with Cerberus: Stealthy and Colluded Backdoor Attack against Federated Learning","authors":"Xiaoting Lyu, Yufei Han, Wen Wang, Jingkai Liu, Bin Wang, Jiqiang Liu, Xiangliang Zhang","doi":"10.1609/aaai.v37i7.26083","DOIUrl":"https://doi.org/10.1609/aaai.v37i7.26083","url":null,"abstract":"Are Federated Learning (FL) systems free from backdoor poisoning with the arsenal of various defense strategies deployed? This is an intriguing problem with significant practical implications regarding the utility of FL services. Despite the recent flourish of poisoning-resilient FL methods, our study shows that carefully tuning the collusion between malicious participants can minimize the trigger-induced bias of the poisoned local model from the poison-free one, which plays the key role in delivering stealthy backdoor attacks and circumventing a wide spectrum of state-of-the-art defense methods in FL. In our work, we instantiate the attack strategy by proposing a distributed backdoor attack method, namely Cerberus Poisoning (CerP). It jointly tunes the backdoor trigger and controls the poisoned model changes on each malicious participant to achieve a stealthy yet successful backdoor attack against a wide spectrum of defensive mechanisms of federated learning techniques. Our extensive study on 3 large-scale benchmark datasets and 13 mainstream defensive mechanisms confirms that Cerberus Poisoning raises a significantly severe threat to the integrity and security of federated learning practices, regardless of the flourish of robust Federated Learning methods.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"31 1","pages":"9020-9028"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76604395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
DisGUIDE: Disagreement-Guided Data-Free Model Extraction DisGUIDE:分歧引导的无数据模型提取
Jonathan Rosenthal, Eric Enouen, H. Pham, Lin Tan
Recent model-extraction attacks on Machine Learning as a Service (MLaaS) systems have moved towards data-free approaches, showing the feasibility of stealing models trained with difficult-to-access data. However, these attacks are ineffective or limited due to the low accuracy of extracted models and the high number of queries to the models under attack. The high query cost makes such techniques infeasible for online MLaaS systems that charge per query.We create a novel approach to get higher accuracy and query efficiency than prior data-free model extraction techniques. Specifically, we introduce a novel generator training scheme that maximizes the disagreement loss between two clone models that attempt to copy the model under attack. This loss, combined with diversity loss and experience replay, enables the generator to produce better instances to train the clone models. Our evaluation on popular datasets CIFAR-10 and CIFAR-100 shows that our approach improves the final model accuracy by up to 3.42% and 18.48% respectively. The average number of queries required to achieve the accuracy of the prior state of the art is reduced by up to 64.95%. We hope this will promote future work on feasible data-free model extraction and defenses against such attacks.
最近针对机器学习即服务(MLaaS)系统的模型提取攻击已经转向无数据方法,这表明窃取用难以访问的数据训练的模型是可行的。然而,这些攻击是无效的或有限的,因为提取的模型的准确性较低,以及对被攻击模型的大量查询。高昂的查询成本使得这种技术对于每次查询收费的在线MLaaS系统来说不可行。我们创造了一种新的方法,比以前的无数据模型提取技术获得更高的准确性和查询效率。具体来说,我们引入了一种新的生成器训练方案,该方案最大限度地减少了试图复制被攻击模型的两个克隆模型之间的分歧损失。这种损失,结合多样性损失和经验重放,使生成器能够产生更好的实例来训练克隆模型。我们对流行数据集CIFAR-10和CIFAR-100的评估表明,我们的方法将模型的最终精度分别提高了3.42%和18.48%。达到现有技术的准确度所需的平均查询次数最多减少了64.95%。我们希望这将促进未来可行的无数据模型提取和防御此类攻击的工作。
{"title":"DisGUIDE: Disagreement-Guided Data-Free Model Extraction","authors":"Jonathan Rosenthal, Eric Enouen, H. Pham, Lin Tan","doi":"10.1609/aaai.v37i8.26150","DOIUrl":"https://doi.org/10.1609/aaai.v37i8.26150","url":null,"abstract":"Recent model-extraction attacks on Machine Learning as a Service (MLaaS) systems have moved towards data-free approaches, showing the feasibility of stealing models trained with difficult-to-access data. However, these attacks are ineffective or limited due to the low accuracy of extracted models and the high number of queries to the models under attack. The high query cost makes such techniques infeasible for online MLaaS systems that charge per query.\u0000We create a novel approach to get higher accuracy and query efficiency than prior data-free model extraction techniques. Specifically, we introduce a novel generator training scheme that maximizes the disagreement loss between two clone models that attempt to copy the model under attack. This loss, combined with diversity loss and experience replay, enables the generator to produce better instances to train the clone models. Our evaluation on popular datasets CIFAR-10 and CIFAR-100 shows that our approach improves the final model accuracy by up to 3.42% and 18.48% respectively. The average number of queries required to achieve the accuracy of the prior state of the art is reduced by up to 64.95%. We hope this will promote future work on feasible data-free model extraction and defenses against such attacks.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"517 1","pages":"9614-9622"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77148921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Leveraging Sub-class Discimination for Compositional Zero-Shot Learning 利用子类差别进行作文零射击学习
Xiaoming Hu, Zilei Wang
Compositional Zero-Shot Learning (CZSL) aims at identifying unseen compositions composed of previously seen attributes and objects during the test phase. In real images, the visual appearances of attributes and objects (primitive concepts) generally interact with each other. Namely, the visual appearances of an attribute may change when composed with different objects, and vice versa. But previous works overlook this important property. In this paper, we introduce a simple yet effective approach with leveraging sub-class discrimination. Specifically, we define the primitive concepts in different compositions as sub-classes, and then maintain the sub-class discrimination to address the above challenge. More specifically, inspired by the observation that the composed recognition models could account for the differences across sub-classes, we first propose to impose the embedding alignment between the composed and disentangled recognition to incorporate sub-class discrimination at the feature level. Then we develop the prototype modulator networks to adjust the class prototypes w.r.t. the composition information, which can enhance sub-class discrimination at the classifier level. We conduct extensive experiments on the challenging benchmark datasets, and the considerable performance improvement over state-of-the-art approaches is achieved, which indicates the effectiveness of our method. Our code is available at https://github.com/hxm97/SCD-CZSL.
组合零射击学习(CZSL)旨在识别在测试阶段由先前看到的属性和对象组成的未见过的组合。在真实图像中,属性和对象(原始概念)的视觉外观通常是相互作用的。也就是说,当与不同的对象组合时,属性的视觉外观可能会改变,反之亦然。但是以前的工作忽略了这一重要性质。在本文中,我们引入了一种简单而有效的利用子类区分的方法。具体来说,我们将不同组合中的原语概念定义为子类,然后保持子类区分来解决上述挑战。更具体地说,由于观察到组合识别模型可以解释子类之间的差异,我们首先提出在组合识别和解纠缠识别之间施加嵌入对齐,以在特征层面结合子类区分。然后,我们开发了原型调制器网络,利用组合信息来调整类原型,从而增强分类器层面的子类识别能力。我们在具有挑战性的基准数据集上进行了广泛的实验,与最先进的方法相比,取得了相当大的性能改进,这表明我们的方法是有效的。我们的代码可在https://github.com/hxm97/SCD-CZSL上获得。
{"title":"Leveraging Sub-class Discimination for Compositional Zero-Shot Learning","authors":"Xiaoming Hu, Zilei Wang","doi":"10.1609/aaai.v37i1.25168","DOIUrl":"https://doi.org/10.1609/aaai.v37i1.25168","url":null,"abstract":"Compositional Zero-Shot Learning (CZSL) aims at identifying unseen compositions composed of previously seen attributes and objects during the test phase. In real images, the visual appearances of attributes and objects (primitive concepts) generally interact with each other. Namely, the visual appearances of an attribute may change when composed with different objects, and vice versa. But previous works overlook this important property. In this paper, we introduce a simple yet effective approach with leveraging sub-class discrimination. Specifically, we define the primitive concepts in different compositions as sub-classes, and then maintain the sub-class discrimination to address the above challenge. More specifically, inspired by the observation that the composed recognition models could account for the differences across sub-classes, we first propose to impose the embedding alignment between the composed and disentangled recognition to incorporate sub-class discrimination at the feature level. Then we develop the prototype modulator networks to adjust the class prototypes w.r.t. the composition information, which can enhance sub-class discrimination at the classifier level. We conduct extensive experiments on the challenging benchmark datasets, and the considerable performance improvement over state-of-the-art approaches is achieved, which indicates the effectiveness of our method. Our code is available at https://github.com/hxm97/SCD-CZSL.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"38 1","pages":"890-898"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80822271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1