首页 > 最新文献

International Joint Conference on Artificial Intelligence最新文献

英文 中文
Moral Planning Agents with LTL Values 具有LTL价值观的道德规划主体
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/47
Umberto Grandi, E. Lorini, Timothy Parker
A moral planning agent (MPA) seeks to compare two plans or compute an optimal plan in an interactive setting with other agents, where relative ideality and optimality of plans are defined with respect to a prioritized value base. We model MPAs whose values are expressed by formulas of linear temporal logic (LTL) and define comparison for both joint plans and individual plans. We introduce different evaluation criteria for individual plans including an optimistic (risk-seeking) criterion, a pessimistic (risk-averse) one, and two criteria based on the use of anticipated responsibility. We provide complexity results for a variety of MPA problems.
道德规划主体(MPA)寻求在与其他主体的交互环境中比较两个计划或计算一个最优计划,其中计划的相对理想性和最优性是根据优先值基础定义的。我们建立了用线性时间逻辑(LTL)公式表示的MPAs模型,并定义了联合计划和单独计划的比较。我们为单个计划引入了不同的评估标准,包括乐观(寻求风险)标准、悲观(规避风险)标准和基于预期责任使用的两个标准。我们提供了各种MPA问题的复杂性结果。
{"title":"Moral Planning Agents with LTL Values","authors":"Umberto Grandi, E. Lorini, Timothy Parker","doi":"10.24963/ijcai.2023/47","DOIUrl":"https://doi.org/10.24963/ijcai.2023/47","url":null,"abstract":"A moral planning agent (MPA) seeks to compare two plans or compute an optimal plan in an interactive setting with other agents, where relative ideality and optimality of plans are defined with respect to a prioritized value base. We model MPAs whose values are expressed by formulas of linear temporal logic (LTL) and define comparison for both joint plans and individual plans. We introduce different evaluation criteria for individual plans including an optimistic (risk-seeking) criterion, a pessimistic (risk-averse) one, and two criteria based on the use of anticipated responsibility. We provide complexity results for a variety of MPA problems.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115051465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DFVSR: Directional Frequency Video Super-Resolution via Asymmetric and Enhancement Alignment Network 基于非对称和增强对准网络的定向频率视频超分辨率
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/76
Shuting Dong, Feng Lu, Zhe Wu, Chun Yuan
Recently, techniques utilizing frequency-based methods have gained significant attention, as they exhibit exceptional restoration capabilities for detail and structure in video super-resolution tasks. However, most of these frequency-based methods mainly have three major limitations: 1) insufficient exploration of object motion information, 2) inadequate enhancement for high-fidelity regions, and 3) loss of spatial information during convolution. In this paper, we propose a novel network, Directional Frequency Video Super-Resolution (DFVSR), to address these limitations. Specifically, we reconsider object motion from a new perspective and propose Directional Frequency Representation (DFR), which not only borrows the property of frequency representation of detail and structure information but also contains the direction information of the object motion that is extremely significant in videos. Based on this representation, we propose a Directional Frequency-Enhanced Alignment (DFEA) to use double enhancements of task-related information for ensuring the retention of high-fidelity frequency regions to generate the high-quality alignment feature. Furthermore, we design a novel Asymmetrical U-shaped network architecture to progressively fuse these alignment features and output the final output. This architecture enables the intercommunication of the same level of resolution in the encoder and decoder to achieve the supplement of spatial information. Powered by the above designs, our method achieves superior performance over state-of-the-art models on both quantitative and qualitative evaluations.
最近,利用基于频率的方法的技术获得了极大的关注,因为它们在视频超分辨率任务中表现出对细节和结构的卓越恢复能力。然而,这些基于频率的方法大多存在三个主要的局限性:1)对目标运动信息的挖掘不足;2)对高保真区域的增强不足;3)卷积过程中空间信息的丢失。在本文中,我们提出了一种新的网络,定向频率视频超分辨率(DFVSR),以解决这些限制。具体来说,我们从一个新的角度重新考虑物体的运动,提出了方向频率表示(Directional Frequency Representation, DFR),它不仅借用了细节和结构信息的频率表示特性,而且还包含了在视频中非常重要的物体运动的方向信息。在此基础上,我们提出了一种定向频率增强对准(DFEA),利用任务相关信息的双重增强来确保高保真频率区域的保留,从而产生高质量的对准特征。此外,我们设计了一种新颖的非对称u型网络架构,以逐步融合这些对齐特征并输出最终输出。这种结构使得编码器和解码器在相同分辨率的情况下相互通信,从而实现空间信息的补充。在上述设计的支持下,我们的方法在定量和定性评估方面都优于最先进的模型。
{"title":"DFVSR: Directional Frequency Video Super-Resolution via Asymmetric and Enhancement Alignment Network","authors":"Shuting Dong, Feng Lu, Zhe Wu, Chun Yuan","doi":"10.24963/ijcai.2023/76","DOIUrl":"https://doi.org/10.24963/ijcai.2023/76","url":null,"abstract":"Recently, techniques utilizing frequency-based methods have gained significant attention, as they exhibit exceptional restoration capabilities for detail and structure in video super-resolution tasks. However, most of these frequency-based methods mainly have three major limitations: 1) insufficient exploration of object motion information, 2) inadequate enhancement for high-fidelity regions, and 3) loss of spatial information during convolution. In this paper, we propose a novel network, Directional Frequency Video Super-Resolution (DFVSR), to address these limitations. Specifically, we reconsider object motion from a new perspective and propose Directional Frequency Representation (DFR), which not only borrows the property of frequency representation of detail and structure information but also contains the direction information of the object motion that is extremely significant in videos. Based on this representation, we propose a Directional Frequency-Enhanced Alignment (DFEA) to use double enhancements of task-related information for ensuring the retention of high-fidelity frequency regions to generate the high-quality alignment feature. Furthermore, we design a novel Asymmetrical U-shaped network architecture to progressively fuse these alignment features and output the final output. This architecture enables the intercommunication of the same level of resolution in the encoder and decoder to achieve the supplement of spatial information. Powered by the above designs, our method achieves superior performance over state-of-the-art models on both quantitative and qualitative evaluations.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115249546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Action Space Reduction for Planning Domains 规划域的行动空间缩减
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/599
Harsha Kokel, Junkyu Lee, Michael Katz, Shirin Sohrabi, Kavitha Srinivas
Planning tasks succinctly represent labeled transition systems, with each ground action corresponding to a label. This granularity, however, is not necessary for solving planning tasks and can be harmful, especially for model-free methods. In order to apply such methods, the label sets are often manually reduced. In this work, we propose automating this manual process. We characterize a valid label reduction for classical planning tasks and propose an automated way of obtaining such valid reductions by leveraging lifted mutex groups. Our experiments show a significant reduction in the action label space size across a wide collection of planning domains. We demonstrate the benefit of our automated label reduction in two separate use cases: improved sample complexity of model-free reinforcement learning algorithms and speeding up successor generation in lifted planning. The code and supplementary material are available at https://github.com/IBM/Parameter-Seed-Set.
规划任务简洁地表示有标签的过渡系统,每个地面行动对应一个标签。然而,这种粒度对于解决计划任务并不是必需的,而且可能是有害的,特别是对于无模型方法。为了应用这些方法,标签集通常是手动减少的。在这项工作中,我们建议将这一手工过程自动化。我们描述了经典规划任务的有效标签缩减,并提出了一种利用解除互斥锁组获得这种有效缩减的自动化方法。我们的实验表明,在广泛的规划域集合中,动作标签空间大小显着减少。我们在两个独立的用例中展示了我们的自动标签减少的好处:提高了无模型强化学习算法的样本复杂性,加速解除计划中的后继生成。代码和补充材料可在https://github.com/IBM/Parameter-Seed-Set上获得。
{"title":"Action Space Reduction for Planning Domains","authors":"Harsha Kokel, Junkyu Lee, Michael Katz, Shirin Sohrabi, Kavitha Srinivas","doi":"10.24963/ijcai.2023/599","DOIUrl":"https://doi.org/10.24963/ijcai.2023/599","url":null,"abstract":"Planning tasks succinctly represent labeled transition systems, with each ground action corresponding to a label. This granularity, however, is not necessary for solving planning tasks and can be harmful, especially for model-free methods. In order to apply such methods, the label sets are often manually reduced. In this work, we propose automating this manual process. We characterize a valid label reduction for classical planning tasks and propose an automated way of obtaining such valid reductions by leveraging lifted mutex groups. Our experiments show a significant reduction in the action label space size across a wide collection of planning domains. We demonstrate the benefit of our automated label reduction in two separate use cases: improved sample complexity of model-free reinforcement learning algorithms and speeding up successor generation in lifted planning. The code and supplementary material are available at https://github.com/IBM/Parameter-Seed-Set.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115487729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Semantics- and Domain-Aware Adversarial Attacks 面向语义和领域感知的对抗性攻击
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/60
Jianping Zhang, Yung-Chieh Huang, Weibin Wu, Michael R. Lyu
Language models are known to be vulnerable to textual adversarial attacks, which add human-imperceptible perturbations to the input to mislead DNNs. It is thus imperative to devise effective attack algorithms to identify the deficiencies of DNNs before real-world deployment. However, existing word-level attacks have two major deficiencies: (1) They may change the semantics of the original sentence. (2) The generated adversarial sample can appear unnatural to humans due to the introduction of out-of-domain substitute words. In this paper, to address such drawbacks, we propose a semantics- and domain-aware word-level attack method. Specifically, we greedily replace the important words in a sentence with the ones suggested by a language model. The language model is trained to be semantics- and domain-aware via contrastive learning and in-domain pre-training. Furthermore, to balance the quality of adversarial examples and the attack success rate, we propose an iterative updating framework to optimize the contrastive learning loss and the in-domain pre-training loss in circular order. Comprehensive experimental comparisons confirm the superiority of our approach. Notably, compared with state-of-the-art benchmarks, our strategy can achieve over 3% improvement in attack success rates and 9.8% improvement in the quality of adversarial examples.
众所周知,语言模型很容易受到文本对抗性攻击,这种攻击会在输入中添加人类难以察觉的扰动,从而误导dnn。因此,在实际部署之前,必须设计有效的攻击算法来识别dnn的缺陷。然而,现有的词级攻击有两个主要缺陷:(1)可能会改变原句子的语义。(2)由于引入域外替代词,生成的对抗性样本对人类来说可能显得不自然。在本文中,为了解决这些缺陷,我们提出了一种语义和领域感知的词级攻击方法。具体来说,我们会贪婪地将句子中的重要单词替换为语言模型建议的单词。通过对比学习和域内预训练,将语言模型训练为语义感知和域感知。此外,为了平衡对抗样本的质量和攻击成功率,我们提出了一个迭代更新框架,以循环顺序优化对比学习损失和域内预训练损失。综合实验比较证实了我们方法的优越性。值得注意的是,与最先进的基准相比,我们的策略可以在攻击成功率上提高3%以上,在对抗性示例的质量上提高9.8%。
{"title":"Towards Semantics- and Domain-Aware Adversarial Attacks","authors":"Jianping Zhang, Yung-Chieh Huang, Weibin Wu, Michael R. Lyu","doi":"10.24963/ijcai.2023/60","DOIUrl":"https://doi.org/10.24963/ijcai.2023/60","url":null,"abstract":"Language models are known to be vulnerable to textual adversarial attacks, which add human-imperceptible perturbations to the input to mislead DNNs. It is thus imperative to devise effective attack algorithms to identify the deficiencies of DNNs before real-world deployment. However, existing word-level attacks have two major deficiencies: (1) They may change the semantics of the original sentence. (2) The generated adversarial sample can appear unnatural to humans due to the introduction of out-of-domain substitute words. In this paper, to address such drawbacks, we propose a semantics- and domain-aware word-level attack method. Specifically, we greedily replace the important words in a sentence with the ones suggested by a language model. The language model is trained to be semantics- and domain-aware via contrastive learning and in-domain pre-training. Furthermore, to balance the quality of adversarial examples and the attack success rate, we propose an iterative updating framework to optimize the contrastive learning loss and the in-domain pre-training loss in circular order. Comprehensive experimental comparisons confirm the superiority of our approach. Notably, compared with state-of-the-art benchmarks, our strategy can achieve over 3% improvement in attack success rates and 9.8% improvement in the quality of adversarial examples.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"271 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116422536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised and Few-Shot Parsing from Pretrained Language Models (Extended Abstract) 基于预训练语言模型的无监督少镜头解析(扩展摘要)
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/797
Zhiyuan Zeng, Deyi Xiong
This paper proposes two Unsupervised constituent Parsing models (UPOA and UPIO) that calculate inside and outside association scores solely based on the self-attention weight matrix learned in a pretrained language model. The proposed unsupervised parsing models are further extended to few-shot parsing models (FPOA, FPIO) that use a few annotated trees to fine-tune the linear projection matrices in self-attention. Experiments on PTB and SPRML show that both unsupervised and few-shot parsing methods are better than or comparable to the previous methods.
本文提出了两种无监督成分分析模型(UPOA和UPIO),它们仅基于预训练语言模型中学习到的自注意权矩阵计算内部和外部关联分数。提出的无监督解析模型进一步扩展为使用少量注释树对线性投影矩阵进行自关注微调的少镜头解析模型(FPOA、FPIO)。在PTB和SPRML上的实验表明,无监督和少镜头分析方法都优于或可与以前的方法相媲美。
{"title":"Unsupervised and Few-Shot Parsing from Pretrained Language Models (Extended Abstract)","authors":"Zhiyuan Zeng, Deyi Xiong","doi":"10.24963/ijcai.2023/797","DOIUrl":"https://doi.org/10.24963/ijcai.2023/797","url":null,"abstract":"This paper proposes two Unsupervised constituent Parsing models (UPOA and UPIO) that calculate inside and outside association scores solely based on the self-attention weight matrix learned in a pretrained language model. The proposed unsupervised parsing models are further extended to few-shot parsing models (FPOA, FPIO) that use a few annotated trees to fine-tune the linear projection matrices in self-attention. Experiments on PTB and SPRML show that both unsupervised and few-shot parsing methods are better than or comparable to the previous methods.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"304 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122416851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acoustic NLOS Imaging with Cross Modal Knowledge Distillation 基于交叉模态知识蒸馏的声学NLOS成像
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/156
Ui-Hyeon Shin, Seungwoo Jang, Kwangsu Kim
Acoustic non-line-of-sight (NLOS) imaging aims to reconstruct hidden scenes by analyzing reflections of acoustic waves. Despite recent developments in the field, existing methods still have limitations such as sensitivity to noise in a physical model and difficulty in reconstructing unseen objects in a deep learning model. To address these limitations, we propose a novel cross-modal knowledge distillation (CMKD) approach for acoustic NLOS imaging. Our method transfers knowledge from a well-trained image network to an audio network, effectively combining the strengths of both modalities. As a result, it is robust to noise and superior in reconstructing unseen objects. Additionally, we evaluate real-world datasets and demonstrate that the proposed method outperforms state-of-the-art methods in acoustic NLOS imaging. The experimental results indicate that CMKD is an effective solution for addressing the limitations of current acoustic NLOS imaging methods. Our code, model, and data are available at https://github.com/shineh96/Acoustic-NLOS-CMKD.
声学非视距成像(NLOS)旨在通过分析声波的反射来重建隐藏场景。尽管该领域最近取得了进展,但现有方法仍然存在局限性,例如物理模型中对噪声的敏感性以及在深度学习模型中重建看不见的物体的困难。为了解决这些限制,我们提出了一种新的跨模态知识蒸馏(CMKD)方法用于声学NLOS成像。我们的方法将知识从训练有素的图像网络转移到音频网络,有效地结合了两种模式的优势。结果表明,该方法对噪声具有较强的鲁棒性,在重建未见物体方面具有较好的优势。此外,我们评估了真实世界的数据集,并证明了所提出的方法在声学NLOS成像方面优于最先进的方法。实验结果表明,CMKD是解决现有声学近视场成像方法局限性的有效方法。我们的代码、模型和数据可在https://github.com/shineh96/Acoustic-NLOS-CMKD上获得。
{"title":"Acoustic NLOS Imaging with Cross Modal Knowledge Distillation","authors":"Ui-Hyeon Shin, Seungwoo Jang, Kwangsu Kim","doi":"10.24963/ijcai.2023/156","DOIUrl":"https://doi.org/10.24963/ijcai.2023/156","url":null,"abstract":"Acoustic non-line-of-sight (NLOS) imaging aims to reconstruct hidden scenes by analyzing reflections of acoustic waves. Despite recent developments in the field, existing methods still have limitations such as sensitivity to noise in a physical model and difficulty in reconstructing unseen objects in a deep learning model. To address these limitations, we propose a novel cross-modal knowledge distillation (CMKD) approach for acoustic NLOS imaging. Our method transfers knowledge from a well-trained image network to an audio network, effectively combining the strengths of both modalities. As a result, it is robust to noise and superior in reconstructing unseen objects. Additionally, we evaluate real-world datasets and demonstrate that the proposed method outperforms state-of-the-art methods in acoustic NLOS imaging. The experimental results indicate that CMKD is an effective solution for addressing the limitations of current acoustic NLOS imaging methods. Our code, model, and data are available at https://github.com/shineh96/Acoustic-NLOS-CMKD.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"2 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114044493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Game-theoretic Mechanisms for Eliciting Accurate Information 获取准确信息的博弈论机制
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/740
B. Faltings
Artificial Intelligence often relies on information obtained from others through crowdsourcing, federated learning, or data markets. It is crucial to ensure that this data is accurate. Over the past 20 years, a variety of incentive mechanisms have been developed that use game theory to reward the accuracy of contributed data. These techniques are applicable to many settings where AI uses contributed data.This survey categorizes the different techniques and their properties, and shows their limits and tradeoffs. It identifies open issues and points to possible directions to address these.
人工智能通常依赖于通过众包、联合学习或数据市场从他人那里获得的信息。确保这些数据的准确性至关重要。在过去的20年里,利用博弈论来奖励贡献数据的准确性的各种激励机制已经发展起来。这些技术适用于人工智能使用贡献数据的许多设置。本调查对不同的技术及其特性进行了分类,并显示了它们的限制和权衡。它指出了尚未解决的问题,并指出了解决这些问题的可能方向。
{"title":"Game-theoretic Mechanisms for Eliciting Accurate Information","authors":"B. Faltings","doi":"10.24963/ijcai.2023/740","DOIUrl":"https://doi.org/10.24963/ijcai.2023/740","url":null,"abstract":"Artificial Intelligence often relies on information obtained from others through crowdsourcing, federated learning, or data markets. It is crucial to ensure that this data is accurate. Over the past 20 years, a variety of incentive mechanisms have been developed that use game theory to reward the accuracy of contributed data. These techniques are applicable to many settings where AI uses contributed data.\u0000\u0000\u0000\u0000This survey categorizes the different techniques and their properties, and shows their limits and tradeoffs. It identifies open issues and points to possible directions to address these.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114124778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Boosting Decision-Based Black-Box Adversarial Attack with Gradient Priors 基于梯度先验增强决策的黑盒对抗攻击
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/133
Han Liu, Xingshuo Huang, Xiaotong Zhang, Qimai Li, Fenglong Ma, Wen Wang, Hongyang Chen, Hong Yu, Xianchao Zhang
Decision-based methods have shown to be effective in black-box adversarial attacks, as they can obtain satisfactory performance and only require to access the final model prediction. Gradient estimation is a critical step in black-box adversarial attacks, as it will directly affect the query efficiency. Recent works have attempted to utilize gradient priors to facilitate score-based methods to obtain better results. However, these gradient priors still suffer from the edge gradient discrepancy issue and the successive iteration gradient direction issue, thus are difficult to simply extend to decision-based methods. In this paper, we propose a novel Decision-based Black-box Attack framework with Gradient Priors (DBA-GP), which seamlessly integrates the data-dependent gradient prior and time-dependent prior into the gradient estimation procedure. First, by leveraging the joint bilateral filter to deal with each random perturbation, DBA-GP can guarantee that the generated perturbations in edge locations are hardly smoothed, i.e., alleviating the edge gradient discrepancy, thus remaining the characteristics of the original image as much as possible. Second, by utilizing a new gradient updating strategy to automatically adjust the successive iteration gradient direction, DBA-GP can accelerate the convergence speed, thus improving the query efficiency. Extensive experiments have demonstrated that the proposed method outperforms other strong baselines significantly.
基于决策的方法在黑盒对抗攻击中被证明是有效的,因为它们可以获得令人满意的性能,并且只需要访问最终的模型预测。梯度估计是黑盒对抗攻击的关键步骤,它将直接影响到查询效率。最近的工作试图利用梯度先验来促进基于分数的方法,以获得更好的结果。然而,这些梯度先验仍然存在边缘梯度差异问题和连续迭代梯度方向问题,难以简单地扩展到基于决策的方法。在本文中,我们提出了一种新的基于决策的带有梯度先验的黑盒攻击框架(DBA-GP),它将数据相关的梯度先验和时间相关的先验无缝地集成到梯度估计过程中。首先,DBA-GP利用联合双边滤波器处理每个随机扰动,保证在边缘位置产生的扰动几乎不被平滑,即减轻边缘梯度差异,从而尽可能地保留原始图像的特征。其次,利用一种新的梯度更新策略自动调整连续迭代梯度方向,加快了收敛速度,从而提高了查询效率。大量的实验表明,该方法明显优于其他强基线方法。
{"title":"Boosting Decision-Based Black-Box Adversarial Attack with Gradient Priors","authors":"Han Liu, Xingshuo Huang, Xiaotong Zhang, Qimai Li, Fenglong Ma, Wen Wang, Hongyang Chen, Hong Yu, Xianchao Zhang","doi":"10.24963/ijcai.2023/133","DOIUrl":"https://doi.org/10.24963/ijcai.2023/133","url":null,"abstract":"Decision-based methods have shown to be effective in black-box adversarial attacks, as they can obtain satisfactory performance and only require to access the final model prediction. Gradient estimation is a critical step in black-box adversarial attacks, as it will directly affect the query efficiency. Recent works have attempted to utilize gradient priors to facilitate score-based methods to obtain better results. However, these gradient priors still suffer from the edge gradient discrepancy issue and the successive iteration gradient direction issue, thus are difficult to simply extend to decision-based methods. In this paper, we propose a novel Decision-based Black-box Attack framework with Gradient Priors (DBA-GP), which seamlessly integrates the data-dependent gradient prior and time-dependent prior into the gradient estimation procedure. First, by leveraging the joint bilateral filter to deal with each random perturbation, DBA-GP can guarantee that the generated perturbations in edge locations are hardly smoothed, i.e., alleviating the edge gradient discrepancy, thus remaining the characteristics of the original image as much as possible. Second, by utilizing a new gradient updating strategy to automatically adjust the successive iteration gradient direction, DBA-GP can accelerate the convergence speed, thus improving the query efficiency. Extensive experiments have demonstrated that the proposed method outperforms other strong baselines significantly.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122090117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Customized Positional Encoding to Combine Static and Time-varying Data in Robust Representation Learning for Crop Yield Prediction 结合静态和时变数据的自定义位置编码鲁棒表示学习用于作物产量预测
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/676
Qinqing Liu, Fei Dou, Meijian Yang, Ezana Amdework, Guiling Wang, J. Bi
Accurate prediction of crop yield under the conditions of climate change is crucial to ensure food security. Transformers have shown remarkable success in modeling sequential data and hold the potential for improving crop yield prediction. To understand how weather and meteorological sequence variables affect crop yield, the positional encoding used in Transformers is typically shared across different sample sequences. We argue that it is necessary and beneficial to differentiate the positional encoding for distinct samples based on time-invariant properties of the sequences. Particularly, the sequence variables influencing crop yield vary according to static variables such as geographical locations. Sample data from southern areas may benefit from more tailored positional encoding different from that for northern areas. We propose a novel transformer based architecture for accurate and robust crop yield prediction, by introducing a Customized Positional Encoding (CPE) that encodes a sequence adaptively according to static information associated with the sequence. Empirical studies demonstrate the effectiveness of the proposed novel architecture and show that partially lin-earized attention better captures the bias introduced by side information than softmax re-weighting. The resultant crop yield prediction model is robust to climate change, with mean-absolute-error reduced by up to 26% compared to the best baseline model in extreme drought years.
准确预测气候变化条件下的作物产量对保障粮食安全至关重要。变压器在序列数据建模方面取得了显著的成功,并具有改善作物产量预测的潜力。为了理解天气和气象序列变量如何影响作物产量,变形金刚中使用的位置编码通常在不同的样本序列中共享。我们认为基于序列的时不变特性区分不同样本的位置编码是必要和有益的。特别是,影响作物产量的序列变量因地理位置等静态变量而异。南方地区的样本数据可能会受益于与北方地区不同的更定制的位置编码。我们提出了一种新的基于变压器的结构,通过引入自定义位置编码(CPE),根据与序列相关的静态信息自适应编码序列,实现准确和稳健的作物产量预测。实证研究证明了所提出的新架构的有效性,并表明部分线性化的注意力比softmax重新加权更能捕获由侧信息引入的偏见。所建立的作物产量预测模型对气候变化具有鲁棒性,在极端干旱年份,与最佳基线模型相比,平均绝对误差减少了26%。
{"title":"Customized Positional Encoding to Combine Static and Time-varying Data in Robust Representation Learning for Crop Yield Prediction","authors":"Qinqing Liu, Fei Dou, Meijian Yang, Ezana Amdework, Guiling Wang, J. Bi","doi":"10.24963/ijcai.2023/676","DOIUrl":"https://doi.org/10.24963/ijcai.2023/676","url":null,"abstract":"Accurate prediction of crop yield under the conditions of climate change is crucial to ensure food security. Transformers have shown remarkable success in modeling sequential data and hold the potential for improving crop yield prediction. To understand how weather and meteorological sequence variables affect crop yield, the positional encoding used in Transformers is typically shared across different sample sequences. We argue that it is necessary and beneficial to differentiate the positional encoding for distinct samples based on time-invariant properties of the sequences. Particularly, the sequence variables influencing crop yield vary according to static variables such as geographical locations. Sample data from southern areas may benefit from more tailored positional encoding different from that for northern areas. We propose a novel transformer based architecture for accurate and robust crop yield prediction, by introducing a Customized Positional Encoding (CPE) that encodes a sequence adaptively according to static information associated with the sequence. Empirical studies demonstrate the effectiveness of the proposed novel architecture and show that partially lin-\u0000\u0000earized attention better captures the bias introduced by side information than softmax re-weighting. The resultant crop yield prediction model is robust to climate change, with mean-absolute-error reduced by up to 26% compared to the best baseline model in extreme drought years.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122106970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adversarial Behavior Exclusion for Safe Reinforcement Learning 安全强化学习的对抗行为排斥
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/54
Md Asifur Rahman, Tongtong Liu, Sarra M. Alqahtani
Learning by exploration makes reinforcement learning (RL) potentially attractive for many real-world applications. However, this learning process makes RL inherently too vulnerable to be used in real-world applications where safety is of utmost importance. Most prior studies consider exploration at odds with safety and thereby restrict it using either joint optimization of task and safety or imposing constraints for safe exploration. This paper migrates from the current convention to using exploration as a key to safety by learning safety as a robust behavior that completely excludes any behavioral pattern responsible for safety violations. Adversarial Behavior Exclusion for Safe RL (AdvEx-RL) learns a behavioral representation of the agent's safety violations by approximating an optimal adversary utilizing exploration and later uses this representation to learn a separate safety policy that excludes those unsafe behaviors. In addition, AdvEx-RL ensures safety in a task-agnostic manner by acting as a safety firewall and therefore can be integrated with any RL task policy. We demonstrate the robustness of AdvEx-RL via comprehensive experiments in standard constrained Markov decision processes (CMDP) environments under 2 white-box action space perturbations as well as with changes in environment dynamics against 7 baselines. Consistently, AdvEx-RL outperforms the baselines by achieving an average safety performance of over 75% in the continuous action space with 10 times more variations in the testing environment dynamics. By using a standalone safety policy independent of conflicting objectives, AdvEx-RL also paves the way for interpretable safety behavior analysis as we show in our user study.
通过探索学习使得强化学习(RL)对许多现实世界的应用具有潜在的吸引力。然而,这种学习过程使得强化学习本质上太脆弱,无法在安全至关重要的现实应用中使用。大多数先前的研究认为勘探与安全不一致,因此使用任务和安全的联合优化或对安全勘探施加约束来限制勘探。本文从当前的惯例迁移到使用探索作为安全的关键,通过学习安全作为一个健壮的行为,完全排除任何行为模式负责违反安全。安全RL的对抗行为排除(AdvEx-RL)通过利用探索近似最优对手来学习代理违反安全行为的行为表示,然后使用该表示来学习排除这些不安全行为的单独安全策略。此外,AdvEx-RL通过充当安全防火墙以任务无关的方式确保安全,因此可以与任何RL任务策略集成。我们通过在标准约束马尔可夫决策过程(CMDP)环境下的2个白盒行动空间扰动以及7个基线下环境动态变化的综合实验证明了AdvEx-RL的鲁棒性。AdvEx-RL在连续运动空间的平均安全性能超过75%,在测试环境动态变化的10倍以上,始终优于基线。通过使用独立于冲突目标的独立安全策略,AdvEx-RL还为可解释的安全行为分析铺平了道路,正如我们在用户研究中所示。
{"title":"Adversarial Behavior Exclusion for Safe Reinforcement Learning","authors":"Md Asifur Rahman, Tongtong Liu, Sarra M. Alqahtani","doi":"10.24963/ijcai.2023/54","DOIUrl":"https://doi.org/10.24963/ijcai.2023/54","url":null,"abstract":"Learning by exploration makes reinforcement learning (RL) potentially attractive for many real-world applications. However, this learning process makes RL inherently too vulnerable to be used in real-world applications where safety is of utmost importance. Most prior studies consider exploration at odds with safety and thereby restrict it using either joint optimization of task and safety or imposing constraints for safe exploration. This paper migrates from the current convention to using exploration as a key to safety by learning safety as a robust behavior that completely excludes any behavioral pattern responsible for safety violations. Adversarial Behavior Exclusion for Safe RL (AdvEx-RL) learns a behavioral representation of the agent's safety violations by approximating an optimal adversary utilizing exploration and later uses this representation to learn a separate safety policy that excludes those unsafe behaviors. In addition, AdvEx-RL ensures safety in a task-agnostic manner by acting as a safety firewall and therefore can be integrated with any RL task policy. We demonstrate the robustness of AdvEx-RL via comprehensive experiments in standard constrained Markov decision processes (CMDP) environments under 2 white-box action space perturbations as well as with changes in environment dynamics against 7 baselines. Consistently, AdvEx-RL outperforms the baselines by achieving an average safety performance of over 75% in the continuous action space with 10 times more variations in the testing environment dynamics. By using a standalone safety policy independent of conflicting objectives, AdvEx-RL also paves the way for interpretable safety behavior analysis as we show in our user study.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"267 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122176768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Joint Conference on Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1