首页 > 最新文献

International Joint Conference on Artificial Intelligence最新文献

英文 中文
SeRO: Self-Supervised Reinforcement Learning for Recovery from Out-of-Distribution Situations SeRO:自监督强化学习从非分布情况中恢复
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/432
Chan Kim, JaeKyung Cho, C. Bobda, Seung-Woo Seo, Seong-Woo Kim
Robotic agents trained using reinforcement learning have the problem of taking unreliable actions in an out-of-distribution (OOD) state. Agents can easily become OOD in real-world environments because it is almost impossible for them to visit and learn the entire state space during training. Unfortunately, unreliable actions do not ensure that agents perform their original tasks successfully. Therefore, agents should be able to recognize whether they are in OOD states and learn how to return to the learned state distribution rather than continue to take unreliable actions. In this study, we propose a novel method for retraining agents to recover from OOD situations in a self-supervised manner when they fall into OOD states. Our in-depth experimental results demonstrate that our method substantially improves the agent’s ability to recover from OOD situations in terms of sample efficiency and restoration of the performance for the original tasks. Moreover, we show that our method can retrain the agent to recover from OOD situations even when in-distribution states are difficult to visit through exploration. Code and supplementary materials are available at https://github.com/SNUChanKim/SeRO.
使用强化学习训练的机器人代理存在在非分布状态下采取不可靠行动的问题。在现实环境中,智能体很容易成为OOD,因为在训练过程中,它们几乎不可能访问和学习整个状态空间。不幸的是,不可靠的操作不能确保代理成功执行其原始任务。因此,agent应该能够识别自己是否处于OOD状态,并学习如何返回到学习到的状态分布,而不是继续采取不可靠的动作。在这项研究中,我们提出了一种新的方法来重新训练智能体,当它们陷入OOD状态时,以一种自我监督的方式从OOD状态中恢复过来。我们的深入实验结果表明,我们的方法在样本效率和原始任务性能恢复方面大大提高了智能体从OOD情况中恢复的能力。此外,我们证明了我们的方法可以重新训练智能体从OOD情况中恢复,即使在分布状态难以通过探索访问的情况下。代码和补充材料可在https://github.com/SNUChanKim/SeRO上获得。
{"title":"SeRO: Self-Supervised Reinforcement Learning for Recovery from Out-of-Distribution Situations","authors":"Chan Kim, JaeKyung Cho, C. Bobda, Seung-Woo Seo, Seong-Woo Kim","doi":"10.24963/ijcai.2023/432","DOIUrl":"https://doi.org/10.24963/ijcai.2023/432","url":null,"abstract":"Robotic agents trained using reinforcement learning have the problem of taking unreliable actions in an out-of-distribution (OOD) state. Agents can easily become OOD in real-world environments because it is almost impossible for them to visit and learn the entire state space during training. Unfortunately, unreliable actions do not ensure that agents perform their original tasks successfully. Therefore, agents should be able to recognize whether they are in OOD states and learn how to return to the learned state distribution rather than continue to take unreliable actions. In this study, we propose a novel method for retraining agents to recover from OOD situations in a self-supervised manner when they fall into OOD states. Our in-depth experimental results demonstrate that our method substantially improves the agent’s ability to recover from OOD situations in terms of sample efficiency and restoration of the performance for the original tasks. Moreover, we show that our method can retrain the agent to recover from OOD situations even when in-distribution states are difficult to visit through exploration. Code and supplementary materials are available at https://github.com/SNUChanKim/SeRO.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128078158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Optimization with Switching Cost: Regret Analysis and Lookahead Variants 具有切换代价的贝叶斯优化:后悔分析和前瞻变量
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/446
Peng Liu, Haowei Wang, Wei Qiyu
Bayesian Optimization (BO) has recently received increasing attention due to its efficiency in optimizing expensive-to-evaluate functions. For some practical problems, it is essential to consider the path-dependent switching cost between consecutive sampling locations given a total traveling budget. For example, when using a drone to locate cracks in a building wall or search for lost survivors in the wild, the search path needs to be efficiently planned given the limited battery power of the drone. Tackling such problems requires a careful cost-benefit analysis of candidate locations and balancing exploration and exploitation. In this work, we formulate such a problem as a constrained Markov Decision Process (MDP) and solve it by proposing a new distance-adjusted multi-step look-ahead acquisition function, the distUCB, and using rollout approximation. We also provide a theoretical regret analysis of the distUCB-based Bayesian optimization algorithm. In addition, the empirical performance of the proposed algorithm is tested based on both synthetic and real data experiments, and it shows that our cost-aware non-myopic algorithm performs better than other popular alternatives.
近年来,贝叶斯优化(BO)因其在优化昂贵函数方面的效率而受到越来越多的关注。对于一些实际问题,必须考虑给定总行程预算的连续采样点之间的路径依赖切换成本。例如,当使用无人机定位建筑物墙壁上的裂缝或在野外搜寻失踪的幸存者时,由于无人机的电池电量有限,需要有效地规划搜索路径。解决这些问题需要对候选地点进行仔细的成本效益分析,并平衡勘探和开采。在这项工作中,我们将这样的问题表述为约束马尔可夫决策过程(MDP),并通过提出一个新的距离调整多步前瞻获取函数distUCB和使用rollout逼近来解决它。我们还对基于distucb的贝叶斯优化算法进行了理论遗憾分析。此外,基于合成和真实数据实验对本文算法的经验性能进行了测试,结果表明本文算法的成本感知非近视算法的性能优于其他流行的替代算法。
{"title":"Bayesian Optimization with Switching Cost: Regret Analysis and Lookahead Variants","authors":"Peng Liu, Haowei Wang, Wei Qiyu","doi":"10.24963/ijcai.2023/446","DOIUrl":"https://doi.org/10.24963/ijcai.2023/446","url":null,"abstract":"Bayesian Optimization (BO) has recently received increasing attention due to its efficiency in optimizing expensive-to-evaluate functions. For some practical problems, it is essential to consider the path-dependent switching cost between consecutive sampling locations given a total traveling budget. For example, when using a drone to locate cracks in a building wall or search for lost survivors in the wild, the search path needs to be efficiently planned given the limited battery power of the drone. Tackling such problems requires a careful cost-benefit analysis of candidate locations and balancing exploration and exploitation. In this work, we formulate such a problem as a constrained Markov Decision Process (MDP) and solve it by proposing a new distance-adjusted multi-step look-ahead acquisition function, the distUCB, and using rollout approximation. We also provide a theoretical regret analysis of the distUCB-based Bayesian optimization algorithm. In addition, the empirical performance of the proposed algorithm is tested based on both synthetic and real data experiments, and it shows that our cost-aware non-myopic algorithm performs better than other popular alternatives.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132525800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Front-to-End Bidirectional Heuristic Search with Consistent Heuristics: Enumerating and Evaluating Algorithms and Bounds 具有一致启发式的前端到端双向启发式搜索:枚举和评估算法和边界
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/625
Lior Siag, Shahaf S. Shperberg, Ariel Felner, Nathan R Sturtevant
Recent research on bidirectional heuristic search (BiHS) is based on the must-expand pairs theory (MEP theory), which describes which pairs of nodes must be expanded during the search to guarantee the optimality of solutions. A separate line of research in BiHS has proposed algorithms that use lower bounds that are derived from consistent heuristics during search. This paper links these two directions, providing a comprehensive unifying view and showing that both existing and novel algorithms can be derived from the MEP theory. An extended set of bounds is formulated, encompassing both previously discovered bounds and new ones. Finally, the bounds are empirically evaluated by their contribution to the efficiency of the search
双向启发式搜索(BiHS)的最新研究是基于必须展开对理论(MEP理论),该理论描述了在搜索过程中必须展开哪些节点对以保证解的最优性。BiHS的另一条研究路线提出了使用搜索过程中从一致启发式推导出的下界的算法。本文将这两个方向联系起来,提供了一个全面统一的观点,并表明现有的和新的算法都可以从MEP理论中推导出来。一个扩展的边界集被制定,包括以前发现的边界和新的边界。最后,根据它们对搜索效率的贡献对边界进行经验评估
{"title":"Front-to-End Bidirectional Heuristic Search with Consistent Heuristics: Enumerating and Evaluating Algorithms and Bounds","authors":"Lior Siag, Shahaf S. Shperberg, Ariel Felner, Nathan R Sturtevant","doi":"10.24963/ijcai.2023/625","DOIUrl":"https://doi.org/10.24963/ijcai.2023/625","url":null,"abstract":"Recent research on bidirectional heuristic search (BiHS) is based on the must-expand pairs theory (MEP theory), which describes which pairs of nodes must be expanded during the search to guarantee the optimality of solutions. A separate line of research in BiHS has proposed algorithms that use lower bounds that are derived from consistent heuristics during search. This paper links these two directions, providing a comprehensive unifying view and showing that both existing and novel algorithms can be derived from the MEP theory. An extended set of bounds is formulated, encompassing both previously discovered bounds and new ones. Finally, the bounds are empirically evaluated by their contribution to the efficiency of the search","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133314020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Plansformer Tool: Demonstrating Generation of Symbolic Plans Using Transformers 变压器工具:示范生成的符号计划使用变压器
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/839
Vishal Pallagani, Bharath Muppasani, Biplav Srivastava, F. Rossi, L. Horesh, K. Murugesan, Andrea Loreggia, F. Fabiano, Rony Joseph, Yathin Kethepalli
Plansformer is a novel tool that utilizes a fine-tuned language model based on transformer architecture to generate symbolic plans. Transformers are a type of neural network architecture that have been shown to be highly effective in a range of natural language processing tasks. Unlike traditional planning systems that use heuristic-based search strategies, Plansformer is fine-tuned on specific classical planning domains to generate high-quality plans that are both fluent and feasible. Plansformer takes the domain and problem files as input (in PDDL) and outputs a sequence of actions that can be executed to solve the problem. We demonstrate the effectiveness of Plansformer on a variety of benchmark problems and provide both qualitative and quantitative results obtained during our evaluation, including its limitations. Plansformer has the potential to significantly improve the efficiency and effectiveness of planning in various domains, from logistics and scheduling to natural language processing and human-computer interaction. In addition, we provide public access to Plansformer via a website as well as an API endpoint; this enables other researchers to utilize our tool for planning and execution. The demo video is available at https://youtu.be/_1rlctCGsrk
plantransformer是一种新颖的工具,它利用基于变压器架构的微调语言模型来生成符号规划。变形金刚是一种神经网络架构,已被证明在一系列自然语言处理任务中非常有效。与使用启发式搜索策略的传统规划系统不同,plantransformer对特定的经典规划领域进行了微调,以生成既流畅又可行的高质量规划。plantransformer将域和问题文件作为输入(在PDDL中),并输出一系列可以执行以解决问题的操作。我们展示了plantransformer在各种基准问题上的有效性,并提供了在评估过程中获得的定性和定量结果,包括其局限性。plantransformer有潜力显著提高各个领域的规划效率和有效性,从物流和调度到自然语言处理和人机交互。此外,我们通过网站和API端点提供对plantransformer的公共访问;这使其他研究人员能够利用我们的工具进行计划和执行。演示视频可在https://youtu.be/_1rlctCGsrk上获得
{"title":"Plansformer Tool: Demonstrating Generation of Symbolic Plans Using Transformers","authors":"Vishal Pallagani, Bharath Muppasani, Biplav Srivastava, F. Rossi, L. Horesh, K. Murugesan, Andrea Loreggia, F. Fabiano, Rony Joseph, Yathin Kethepalli","doi":"10.24963/ijcai.2023/839","DOIUrl":"https://doi.org/10.24963/ijcai.2023/839","url":null,"abstract":"Plansformer is a novel tool that utilizes a fine-tuned language model based on transformer architecture to generate symbolic plans. Transformers are a type of neural network architecture that have been shown to be highly effective in a range of natural language processing tasks. Unlike traditional planning systems that use heuristic-based search strategies, Plansformer is fine-tuned on specific classical planning domains to generate high-quality plans that are both fluent and feasible. Plansformer takes the domain and problem files as input (in PDDL) and outputs a sequence of actions that can be executed to solve the problem. We demonstrate the effectiveness of Plansformer on a variety of benchmark problems and provide both qualitative and quantitative results obtained during our evaluation, including its limitations. Plansformer has the potential to significantly improve the efficiency and effectiveness of planning in various domains, from logistics and scheduling to natural language processing and human-computer interaction. In addition, we provide public access to Plansformer via a website as well as an API endpoint; this enables other researchers to utilize our tool for planning and execution. The demo video is available at https://youtu.be/_1rlctCGsrk","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"5 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132287389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Annealing Genetic-based Preposition Substitution for Text Rubbish Example Generation 基于退火遗传的介词替换文本垃圾样例生成
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/569
Chen Li, Xinghao Yang, Baodi Liu, Weifeng Liu, Honglong Chen
Modern Natural Language Processing (NLP) models expose under-sensitivity towards text rubbish examples. The text rubbish example is the heavily modified input text which is nonsensical to humans but does not change the model’s prediction. Prior work crafts rubbish examples by iteratively deleting words and determining the deletion order with beam search. However, the produced rubbish examples usually cause a reduction in model confidence and sometimes deliver human-readable text. To address these problems, we propose an Annealing Genetic based Preposition Substitution (AGPS) algorithm for text rubbish sample generation with two major merits. Firstly, the AGPS crafts rubbish text examples by substituting input words with meaningless prepositions instead of directly removing them, which brings less degradation to the model’s confidence. Secondly, we design an Annealing Genetic algorithm to optimize the word replacement priority, which allows the Genetic Algorithm (GA) to jump out the local optima with probabilities. This is significant in achieving better objectives, i.e., a high word modification rate and a high model confidence. Experimental results on five popular datasets manifest the superiority of AGPS compared with the baseline and expose the fact: the NLP models can not really understand the semantics of sentences, as they give the same prediction with even higher confidence for the nonsensical preposition sequences.
现代自然语言处理(NLP)模型对文本垃圾样本的敏感性不足。文本垃圾示例是大量修改的输入文本,这些文本对人类来说是无意义的,但不会改变模型的预测。先前的工作是通过迭代删除单词和用波束搜索确定删除顺序来生成垃圾样例。然而,产生的垃圾示例通常会导致模型置信度降低,有时会提供人类可读的文本。为了解决这些问题,我们提出了一种基于退火遗传的介词替换(AGPS)算法用于文本垃圾样本生成,该算法具有两个主要优点。首先,AGPS通过用无意义的介词代替输入词来制作垃圾文本样例,而不是直接删除它们,这对模型的置信度降低较小。其次,我们设计了一种退火遗传算法来优化单词替换优先级,使遗传算法(GA)能够以概率跳出局部最优。这对于实现更好的目标非常重要,例如,高单词修改率和高模型置信度。在5个流行数据集上的实验结果显示了AGPS与基线相比的优势,并揭示了一个事实:NLP模型并不能真正理解句子的语义,因为它们对无意义介词序列给出了相同的预测,甚至更高的置信度。
{"title":"Annealing Genetic-based Preposition Substitution for Text Rubbish Example Generation","authors":"Chen Li, Xinghao Yang, Baodi Liu, Weifeng Liu, Honglong Chen","doi":"10.24963/ijcai.2023/569","DOIUrl":"https://doi.org/10.24963/ijcai.2023/569","url":null,"abstract":"Modern Natural Language Processing (NLP) models expose under-sensitivity towards text rubbish examples. The text rubbish example is the heavily modified input text which is nonsensical to humans but does not change the model’s prediction. Prior work crafts rubbish examples by iteratively deleting words and determining the deletion order with beam search. However, the produced rubbish examples usually cause a reduction in model confidence and sometimes deliver human-readable text. To address these problems, we propose an Annealing Genetic based Preposition Substitution (AGPS) algorithm for text rubbish sample generation with two major merits. Firstly, the AGPS crafts rubbish text examples by substituting input words with meaningless prepositions instead of directly removing them, which brings less degradation to the model’s confidence. Secondly, we design an Annealing Genetic algorithm to optimize the word replacement priority, which allows the Genetic Algorithm (GA) to jump out the local optima with probabilities. This is significant in achieving better objectives, i.e., a high word modification rate and a high model confidence. Experimental results on five popular datasets manifest the superiority of AGPS compared with the baseline and expose the fact: the NLP models can not really understand the semantics of sentences, as they give the same prediction with even higher confidence for the nonsensical preposition sequences.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134532812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic Recognition of the General-Purpose Communicative Functions Defined by the ISO 24617-2 Standard for Dialog Act Annotation (Extended Abstract) ISO 24617-2对话动作注释标准中通用交际功能的自动识别(扩展摘要)
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/788
Eugénio Ribeiro, Ricardo Ribeiro, David Martins de Matos
From the perspective of a dialog system, the identification of the intention behind the segments in a dialog is important, as it provides cues regarding the information present in the segments and how they should be interpreted. The ISO 24617-2 standard for dialog act annotation defines a hierarchically organized set of general-purpose communicative functions that correspond to different intentions that are relevant in the context of a dialog. In this paper, we explore the automatic recognition of these functions. To do so, we propose to adapt existing approaches to dialog act recognition, so that they can deal with the hierarchical classification problem. More specifically, we propose the use of an end-to-end hierarchical network with cascading outputs and maximum a posteriori path estimation to predict the communicative function at each level of the hierarchy, preserve the dependencies between the functions in the path, and decide at which level to stop. Additionally, we rely on transfer learning processes to address the data scarcity problem. Our experiments on the DialogBank show that this approach outperforms both flat and hierarchical approaches based on multiple classifiers and that each of its components plays an important role in the recognition of general-purpose communicative functions.
从对话系统的角度来看,识别对话片段背后的意图非常重要,因为它提供了关于片段中存在的信息以及如何解释它们的线索。对话行为注释的ISO 24617-2标准定义了一组分层组织的通用交流功能,这些功能对应于对话上下文中相关的不同意图。在本文中,我们对这些函数的自动识别进行了探讨。为此,我们提出对现有的对话行为识别方法进行改进,使其能够处理层次分类问题。更具体地说,我们建议使用具有级联输出和最大后测路径估计的端到端分层网络来预测每一层的通信功能,保留路径中功能之间的依赖关系,并决定在哪一层停止。此外,我们依靠迁移学习过程来解决数据稀缺问题。我们在DialogBank上的实验表明,这种方法优于基于多个分类器的扁平和分层方法,并且它的每个组件在通用交际功能的识别中都起着重要作用。
{"title":"Automatic Recognition of the General-Purpose Communicative Functions Defined by the ISO 24617-2 Standard for Dialog Act Annotation (Extended Abstract)","authors":"Eugénio Ribeiro, Ricardo Ribeiro, David Martins de Matos","doi":"10.24963/ijcai.2023/788","DOIUrl":"https://doi.org/10.24963/ijcai.2023/788","url":null,"abstract":"From the perspective of a dialog system, the identification of the intention behind the segments in a dialog is important, as it provides cues regarding the information present in the segments and how they should be interpreted. The ISO 24617-2 standard for dialog act annotation defines a hierarchically organized set of general-purpose communicative functions that correspond to different intentions that are relevant in the context of a dialog. In this paper, we explore the automatic recognition of these functions. To do so, we propose to adapt existing approaches to dialog act recognition, so that they can deal with the hierarchical classification problem. More specifically, we propose the use of an end-to-end hierarchical network with cascading outputs and maximum a posteriori path estimation to predict the communicative function at each level of the hierarchy, preserve the dependencies between the functions in the path, and decide at which level to stop. Additionally, we rely on transfer learning processes to address the data scarcity problem. Our experiments on the DialogBank show that this approach outperforms both flat and hierarchical approaches based on multiple classifiers and that each of its components plays an important role in the recognition of general-purpose communicative functions.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134072340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-level Graph Contrastive Prototypical Clustering 多层次图对比原型聚类
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/513
Yuchao Zhang, Yuan Yuan, Qi Wang
Recently, graph neural networks (GNNs) have drawn a surge of investigations in deep graph clustering. Nevertheless, existing approaches predominantly are inclined to semantic-agnostic since GNNs exhibit inherent limitations in capturing global underlying semantic structures. Meanwhile, multiple objectives are imposed within one latent space, whereas representations from different granularities may presumably conflict with each other, yielding severe performance degradation for clustering. To this end, we propose a novel Multi-Level Graph Contrastive Prototypical Clustering (MLG-CPC) framework for end-to-end clustering. Specifically, a Prototype Discrimination (ProDisc) objective function is proposed to explicitly capture semantic information via cluster assignments. Moreover, to alleviate the issue of objectives conflict, we introduce to perceive representations of different granularities within individual feature-, prototypical-, and cluster-level spaces by the feature decorrelation, prototype contrast, and cluster space consistency respectively. Extensive experiments on four benchmarks demonstrate the superiority of the proposed MLG-CPC against the state-of-the-art graph clustering approaches.
近年来,图神经网络(gnn)在深度图聚类方面引起了广泛的研究。然而,现有的方法主要倾向于语义不可知论,因为gnn在捕获全局底层语义结构方面表现出固有的局限性。同时,在一个潜在空间内强加了多个目标,而来自不同粒度的表示可能会相互冲突,从而导致聚类的性能严重下降。为此,我们提出了一种新的多层次图对比原型聚类(MLG-CPC)框架,用于端到端聚类。具体而言,提出了一个原型判别(ProDisc)目标函数,通过聚类分配明确地捕获语义信息。此外,为了缓解目标冲突问题,我们分别通过特征去相关、原型对比和聚类空间一致性来感知个体特征级、原型级和聚类级空间中不同粒度的表征。在四个基准上的大量实验证明了所提出的MLG-CPC与最先进的图聚类方法相比的优越性。
{"title":"Multi-level Graph Contrastive Prototypical Clustering","authors":"Yuchao Zhang, Yuan Yuan, Qi Wang","doi":"10.24963/ijcai.2023/513","DOIUrl":"https://doi.org/10.24963/ijcai.2023/513","url":null,"abstract":"Recently, graph neural networks (GNNs) have drawn a surge of investigations in deep graph clustering. Nevertheless, existing approaches predominantly are inclined to semantic-agnostic since GNNs exhibit inherent limitations in capturing global underlying semantic structures. Meanwhile, multiple objectives are imposed within one latent space, whereas representations from different granularities may presumably conflict with each other, yielding severe performance degradation for clustering. To this end, we propose a novel Multi-Level Graph Contrastive Prototypical Clustering (MLG-CPC) framework for end-to-end clustering. Specifically, a Prototype Discrimination (ProDisc) objective function is proposed to explicitly capture semantic information via cluster assignments. Moreover, to alleviate the issue of objectives conflict, we introduce to perceive representations of different granularities within individual feature-, prototypical-, and cluster-level spaces by the feature decorrelation, prototype contrast, and cluster space consistency respectively. Extensive experiments on four benchmarks demonstrate the superiority of the proposed MLG-CPC against the state-of-the-art graph clustering approaches.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114229520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VecoCare: Visit Sequences-Clinical Notes Joint Learning for Diagnosis Prediction in Healthcare Data VecoCare:访问序列-临床记录联合学习在医疗保健数据中的诊断预测
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/547
Yongxin Xu, Kai Yang, Chaohe Zhang, Peinie Zou, Zhiyuan Wang, Hongxin Ding, Junfeng Zhao, Yasha Wang, Bing Xie
Due to the insufficiency of electronic health records (EHR) data utilized in practical diagnosis prediction scenarios, most works are devoted to learning powerful patient representations either from structured EHR data (e.g., temporal medical events, lab test results, etc.) or unstructured data (e.g., clinical notes, etc.). However, synthesizing rich information from both of them still needs to be explored. Firstly, the heterogeneous semantic biases across them heavily hinder the synthesis of representation spaces, which is critical for diagnosis prediction. Secondly, the intermingled quality of partial clinical notes leads to inadequate representations of to-be-predicted patients. Thirdly, typical attention mechanisms mainly focus on aggregating information from similar patients, ignoring important auxiliary information from others. To tackle these challenges, we propose a novel visit sequences-clinical notes joint learning approach, dubbed VecoCare. It performs a Gromov-Wasserstein Distance (GWD)-based contrastive learning task and an adaptive masked language model task in a sequential pre-training manner to reduce heterogeneous semantic biases. After pre-training, VecoCare further aggregates information from both similar and dissimilar patients through a dual-channel retrieval mechanism. We conduct diagnosis prediction experiments on two real-world datasets, which indicates that VecoCare outperforms state-of-the-art approaches. Moreover, the findings discovered by VecoCare are consistent with the medical researches.
由于实际诊断预测场景中使用的电子健康记录(EHR)数据的不足,大多数工作都致力于从结构化的EHR数据(例如,时间医疗事件,实验室测试结果等)或非结构化数据(例如,临床记录等)中学习强大的患者表示。然而,从两者中合成丰富的信息仍然需要探索。首先,它们之间的异构语义偏差严重阻碍了表征空间的综合,而表征空间的综合对诊断预测至关重要。其次,部分临床记录的混杂质量导致待预测患者的不充分代表。第三,典型的注意机制主要集中于对同类患者信息的聚合,忽略了来自其他患者的重要辅助信息。为了应对这些挑战,我们提出了一种新颖的就诊顺序-临床记录联合学习方法,称为VecoCare。该算法以顺序预训练的方式执行基于Gromov-Wasserstein距离(GWD)的对比学习任务和自适应屏蔽语言模型任务,以减少异构语义偏差。经过预训练后,VecoCare通过双通道检索机制进一步聚合相似和不相似患者的信息。我们在两个真实世界的数据集上进行了诊断预测实验,这表明VecoCare优于最先进的方法。此外,VecoCare的发现与医学研究结果一致。
{"title":"VecoCare: Visit Sequences-Clinical Notes Joint Learning for Diagnosis Prediction in Healthcare Data","authors":"Yongxin Xu, Kai Yang, Chaohe Zhang, Peinie Zou, Zhiyuan Wang, Hongxin Ding, Junfeng Zhao, Yasha Wang, Bing Xie","doi":"10.24963/ijcai.2023/547","DOIUrl":"https://doi.org/10.24963/ijcai.2023/547","url":null,"abstract":"Due to the insufficiency of electronic health records (EHR) data utilized in practical diagnosis prediction scenarios, most works are devoted to learning powerful patient representations either from structured EHR data (e.g., temporal medical events, lab test results, etc.) or unstructured data (e.g., clinical notes, etc.). However, synthesizing rich information from both of them still needs to be explored. Firstly, the heterogeneous semantic biases across them heavily hinder the synthesis of representation spaces, which is critical for diagnosis prediction. Secondly, the intermingled quality of partial clinical notes leads to inadequate representations of to-be-predicted patients. Thirdly, typical attention mechanisms mainly focus on aggregating information from similar patients, ignoring important auxiliary information from others. To tackle these challenges, we propose a novel visit sequences-clinical notes joint learning approach, dubbed VecoCare. It performs a Gromov-Wasserstein Distance (GWD)-based contrastive learning task and an adaptive masked language model task in a sequential pre-training manner to reduce heterogeneous semantic biases. After pre-training, VecoCare further aggregates information from both similar and dissimilar patients through a dual-channel retrieval mechanism. We conduct diagnosis prediction experiments on two real-world datasets, which indicates that VecoCare outperforms state-of-the-art approaches. Moreover, the findings discovered by VecoCare are consistent with the medical researches.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"194 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114737476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Context-Aware Feature Selection and Classification 上下文感知特征选择和分类
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/480
Juanyan Wang, M. Bilgic
We propose a joint model that performs instance-level feature selection and classification. For a given case, the joint model first skims the full feature vector, decides which features are relevant for that case, and makes a classification decision using only the selected features, resulting in compact, interpretable, and case-specific classification decisions. Because the selected features depend on the case at hand, we refer to this approach as context-aware feature selection and classification. The model can be trained on instances that are annotated by experts with both class labels and instance-level feature selections, so it can select instance-level features that humans would use. Experiments on several datasets demonstrate that the proposed model outperforms eight baselines on a combined classification and feature selection measure, and is able to better emulate the ground-truth instance-level feature selections. The supplementary materials are available at https://github.com/IIT-ML/IJCAI23-CFSC.
我们提出了一个执行实例级特征选择和分类的联合模型。对于给定的情况,联合模型首先略读完整的特征向量,决定哪些特征与该情况相关,然后仅使用选定的特征做出分类决策,从而产生紧凑的、可解释的和特定于情况的分类决策。因为选择的特征取决于手头的情况,所以我们将这种方法称为上下文感知特征选择和分类。该模型可以在由专家使用类标签和实例级特征选择进行注释的实例上进行训练,因此它可以选择人类将使用的实例级特征。在多个数据集上的实验表明,该模型在分类和特征选择的组合度量上优于8个基线,并且能够更好地模拟真实的实例级特征选择。补充材料可在https://github.com/IIT-ML/IJCAI23-CFSC上获得。
{"title":"Context-Aware Feature Selection and Classification","authors":"Juanyan Wang, M. Bilgic","doi":"10.24963/ijcai.2023/480","DOIUrl":"https://doi.org/10.24963/ijcai.2023/480","url":null,"abstract":"We propose a joint model that performs instance-level feature selection and classification. For a given case, the joint model first skims the full feature vector, decides which features are relevant for that case, and makes a classification decision using only the selected features, resulting in compact, interpretable, and case-specific classification decisions. Because the selected features depend on the case at hand, we refer to this approach as context-aware feature selection and classification. The model can be trained on instances that are annotated by experts with both class labels and instance-level feature selections, so it can select instance-level features that humans would use. Experiments on several datasets demonstrate that the proposed model outperforms eight baselines on a combined classification and feature selection measure, and is able to better emulate the ground-truth instance-level feature selections. The supplementary materials are available at https://github.com/IIT-ML/IJCAI23-CFSC.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116728883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unbiased Risk Estimator to Multi-Labeled Complementary Label Learning 多标签互补标签学习的无偏风险估计
Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/415
Yi Gao, Miao Xu, Min-Ling Zhang
Multi-label learning (MLL) usually requires assigning multiple relevant labels to each instance. While a fully supervised MLL dataset needs a large amount of labeling effort, using complementary labels can help alleviate this burden. However, current approaches to learning from complementary labels are mainly designed for multi-class learning and assume that each instance has a single relevant label. This means that these approaches cannot be easily applied to MLL when only complementary labels are provided, where the number of relevant labels is unknown and can vary across instances. In this paper, we first propose the unbiased risk estimator for the multi-labeled complementary label learning (MLCLL) problem. We also provide an estimation error bound to ensure the convergence of the empirical risk estimator. In some cases, the unbiased estimator may give unbounded gradients for certain loss functions and result in overfitting. To mitigate this problem, we improve the risk estimator by minimizing a proper loss function, which has been shown to improve gradient updates. Our experimental results demonstrate the effectiveness of the proposed approach on various datasets.
多标签学习(MLL)通常需要为每个实例分配多个相关标签。虽然完全监督的MLL数据集需要大量的标记工作,但使用补充标签可以帮助减轻这一负担。然而,目前从互补标签中学习的方法主要是为多类学习设计的,并且假设每个实例都有一个单独的相关标签。这意味着,当只提供互补标签时,这些方法不能很容易地应用于MLL,其中相关标签的数量是未知的,并且可能因实例而异。本文首先提出了多标签互补标签学习(MLCLL)问题的无偏风险估计量。为了保证经验风险估计量的收敛性,我们还给出了估计误差界。在某些情况下,无偏估计量可能对某些损失函数给出无界梯度并导致过拟合。为了缓解这个问题,我们通过最小化适当的损失函数来改进风险估计器,这已经被证明可以改善梯度更新。我们的实验结果证明了该方法在各种数据集上的有效性。
{"title":"Unbiased Risk Estimator to Multi-Labeled Complementary Label Learning","authors":"Yi Gao, Miao Xu, Min-Ling Zhang","doi":"10.24963/ijcai.2023/415","DOIUrl":"https://doi.org/10.24963/ijcai.2023/415","url":null,"abstract":"Multi-label learning (MLL) usually requires assigning multiple relevant labels to each instance. While a fully supervised MLL dataset needs a large amount of labeling effort, using complementary labels can help alleviate this burden. However, current approaches to learning from complementary labels are mainly designed for multi-class learning and assume that each instance has a single relevant label. This means that these approaches cannot be easily applied to MLL when only complementary labels are provided, where the number of relevant labels is unknown and can vary across instances. In this paper, we first propose the unbiased risk estimator for the multi-labeled complementary label learning (MLCLL) problem. We also provide an estimation error bound to ensure the convergence of the empirical risk estimator. In some cases, the unbiased estimator may give unbounded gradients for certain loss functions and result in overfitting. To mitigate this problem, we improve the risk estimator by minimizing a proper loss function, which has been shown to improve gradient updates. Our experimental results demonstrate the effectiveness of the proposed approach on various datasets.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115881047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
International Joint Conference on Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1