International Joint Conference on Artificial Intelligence最新文献

英文中文

KDLGT: A Linear Graph Transformer Framework via Kernel Decomposition Approach 基于核分解方法的线性图转换器框架

International Joint Conference on Artificial Intelligence

Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/263

Yi Wu, Yanyang Xu, Wenhao Zhu, Guojie Song, Zhouchen Lin, Liangji Wang, Shaoguo Liu

In recent years, graph Transformers (GTs) have been demonstrated as a robust architecture for a wide range of graph learning tasks. However, the quadratic complexity of GTs limits their scalability on large-scale data, in comparison to Graph Neural Networks (GNNs). In this work, we propose the Kernel Decomposition Linear Graph Transformer (KDLGT), an accelerating framework for building scalable and powerful GTs. KDLGT employs the kernel decomposition approach to rearrange the order of matrix multiplication, thereby reducing complexity to linear. Additionally, it categorizes GTs into three distinct types and provides tailored accelerating methods for each category to encompass all types of GTs. Furthermore, we provide a theoretical analysis of the performance gap between KDLGT and self-attention to ensure its effectiveness. Under this framework, we select two representative GTs to design our models. Experiments on both real-world and synthetic datasets indicate that KDLGT not only achieves state-of-the-art performance on various datasets but also reaches an acceleration ratio of approximately 10 on graphs of certain sizes.

近年来，图形转换器(gt)已被证明是一种用于广泛的图形学习任务的鲁棒架构。然而，与图神经网络(gnn)相比，GTs的二次复杂度限制了它们在大规模数据上的可扩展性。在这项工作中，我们提出了核分解线性图转换器(KDLGT)，这是一个用于构建可扩展和强大的gt的加速框架。KDLGT采用核分解方法重新排列矩阵乘法的顺序，从而将复杂度降低到线性。此外，它将gt分为三种不同的类型，并为每种类型提供量身定制的加速方法，以涵盖所有类型的gt。此外，我们还从理论上分析了KDLGT与自我注意之间的性能差距，以确保其有效性。在此框架下，我们选择了两个具有代表性的gt来设计模型。在真实世界和合成数据集上的实验表明，KDLGT不仅在各种数据集上达到了最先进的性能，而且在特定大小的图上达到了大约10的加速比。

{"title":"KDLGT: A Linear Graph Transformer Framework via Kernel Decomposition Approach","authors":"Yi Wu, Yanyang Xu, Wenhao Zhu, Guojie Song, Zhouchen Lin, Liangji Wang, Shaoguo Liu","doi":"10.24963/ijcai.2023/263","DOIUrl":"https://doi.org/10.24963/ijcai.2023/263","url":null,"abstract":"In recent years, graph Transformers (GTs) have been demonstrated as a robust architecture for a wide range of graph learning tasks. However, the quadratic complexity of GTs limits their scalability on large-scale data, in comparison to Graph Neural Networks (GNNs). In this work, we propose the Kernel Decomposition Linear Graph Transformer (KDLGT), an accelerating framework for building scalable and powerful GTs. KDLGT employs the kernel decomposition approach to rearrange the order of matrix multiplication, thereby reducing complexity to linear. Additionally, it categorizes GTs into three distinct types and provides tailored accelerating methods for each category to encompass all types of GTs. Furthermore, we provide a theoretical analysis of the performance gap between KDLGT and self-attention to ensure its effectiveness. Under this framework, we select two representative GTs to design our models. Experiments on both real-world and synthetic datasets indicate that KDLGT not only achieves state-of-the-art performance on various datasets but also reaches an acceleration ratio of approximately 10 on graphs of certain sizes.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129445328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

AutoML for Outlier Detection with Optimal Transport Distances 具有最佳运输距离的离群点自动检测

International Joint Conference on Artificial Intelligence

Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/843

Prabhant Singh, J. Vanschoren

Automated machine learning (AutoML) has been widely researched and adopted for supervised problems, but progress in unsupervised settings has been limited. We propose `"LOTUS", a novel framework to automate outlier detection based on meta-learning. Our premise is that the selection of the optimal outlier detection technique depends on the inherent properties of the data distribution. We leverage optimal transport to find the dataset with the most similar underlying distribution, and then apply the outlier detection techniques that proved to work best for that data distribution. We evaluate the robustness of our framework and find that it outperforms all state-of-the-art automated outlier detection tools. This approach can also be easily generalized to automate other unsupervised settings.

自动化机器学习(AutoML)已被广泛研究和应用于有监督问题，但在无监督环境中的进展有限。我们提出了“LOTUS”，这是一个基于元学习的新框架，可以自动检测异常值。我们的前提是，最优离群点检测技术的选择取决于数据分布的固有特性。我们利用最优传输来找到具有最相似底层分布的数据集，然后应用被证明最适合该数据分布的离群值检测技术。我们评估了框架的稳健性，发现它优于所有最先进的自动离群检测工具。这种方法也可以很容易地推广到自动化其他无监督设置。

引用次数: 0

Promoting Gender Equality through Gender-biased Language Analysis in Social Media 通过社会媒体中的性别偏见语言分析促进性别平等

International Joint Conference on Artificial Intelligence

Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/689

G. Singh, Soumitra Ghosh, Asif Ekbal

Gender bias is a pervasive issue that impacts women's and marginalized groups' ability to fully participate in social, economic, and political spheres. This study introduces a novel problem of Gender-biased Language Identification and Extraction (GLIdE) from social media interactions and develops a multi-task deep framework that detects gender-biased content and identifies connected causal phrases from the text using emotional information that is present in the input. The method uses a zero-shot strategy with emotional information and a mechanism to represent gender-stereotyped information as a knowledge graph. In this work, we also introduce the first-of-its-kind Gender-biased Analysis Corpus (GAC) of 12,432 social media posts and improve the best-performing baseline for gender-biased language identification and extraction tasks by margins of 4.88% and 5 ROS points, demonstrating this through empirical evaluation and extensive qualitative analysis. By improving the accuracy of identifying and analyzing gender-biased language, this work can contribute to achieving gender equality and promoting inclusive societies, in line with the United Nations Sustainable Development Goals (UN SDGs) and the Leave No One Behind principle (LNOB). We adhere to the principles of transparency and collaboration in line with the UN SDGs by openly sharing our code and dataset.

性别偏见是一个普遍存在的问题，影响着妇女和边缘群体充分参与社会、经济和政治领域的能力。本研究从社交媒体互动中引入了一个新的性别偏见语言识别和提取(GLIdE)问题，并开发了一个多任务深度框架，该框架可以检测性别偏见内容，并使用输入中的情感信息从文本中识别出相关的因果短语。该方法采用了一种带有情感信息的零投策略和一种将性别刻板印象信息表示为知识图谱的机制。在这项工作中，我们还引入了首个包含12,432个社交媒体帖子的性别偏见分析语料库(GAC)，并通过实证评估和广泛的定性分析证明，将性别偏见语言识别和提取任务的最佳表现基线提高了4.88%和5个ROS点。通过提高识别和分析性别偏见语言的准确性，这项工作有助于实现性别平等和促进包容性社会，符合联合国可持续发展目标(UN SDGs)和不让任何人掉队原则(LNOB)。我们通过公开分享代码和数据集，坚持透明和协作的原则，符合联合国可持续发展目标。

{"title":"Promoting Gender Equality through Gender-biased Language Analysis in Social Media","authors":"G. Singh, Soumitra Ghosh, Asif Ekbal","doi":"10.24963/ijcai.2023/689","DOIUrl":"https://doi.org/10.24963/ijcai.2023/689","url":null,"abstract":"Gender bias is a pervasive issue that impacts women's and marginalized groups' ability to fully participate in social, economic, and political spheres. This study introduces a novel problem of Gender-biased Language Identification and Extraction (GLIdE) from social media interactions and develops a multi-task deep framework that detects gender-biased content and identifies connected causal phrases from the text using emotional information that is present in the input. The method uses a zero-shot strategy with emotional information and a mechanism to represent gender-stereotyped information as a knowledge graph. In this work, we also introduce the first-of-its-kind Gender-biased Analysis Corpus (GAC) of 12,432 social media posts and improve the best-performing baseline for gender-biased language identification and extraction tasks by margins of 4.88% and 5 ROS points, demonstrating this through empirical evaluation and extensive qualitative analysis. By improving the accuracy of identifying and analyzing gender-biased language, this work can contribute to achieving gender equality and promoting inclusive societies, in line with the United Nations Sustainable Development Goals (UN SDGs) and the Leave No One Behind principle (LNOB). We adhere to the principles of transparency and collaboration in line with the UN SDGs by openly sharing our code and dataset.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"116 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127065231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automatic Verification for Soundness of Bounded QNP Abstractions for Generalized Planning 广义规划有界QNP抽象完备性的自动验证

International Joint Conference on Artificial Intelligence

Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/351

Zhenhe Cui, Weidu Kuang, Yongmei Liu

Generalized planning (GP) studies the computation of general solutions for a set of planning problems. Computing general solutions with correctness guarantee has long been a key issue in GP. Abstractions are widely used to solve GP problems. For example, a popular abstraction model for GP is qualitative numeric planning (QNP), which extends classical planning with non-negative real variables that can be increased or decreased by some arbitrary amount. The refinement of correct solutions of sound abstractions are solutions with correctness guarantees for GP problems. More recent literature proposed a uniform abstraction framework for GP and gave model-theoretic definitions of sound and complete abstractions for GP problems. In this paper, based on the previous work, we explore automatic verification of sound abstractions for GP. Firstly, we present a proof-theoretic characterization for sound abstractions. Secondly, based on the characterization, we give a sufficient condition for sound abstractions with deterministic actions. Then we study how to verify the sufficient condition when the abstraction models are bounded QNPs where integer variables can be incremented or decremented by one. To this end, we develop methods to handle counting and transitive closure, which are often used to define numerical variables. Finally, we implement a sound bounded QNP abstraction verification system and report experimental results on several domains.

广义规划研究一组规划问题的一般解的计算。计算具有正确性保证的通用解一直是GP中的关键问题。抽象被广泛用于解决GP问题。例如，一种流行的GP抽象模型是定性数值规划(QNP)，它扩展了经典规划，使用非负实变量，这些变量可以任意增加或减少。健全抽象的正确解的细化是GP问题的正确性保证的解。最近的文献提出了GP的统一抽象框架，并给出了GP问题的健全和完全抽象的模型理论定义。本文在前人工作的基础上，探讨了GP语音抽象的自动验证。首先，我们提出了声音抽象的证明理论表征。其次，在定性的基础上，给出了具有确定性动作的健全抽象的充分条件。然后，我们研究了如何验证抽象模型是整数变量可以加1或减1的有界qnp的充分条件。为此，我们开发了处理计数和传递闭包的方法，它们通常用于定义数值变量。最后，我们实现了一个完善的有界QNP抽象验证系统，并报告了几个领域的实验结果。

{"title":"Automatic Verification for Soundness of Bounded QNP Abstractions for Generalized Planning","authors":"Zhenhe Cui, Weidu Kuang, Yongmei Liu","doi":"10.24963/ijcai.2023/351","DOIUrl":"https://doi.org/10.24963/ijcai.2023/351","url":null,"abstract":"Generalized planning (GP) studies the computation of general solutions for a set of planning problems. Computing general solutions with correctness guarantee has long been a key issue in GP. Abstractions are widely used to solve GP problems. For example, a popular abstraction model for GP is qualitative numeric planning (QNP), which extends classical planning with non-negative real variables that can be increased or decreased by some arbitrary amount. The refinement of correct solutions of sound abstractions are solutions with correctness guarantees for GP problems. More recent literature proposed a uniform abstraction framework for GP and gave model-theoretic definitions of sound and complete abstractions for GP problems. In this paper, based on the previous work, we explore automatic verification of sound abstractions for GP. Firstly, we present a proof-theoretic characterization for sound abstractions. Secondly, based on the characterization, we give a sufficient condition for sound abstractions with deterministic actions. Then we study how to verify the sufficient condition when the abstraction models are bounded QNPs where integer variables can be incremented or decremented by one. To this end, we develop methods to handle counting and transitive closure, which are often used to define numerical variables. Finally, we implement a sound bounded QNP abstraction verification system and report experimental results on several domains.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127132024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Motion Planning Under Uncertainty with Complex Agents and Environments via Hybrid Search (Extended Abstract) 基于混合搜索的复杂agent和环境下不确定运动规划(扩展摘要)

International Joint Conference on Artificial Intelligence

Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/792

Daniel Strawser, B. Williams

As autonomous systems tackle more real-world situations, mission success oftentimes cannot be guaranteed and the planner must reason about the probability of failure. Unfortunately, computing a trajectory that satisfies mission goals while constraining the probability of failure is difficult because of the need to reason about complex, multidimensional probability distributions. Recent methods have seen success using chance-constrained, model-based planning. We argue there are two main drawbacks to these approaches. First, current methods suffer from an inability to deal with expressive environment models such as 3D non-convex obstacles. Second, most planners rely on considerable simplifications when computing trajectory risk including approximating the agent's dynamics, geometry, and uncertainty. We apply hybrid search to the risk-bound, goal-directed planning problem. The hybrid search consists of a region planner and a trajectory planner. The region planner makes discrete choices by reasoning about geometric regions that the agent should visit in order to accomplish its mission. In formulating the region planner, we propose landmark regions that help produce obstacle-free paths. The region planner passes paths through the environment to a trajectory planner; the task of the trajectory planner is to optimize trajectories that respect the agent's dynamics and the user's desired risk of mission failure. We discuss three approaches to modeling trajectory risk: a CDF-based approach, a sampling-based collocation method, and an algorithm named Shooting Method Monte Carlo. A variety of 2D and 3D test cases are presented in the full paper including a linear case, a Dubins car model, and an underwater autonomous vehicle. The method is shown to outperform other methods in terms of speed and utility of the solution. Additionally, the models of trajectory risk are shown to better approximate risk in simulation.

随着自主系统处理更多的现实情况，任务的成功往往不能保证，计划者必须考虑失败的可能性。不幸的是，由于需要对复杂的多维概率分布进行推理，在限制失败概率的同时计算满足任务目标的轨迹是困难的。最近的方法已经成功地使用了机会约束、基于模型的计划。我们认为这些方法有两个主要缺点。首先，当前的方法无法处理具有表现力的环境模型，如3D非凸障碍物。其次，大多数计划者在计算轨迹风险时依赖于相当大的简化，包括逼近agent的动力学、几何和不确定性。我们将混合搜索应用于风险约束、目标导向的规划问题。混合搜索由区域规划器和轨迹规划器组成。区域规划器通过推理智能体为完成任务需要访问的几何区域来做出离散选择。在制定区域规划时，我们提出了有助于产生无障碍路径的地标区域。区域规划器通过环境将路径传递给轨迹规划器;轨迹规划器的任务是在尊重智能体动力学和用户期望的任务失败风险的情况下优化轨迹。我们讨论了三种建模轨迹风险的方法:基于cdf的方法、基于采样的配置方法和射击法蒙特卡罗算法。全文中介绍了各种2D和3D测试案例，包括线性案例，杜宾汽车模型和水下自主车辆。该方法在解决方案的速度和效用方面优于其他方法。仿真结果表明，轨迹风险模型能较好地逼近风险。

{"title":"Motion Planning Under Uncertainty with Complex Agents and Environments via Hybrid Search (Extended Abstract)","authors":"Daniel Strawser, B. Williams","doi":"10.24963/ijcai.2023/792","DOIUrl":"https://doi.org/10.24963/ijcai.2023/792","url":null,"abstract":"As autonomous systems tackle more real-world situations, mission success oftentimes cannot be guaranteed and the planner must reason about the probability of failure. Unfortunately, computing a trajectory that satisfies mission goals while constraining the probability of failure is difficult because of the need to reason about complex, multidimensional probability distributions. Recent methods have seen success using chance-constrained, model-based planning. We argue there are two main drawbacks to these approaches. First, current methods suffer from an inability to deal with expressive environment models such as 3D non-convex obstacles. Second, most planners rely on considerable simplifications when computing trajectory risk including approximating the agent's dynamics, geometry, and uncertainty. We apply hybrid search to the risk-bound, goal-directed planning problem. The hybrid search consists of a region planner and a trajectory planner. The region planner makes discrete choices by reasoning about geometric regions that the agent should visit in order to accomplish its mission. In formulating the region planner, we propose landmark regions that help produce obstacle-free paths. The region planner passes paths through the environment to a trajectory planner; the task of the trajectory planner is to optimize trajectories that respect the agent's dynamics and the user's desired risk of mission failure. We discuss three approaches to modeling trajectory risk: a CDF-based approach, a sampling-based collocation method, and an algorithm named Shooting Method Monte Carlo. A variety of 2D and 3D test cases are presented in the full paper including a linear case, a Dubins car model, and an underwater autonomous vehicle. The method is shown to outperform other methods in terms of speed and utility of the solution. Additionally, the models of trajectory risk are shown to better approximate risk in simulation.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127351209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automated Content Moderation Using Transparent Solutions and Linguistic Expertise 使用透明解决方案和语言专业知识的自动内容审核

International Joint Conference on Artificial Intelligence

Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/823

Veronika Solopova

Since the dawn of Transformer-based models, the trade-off between transparency and accuracy has been a topical issue in the NLP community. Working towards ethical and transparent automated content moderation (ACM), my goal is to find where it is still relevant to implement linguistic expertise. I show that transparent statistical models based on linguistic knowledge can still be competitive, while linguistic features have many other useful applications.

自从基于变形金刚的模型出现以来，透明度和准确性之间的权衡一直是NLP社区的一个热门问题。致力于道德和透明的自动内容审核(ACM)，我的目标是找到仍然与实施语言专业知识相关的地方。我表明，基于语言知识的透明统计模型仍然具有竞争力，而语言特征还有许多其他有用的应用。

引用次数: 0

On Building a Semi-Automated Framework for Generating Causal Bayesian Networks from Raw Text 基于原始文本生成因果贝叶斯网络的半自动化框架的构建

International Joint Conference on Artificial Intelligence

Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/822

Solat J. Sheikh

The availability of a large amount of unstructured text has generated interest in utilizing it for future decision-making and developing strategies in various critical domains. Despite some progress, automatically generating accurate reasoning models from the raw text is still an active area of research. Furthermore, most proposed approaches focus on a specific do-main. As such, their suggested transformation methods are usually unreliable when applied to other domains. This research aims to develop a framework, SCANER (Semi-automated CAusal Network Extraction from Raw text), to convert raw text into Causal Bayesian Networks (CBNs). The framework will then be employed in various domains to demonstrate its utilization as a decision-support tool. The preliminary experiments have focused on three domains: political narratives, food insecurity, and medical sciences. The future focus is on developing BNs from political narratives and modifying them through various methods to reduce the level of aggressiveness or extremity in the narratives without causing conflict among the masses or countries.

大量非结构化文本的可用性引起了人们对利用它在各个关键领域进行未来决策和制定战略的兴趣。尽管取得了一些进展，但从原始文本中自动生成准确的推理模型仍然是一个活跃的研究领域。此外，大多数建议的方法都集中在特定的do-main上。因此，他们建议的转换方法在应用于其他领域时通常是不可靠的。本研究旨在开发一个框架，SCANER(原始文本的半自动因果网络提取)，将原始文本转换为因果贝叶斯网络(cbn)。然后将在各个领域中使用该框架，以演示其作为决策支持工具的使用情况。初步实验集中在三个领域:政治叙事、粮食不安全和医学科学。未来的重点是从政治叙事中发展bn，并通过各种方法对其进行修改，以减少叙事中的侵略性或极端程度，而不会引起群众或国家之间的冲突。

引用次数: 0

Relation-enhanced DETR for Component Detection in Graphic Design Reverse Engineering 图形设计逆向工程中构件检测的关系增强DETR

International Joint Conference on Artificial Intelligence

Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/532

Xixuan Hao, Danqing Huang, Jieru Lin, Chin-Yew Lin

It is a common practice for designers to create digital prototypes from a mock-up/screenshot. Reverse engineering graphic design by detecting its components (e.g., text, icon, button) helps expedite this process. This paper first conducts a statistical analysis to emphasize the importance of relations in graphic layouts, which further motivates us to incorporate relation modeling into component detection. Built on the current state-of-the-art DETR (DEtection TRansformer), we introduce a learnable relation matrix to model class correlations. Specifically, the matrix will be added in the DETR decoder to update the query-to-query self-attention. Experiment results on three public datasets show that our approach achieves better performance than several strong baselines. We further visualize the learnt relation matrix and observe some reasonable patterns. Moreover, we show an application of component detection where we leverage the detection outputs as augmented training data for layout generation, which achieves promising results.

设计师从模型/截图中创建数字原型是一种常见的做法。逆向工程图形设计通过检测其组件(例如，文本，图标，按钮)有助于加快这一过程。本文首先通过统计分析来强调关系在图形布局中的重要性，这进一步促使我们将关系建模纳入到组件检测中。在当前最先进的DETR(检测变压器)的基础上，我们引入了一个可学习的关系矩阵来建模类相关性。具体来说，矩阵将被添加到DETR解码器中，以更新查询到查询的自关注。在三个公共数据集上的实验结果表明，我们的方法比几个强基线取得了更好的性能。我们进一步将学习到的关系矩阵可视化，并观察到一些合理的模式。此外，我们展示了一个组件检测的应用，我们利用检测输出作为增强训练数据来生成布局，这取得了很好的结果。

引用次数: 1

Learning Preference Models with Sparse Interactions of Criteria 具有稀疏交互条件的学习偏好模型

International Joint Conference on Artificial Intelligence

Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/421

Margot Herin, P. Perny, Nataliya Sokolovska

Multicriteria decision making requires defining the result of conflicting and possibly interacting criteria. Allowing criteria interactions in a decision model increases the complexity of the preference learning task due to the combinatorial nature of the possible interactions. In this paper, we propose an approach to learn a decision model in which the interaction pattern is revealed from preference data and kept as simple as possible. We consider weighted aggregation functions like multilinear utilities or Choquet integrals, admitting representations including non-linear terms measuring the joint benefit or penalty attached to some combinations of criteria. The weighting coefficients known as Möbius masses model positive or negative synergies among criteria. We propose an approach to learn the Möbius masses, based on iterative reweighted least square for sparse recovery, and dualization to improve scalability. This approach is applied to learn sparse representations of the multilinear utility model and conjunctive/disjunctive forms of the discrete Choquet integral from preferences examples, in aggregation problems possibly involving more than 20 criteria.

多标准决策需要定义相互冲突和可能相互作用的标准的结果。由于可能的交互具有组合性，在决策模型中允许标准交互增加了偏好学习任务的复杂性。在本文中，我们提出了一种学习决策模型的方法，该方法从偏好数据中揭示交互模式并尽可能保持简单。我们考虑像多线性效用或Choquet积分这样的加权聚合函数，承认包含非线性项的表示，测量附加到某些标准组合的共同利益或惩罚。称为Möbius质量的加权系数模拟了标准之间的积极或消极协同作用。我们提出了一种学习Möbius质量的方法，该方法基于迭代加权最小二乘进行稀疏恢复，并基于二元化来提高可扩展性。在可能涉及超过20个标准的聚合问题中，该方法被应用于从偏好示例中学习多线性实用新型的稀疏表示和离散Choquet积分的合取/析取形式。

{"title":"Learning Preference Models with Sparse Interactions of Criteria","authors":"Margot Herin, P. Perny, Nataliya Sokolovska","doi":"10.24963/ijcai.2023/421","DOIUrl":"https://doi.org/10.24963/ijcai.2023/421","url":null,"abstract":"Multicriteria decision making requires defining the result of conflicting and possibly interacting criteria. Allowing criteria interactions in a decision model increases the complexity of the preference learning task due to the combinatorial nature of the possible interactions. In this paper, we propose an approach to learn a decision model in which the interaction pattern is revealed from preference data and kept as simple as possible. We consider weighted aggregation functions like multilinear utilities or Choquet integrals, admitting representations including non-linear terms measuring the joint benefit or penalty attached to some combinations of criteria. The weighting coefficients known as Möbius masses model positive or negative synergies among criteria. We propose an approach to learn the Möbius masses, based on iterative reweighted least square for sparse recovery, and dualization to improve scalability. This approach is applied to learn sparse representations of the multilinear utility model and conjunctive/disjunctive forms of the discrete Choquet integral from preferences examples, in aggregation problems possibly involving more than 20 criteria.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130264657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Communication-Efficient Stochastic Gradient Descent Ascent with Momentum Algorithms 基于动量算法的高效通信随机梯度下降上升

International Joint Conference on Artificial Intelligence

Pub Date : 2023-08-01 DOI: 10.24963/ijcai.2023/512

Yihan Zhang, M. Qiu, Hongchang Gao

Numerous machine learning models can be formulated as a stochastic minimax optimization problem, such as imbalanced data classification with AUC maximization.Developing efficient algorithms to optimize such kinds of problems is of importance and necessity. However, most existing algorithms restrict their focus on the single-machine setting so that they are incapable of dealing with the large communication overhead in a distributed training system. Moreover, most existing communication-efficient optimization algorithms only focus on the traditional minimization problem, failing to handle the minimax optimization problem. To address these challenging issues, in this paper, we develop two novel communication-efficient stochastic gradient descent ascent with momentum algorithms for the distributed minimax optimization problem, which can significantly reduce the communication cost via the two-way compression scheme. However, the compressed momentum makes it considerably challenging to investigate the convergence rate of our algorithms, especially in the presence of the interaction between the minimization and maximization subproblems. In this paper, we successfully addressed these challenges and established the convergence rate of our algorithms for nonconvex-strongly-concave problems. To the best of our knowledge, our algorithms are the first communication-efficient algorithm with theoretical guarantees for the minimax optimization problem. Finally, we apply our algorithm to the distributed AUC maximization problem for the imbalanced data classification task. Extensive experimental results confirm the efficacy of our algorithm in saving communication costs.

许多机器学习模型可以被表述为一个随机极大极小优化问题，例如AUC最大化的不平衡数据分类。开发有效的算法来优化这类问题是非常重要和必要的。然而，现有的大多数算法都局限于单机设置，无法处理分布式训练系统中庞大的通信开销。此外，现有的通信效率优化算法大多只关注传统的最小化问题，未能处理极大极小优化问题。为了解决这些具有挑战性的问题，本文针对分布式极大极小优化问题开发了两种具有通信效率的随机梯度下降上升动量算法，通过双向压缩方案可以显著降低通信成本。然而，压缩的动量使得研究我们的算法的收敛速度变得相当具有挑战性，特别是在最小化和最大化子问题之间存在相互作用的情况下。在本文中，我们成功地解决了这些挑战，并建立了我们的算法对非凸强凹问题的收敛速度。据我们所知，我们的算法是第一个对极大极小优化问题具有理论保证的通信高效算法。最后，将该算法应用于非平衡数据分类任务的分布式AUC最大化问题。大量的实验结果证实了该算法在节省通信成本方面的有效性。

{"title":"Communication-Efficient Stochastic Gradient Descent Ascent with Momentum Algorithms","authors":"Yihan Zhang, M. Qiu, Hongchang Gao","doi":"10.24963/ijcai.2023/512","DOIUrl":"https://doi.org/10.24963/ijcai.2023/512","url":null,"abstract":"Numerous machine learning models can be formulated as a stochastic minimax optimization problem, such as imbalanced data classification with AUC maximization.\u0000\u0000Developing efficient algorithms to optimize such kinds of problems is of importance and necessity. However, most existing algorithms restrict their focus on the single-machine setting so that they are incapable of dealing with the large communication overhead in a distributed training system. Moreover, most existing communication-efficient optimization algorithms only focus on the traditional minimization problem, failing to handle the minimax optimization problem. To address these challenging issues, in this paper, we develop two novel communication-efficient stochastic gradient descent ascent with momentum algorithms for the distributed minimax optimization problem, which can significantly reduce the communication cost via the two-way compression scheme. However, the compressed momentum makes it considerably challenging to investigate the convergence rate of our algorithms, especially in the presence of the interaction between the minimization and maximization subproblems. In this paper, we successfully addressed these challenges and established the convergence rate of our algorithms for nonconvex-strongly-concave problems. To the best of our knowledge, our algorithms are the first communication-efficient algorithm with theoretical guarantees for the minimax optimization problem. Finally, we apply our algorithm to the distributed AUC maximization problem for the imbalanced data classification task. Extensive experimental results confirm the efficacy of our algorithm in saving communication costs.","PeriodicalId":394530,"journal":{"name":"International Joint Conference on Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130279514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Joint Conference on Artificial Intelligence

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀