Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)最新文献

英文中文

Learning Optimal Transport Between two Empirical Distributions with Normalizing Flows 用正规化流学习两个经验分布之间的最优传输

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2022-07-04 DOI: 10.48550/arXiv.2207.01246

Florentin Coeurdoux, N. Dobigeon, P. Chainais

Optimal transport (OT) provides effective tools for comparing and mapping probability measures. We propose to leverage the flexibility of neural networks to learn an approximate optimal transport map. More precisely, we present a new and original method to address the problem of transporting a finite set of samples associated with a first underlying unknown distribution towards another finite set of samples drawn from another unknown distribution. We show that a particular instance of invertible neural networks, namely the normalizing flows, can be used to approximate the solution of this OT problem between a pair of empirical distributions. To this aim, we propose to relax the Monge formulation of OT by replacing the equality constraint on the push-forward measure by the minimization of the corresponding Wasserstein distance. The push-forward operator to be retrieved is then restricted to be a normalizing flow which is trained by optimizing the resulting cost function. This approach allows the transport map to be discretized as a composition of functions. Each of these functions is associated to one sub-flow of the network, whose output provides intermediate steps of the transport between the original and target measures. This discretization yields also a set of intermediate barycenters between the two measures of interest. Experiments conducted on toy examples as well as a challenging task of unsupervised translation demonstrate the interest of the proposed method. Finally, some experiments show that the proposed approach leads to a good approximation of the true OT.

最优运输(OT)为比较和映射概率测度提供了有效的工具。我们建议利用神经网络的灵活性来学习一个近似的最优运输图。更准确地说，我们提出了一种新的和原始的方法来解决将与第一个潜在未知分布相关的有限样本集传输到从另一个未知分布提取的另一个有限样本集的问题。我们证明了可逆神经网络的一个特殊实例，即归一化流，可以用来近似这对经验分布之间的OT问题的解。为此，我们提出用相应的Wasserstein距离的最小化来代替推进测度的等式约束，从而放宽OT的Monge公式。然后，要检索的前推算子被限制为通过优化结果代价函数来训练的规范化流。这种方法允许将传输映射离散为函数的组合。这些功能中的每一个都与网络的一个子流相关联，其子流的输出提供了原始度量和目标度量之间传输的中间步骤。这种离散化也产生了两个感兴趣的度量之间的一组中间重心。在玩具示例和具有挑战性的无监督翻译任务上进行的实验证明了所提出方法的有效性。最后，一些实验表明，所提出的方法可以很好地逼近真实的OT。

{"title":"Learning Optimal Transport Between two Empirical Distributions with Normalizing Flows","authors":"Florentin Coeurdoux, N. Dobigeon, P. Chainais","doi":"10.48550/arXiv.2207.01246","DOIUrl":"https://doi.org/10.48550/arXiv.2207.01246","url":null,"abstract":"Optimal transport (OT) provides effective tools for comparing and mapping probability measures. We propose to leverage the flexibility of neural networks to learn an approximate optimal transport map. More precisely, we present a new and original method to address the problem of transporting a finite set of samples associated with a first underlying unknown distribution towards another finite set of samples drawn from another unknown distribution. We show that a particular instance of invertible neural networks, namely the normalizing flows, can be used to approximate the solution of this OT problem between a pair of empirical distributions. To this aim, we propose to relax the Monge formulation of OT by replacing the equality constraint on the push-forward measure by the minimization of the corresponding Wasserstein distance. The push-forward operator to be retrieved is then restricted to be a normalizing flow which is trained by optimizing the resulting cost function. This approach allows the transport map to be discretized as a composition of functions. Each of these functions is associated to one sub-flow of the network, whose output provides intermediate steps of the transport between the original and target measures. This discretization yields also a set of intermediate barycenters between the two measures of interest. Experiments conducted on toy examples as well as a challenging task of unsupervised translation demonstrate the interest of the proposed method. Finally, some experiments show that the proposed approach leads to a good approximation of the true OT.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73301759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Detection of ADHD based on Eye Movements during Natural Viewing 基于自然观看时眼球运动的ADHD检测

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2022-07-04 DOI: 10.48550/arXiv.2207.01377

Shuwen Deng, Paul Prasse, D. R. Reich, S. Dziemian, Maja Stegenwallner-Schütz, Daniel G. Krakowczyk, Silvia Makowski, N. Langer, T. Scheffer, L. Jäger

Attention-deficit/hyperactivity disorder (ADHD) is a neurodevelopmental disorder that is highly prevalent and requires clinical specialists to diagnose. It is known that an individual's viewing behavior, reflected in their eye movements, is directly related to attentional mechanisms and higher-order cognitive processes. We therefore explore whether ADHD can be detected based on recorded eye movements together with information about the video stimulus in a free-viewing task. To this end, we develop an end-to-end deep learning-based sequence model which we pre-train on a related task for which more data are available. We find that the method is in fact able to detect ADHD and outperforms relevant baselines. We investigate the relevance of the input features in an ablation study. Interestingly, we find that the model's performance is closely related to the content of the video, which provides insights for future experimental designs.

注意缺陷/多动障碍(ADHD)是一种非常普遍的神经发育障碍，需要临床专家来诊断。众所周知，一个人的观看行为反映在他们的眼球运动中，与注意力机制和高阶认知过程直接相关。因此，我们探索是否可以根据记录的眼球运动以及自由观看任务中的视频刺激信息来检测ADHD。为此，我们开发了一个基于端到端深度学习的序列模型，我们在一个相关的任务上进行预训练，其中有更多的数据可用。我们发现，该方法实际上能够检测ADHD，并且优于相关基线。我们研究了消融研究中输入特征的相关性。有趣的是，我们发现模型的性能与视频的内容密切相关，这为未来的实验设计提供了见解。

引用次数: 6

Transforming PageRank into an Infinite-Depth Graph Neural Network 将PageRank转化为无限深度图神经网络

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2022-07-01 DOI: 10.48550/arXiv.2207.00684

Andreas Roth, T. Liebig

Popular graph neural networks are shallow models, despite the success of very deep architectures in other application domains of deep learning. This reduces the modeling capacity and leaves models unable to capture long-range relationships. The primary reason for the shallow design results from over-smoothing, which leads node states to become more similar with increased depth. We build on the close connection between GNNs and PageRank, for which personalized PageRank introduces the consideration of a personalization vector. Adopting this idea, we propose the Personalized PageRank Graph Neural Network (PPRGNN), which extends the graph convolutional network to an infinite-depth model that has a chance to reset the neighbor aggregation back to the initial state in each iteration. We introduce a nicely interpretable tweak to the chance of resetting and prove the convergence of our approach to a unique solution without placing any constraints, even when taking infinitely many neighbor aggregations. As in personalized PageRank, our result does not suffer from over-smoothing. While doing so, time complexity remains linear while we keep memory complexity constant, independently of the depth of the network, making it scale well to large graphs. We empirically show the effectiveness of our approach for various node and graph classification tasks. PPRGNN outperforms comparable methods in almost all cases.

流行的图神经网络是浅模型，尽管在深度学习的其他应用领域中非常深的架构取得了成功。这降低了建模能力，使模型无法捕获长期关系。浅设计的主要原因是过度平滑，这导致节点状态随着深度的增加而变得更加相似。我们建立在gnn和PageRank之间的密切联系上，其中个性化PageRank引入了个性化向量的考虑。采用这一思想，我们提出了个性化PageRank图神经网络(PPRGNN)，它将图卷积网络扩展为无限深度模型，该模型在每次迭代中都有机会将邻居聚合重置回初始状态。我们引入了一个很好的可解释的调整，以重置的机会，并证明我们的方法收敛到一个唯一的解决方案，而不放置任何约束，即使在取无限多个邻居聚合时也是如此。在个性化PageRank中，我们的结果不会受到过度平滑的影响。在这样做的同时，时间复杂度保持线性，而我们保持内存复杂度不变，独立于网络的深度，使其能够很好地扩展到大型图。我们通过经验证明了我们的方法对各种节点和图分类任务的有效性。PPRGNN在几乎所有情况下都优于同类方法。

{"title":"Transforming PageRank into an Infinite-Depth Graph Neural Network","authors":"Andreas Roth, T. Liebig","doi":"10.48550/arXiv.2207.00684","DOIUrl":"https://doi.org/10.48550/arXiv.2207.00684","url":null,"abstract":"Popular graph neural networks are shallow models, despite the success of very deep architectures in other application domains of deep learning. This reduces the modeling capacity and leaves models unable to capture long-range relationships. The primary reason for the shallow design results from over-smoothing, which leads node states to become more similar with increased depth. We build on the close connection between GNNs and PageRank, for which personalized PageRank introduces the consideration of a personalization vector. Adopting this idea, we propose the Personalized PageRank Graph Neural Network (PPRGNN), which extends the graph convolutional network to an infinite-depth model that has a chance to reset the neighbor aggregation back to the initial state in each iteration. We introduce a nicely interpretable tweak to the chance of resetting and prove the convergence of our approach to a unique solution without placing any constraints, even when taking infinitely many neighbor aggregations. As in personalized PageRank, our result does not suffer from over-smoothing. While doing so, time complexity remains linear while we keep memory complexity constant, independently of the depth of the network, making it scale well to large graphs. We empirically show the effectiveness of our approach for various node and graph classification tasks. PPRGNN outperforms comparable methods in almost all cases.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89750075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Is this bug severe? A text-cum-graph based model for bug severity prediction 这种病严重吗?基于文本和图形的bug严重性预测模型

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2022-07-01 DOI: 10.48550/arXiv.2207.00623

Rima Hazra, Arpit Dwivedi, Animesh Mukherjee

Repositories of large software systems have become commonplace. This massive expansion has resulted in the emergence of various problems in these software platforms including identification of (i) bug-prone packages, (ii) critical bugs, and (iii) severity of bugs. One of the important goals would be to mine these bugs and recommend them to the developers to resolve them. The first step to this is that one has to accurately detect the extent of severity of the bugs. In this paper, we take up this task of predicting the severity of bugs in the near future. Contextualized neural models built on the text description of a bug and the user comments about the bug help to achieve reasonably good performance. Further information on how the bugs are related to each other in terms of the ways they affect packages can be summarised in the form of a graph and used along with the text to get additional benefits.

大型软件系统的存储库已经变得司空见惯。这种大规模的扩展导致了这些软件平台中出现了各种问题，包括(i)易出错包的识别，(ii)关键bug，以及(iii) bug的严重性。其中一个重要的目标是挖掘这些错误，并将它们推荐给开发人员来解决它们。要做到这一点，第一步是必须准确地检测漏洞的严重程度。在本文中，我们承担了在不久的将来预测bug严重性的任务。基于错误的文本描述和用户对错误的评论建立的上下文化神经模型有助于实现相当好的性能。关于这些bug是如何影响包的，它们之间是如何相互关联的进一步信息可以用图表的形式总结出来，并与文本一起使用，以获得额外的好处。

引用次数: 2

Open Data Science to Fight COVID-19: Winning the 500k XPRIZE Pandemic Response Challenge 开放数据科学对抗COVID-19:赢得50万XPRIZE大流行应对挑战

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2022-07-01 DOI: 10.1007/978-3-030-86514-6_24

M. Lozano, Òscar Garibo i Orts, Eloy Piñol, M. Rebollo, K. Polotskaya, M. Garcia-March, J. Conejero, F. Escolano, N. Oliver

引用次数: 16

MEAD: A Multi-Armed Approach for Evaluation of Adversarial Examples Detectors MEAD:一种评估对抗样本检测器的多臂方法

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2022-06-30 DOI: 10.48550/arXiv.2206.15415

Federica Granese, Marine Picot, Marco Romanelli, Francisco Messina, P. Piantanida

Detection of adversarial examples has been a hot topic in the last years due to its importance for safely deploying machine learning algorithms in critical applications. However, the detection methods are generally validated by assuming a single implicitly known attack strategy, which does not necessarily account for real-life threats. Indeed, this can lead to an overoptimistic assessment of the detectors' performance and may induce some bias in the comparison between competing detection schemes. We propose a novel multi-armed framework, called MEAD, for evaluating detectors based on several attack strategies to overcome this limitation. Among them, we make use of three new objectives to generate attacks. The proposed performance metric is based on the worst-case scenario: detection is successful if and only if all different attacks are correctly recognized. Empirically, we show the effectiveness of our approach. Moreover, the poor performance obtained for state-of-the-art detectors opens a new exciting line of research.

由于对抗性示例的检测对于在关键应用中安全部署机器学习算法的重要性，因此在过去几年中一直是一个热门话题。然而，检测方法通常是通过假设一个隐式已知的攻击策略来验证的，这并不一定能解释现实生活中的威胁。事实上，这可能导致对检测器性能的过于乐观的评估，并可能在相互竞争的检测方案之间的比较中引起一些偏差。我们提出了一种新的多臂框架，称为MEAD，用于评估基于几种攻击策略的检测器，以克服这一限制。其中，我们利用了三个新的目标来产生攻击。建议的性能指标基于最坏情况:当且仅当所有不同的攻击都被正确识别时，检测才成功。经验表明，我们的方法是有效的。此外，最先进的探测器所获得的较差性能开辟了一个令人兴奋的新研究方向。

{"title":"MEAD: A Multi-Armed Approach for Evaluation of Adversarial Examples Detectors","authors":"Federica Granese, Marine Picot, Marco Romanelli, Francisco Messina, P. Piantanida","doi":"10.48550/arXiv.2206.15415","DOIUrl":"https://doi.org/10.48550/arXiv.2206.15415","url":null,"abstract":"Detection of adversarial examples has been a hot topic in the last years due to its importance for safely deploying machine learning algorithms in critical applications. However, the detection methods are generally validated by assuming a single implicitly known attack strategy, which does not necessarily account for real-life threats. Indeed, this can lead to an overoptimistic assessment of the detectors' performance and may induce some bias in the comparison between competing detection schemes. We propose a novel multi-armed framework, called MEAD, for evaluating detectors based on several attack strategies to overcome this limitation. Among them, we make use of three new objectives to generate attacks. The proposed performance metric is based on the worst-case scenario: detection is successful if and only if all different attacks are correctly recognized. Empirically, we show the effectiveness of our approach. Moreover, the poor performance obtained for state-of-the-art detectors opens a new exciting line of research.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78695879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Customized Conversational Recommender Systems 定制会话推荐系统

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2022-06-30 DOI: 10.48550/arXiv.2207.00814

Shuokai Li, Yongchun Zhu, Ruobing Xie, Zhenwei Tang, Zhao Zhang, Fuzhen Zhuang, Qing He, Hui Xiong

Conversational recommender systems (CRS) aim to capture user's current intentions and provide recommendations through real-time multi-turn conversational interactions. As a human-machine interactive system, it is essential for CRS to improve the user experience. However, most CRS methods neglect the importance of user experience. In this paper, we propose two key points for CRS to improve the user experience: (1) Speaking like a human, human can speak with different styles according to the current dialogue context. (2) Identifying fine-grained intentions, even for the same utterance, different users have diverse finegrained intentions, which are related to users' inherent preference. Based on the observations, we propose a novel CRS model, coined Customized Conversational Recommender System (CCRS), which customizes CRS model for users from three perspectives. For human-like dialogue services, we propose multi-style dialogue response generator which selects context-aware speaking style for utterance generation. To provide personalized recommendations, we extract user's current fine-grained intentions from dialogue context with the guidance of user's inherent preferences. Finally, to customize the model parameters for each user, we train the model from the meta-learning perspective. Extensive experiments and a series of analyses have shown the superiority of our CCRS on both the recommendation and dialogue services.

会话推荐系统(CRS)旨在捕捉用户当前的意图，并通过实时的多回合会话交互提供推荐。CRS作为一个人机交互系统，提高用户体验是必不可少的。然而，大多数CRS方法忽视了用户体验的重要性。本文提出了CRS改善用户体验的两个关键点:(1)像人一样说话，人可以根据当前的对话上下文以不同的风格说话。(2)识别细粒度意图，即使是同一话语，不同的用户也有不同的细粒度意图，这与用户的内在偏好有关。在此基础上，我们提出了一种新的CRS模型——定制会话推荐系统(Customized Conversational Recommender System, CCRS)，该模型从三个角度为用户定制CRS模型。对于类人对话服务，我们提出了多风格对话响应生成器，它选择上下文感知的说话风格来生成话语。为了提供个性化的推荐，我们在用户固有偏好的指导下，从对话上下文中提取用户当前的细粒度意图。最后，为了定制每个用户的模型参数，我们从元学习的角度训练模型。大量的实验和一系列的分析表明，我们的CCRS在推荐和对话服务方面都具有优势。

{"title":"Customized Conversational Recommender Systems","authors":"Shuokai Li, Yongchun Zhu, Ruobing Xie, Zhenwei Tang, Zhao Zhang, Fuzhen Zhuang, Qing He, Hui Xiong","doi":"10.48550/arXiv.2207.00814","DOIUrl":"https://doi.org/10.48550/arXiv.2207.00814","url":null,"abstract":"Conversational recommender systems (CRS) aim to capture user's current intentions and provide recommendations through real-time multi-turn conversational interactions. As a human-machine interactive system, it is essential for CRS to improve the user experience. However, most CRS methods neglect the importance of user experience. In this paper, we propose two key points for CRS to improve the user experience: (1) Speaking like a human, human can speak with different styles according to the current dialogue context. (2) Identifying fine-grained intentions, even for the same utterance, different users have diverse finegrained intentions, which are related to users' inherent preference. Based on the observations, we propose a novel CRS model, coined Customized Conversational Recommender System (CCRS), which customizes CRS model for users from three perspectives. For human-like dialogue services, we propose multi-style dialogue response generator which selects context-aware speaking style for utterance generation. To provide personalized recommendations, we extract user's current fine-grained intentions from dialogue context with the guidance of user's inherent preferences. Finally, to customize the model parameters for each user, we train the model from the meta-learning perspective. Extensive experiments and a series of analyses have shown the superiority of our CCRS on both the recommendation and dialogue services.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80832037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Placing (Historical) Facts on a Timeline: A Classification cum Coref Resolution Approach 在时间轴上放置(历史)事实:分类和核心解决方法

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2022-06-28 DOI: 10.48550/arXiv.2206.14089

Sayantan Adak, Altaf Ahmad, Aditya Basu, Animesh Mukherjee

A timeline provides one of the most effective ways to visualize the important historical facts that occurred over a period of time, presenting the insights that may not be so apparent from reading the equivalent information in textual form. By leveraging generative adversarial learning for important sentence classification and by assimilating knowledge based tags for improving the performance of event coreference resolution we introduce a two staged system for event timeline generation from multiple (historical) text documents. We demonstrate our results on two manually annotated historical text documents. Our results can be extremely helpful for historians, in advancing research in history and in understanding the socio-political landscape of a country as reflected in the writings of famous personas.

时间轴提供了一种最有效的方式，可以将一段时间内发生的重要历史事实形象化，呈现出阅读文本形式的同等信息时可能不那么明显的见解。通过利用生成式对抗学习进行重要的句子分类，并通过吸收基于知识的标签来提高事件共参考分辨率的性能，我们引入了一个两阶段的系统，用于从多个(历史)文本文档生成事件时间轴。我们在两个手工注释的历史文本文档上演示了我们的结果。我们的研究结果对历史学家非常有帮助，有助于推进历史研究，有助于理解一个国家的社会政治景观，这反映在著名人物的著作中。

引用次数: 0

DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement Learning Systems DistSPECTRL:多智能体强化学习系统中的分布式规范

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2022-06-28 DOI: 10.48550/arXiv.2206.13754

Joe Eappen, S. Jagannathan

While notable progress has been made in specifying and learning objectives for general cyber-physical systems, applying these methods to distributed multi-agent systems still pose significant challenges. Among these are the need to (a) craft specification primitives that allow expression and interplay of both local and global objectives, (b) tame explosion in the state and action spaces to enable effective learning, and (c) minimize coordination frequency and the set of engaged participants for global objectives. To address these challenges, we propose a novel specification framework that allows natural composition of local and global objectives used to guide training of a multi-agent system. Our technique enables learning expressive policies that allow agents to operate in a coordination-free manner for local objectives, while using a decentralized communication protocol for enforcing global ones. Experimental results support our claim that sophisticated multi-agent distributed planning problems can be effectively realized using specification-guided learning.

虽然在一般网络物理系统的指定和学习目标方面取得了显著进展，但将这些方法应用于分布式多智能体系统仍然存在重大挑战。其中需要(a)制作允许本地和全局目标的表达和相互作用的规范原语，(b)驯服状态和行动空间中的爆炸，以实现有效的学习，以及(c)最小化协调频率和参与全球目标的参与者集。为了应对这些挑战，我们提出了一个新的规范框架，该框架允许本地和全局目标的自然组合，用于指导多智能体系统的训练。我们的技术允许学习表达策略，这些策略允许代理以不需要协调的方式操作本地目标，同时使用分散的通信协议来执行全局目标。实验结果支持我们的说法，即复杂的多智能体分布式规划问题可以有效地实现使用规范引导学习。

{"title":"DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement Learning Systems","authors":"Joe Eappen, S. Jagannathan","doi":"10.48550/arXiv.2206.13754","DOIUrl":"https://doi.org/10.48550/arXiv.2206.13754","url":null,"abstract":"While notable progress has been made in specifying and learning objectives for general cyber-physical systems, applying these methods to distributed multi-agent systems still pose significant challenges. Among these are the need to (a) craft specification primitives that allow expression and interplay of both local and global objectives, (b) tame explosion in the state and action spaces to enable effective learning, and (c) minimize coordination frequency and the set of engaged participants for global objectives. To address these challenges, we propose a novel specification framework that allows natural composition of local and global objectives used to guide training of a multi-agent system. Our technique enables learning expressive policies that allow agents to operate in a coordination-free manner for local objectives, while using a decentralized communication protocol for enforcing global ones. Experimental results support our claim that sophisticated multi-agent distributed planning problems can be effectively realized using specification-guided learning.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87348667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Learning to Control Local Search for Combinatorial Optimization 学习控制组合优化中的局部搜索

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2022-06-27 DOI: 10.48550/arXiv.2206.13181

Jonas K. Falkner, Daniela Thyssens, Ahmad Bdeir, L. Schmidt-Thieme

Combinatorial optimization problems are encountered in many practical contexts such as logistics and production, but exact solutions are particularly difficult to find and usually NP-hard for considerable problem sizes. To compute approximate solutions, a zoo of generic as well as problem-specific variants of local search is commonly used. However, which variant to apply to which particular problem is difficult to decide even for experts. In this paper we identify three independent algorithmic aspects of such local search algorithms and formalize their sequential selection over an optimization process as Markov Decision Process (MDP). We design a deep graph neural network as policy model for this MDP, yielding a learned controller for local search called NeuroLS. Ample experimental evidence shows that NeuroLS is able to outperform both, well-known general purpose local search controllers from Operations Research as well as latest machine learning-based approaches.

组合优化问题在许多实际环境中都会遇到，例如物流和生产，但是对于相当大的问题规模，很难找到精确的解决方案，并且通常是np困难的。为了计算近似解，通常使用大量的通用和特定于问题的局部搜索变体。然而，即使是专家也很难决定将哪种变体应用于哪个特定问题。在本文中，我们确定了这种局部搜索算法的三个独立算法方面，并将它们的顺序选择形式化为马尔可夫决策过程(MDP)。我们设计了一个深度图神经网络作为该MDP的策略模型，产生了一个局部搜索的学习控制器，称为NeuroLS。大量的实验证据表明，NeuroLS能够超越运筹学中众所周知的通用本地搜索控制器以及最新的基于机器学习的方法。

引用次数: 4

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀