Machine Learning最新文献

英文中文

On metafeatures’ ability of implicit concept identification 关于元特征的内隐概念识别能力

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-09-18 DOI: 10.1007/s10994-024-06612-0

Joanna Komorniczak, Paweł Ksieniewicz

Concept drift in data stream processing remains an intriguing challenge and states a popular research topic. Methods that actively process data streams usually employ drift detectors, whose performance is often based on monitoring the variability of different stream properties. This publication provides an overview and analysis of metafeatures variability describing data streams with concept drifts. Five experiments conducted on synthetic, semi-synthetic, and real-world data streams examine the ability of over 160 metafeatures from 9 categories to recognize concepts in non-stationary data streams. The work reveals the distinctions in the considered sources of streams and specifies 17 metafeatures with a high ability of concept identification.

数据流处理中的概念漂移仍然是一个引人入胜的挑战，也是一个热门的研究课题。主动处理数据流的方法通常采用漂移检测器，其性能通常基于对不同数据流属性变异性的监测。本出版物概述并分析了描述具有概念漂移的数据流的元特征变异性。在合成、半合成和真实世界数据流上进行的五项实验检验了 9 个类别的 160 多个元特征识别非稳态数据流中概念的能力。这项工作揭示了所考虑的数据流来源的区别，并确定了 17 个具有较高概念识别能力的元特征。

引用次数: 0

Towards a foundation large events model for soccer 建立足球大型活动基础模型

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-09-13 DOI: 10.1007/s10994-024-06606-y

Tiago Mendes-Neves, Luís Meireles, João Mendes-Moreira

This paper introduces the Large Events Model (LEM) for soccer, a novel deep learning framework for generating and analyzing soccer matches. The framework can simulate games from a given game state, with its primary output being the ensuing probabilities and events from multiple simulations. These can provide insights into match dynamics and underlying mechanisms. We discuss the framework’s design, features, and methodologies, including model optimization, data processing, and evaluation techniques. The models within this framework are developed to predict specific aspects of soccer events, such as event type, success likelihood, and further details. In an applied context, we showcase the estimation of xP+, a metric estimating a player’s contribution to the team’s points earned. This work ultimately enhances the field of sports event prediction and practical applications and emphasizes the potential for this kind of method.

本文介绍了足球大事件模型（LEM），这是一种用于生成和分析足球比赛的新型深度学习框架。该框架可以从给定的比赛状态出发模拟比赛，其主要输出是来自多次模拟的随之而来的概率和事件。这些数据可以帮助我们深入了解比赛动态和内在机制。我们将讨论该框架的设计、特点和方法，包括模型优化、数据处理和评估技术。该框架中的模型是为预测足球赛事的特定方面而开发的，如赛事类型、成功可能性和更多细节。在应用方面，我们展示了 xP+ 的估算，这是一个估算球员对球队得分贡献的指标。这项工作最终加强了体育赛事预测领域和实际应用，并强调了这种方法的潜力。

引用次数: 0

Persistent Laplacian-enhanced algorithm for scarcely labeled data classification 用于稀少标记数据分类的持续拉普拉斯增强算法

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-09-13 DOI: 10.1007/s10994-024-06616-w

Gokul Bhusal, Ekaterina Merkurjev, Guo-Wei Wei

The success of many machine learning (ML) methods depends crucially on having large amounts of labeled data. However, obtaining enough labeled data can be expensive, time-consuming, and subject to ethical constraints for many applications. One approach that has shown tremendous value in addressing this challenge is semi-supervised learning (SSL); this technique utilizes both labeled and unlabeled data during training, often with much less labeled data than unlabeled data, which is often relatively easy and inexpensive to obtain. In fact, SSL methods are particularly useful in applications where the cost of labeling data is especially expensive, such as medical analysis, natural language processing, or speech recognition. A subset of SSL methods that have achieved great success in various domains involves algorithms that integrate graph-based techniques. These procedures are popular due to the vast amount of information provided by the graphical framework. In this work, we propose an algebraic topology-based semi-supervised method called persistent Laplacian-enhanced graph MBO by integrating persistent spectral graph theory with the classical Merriman–Bence–Osher (MBO) scheme. Specifically, we use a filtration procedure to generate a sequence of chain complexes and associated families of simplicial complexes, from which we construct a family of persistent Laplacians. Overall, it is a very efficient procedure that requires much less labeled data to perform well compared to many ML techniques, and it can be adapted for both small and large datasets. We evaluate the performance of our method on classification, and the results indicate that the technique outperforms other existing semi-supervised algorithms.

许多机器学习（ML）方法的成功在很大程度上取决于是否拥有大量的标记数据。然而，对于许多应用来说，获取足够多的标记数据既昂贵又耗时，而且还受到道德约束。半监督学习（SSL）是一种在应对这一挑战方面显示出巨大价值的方法；这种技术在训练过程中同时使用标记数据和非标记数据，但标记数据往往比非标记数据少得多，而非标记数据通常相对容易获得，而且成本低廉。事实上，在医疗分析、自然语言处理或语音识别等标注数据成本特别昂贵的应用中，SSL 方法尤其有用。在各个领域取得巨大成功的 SSL 方法中，有一个子集涉及集成了基于图的技术的算法。由于图形框架提供了大量信息，这些程序很受欢迎。在这项工作中，我们通过将持久谱图理论与经典的梅里曼-本斯-奥舍（MBO）方案相结合，提出了一种基于代数拓扑的半监督方法，称为持久拉普拉斯增强图 MBO。具体来说，我们使用过滤程序生成链复数序列和相关的简复数族，并由此构建持久拉普拉斯族。总体而言，这是一种非常高效的程序，与许多 ML 技术相比，它所需的标记数据要少得多，而且既适用于小型数据集，也适用于大型数据集。我们对该方法的分类性能进行了评估，结果表明该技术优于其他现有的半监督算法。

{"title":"Persistent Laplacian-enhanced algorithm for scarcely labeled data classification","authors":"Gokul Bhusal, Ekaterina Merkurjev, Guo-Wei Wei","doi":"10.1007/s10994-024-06616-w","DOIUrl":"https://doi.org/10.1007/s10994-024-06616-w","url":null,"abstract":"The success of many machine learning (ML) methods depends crucially on having large amounts of labeled data. However, obtaining enough labeled data can be expensive, time-consuming, and subject to ethical constraints for many applications. One approach that has shown tremendous value in addressing this challenge is semi-supervised learning (SSL); this technique utilizes both labeled and unlabeled data during training, often with much less labeled data than unlabeled data, which is often relatively easy and inexpensive to obtain. In fact, SSL methods are particularly useful in applications where the cost of labeling data is especially expensive, such as medical analysis, natural language processing, or speech recognition. A subset of SSL methods that have achieved great success in various domains involves algorithms that integrate graph-based techniques. These procedures are popular due to the vast amount of information provided by the graphical framework. In this work, we propose an algebraic topology-based semi-supervised method called persistent Laplacian-enhanced graph MBO by integrating persistent spectral graph theory with the classical Merriman–Bence–Osher (MBO) scheme. Specifically, we use a filtration procedure to generate a sequence of chain complexes and associated families of simplicial complexes, from which we construct a family of persistent Laplacians. Overall, it is a very efficient procedure that requires much less labeled data to perform well compared to many ML techniques, and it can be adapted for both small and large datasets. We evaluate the performance of our method on classification, and the results indicate that the technique outperforms other existing semi-supervised algorithms.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"176 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Conformal prediction for regression models with asymmetrically distributed errors: application to aircraft navigation during landing maneuver 具有非对称分布误差的回归模型的共形预测：应用于着陆机动过程中的飞机导航

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-09-09 DOI: 10.1007/s10994-024-06615-x

Solène Vilfroy, Lionel Bombrun, Thierry Urruty, Florence De Grancey, Jean-Philippe Lebrat, Philippe Carré

Semi-autonomous aircraft navigation is a high-risk domain where confidence on the prediction is required. For that, this paper introduces the use of conformal predictions strategies for regression problems. While standard approaches use an absolute nonconformity scores, we aim at introducing a signed version of the nonconformity scores. Experimental results on synthetic data have shown their interest for non-centered errors. Moreover, in order to reduce the width of the prediction interval, we introduce an optimization procedure which learn the optimal alpha risks for the lower and upper bounds of the interval. In practice, we show that a line search algorithm can be employed to solve it. Practically, this novel adaptive conformal prediction strategy has revealed to be well adapted for skew distributed errors. In addition, an extension of these conformal prediction strategies is introduced to incorporate numeric and categorical auxiliary variables describing the acquisition context. Based on a quantile regression model, they allow to maintain the coverage for each metadata value. All these strategies have then been applied on a real use case of runway localization from data acquired by an aircraft during landing maneuver. Extensive experiments on multiple airports have shown the interest of the proposed conformal prediction strategies, in particular for runways equipped with a very long ramp approach where asymmetric angular deviation error are observed.

半自动飞机导航是一个高风险领域，需要对预测结果有信心。为此，本文介绍了使用保形预测策略来解决回归问题。标准方法使用绝对不符性分数，而我们的目标是引入符号版的不符性分数。在合成数据上的实验结果表明，它们对非中心误差很有意义。此外，为了减小预测区间的宽度，我们引入了一个优化程序，该程序可以学习区间上下限的最佳阿尔法风险。在实践中，我们证明可以采用线性搜索算法来解决这个问题。实践表明，这种新颖的自适应保形预测策略非常适合偏斜分布误差。此外，我们还对这些共形预测策略进行了扩展，以纳入描述采集环境的数字和分类辅助变量。基于量化回归模型，它们可以保持每个元数据值的覆盖范围。随后，所有这些策略都被应用到一个实际案例中，即根据飞机着陆时获取的数据进行跑道定位。在多个机场进行的大量实验表明，所提出的保形预测策略很有意义，特别是对于配备有很长坡道的跑道，在这种跑道上可以观察到不对称的角度偏差误差。

{"title":"Conformal prediction for regression models with asymmetrically distributed errors: application to aircraft navigation during landing maneuver","authors":"Solène Vilfroy, Lionel Bombrun, Thierry Urruty, Florence De Grancey, Jean-Philippe Lebrat, Philippe Carré","doi":"10.1007/s10994-024-06615-x","DOIUrl":"https://doi.org/10.1007/s10994-024-06615-x","url":null,"abstract":"Semi-autonomous aircraft navigation is a high-risk domain where confidence on the prediction is required. For that, this paper introduces the use of conformal predictions strategies for regression problems. While standard approaches use an absolute nonconformity scores, we aim at introducing a signed version of the nonconformity scores. Experimental results on synthetic data have shown their interest for non-centered errors. Moreover, in order to reduce the width of the prediction interval, we introduce an optimization procedure which learn the optimal alpha risks for the lower and upper bounds of the interval. In practice, we show that a line search algorithm can be employed to solve it. Practically, this novel adaptive conformal prediction strategy has revealed to be well adapted for skew distributed errors. In addition, an extension of these conformal prediction strategies is introduced to incorporate numeric and categorical auxiliary variables describing the acquisition context. Based on a quantile regression model, they allow to maintain the coverage for each metadata value. All these strategies have then been applied on a real use case of runway localization from data acquired by an aircraft during landing maneuver. Extensive experiments on multiple airports have shown the interest of the proposed conformal prediction strategies, in particular for runways equipped with a very long ramp approach where asymmetric angular deviation error are observed.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"38 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

In-game soccer outcome prediction with offline reinforcement learning 利用离线强化学习预测场内足球比赛结果

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-09-06 DOI: 10.1007/s10994-024-06611-1

Pegah Rahimian, Balazs Mark Mihalyi, Laszlo Toka

Predicting outcomes in soccer is crucial for various stakeholders, including teams, leagues, bettors, the betting industry, media, and fans. With advancements in computer vision, player tracking data has become abundant, leading to the development of sophisticated soccer analytics models. However, existing models often rely solely on spatiotemporal features derived from player tracking data, which may not fully capture the complexities of in-game dynamics. In this paper, we present an end-to-end system that leverages raw event and tracking data to predict both offensive and defensive actions, along with the optimal decision for each game scenario, based solely on historical game data. Our model incorporates the effectiveness of these actions to accurately predict win probabilities at every minute of the game. Experimental results demonstrate the effectiveness of our approach, achieving an accuracy of 87% in predicting offensive and defensive actions. Furthermore, our in-game outcome prediction model exhibits an error rate of 0.1, outperforming counterpart models and bookmakers’ odds.

预测足球比赛的结果对包括球队、联赛、投注者、博彩业、媒体和球迷在内的各利益相关方都至关重要。随着计算机视觉技术的进步，球员跟踪数据变得越来越丰富，从而推动了复杂足球分析模型的发展。然而，现有模型通常仅依赖于从球员跟踪数据中提取的时空特征，这可能无法完全捕捉到复杂的比赛动态。在本文中，我们介绍了一种端到端系统，该系统利用原始事件和跟踪数据来预测进攻和防守行动，并根据历史比赛数据预测每个比赛场景的最佳决策。我们的模型结合了这些行动的有效性，可以准确预测比赛每一分钟的获胜概率。实验结果证明了我们方法的有效性，在预测进攻和防守行动方面达到了 87% 的准确率。此外，我们的比赛结果预测模型误差率为 0.1，优于同类模型和博彩公司赔率。

引用次数: 0

Evaluating large language models for user stance detection on X (Twitter) 评估用于 X（推特）上用户立场检测的大型语言模型

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-09-06 DOI: 10.1007/s10994-024-06587-y

Margherita Gambini, Caterina Senette, Tiziano Fagni, Maurizio Tesconi

Current stance detection methods employ topic-aligned data, resulting in many unexplored topics due to insufficient training samples. Large Language Models (LLMs) pre-trained on a vast amount of web data offer a viable solution when training data is unavailable. This work introduces Tweets2Stance - T2S, an unsupervised stance detection framework based on zero-shot classification, i.e. leveraging an LLM pre-trained on Natural Language Inference tasks. T2S detects a five-valued user’s stance on social-political statements by analyzing their X (Twitter) timeline. The Ground Truth of a user’s stance is obtained from Voting Advice Applications (VAAs). Through comprehensive experiments, a T2S’s optimal setting was identified for each election. Linguistic limitations related to the language model are further addressed by integrating state-of-the-art LLMs like GPT-4 and Mixtral into the T2S framework. The T2S framework’s generalization potential is demonstrated by measuring its performance (F1 and MAE scores) across nine datasets. These datasets were built by collecting tweets from competing parties’ Twitter accounts in nine political elections held in different countries from 2019 to 2021. The results, in terms of F1 and MAE scores, outperformed all baselines and approached the best scores for each election. This showcases the ability of T2S, particularly when combined with state-of-the-art LLMs, to generalize across different cultural-political contexts.

目前的立场检测方法采用的是主题对齐数据，由于训练样本不足，导致许多主题未被探索。在没有训练数据的情况下，在大量网络数据上预先训练的大型语言模型（LLM）提供了一种可行的解决方案。这项工作介绍了 Tweets2Stance - T2S，这是一个基于零镜头分类的无监督立场检测框架，即利用在自然语言推理任务中预先训练的 LLM。T2S 通过分析用户的 X（Twitter）时间线，检测用户对社会政治声明的立场。用户立场的地面真实信息来自投票建议应用程序（VAA）。通过综合实验，为每次选举确定了 T2S 的最佳设置。通过将 GPT-4 和 Mixtral 等最先进的语言模型集成到 T2S 框架中，进一步解决了与语言模型相关的语言限制问题。通过测量 T2S 框架在九个数据集上的性能（F1 和 MAE 分数），证明了该框架的通用潜力。这些数据集是通过收集 2019 年至 2021 年在不同国家举行的九次政治选举中竞争政党的 Twitter 账户推文建立的。就 F1 和 MAE 分数而言，结果优于所有基线，并接近每次选举的最佳分数。这展示了 T2S 的能力，尤其是在与最先进的 LLM 相结合时，能够跨越不同的文化政治背景。

{"title":"Evaluating large language models for user stance detection on X (Twitter)","authors":"Margherita Gambini, Caterina Senette, Tiziano Fagni, Maurizio Tesconi","doi":"10.1007/s10994-024-06587-y","DOIUrl":"https://doi.org/10.1007/s10994-024-06587-y","url":null,"abstract":"Current stance detection methods employ topic-aligned data, resulting in many unexplored topics due to insufficient training samples. Large Language Models (LLMs) pre-trained on a vast amount of web data offer a viable solution when training data is unavailable. This work introduces Tweets2Stance - T2S, an unsupervised stance detection framework based on zero-shot classification, i.e. leveraging an LLM pre-trained on Natural Language Inference tasks. T2S detects a five-valued user’s stance on social-political statements by analyzing their X (Twitter) timeline. The Ground Truth of a user’s stance is obtained from Voting Advice Applications (VAAs). Through comprehensive experiments, a T2S’s optimal setting was identified for each election. Linguistic limitations related to the language model are further addressed by integrating state-of-the-art LLMs like GPT-4 and Mixtral into the T2S framework. The T2S framework’s generalization potential is demonstrated by measuring its performance (F1 and MAE scores) across nine datasets. These datasets were built by collecting tweets from competing parties’ Twitter accounts in nine political elections held in different countries from 2019 to 2021. The results, in terms of F1 and MAE scores, outperformed all baselines and approached the best scores for each election. This showcases the ability of T2S, particularly when combined with state-of-the-art LLMs, to generalize across different cultural-political contexts.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"26 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142226584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Nested barycentric coordinate system as an explicit feature map for polyhedra approximation and learning tasks 嵌套重心坐标系作为多面体近似和学习任务的显式特征图

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-09-05 DOI: 10.1007/s10994-024-06596-x

Lee-Ad Gottlieb, Eran Kaufman, Aryeh Kontorovich, Gabriel Nivasch, Ofir Pele

We introduce a new embedding technique based on a nested barycentric coordinate system. We show that our embedding can be used to transform the problems of polyhedron approximation, piecewise linear classification and convex regression into one of finding a linear classifier or regressor in a higher dimensional (but nevertheless quite sparse) representation. Our embedding maps a piecewise linear function into an everywhere-linear function, and allows us to invoke well-known algorithms for the latter problem to solve the former. We explain the applications of our embedding to the problems of approximating separating polyhedra—in fact, it can approximate any convex body and unions of convex bodies—as well as to classification by separating polyhedra, and to piecewise linear regression.

我们介绍了一种基于嵌套重心坐标系的新嵌入技术。我们的研究表明，我们的嵌入技术可用于将多面体逼近、片断线性分类和凸回归等问题转化为在高维（但相当稀疏）表示中寻找线性分类器或回归器的问题。我们的嵌入将片面线性函数映射为无处不在的线性函数，并允许我们引用后一个问题的著名算法来解决前一个问题。我们解释了我们的嵌入在近似分离多面体问题上的应用--事实上，它可以近似任何凸体和凸体的联合体--以及在分离多面体分类和分片线性回归中的应用。

引用次数: 0

Self-organizing maps with adaptive distances for multiple dissimilarity matrices 具有自适应距离的自组织图，适用于多个异质性矩阵

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-09-03 DOI: 10.1007/s10994-024-06607-x

Laura Maria Palomino Mariño, Francisco de Assis Tenorio de Carvalho

There has been an increasing interest in multi-view approaches based on their ability to manage data from several sources. However, regarding unsupervised learning, most multi-view approaches are clustering algorithms suitable for analyzing vector data. Currently, only a relatively few SOM algorithms can manage multi-view dissimilarity data, despite their usefulness. This paper proposes two new families of batch SOM algorithms for multi-view dissimilarity data: multi-medoids SOM and relational SOM, both designed to give a crisp partition and learn the relevance weight for each dissimilarity matrix by optimizing an objective function, aiming to preserve the topological properties of the map data. In both families, the weight represents the relevance of each dissimilarity matrix for the learning task being computed, either locally, for each cluster, or globally, for the whole partition. The proposed algorithms were compared with already in the literature single-view SOM and set-medoids SOM for multi-view dissimilarity data. According to the experiments using 14 datasets for F-measure, NMI, Topographic Error, and Silhouette, the relevance weights of the dissimilarity matrices must be considered. In addition, the multi-medoids and relational SOM performed better than the set-medoids SOM. An application study was also carried out on a dermatology dataset, where the proposed methods have the best performance.

多视图方法能够管理来自多个来源的数据，因此越来越受到人们的关注。然而，在无监督学习方面，大多数多视角方法都是适用于分析向量数据的聚类算法。目前，只有相对较少的 SOM 算法可以管理多视角差异数据，尽管它们非常有用。本文针对多视角异质性数据提出了两个新的批量 SOM 算法系列：多媒介 SOM 和关系 SOM，这两个系列都旨在通过优化目标函数来给出一个清晰的分区并学习每个异质性矩阵的相关性权重，目的是保留地图数据的拓扑特性。在这两个系列中，权重代表了每个异质性矩阵对于正在计算的学习任务的相关性，可以是局部的（针对每个群组），也可以是全局的（针对整个分区）。针对多视角异质性数据，我们将所提出的算法与已有文献中的单视角 SOM 和集合媒介 SOM 进行了比较。根据使用 14 个数据集进行的 F-measure、NMI、Topographic Error 和 Silhouette 实验，必须考虑异质性矩阵的相关性权重。此外，多媒介 SOM 和关系 SOM 的性能优于集合媒介 SOM。我们还在一个皮肤科数据集上进行了应用研究，发现所提出的方法在该数据集上表现最佳。

{"title":"Self-organizing maps with adaptive distances for multiple dissimilarity matrices","authors":"Laura Maria Palomino Mariño, Francisco de Assis Tenorio de Carvalho","doi":"10.1007/s10994-024-06607-x","DOIUrl":"https://doi.org/10.1007/s10994-024-06607-x","url":null,"abstract":"There has been an increasing interest in multi-view approaches based on their ability to manage data from several sources. However, regarding unsupervised learning, most multi-view approaches are clustering algorithms suitable for analyzing vector data. Currently, only a relatively few SOM algorithms can manage multi-view dissimilarity data, despite their usefulness. This paper proposes two new families of batch SOM algorithms for multi-view dissimilarity data: multi-medoids SOM and relational SOM, both designed to give a crisp partition and learn the relevance weight for each dissimilarity matrix by optimizing an objective function, aiming to preserve the topological properties of the map data. In both families, the weight represents the relevance of each dissimilarity matrix for the learning task being computed, either locally, for each cluster, or globally, for the whole partition. The proposed algorithms were compared with already in the literature single-view SOM and set-medoids SOM for multi-view dissimilarity data. According to the experiments using 14 datasets for F-measure, NMI, Topographic Error, and Silhouette, the relevance weights of the dissimilarity matrices must be considered. In addition, the multi-medoids and relational SOM performed better than the set-medoids SOM. An application study was also carried out on a dermatology dataset, where the proposed methods have the best performance.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"71 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep negative correlation classification 深度负相关分类

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-08-28 DOI: 10.1007/s10994-024-06604-0

Le Zhang, Qibin Hou, Yun Liu, Jia-Wang Bian, Xun Xu, Joey Tianyi Zhou, Ce Zhu

Ensemble learning serves as a straightforward way to improve the performance of almost any machine learning algorithm. Existing deep ensemble methods usually naïvely train many different models and then aggregate their predictions. This is not optimal in our view from two aspects: (1) Naïvely training multiple models adds much more computational burden, especially in the deep learning era; (2) Purely optimizing each base model without considering their interactions limits the diversity of ensemble and performance gains. We tackle these issues by proposing deep negative correlation classification (DNCC), in which the accuracy and diversity trade-off is systematically controlled by decomposing the loss function seamlessly into individual accuracy and the “correlation” between individual models and the ensemble. DNCC yields a deep classification ensemble where the individual estimator is both accurate and “negatively correlated”. Thanks to the optimized diversities, DNCC works well even when utilizing a shared network backbone, which significantly improves its efficiency when compared with most existing ensemble systems, as illustrated in Fig. 2. Extensive experiments on multiple benchmark datasets and network structures demonstrate the superiority of the proposed method.

集合学习是提高几乎所有机器学习算法性能的直接方法。现有的深度集合方法通常会天真地训练许多不同的模型，然后汇总它们的预测结果。我们认为，从两个方面来看，这种方法并不是最佳的：（1）天真地训练多个模型会增加更多的计算负担，尤其是在深度学习时代；（2）纯粹地优化每个基础模型而不考虑它们之间的相互作用会限制集合的多样性和性能的提高。为了解决这些问题，我们提出了深度负相关分类（DNCC），通过将损失函数无缝分解为单个模型的准确性和单个模型与集合之间的 "相关性"，系统地控制准确性和多样性之间的权衡。DNCC 产生的深度分类集合中，单个估计器既准确又 "负相关"。如图 2 所示，得益于优化的多样性，DNCC 即使在利用共享网络骨干的情况下也能很好地工作，这与大多数现有的集合系统相比，大大提高了其效率。在多个基准数据集和网络结构上进行的大量实验证明了所提出方法的优越性。

{"title":"Deep negative correlation classification","authors":"Le Zhang, Qibin Hou, Yun Liu, Jia-Wang Bian, Xun Xu, Joey Tianyi Zhou, Ce Zhu","doi":"10.1007/s10994-024-06604-0","DOIUrl":"https://doi.org/10.1007/s10994-024-06604-0","url":null,"abstract":"Ensemble learning serves as a straightforward way to improve the performance of almost any machine learning algorithm. Existing deep ensemble methods usually naïvely train many different models and then aggregate their predictions. This is not optimal in our view from two aspects: (1) Naïvely training multiple models adds much more computational burden, especially in the deep learning era; (2) Purely optimizing each base model without considering their interactions limits the diversity of ensemble and performance gains. We tackle these issues by proposing deep negative correlation classification (DNCC), in which the accuracy and diversity trade-off is systematically controlled by decomposing the loss function seamlessly into individual accuracy and the “correlation” between individual models and the ensemble. DNCC yields a deep classification ensemble where the individual estimator is both accurate and “negatively correlated”. Thanks to the optimized diversities, DNCC works well even when utilizing a shared network backbone, which significantly improves its efficiency when compared with most existing ensemble systems, as illustrated in Fig. 2. Extensive experiments on multiple benchmark datasets and network structures demonstrate the superiority of the proposed method.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"17 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Generalization of temporal logic tasks via future dependent options 通过依赖未来的选项实现时间逻辑任务的通用化

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-08-26 DOI: 10.1007/s10994-024-06614-y

Duo Xu, Faramarz Fekri

Temporal logic (TL) tasks consist of complex and temporally extended subgoals and they are common for many real-world applications, such as service and navigation robots. However, it is often inefficient or even infeasible to train reinforcement learning (RL) agents to solve multiple TL tasks, since rewards are sparse and non-Markovian in these tasks. A promising solution to this problem is to learn task-conditioned policies which can zero-shot generalize to new TL tasks without further training. However, influenced by some practical issues, such as issues of lossy symbolic observation and long time-horizon of completing TL task, previous works suffer from sample inefficiency in training and sub-optimality (or even infeasibility) in task execution. In order to tackle these issues, this paper proposes an option-based framework to generalize TL tasks, consisting of option training and task execution parts. We have innovations in both parts. In option training, we propose to learn options dependent on the future subgoals via a novel approach. Additionally, we propose to train a multi-step value function which can propagate the rewards of satisfying future subgoals more efficiently in long-horizon tasks. In task execution, in order to ensure the optimality and safety, we propose a model-free MPC planner for option selection, circumventing the learning of a transition model which is required by previous MPC planners. In experiments on three different domains, we evaluate the generalization capability of the agent trained by the proposed method, showing its significant advantage over previous methods.

时间逻辑（TL）任务由复杂的时间扩展子目标组成，在服务和导航机器人等许多现实世界的应用中很常见。然而，训练强化学习（RL）代理来解决多个 TL 任务往往效率低下，甚至不可行，因为在这些任务中，奖励是稀疏和非马尔可夫的。解决这一问题的一个很有前景的办法是学习任务条件策略，这种策略无需进一步训练就能对新的 TL 任务进行零点泛化。然而，受一些实际问题的影响，例如有损符号观测和完成 TL 任务的时间跨度较长等问题，以前的工作在训练中存在样本效率低下的问题，在任务执行中存在次优性（甚至不可行性）。为了解决这些问题，本文提出了一个基于选项的框架来泛化 TL 任务，由选项训练和任务执行两部分组成。我们在这两部分都有所创新。在选项训练中，我们建议通过一种新颖的方法来学习与未来子目标相关的选项。此外，我们还建议训练一个多步骤价值函数，它可以在长视距任务中更有效地传播满足未来子目标的奖励。在任务执行过程中，为了确保最优性和安全性，我们提出了一种用于选项选择的无模型 MPC 计划程序，避免了以往 MPC 计划程序所要求的过渡模型学习。在三个不同领域的实验中，我们评估了用所提方法训练的代理的泛化能力，结果表明它比以前的方法有显著优势。

{"title":"Generalization of temporal logic tasks via future dependent options","authors":"Duo Xu, Faramarz Fekri","doi":"10.1007/s10994-024-06614-y","DOIUrl":"https://doi.org/10.1007/s10994-024-06614-y","url":null,"abstract":"Temporal logic (TL) tasks consist of complex and temporally extended subgoals and they are common for many real-world applications, such as service and navigation robots. However, it is often inefficient or even infeasible to train reinforcement learning (RL) agents to solve multiple TL tasks, since rewards are sparse and non-Markovian in these tasks. A promising solution to this problem is to learn task-conditioned policies which can zero-shot generalize to new TL tasks without further training. However, influenced by some practical issues, such as issues of lossy symbolic observation and long time-horizon of completing TL task, previous works suffer from sample inefficiency in training and sub-optimality (or even infeasibility) in task execution. In order to tackle these issues, this paper proposes an option-based framework to generalize TL tasks, consisting of option training and task execution parts. We have innovations in both parts. In option training, we propose to learn options dependent on the future subgoals via a novel approach. Additionally, we propose to train a multi-step value function which can propagate the rewards of satisfying future subgoals more efficiently in long-horizon tasks. In task execution, in order to ensure the optimality and safety, we propose a model-free MPC planner for option selection, circumventing the learning of a transition model which is required by previous MPC planners. In experiments on three different domains, we evaluate the generalization capability of the agent trained by the proposed method, showing its significant advantage over previous methods.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"317 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142226586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Machine Learning

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀